Remote Senior Site Reliability Engineer at Ably realtime 📈 Open Startup
RSS
API
Post a Job

get a remote job
you can do anywhere

The largest collection of Remote Jobs for Digital Nomads online. Get a remote job you can do anywhere at Remote Companies like Buffer, Zapier and Automattic who embrace the future. There are 32,450+ jobs that allow you to work anywhere and live everywhere.

The largest collection of Remote Jobs for Digital Nomads online. Get a remote job you can do anywhere at Remote Companies like Buffer, Zapier and Automattic who embrace the future. There are 32,450+ jobs that allow you to work anywhere and live everywhere.

  Jobs

  People

👉 Hiring for a remote Senior position?

Post a Job - $299
on the 🏆 #1 remote jobs board

Ably realtime


Senior Site Reliability Engineer

Senior Site Reliability Engineer


Ably realtime


senior

sys admin

engineer

admin

senior

sys admin

engineer

admin

2yr

Stats (beta): 👁 350 views,✍️ 0 applied (0%)
\nSenior Site Reliability Engineer (remote / London)\n\nWhat makes Ably special?\nAbly helps power next generation digital experiences through its distributed global messaging cloud-based platform. Ones which are live rather than static, where data is in motion rather than at rest. Read a recent blog post on the distributed systems problems we think about and work on each day.\n\nWhat we can offer you\nWorking at Ably means you are working on a cutting-edge distributed internet-scale platform that spans 20+ data centres, soon to support multiple clouds delivering potentially trillions of messages for developers. You will learn with the best. You will have autonomy and freedom to experiment and improve. You will be part of a dynamic team and a business that is growing rapidly.\n \nJob description\nIf you don't know what a Site Reliability Engineer is, we recommend you first read Google's definition of a Site Reliability Engineer, which we are in agreement with.\n\nAs a Senior Engineer in our Site Reliability Engineering team, you’ll build solutions to enhance availability, performance and stability of the Ably platform as well as developing new network services whilst automating away repetitive work. You'll also respond to pings, pages and alerts to investigate issues in our products that you can really sink your teeth into. You'll be working on non-production and production environments, monitoring, data collection and configuration management, as well as disaster recovery planning, capacity engineering, reliability improvement initiatives and platform automation. The team needs someone who can ask questions, learn from others and turn chaos into order.\n\nThis role would be a great fit for someone with creative and innovative problem solving skills with a willingness to take responsibility for the code you write all the way to production. You will develop and implement solutions that operate at scale - seeing your own technology efforts directly improve the reliability of our products. Our teams are empowered and expected to improve our products to truly deliver a reliable experience to customers. \n\nIf you're excited by working on truly complex problems at internet-scale with smart engineers, you'll enjoy working at Ably.\n\nOur infrastructure stack:\n\n\n* Infrastructure languages: Ruby, Go, Bash.\n\n* Service languages: Go, Elixir, Node.js and some C.\n\n* Mostly AWS based, but we are working on supporting other clouds.\n\n* Architecture: Exclusively Docker containers for all services, servers are immutable, ephemeral and disposed of frequently, code is packaged as slugs, datacenters (circa 20) are isolated and autonomous, critical shared services always have redundancy baked in, manual configuration of any infrastructure is a smell.\n\n* Data services: Cassandra (our realtime datastore, 3 regions, 6 data centers), Influx, Elastic, Kibana, Grafana, etc.\n\n* Web site: We use Rails & Heroku for simplicity. The web service is not part of our "core product" and thus has lower uptime requirements.\n\n\n\n\n\nSee https://goo.gl/cDUirr and https://goo.gl/XDpmBi for a taster on the lengths we go to at each layer in the stack to ensure 100% service uptime.  \n \nDay to day you can expect to be working on:\n\n\n* Writing Ruby code for our infrastructure automation, orchestration, configuration and continuous integration testing of our infrastructure.\n\n* Writing Go code for our core routing, workers and infrastructure services.\n\n* Making extensive use of a wide range of AWS services. Whilst we primarily use AWS for our infrastructure, in time we expect that to change as we span other cloud services.\n\n* Managing and developing out our continuous integration services that test every aspect of the service, from infrastructure tools, to our health servers, routers, realtime services, protocol adaptors and client libraries.  Our CI environment is mature, yet we would like to continue to evolve our CI environments to help improve the robustness of the platform and reduce risk of regressions.\n\n* Being exposed to our other development environments such as Node.js and Elixir, both used extensively in our realtime services.\n\n* Working with the realtime engineering team to ensure our infrastructure supports the ever changing networking, security and processing requirements.\n\n* Collaborating with the team to design, discuss and implement new features and services.\n\n* Diagnosing and fixing bugs in all areas of our platform.  You will often be working at very low levels in the network stack to help diagnose difficult to identify distributed problems.\n\n* Work with the engineering team to enable them to take responsibility for the complete lifecycle of the features and code they deliver i.e. pull request, reviews, testing, deploy to staging and sandbox environments, then into production environments. We are strong believers in all developers being responsible for deploying their own code.\n\n* Contributing to open source projects that we support or use in our products.  All of our client libraries are open source as well and may require your support at times.\n\n* Helping customers solve problems they are experiencing that may help us find bugs in the platform.\n\n* Support the wider team in regards to documentation and customer support.\n\n* Suggestions for new features or improvements to our protocol and API specifications.\n\n\n\n\n \nBenefits\n\n\n* Salary range: €40k to €85k.\n\n* Employee options: Yes, negotiable.\n\n* Holidays: 25+ days excluding national holidays.\n\n* This role can be remote or on-site in our London office. However, if you are working remotely, you will need to be in a European timezone so that we can communicate effectively during business hours, and you will need to be close enough to visit our office in London occasionally.  Our preference is to have a team member near enough to commute to our London office when necessary. You will benefit from a flexible working environment in which working from home and managing your own working hours sensibly is the norm. \n\n* Work in an environment where code quality, technical challenges and delivery is what we all care about. \n\n* Skills development is intrinsic in the job. We're largely working on unsolved problems each day, and such, there is plenty of scope to widen your knowledge and skillset.\n\n* Work with genuinely nice and smart people who care about code quality and enjoying their jobs.\n\n\n\n\n\nRequirements\n\n\n* Experience: A minimum of a three years of professional experience with Go as Go is used in all our routing and infrastructure services.  Our infrastructure automation and orchestration layer requires you to be proficient in Ruby. You should have experience using both statically and dynamically typed languages. Experience with Node.js and Elixir/Erlang is beneficial. You must have solid experience managing infrastructure and CI environments, and any distributed or large scale infrastructure management is preferred. Understanding of distributed systems is beneficial.\n\n* Pragmatic: A problem solver excited by the prospect of automating your job away and working autonomously to solve problems and bring solutions to the team.\n\n* Fast Learner: We’re looking for software engineers who thrive on applying their knowledge, learning new technologies.  Our stack is diverse, and we expect it to continue to grow.\n\n* Testing: Experience using testing frameworks and adoption of test driven development where applicable.\n\n* Communication: We use tools such as Slack throughout the day to communicate, however we believe in voice conversations to discuss and solve problems. You must be proficient in spoken and written English, be eager to collaborate with the engineering team and constructively welcome code reviews.\n\n* Customers: Comfortable talking to customers and assisting them with their technical issues and integration.\n\n* Open source: We prefer developers who have contributed back to the open source community, even if those contributions are small. \n\n\n\n\nAre you up for the challenge?\nApply now via the application form. For more information about our organisation, please visit our website.\n\n**** NO AGENCIES PLEASE ****

See more jobs at Ably realtime

# How do you apply? This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.