Hacker News mode
Safe for work mode
👉 Hiring for a Remote position?on the 🏆 #1 Remote Jobs board.
Remote Health by SafetyWing
Global health insurance for freelancers & remote workers
Site Reliability Engineer Leading Open Source Companyclosed
✅ 2 applications (0%)
This job post is closed and the position is probably filled. Please do not apply.
\nThanks to our ongoing expansion we have the opportunity to grow our Site Reliability team. We're a part of the Elastic Cloud team with an operations background who aren’t afraid to get our hands dirty. We are the first line of consumers for Elastic's products and our experience helps influence the direction of the product. While most organizations may have a single or a handful of Elastic Stack deployments, here, you’ll be responsible for identifying, troubleshooting and reporting platform problems to developers in order to ensure that the thousands of Elasticsearch clusters that we manage are providing a stable and reliable service. We’re looking for people who are just as passionate about troubleshooting issues with distributed systems as they are to automate, code and collaborate to solve problems.\n\nResponsibilities\n\n\n* You will report and solve problems within the Elastic Cloud infrastructure services and collaborate on issues with developers\n\n* Handle day to day operations around the Elastic Cloud such as customer trouble tickets, managing cloud provider infrastructure (maintenance/ expansion), and software deployments\n\n* You will develop and improve tooling to deploy and run the Elastic Cloud product and infrastructure\n\n* Demonstrate and promote best practices for teams using cloud platforms\n\n\n\n\nExperience\n\n\n* You have multiple years hands-on experience administering Linux, preferably with distributed systems with some scale\n\n* One or more years of AWS, GCP and/or Azure experience is a requirement\n\n* You possess experience automating production Linux systems collaboratively, deriving configuration through version control\n\n* Comfortable writing software to automate API-driven tasks at scale; The SRE team uses Python and some Go, where the developers use Scala, Python, and Java\n\n* You have used Ansible/Puppet/Chef or another config management suite, know where it's broken, and open to trying new things\n\n\n\n\nKey Skills\n\n\n* Healthy knowledge of Linux (have compiled your own kernel at some point, know how to trace syscalls, understand TCP, care about the difference between sysvinit/runit/systemd, etc.)\n\n* Relentless desire to automate and build software tools\n\n* Desire to represent work in git, driven by a GitHub workflow through issues and pull requests\n\n* Love open source development, and have contributed to some project somewhere (doesn't have to be ours), whether through mailing lists, patches, documentation, etc.\n\n* Enjoy working remotely and the communication it requires\n\n* Love a diverse environment, working with men and women all over the world\n\n\n
See more jobs at Elastic
# How do you apply?\n\nThis job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Recommended remote workers for this job
Hi, I'm Luke 👋 I'm a System Administrator with 10+ years experience, I specialise in Linux serve...
[Spam check] What is the name of Elon Musk's company going to Mars?