Remote Oncall DevOps SRE For Big Data Infrastructure at Ahrefs Open Startup
RSS
API
Global PayrollPost a job

find a remote job
work from anywhere

The largest collection of Remote Jobs for Digital Nomads online. Get a remote job you can do anywhere at Remote Companies like Toptal, Zapier and Automattic who embrace the future. There are 43,600+ jobs that allow you to work anywhere and live everywhere.

The largest collection of Remote Jobs for Digital Nomads online. Get a remote job you can do anywhere at Remote Companies like Toptal, Zapier and Automattic who embrace the future. There are 43,600+ jobs that allow you to work anywhere and live everywhere.

Global PayrollPost a job

  Jobs

  People

👉 Hiring for a remote position?

Post a job
on the 🏆 #1 remote jobs board

Ahrefs


Oncall DevOps SRE For Big Data Infrastructure

closed

Oncall DevOps SRE For Big Data Infrastructure


Ahrefs


big data

 

devops

 

devops

 

big data

 

devops

 

devops

 
This job post is closed and the position is probably filled. Please do not apply.
\nWhat We Need\n\nAhrefs is looking for a Site Reliability Engineer to help take care of its distributed crawler powered by 2,000 servers and ensure all systems are up and running 24/7. If you possess a healthy desire to automate everything while being able to quickly resolve urgent issues manually, then we want you! We strive to keep humans away from doing repetitive jobs that can be done by computers and focus instead on foreseeing problems and defining programmatic means to handle them.\n\nOur system is big part custom OCaml code and also employs third-party technologies - Debian, ELK, Puppet, Clickhouse, and anything else that will solve the task at hand. In this role, be prepared to deal with 25 petabytes storage cluster, 2,000 baremetal servers, experimental large-scale deployments and all kinds of software bugs and hardware deviations on a daily basis.\n\nBasic Requirements:\n\n\n* Deep understanding of operating systems and networks fundamentals\n\n* Practical knowledge of Linux userspace and kernel internals\n\n\n\n\nThe ideal candidate is expected to:\n\n\n* Understand the whole technology stack at all levels: from network and user-space code to OS internals and hardware\n\n* Independently deal with and investigate infrastructure issues on live production systems including dealing with hardware problems and interact with datacenters\n\n* Develop internal automation - monitoring, setup, statistics\n\n* Have the ability to foresee potential problems and prevent them from happening. Apply first-aid reaction to infrastructure failures when necessary\n\n* Help developers with deployment and integration\n\n* Participate in on-call rotation\n\n* Make well-reasoned technical choices and take responsibility for it\n\n* Approach problems with a practical mindset and suppress perfectionism when time is a priority\n\n* Setup automatic systems to control infrastructure\n\n* Possess a healthy detestation for complex shell scripts\n\n\n


See more jobs at Ahrefs

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
212ms