Remote Oncall DevOps SRE For Big Data Infrastructure at Ahrefs 📈 Open Startup
RSS
API
Post a Job

get a remote job
you can do anywhere

The largest collection of Remote Jobs for Digital Nomads online. Get a remote job you can do anywhere at Remote Companies like Buffer, Zapier and Automattic who embrace the future. There are 31,900+ jobs that allow you to work anywhere and live everywhere.

The largest collection of Remote Jobs for Digital Nomads online. Get a remote job you can do anywhere at Remote Companies like Buffer, Zapier and Automattic who embrace the future. There are 31,900+ jobs that allow you to work anywhere and live everywhere.

  Jobs

  People

👉 Hiring for a remote Big Data position?

Post a Job - $299
on the 🏆 #1 remote jobs board

Ahrefs


Oncall Devops SRE For Big Data Infrastructure

Oncall Devops SRE For Big Data Infrastructure


Ahrefs


big data

devops

devops

big data

devops

devops

5mo
\nWhat We Need\n\nAhrefs is looking for a Site Reliability Engineer to help take care of its distributed crawler powered by 2,000 servers and ensure all systems are up and running 24/7. If you possess a healthy desire to automate everything while being able to quickly resolve urgent issues manually, then we want you! We strive to keep humans away from doing repetitive jobs that can be done by computers and focus instead on foreseeing problems and defining programmatic means to handle them.\n\nOur system is big part custom OCaml code and also employs third-party technologies - Debian, ELK, Puppet, Clickhouse, and anything else that will solve the task at hand. In this role, be prepared to deal with 25 petabytes storage cluster, 2,000 baremetal servers, experimental large-scale deployments and all kinds of software bugs and hardware deviations on a daily basis.\n\nBasic Requirements:\n\n\n* Deep understanding of operating systems and networks fundamentals\n\n* Practical knowledge of Linux userspace and kernel internals\n\n\n\n\nThe ideal candidate is expected to:\n\n\n* Understand the whole technology stack at all levels: from network and user-space code to OS internals and hardware\n\n* Independently deal with and investigate infrastructure issues on live production systems including dealing with hardware problems and interact with datacenters\n\n* Develop internal automation - monitoring, setup, statistics\n\n* Have the ability to foresee potential problems and prevent them from happening. Apply first-aid reaction to infrastructure failures when necessary\n\n* Help developers with deployment and integration\n\n* Participate in on-call rotation\n\n* Make well-reasoned technical choices and take responsibility for it\n\n* Approach problems with a practical mindset and suppress perfectionism when time is a priority\n\n* Setup automatic systems to control infrastructure\n\n* Possess a healthy detestation for complex shell scripts\n\n\n

See more jobs at Ahrefs

# How do you apply? This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.