Remote Site Reliability Engineer at YouGov 📈 Open Startup
RSS
API
Post a Job

get a remote job
you can do anywhere

The largest collection of Remote Jobs for Digital Nomads online. Get a remote job you can do anywhere at Remote Companies like Buffer, Zapier and Automattic who embrace the future. There are 32,450+ jobs that allow you to work anywhere and live everywhere.

The largest collection of Remote Jobs for Digital Nomads online. Get a remote job you can do anywhere at Remote Companies like Buffer, Zapier and Automattic who embrace the future. There are 32,450+ jobs that allow you to work anywhere and live everywhere.

  Jobs

  People

👉 Hiring for a remote Sys Admin position?

Post a Job - $299
on the 🏆 #1 remote jobs board

YouGov

 

Site Reliability Engineer

Site Reliability Engineer  


YouGov


sys admin

engineer

admin

sys admin

engineer

admin

17d
\nRole: \n\nAs a Site Reliability Engineer at YouGov, you will join our talented individuals in being responsible for the delivery, optimization, resilience, and availability of high-value and high-transaction-rate services trusted and used by both the general public and some of the largest brands in the world. Site Reliability Engineering is a discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE's ensure that YouGov's internally critical and externally visible systems maintain the appropriate service levels (availability, latency, and reliability) to serve our customers' needs, and reduce the friction for managing change, while being strategic about capacity, and constantly managing performance. SRE is a mindset and a set of engineering approaches focusing on delivery of the appropriate architecture, building infrastructure, optimizing existing systems, and eliminating toil through automation.\n\nSREs have the acumen and experience to provide direct technical contributions to major projects both in code, and in building and optimizing the production environment. You will identify and solve critical problems and build automation to prevent their recurrence. You align with your peers across engineering, deliver subject matter expertise for the infrastructure within your product area, and draw on your strong communication skills to collaborate with your peers in other geographies. Your perspectives help foster and support successful delivery of reliability engineering, and you influence by way of metrics, data, and automation.\n\n Experience required: \n\n\n* 3+ years' work experience in a similar job role.\n\n* Design, develop, and implement supporting cloud services on the Kubernetes platform.\n\n* Proven application production support experience.\n\n* Strong analytical and problem-solving skills.\n\n* Passion for automating repetitive tasks.\n\n* Identify and solve critical problems and build automation to prevent their recurrence.\n\n* Develop clean, well-documented, testable code.\n\n* Work cross-functionally across developers, QA, and other teams.\n\n* Troubleshoot and resolve issues in both production and lower environments.\n\n* Participate in on-call rotation in support of critical products.\n\n* Proven software engineering experience Kubernetes / K8's / Docker.\n\n* Familiarity with running and scaling distributed software systems (load balancing, high availability, systems monitoring, etc.)\n\n* Experience administering and/or designing databases - SQL and NoSQL\n\n* Understanding of networking: TCP, UDP, firewalls, DNS, OSI layers, etc.\n\n* Experience with log analysis and monitoring tools such as Splunk, Logstash, New Relic, etc.\n\n* Establish Error Budgets for the products by monitoring SLIs, measuring SLOs and publishing them to a dashboard\n\n* Design, build and implement software features for the product that increase reliability, availability and performance\n\n* Own the pipeline of deployments to production, this includes establishing and maintaining the CI/CD pipeline for the product\n\n* Drive blameless post-mortems with the product team and use the Error Budget to establish priorities for any necessary changes\n\n* Have experience with Networking, Linux OS, Security, Data Persistence, Containers, AWS, etc.\n\n\n\n\nAny additional info:\n\nThis position is 100% remote, therefore having experience within a remote environment would be ideal.

See more jobs at YouGov

Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.