Remote Site Reliability Engineer Heroku at Salesforce πŸ“ˆ Open Startup
RSS
API
Post a Job

get a remote job
you can do anywhere

The largest collection of Remote Jobs for Digital Nomads online. Get a remote job you can do anywhere at Remote Companies like Buffer, Zapier and Automattic who embrace the future. There are 32,300+ jobs that allow you to work anywhere and live everywhere.

The largest collection of Remote Jobs for Digital Nomads online. Get a remote job you can do anywhere at Remote Companies like Buffer, Zapier and Automattic who embrace the future. There are 32,300+ jobs that allow you to work anywhere and live everywhere.

  Jobs

  People

πŸ‘‰ Hiring for a remote Sys Admin position?

Post a Job - $299
on the πŸ† #1 remote jobs board

Salesforce


Site Reliability Engineer Heroku

Site Reliability Engineer Heroku


Salesforce


sys admin

heroku

engineer

admin

sys admin

heroku

engineer

admin

8mo
\nSite Reliability Engineer, Heroku\nLocation: US Remote\n*We are a highly distributed team looking for candidates comfortable working remotely.\nAbout Heroku SRE\nHeroku, a subsidiary of Salesforce, operates the world’s largest Platform As A Service (PaaS), continuously delivering millions of apps with a high volume of deploys per day. Heroku's vision is for developers to focus on their applications and leave operations to us.\nWe are writing our team charter and we're looking for engineers who are interested in joining that effort. This is not an established team - you will be among the first people to implement this job role at Heroku. Because the team isn't established, we'll be looking to you to help define how the team should get work done and how it will communicate with other teams in the engineering org, so it will help if you're interested in the human communication problems of engineering.\nWhat's this job like?\nThis job is open to people anywhere in North America (the United States and Canada). You can work at a Salesforce office or work from home. Because the team is just getting started, here's what we can say so far:\nCurrently, we're helping development teams to develop Service Level Objectives (SLOs) for parts of the platform where they don't currently exist and defining minimum standards for service health metrics\nNext, we will define the SRE Entrance process that development teams follow in order to hand off the operation of a production service to the SRE team\nThe team will be on call for multiple production services, once they have gone through the SRE Entrance process. This includes:\n\nResponding to pages generated by automated monitoring and alerting\nResponding to pages created manually by other engineers and support personnel\nJoining an incident response team as a Subject Matter Expert and working with other SMEs and an Incident Commander to resolve the issue (we'll train you for this)\n\nWhen the team is more established, our goal is a 50% focus on engineering activities. Likely projects include:\nAutomated data and service management tooling\nInstrumenting for observability for troubleshooting\nHardening for resilience in the face of operational events and customer behavior\n\nWho are you?\nWe’re looking for people who are interested in complex distributed systems- how they work, how they can work better, how we even know if they’re working at all. We need someone who's spent time working as a developer (writing code with a team to fix operational issues or build features), but who has also spent time on operational concerns (investigating production incidents, creating or updating monitoring and alerting plans for production systems, or investigating performance issues, for instance).\nYou don't need to have “SRE” in your job title in order to have appropriate skills for this position. You might come from a DevOps environment or have been one of a handful of engineers in a shop so small that everyone does a little of everything. The important thing is that you have experience in both writing code and maintaining systems, and that you're willing to do both of those things in the future. If you're stronger in one area than the other, that's okay.\nBe sure to read or skim the Site Reliability Engineering book, which we are modeling our team structure around.\nRequirements\nExperience with complex distributed systems and familiarity with how the internet and web applications work. You don’t have to have built a datacenter or run a large cloud service at a major provider, but you do need to have used cloud services. Running LAN infrastructure or doing client-side system administration is not enough for this role.\nWilling to join an on-call rotation that you would participate in defining.\nWilling to work on a distributed (currently all-remote) team spanning multiple time zones. None of us currently lives in the same place or works out of the San Francisco headquarters; all of us are experienced remote workers.\nComfortable reading and writing code with a team in at least one of Ruby, Go, Python, or Erlang. It's fine if you know more than one of those languages and/or other languages, but they are the four most important languages at Heroku. We need people who are comfortable with them, and open to switching between them.\nHow do I know if I should apply?\nIf you have experience with any of the following topics, you should apply!\nContainers and container management technologies such as lxc, Docker and Kubernetes\nExperience with AWS services like EC2, ELB, DynamoDB, S3 (or their Azure or GCP equivalents- OpenStack experience is fine too)\nDatabases and big data stores, especially Postgres or Kafka\nREST APIs\nLoad balancing technologies, including L4 or L7 routing and CDNs\nMonitoring, instrumentation, or observability\nStandard parts of a web app's stack, such as TCP/IP, DNS, HTTP, etc.\nCloud computing patterns (and how they're different than using hardware)\nInfrastructure as code (Terraform, Chef, Puppet, Ansible, CloudFormation, etc)

See more jobs at Salesforce

# How do you apply? This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

πŸ‘‰ Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.