Open Startup
RSS
API
Remote HealthPost a job

find a remote job
work from anywhere

Get a  email of all new Remote Amazon Web Services + Site Reliability Jobs

Subscribe
×

πŸ‘‰ Hiring for a Remote Amazon Web Services + Site Reliability position?

Post a job
on the πŸ† #1 Remote Jobs board

Remote Health by SafetyWing


Global health insurance for freelancers & remote workers

Remote Health by SafetyWing


Global health insurance for freelancers & remote workers

Shopify

 This job is getting a relatively high amount of applications currently (12% of viewers clicked Apply)

This position is a Remote OK original posting verified
United States, Canada

Production Engineering Manager  This job is getting a relatively high amount of applications currently (12% of viewers clicked Apply)


Shopify

United States, CanadaOriginally posted on Remote OK

engineering manager

 

engineering manager

 

distributed systems


Shopify is hiring a Remote Production Engineering Manager

**Company Description**\n\nShopify is now permanently remote and working towards a future that is digital by default. Learn more about what this can mean for you.\n\nOver 1.7 million businesses have bet their success on the stability and performance of the Shopify platform. In order to support these growing businessesβ€”as well as the next millionβ€”our systems need to be fast, reliable, scalable and secure. Accomplishing that will require people like you: talented, curious, growth-minded and empathetic engineering managers that are excited to build, support and lead our infrastructure teams.\n\n**Job Description**\n\nProduction Engineering, which is part of our core engineering organization, builds, operates and improves the heart of Shopify’s technical platform. We are a fast-growing team focused on building and maintaining tools and services to unlock the power of planet scale infrastructure for all of Shopify’s merchants, buyers and developers. \n\nShopify has grown rapidly over the last number of years. As an experienced infrastructure engineering manager, we need your help to both start new teams and expand and grow the missions of our existing teams. There are multiple positions available on a variety of teams and we will work with you as part of the interview process to identify which team best fits your interests, needs and experience.\n\n**Here is a sampling of some of the teams, systems and projects to which you could contribute:**\n\n* Expand the reach of our search systems to standardize the way we index documents in different languages and in various locations around the world\n* Scale a team looking at solving issues with shopping cart access, configuration plane information and package tracking data using a globally accessible, high write key/value store\n* Grow the capacity of our worldwide distributed site reliability engineering teams, consulting with other engineering groups on how to build low latency, highly resilient systems\n* Take our observability systems to the next level, expanding and evangelizing the usage of tracing, metrics and structured logging across the company \n* Work on expanding our highly scalable and configurable job system to support all of the applications on the platform\n* Keep our databases operating optimally using proxies, load shedding, custom routing layers and application transparent sharding\n* Build manipulation primitives such as combination and filtering into our streaming infrastructure to allow teams to translate existing data streams into specific business problems\n\n**Qualifications**\n\nWhile we don’t need you to have specific experience with our technology stack, these are leadership positions that do require that you have: \n\n* Proven management and leadership skills, allowing you to develop and mentor others as well as build credibility with your team while executing broader engineering strategies\n* Demonstrated proficiency designing and improving the development, delivery and automation of software infrastructure within a cloud environment\n* Experience developing and designing solutions in a modern, high-level/systems programming language (Go, Ruby, Python, Java, C++, C, etc…)\n* Familiarity working with senior stakeholders across the organization, both technical and non technical, to develop roadmaps, integrate with larger company initiatives and deliver business and engineering value.\n\n**If you have experience in any of the following areas, that will certainly be put to good use. But if you don’t, that’s ok -- the faster you apply, the quicker we can get to teaching you about:**\n\n* Building services and deploying them on top of Kubernetes and/or Google Cloud Platform\n* Familiarity with how to design, build, understand and maintain distributed systems \n* Working with Terraform and/or other infrastructure orchestration tooling\n* Participating in an on call rotation and/or site reliability engineering (SRE) experience\n* Automating infrastructure operations\n\n**Additional information**\n\nWe know that applying to a new role takes a lot of work and we truly value your time. We’re looking forward to reading your application.\n\nAt Shopify, we are committed to building and fostering an environment where our employees feel included, valued, and heard. Our belief is that a strong commitment to diversity and inclusion enables us to truly make commerce better for everyone. We strongly encourage applications from Indigenous peoples, racialized people, people with disabilities, people from gender and sexually diverse communities and/or people with intersectional identities.\n\n#Location\nUnited States, Canada


See more jobs at Shopify

# How do you apply?\n\n Click here to apply to the role: https://smrtr.io/5wxL2
Apply for this job

Previous Remote Amazon Web Services + Site Reliability Jobs

ZibaSec


This position is a Remote OK original posting verified closed
πŸ‡ΊπŸ‡Έ US-only

Amazon Web Services Focused Site Reliability Engineer


ZibaSec

πŸ‡ΊπŸ‡Έ US-onlyOriginally posted on Remote OK

python

 

sre

 

python

 

sre

 

sys admin

This job post is closed and the position is probably filled. Please do not apply.
We are looking for a senior infrastructure engineer, with deep AWS experience, who feels comfortable with new project implementation. Prior experience with serverless architectures is ideal.\n\nThe ideal candidate has experience with programmatically managing AWS infrastructure. We currently use Serverless Framework and Terraform.\n\nWe are looking for expert level proficiency in Python. Experience with any of the following is an additional asset:\n\n- Linux\n- Node\n- Working with the AWS SDK\n- Infrastructure as Code\n- Writing CLIs\n- CI/CD architectures including developing CI-server workflows\n- Terraform (or something similar like Ansible, etc)\n\n## **We’d be especially interested in you if you have:**\n\n- Contributed to any infrastructure or security automation project in the open source world\n- Built systems around observability and tracing\n- Knowledge on Chaos engineering concepts and theory\n- Fought and won battles against AWS Lambda + AWS API Gateway\n- Worked under the constraints of FedRAMP\n\n# About ZibaSec\n\nThe best way to learn about our company is to look at our publicly available employee handbook at [https://www.notion.so/zibasec/Our-Why-f5245149408f4f43baad7ef4de4e0a91](https://www.notion.so/zibasec/Our-Why-f5245149408f4f43baad7ef4de4e0a91)\n\nWe’re an early stage, funded startup focused on helping organizations improve their security posture. We build easy-to-use tools that make it harder for attackers to exploit the people within an organization.\n\nOur flagship product is focused on helping organizations run email phishing campaigns against their own employees. This lets organizations assess their risk levels while also providing insight as to what type of training might be necessary for their organization.\n\nWe are a growing company and can promise you the following:\n\n- A diverse organization.\n- A safe workplace with zero tolerance for discrimination and harassment of any kind.\n- A solid workstation; your choice of a Linux, Mac, or Windows laptop.\n- A 100% remote and balanced work life. We actually prefer you don't work for more than 40 hours a week. We don't have VCs or other outside entities to answer to, and we rather our people have a balanced life than no life.\n- Flexible scheduling. Early riser? Night owl? No problem. We maintain an overlapping 3-hour window for synchronous work. Other than that, work any hours that work for you!\n- We're a tight-knit group and we value each other. Your voice will carry the same weight as anyone else.\n- You'll have dedicated time to learn, and a budget to pay for it.\n\n# **ZibaSec's Core Software Beliefs**\n\n- **Testing is important:**Β Untested code does not get shipped...but hitting 100% unit test coverage can be detrimental to productivity for no or very little gain; it's about the right balance. We're more fond of integration and end-to-end testing.\n- **Git activity != actual productivity:**Β Developers need time to debug locally, research, and learn.\n- **Continuous Deployment:**Β When code is ready, passes tests, it should make it into production within minutes.\n- **Readability > clever code:**Β Slick code isn't so slick if it's hard to grok.\n- **Continuous Improvement:**Β Everything can be improved and nobody knows any code, stack, framework perfectly; there is always room to learn and improve. In fact, we'll provide you with a budget that you can spend on learning (conferences, courses, etc).\n- **Dogma is bad:**Β Some method, technique, etc., may have been the right answer 100 times, but on the 101st time it's possible that another way could be the best path.\n- **Open Source is crucial:**Β As a company, we're very involved with open source, we are active consumers and contributors to multiple projects. We feel so strongly about this that if we find that a particular internal library could be beneficial to the outside world, then we take the time to package it up and open source it as a standalone library (we did exactly this for a Django SAML2 authentication back end). \n\n#Salary and compensation\n$180,000 — $180,000/year\n\n\n#Location\nπŸ‡ΊπŸ‡Έ US-only


See more jobs at ZibaSec

# How do you apply?\n\n This job post has been closed by the poster, which means they probably have enough applicants now. Please do not apply.

Netdata Inc

 This job is getting a relatively high amount of applications currently (16% of viewers clicked Apply)

This position is a Remote OK original posting closed
🌏 Worldwide

Senior Site Reliability  This job is getting a relatively high amount of applications currently (16% of viewers clicked Apply)


Netdata Inc

🌏 WorldwideOriginally posted on Remote OK

javascript

 

go

 

c

 

javascript

 

go

 

c

 

python

This job post is closed and the position is probably filled. Please do not apply.
Netdata is looking for Senior Site Reliability / DevOps Engineers proficient in CI/CD methodologies, coupled with strong experience in software written in Javascript, Go, C, Python or other scripting languages, to join our distributed (remote) engineering team.\n\nAs a Senior SRE/DevOps engineer you will focus on supporting our netdata cloud offerings, augmenting our existing development infrastructure by implementing the automations necessary to catalyze further development of both our open-source project and our commercial offerings and last, but certainly not least, participating in the development of Netdata by making sure it's a first class citizen in various operating environments (e.g. orchestrated containers, IoT devices etc.)\n\nYour work will include building CI/CD pipelines, packaging, installation facilities and operational processes as well as developing custom solutions for our various teams and systems. As a Netdata SRE/DevOps engineer you will also be assisting engineers across our company, enabling them to provide world-class solutions for numerous platforms; as well as our community, open-source contributors and team-members with your deep knowledge of systems and troubleshooting skills.\n\n\n**Responsibilities**\n\n* Develop our automated CI/CD, packaging, deployment and execution environment infrastructure.\n* Develop automation tools to catalyse existing development or operational processes.\n* Evaluate, architect and develop technology options for our infrastructure and systems.\n* Troubleshoot, maintain, enhance and augment our platform.\n* Automate tasks wherever possible.\n* Stay up-to-date on emerging technologies.\n\n**Job Requirements**\n\n**Required experience**\n\n* A bachelor's degree in Computer Science or equivalent\n* 3+ years of experience on CI/CD tools (Travis, Gitlab, AWS, Azure, etc) and methodologies\n* Minimum 3 years of Linux systems development and/or administration.\n* Minimum 2 years of experience with at least one scripting language, coupled with related automation projects\n* Previous experience with cloud-based technologies and surrounding operational processes\n* Self motivated, conscientious, with a problem-solving, hands-on mindset.\n* Perfectionist where it matters, but also pragmatic, with effective time management skills.\n* Team player, eager to help.\n* Excellent analytical skills.\n* Excellent command of spoken and written English.\n \n**Preferred experience**\n\n* Minimum 2 years of Go, Javascript and C development experience in demanding environments.\n* Expert on Continuous Integration, with long experience in Test Automation\n* 5+ years of shell scripting experience, on at least 2 languages (BASH, python, perl, ruby, etc.)\n* Minimum 2 years of experience with Google Cloud app engine and surrounding operational processes\n* Experience on configuration management and tools to support it (Ansible, puppet, etc.)\n* Experience with monitoring solutions and service assurance in general.\n* A linux, cross-distribution artisan. A good amount of knowledge on windows system administration\n* Open source contributor\n* Agile Development Methodology\n\n#Location\n🌏 Worldwide


See more jobs at Netdata Inc

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.

Sticker Mule


This position is a Remote OK original posting verified closed

Junior Site Reliability Engineer


Sticker Mule

Originally posted on Remote OK

sre

 

sre

 

google cloud

This job post is closed and the position is probably filled. Please do not apply.
**About Sticker Mule**\n\n\n\nWe created Sticker Mule to be the best place to work and shop. That means making ordering fast, simple and fun while creating a stable, low stress and enjoyable place for talented people to work.\n\n\n\nWe're searching for more to join us as we look to build one of the Internet's best technical teams. Some of our current projects include migrating to a service architecture, inter-service communication with GCloud PubSub and GRPC, API Gateway based GraphQL, event sourcing persistence and CQRS, and manufacturing and artwork processing automation.\n\n\n\n[Watch a brief video to learn more\n\n](https://www.stickermule.com/about)\n\n\n\n\n\n**Why we enjoy working here**\n\n\n\n1. We work flexible hours with an asynchronous culture.\n\n2. We work at a sustainable pace without unreasonable external deadlines.\n\n3. Varied, interesting technical challenges to work on.\n\n4. Opportunities to make a large impact as part of a small, highly motivated team. \n\n\n\n# Responsibilities\n 1. Help design, build and maintain tools to develop, test and deploy services efficiently.\n\n2. Help improving the performance, reliability and security of the Sticker Mule cloud infrastructure.\n\n3. Learn how to implement CI/CD pipelines and debug production services. \n\n# Requirements\n1. You have interest in knowing not only how to write software, but also how to run it at scale.\n\n2. You have a minimum of 1 year of professional software development experience.\n\n3. You’re competent in one general purpose language, like Go, Ruby, or JavaScript.\n\n4. You like Linux, the command line and bash scripts.\n\n5. You have basic experience with one of AWS or Google Cloud.\n\n6. You used logging, monitoring and distributed tracing systems.\n\n7. You possess strong analytical and critical thinking skills.\n\n8. You have great written and verbal communication skills in English.\n\n\n\n\n\nApplicants will be sent a Hackerrank test within 1-3 days of applying. Test must be completed within 5 days.


See more jobs at Sticker Mule

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
177ms