Remote Engineer + Sys Admin Jobs in Aug 2020 Open Startup
RSS
API
Remote HealthPost a job

find a remote job
work from anywhere

Browse 250+ Remote Engineer Sys Admin Jobs in August 2020 at companies like Customer.io, Doximity and Snowplow Analytics working as a Software Engineer, SRE, Engineering Manager Site Reliability or Site Reliability Engineer. Last post

Test A
Test B
Test C

Browse 250+ Remote Engineer Sys Admin Jobs in August 2020 at companies like Customer.io, Doximity and Snowplow Analytics working as a Software Engineer, SRE, Engineering Manager Site Reliability or Site Reliability Engineer. Last post

Remote HealthPost a job

Get a  email of all new remote Engineer + Sys Admin jobs

Subscribe
×

  Jobs

  People

👉 Hiring for a remote Engineer + Sys Admin position?

Post a job
on the 🏆 #1 remote jobs board

Previously

The first health insurance for remote startups
A fully equipped health insurance that works for all your global employees
The first health insurance for remote startups
A fully equipped health insurance that works for all your global employees

Customer.io

 

Site Reliability Engineer

Site Reliability Engineer  


Customer.io


golang

sys admin

engineer

admin

golang

sys admin

engineer

admin


👁 594 viewed | ✍️ 102 applied (17%)
Portland, United States - We're looking for a collaborative Site Reliability Engineer (SRE) who loves solving interesting puzzles and is excited to help us build out a scalable, reliable platform that our customers love. We understand that you might not have all the skills we've listed, and that's okay. I...

See more jobs at Customer.io

Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Last 30 days

Doximity


Software Engineer, SRE

verified
🇺🇸 US-only

Software Engineer, SRE


Doximity

🇺🇸 US-only

service reliability

sre

ruby

unix

service reliability

sre

ruby

unix


👁 4,561 viewed | ✍️ 144 applied (3%)
Doximity is transforming the healthcare industry. Our mission is to help doctors be more productive, informed, and connected. As a software engineer, you'll work within cross-functional delivery teams alongside other engineers, designers, and product managers in building software to help improve healthcare.  \n\nOur[ team](https://www.doximity.com/about/company#theteam) brings a diverse set of technical and cultural backgrounds and we like to think pragmatically in choosing the tools most appropriate for the job at hand.\n\n**One of Doximity's [core values](https://work.doximity.com/) is stretching ourselves. Even if you don't check off all the boxes below we encourage you to apply. Doximity is full of exceptional people that don't fit a mold, join us!**\n\n**About you**\n\n* You are a Ruby engineer at heart, very familiar and passionate about the Rails ecosystem\n* You are knowledgeable of memory and CPU profiling tools to help adjust Ruby jobs and processes to use resources effectively\n* You have experience working with Terraform and Chef (or similar tooling) either in a DevOps or product support capacity\n* You have experience deploying, configuring, and maintaining NGINX\n* You are proficient with Unix, AWS, and Git\n* You are self-motivated and able to manage yourself and your own queue\n* You are a problem solver with a passion for simple, clean, and maintainable solutions\n* You agree that concise and effective written and verbal communication is a must for a successful team\n* You are able to maintain a minimum of 5 hours overlap with 9:30 to 5:30 PM Pacific time\n* You can dedicate about two weeks per year for travel to company events\n\n**Here's How You Will Make an Impact**\n\n* Improve the performance and scalability of services, optimize our REST and GraphQL APIs\n* Address security concerns and proficiently maintain our application stack\n* Troubleshoot issues across the whole stack, such as high-load, memory full, network issues and come up with temporary/long term solutions based on the root cause\n* Hands-on maintenance on our Ruby on Rails and Go (Golang) applications\n* Increase our automated test coverage and deployment infrastructure robustness \n* Manage infrastructure using Chef and Terraform\n* Active involvement in design, implementation, and maintenance of the development, staging, and production infrastructure and services your team is responsible for\n* Create concise postmortems in the event of an outage\n* Write and maintain run-books for other engineers to leverage\n* Ensure proper security, monitoring, alerting, and reporting for the applications your team is responsible for\n* Collaborate with other engineers to make sound infrastructure decisions, improve workflow, and deploy applications ready for production\n* Monitor capacity, cost and plan for upgrades\n* Participate in an on-call rotation\n\n**About Us**\n\n* Here are some of the [ways we bring value to doctors](https://drive.google.com/file/d/1qimYh0mG3i1nTJe6jDCDepJt2i4o8MEB/view)\n* Our web applications are built primarily using Ruby, Rails, Javascript (Vue.js), and a bit of Golang\n* Our data engineering stack run on Python, MySQL, Spark, and Airflow\n* Our production application stack is hosted on AWS and we deploy to production on average 50 times per day\n* We have over 350 private repositories in Github containing our applications, forks of gems, our own internal gems, and [open-source projects](https://github.com/doximity)\n* We have worked as a distributed team for a long time; we're currently about [65% distributed](https://blog.brunomiranda.com/building-a-distributed-engineering-team-85d281b9b1c)\n* Find out more information on the [Doximity engineering blog](https://technology.doximity.com/)\n* Our [recruiting process](https://technology.doximity.com/articles/engineering-recruitment-process-doximity)\n* Our [product development cycle](https://technology.doximity.com/articles/mofo-driven-product-development)\n* Our [on-boarding & mentorship process](https://technology.doximity.com/articles/software-engineering-on-boarding-at-doximity)\n\n**Benefits & Perks**\n\n* Generous time off policy\n* Comprehensive benefits including medical, vision, dental, Life/ADD, 401k, flex spending accounts, commuter benefits, equipment budget, and continuous education budget\n* Family Planning and Support benefits\n* Pre-IPO stock incentives\n* .. and much more! For a full list, see our career page\n\n**More info on Doximity**\n\nWe’re thrilled to be named the Fastest Growing Company in the Bay Area, and one of Fast Company’s Most Innovative Companies. Joining Doximity means being part of an incredibly talented and humble team. We work on amazing products that over 70% of US doctors (and over one million healthcare professionals) use to make their busy lives a little easier. We’re driven by the goal of improving inefficiencies in our $3.5 trillion U.S. healthcare system and love creating technology that has a real, meaningful impact on people’s lives. To learn more about our team, culture, and users, check out our careers page, company blog, and engineering blog. We’re growing steadily, and there’s plenty of opportunity for you to make an impact.\n\n*Doximity is proud to be an equal opportunity employer, and committed to providing employment opportunities regardless of race, religious creed, color, national origin, ancestry, physical disability, mental disability, medical condition, genetic information, marital status, sex, gender, gender identity, gender expression, pregnancy, childbirth and breastfeeding, age, sexual orientation, military or veteran status, or any other protected classification. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law.*\n\n#Location\n- 🇺🇸 US-only

See more jobs at Doximity

Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Previously

Snowplow Analytics


Site Reliability Engineer

Site Reliability Engineer


Snowplow Analytics


sys admin

engineer

admin

sys admin

engineer

admin


👁 734 viewed | ✍️ 46 applied (6%)
This job post is archived and the position is probably filled. Please do not apply.
Site Reliability Engineer (AWS)\nRemote, located in the UTC +/- 2 region \n\nAt Snowplow, we are on a mission to empower people to differentiate with data. We provide the technology to enable our customers to take control of their data and empower them to do amazing things with it.\n\nThere are tens of thousands of pipelines using our open source pipeline worldwide, collecting data emitted from over half a million sites. Running on AWS and GCP data technologies, it is ideal for data teams who want to manage their data in real-time and in their own cloud. We also collect, validate, enrich and load in the region of 5 billion events for our customers each day and help them on their Snowplow journey through our management console. \n\nTo support our ongoing growth, we are now looking for an experienced Site Reliability Engineer (SRE) to join our Tech Ops Team. You’ll be taking the lead on all things AWS including development and improvements of the current stack and rolling out new features - all whilst keeping these environments running smoothly. We would love to hear from you if the idea of programmatically controlling thousands of remote production environments excites you!.\n\nThe Opportunity: \n\nOur Private SaaS offering has grown significantly over the past year and we now orchestrate and monitor Snowplow event pipelines across hundreds of customer-owned AWS & GCP sub-accounts.  Each account has its own individualised and optimised stack and all are capable of processing many billions of events per month.\n\nWe are looking for another SRE to help us grow to managing 1,000 and then 10,000 AWS, GCP and (in the future) Azure accounts. You will be pioneering solutions to managing estates of this size through cutting edge monitoring and automation. You’ll work closely with our Tech Ops Lead on all aspects of our proprietary deployment, orchestration and monitoring stacks.\n\nTech Ops has two areas of responsibility: the centralised services we provide customers and their pipeline infrastructure hosted in their own AWS or GCP accounts.  Within both domains we are striving to increase service reliability, fulfil customer requests in a timely fashion, and automate recurring tasks.  Task automation is essential as our customer base grows, because our infrastructure estate scales linearly with our customer numbers, unlike most software businesses.\n\nThe challenge of automating the maintenance and deployment of thousands of individualised stacks is an enormously ambitious undertaking and a hugely exciting infrastructure automation challenge you’re unlikely to find anywhere else!\n\nThe environment you’ll be working in:\n\nOur company values are Transparency, Honesty, Ownership, Inclusivity, Empowerment, Customer-centricity, Growth and Technical Excellence. These aren’t just words we plucked out of thin air, we came up with them together as a company and are continually looking to find new ways to weave these into our day to day operations. From flexible hours and working locations to the way we give feedback, we’re passionate about building a company that supports both company and individual development.\n\nWhat you’ll be doing:\n\n- Strategising and innovating around the Private SaaS model, helping Snowplow to plan for the future. \n- Collaborating closely with our team of SREs and other teams around the business to ensure we continue to provide our customers with an excellent service. \n- Maintaining and developing our growing Terraform infrastructure-as-code stacks which we use to deploy infrastructure for all internal and client use cases\n- Maintaining our internal infrastructure stacks which include the HashiCorp suite as well as our Snowplow Insights UI and VPNs\n- Improving the resilience and healing ability of our infrastructure estate.\n- Owning your share of support tickets and participation in an on-call rotation to help mitigate anything else and help us serve our client base 24/7.\n- Being a key part of our response to high-severity internal or customer incidents, ensuring we meet all SLAs\n\nWhat you bring to the team:\n\n- Has worked with AWS in a production capacity - experience in GCP and/or Azure is a bonus\n- Has worked with Terraform, CloudFormation or some form of infrastructure-as-code tooling\n- Any experience with the HashiCorp stack (Vault, Consul, Nomad) and understanding their role in infrastructure automation is a bonus\n- Has worked with Docker and is familiar with container-based architectures\n- Knowledgeable about the Linux operating system and how to manage servers in a production capacity\n- Knowledgeable about Cloud networking principles and how to troubleshoot issues in this space\n- Comfortable scripting in one or more of: Bash, Python, Ruby or Perl\n- Comfortable programming in one or more of: Java, Scala, Golang or Python\n\nWhat you’ll get in return\n\n- A competitive package based on experience, including share options\n- 25 days of holiday a year (plus bank holidays)\n- MacBook or Dell XPS 13/15\n- Two fantastic company Away Weeks in a different European city each year (the last one was in November 2019 in Bratislava)\n- Work alongside a supportive and talented team with the opportunity to work on cutting edge technology and challenging problems\n- Grow and develop in a fast-moving, collaborative organisation\n- Enjoy fun events in and around London organised by our Cultural Work Committee\n- If based in London, convenient office location in central London (Aldgate) and a continuous supply of Pact coffee and healthy snacks

See more jobs at Snowplow Analytics

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

PEMDAS Technologies and Innovations


Unmanned Aerial System Back End GIS Software Engineer

Unmanned Aerial System Back End GIS Software Engineer


PEMDAS Technologies and Innovations


sys admin

backend

dev

engineer

sys admin

backend

dev

engineer


👁 4,893 viewed | ✍️ 27 applied (1%)
This job post is archived and the position is probably filled. Please do not apply.
\nPEMDAS is looking for an experienced back-end GIS software engineer to join our remote team. You will be working on the back-end data services of our environmental intelligence system, building APIs, performance tuning algorithms, and architecting our solution going forward for unmanned aerial systems.  You need to be comfortable working with big environmental data sets: data flow setup, data processing, and data storage.\n\nYou will be expected to follow typical software development processes. Our developers use Jira, Git version control, continually integrate their software updates with automated builds, follow Agile software development processes, fully document their code, and follow accepted code style standards. This position offers a wide range of creative freedom, but utilizing these best practices allows us to maintain structure, consistency, and high quality products.\n\nAs we are a remote team, you must have the discipline to manage your time while working from home.  Some travel will be required (<25%) in order to better coordinate implementation of complex solutions with our team and to facilitate demonstrations of our solutions to our government clients on site.\n\n  The Basics\n\n\n* BS/MS degree in Software Engineering, Computer Science, or a related subject\n\n* Software Development: minimum 5 years (Required)\n\n* ESRI (ArcGIS) or GDAL: minimum 3-5 years (Required)\n\n* C# and or Python: minimum 5 years (Required)\n\n* Clearance (or clearable)\n\n* Skilled at developing OGC-compliant web services\n\n* Familiarity with Atlassian or similar tool suite for task tracking and development processes\n\n* Ability to document requirements and specifications\n\n\n\n\nPreference given to candidates with:\n\n\n* Familiarity IBL Visual Weather\n\n* Experience with meteorological data\n\n* Experience with containers\n\n\n\n\n  You will be a perfect fit if you:\n\n\n* Develop well-designed, implementable, and testable software\n\n* Enjoy working on new, unexplored problems\n\n* Do not like working mundane tasks, but prefer the ability to develop creative solutions\n\n* Can conduct feasibility studies and advise on alternative approaches (trades)\n\n* Work well as part of self-organizing team and are open to pair programming\n\n* Thrive when working in the comfort of your own home as part of a geographically separated team\n\n\n

See more jobs at PEMDAS Technologies and Innovations

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Giant Swarm


Site Reliability Engineer

verified

Site Reliability Engineer


Giant Swarm


sys admin

engineer

admin

sys admin

engineer

admin


👁 449 viewed | ✍️ 29 applied (6%)
This job post is archived and the position is probably filled. Please do not apply.
\nGiant Swarm is a fast-growing open-source infrastructure management platform used by modern enterprises. Our vision is to empower developers around the world to ship great products.\n\nWe're a distributed, diverse, and growing team, spread across Europe. The company is based in Cologne, Germany, where we have a small office in a coworking space. However, less than 5% of us actually work there. All workflows are designed to function remotely - but of course, if you want to visit Cologne, you are more than welcome!\n\nWhat we offer on top:\n\n\n* Choose the hardware you like the most!\n\n* Family first - we have more kids than employees!\n\n* Join our team at conferences all over the globe!\n\n* Internal Hackathons - we love to challenge ourselves!\n\n* 2 Off-Sites per year (check our photos on Instagram)!\n\n\n\n\nWhat’s the most outstanding part about working for Giant Swarm?\n"It's a long list, but for me, the most important thing is the people. It's great to be surrounded by so many smart people - there's a lot of work to do but it doesn't feel like an uphill struggle because everyone pulls their weight so well"\n(Simon Weald, SRE)\n\nWhile we are remote-first, we appreciate quality time with our co-workers, so we meet in person twice a year to work and have fun together.\n\nWork-life integration\n\n\n* Flexible working hours, and working from home or anywhere you prefer\n\n* Currently, the number of kids from our team members outnumbers the number of employees.\n\n* We don’t only care about the kids “within” the company, but also about all children - for example, we compensate the carbon of all our flights.\n\n* As an international company, we want to create similar standards for everyone, regardless of location. So, additional perks (for example, a location-aware, fixed amount paid each month to cover costs like co-working, phone contracts or gym memberships), paid parental leave and healthcare compensation are compulsory.\n\n\n\n\nYour Job\n\n\n* You maintain, operate and upgrade our own and our customer’s Kubernetes clusters.\n\n* You will design, configure, build, and maintain our core infrastructure, from kernel parameters to the cloud provider templates.\n\n* You understand how servers and systems work and you tweak their behavior to your needs.\n\n* You will be responsible for our monitoring, logging and alerting.\n\n* You will help resolve incidents on our own and our customer’s clusters.\n\n* You participate in the on-call support schedule (~ one 24 hours shift every 2 weeks)\n\n* You are a go-to person in case our developers need advice regarding infrastructure.\n\n* You will automate all the things.\n\n\n\n\nRequirements\n\n\n* You must have deep, hands-on knowledge of Kubernetes from both the end-user and the operation side.\n\n* You have wide experience with and are able to debug Networking, Security, Linux (Kernel, Namespaces, cgroups).\n\n* You have great debugging skills and you are not afraid to deep dive into thousands of lines of logs.\n\n* You have decent coding skills, preferably in Go. You have experience with maintaining infrastructure with code.\n\n* You know the good and bad parts of various automatization tools (Terraform, Chef, Puppet, Ansible or Saltstack).\n\n* You are fluent with CNCF products running on top of Kubernetes (prometheus, grafana, ingress controller, …) you know how to use them and how to configure them.\n\n* You have a decent knowledge of storage including software-defined storage.\n\n* You like reverse and performance engineering.\n\n* You automate all the things by writing code. Using bash scripts for it makes you sad :)\n\n* We are currently mostly distributed around Europe (around UTC), but we have recently won our first US client and are looking for someone in the same time zone. Thus, you are located somewhere at the American (North, South or Central) East Coast.\n\n\n\n\nWhy we think this job is worth applying for!\n\nImpact, Impact, Impact! We are a remote-first organization with a growing team from 15+ European countries. Every new team member changes the team. This is great! People who know things we don’t are highly welcome.\n\n“It's easier to ask forgiveness than it is to get permission” (Grace Hopper) - sure, it’s not 100% like this, but we have a strong culture of failure which, is part of our agile mindset. We don’t do things like in the guidebook. You can try things out! Our default to 100% transparency will help you here.\nWe play a key role in our customers' digital transformation. We have partnered up with Amazon and Microsoft to provide our solution on their cloud platforms - more will follow.\nWe have been in this ecosystem from the get-go and as part of the CNCF family, we feel at home in the community. As a part of Giant Swarm, you will also join this extended family.\nWe serve some of Europe's leading organizations and are talking to many more.\n\nWhat’s the most challenging part of your job?"Finding time to concentrate on a specific task (especially if it's in-depth) - SREs context-switch a lot"\n(Simon Weald, SRE)\n\nInterested? Questions? Coffee? Contact Mirco ([email protected]) or apply directly!

See more jobs at Giant Swarm

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

SpotMe


Site Reliability Engineer

Site Reliability Engineer


SpotMe


sys admin

engineer

admin

sys admin

engineer

admin


👁 348 viewed | ✍️ 17 applied (5%)
This job post is archived and the position is probably filled. Please do not apply.
\nSpotMe is the worldwide leader of enterprise engagement platforms with a focus on live events, virtual and hybrid meetings, as well as long-term engagement.\n\nThe Covid19 crisis has created a big shift in the way people work, meet and interact with one-another. As a result, we’re seeing a total reset of the industry, and while this is a big change, it is also a fantastic opportunity to transform the way people engage in meetings and events.\n\nIn the past months, we have fully embraced this opportunity, and have evolved our platform and apps to match these new needs. Our agility has allowed us to adapt with the fastest possible pace, by continually delivering and deploying new features and innovations.\n\nIn parallel, we have also had to adapt the way we work, with a focus on flexibility. Our engineers are now free to decide when they want to work from home, and when they come into our Lausanne or Sofia office. In fact, they can work from anywhere they want.\n\nDo you want to join us in this exciting adventure? Please do not hesitate to reach out to us.\n\nResponsibilities:\n\n\n* Work with engineering teams to design and build a scalable platform that provides mission-critical services to our end customers and users.\n\n* Participate in the design and development of internal tooling and scripts to monitor and automate our infrastructure related processes.\n\n* Implement automated and failsafe platform deployment concepts typically canary releases.\n\n* Define deployment strategy and tools to ensure smooth service operation through resistance to failure, automatic upscaling and downscaling as well as zero downtime deployments.\n\n* Solve issues across the entire stack it being software or hardware related.\n\n* Work with architects to help define new system architectures in order to achieve high availability and failsafe services\n\n* Responsible for on-going maintenance and support of internal tools, improve system health and reliability.\n\n* Document and provide cross-training to peers for projects and products worked on.\n\n\n\n\nRequirements & Skills:\n\n\n* Bachelor's or Master's degree in Computer Science, Information Systems, or equivalent.\n\n* Typically 4-6 years of relevant experience \n\n* In-depth understanding of software engineering and cloud operations.\n\n* Familiar with cloud automation concepts, tools, and processes. \n\n* Experience in designing large-scale distributed information systems, server load balancing architectures.\n\n* Working experience with Ansible.\n\n* Professional experience with Terraform, Docker and Packer.\n\n* Solid work experience with cloud platforms such as AWS or Azure.\n\n* Solid understanding of networking concepts, TCP/IP stack.\n\n* Programming experience in at least one of the following languages: Python, Go, JavaScript.\n\n* Practical experience with Linux administration (Debian is a plus), monitoring tools, troubleshooting and performance tuning.\n\n* JavaScript experience desired\n\n* Experience with deployment and maintaining of Erlang/OTP based systems is an asset\n\n* Strong analytical and problem-solving skills.\n\n* Excellent written and verbal communication skills; mastery in English and local language. \n\n\n

See more jobs at SpotMe

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Platform.sh


Devops Site Reliability Engineer

Devops Site Reliability Engineer


Platform.sh


devops

sys admin

engineer

admin

devops

sys admin

engineer

admin


👁 49 viewed | ✍️ 2 applied (4%)
This job post is archived and the position is probably filled. Please do not apply.
\nPlatform.sh is a groundbreaking hosting and development tool for web applications. \n\nTo reinforce our technical prowess, we are looking to grow our operations team. If you’re looking for an exciting, high-growth opportunity with an award-winning, cutting-edge company, this could be just the job for you\n\nFor its PaaS solution https://platform.sh is looking for an Operations and Service Reliability Engineer with a taste for Python and Go, great Linux system understanding, and a real hunger for the challenges of building robust, distributed systems.\n\nPlatform.sh is a PaaS shrouded in a lot of black magic (we can consistently clone a whole running cluster, with its state, databases, indexes in a matter of seconds). We want to get this down to the hundreds of milliseconds domain. Interested? There is more...\n\nOur external API is pure Hypermedia REST + oAuth on top of Pyramid. It mechanizes the Git layer and needs more features.\n\nWe can consistently generate from the same manifest a Docker container, an LXC one, or VM disk images (AWS, Azure, OpenStack), we want more targets.\n\nWe probably have the highest industry container density. We need to get it higher.\n\nWe support any Python, Ruby, NodeJS or PHP, Java and .NET, time to roll-out Elixir, of course, Elixir (and Rust. We need Rust).\n\nDirectly reporting to one of our Directors for the Operations Infrastructure Department and in close interaction with our Engineering and Customer Success teams, you will be responsible for:\n\n\n* cloud operations: configure clusters, deploy stuff, follow-up on alerts, help customer support debug issues.\n\n* automating all of the above so they can instead drink margaritas (or non-alcoholic beverages, of course)\n\n* creating systems, tools & processes that will enhance our support and operations efficiency\n\n* improving service quality, discipline and reliability throughout lifecycle\n\n* monitoring operating objectives, streamline and automate intervention\n\n* continuous learning from Operations experience, modeled as software\n\n\n\n\nThis is a fully remote position for a candidate based in EMEA.\n\nThe ideal candidate\n\n\n* has proven successful experience in an operations role,\n\n* has demonstrated the ability to successfully manage cloud-based infrastructure for a fast growing organization,\n\n* has experience with containerization technologies,\n\n* has had exposure to cloud services (AWS, Azure, GCP, ...),\n\n* understands how an OS works, knows networking, how git works, and the constraints of a distributed system,\n\n* Puppet experience,\n\n* is proficient in Python (Golang a plus).\n\n\n\n\nNice to have \n\n\n* knowledge of Magento Ecommerce, Symfony, Drupal, eZ Platform, or Typo3.\n\n\n\n\nNote: we don't like stress, so we build everything to be robust and resilient, but stuff does break. This is a role with on-call duties and fire drills. If this fills you with dread... well, this might not be a fit for you.

See more jobs at Platform.sh

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Numbrs


Site Reliability Engineer

Site Reliability Engineer


Numbrs


sys admin

engineer

admin

sys admin

engineer

admin


👁 301 viewed | ✍️ 16 applied (5%)
This job post is archived and the position is probably filled. Please do not apply.
\nNumbrs is reshaping the future of the workplace. We are a fully remote company, at which every employee is free to live and work wherever they want.\n\nNumbrs was founded with the vision to revolutionise banking. Therefore from day one Numbrs has always been a technology company, which is driven by a strong entrepreneurial spirit and the urge to innovate. We live and embrace technology.\n\nAt Numbrs, our engineers don’t just develop things – we have an impact. We change the way how people are managing their finances by building the best products and services for our users.\n\nNumbrs engineers are innovators, problem-solvers, and hard-workers who are building solutions in big data, mobile technology and much more. We look for professional, highly skilled engineers who evolve, adapt to change and thrive in a fast-paced, value-driven environment.\n\nJoin our dedicated technology team that builds massively scalable systems, designs low latency architecture solutions and leverages machine learning technology to turn financial data into action. Want to push the limit of personal finance management? Join Numbrs.\n\nJob Description\n\nYou will be a part of a team that is responsible for deploying, supporting, monitoring and troubleshooting large scale micro-service based distributed systems with high transaction volume; documenting the IT infrastructure, policies and procedures. You will also be part of an on-call rotation.\n\nKey Qualifications\n\n\n* a Bachelor's or higher degree in technical field of study\n\n* a minimum of 5 years experience deploying, monitoring and troubleshooting large scale distributed systems\n\n* background in Linux administration (mainly Debian)\n\n* scripting/programming knowledge of at least Unix shell scripting\n\n* good networking understanding (TCP/IP, DNS, routing, firewalls, etc.)\n\n* good understanding of technologies such as Apache, Nginx, Databases (relational and key-value), DNS servers, SMTP servers, etc.\n\n* understanding of cloud-based infrastructure, such as AWS\n\n* experience with systems for automating deployment, scaling and management of containerised applications, such as Kubernetes\n\n* quick to learn and fast to adapt to changing environments\n\n* excellent communication and documentation skills\n\n* excellent troubleshooting and creative problem-solving abilities\n\n* excellent communication and organisational skills in English\n\n\n\n\nIdeally, candidates will also have\n\n\n* experience deploying and supporting big data technologies, such as Kafka, Spark, Storm and Cassandra\n\n* experience maintaining continuous integration and delivery pipelines with tools such as Jenkins and Spinnaker\n\n* experience implementing, operating and supporting open source tools for network and security monitoring and management on Linux/Unix platforms\n\n* experience with encryption and cryptography standards\n\n\n

See more jobs at Numbrs

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Verys


Site Reliability Engineer C# .NET

Site Reliability Engineer C# .NET


Verys


c

c plus plus

sys admin

engineer

c

c plus plus

sys admin

engineer


👁 1,405 viewed | ✍️ 137 applied (10%)
This job post is archived and the position is probably filled. Please do not apply.
\nImportant note: We are not currently working with third parties or offering sponsorship. This position is fully remote, with the option to work out of our office in Orange County, CA.  Thank you! \n \nAt Verys, we build software to be proud of for world class clients like Blizzard, American Airlines, Kia, and Experian. Our developers have a chance to work with a variety of tech stacks across industries, allowing them constant growth and exposure to new challenges, all while enjoying the family-like culture of respect that has led to our success and growth over the last 8 years.\n \nRight now, we’re looking to welcome a new Site Reliability Engineer to join our team. In this role, you will work cross functionally within the organization to assure high availability and stability for a variety of products.\n \nIf you are excited by solving complex challenges and growing your career within an innovative software services company, we’d love to hear from you!\n \nWhat you will be doing:\n\n\n* You will be the Subject Matter Expert on how the applications works, its underlying architecture, and data relationships.\n\n* Address support escalations from Customer Support and IT Operations teams.\n\n* Debug and correct application configurations, codes, database queries, and data to restore an incident.\n\n* Collaborate with the product owner and product manager for any solutions that required non-operational development engagement.\n\n* Apply reverse engineering to troubleshoot on the issue.\n\n* Automate support processes.\n\n* Participate in software releases and peer review on code changes.\n\n* Deployment of hotfixes and ad hoc database changes to lower environments. Coordinate changes with IT Operations to production.\n\n* Collaborate with IT Operations and development team to ensure proactive monitoring, high availability, and performance of products to achieve the best customer experience.\n\n* Recommend and collaborate with others to innovate and implement solutions to optimize product and team performance.\n\n* Take ownership of root cause analysis to resolve problems permanently.\n\n* Create and maintain runbooks that outline development support procedures.\n\n\n\n\n \nPreferred Experience:\n\n\n* BS/MS degree in Computer Science, Engineering, or relevant professional experience.\n\n* Hands-on experience with Microsoft Windows and Microsoft .NET frameworks.\n\n* Exposure in writing database queries on SQL Server or equivalent database platforms.\n\n* Coding and scripting skills such as programming using C#, PowerShell, Python, or other relevant languages.\n\n* Good grasp in Agile software development methodologies.\n\n* Experience in navigating on Linux environment is preferred.\n\n* Cloud experience preferred (AWS, Azure, GCF)\n\n* Skilled in creating runbook for support processes.\n\n* Exposure to ITIL framework highly desired.\n\n* 3+ years of similar working experience.\n\n\n

See more jobs at Verys

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Packet Fabric


Senior Systems Reliability Engineer

Senior Systems Reliability Engineer


Packet Fabric


senior

sys admin

engineer

admin

senior

sys admin

engineer

admin


👁 285 viewed | ✍️ 17 applied (6%)
This job post is archived and the position is probably filled. Please do not apply.
\nDescription\nAs a well rounded systems reliability engineer with a diverse set of skills, this makes you one of the very best people to troubleshoot, monitor the platform, and be on top of releases. You should definitely be the type that appreciates diversity in your day, and challenges outside of your comfort level! A typical day might include these types of activities:\n\n- Taking charge of the build process and pipelines across the platform.\n- Being keenly aware of systems architecture and automatically adding in redundancy and backup for new systems and software.\n- Assist in troubleshooting a complex customer issues across network devices, server hardware, virtual machines, in-house software and open source software. Not only can you run tcpdump with filters on the command line, but you can read it there also.\n- Adding additional monitoring and alerting on all systems across the platform that will help you identify one of those annoying intermittent issues you have seen in the logs.\n\n\nSkills & Requirements\nThe right candidates will probably have a CS degree, solid scripting and automation skills, great troubleshooting skills across the OS and network, a good grasp on security concepts, experience with routing platforms and protocols, and enjoy working collaboratively.\n\nSpecific requirements include:\n\n- Experience in automating tasks through scripting. You should be very well versed with Python, and probably a few other languages. We will ask for script samples.\n- High degree of drive to improve and automate your environment with minimal guidance\n- Be able to solve for immediate, and plan to accommodate for future problems\n- Experience with Ansible, Salt, Chef, Puppet, Terraform, or CFEngine. Experience with Ansible and Terraform preferred.\n- Experience with build pipelines, integration testing and Jenkins.\n- Experience administering a wide variety of *nix platforms, including multiple Linux variants.\n- Solid understanding of Layer 2 and Layer 3 protocols including IPv4/6, 802.1Q, BGP, MPLS, etc., and understanding a multitude of different network architectures.\n- Experience with Google Compute, AWS, or other cloud based compute and database services.\n- Understand the importance and implementation of backup and redundancy across many layers of databases, systems, and network configurations.\n\nSome knowledge that would be a huge plus:\n\n- Familiarity administering/troubleshooting Juniper/Cisco/Arista platforms.\n- Experience with extremely large scale network management and monitoring.\n- Experience with Postgresql, TimescaleDB, ElasticSearch

See more jobs at Packet Fabric

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Hypatos


Senior Devops System Engineer

Senior Devops System Engineer


Hypatos


sys admin

devops

senior

engineer

sys admin

devops

senior

engineer


👁 1,338 viewed | ✍️ 130 applied (10%)
This job post is archived and the position is probably filled. Please do not apply.
\nABOUT US\n\nHypatos is the leader in applying deep learning technology to automate back office tasks. We build advanced machine learning models to process complex documents. Our technology is in high demand because it brings a step change to organizational efficiency. We are improving the way hundreds of millions of people work every day. Join us and change work for good.\n \n To achieve our goals, we need your support! \n\nYOUR MISSION\n\n\n* Prepare Hypatos line of products for deliver to Kuberneres on-premise and on-cloud;\n\n* Improve our CI/CD processes;\n\n* Contribute to our Machine Learning infrastructure;\n\n* Work with Terraform to manage our cloud products;\n\n* Write scripts to automate and improve development and delivery processes.\n\n\n\n\nOUR TECH STACK\n\n\n* Modern stack on Kubernetes (EKS/Amazon) for our cloud product;\n\n* We deliver on Kubernetes to our customers on-premise, often on Openshift;\n\n* We have Continuous Deployment (git-driven);\n\n* We use Terraform/Terragrunt for our own on-cloud products; We believe in infra-as-a-code.\n\n* Monitoring with Prometheus, Alertmanager, Grafana;\n\n* Our own product, a SaaS and a sophisticated FinTech/ML API.\n\n\n\n\nYOUR PROFILE\n\n\n* Have >4 years of system engineering and administration under your belt;\n\n* Feel well with automation technologies, such as Terraform, Ansible or Salt;\n\n* Proficency in writing scripts in Bash and Python;\n\n* Have the basics covered: bash, Linux, ..., nginx configuration, networking;\n\n* Experience with Docker, Kubernetes, public cloud (AWS, GCP, Azure) and OpenShift is a big plus;\n\n* You seek solutions that are elegant, yet pragmatic;\n\n* Good communication skills. Strong enthusiasm to learn.\n You communicate complex data solutions across teams and clients;\n\n* You are not afraid to talk with customers;\n\n* B.A., M.Sc. or equivalent experience in Computer Science, Engineering, or another relevant technical field.\n\n\n\n\nWHY US?\n\n\n* You will be able to personally and professionally grow with a young and striving company through technically challenging, diverse and increasingly international projects\n\n* Beyond an individually negotiated compensation package including company shares, you will enjoy a unique combination of professional opportunities, entrepreneurial spirit, technological excellence and industry exposure\n\n* Flat hierarchies and a large amount of individual responsibility\n\n* Flexible working hours and a pleasant working environment\n\n* Partial home office solution is possible\n\n* 28 days off\n\n* Tools: Macbook Pro, InteliJ Ultimate\n\n\n

See more jobs at Hypatos

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Improbable


Senior Software Engineer Reliability Engineering

Senior Software Engineer Reliability Engineering


Improbable


dev

senior

sys admin

engineer

dev

senior

sys admin

engineer


👁 10,007 viewed | ✍️ 140 applied (1%)
This job post is archived and the position is probably filled. Please do not apply.
Your Mission\n\nThe Reliability Engineering organisation aims to provide easy-to-use and useful tools and frameworks for both the Game Technology Unit and First Party Studios to enable them to test, release and operate their high quality products (e.g. components/services for the Games Technology Unit; and games for the First Party Studios),  quickly, reliably, repeatedly, safely with confidence which leads to higher customer satisfaction and more successful and resilient products. \n\nWe thrive for a faster, more effective, more flexible game development.\n\nAreas for Impact\n\nOur engineering teams are focussed on improving the stability and throughput of the products released by the Game Technology Unit and First Party Studios. We want to achieve this through different strategies.\n\nSome of the things you can impact:\n\n\n\n* Develop easy to use and useful tools and frameworks to: track, benchmark and alarm performance metrics, Develop capabilities to execute unit, end-to-end and performance tests and collect and display stack-trace and crash dumps\n\n* Implement the automation for repetitive tasks (report generation, playtests, dashboarding)\n\n* Implement continuous integration and delivery for the software stacks we support. \n\n* Educate teams to the software development best practices via consultancy, communities of practice, bottom up grassroots.\n\n* Being the domain expert and voice of quality and reliability through testing, automation, continuous integration, delivery and monitoring.\n\n\n\nWe'd like to hear from you if you identify with the following \n\n\n* Strong Object-Oriented software programming and design knowledge with one or more of the following: Java, C#, C++, Go\n\n* Developed software using Agile and modern development practices, including test automation at the various levels (i.e. unit, integration, end-to-end, performance tests).  \n\n* Love solving hard problems and developing simple tools and processes so everyone can solve those hard problems.\n\n* Have the ability and desire to help other developers improve their development, workflow and testing practices.\n\n* Released software in production via continuous integration and delivery systems (i.e. Jenkins, Buildkite, or other commercial solutions) and familiar with their setup and maintenance.\n\n* Experience with game engines (Unreal, Unity), game development and testing best practices.\n\n* Comfortable working in an environment with a high level of ambiguity.\n\n* Familiar with cloud services (i.e. Microsoft Azure, Google GCP or Amazon AWS)\n\n\n

See more jobs at Improbable

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Roon Labs

 

Lead Site Reliability Engineer

Lead Site Reliability Engineer  


Roon Labs


exec

sys admin

engineer

admin

exec

sys admin

engineer

admin


👁 1,479 viewed | ✍️ 159 applied (11%)
This job post is archived and the position is probably filled. Please do not apply.
\nWHAT YOU'LL DO\n\n\n* Lead the effort in creating the next generation of our deployment and operations infrastructure on Google Cloud Platform.\n\n* Engage in and improve the whole lifecycle for services: from inception and design, through deployment, to operation and optimization.\n\n* Be accountable for the technical vision and long-term technology strategy.\n\n* Work closely with your product peers to define product vision and strategy.\n\n* Maintain services once they are live by measuring and monitoring.\n\n* Scale systems proactively and sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.\n\n* Practice sustainable incident response and blameless postmortems – and teach others to do the same by example.\n\n\n

See more jobs at Roon Labs

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

ProKeep


System Engineer

verified

System Engineer


ProKeep


sys admin

engineer

admin

sys admin

engineer

admin


👁 1,931 viewed | ✍️ 172 applied (9%)
This job post is archived and the position is probably filled. Please do not apply.
\nSystems Engineer \n\nProKeep is seeking a systems/cloud engineer for a messaging platform that is experiencing rapid growth in the construction industry. This engineer will focus on improving our existing infrastructure and  application performance along with handling general devops/cloud engineering tasks.\n\n This is an exciting opportunity for a candidate who wants to grow with an established startup company!\n\nAbout ProKeep\n\nProKeep is a software company that has developed a messaging platform for the $100+ billion wholesale distribution market (i.e. plumbing, electrical, HVAC, etc). We are post product, post revenue and growing fast in the US and Canada. Our team is small, nimble, and devoted to making our customers lives easier with simple to use technology. We envision a world where distributors using our tools are more efficient and able to build stronger relationships with their customers.\n\nResponsibilities\n\nWe are looking for a dedicated systems engineer who is excited to make significant contributions to our integrated stack, data layer, and overall performance, security, and reliability. You will be primarily focused on systems and cloud engineering, but we are a small and highly collaborative team, so from time to time you may also take a diversion into other adjacent tech realms. You don’t have to be an expert with all the tools in our stack, but an interest in working on projects at all levels of the stack will be helpful as we expand the breadth of our platform. You should have strong opinions about tools and architecture, but should also be able to explain those opinions persuasively, not abrasively. You should also take pride in writing performant, logically structured, and readable code, config, and documentation, but be pragmatic enough to realize that sometimes shipped code is better than perfect code. Some training will be provided.\n\nRequirements\n\n\n* 5+ years experience working as a cloud/systems engineer.\n\n* general system infrastructure maintenance and improvement\n\n* shell scripting\n\n* linux system administration\n\n* docker\n\n* AWS services: ECS, CloudFront, SQS, S3, ECS, ALB, IAMs\n\n* configuration management\n\n* noSQL databases\n\n* app/systems security\n\n* envoy/service mesh\n\n* devops and CI/CD (ideally, GitLab CI) a plus.\n\n* Experience working in a geographically distributed team.\n\n* Desire to collaborate with other developers and ability to communicate over various channels (email, phone, Slack, Google Hangouts, etc.)\n\n\n\n\nIdeal Skills and Experience\n\n\n* Elixir/Erlang or strong desire to learn a plus.\n\n* Postgres/RDS capacity and performance improvements.\n\n* performance tests with jmeter\n\n* AWS/ECS-cli\n\n* Node experience a plus - our ECS orchestration code is written in Node.\n\n\n\n\nWorking Relationship\n\nThis is a full-time salaried position starting immediately. At this time we can only hire US based\n\nfull-time employees and cannot consider consultants or contractors. This is a remote work opportunity, but you should be able to work core hours in U.S. pacific time.\n\nNext Steps\n\nIf interested, please email [email protected] to introduce yourself and start the process.

See more jobs at ProKeep

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Covr


Site Reliability Engineer

Site Reliability Engineer


Covr


sys admin

engineer

admin

sys admin

engineer

admin


👁 233 viewed | ✍️ 8 applied (3%)
This job post is archived and the position is probably filled. Please do not apply.
\n\n* Design and provision infrastructure to support cloud native applications.\n\n* Manage and enhance our automation to provide Continuous Integration and Delivery services.\n\n* Contribute to the design and implementation of a micro-services architecture.\n\n* Deep-dive into client or server systems to optimize for performance, maintainability, scalability, extensibility as needed.\n\n* Extend the capabilities of our platform via custom partner integrations.\n\n* Work with a geographically distributed team.\n\n\n\n\nPreferred Qualifications\n\n\n* 5+ years of progressive professional software engineering experience\n\n* 3+ years of designing and implementing public cloud solutions (AWS/Azure)\n\n* 3+ years experience managing unix/linux systems\n\n* Demonstrable proficiency in one or more Infrastructure-as-Code tools (Ansible, Chef, CloudFormation, Terraform, etc)\n\n* Experience with the architecture and design of distributed software systems\n\n* Expertise in a variety of relational and non-relational data stores\n\n* Experience working with Docker containers and Kubernetes or another container scheduler\n\n* Bachelor's Degree in Computer Science or equivalent experience\n\n\n

See more jobs at Covr

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Eidosmedia

 

Linux System Engineer

Linux System Engineer  


Eidosmedia


sys admin

engineer

linux

admin

sys admin

engineer

linux

admin


👁 2,451 viewed | ✍️ 396 applied (16%)
This job post is archived and the position is probably filled. Please do not apply.
\nWe are looking for a Linux System Engineer that will join our Hotline support team providing 24x7x365 hotline service (Level 2 support).\n\nCompany Profile\nEidosmedia is a world leader in content management and digital publishing solutions. We produce software that covers the entire lifecycle of content, from authoring, management, workflow, design, to sharing, publishing and delivery, with open technologies and modern frameworks.\nAs established innovators and disruptors, we help our customers maximize the productivity and flexibility of their operations through the application of modern, digital technologies.\n\nWhat you will do\nYou will be serving as the first point of contact for customers seeking technical assistance over the phone or email, providing first level contact and conveying resolutions to their issues.\nYou will be also responsible for performing remote troubleshooting through diagnostic techniques and investigating user problems (systems, hardware or software) identifying their source, determining possible solutions in cooperation with support teams and advising user on appropriate action.\n\nWhat we offer\nWe are an international organisation, and you will deal with people all over the world. We love what we do and we love doing it well. Our enthusiasm drives us to take pride and pleasure in the routine delivery of excellence. We recognise that our people are our most valuable resource so we aim to create an environment where everyone can reach his/her full potential.\n\nPersonal Profile\n\n\n* Computer Engineering, Computer Science degree or IT High School diploma\n\n* Good knowledge of Linux Server\n\n* Good knowledge of Web and Application Server (Tomcat, Nginx, Apache)\n\n* Basic knowledge of framework HA (Veritas/Monit/RHCS)\n\n* Ability to troubleshoot and diagnose problems and to analyze log       \n\n* Good knowledge of scripting (Bash or Python on a UNIX or Linux platform)\n\n* Good English skills\n\n* Teamwork and excellent communication skills\n\n\n\n\nMore about us\nEidosmedia welcomes diversity among its people: whatever their race, religion, age, sex or sexual orientation, everybody’s contribution is valued equally in an atmosphere of mutual respect and regard.\n\nIf you think you have what it takes to succeed in this role then apply immediately!

See more jobs at Eidosmedia

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

White Hat Gaming


Site Reliability Engineer

verified
🌏 Worldwide

Site Reliability Engineer


White Hat Gaming

🌏 Worldwide

sre

devops

aws

sys admin

sre

devops

aws

sys admin


👁 2,838 viewed | ✍️ 4 applied (0%)
This job post is archived and the position is probably filled. Please do not apply.
You will be a part of a team that is responsible for deploying, supporting, monitoring and troubleshooting large scale micro-service based distributed systems host in AWS and bare metal with high transaction volume; documenting the IT infrastructure, policies and procedures.\n\n# Responsibilities\n As a Senior SRE/DevOps engineer you will focus on supporting our team and augment our existing development infrastructure by implementing the automations necessary to streamline our pipeline. Tooling includes Terraform, Kubernetes, ELK and AWS services. \n\n# Requirements\n* A minimum of 5 years experience deploying, monitoring and troubleshooting large scale distributed systems\n* Background in Linux administration\n* Scripting/programming knowledge of at least Unix shell scripting\n* Good networking understanding (TCP/IP, DNS, routing, firewalls, etc.)\n* Good understanding of technologies such as Apache, Nginx, Databases (relational and key-value), DNS servers, etc\n* Understanding of cloud-based infrastructure, such as AWS\n* Experience with systems for automating deployment, scaling and management of containerised applications, such as Kubernetes\n* Experience with Terraform for infrastructure\n* Quick to learn and fast to adapt to changing environments\n* Excellent communication and documentation skills\n* Excellent troubleshooting and creative problem-solving abilities\n* Excellent communication and organisational skills in English\n\n Ideally, candidates will also have\n* Experience deploying and supporting multiple staging/dev environments\n* Experience maintaining continuous integration and delivery pipelines with tools such as Jenkins and Spinnaker\n* Experience implementing, operating and supporting open source tools for network and security monitoring and management on Linux/Unix platforms\n* Experience with Postgres\n* Experience with security in AWS\n\n\n#Location\n- 🌏 Worldwide

See more jobs at White Hat Gaming

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

PeopleDoc

 

Cloud Devops Engineer Site Reliability Engineer

Cloud Devops Engineer Site Reliability Engineer  


PeopleDoc


devops

cloud

sys admin

engineer

devops

cloud

sys admin

engineer


👁 2,066 viewed | ✍️ 324 applied (16%)
This job post is archived and the position is probably filled. Please do not apply.
\nThe mission of a SRE at PeopleDoc is to secure, administrate and maintain the production infrastructure as it were a software, you will contribute in building our fault tolerant, highly scalable and low latency services on virtual and bare-metal servers over data centers in different regions of the globe. We are looking for a Software Engineer with a combination of strong skills in modern software development and sysadmin knowledge.\n\nWe are hiring in our Paris office, but Remote workers are welcome too!\n\nThe successful candidate will be required to:\n\n\n* Design, code and maintain the cloud infrastructure hosting PeopleDoc services\n\n* Collect and monitor KPIs (availability, response time, time to deploy) and ensure that they meet our SLAs\n\n* Lead the scalability & capacity planning strategy\n\n* Work with other teams to identify, troubleshoot, and resolve high impact issues\n\n* Team player with good communication skills\n\n\n\n\nCompetencies required:\n\n\n* Experience with at least one programming language (Python, Java or Go) and modern software development practices\n\n* Experience in automation tools (Ansible, Salt, Puppet or Chef) and CI/CD principles\n\n* Experience with Cloud services (AWS, GCP or Openstack) and its APIs\n\n* Good Linux system administration skills (DNS, RabbitMQ, Redis or HAProxy)\n\n* Good Networking knowledge (TCP/IP, Linux routing and firewall)\n\n\n\n\n\nTypical Interview Process:\n\n\n* If your application is selected, a Recruiter will reach out to schedule a phone screen with them.\n\n* If selected to move forward, you will complete a HackerRank Coding Assessment.\n\n* If you pass, you will either move forward to a technical phone call for an additional screening, OR directly to an onsite interview.\n\n* Offer stage.\n\n\n

See more jobs at PeopleDoc

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Concentric Sky

 

Site Reliability Engineer

Site Reliability Engineer  


Concentric Sky


sys admin

engineer

admin

sys admin

engineer

admin


👁 2,008 viewed | ✍️ 296 applied (15%)
This job post is archived and the position is probably filled. Please do not apply.
\nConcentric Sky is looking for excellent people to add to our team!\nWe are looking for intelligent people who are passionate about technology, detail oriented, and capable of working in a fast paced and collaborative environment. \nConcentric Sky develops web and mobile applications for clients in various sectors from education to industry to incubators.  We have an amazing team of more than 70 highly skilled technical professionals. We are a collaborative group of relaxed, cool people who share a love for technology and the positive impact it's having on our world. We offer a fantastic work environment with lots of perks, flexibility, freedom, and a full benefits package. See why we love Concentric Sky.\nJob Overview:\nWe are looking for a Site Reliability Engineer (SRE) to join our team. The SRE is responsible for the planning and implementation of our internal application: Badgr. The SRE relies on extensive experience and judgment to succeed.\nJob Requirements:\n* Hands-on experience building and supporting applications running on Amazon Web Services (AWS)\n* Hands-on experience using Ansible and Terraform\n* Hands-on experience with container-based deployments & service orchestration (e.g. Docker)\n* Hands-on experience with at least one programming and/or scripting languages (e.g. Python, Go)\n* Hands-on experience building and managing release systems, code merging and promotion, and CI/CD workflows and tools\n* Ability to debug and optimize code and to automate routine tasks\n* Strong Linux system administration, networking, security skills\n* Demonstrated experience using collaborative development using Git\n* Self-Starter - ability to quickly learn new tools and products\n* Excellent written and verbal communication skills as exemplified by clear issue explanations, documentation and effective intra- and inter-group communications\n* Candidates must be eligible to work in the United States (citizenship, green card, visa, etc)\n\n\nJob Responsibilities:\n* Design, develop and implement cloud-based (AWS) technology solutions\n* Maintain and support CI/CD control software (e.g. Jenkins)\n* Stay up to date on emerging technologies\n* Perform on-call duties\n\n\nStrong plus / Success factors:\n* MongoDB database administration\n* Data driven application performance analysis and optimization\n\n\nCompensation & Benefits:\nCompensation is based on your experience. We offer great perks and benefits including a fully stocked snack kitchen, kegerator, lunch delivery, tech toys, generously flexible work schedules, paid time off, excellent health, dental, and vision insurance plans, a 401k, and an FSA!\nIf you are an innovator and want an excellent opportunity to put your skills to work, learn some new ones, and be part of a cutting-edge team, please send us your resume. No phone calls please.\nWe look forward to hearing from you.\nConcentric Sky, Inc., is an equal opportunity employer.

See more jobs at Concentric Sky

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Hypothesis.is

 

Senior Site Reliability Engineer

Senior Site Reliability Engineer  


Hypothesis.is


senior

sys admin

engineer

admin

senior

sys admin

engineer

admin


👁 1,784 viewed | ✍️ 233 applied (13%)
This job post is archived and the position is probably filled. Please do not apply.
\nLocation: Remote. Candidates must be located between UTC-6 and UTC+2 time zones.\n\nSummary\n\nHypothesis is seeking a Senior Site Reliability Engineer to join our product delivery team and lead our work to help us build efficient, reliable, secure, and scalable infrastructure and code. This role combines the activities of development and site reliability engineering to ensure Hypothesis technologies and services support our vision of a world where annotation is as common as comments, but more useful and engaging. Join us as we extend what the web can do.\n\nAbout the role\n\nReporting to the Engineering Manager, the Senior Site Reliability Engineer leads the work to build, document and maintain efficient, reliable, scalable, secure and easy-to-use operations including deployment, QA and production environments, and monitoring.\n\n\n* Infrastructure:\n\n\n* Provision and administer infrastructure (hosts, cloud services, monitoring tools, etc.) for highly reliable and scalable web applications and data stores\n\n* Document our operations systems so that the whole team can understand and operate them. \n\n* Oversee deployment of Hypothesis application servers\n\n\n\n\n\n* Automation:\n\n\n* Build automated tooling to configure and maintain our systems and services\n\n* Guide the team in the best way to use configuration management to grow and administer our services\n\n\n\n\n\n* Performance, reliability, security, and scaling:\n\n\n* Identify and solve performance, reliability, security, and scaling issues in our stack\n\n* Stress test our stack to find cracks in the system and help us scale\n\n\n\n\n\n* Auditing for security vulnerabilities at regular intervals, and enacting the practices set forth in our security policy.\n\n\n\n\nSkills and experience you possess\n\n\n* You have experience in software development, site reliability, and backend/infrastructure engineering for an organization experiencing fast-paced growth.\n\n* You are knowledgeable in configuration management with a framework such as Ansible or Terraform.\n\n* You understand the ins and outs of AWS, Linux, and PostgreSQL well enough to teach others how to use them, and can comfortably operate all of them from the CLI.\n\n* You are proficient with a programming language like Python or Ruby, and with shell scripting.\n\n* You are familiar with security best practices and have helped to audit for and remediate security vulnerabilities in infrastructure.\n\n* Your documentation and verbal communication skills are excellent, and you’re able to collaborate and rally support with people on and off your team.\n\n* You are inclined to automate, but can discern when automation isn’t the best solution and present alternatives.\n\n* You’ve worked with continuous integration and deployment systems, and have ideas about how to build and improve them.\n\n* You strongly believe in the importance of security, and enjoy the idea of partnering with engineers to ensure the integrity of our customers’ data.\n\n* You have experience with remote work and understand the importance of good time management, self-motivation, and self-discipline as a remote worker. \n\n\n\n\nAbout you\n\nYou are someone who loves problem solving. You value simplicity over complexity. You take great satisfaction in helping others be more successful and productive and wouldn’t think to move on without documenting your work so 6-months-from-now you (or anybody else for that matter) can drop back in and understand it. We are interested in someone who wants to help everyone around them better understand how to operate software at scale and who is eager to take on the responsibilities outlined for this role. \n\nYou will be successful at Hypothesis if you:\n\n\n* Love learning new things,\n\n* Unafraid to ask questions \n\n* Are committed to improving both as a technologist and a human being,\n\n* Are tenacious, self-directed, and highly motivated,\n\n* Enjoy helping others around you grow as developers and be successful,\n\n* Communicate clearly and effectively (this is especially important in a remote organization), and\n\n* Approach your work with a mindset that allows for growth and change.\n\n\n\n\nWhat’s next\n\nDoes this sound interesting? Drop us a line to tell us what about this role intrigues you and why you think you would be great for Hypothesis. Resumes are helpful, but so are examples of your recent work. We can’t wait to hear from you!

See more jobs at Hypothesis.is

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

REPAY

 

Senior Site Reliability Engineer

Senior Site Reliability Engineer  


REPAY


senior

sys admin

engineer

admin

senior

sys admin

engineer

admin


👁 1,797 viewed | ✍️ 201 applied (11%)
This job post is archived and the position is probably filled. Please do not apply.
\nREPAY is looking for an experienced Senior Site Reliability Engineer to be part of our Agile and growing Technology team in expanding our core payment processing platform and products within the consumer finance industry.  Work in a small team, where you will have an immediate and measurable impact on our fast-growing business by helping us reach the next level and stage of growth. Your expertise building and managing scalable cloud hosted systems with open source components and tools will be critical in this role. \n\nYou’ll be responsible for:\n\n\n* Working with Terraform, Kubernetes, AWS, Jenkins, Packer, Envoy, Gloo, Bash, Go and Python on a daily basis\n\n* Playing a key role as a Site Reliability Engineer where you will be responsible for the overall success of product/solution deployment including designing automated installations, maintenance of stable production environments and on-time production releases.\n\n* Building tooling for self-service for developers to facilitate a culture of DevOps across the entire development org and not just a DevOps team\n\n* Continuously hone and improve our technology stack to keep up with the state of the art\n\n* Participating in company’s off-hours on-call rotation program\n\n\n\n\nBasic Skills & Requirements\n\n\n* 6+ years of experience building highly available systems and supporting SaaS cloud-based infrastructure\n\n* Experience with terraform, packer, Kubernetes or other Infrastructure/Configuration-as-code tooling\n\n* Experience using at least one scripting language such as Python, bash, PowerShell or similar\n\n* Experience deploying to cloud-based technology (AWS)\n\n* Experience with continuous integration (CI/CD) and automated build tools such as TeamCity and Jenkins\n\n* Ability to tackle problems both at large scale and the small scale\n\n* Good understanding of application/infrastructure security\n\n* Strong communication skills; ability to communicate with distributed teams\n\n\n\n\nPreferred Skills & Requirements\n\n\n* Good understanding of Payments processing and/or developing Payments products\n\n* Familiarity of PCI compliance\n\n* BS in Computer Science, Software Engineering, Computer Engineering, or equivalent experience\n\n\n

See more jobs at REPAY

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

amplified ai

 

Site Reliability Engineer

Site Reliability Engineer  


amplified ai


sys admin

engineer

admin

sys admin

engineer

admin


👁 860 viewed | ✍️ 103 applied (12%)
This job post is archived and the position is probably filled. Please do not apply.
\nWe're growing and we're looking for a dedicated Site Reliability Engineer to help us automate operations and build an infrastructure for growth. We've found product market fit and now we're working to ensure that our systems are available, performant, and visible as we develop architecture that is scalable, cost-effective, secure, and reliable. Your role will be to help us meet those goals and set even more ambitious ones, looking forward to design systems that will take us into the future while helping us become hyper-aware of what's happening in our systems right now.\n\nThis is an opportunity to engage with cutting-edge technology and work on a real-world problem at global scale. In addition to competitive compensation and benefits there is also room for the right person to take on increased responsibilities. And it’s a lot of fun (although fast-paced and even chaotic at times) working as part of a small, passionate team.\n\nResponsibilities\n\n\n* Take ownership of our infrastructure as code, which is currently in Terraform\n\n* Lead our DevOps culture, encouraging and enabling developer effectiveness through powerful and secure tooling\n\n* Keep us abreast of what's happening with our systems and our customers up to the second — we have one tracing obsessive in the team and we're all trying to be a bit more like him\n\n* Expand and improve our CI and CD systems\n\n* Help us develop and uphold SLIs and SLOs\n\n* Develop and maintain a (blameless) postmortem practice\n\n* Make monitoring and alerting alert on symptoms and not on outages\n\n\n\n\nQualifications\n\n\n* Excellent systems thinking: edge cases, failure modes, behaviours, specific implementations\n\n* Experience in a DevOps oriented role\n\n* Experience with at least one major cloud provider (we use AWS)\n\n* Experience with infrastructure as code, especially Terraform\n\n* Familiarity with and interest in security best practice\n\n* Operational experience with containers — Kubernetes a plus\n\n* Strong programming and shell scripting skills\n\n* An obsession with documentation\n\n* Ability to thrive in a remote-first team\n\n\n

See more jobs at amplified ai

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Bold Penguin

 

Cloud Site Reliability Engineer

Cloud Site Reliability Engineer  


Bold Penguin


cloud

sys admin

engineer

admin

cloud

sys admin

engineer

admin


👁 1,969 viewed | ✍️ 243 applied (12%)
This job post is archived and the position is probably filled. Please do not apply.
\nWe didn’t create Bold Penguin because commercial insurance is broken. It isn’t. But as the world has gotten more connected and digitized, commercial insurance lags behind—creating a fragmented landscape where businesses, agents, and insurance companies struggle to interact in a smooth and easy way. That’s why we’ve built a highly efficient exchange that cuts the friction out of commercial insurance by connecting everyone to the right quote in record time.\n\nPowering the world of insurance is no small feat, so we’ve brought on a team that's not only incredibly talented but also passionate about our potential to upgrade the entire industry. As more and more companies big and small depend on our technology to operate in the commercial insurance space, we’ll need the best talent all around to support our growth. That’s why we’re looking at you (yes, you!) to make a bold move and join our adventure.\n\nYour  Role\n\nAs a Cloud & Site Reliability Engineer, you will be a subject matter expert in building highly reliable, highly scalable features and infrastructure. You’ll use DevOps principles to ensure that Bold Penguin’s software systems are always available and ready to scale to meet growing demands. \n\nClick here to learn more about DevOps on the glacier\n\nWhat You’ll Do\n\n\n* Ensure the reliability, performance, and availability of our platform by working as part of a cross-functional product team\n\n* Participate in agile ceremonies such as iteration planning, retrospective, and daily standups\n\n* Be part of the shared on-call rotation and proactively research possible issues affected the availability of our platform\n\n* Understand and clearly articulate tradeoffs in architecture decisions with regards to cost, security, operational efficiencies, performance, and availability\n\n* Build and maintain infrastructure with executable code (IaC) and automated delivery pipelines\n\n* Be passionate about Cloud/DevOps/SRE concepts such as Immutable Infrastructure, Cattle vs Pets, Infrastructure as Code, Delivery Pipelines\n\n\n\n\nSkills & Qualifications\n\n\n* Deep, hands-on expertise with AWS Cloudformation and other Infrastructure as Code tools\n\n* Experience with Amazon Web Services; specifically EC2, ECS, ELB, CodePipeline, RDS, Redshift, S3, IAM, and Lambda\n\n* Ability to articulate Cloud & DevOps concepts to a variety of technical & non-technical team members\n\n* Bonus points for expertise in implementing security & compliance frameworks such as SOC/2, NIST 800-53, and NIST 800-171 especially in Amazon Web Services\n\n* Bonus points for AWS Certifications \n\n* Bonus points for familiarity with microservices architectures, Ruby on Rails and/or ETL tools such as Fivetran.\n\n* Experience working at technology companies and startups desirable\n\n* 2-4 years + of working remote, full time, and/or with full time co-located teams across different time zones.\n\n\n\n\nBONUS POINTS\n\n\n* Full-stack expertise in multiple tiers of modern web applications (e.g. front end, back end, infrastructure, etc.)\n\n* Open-source contributions and/or speaking experience.\n\n* Previous work experience in insurance and/or experience with policy rating very desirable.\n\n* You love Penguins! ;P\n\n\n\n\nTRAVEL TO THE "GLACIER" (please read)\n\n\n* We are firm proponents of "seeing eye to eye by meeting face to face". As such, our remote team travels in once a quarter for a full day of collaboration, goal setting, team building, etc.  Are you able to make this work?  In addition to this we also ask that, if hired, you are able to make the first week onsite for onboarding/training. \n\n\n\n\n\nPENGUIN PERKS\n\n\n* For a healthy colony.\n\n\n\n* Our plan covers 50% of your Medical Premiums – Health - HRA, Dental, Vision, and Life Insurance, as well as Short & Long Term Disability (Trust us, the benefits are great!)\n\n\n\n\n\n\n\n* Penguins plan for the future.\n\n\n\n* 401k Match program, up to 4%! \n\n\n\n\n\n\n\n* Parental Leave\n\n\n\n* 16 weeks of parental leave (your kids need you there!)\n\n\n\n\n\n\n\n* Need a vacation?\n\n\n\n* Unlimited PTO - Please take a vacation - you need it and we applaud it and in fact we require you take 10 days off!\n\n\n\n\n\n\n\n* Hungry? Thirsty?\n\n\n\n* We offer free snacks and drinks, as well as catered lunch every Monday (even to our remote employees...nomb nomb nomb)\n\n\n\n\n\n\n\n* Penguins need to learn!\n\n\n\n* We support your professional growth. Certifications, training, memberships, and conferences are actively encouraged—and often covered.\n\n\n\n\n\n\n\n* Penguins are social creatures and love to play!\n\n\n\n* We have frequent happy hours, company events, and outings. What kind of company would we be if we didn't have some fun!?!? \n\n\n\n\n\n\n\n* Penguins give back.\n\n\n\n* We offer volunteer opportunities every month!  There is no better feeling than giving back =)\n\n\n\n\n\n\n\n* Don’t want to move to Columbus?\n\n\n\n* We offer up to 100% remote engineers!\n\n* You must be OK visiting the office for a day or two every quarter - we are all about that camaraderie! \n\n\n\n\n\n\nPenguins believe in inclusion. That’s why we’re proud to be an equal opportunity employer that considers all qualified applicants regardless of race, color, religion, gender identity or expression, sexual orientation, national origin, genetics, disability, age, veteran status, beak size, or inability to fly.

See more jobs at Bold Penguin

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Quizlet

 

Snr Site Reliability Engineer Quizlet SF Denver

Snr Site Reliability Engineer Quizlet SF Denver  


Quizlet


golang

sys admin

engineer

admin

golang

sys admin

engineer

admin


👁 1,764 viewed | ✍️ 204 applied (12%)
This job post is archived and the position is probably filled. Please do not apply.
unknown, Unknown - - Company: Quizlet.com- Technial Recruiting partner: SourceCoders.io- Location: Onsite in San Francisco or Denver or Remote for CST or EST based candidates - Compensation: $120K-$200K (heavily dependent on experience and work location)- Work...

See more jobs at Quizlet

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Wikimedia Foundation


Site Reliability Engineer

Site Reliability Engineer


Wikimedia Foundation


sys admin

engineer

admin

sys admin

engineer

admin


👁 107 viewed | ✍️ 1 applied (1%)
This job post is archived and the position is probably filled. Please do not apply.
\nThe Wikimedia Foundation is hiring two Site Reliability Engineers to support and maintain (1) the data and statistics infrastructure that powers a big part of decision making in the Foundation and in the Wiki community, and (2) the search infrastructure that underpins all search on Wikipedia and its sister projects. This includes everything from eliminating boring things from your daily workflow by automating them, to upgrading a multi-petabyte Hadoop or multi-terabyte Search cluster to the next upstream version without impacting uptime and users.\n\nWe're looking for an experienced candidate who's excited about working with big data systems. Ideally you will already have some experience working with software like Hadoop, Kafka, ElasticSearch, Spark and other members of the distributed computing world. Since you'll be joining an existing team of SREs you'll have plenty of space and opportunities to get familiar with our tech (Analytics, Search, WDQS), so there's no need to immediately have the answer to every question.\n\nWe are a full-time distributed team with no one working out of the actual Wikimedia office, so we are all together in the same remote boat. Part of the team is in Europe and part in the United States. We see each other in person two or three times a year, either during one of our off-sites (most recently in Europe), the Wikimedia All Hands (once a year), or Wikimania, the annual international conference for the Wiki community.\n\nHere are some examples of projects we've been tackling lately that you might be involved with:\n\n\n*  Integrating an open-source GPU software platform like AMD ROCm in Hadoop and in the Tensorflow-related ecosystem\n\n*  Improving the security of our data by adding Kerberos authentication to the analytics Hadoop cluster and its satellite systems\n\n*  Scaling the Wikidata query service, a semantic query endpoint for graph databases\n\n*  Building the Foundation's new event data platform infrastructure\n\n*  Implementing alarms that alert the team of possible data loss or data corruption\n\n*  Building a new and improved Jupyter notebooks ecosystem for the Foundation and the community to use\n\n*  Building and deploying services in Kubernetes with Helm\n\n*  Upgrading the cluster to Hadoop 3\n\n*  Replacing Oozie by Airflow as a workflow scheduler\n\n\n\n\n\nAnd these are our more formal requirements:\n\n\n*    Couple years experience in an SRE/Operations/DevOps role as part of a team\n\n*    Experience in supporting complex web applications running highly available and high traffic infrastructure based on Linux\n\n*    Comfortable with configuration management and orchestration tools (Puppet, Ansible, Chef, SaltStack, etc.), and modern observability       infrastructure (monitoring, metrics and logging)\n\n*    An appetite for the automation and streamlining of tasks\n\n*    Willingness to work with JVM-based systems  \n\n*    Comfortable with shell and scripting languages used in an SRE/Operations engineering context (e.g. Python, Go, Bash, Ruby, etc.)\n\n*    Good understanding of Linux/Unix fundamentals and debugging skills\n\n*    Strong English language skills and ability to work independently, as an effective part of a globally distributed team\n\n*    B.S. or M.S. in Computer Science, related field or equivalent in related work experience. Do not feel you need a degree to apply; we value hands-on experience most of all.\n\n\n\n\n\n\nThe Wikimedia Foundation is... \n\n...the nonprofit organization that hosts and operates Wikipedia and the other Wikimedia free knowledge projects. Our vision is a world in which every single human can freely share in the sum of all knowledge. We believe that everyone has the potential to contribute something to our shared knowledge, and that everyone should be able to access that knowledge, free of interference. We host the Wikimedia projects, build software experiences for reading, contributing, and sharing Wikimedia content, support the volunteer communities and partners who make Wikimedia possible, and advocate for policies that enable Wikimedia and free knowledge to thrive. The Wikimedia Foundation is a charitable, not-for-profit organization that relies on donations. We receive financial support from millions of individuals around the world, with an average donation of about $15. We also receive donations through institutional grants and gifts. The Wikimedia Foundation is a United States 501(c)(3) tax-exempt organization with offices in San Francisco, California, USA.\n\nThe Wikimedia Foundation is an equal opportunity employer, and we encourage people with a diverse range of backgrounds to apply.\n\nU.S. Benefits & Perks*\n\n\n* Fully paid medical, dental and vision coverage for employees and their eligible families (yes, fully paid premiums!)\n\n* The Wellness Program provides reimbursement for mind, body and soul activities such as fitness memberships, baby sitting, continuing education and much more\n\n* The 401(k) retirement plan offers matched contributions at 4% of annual salary\n\n* Flexible and generous time off - vacation, sick and volunteer days, plus 19 paid holidays - including the last week of the year.\n\n* Family friendly! 100% paid new parent leave for seven weeks plus an additional five weeks for pregnancy, flexible options to phase back in after leave, fully equipped lactation room.\n\n* For those emergency moments - long and short term disability, life insurance (2x salary) and an employee assistance program\n\n* Pre-tax savings plans for health care, child care, elder care, public transportation and parking expenses\n\n* Telecommuting and flexible work schedules available\n\n* Appropriate fuel for thinking and coding (aka, a pantry full of treats) and monthly massages to help staff relax\n\n* Great colleagues - diverse staff and contractors speaking dozens of languages from around the world, fantastic intellectual discourse, mission-driven and intensely passionate people\n\n\n\n\n*Eligible international workers' benefits are specific to their location and dependent on their employer of record

See more jobs at Wikimedia Foundation

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Source Coders


Snr Site Reliability Engineer

Snr Site Reliability Engineer


Source Coders


sys admin

engineer

admin

sys admin

engineer

admin


👁 86 viewed | ✍️ 1 applied (1%)
This job post is archived and the position is probably filled. Please do not apply.
\n\n* Company: Quizlet.com\n\n* Technial Recruiting partner: SourceCoders.io\n\n* Location: Onsite in San Francisco or Denver or Remote for CST or EST based candidates \n\n* Compensation: $120K-$200K (heavily dependent on experience and work location)\n\n* Work visas accepted: US Citizen, Green Card, H-1B transfer, TN Visa\n\n\n\n\nQuizlet’s mission is to help students (and their teachers) practice and master whatever they are learning. Every month more than 50 million active learners from 130 countries practice and master more than 300 million study sets on every conceivable topic and subject. We are developing new learning experiences by modeling how students learn and drawing upon knowledge acquisition, retention, and pedagogy in cognitive science. We are always seeking to help students master any subject by optimizing study efficiency and engagement. Want to be a go-to person for site reliability on the most-used learning platform in the U.S.? Want to work on a service that is rapidly scaling and relied upon by millions of students and teachers worldwide?  Quizlet is an indispensable utility used daily by millions of students and teachers around the globe. If our site goes down, even just for a few minutes, the pain is felt intensely. Speed is crucial, and downtime is not an option as we grow — during the school year, we are in the top 20 most-visited websites in the U.S. These are challenges you will face on day one at Quizlet.\n\nWhat you'll do\n\n\n\n\n* Engage with service owners to improve the entire service lifecycle — from inception and design, through deployment, operation, maintenance, and sunset.\n\n* Help service owners drive their services through the service lifecycle through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.\n\n* Help service owners maintain their services once they are live by measuring and monitoring availability, latency, and overall system health.\n\n* Help scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity.\n\n* Practice and evangelize sustainable incident response and blameless postmortems.\n\n\n\n\n\n\nWhat we are looking for\n\n\n\n\n* Experience in designing, analyzing and troubleshooting distributed systems serving production traffic.\n\n* Experience with algorithmic thinking, data structures, and software complexity.\n\n* Experience in writing scripts in one or more languages such as Python or Go\n\n* Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.\n\n* Ability and desire to debug and optimize code and automate routine tasks.\n\n* Experience with on-call duty, know why it’s hard, work to improve it, and make it so well documented that every engineer wants to be on rotation.\n\n* {Passion|Interest|Experience} with automation of code testing and deployment through the use of containers.\n\n\n\n\n

See more jobs at Source Coders

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Source Coders


Senior Site Reliability Engineer

Senior Site Reliability Engineer


Source Coders


senior

sys admin

engineer

admin

senior

sys admin

engineer

admin


👁 97 viewed | ✍️ 1 applied (1%)
This job post is archived and the position is probably filled. Please do not apply.
\n\n* Company: Quizlet.com\n\n* Technial Recruiting partner: SourceCoders.io\n\n* Location: Onsite in San Francisco or Denver or Remote for CST or EST based candidates \n\n* Compensation: $120K-$200K (heavily dependent on experience and work location)\n\n* Work visas accepted: US Citizen, Green Card, H-1B transfer, TN Visa\n\n\n\n\nQuizlet’s mission is to help students (and their teachers) practice and master whatever they are learning. Every month more than 50 million active learners from 130 countries practice and master more than 300 million study sets on every conceivable topic and subject. We are developing new learning experiences by modeling how students learn and drawing upon knowledge acquisition, retention, and pedagogy in cognitive science. We are always seeking to help students master any subject by optimizing study efficiency and engagement. Want to be a go-to person for site reliability on the most-used learning platform in the U.S.? Want to work on a service that is rapidly scaling and relied upon by millions of students and teachers worldwide?  Quizlet is an indispensable utility used daily by millions of students and teachers around the globe. If our site goes down, even just for a few minutes, the pain is felt intensely. Speed is crucial, and downtime is not an option as we grow — during the school year, we are in the top 20 most-visited websites in the U.S. These are challenges you will face on day one at Quizlet.\n\nWhat you'll do\n\n\n\n\n* Engage with service owners to improve the entire service lifecycle — from inception and design, through deployment, operation, maintenance, and sunset.\n\n* Help service owners drive their services through the service lifecycle through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.\n\n* Help service owners maintain their services once they are live by measuring and monitoring availability, latency, and overall system health.\n\n* Help scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity.\n\n* Practice and evangelize sustainable incident response and blameless postmortems.\n\n\n\n\n\n\nWhat we are looking for\n\n\n\n\n* Experience in designing, analyzing and troubleshooting distributed systems serving production traffic.\n\n* Experience with algorithmic thinking, data structures, and software complexity.\n\n* Experience in writing scripts in one or more languages such as Python or Go\n\n* Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.\n\n* Ability and desire to debug and optimize code and automate routine tasks.\n\n* Experience with on-call duty, know why it’s hard, work to improve it, and make it so well documented that every engineer wants to be on rotation.\n\n* {Passion|Interest|Experience} with automation of code testing and deployment through the use of containers.\n\n\n\n\n

See more jobs at Source Coders

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

flanksource


Kubernetes Site Reliability Engineer

Kubernetes Site Reliability Engineer


flanksource


golang

sys admin

engineer

admin

golang

sys admin

engineer

admin


👁 8,088 viewed | ✍️ 212 applied (3%)
This job post is archived and the position is probably filled. Please do not apply.
unknown, Unknown - flanksource is a niche consultancy focusing exclusively on Kubernetes and the Cloud Native ecosystem. We help companies navigate the CNCF landscape by evaluating and integrating technology into an infrastructure continuous delivery pipeline, tailored to each ...

See more jobs at flanksource

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Smile.io


Site Reliability Engineer


🌏 Worldwide

Site Reliability Engineer


Smile.io

🌏 Worldwide

site reliability

sys admin

engineer

admin

site reliability

sys admin

engineer

admin


👁 2,392 viewed | ✍️ 177 applied (7%)
This job post is archived and the position is probably filled. Please do not apply.
Smile is the largest provider of reward programs in the world. We reward tens of millions of people every year, and power rewards programs for thousands of businesses around the world. We’ve got big scaling plans in 2020 and beyond - we need amazing talent to achieve our goals and you get to be the one to find them and bring them to Smile!\n\nA little more about the Smile Development team:\n\nThe engineering team at Smile.io believes in being proud of your code, owning what you ship, and embracing new tools to increase developer happiness. We also like to focus on learning, architecture, and platform health. We ship early and often, using feature flags to get our code in the hands of end-users as soon as possible. We also rely on data, user research, and product feedback to make and shape important features and decisions.\n\nWe are remote-friendly, with engineers on our team working from home offices in Romania, Cyprus, New Orleans and more! Tell us where you'd like to work when you apply!\n\nWhat’s it like to work at Smile.io?\nWe are a team of smart self-starters who build efficient and unique solutions to problems. You’ll be working with some amazing talent and you'll constantly be pushed to challenge yourself and improve your skills. This starts in the interview process, where you’ll be asked to show us your skills in real-time. It’s not an easy process, but we think you’ll find it rewarding and a great preview to what working here is really like.\n\nAs a team, we’re driven by these core values:\n\nBe Humble - think of the team before thinking of yourself. We have no room for massive egos.\nBe Hungry - set hard goals, ask lots of questions and learn every day.\nBe Human - show empathy towards others, consider the impact of your decisions on other teams.\nWe collaborate on everything. Our communication tools and our space are designed with this in mind - from physical areas to connect in comfort to Slack channels of all sorts, we enable you to reach out to those around you to make sure you have the information you need to make great decisions.\n\nWe know that Smile.io as a business is in constant evolution - the same is true of our people. We’re here to support each other in our growth, so we talk openly about our career goals, hopes & dreams. With such a diverse team of people, we know we can offer you the mentorship, tools and encouragement you need to grow.\n\nWe believe that diverse teams perform better and that fostering an inclusive work environment is a key part of growing a successful business. We welcome people of diverse backgrounds, experiences and perspectives. We are an equal opportunity employer and are committed to work with applicants requesting accommodation at any stage of the hiring process.\n\n# Responsibilities\n Responsibilities\nBuild scalable systems, using best practices around automation, pushing changes that improve reliability and velocity\nSupport services before they go live through activities such as system design consulting, developing software platforms and frameworks, planning and reviews\nMaintain services once they are live by measuring and monitoring availability, latency and overall system health.\nProvide mentorship and training to other team members on technologies and processes; drive education and knowledge transfer of design patterns, technical practices, and relevant technologies and tools\nDrive high standards around incident response practices and policies\nGather requirements and make thoughtful tradeoffs to ensure we are focusing our efforts on the most impactful projects.\nWork on services and tools to proactively improve the quality and reliability of our production API.\nDebug production issues across services and multiple levels of the stack. Improve operational standards, tooling, and processes. \n\n# Requirements\nWhat we’re seeking:\n\nPrior experience 2-4 years improving the reliability of a SaaS product.\nPrior experience operationally supporting a SaaS product.\nPrior software development experience.\nTraits or experience we’d love to see (you don’t need them all to apply):\nHave prior experience supporting the reliability of and/or developing on an API-based product.\nCan think intuitively about systems and services and write high quality code. We work mostly in Ruby, with very occasional Elixir and Go.\nHold yourself and others to a high bar when working with production systems.\nHave experience authoring and operating high-scale services, as well as debugging complex systems.\nTake pride in working on projects to successful completion involving a wide variety of technologies and systems.\nThrive in a collaborative environment involving different stakeholders and subject matter experts.\nEnjoy working with a diverse group of people with different expertise. Comfortable in collaborating with other teams such as Sales and Support in sharing feedback from our customers\n\n#Location\n- 🌏 Worldwide

See more jobs at Smile.io

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Wikimedia Foundation


Site Reliability Engineering Manager

Site Reliability Engineering Manager


Wikimedia Foundation


exec

sys admin

engineer

admin

exec

sys admin

engineer

admin


👁 354 viewed | ✍️ 1 applied (0%)
This job post is archived and the position is probably filled. Please do not apply.
\nSummary\n\nWikimedia’s Site Reliability Engineering team is principally responsible for ensuring our global top-10 web site, our public facing services and underlying infrastructure are healthy and developing further in support of Wikimedia’s mission. The SRE team comprises over 30 creative and talented staff members that are globally distributed and organized into 6 teams each with their own scope and focus area. We are strengthening the team and looking for several Engineering Managers to help our staff and teams achieve our goals.\n\nAs an Engineering Manager, you will support engineers developing services and infrastructure, deploying and building new features, products, and services used by hundreds of millions of people around the world. This is an opportunity to do good while improving one of the best known sites in the world. \n\nYour Responsibilities:\n\n\n* Manage one to two globally distributed teams within Site Reliability Engineering\n\n* Recruit, hire, and help onboard new team members\n\n* Work with team members to set individual performance goals, and support them in meeting and evolving their goals and career path\n\n* Triage incoming workload, maintain focus on priorities, and set realistic expectations for both peers and team members\n\n* Coordinate and communicate with other members of the Wikimedia engineering teams on relevant projects, and contribute to the organizational strategy\n\n* Continuously develop the roadmap of the team in alignment with other SRE and Technology teams, and help draft and execute the team’s annual and quarterly plans\n\n* Project manage new and existing initiatives\n\n* Lead the definition, refinement, and execution of the processes through which the team manages and performs work.\n\n* Lead incident response, diagnosis, and follow-up on system alerts and outages across Wikimedia’s production infrastructure\n\n* Facilitate the definition and establishment of Service Level Indicators and Objectives with service owners and stakeholders\n\n* Share our values and work in accordance with them\n\n\n\n\nSkills & Experience:\n\n\n* Prior experience managing teams\n\n* Strong technical background, including 5+ years experience as part of an SRE, TechOps or software engineering team\n\n* Experience working with or applying one or more project management methodologies to site reliability engineering work\n\n* Aptitude for automation and streamlining of tasks\n\n* Communicate effectively in both spoken and written English\n\n* Ability to work independently, as an effective part of a globally distributed team\n\n* Willing and able to travel several times a year for occasional in-person meetings\n\n* B.S. or M.S. in Computer Science or the equivalent in related work experience\n\n\n\n\nAdditionally, we would love it if you have:\n\n\n* Experience working in a distributed, largely remote environment\n\n* Experience contributing to open source projects\n\n\n\n\nTeams\n\n\n* Service Operations: Build and improve our new Kubernetes based Deployment pipeline and help our teams, service owners and developers across the organization test and deploy our existing application platform as well as new applications/features.\n\n* Data Persistence: Store, query and protect the sum of all human knowledge! Work together with our engineers to ensure existing and new data needs are met in an efficient and reliable manner, using the most appropriate boring and exciting open source technologies: MySQL, Cassandra, OpenStack Swift, Ceph.\n\n* Observability: Work across SRE and Technology to provide teams with tools, platforms, and insights into how systems and services are performing. Leverage exciting technologies such as Prometheus, AlertManager, Grafana, Logstash, Kibana, Kafka and more. Research emerging tools, trends and methodologies and work with the open source community to contribute back that knowledge to the commons.\n\n\n\n\nThe Wikimedia Foundation is... \n\n...the nonprofit organization that hosts and operates Wikipedia and the other Wikimedia free knowledge projects. Our vision is a world in which every single human can freely share in the sum of all knowledge. We believe that everyone has the potential to contribute something to our shared knowledge, and that everyone should be able to access that knowledge, free of interference. We host the Wikimedia projects, build software experiences for reading, contributing, and sharing Wikimedia content, support the volunteer communities and partners who make Wikimedia possible, and advocate for policies that enable Wikimedia and free knowledge to thrive. The Wikimedia Foundation is a charitable, not-for-profit organization that relies on donations. We receive financial support from millions of individuals around the world, with an average donation of about $15. We also receive donations through institutional grants and gifts. The Wikimedia Foundation is a United States 501(c)(3) tax-exempt organization with offices in San Francisco, California, USA.\n\nThe Wikimedia Foundation is an equal opportunity employer, and we encourage people with a diverse range of backgrounds to apply.\n\nU.S. Benefits & Perks*\n\n\n* Fully paid medical, dental and vision coverage for employees and their eligible families (yes, fully paid premiums!)\n\n* The Wellness Program provides reimbursement for mind, body and soul activities such as fitness memberships, baby sitting, continuing education and much more\n\n* The 401(k) retirement plan offers matched contributions at 4% of annual salary\n\n* Flexible and generous time off - vacation, sick and volunteer days, plus 19 paid holidays - including the last week of the year.\n\n* Family friendly! 100% paid new parent leave for seven weeks plus an additional five weeks for pregnancy, flexible options to phase back in after leave, fully equipped lactation room.\n\n* For those emergency moments - long and short term disability, life insurance (2x salary) and an employee assistance program\n\n* Pre-tax savings plans for health care, child care, elder care, public transportation and parking expenses\n\n* Telecommuting and flexible work schedules available\n\n* Appropriate fuel for thinking and coding (aka, a pantry full of treats) and monthly massages to help staff relax\n\n* Great colleagues - diverse staff and contractors speaking dozens of languages from around the world, fantastic intellectual discourse, mission-driven and intensely passionate people\n\n\n\n\n*Eligible international workers' benefits are specific to their location and dependent on their employer of record\n\nMore information\n\nWMF\nBlog\nWikimedia 2030\nWikimedia Medium Term Plan\nDiversity and inclusion information for Wikimedia workers, by the numbers\nWikimania 2019\nAnnual Report - 2017 \nThis is Wikimedia Foundation \nFacts Matter\nOur Projects\nFundraising Report

See more jobs at Wikimedia Foundation

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

YouGov


Site Reliability Engineer

Site Reliability Engineer


YouGov


sys admin

engineer

admin

sys admin

engineer

admin


👁 1,634 viewed | ✍️ 160 applied (10%)
This job post is archived and the position is probably filled. Please do not apply.
\nRole: \n\nAs a Site Reliability Engineer at YouGov, you will join our talented individuals in being responsible for the delivery, optimization, resilience, and availability of high-value and high-transaction-rate services trusted and used by both the general public and some of the largest brands in the world. Site Reliability Engineering is a discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE's ensure that YouGov's internally critical and externally visible systems maintain the appropriate service levels (availability, latency, and reliability) to serve our customers' needs, and reduce the friction for managing change, while being strategic about capacity, and constantly managing performance. SRE is a mindset and a set of engineering approaches focusing on delivery of the appropriate architecture, building infrastructure, optimizing existing systems, and eliminating toil through automation.\n\nSREs have the acumen and experience to provide direct technical contributions to major projects both in code, and in building and optimizing the production environment. You will identify and solve critical problems and build automation to prevent their recurrence. You align with your peers across engineering, deliver subject matter expertise for the infrastructure within your product area, and draw on your strong communication skills to collaborate with your peers in other geographies. Your perspectives help foster and support successful delivery of reliability engineering, and you influence by way of metrics, data, and automation.\n\n Experience required: \n\n\n* 3+ years' work experience in a similar job role.\n\n* Design, develop, and implement supporting cloud services on the Kubernetes platform.\n\n* Proven application production support experience.\n\n* Strong analytical and problem-solving skills.\n\n* Passion for automating repetitive tasks.\n\n* Identify and solve critical problems and build automation to prevent their recurrence.\n\n* Develop clean, well-documented, testable code.\n\n* Work cross-functionally across developers, QA, and other teams.\n\n* Troubleshoot and resolve issues in both production and lower environments.\n\n* Participate in on-call rotation in support of critical products.\n\n* Proven software engineering experience Kubernetes / K8's / Docker.\n\n* Familiarity with running and scaling distributed software systems (load balancing, high availability, systems monitoring, etc.)\n\n* Experience administering and/or designing databases - SQL and NoSQL\n\n* Understanding of networking: TCP, UDP, firewalls, DNS, OSI layers, etc.\n\n* Experience with log analysis and monitoring tools such as Splunk, Logstash, New Relic, etc.\n\n* Establish Error Budgets for the products by monitoring SLIs, measuring SLOs and publishing them to a dashboard\n\n* Design, build and implement software features for the product that increase reliability, availability and performance\n\n* Own the pipeline of deployments to production, this includes establishing and maintaining the CI/CD pipeline for the product\n\n* Drive blameless post-mortems with the product team and use the Error Budget to establish priorities for any necessary changes\n\n* Have experience with Networking, Linux OS, Security, Data Persistence, Containers, AWS, etc.\n\n\n\n\nAny additional info:\n\nThis position is 100% remote, therefore having experience within a remote environment would be ideal.

See more jobs at YouGov

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Container Solutions


Site Reliability Engineer

Site Reliability Engineer


Container Solutions


sys admin

engineer

admin

sys admin

engineer

admin


👁 930 viewed | ✍️ 72 applied (8%)
This job post is archived and the position is probably filled. Please do not apply.
We are looking for Site Reliability Engineers to join our new SRE team. As part of the team you will be taking responsibility for availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning of customer applications and infrastructure.\n\n\n\n\n\nWhat your day-to-day will look like:\n\n\n\n\n* Participate in your team's effort to continuously improve our customer's production environments\n\n* Own your team's tech and tools stack and contribute to the relevant Open Source projects\n\n* Design, analyse and troubleshoot large-scale distributed systems\n\n* Being part of your team's on-call rotation\n\n* Learn and share by being part of the Cloud Native community through blog post and conference talks\n\n* Automate almost all the things\n\n\n\n\n\n\n\n\n\n\nSkills and requirements:\n\n\n\n\n* Strong engineering OR operations background and the urge to master both disciplines\n\n* An analytical mind, debugging and problem solving skills\n\n* Strong written and spoken technical communication\n\n* Flexibility to learn about and work with different technical environments and teams\n\n\n\n\n\n\n\n\n\n\nBonus Points (we value curiosity and ability to learn over previous experience):\n\n\n\n\n* Strong understanding of the Kubernetes API, core principles and components\n\n* Strong knowledge of Linux networking and security related to containers\n\n* Production experience with at least one common CI/CD system\n\n* Production experience with at least one major cloud provider\n\n* Production experience with at least one modern infrastructure automation or configuration management system\n\n* Ability to contribute to polyglot code bases\n\n\n\n\n\n\n\n\nWe are building a remote first team across multiple time zones to allow a follow the sun on-call rotation.\nWe are not hiring job descriptions. We hire humans. :)\nWe welcome applications from everybody, regardless of ethnic or national origin, religion, gender identity, sexual orientation or age.

See more jobs at Container Solutions

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Vistas recruitment

 

Site Reliability Engineer Fully

Site Reliability Engineer Fully  


Vistas recruitment


golang

sys admin

engineer

admin

golang

sys admin

engineer

admin


👁 2,199 viewed | ✍️ 308 applied (14%)
This job post is archived and the position is probably filled. Please do not apply.
Remote (+/- 3hrs CET), Europe - Location: Remote (+/- 3hrs CET)Type: PermanentSalary: €60,000 - €100,000 Per AnnumWant to work remotely in a 'remote-first' culture? Want to significantly impact the daily lives of other engineers? ...in a well-funded, fast growing European start up...

See more jobs at Vistas recruitment

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

SunteckTTS


Scalability Reliability Engineer

Scalability Reliability Engineer


SunteckTTS


sys admin

engineer

admin

sys admin

engineer

admin


👁 1,807 viewed | ✍️ 147 applied (8%)
This job post is archived and the position is probably filled. Please do not apply.
**Requirements:**\n\n- 4+ Years of DevOps experience
\n- Docker Experience
\n- Experience with DevOps tools such as Ansible or Terraform
\n- Experience with AWS\n- Excellent command of source control (preferably GIT)\n- Excellent command of Linux (preferably Ubuntu)
\n- Excellent command of relational databases (preferably MySQL)
\n- Excellent written and verbal communication skills to communicate with Management, Development Team and Business Analysts in a collaborative environment. \n- You analyze every change in the context of the *much* bigger picture
\n- Experience in and desire to participate in mentoring, shaping the development process\n- Ability to embrace every opportunity to automate and enhance existing processes.\n- A desire to help continue the transformation of heavily used enterprise systems through the implementation of best practices, standards, and strategic refactoring\n- Demonstrated aptitude for estimation, long term planning, and self direction\n- Enjoy working in a collaborative small team environment\n- Strong sense of task ownership \n\n**Nice to Haves:**\n\n- Experience with PHP\n- Command of object oriented programming concepts and coding best practices\n- Experience with Symfony\n- Experience with Golang\n- Experience with Ruby on Rails\n- Experience with Agile Development\n- Experience refactoring production enterprise systems \n\n\n**About the Role:**\n\nWe are looking for an outstanding Site Reliability Engineer who can lead our infrastructure efforts on the Application Development team. This individual must embrace every opportunity to automate and enhance existing processes of all our application instances. This individual will provide leadership and must have the communication skills to influence management in initiatives that will help provide business value. The ideal candidate is process-oriented, and always trying to expand their skills to include new tools and approaches. \n\nOn any given day this individual will be spinning up new instances of an app, working on Disaster Recovery planning and implementation, building out all infrastructure using programing languages that allow for easily repeatable deployment of all infrastructure. They will provide support to technical resources and development, work on root cause analysis and prevention. \n\nThis individual will assist us in Logging, Auditing, Metrics, Alerting, Backups, Queing, Clustering, Documentation and other Dev Operation initiatives to assist the organization in adding business value where appropriate. Keeps production systems working and ensures top level performance for all systems. \nEvaluates growth and is forward thinking to ensure proper scalability of all systems. \n\n**About SunteckTTS and the Team:**\n\nSunteckTTS is an industry-leading, full-service, transportation logistics provider that operates through a network of sales, operations, and capacity specialists. We are a billion dollar company with over 200 independently owned and operated agent offices across the U.S. and Canada.\nWe focus on providing asset and non-asset surface transportation to a wide range of end-use markets, including food, lumber, paper, printing, textiles, electronics, machinery, government, and more. \n\nThe Application Development Group is in the midst of an ambitious program to create best in breed software products and improve existing systems for our user base. Our team of Business Analysts and Developers share a culture of self-ownership and open communication to provide\nmaximum business value. We strive to ensure team-based component ownership, development, and quality assurance. We are a tight knit group that relies on ability and flexibility to ensure we balance our strides towards improved process and need to meet the priorities of a heavily used\nenterprise system. The Application Development Group focuses on a culture of open communication, learning, and transparency.\n\nSunteckTTS is an Equal Opportunity Employer and does not discriminate against qualified applicants with regard to race, color, religion, age, sex, national origin, handicap, sexual orientation or veteran status.

See more jobs at SunteckTTS

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

ShareStream


Site Reliability Engineer

Site Reliability Engineer


ShareStream


sys admin

engineer

admin

sys admin

engineer

admin


👁 1,520 viewed | ✍️ 118 applied (8%)
This job post is archived and the position is probably filled. Please do not apply.
\nAbout us:\n\nShareStream Education is a leader in online video and media management solutions for academic institutions. Our team is passionate about building a great product that is continually evolving and providing a service that allows our customers to realize the vast potential of streaming media for education.\n\nShareStream Education is deeply committed to achieving client successes and building strong relationships with the Company’s clients, whom we regard as our partners.  \n\nJoin us and contribute to changing the way online education takes place through the use of streaming media!\n\nThe Site Reliability will work remotely. ShareStream Education will not accept resumes from recruiters for this position.\n\nResponsibilities:\n\nShareStream is seeking a multitalented, dedicated Site Reliability Engineer who excels at automating engineering operations and building high-availability and fault-tolerant systems. The Site Reliability Engineer will:\n\n\n* Enhance and operate the continuous integration and continuous delivery (CI/CD) pipeline for multiple applications\n\n* Operate the Kubernetes platform and perform day-to-day monitoring and maintenance\n\n* Automate upgrades, scaling, and other operational needs as required\n\n* Deploy new releases across multiple SaaS customers\n\n* Implement and operate a central logging solution as well as a central metrics solution\n\n* Develop operational playbooks and dashboards to monitor production SaaS environments\n\n* Contribute to managing AWS cost and resource usage\n\n* Work with the Engineering team to implement new technologies, including Istio, CephFS, ElasticSearch, and InfluxDB\n\n\n\n\nRequirements:\n\n\n* BS and/or MS degree in Computer Science or a related degree\n\n* Extensive experience building and operating distributed systems in Amazon Web Services (AWS)\n\n* Expert-level Linux skills (CentOS and Ubuntu)\n\n* Extensive experience with container-based software development and management using Docker and Kubernetes\n\n* Extensive experience with Jenkins\n\n* Extensive experience with Ansible, Chef, or Puppet\n\n* Expert in at least one scripting language, preferably Bash or Python\n\n* Intermediate-level software-development skills using Java or another object-oriented programming language is a strong plus\n\n* Experience managing backups and participating in disaster-recovery planning and testing is a strong plus\n\n\n\n\nExperience working in a fast-moving startup environment is a strong plus

See more jobs at ShareStream

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Theorem


Experienced Site Reliability Engineer

Experienced Site Reliability Engineer


Theorem


sys admin

engineer

admin

sys admin

engineer

admin


👁 1,701 viewed | ✍️ 119 applied (7%)
This job post is archived and the position is probably filled. Please do not apply.
\nTheorem is a software consultancy that believes in simplicity in software design. We deliver solutions for startups and enterprises. You can see our portfolio to learn more about the results we've delivered for our clients.\n\nWe are a remote first company with offices in Los Angeles and New York, and team members all around the world.\nCandidates located within UTC + 1 to UTC - 8 will be given priority for team time zone alignment. Team members are expected to align a portion of their day with Pacific Timezone.\n\nJob Duties:\n\n\n* Mentor and teach SRE best practices, internally and with our customers.\n\n* Build and maintain high-availability systems.\n\n* Identify improvement opportunities on existing systems, build plans and execute improvements.\n\n* Ensure our clients and their users have the best and fastest experience possible.\n\n* Participate in code and design reviews, teaching and learning from other engineers.\n\n* Plan, estimate and prioritize work in a collaborative and distributed team.\n\n* Potentially travel to spend time with clients.\n\n\n\n\nJob Requirements:\n\n\n* Familiar with Python, C# or Ruby, and at least one other programming language.\n\n* Experience with Infrastructure as Code and Configuration Management tools.\n\n* Experience with alerting and monitoring tools.\n\n* Experience working in a highly distributed company.\n\n* Be open minded and always learning.\n\n* Experience with the following tools are preferred, but not necessarily required:\n\n\n\n* Terraform\n\n* CloudFormation\n\n* Chef\n\n* Docker + Kubernetes\n\n* Prometheus + Grafana\n\n* Elasticsearch + Logstash + Kibana\n\n* Splunk\n\n* Jenkins\n\n\n\n\n

See more jobs at Theorem

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Eidosmedia


Managed Service System Engineer

Managed Service System Engineer


Eidosmedia


sys admin

engineer

admin

sys admin

engineer

admin


👁 1,494 viewed | ✍️ 112 applied (7%)
This job post is archived and the position is probably filled. Please do not apply.
\nJob description\n\n\nEidosmedia is looking for a  Managed Services System Engineer who will be responsible for addressing customer service requests incidents, follow-up analysis and troubleshooting Eidosmedia’s proprietary content management solution, Méthode.\n\nThe team\n\nThis is a great opportunity to join a dynamic team of very talented people responsible for shaping the future of content management systems.\n\nWe are global leader in content management applied to complex, real-time scenarios. We’re deeply passionate about technology to keep our solutions at the leading edge. Today we’re on a mission to evolve a product suite leader in the Media industry and conquer other solid verticals like Finance and Government.\n\nYou will join our Systems and Operations Team based in New York and supporting Eidosmedia US Customers reporting directly to the System and Operations Manager and collaborating with colleagues all across the globe.\n\nWhat you will learn\n\nAs a Managed Services Systems Engineer will participate in creating maintenance plans, implement solutions, integrations and maintaining documentation for all US customers\n\nYou will assure activities are compliant with Managed Services SLA’s as it pertains to the customer’s subscribed services. This includes but is not limited to proactive system monitoring, system software upgrades, application software upgrades, patching.\n\nAdditional duties include general system administration tasks and participation of after-hours maintenance and activities and/or on call as needed.\n\nWhat we are looking for\n\n\n* Computer Engineering degree, or equivalent\n\n* Deep knowledge of UNIX / LINUX operating systems.\n\n* Good knowledge of scripting languages (Python, Ruby, Bash, Perl)\n\n* Good knowledge of high availability systems (Veritas/RHCS) and storage systems (NetApp, EMC)\n\n* Good knowledge of web and application servers (Tomcat)\n\n* Experience in using virtualization platforms (VMWare, XEN, etc.)\n\n* Experience in managing systems in complex cloud environments (AWS, GCP, MS Azure, etc)\n\n* Ability to troubleshoot and diagnose problems and to perform log analysis\n\n\n\n\nIt’s great (but not necessary) if you also have\n\nITIL or Agile/SCRUM qualified\n\nWork Authorization\n\nCandidate must be authorized to work in the U.S.

See more jobs at Eidosmedia

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Heetch


Senior Site Reliability Engineer

Senior Site Reliability Engineer


Heetch


senior

sys admin

engineer

admin

senior

sys admin

engineer

admin


👁 221 viewed | ✍️ 1 applied (0%)
This job post is archived and the position is probably filled. Please do not apply.
\nImportant note before applying :\n\nWe're a young company iterating over our remote culture so for now, we're only working with people in locations where the time zone is:\n\n-1 hour > Paris time zone < +1 hour\n\n\nSRE Team @Heetch\n\nOur infrastructure receives millions of events per day and processes millions of API requests. We also serve over a dozen thousands rides daily.\n\nBy joining the team, you'll be helping building its technical vision and creating the best platform to run our services at scale. You'll be joining a growing team collaborating with the rest of the development organization.\n\nWe work day-to-day on automation in order to ensure reliability, scalability and velocity at Heetch. Our infrastructure is growing on a daily basis, with more than 160 micro services owned by 16 different teams and counting. One of our challenges is to provide help (design consulting, capacity planning, incident management, service development, monitoring, etc.) to other teams in order to spread common and best practices. We also develop, put in production and maintain services in order to ensure a maximum of independence and ownership to other teams.\n\nOur team's values\n\n\n* Move smart: We are rel="nofollow"> Engineering Blog and follow our twitter :) You can also have a look at our open-source projects and contributions here\n\n\n

See more jobs at Heetch

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Packet Fabric


Network Reliability Engineer

Network Reliability Engineer


Packet Fabric


sys admin

engineer

admin

sys admin

engineer

admin


👁 1,926 viewed | ✍️ 93 applied (5%)
This job post is archived and the position is probably filled. Please do not apply.
\nJob Description\n\nAs a network reliability engineer, you should definitely be the type that appreciates diversity in your day, and challenges outside of your comfort level! A typical day in the life of a PacketFabric network reliability engineer might include these types of activities:\n\n\n* Work with the network architects to automate router provisioning and upgrades across thousands of network devices, taking into account all sorts of annoying things and edge cases\n\n* Develop tools for network capacity planning, by working closely with network engineering, infrastructure, and procurement\n\n* Work on streamlining the maintenance and outage process, by getting things like many many ugly vendor emails into an orderly database\n\n* Write API’s and tools to manage and maintain the network overall\n\n* Research and implement additional ideas you may have to improve the product/platform\n\n\n\n\nSkills & Requirements\n\nThe right candidates will have an extreme abundance of hard core programming skills and be extremely well versed in various network protocols and network equipment. They will be comfortable handling orchestration tools and dealing with frustrating large data sets. You will also know how to sacrifice algorithm elegance, for getting it done on deadline, and know when it is time to refactor some code to improve latency in various situations. You don't even need to be reminded of safe/secure programming practices, because things as simple as session security are inherent to your nature. More specifics include:\n\n\n* A ridiculous amount of experience working in network environments to automate tasks, or other complex environments, such as industrial equipment\n\n* Second nature when working in a LAMP stack environment, armed with a command line\n\n* Loads of experience with Python, and solid OO programming paradigms\n\n* Experience with orchestration tools like Ansible and Jenkins\n\n* Good familiarity with basic network protocols including MPLS and BGP.\n\n* Good familiarity with various Layer 2 interconnect technologies, including but not limited to L2VPN and EVPN/VXLAN.\n\n* Previous work with netconf interactions to Cisco and Juniper hardware, other router APIs, open source configuration tools, or writing your own scripts for configuration\n\n* A huge plus for previous work in large scale networks\n\n* Never being afraid to venture boldly where none have gone before, and develop code where there are no previous libraries to draw from\n\n\n\n\nPreferred Experience\n\n\n* A huge plus if you have experience with network traffic modeling.\n\n\n

See more jobs at Packet Fabric

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

NetData

 

Senior Site Reliability Devops Engineer

Senior Site Reliability Devops Engineer  


NetData


devops

senior

sys admin

engineer

devops

senior

sys admin

engineer


👁 1,539 viewed | ✍️ 174 applied (11%)
This job post is archived and the position is probably filled. Please do not apply.
\nNetdata is looking for Senior Site Reliability / DevOps Engineers proficient in CI/CD methodologies, coupled with strong experience in software written in Javascript, Go, C, Python or other scripting languages, to join our distributed (remote) engineering team. \n\nAs a Senior SRE/DevOps engineer you will focus on supporting our netdata cloud offerings, augmenting our existing development infrastructure by implementing the automations necessary to catalyze further development of both our open-source project and our commercial offerings and last, but certainly not least, participating in the development of Netdata by making sure it's a first class citizen in various operating environments (e.g. orchestrated containers, IoT devices etc.)\n\nYour work will include building CI/CD pipelines, packaging, installation facilities and operational processes as well as developing custom solutions for our various teams and systems. As a Netdata SRE/DevOps engineer you will also be assisting engineers across our company, enabling them to provide world-class solutions for numerous platforms; as well as our community, open-source contributors and team-members with your deep knowledge of systems and troubleshooting skills.\n\nResponsibilities\n\n\n* Develop our automated CI/CD, packaging, deployment and execution environment infrastructure.\n\n* Develop automation tools to catalyze existing development or operational processes.\n\n* Evaluate, architect and develop technology options for our infrastructure and systems.\n\n* Troubleshoot, maintain, enhance and augment our platform; candidates will be expected to participate in an on-call rota.\n\n* Automate tasks wherever possible.\n\n* Stay up-to-date on emerging technologies.\n\n\n

See more jobs at NetData

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Luna


Senior Type System Engineer

Senior Type System Engineer


Luna


sys admin

senior

engineer

admin

sys admin

senior

engineer

admin


👁 1,491 viewed | ✍️ 86 applied (6%)
This job post is archived and the position is probably filled. Please do not apply.
\nSenior Type-System Engineer\nLuna is looking for a senior type-system engineer to help build the next generation interpreter and runtime for Luna, a project said by Singularity University to have the potential to change the lives of one-billion people. If you have strong technical skills and a passion for all things compiler, then this role could be the one for you.\n\nAs a type-system engineer you'll work as part of the compiler team to design and implement Luna's new type system, including its underlying theory, type-checker, and inference engine. This wok is _intrinsic_ to Luna's evolution, and will provide you with the opportunity to collaborate with a world-class team of engineers, community managers, and business developers (with experience at Bloomberg, GitHub, PayPal, to name a few), making your mark on Luna's future.\n\nWhat You'll Do\nAs a senior type-system engineer, you'll be working on the design and development of Luna's new type-system, in conjunction with the rest of the compiler team, to help support the language's evolution. This will involve:\n\n\n* Determining and formalising the theoretical underpinnings of the new type system in a way as to ensure its soundness.\n\n* Both theoretical and practical treatments of the theory behind Luna's type system.\n\n* Working with the broader compiler team to implement the type-checking and type-inference engines as part of the greater interpreter.\n\n* Using the type-system's information to improve the interpreter's functionality and performance, as well as how it interacts with the users.\n\n\n\n\nThe Skills We're Looking For\nWe have a few particular skills that we're looking for in this role:\n\n\n* Practical and rich experience writing code in a functional programming language such as Haskell or Scala, including experience with type-level programming techniques (3+ years).\n\n* Experience working with the theory behind powerful type systems, including row types, type-checking and type-inference algorithms, and dependently-typed systems.\n\n* Practical experience building real-world type-systems, including facilities for both type-checking and inference.\n\n* An awareness of the UX impacts of type-systems, and a willingness to minimise their often-intrusive nature.\n\n* Practical experience in building large and complex software systems.\n\n\n\n\nIt would be a big bonus if you had:\n\n\n* Experience writing Java and Scala code, as these will be used to implement the type-system.\n\n* Experience in writing comprehensive regression tests for both type-inference and type-checking systems.\n\n\n\n\nAvoid the confidence gap. You don't have to match all of the skills above to apply!\n\nWho You'll Work With\nYou'll be joining a distributed, multi-disciplinary team that includes people with skills spanning from compiler development to data-science. Though you'll have your area to work on, our internal culture is one of collaboration and communication, and input is always welcomed.\n\nWe firmly believe that only by working together, rather than putting our team members in their own boxes, can we create the best version of Luna that can be.\n\nThe Details\nAs part of the Luna team you'd be able to work from anywhere, whether that be at home, or on the go! We have team members distributed across the world, from San Francisco, to London, to Kraków. We welcome remote work and flexible schedules, or you can work from the Kraków office (or our planned SF office) if you'd like. We can provide competitive compensation and holiday, as well as the possibility\nof equity as time goes on.\n\nHow To Apply?\nSend us an email at [email protected], and tell us a little bit about yourself and why you think you'd be a good fit for the role! You can also tell us about:\n\n\n* Some of your past work or projects.\n\n* Why you'd like to work on Luna, and where you imagine Luna being in 5 years.\n\n* The most important features of a team that you'd like to work in.\n\n* Whether you take pride in your ability to communicate clearly and efficiently with your team.\n\n\n

See more jobs at Luna

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

UNOPS


System Administrator Ops Engineer

System Administrator Ops Engineer


UNOPS


sys admin

admin

engineer

sys admin

admin

engineer


👁 1,923 viewed | ✍️ 172 applied (9%)
This job post is archived and the position is probably filled. Please do not apply.
\nBackground Information - Job-specific\n\nThe Office for the Coordination of Humanitarian Affairs (OCHA) is seeking a part time Ops Engineer / System Administrator. OCHA currently maintains multiple web-based platforms that are used in humanitarian response. Although some of our brands have been used for over a decade, ongoing investment has kept our technology modern and relevant.\nThe successful candidate will work with our web-based development teams to maintain and grow the reliable infrastructure that is used by hundreds of thousands of humanitarian responders, supporting staff, and volunteers during sudden onset and complex emergencies. \n\nThe successful candidate must be based in the Americas’ time zones and provide support for four hours between 17:00 GMT and 22:00 GMT\n\nFunctional Responsibilities\n\nThe UN OCHA team operates as a collaboration between developers and operations (“DevOps”). You will work collaboratively with developers to ensure an excellent developer experience; site manager experience; and site visitor experience.\n\n\n* Support the process of planning, creating, and documenting scalable, secure, and highly available infrastructure for internal development teams.\n\n* Review and update documentation, diagrams, and training documents which outline best practices and standard operating procedures (“runbooks”) to empower developers to leverage existing infrastructure.\n\n* Contribute to the creation of automated and repeatable deployment systems including appropriate development, staging and production environments.\n\n* Collaborate with developers to help design appropriate requirements for systems, network, and application architecture\n\n* Contribute to the creation of disaster recovery plans including the development of incident response plans; if necessary, work with teams to implement plans in emergency situations\n\n* Support and maintain strong relationships with the engineering team and project managers\n\n* Be actively contactable through assigned working times and reachable, at a general knowledgeable level,in the case of a major emergency. Expectations are that you will rely on another part-term system     administrator as well the OCHA developers to ensure basic 24 hours coverage.\n\n\n\n\nContract type, level and duration\n\n\n* Contract type: ICA\n\n* Contract level: IICA-1/ICSC-8\n\n* Contract duration: 3 months with possible extension\n\n\n\n\nFor more details about the ICA contractual modality, please follow this link:\nhttps://www.unops.org/english/Opportunities/job-opportunities/what-we-offer/Pages/Individual-Contractor-Agreements.aspx \n\nEducation/Experience/Language requirements\n\nEducation:\n\n\n* A Bachelor’s degree in related fields is required. A Master’s degree in related field can substitute two years of the required relevant experience. \n\n* Secondary education with additional four (4) years of relevant experience may be accepted in lieu of a bachelor’s degree. A relevant technical certificate with additional experience may also be accepted. \n\n\n\n\n\nRequired Qualifications\n\n\n* At least 2 years of relevant experience is required.\n\n* Experience in a collaborative (“DevOps”) environment is critical. Developers will share your responsibility of ensuring app stability on a stable infrastructure.\n\n* Production experience with a multi-server, load balanced environment. Preference given to candidates with this experience in a Cloud infrastructure (AWS, Google Cloud, Azure, or equivalent).\n\n* Production experience with a configuration management tool deployed in a multi-server, load-balanced environment. Preference given to Ansible; experience with Chef, Puppet, or equivalent will also be considered.\n\n* Ability to clearly communicate technical information and with consideration and compassion for the technical abilities of the people they are communicating with\n\n\n\n\nDesired Skills\n\n\n* Production experience supporting or building web applications in at least one of: PHP, Python, Node.js.\n\n* Production experience with database optimisations for performance and availability. Preference given to MySQL / MariaDB; experience with Postgres, or equivalent will also be considered.\n\n* Production experience with build automation tools such as Jenkins or Bamboo\n\n* Demonstrable experience with Git, GitHub, GitLab, and/or Bitbucket\n\n* Demonstrable experience with shell scripting\n\n* Production experience with system administration for large scale Drupal deployments.\n\n* Experience with the components of a LAMP (Linux, Apache, MySQL, PHP) or LEMP (Linux, NGINX, MySQL, PHP) stack.\n\n* RHCE certification or experience with general administration of Debian-based Linux distributions.\n\n* Experience with configuring logging and monitoring tools; and taking action based on the reported alerts (ELK stack, Nagios, and equivalent tools)\n\n* Experience building, installing, and maintaining Linux-based packages for software deployment (RPM or .deb)\n\n* Experience with data migration projects.\n\n* Experience with multi-server hosting environments (N-Tier, sharding, scaling, failover)\n\n* Experience with caching & high availability configuration techniques\n\n* Experience building and deploying Docker containers, and Docker-housed applications\n\n* Ability to accurately explain and document software development and release processes\n\n\n\n\nLanguage Skills\n\n\n* Working-level English; with excellent written communication skills (this is a remote position, so most communications will be in writing via text-based chat; or email).\n\n\n\n\nCompetencies\n\n\n\n\nTreats all individuals with respect; responds sensitively to differences and encourages others to do the same. Upholds organizational and ethical norms. Maintains high standards of trustworthiness. Role model for diversity and inclusion.\n\n\n\n\n\n\n\n\n\nActs as a positive role model contributing to the team spirit. Collaborates and supports the development of others. For people managers only: Acts as positive leadership role model, motivates, directs and inspires others to succeed, utilizing appropriate leadership styles.\n\n\n\n\n\n\n\nDemonstrates understanding of the impact of own role on all partners and always puts the end beneficiary first. Builds and maintains strong external relationships and is a competent partner for others (if relevant to the role).\n\n\n\n\n\n\n\nEfficiently establishes an appropriate course of action for self and/or others to accomplish a goal. Actions lead to total task accomplishment through concern for quality in all areas. Sees opportunities and takes the initiative to act on them. Understands that responsible use of resources maximizes our impact on our beneficiaries.\n\n\n\n\n\n\n\nOpen to change and flexible in a fast paced environment. Effectively adapts own approach to suit changing circumstances or requirements. Reflects on experiences and modifies own behavior. Performance is consistent, even under pressure. Always pursues continuous improvements.\n\n\n\n\n\n\n\nEvaluates data and courses of action to reach logical, pragmatic decisions. Takes an unbiased, rational approach with calculated risks. Applies innovation and creativity to problem-solving.\n\n\n\n\n\n\n\nExpresses ideas or facts in a clear, concise and open manner. Communication indicates a consideration for the feelings and needs of others. Actively listens and proactively shares knowledge. Handles conflict effectively, by overcoming differences of opinion and finding common ground.\n\n\n\n\n\nAdditional Considerations\n\n\n* Please note that the closing date is midnight Copenhagen time\n\n* Applications received after the closing date will not be considered.\n\n* Only those candidates that are short-listed for interviews will be notified.\n\n* Qualified female candidates are strongly encouraged to apply.\n\n* The incumbent is responsible to abide by security policies, administrative instructions, plans and procedures of the UN Security Management System and that of UNOPS.  \n\n\n\n\nIt is the policy of UNOPS to conduct background checks on all potential recruits/interns. \nRecruitment/internship in UNOPS is contingent on the results of such checks.\n\nBackground Information - UNOPS\n\nUNOPS is an operational arm of the United Nations, supporting the successful implementation of its partners’ peacebuilding, humanitarian and development projects around the world. Our mission is to help people build better lives and countries achieve sustainable development.\n\nUNOPS areas of expertise cover infrastructure, procurement, project management, financial management and human resources.\n\nWorking with us\n\nUNOPS offers short- and long-term work opportunities in diverse and challenging environments across the globe. We are looking for creative, results-focused professionals with skills in a range of disciplines.\n\nDiversity\n\nWith over 4,000 UNOPS personnel and approximately 7,000 personnel recruited on behalf of UNOPS partners spread across 80 countries, our workforce represents a wide range of nationalities and cultures. We promote a balanced, diverse workforce — a strength that helps us better understand and address our partners’ needs, and continually strive to improve our gender balance through initiatives and policies that encourage recruitment of qualified female candidates.\n\nWork life harmonization\n\nUNOPS values its people and recognizes the importance of balancing professional and personal demands.\n\nSustainable Development Cluster (SDC)\n\nBased in New York, the Sustainable Development Cluster (SDC) supports diverse partners with their peacebuilding, humanitarian and development operations.\n\nThe SDC’s services include grants management, development and special initiatives support, and technology support to the UN and UN agencies.\n\nThe SDC is part of the New York Service Cluster that supports the United Nations Secretariat, as well as a broadening community of other New York-based United Nations organizations, bilateral and multilateral partners in the delivery of UNOPS mandate in project management, infrastructure management, and procurement management.

See more jobs at UNOPS

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Zapier


Site Reliability Engineer

Site Reliability Engineer


Zapier


sys admin

engineer

admin

sys admin

engineer

admin


👁 2,134 viewed | ✍️ 143 applied (7%)
This job post is archived and the position is probably filled. Please do not apply.
\nHi there!\n\nWe're looking for a Site Reliability Engineer to join the SRE team at Zapier. Zapier’s on a mission to make everyone more productive at work. Over 3 million professionals already use Zapier to save more time, but there are millions more to reach. As a member of the Zapier SRE team, you’ll build systems to allow us to scale task processing for millions of customers and enable engineering teams to ship hundreds of times a day without fear! \n\nIf you’re interested in launching your career at a fast-growing and profitable startup, then read on… \n\nWe know applying for and taking on a new job at any company requires a leap of faith. We want you to feel comfortable and excited to apply at Zapier. To help share a bit more about life at Zapier, here are a few resources in addition to the job description that can give you an inside look at what life is like at Zapier. Hopefully, you'll take the leap of faith and apply.\n\n\n* Our Commitment to Applicants\n\n* Culture and Values at Zapier\n\n* Zapier Guide to Remote Work\n\n* Zapier Code of Conduct\n\n* Diversity and Inclusivity at Zapier\n\n\n\n\nZapier is proud to be an equal opportunity workplace dedicated to pursuing and hiring a diverse workforce.\n\nAbout You\n\nYou’re a skilled Site Reliability Engineer. We’re looking for 3-5 years of experience in practicing infrastructure as code with Ansible and Terraform, working with Kubernetes clusters and implementing high availability / resilience patterns at scale. \n\nYou’re an excellent written communicator. We’re also a 100% remote team, and writing is our primary means of communication at Zapier. \n\nYou’re creative and resourceful. You try as many angles as possible to secure positive press mentions for clients and companies. You keep an eye out for media, speaking, and award opportunities as they arise and keep up on industry trends. \n\nYou believe relationships are critical to success. You have relationships with business and tech media, you have experience working with customers or partners, and you’re able to quickly build trust with cross-functional teams and external agencies.  \n\nYou’re solid at time management. You’ll juggle a variety fast-moving communications projects, and as a part of a distributed team, you’ll be trusted to work with minimal supervision. As a part of a growing company, you have an opportunity to make a big impact, and you’re keen to build processes that’ll make your job more efficient over time.\n\nThings You’ll Do\n\n\n* Infrastructure automation on EC2 with ansible, packer and terraform\n\n* Enable teams to ship faster by providing self-service tooling to work effectively with Kubernetes\n\n* Implement high availability / resilience patterns to handle surges in traffic and task processing load\n\n* Level up our continuous delivery to continue to enable engineering teams to ship features hundreds of times a day without fail\n\n* Have an enormous impact working closely with teams across the organization\n\n* As a part of our All Hands Support initiative, help customers have the best possible experience with Zapier\n\n\n\n\nAbout Zapier\n\nZapier has been helping people across the world automate the boring and tedious parts of their job. We do that by helping everyone connect the web applications they already use and love.\n\nWe believe that there are jobs a computer is best at doing and that there are jobs a human is best at doing. We want to empower businesses to create processes and systems that let computers do what they are best at doing and let humans do what they are best at doing.\n\nWe believe that with the right tools, you can have big impact with less hassle.\n\nWe believe in small teams. Small teams are fast and nimble. Small teams mean less bureaucracy and less management and more getting things done.\n\nWe believe in a safe, welcoming, and inclusive environment. All teammates at Zapier agree to a code of conduct.\n\nThe Whole Package\n\nLocation: Planet Earth\n\nOur distributed environment lets us work with the best people. You don't have to be located in the USA either. Some team members live in the United Kingdom, Thailand, India, Nigeria, Taiwan, Guatemala, New Zealand, Australia, and more! You just need the skills and drive to succeed in this role and the ability to work from anywhere.\n\nCompensation:\n\n\n* Competitive salary (we don't use remote as an excuse to pay less)\n\n* Great healthcare + dental + vision coverage*\n\n* Retirement plan with 4% company match*\n\n* Profit-sharing\n\n* 2 annual company retreats to awesome places\n\n* 14 weeks paid leave for new parents of biological or adopted children\n\n* Pick your own equipment. We'll set you up with whatever Apple laptop + monitor combo you want plus any software you need.\n\n* Unlimited vacation policy. Plus we require you to take at least 2 weeks off each year. We see most employees take 4-5 weeks off per year. This isn't a vague policy where unlimited vacation means no vacation.\n\n* Work with awesome companies around the world. We partner with great software companies all over the world and you'll constantly get to interact with people from these great companies\n\n\n\n\n*While we take care of our international folks as best we can, currently, healthcare and retirement plans are only available to US-based employees.\n\nHow to ApplyWe have a non-standard application process. To jump-start the process we ask a few questions we normally would ask at the start of an interview. This helps speed up the process and lets us get to know you a bit better right out of the gate. Please make sure to answer each question. After you apply, you are going to hear back from us, even if we don't seem like a good fit. In fact, throughout the process, we strive to make sure you never go more than seven days without hearing from us.\n\nZapier is an equal opportunity employer. We're excited to work with talented and empathetic people no matter their race, color, gender, sexual orientation, religion, national origin, physical or mental disability, or age. Our code of conduct provides a beacon for the kind of company we strive to be, and we celebrate our differences because those differences are what allow us to make a product that serves a global user base.

See more jobs at Zapier

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.


👁 7,159 viewed | ✍️ 1,375 applied (19%)
This job post is archived and the position is probably filled. Please do not apply.
Doximity is transforming the healthcare industry. Our mission is to help doctors be more productive, informed, and connected. As a software engineer, you'll work within cross-functional delivery teams alongside other engineers, designers, and product managers in building software to help improve healthcare.  \n\nOur team brings a diverse set of technical and cultural backgrounds and we like to think pragmatically in choosing the tools most appropriate for the job at hand.\n\n**About Us**\n* Here are some of the ways [we bring value to doctors](https://drive.google.com/file/d/1qimYh0mG3i1nTJe6jDCDepJt2i4o8MEB/view)\n* Our web applications are built primarily using Ruby, Rails, Javascript (Vue.js), and a bit of Golang\n* Our data engineering stack run on Python, MySQL, Spark, and Airflow\n* Our production application stack is hosted on AWS and we deploy to production on average 50 times per day\n* We have over 350 private repositories in Github containing our applications, forks of gems, our own internal gems, and [open-source projects](https://github.com/doximity)\n* We have worked as a distributed team for a long time; we're [currently about 65% distributed](https://blog.brunomiranda.com/building-a-distributed-engineering-team-85d281b9b1c)\n* Find out more information on the [Doximity engineering blog](https://engineering.doximity.com/)\n* Our [company core values](https://work.doximity.com/)\n* Our [recruiting process](https://engineering.doximity.com/articles/engineering-recruitment-process-doximity)\n* Our [product development cycle](https://engineering.doximity.com/articles/mofo-driven-product-development)\n* Our [on-boarding & mentorship process](https://engineering.doximity.com/articles/software-engineering-on-boarding-at-doximity)\n\n**Here's How You Will Make an Impact**\n* Improve the performance and scalability of services, optimize our REST and GraphQL APIs\n* Address security concerns and proficiently maintain our application stack\n* Troubleshoot issues across the whole stack, such as high-load, memory full, network issues and come up with temporary/long term solutions based on the root cause\n* Hands-on maintenance on our Ruby on Rails and Go (Golang) applications\n* Increase our automated test coverage and deployment infrastructure robustness \n* Manage infrastructure using Chef and Terraform\n* Active involvement in design, implementation, and maintenance of the development, staging, and production infrastructure and services your team is responsible for\n* Create concise postmortems in the event of an outage\n* Write and maintain run-books for other engineers to leverage\n* Ensure proper security, monitoring, alerting, and reporting for the applications your team is responsible for\n* Collaborate with other engineers to make sound infrastructure decisions, improve workflow, and deploy applications ready for production\n* Monitor capacity, cost and plan for upgrades\n* Participate in an on-call rotation\n\n**About you**\n* You are a Ruby engineer at heart, very familiar and passionate about the Rails ecosystem\n* You are knowledgeable of memory and CPU profiling tools to help adjust Ruby jobs and processes to use resources effectively\n* You have experience working with Terraform and Chef (or similar tooling) either in a DevOps or product support capacity\n* You have experience deploying, configuring, and maintaining NGINX\n* You are proficient with Unix, AWS, and Git\n* You are self-motivated and able to manage yourself and your own queue\n* You are a problem solver with a passion for simple, clean, and maintainable solutions\n* You agree that concise and effective written and verbal communication is a must for a successful team\n* You are able to maintain a minimum of 5 hours overlap with 9:30 to 5:30 PM Pacific time\n* You can dedicate about two weeks per year for travel to company events\n\n**Benefits**\n\nDoximity has industry leading benefits. For an updated list, see our career page\n\n**More info on Doximity**\nWe’re thrilled to be named the Fastest Growing Company in the Bay Area, and one of Fast Company’s Most Innovative Companies. Joining Doximity means being part of an incredibly talented and humble team. We work on amazing products that over 70% of US doctors (and over one million healthcare professionals) use to make their busy lives a little easier. We’re driven by the goal of improving inefficiencies in our $3.5 trillion U.S. healthcare system and love creating technology that has a real, meaningful impact on people’s lives. To learn more about our team, culture, and users, check out our careers page, company blog, and engineering blog. We’re growing steadily, and there’s plenty of opportunity for you to make an impact.\n\n*Doximity is proud to be an equal opportunity employer, and committed to providing employment opportunities regardless of race, religious creed, color, national origin, ancestry, physical disability, mental disability, medical condition, genetic information, marital status, sex, gender, gender identity, gender expression, pregnancy, childbirth and breastfeeding, age, sexual orientation, military or veteran status, or any other protected classification. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law.*\n\n#Location\n- North America

See more jobs at Doximity

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Clearcover


Site Reliability Engineer

Site Reliability Engineer


Clearcover


sys admin

engineer

admin

sys admin

engineer

admin


👁 1,539 viewed | ✍️ 119 applied (8%)
This job post is archived and the position is probably filled. Please do not apply.
\nWhat is a Site Reliability Engineer? \nClearcover is looking for an energetic, pragmatic, and highly motivated Site Reliability Engineer (SRE) to join our team. Those with backgrounds as Systems Engineers who love automation, or Software Developers who love infrastructure, are ideal for this role. The Site Reliability Engineer (SRE) role is an integral part of our Product and Technology organization, and requires a passion for technology and cloud architecture. Pragmatic and driven, you thrive in collaborative environments with high degrees of autonomy. If you are passionate about designing and automating reliable, scalable, and performant systems that empower engineers to deliver value to our customers, we'd love to talk to you!\n\nWhat will you do?\n\n\n* Develop, deploy, and operate our secure AWS infrastructure (EKS, S3, EC2, RDS, Lambda, etc).\n\n* Ensure the high availability, resiliency, performance, business continuity and compliance capabilities of our cloud services.\n\n* Build and enhance our observability tools, including Istio, Grafana, Prometheus, CloudWatch.\n\n* Define standards for our containerized environments and Kubernetes clusters hosted in AWS.\n\n* Work with our engineering teams to deploy and operate cloud services.\n\n* Help develop and operate our automation and continuous delivery systems.\n\n* Participate in on-call rotation, drive incident resolution, live troubleshooting and impact mitigation.\n\n\n\n\nWhat do you need?\n\n\n* 4+ years as a site reliability engineer, sysops engineer, or devops engineer.\n\n* Experience with cloud IaaS offerings in AWS.\n\n* Experience with automation/configuration management using Terraform, Ansible, or similar solutions.\n\n* Experience with cloud native platforms such as Kubernetes or ECS.\n\n* Experience with continuous integration/deployment frameworks such as Jenkins.\n\n* Experience with both SQL and NoSQL databases such as PostgreSQL, DynamoDB, Redis, MongoDB, Elasticsearch, or equivalent.\n\n* Experience with operational monitoring tools, particularly, Prometheus, SumoLogic, and AWS Cloudwatch.\n\n* An interest in designing, analyzing and troubleshooting large-scale distributed systems.\n\n* Well-versed with the entire software development lifecycle, devops, and SRE practices.\n\n\n\n\nNice to haves?\n\n\n* Ability to strike a balance between the needs of today with building towards a greater future\n\n* You use data driven development to build, scope and define features that have a measurable impact\n\n* Experienced with domain driven design to help explain and simplify complex problems\n\n\n\n\nWhat's in it for you?\n\n\n* Unlimited PTO, we hire adults\n\n* Equity for all employees, so you own a piece of the pie too\n\n* Dental and Vision, we've got you covered 100%\n\n* Medical, we cover 90% of your premium and contribute to your HSA and HRA (cha-ching)\n\n* We invest in your future by contributing 3% of your salary to a 401(K), even if you don't\n\n* Come to work pre-taxed through our FSA commuter benefits\n\n* and yes, we have unlimited LaCroix, beer, snacks and the occasional ice cream social\n\n\n

See more jobs at Clearcover

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Carbon Black


Principal Site Reliability Engineer

Principal Site Reliability Engineer


Carbon Black


sys admin

engineer

admin

sys admin

engineer

admin


👁 1,309 viewed | ✍️ 110 applied (8%)
This job post is archived and the position is probably filled. Please do not apply.
What You’ll Do\n\n\n* \nOwn the architecture and management of AWS infrastructure components\n\n\n* \nAutomate and manage the deployment of cloud-based services\n\n\n* \nShare responsibility for health, scalability and availability of our cloud and datacenter services\n\n\n* \nWork with configuration management tools in both Windows and Linux\n\n\n* \nEnsure cloud architecture meets scalability, availability and cost requirements\n\n\n* \nBe a mentor to team members on good operational practices\n\n\n\n\nWhat You’ll Bring\n\n\n* \nB.S. in Computer Science or equivalent experience\n\n\n* \nMinimum 3 years of experience managing AWS infrastructure\n\n\n* \nMinimum of 7 years of experience with systems engineering and software development\n\n\n* \nExpert understanding/experience of containerization services such as Docker/Kubernetes\n\n\n* \nExperience in open-source tools for monitoring, metrics, configuration and log management\n\n\n* \nSolid understanding/experience of web services, databases and relating infrastructure/architectures\n\n\n* \nSolid understanding of backup/restore best practices\n\n\n* \nStrong level of expertise programming in Python / C# / Java or equivalent language\n\n\n* \nExcellent Troubleshooting Skills\n\n\n* \nExperience supporting an enterprise-level SaaS environment\n\n\n* \nSecurity Experience a plus\n\n\n\n

See more jobs at Carbon Black

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

VividCortex Database Performance Monitoring


Site Reliability Engineer

Site Reliability Engineer


VividCortex Database Performance Monitoring


sys admin

engineer

admin

sys admin

engineer

admin


👁 1,380 viewed | ✍️ 99 applied (7%)
This job post is archived and the position is probably filled. Please do not apply.
\n**This position is a remote role only available to US citizens or permanent residents who are authorized to work without sponsorship currently residing in the United States of America** About VividCortex VividCortex provides deep database performance monitoring to drive speed, efficiency and savings. Our cloud-based SaaS platform offers full visibility into major open source databases – MySQL, PostgreSQL, Amazon Aurora, MongoDB, and Redis – at any scale without overhead. By giving entire engineering teams the ability to monitor database workload and query behavior, VividCortex empowers them to improve application speed, efficiency, and up-time. Founded in 2012, and headquartered in the Washington, DC metro area with remote teams in the US and abroad, our company’s growth continues to accelerate (#673 Inc. 5000). Hundreds of industry leaders like DraftKings, Etsy, GitHub, SendGrid, Shopify, and Yelp rely on VividCortex.  We know our team is our greatest strength so we support our people with excellent benefits including 401k, professional development assistance, flexible paid leave (vacation, parental, sick, etc.), and a health/wellness benefit. We enjoy getting together and giving back to the community through volunteer services. We believe in offering every employee the tools and opportunity to impact the business in a positive way. We care about inclusiveness and working with people who help us learn and grow. About The Role VividCortex is looking for a site reliability engineers to help us operate, troubleshoot, and improve the platform that ingests, secures, and analyzes the massive amounts of performance and other data we measure from our customers' database servers. Our platform is written in Go and hosted on the AWS cloud. It uses Kafka, Redis, and MySQL for data storage and analysis. We are a DevOps organization building a 12-factor microservices application; we practice small, fast cycles of rapid improvement and full exposure to the entire infrastructure, but we don't take anything to extremes.\n\nResponsibilities:\n\n\n\n\n* Make our platform operate and scale in a lights-out fashion, without manual intervention. Plan and build for resilience at 10x scale. Our goal is that systems should scale linearly with our customer growth, and the effort of maintaining the systems should scale sub-linearly.\n\n* Define, code, and execute "infrastructure as code" definitions of our systems and how they operate and interconnect.\n\n* Support the definition of the next generation of our platform.\n\n* Analyze and eliminate actual and potential threats to security, availability, and performance.\n\n* Deploy, observe, and operate the systems you build. Use ChatOps, VividCortex, Ansible, the Unix command line, and other tools to do this.\n\n* Help provide customer support and rotate through on-call duty.\n\n* Continually seek to understand and improve our security posture and practices; security is part of everyone's job here.\n\n* Contribute to a culture of blameless learning, responsibility, and accountability.\n\n* Collaborate as needed; work independently when needed. You must be self-managing. You must be present and online during your team's normal working hours, and attend and participate in team calls and the like.\n\n\n\n\n\n\nPreferred Qualifications:\n\n\n\n\n* You are collaborative and experienced in general development, deployment, and operation of modern API-powered web applications using continuous delivery and Git in a Unix/Linux environment.\n\n* Experience with a configuration management system such as Chef, Puppet, Ansible, or Salt.\n\n* Experience with immutable infrastructure and 12-factor architecture is helpful.\n\n* SaaS multitenant experience is a bonus.\n\n* Experience with enterprise security is a plus.\n\n* Experience with Go, Kafka, Redis, and MySQL is very beneficial. Experience with other storage technologies such as Cassandra, Hadoop, and the like are also helpful.\n\n* Bash scripting experience is strongly desirable.\n\n* Experience with project management is helpful.\n\n\n\n\n\n\nNote to Agencies and Recruiters: VividCortex has a strict company policy against engaging with unsolicited contact from agencies or recruiters.  Unsolicited resumes and leads are property of VividCortex and VividCortex explicitly denies that any information sent to VividCortex can be construed as consideration.

See more jobs at VividCortex Database Performance Monitoring

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Balena


Site Reliability Engineer

Site Reliability Engineer


Balena


sys admin

engineer

admin

sys admin

engineer

admin


👁 1,422 viewed | ✍️ 122 applied (9%)
This job post is archived and the position is probably filled. Please do not apply.
\nAbout being a Site Reliability Engineer at Balena\n\nBalena is looking for a Site Reliability engineer to work with the balena core services. Site Reliability engineers at Balena ensure that our platform is available, reliable, and efficient. They develop monitoring solution and disaster recovery plans, respond and investigate incidents, and work closely with the development team to facilitate frictionless deployments to production.\n\nWe're a growing company with opportunities to shape the future of our core system architecture and work to solve the good problems associated with scaling. As a company at the forefront of the emerging IoT sector, and one of the very few putting Docker on embedded devices, we move quickly and innovate aggressively to solve our problems in new and interesting ways.This will be a full-time role.\n\nYou will spend time on...\n\n\n* Defining and developing our monitoring systems\n\n* Designing and practicing disaster recovery plans\n\n* Scaling our infrastructure to meet the demand of hundreds of thousands of clients\n\n* Investigating and evaluating new technologies\n\n* Collaborating with the team to design internal tooling\n\n* Participating in on-call rotation\n\n\n\n\nREQUIREMENTS\n\nYou...\n\n\n\n* Take pride in your work and are passionate about good code\n\n* Are proficient in at least one mainstream programming language\n\n* Have deep knowledge of Linux, networking, and internet protocols\n\n* Are familiar with managing AWS infrastructure\n\n* Are an excellent communicator, fluent in English\n\n* Have a good internet line available so you can join a video call without trouble\n\n* Are comfortable taking on a project and pushing it to completion without too much management\n\n\n\n\nBENEFITS\n\n\n* Work with an extremely talented, diverse team\n\n* Equipment of your choice\n\n* Remote-friendly\n\n* Flexible working hours\n\n* Flexible vacation policy\n\n* Annual company gathering in an international location\n\n* We send you hardware for side projects!\n\n\n\n\nAbout working at balena\n\nWe come from 15+ countries, and we embrace a remote culture with flexible hours. To us, this means being highly productive while still maintaining a healthy work-life balance. You need to be able to work remotely, and have a dependable internet access available so you can join video calls.\n\nWe are an equal opportunity employer and value diverse backgrounds. We maintain a work environment in which team members are treated with respect at all times and in which thoughts and ideas can be shared openly.\n\nWe communicate proposals, discuss with others in the team and accept feedback if it makes the result better. We value the ability to learn, which is more important to us than knowledge of specific technologies. We know that learning fast means being outside our comfort zone, which is OK -- we'd rather grow than let our assumptions get in our way.\n\nTO APPLY\n\nWe're delighted to hear about you! Send us your CV, with a focus on what you can bring to the team.

See more jobs at Balena

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Trov


Devops System Engineer

Devops System Engineer


Trov


sys admin

devops

engineer

admin

sys admin

devops

engineer

admin


👁 786 viewed | ✍️ 35 applied (4%)
This job post is archived and the position is probably filled. Please do not apply.
\nYour main responsibilities will include:\n\nThe role requires you to build and support all cloud-based systems including, but not limited to, Windows and Linux based servers, networking, and security groups. You will assist our application engineers and automation experts as they design and develop infrastructure to improve resiliency, security, and data availability.\n\n\n* Design, implement, and maintain our secure, scalable, and available infrastructure.\n\n* Automate network and application infrastructure using configuration management  frameworks.\n\n* Design and implement innovations that improve software engineering velocity, infrastructure resiliency, security, and data availability.\n\n* Create and manage the continuous build and deployment systems.\n\n* Unify logging and exception framework to allow for faster bug diagnosis.\n\n* Work closely with developers and system administrators to debug application issues by evaluating application logs, stack traces, and system metrics.\n\n* Ensure availability of all systems to meet our internal and external SLA's.\n\n* Respond to outages with a timely and efficient resolution and participate in On-Call availability team in support of a 24x7 global production environment.\n\n\n\n\nYou have/are:\n\n\n* Strong code/scripting skills (C#/Java, Python, PowerShell, Bash, Ruby, etc)\n\n* Linux, OSX, and Windows operating systems\n\n* Systems architecture experience building production grade highly available deployments.\n\n* Experience working in a cloud environment – AWS, Azure\n\n* Versed in internet architectures including web, application, and database components such as IIS, MySQL, and MongoDB\n\n* Experience with continuous integration servers (preferably Jenkins) automation/configuration management using Chef, Puppet, Ansible or an equivalent and\n\n* Source Control systems (preferably git)\n\n* Strong problem solving and analytical skills\n\n* Ability to manage multiple projects, priorities and deadlines unsupervised.\n\n* Combination of education, training, and experience preferred.\n\n\n\n\nWhat we offer:\n\nPaid medical benefits for you and your family\n\nCompetitive salary and equity\n\nThe latest tech and tools to get the job done your way\n\nFlexible and remote working when needed\n\nUnlimited PTO policy, paid holidays, and your birthday off \n\nOther Details:\n\nThis is a full-time remote position (US or Canada), with a presence in the Danville, CA or New York City office four days a week if you’re living within 1 hour of either office.

See more jobs at Trov

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Sticker Mule


Software Engineer Site Reliability Infrastructure

Software Engineer Site Reliability Infrastructure


Sticker Mule


dev

sys admin

engineer

digital nomad

dev

sys admin

engineer

digital nomad


👁 317 viewed | ✍️ 1 applied (0%)
This job post is archived and the position is probably filled. Please do not apply.
What you'll do\n1. You'll design, build, and maintain the tools our software engineers use daily to develop, test, and deploy services.\n2. You'll work to improve the performance, reliability, and security of the Sticker Mule cloud infrastructure.\n3. You'll collaborate with other engineering teams and stakeholders to ensure we're always building the right solution.\n4. You'll share your expertise with other members of the team, review your peers' code, and mentor other engineers.\n5. You'll participate in periodic on-call duties.\n\nAbout you\n1. You have been a professional software engineer for 3+ years.\n2. You're highly skilled in two or more general-purpose languages, and one of those languages is Go.\n3. You have Linux roots, you contain with Docker, you schedule with Kubernetes.\n4. You have automated complex tasks with Bash.\n5. You have practical experience with more than one cloud provider, AWS and Google Cloud would be especially beneficial.\n6. You used logging, monitoring and distributed tracing systems.\n7. You have excellent written and verbal communication skills in English.\n\nWhy you should join\nWe believe high performing organizations include people from different backgrounds and experiences. We are committed to building a safe, supportive work environment where team members can bring their diverse perspectives.\n\nIn joining the Sticker Mule team, you'll have the opportunity to make a significant impact as part of a small, highly motivated team. We have a large variety of interesting technical problems, and we offer above-market compensation. We strive to provide a sensible balance between work and non-work, and we allow you to define your own schedule.\n\nHow the process works\n1. You’ll send us your application including samples of your code and writing. Alternatively, if you don’t have any samples or can’t provide them, you can do a take-home test.\nSelected candidates will be invited to begin our interview process:\n2. An introductory interview\n3. A technical discussion with other engineers, where we'll go through your technical background and the samples you provided. We might also ask you a few technical questions.\n4. A final meeting with our VP of HR.\n\nCompensation and benefits\n1. $90,000-$115,000+ depending on experience.\n2. Signing bonus.\n3. 4 weeks vacation. 

See more jobs at Sticker Mule

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Netdata Inc.


Senior Site Reliability / Devops Engineer

☝️
GMT-3 to GMT+5

Senior Site Reliability / Devops Engineer


Netdata Inc.

GMT-3 to GMT+5

devops

sre

senior

sys admin

devops

sre

senior

sys admin


👁 4,094 viewed | ✍️ 315 applied (8%)
This job post is archived and the position is probably filled. Please do not apply.
Netdata is looking for Senior Site Reliability / DevOps Engineers proficient in CI/CD methodologies, coupled with strong experience in software written in Javascript, Go, C, Python or other scripting languages, to join our distributed (remote) engineering team.\n\nAs a Senior SRE/DevOps engineer you will focus on supportring our netdata cloud offerings and augment our existing development infrastructure by implementing the automations necessary to catalyse further development of both our open-source project and our commerical offerings. This includes building upon our existing CI/CD, packaging, installation facilities and operational processes as well as developing custom solutions for our various teams and systems. As a Netdata SRE/DevOps engineer you will also be assisting engineers across our company, enabling them to provide world-class solutions for numerous platforms; as well as our community, open-source contributors and team-members with your deep knowledge of systems and troubleshooting skills.\n\n**Why join Netdata**\n* We are a team of industry veterans and senior engineers that prioritize performance and ease of use over anything else.\n* We embrace remote work and great work-life balance.\n* We are solving hard problems that affect thousands of organisations worldwide.\n* We are deeply committed to Open Source and love our community.\n* We deeply care about system performance.\n\n**When you join Netdata, you can expect**\n* A competitive salary.\n* A generous stock plan.\n* To join a venture-backed startup working with some of the most sophisticated investors of Silicon Valley.\n* To be part of our world-class team and interact with an amazing community.\n* To see first-hand how to grow and succeed in an engineering-first, open source-based company.\n* To find a culture that rewards doers.\n\n*Netdata is an Equal Opportunity Employer. We are committed to providing an inclusive work environment free of discrimination and harassment for everyone, regardless of race, color, religion, national or ethnic origin, sex, age, sexual orientation, gender identity, disability, sexual orientation, marital status, military service or other non-merit factor.*\n\n# Responsibilities\n * Develop our automated CI/CD, packaging, deployment and execution environment infrastructure.\n* Develop automation tools to catalyse existing development or operational processes.\n* Evaluate, architect and develop technology options for our infrastructure and systems.\n* Troubleshoot, maintain, enhance and augment our platform; candidates will be expected to participate in an on-call rota.\n* Automate tasks wherever possible.\n* Stay up-to-date on emerging technologies. \n\n# Requirements\n**Required experience**\n* A bachelor's degree in Computer Science or equivalent\n* 3+ years of experience on CI/CD tools (Travis, Gitlab, AWS, Azure, etc) and methodologies\n* Minimum 3 years of Linux systems development and/or administration.\n* Minimum 2 years of experience with at least one scripting language, coupled with related automation projects\n* Previous experience with cloud-based technologies and surrounding operational processes\n* Self motivated, conscientious, with a problem-solving, hands-on mindset.\n* Perfectionist where it matters, but also pragmatic, with effective time management skills.\n* Team player, eager to help.\n* Excellent analytical skills.\n* Excellent command of spoken and written English.\n \n\n**Preferred experience**\n* Minimum 2 years of Go, Javascript and C development experience in demanding environments.\n* Expert on Continuous Integration, with long experience in Test Automation\n* 5+ years of shell scripting experience, on at least 2 languages (BASH, python, perl, ruby, etc.)\n* Minimum 2 years of experience with Google Cloud app engine and surrounding operational processes\n* Experience on configuration management and tools to support it (Ansible, puppet, etc.)\n* Experience with monitoring solutions and service assurance in general.\n* A linux, cross-distribution artisan. A good amount of knowledge on windows system administration\n* Open source contributor\n* Agile Development Methodology\n\n#Location\n- GMT-3 to GMT+5

See more jobs at Netdata Inc.

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

O'Reilly Auto Parts


Site Reliability Engineer

Site Reliability Engineer


O'Reilly Auto Parts


sys admin

engineer

admin

sys admin

engineer

admin


👁 1,833 viewed | ✍️ 172 applied (9%)
This job post is archived and the position is probably filled. Please do not apply.
\nThe Site Reliability Engineer is responsible for the availability and performance the platforms and services of O’Reilly Auto Parts. Creates and defines monitoring and incident response tools and processes.\nThe Site Reliability Engineer will create a bridge between development and operations by applying a software engineering mindset to system administration. Time will be split between operations/on-call duties and developing systems and software that help increase site reliability and performance.\n\nESSENTIAL JOB FUNCTION:\n\n\n* Deploy methodologies for building and operating highly available and scalable services.Work closely with Network Operations Center to develop monitoring tools, analyze root cause of incidents, and improve the Network Operations Center’s ability to independently resolve issues.\n\n* Evaluate, build and modify automation for deploying and operating production services.\n\n* Provide leadership in reducing and resolving production incidents.\n\n* Proactively monitor and review application performance. Monitor specific metrics, set thresholds, and trigger alerts based on those thresholds.\n\n* Collect and analyze logging and diagnostic information.\n\n* Identify opportunities to improve all operations processes.\n\n* Facilitate effective transition of services into production ensuring that all requirements have been met in accordance with O’Reilly’s Change Management standards.\n\n* Properly document all incident responses.\n\n* Provide updates and documentation to runbooks and operational manuals.\n\n* Document mean time to recover (MTTR) and mean time to failure (MTTF).\n\n* Participate in on-call rotations.\n\n\n\n\nSKILLS/ EDUCATION/ KNOWLEDGE/ EXPERIENCE/ ABILITES:\n\nRequired: \n\n\n* Bachelor’s Degree or equivalent work experience.\n\n* 5+ years of professional experience in Site Reliability, Linux Systems Administration, DevOps, or Infrastructure Engineering.\n\n* Experience with programming languages including Java, JavaScript and SQL.\n\n* Experience with Shell Scripting such as Bash, Python or Ruby.\n\n* Familiarity with automation and configuration management tools and frameworks.\n\n* Excellent analytical and problem solving skills.\n\n* Strong written and verbal communication skills.\n\n* Must be well organized, detail oriented, and able to self-prioritize work.\n\n* Must exhibit a high degree of professionalism.\n\n* Composed urgency in stressful situations.\n\n\n\n\nDesired:\n\n\n* ITIL Foundations Certification.\n\n* CRE or CMRP Certifications.\n\n\n

See more jobs at O'Reilly Auto Parts

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Ghost

 

Senior Infrastructure Engineer

verified
🌏 Worldwide

Senior Infrastructure Engineer  


Ghost

🌏 Worldwide

devops

sysadmin

sys admin

senior

devops

sysadmin

sys admin

senior


👁 3,722 viewed | ✍️ 391 applied (11%)
This job post is archived and the position is probably filled. Please do not apply.
We're looking for a talented senior engineer to help build, manage, and scale our Ghost(Pro) PaaS infrastructure, serving over 500M requests/month. This is a key role working in a small team and reporting directly to Ghost's CTO.\n\n# Responsibilities\n All of our infrastructure and systems have gone through several iterations, but have ultimately been built by a small/scrappy team of passionate developers without significant prior sysadmin experience. Our platform these days is pretty solid, but it took us a long time and a lot of trial and error to get here. What we’re looking for now is someone who is comfortable and confident in leading our architecture and taking it to the next level.\n\nFor this position, we're explicitly looking for someone experienced (5+ years sysadmin experience, minimum) and confident in taking on a broad set of responsibilities managing, deploying and maintaining complex projects across several different environments.\n\nExtensive experience in systems management and automation is a must. Experience specifically relating to web hosting at scale, continuous integration, monitoring and performance management is a huge advantage. Previous remote work and startup experience is also very valuable.\n\nOur infrastructure is comprised of about 100 servers across two datacenters, running Ubuntu and managed with Saltstack, sitting behind a fairly deep CDN integration. Most common tech across our instances includes MariaDB, Nginx and Phusion Passenger, LXC, Gluster, and a lot of JavaScript. \n\n# Requirements\nThis role requires someone who is exceptional at clear, frequent communication, especially when identifying and responding to infrastructure failures, as well as…\n\n- 🎛 Analysing infrastructure requirements and optimisations based on app performance and user load scenarios\n- ⚙️ Database clustering and replication management\n- ☎️ Monitoring and on-call alert management\n- 🔑 Common security issues and mitigation strategies\n\nThis role would be well suited to someone in an existing ops team at a fast-paced technology company looking for a more senior position where they’re able to have more control and leadership of systems architecture across a company. There are many of opportunities for growth here as the team expands!\n\nWe don't mind where you're based or what hours you work, but this role does require reasonable working-hours overlap with the rest of our internal/ops engineering team in Europe, as well as availability to be on-call on a rotating schedule in the event of downtime.\n\nWe value diversity of all types at Ghost and our team is made up of a kind, thoughtful group of people with a wide range of backgrounds. We have as many people who speak German as we do English and our engineering team contains as many women as it does men. Some of us are single, others are married, while others are parents. We actively try to find people with different perspectives and experiences to the ones we already have.\n\n#Location\n- 🌏 Worldwide

See more jobs at Ghost

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Stack Overflow


Site Reliability Engineer

verified

Site Reliability Engineer


Stack Overflow


sys admin

engineer

admin

sys admin

engineer

admin


👁 1,765 viewed | ✍️ 164 applied (9%)
This job post is archived and the position is probably filled. Please do not apply.
\nCome join the SRE team at Stack Overflow!  As one of the top 50 websites by traffic volume worldwide, we hit some unique challenges. Recently we’ve launched Stack Overflow for Enterprise and Stack Overflow for Teams, allowing organizations to have a private experience on the platform they already know and love. The success of these new products requires us to rethink our infrastructure strategy for supporting on-prem, cloud, and remote deployments.\n\nWe’re looking for someone with Linux and Windows Server experience (3+ years). Experience with managing internet-facing services is a plus.  We don’t expect you to know everything about all of the technologies we use, so you’ll work with other members of the team to learn and develop your skills.\n\nAs an SRE, you’ll bring a developer mindset to system administration, always looking for ways to automate manual work and create repeatable, scalable systems and processes. We are wiki-centric and prefer to document and automate in small increments as we work.\n\nWe are a remote-first team with members in many timezones. Candidates that live near one of our data centers in Jersey City, NJ or Denver, Colorado will occasionally be visiting them in person to keep our infrastructure running.\n\nWhat you’ll do:\n\n\n* Maintain the services and infrastructure platform used by the Stack Overflow websites.\n\n* Be deeply involved in our move from .NET Framework to .NET Core and then to Linux containers in Kubernetes.\n\n* Lead an initiative to adopt monitoring-centric operations (KPI or Google SRE error budgets) for our applications and internal data services.\n\n* Be part of our on-call rotation (approximately 1 week out of 5).\n\n* Act as a subject matter expert around our IIS infrastructure and automation\n\n* (If you are located in NJ/NY) Occasionally visit our Jersey City datacenter when remote-hands are insufficient.\n\n\n\n\nTechnologies you’ll work with:\n\n\n* Our application stack is IIS, .NET Framework and Core, and Microsoft SQL Server on Windows; Redis, Elasticsearch, and HAProxy on Linux (CentOS)\n\n* Our control-plane is a mixture of Puppet for Windows and Linux, moving towards Kubernetes\n\n* Hardware platforms: Dell Servers and EqualLogic storage, Fortinet and Cisco network devices\n\n* In the future: We are in the middle of a multi-year move to Kubernetes.\n\n\n\n\nSome projects that we've recently completed or are working on:\n\n\n* Created an automated pipeline for SSL Certificates with Let’s Encrypt via Hashicorp Vault\n\n* Built our first Kubernetes clusters with associated CI/CD pipelines\n\n* Upgraded 14 production SQL servers without service downtime\n\n* Improved Windows automation by deploying Puppet and Chocolatey\n\n* Created a secure replica of our infrastructure for storing private Q&A data\n\n* Reinvented how DNS is managed\n\n* Implemented autonomous OS upgrades for both Windows and Linux servers\n\n* Upgraded hardware with zero downtime across a variety of services\n\n* Migrated to a new CDN\n\n\n\n\nSkills & Requirements\n\nWe’re looking for:\n\n\n* Experience working in a mixed Linux / Windows environment\n\n* A love of monitoring, and data-driven operations\n\n* A love of Infrastructure as Code\n\n* Experience with the HTTP protocol, load balancers, CDNs\n\n* A track record of taking on challenges and delivering thorough, stable, and maintainable systems\n\n* Strong written communication skills and a strong inclination to “document as you go”\n\n\n\n\nNot required, but please let us know if you have experience with:\n\n\n* Experience with Microsoft SQL Server administration and query tuning\n\n* Experience in security, or have worked in a SOC2 or PCI environment\n\n\n\n\nWhat you’ll get in return:\n\n\n* Flexible hours\n\n* 20 days paid vacation + holidays\n\n* Completely free health insurance - no copay, no premiums (US residents)\n\n* Generous parental leave (10-16 weeks at 100% pay), family care leave, and unlimited sick days\n\n* Employees will never be poked with a sharp stick\n\n\n\n\nWhen you work in our office… You’ll get your own private office in our headquarters in New York City, and enjoy additional benefits like free lunch every day prepared by our own in-house chefs, transportation reimbursement, and all the espresso you can drink.\n\nIf you want to work remote…. (US time zones) We’ll help you set up a great home office, with an ergonomic chair, standing desk, and any other equipment you need to do your job.

See more jobs at Stack Overflow

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Marketcircle

 

Infrastructure Reliability Engineer

Infrastructure Reliability Engineer  


Marketcircle


sys admin

engineer

admin

sys admin

engineer

admin


👁 1,406 viewed | ✍️ 173 applied (12%)
This job post is archived and the position is probably filled. Please do not apply.
Want to work from home? Want to make an impact by working for a small company that values autonomy and working collaboratively in a team to solve challenging problems? Love learning new technology? Then you'll fit right into the Marketcircle Team!\n\nAt Marketcircle, we value family and work-life balance. We need your help to ensure that our infrastructure runs smoothly and securely, day in, day out.\n\nOur applications are a bit different than your standard fare. We have native macOS and iOS apps that store data locally on the device and sync that data with our backend systems. Of course, we run standard things such as a REST API, CalDAV and CardDAV services etc...\n\nYou'll be working in a world with micro-services written mostly in Ruby, some in Python and a touch in Objective-C/Swift running in a *nix environment. We use tools such as Nomad, Consul, Vault, Docker, Kafka, Postgres, ElasticSearch, NGINX, Jira, Confluence, PagerDuty, GitHub, Travis and a few more.\n\nWe run our infrastructure on Digital Ocean with a sprinkle of AWS.\n\nWe have customers throughout the World and continue to grow. You'll help us scale our infrastructure to handle this growth in a steady, secure and reliable manner, helping keep our customers happy and everyone's work/life balance in check.\n\n\n\n\nAbout you:\n\n\n\n\n* You've written and deployed micro-services and are able to work with senior and junior developers\n\n* You've worked with Postgres and/or ElasticSearch\n\n* You've worked with Digital Ocean, AWS or Azure\n\n* You've worked with Docker, Nomad or Kubernetes\n\n* You've used source control tools such as Git\n\n* You've done some continuous integration work with the likes of Travis, Circle CI, Jenkins\n\n* You are familiar with Kanban and Lean process management\n\n* You are able to work in a team environment, believe that a healthy team is important and willing to work keep your team healthy\n\n* You are curious and always seeking to learn new things\n\n* You've been around the block and know that discipline, routines and standards are key to a healthy and secure infrastructure\n\n\n\n\n\n\n\n\nYour years of experience will be a determining factor in whether we consider you for a junior, intermediate or senior role.\n\n\nEligibility\nThis position is fully remote, however you must comply with the following:\n\n- Pass a police background check\n- Be a Canadian, U.S. or E.U. citizen\n\nMarketcircle Inc. is a young, fun and distributed tech company. We believe in the power of Kaizen (continuous learning), teamwork, creativity, ownership, and empathy. By embodying those core values we know we impact the lives of our customers, and each other. Our mission is to empower small business worldwide drives us to develop a native macOS and iOS app that helps thousands create organization of what would otherwise be chaos. Tired of a long commute to work? As long as you have reliable internet, and can work between 10:00AM - 3:00PM EST (core hours), you can work from anywhere! We expect results, not monkeys sitting in cubes for 10 hours a day! Though we are mostly remote, our team tries to meet up in the office every now and again tostados hare some laughs, build camaraderie and eat some good food! We also make it a point to do activities together, like axe throwing, escape rooms, evenings out, etc.

See more jobs at Marketcircle

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Tutuka


Lead Site Reliability Engineer Payments Wallet Space

Lead Site Reliability Engineer Payments Wallet Space


Tutuka


exec

sys admin

engineer

admin

exec

sys admin

engineer

admin


👁 1,631 viewed | ✍️ 130 applied (8%)
This job post is archived and the position is probably filled. Please do not apply.
\nWhat this job entails\n\nAs the Lead SRE at Tutuka you'll be working closely with entire technical team ensuring the reliability of enterprise-level, highly scalable, highly secure financial processing systems that power tens of millions of transactions and tying them to web, mobile and API interfaces that make it easy for people to issue, redeem and reconcile prepaid cards all over the world.\n\nWe already have a team of amazing developers that work out of our local offices in Johannesburg, South Africa as well as remotely across Europe and Southeast Asia, and now we need you to drive improvements in our reliability, scalability and efficiency.\n\nWhat you will be doing\n\nYou'll find every day an exciting challenge, helping our technical team transform a monolithic enterprise processing environment with bank-level security and 99.95% uptime, into a sleek, nimble, micro-service serverless processing environment with better than bank-level security and 99.99% uptime.\n\nIf it was easy, we would already have done it! This role may or may not involve the following:\n\n\n* Work closely with software engineering teams to improve availability, latency, performance, efficiency, monitoring, emergency response, and capacity planning of services\n\n* Across hybrid cloud environment of hosted data centre and AWS\n\n* Handle upgrades of infrastructure and services through automation\n\n* Identify, gathering, documenting and automating responses to key performance metrics, logs, and alerts\n\n* Find optimizations and other efficiencies to scale the application\n\n* Develop playbooks and tools to streamline processes and shorten problem resolution time\n\n* Maintain infrastructure as a code management process\n\n* Perform periodic on call duties\n\n\n\n\nSkills & requirements\n\nWe love taking on team members with a variety of skill levels, from intern to PhD. But there's no getting around the fact that we need this person to know what they're doing, and hit the ground running.\n\nYou should already be an SRE guru with:\n\n\n* Solid understanding of operational principles, such as capacity planning, monitoring and incident handling\n\n* Experience automating manual processes, leveraging cloud (preferably AWS) platforms\n\n* Telemetry, tracing, logging, and alerting best-practices\n\n* Experience implementing monitored and seamless deployment pipelines\n\n* Internet fundamentals. HTTP/s, DNS, TCP/IP, security-by-design, caching\n\n\n\n\nExtra kudos are awarded for:\n\n\n* JVM performance tuning\n\n* Experience in monitoring of cloud based systems\n\n* Knowledge of automated testing frameworks and methodologies\n\n* Experience with some scripted and compiled/virtual languages (for example JavaScript and Go/JAVA)\n\n\n\n\nIf you have no site reliability engineering experience, your application cannot be considered.

See more jobs at Tutuka

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

YouEarnedIt


Site Reliability Engineer Node.js

Site Reliability Engineer Node.js


YouEarnedIt


javascript

node js

sys admin

engineer

javascript

node js

sys admin

engineer


👁 1,858 viewed | ✍️ 173 applied (9%)
This job post is archived and the position is probably filled. Please do not apply.
Overview of the role:\nWe're looking for a Site Reliability Engineer with a passion for scaling and technical problem-solving to help us grow our SaaS platform in the cloud. You’ll have an understanding of Node applications (MEAN stack) You’ll help our applications get the proper love and care they deserve. You'll investigate, develop, automate, and communicate to get the job done.\n\nWhat awesome stuff you'll do:\n\n\nCollaborate with other engineers to help solve problems ranging from systems security to build automation\n Build tools to help developers to manage the applications in the SDLC\nWork closely with other engineers to solve technical challenges and ensure continued application scalability\nResearch, develop and deploy tools to manage each part of the stack\nBuild systems and tools to automate deployment pipelines\nDefine and own best practices for our engineering teams and assist them in engaging these processes\nInfluence our infrastructure direction with your ideas\nStay current with industry trends, systems, and practices and teach others to help them level up\n\n\nWhat you'll need to be successful:\n\n\nA strong desire to innovate, experiment, collaborate and learn\nHigh standards for quality and attention to detail\nExcellent problem-solving and analytical skills\nExcellent oral and written communication skills\nExperience deploying and maintaining a Node application\nYou’re a developer at heart and love to make tools to help other devs\nExperience with cloud concepts and experience applying them to an app\nExperience with application containerization (Docker)\nExperience with monitoring and alerting platforms and tools\n\n\nBonus points for:\n\n\nExperience with CircleCI, ECS, Kubernetes, GKE, Terraform, Spinnaker\nExperience with ElasticSearch, Redis, Memcached\nExperience with MongoDB, Postgres\n\n

See more jobs at YouEarnedIt

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Salesforce


Site Reliability Engineer Heroku

Site Reliability Engineer Heroku


Salesforce


sys admin

heroku

engineer

admin

sys admin

heroku

engineer

admin


👁 1,492 viewed | ✍️ 93 applied (6%)
This job post is archived and the position is probably filled. Please do not apply.
\nSite Reliability Engineer, Heroku\nLocation: US Remote\n*We are a highly distributed team looking for candidates comfortable working remotely.\nAbout Heroku SRE\nHeroku, a subsidiary of Salesforce, operates the world’s largest Platform As A Service (PaaS), continuously delivering millions of apps with a high volume of deploys per day. Heroku's vision is for developers to focus on their applications and leave operations to us.\nWe are writing our team charter and we're looking for engineers who are interested in joining that effort. This is not an established team - you will be among the first people to implement this job role at Heroku. Because the team isn't established, we'll be looking to you to help define how the team should get work done and how it will communicate with other teams in the engineering org, so it will help if you're interested in the human communication problems of engineering.\nWhat's this job like?\nThis job is open to people anywhere in North America (the United States and Canada). You can work at a Salesforce office or work from home. Because the team is just getting started, here's what we can say so far:\nCurrently, we're helping development teams to develop Service Level Objectives (SLOs) for parts of the platform where they don't currently exist and defining minimum standards for service health metrics\nNext, we will define the SRE Entrance process that development teams follow in order to hand off the operation of a production service to the SRE team\nThe team will be on call for multiple production services, once they have gone through the SRE Entrance process. This includes:\n\nResponding to pages generated by automated monitoring and alerting\nResponding to pages created manually by other engineers and support personnel\nJoining an incident response team as a Subject Matter Expert and working with other SMEs and an Incident Commander to resolve the issue (we'll train you for this)\n\nWhen the team is more established, our goal is a 50% focus on engineering activities. Likely projects include:\nAutomated data and service management tooling\nInstrumenting for observability for troubleshooting\nHardening for resilience in the face of operational events and customer behavior\n\nWho are you?\nWe’re looking for people who are interested in complex distributed systems- how they work, how they can work better, how we even know if they’re working at all. We need someone who's spent time working as a developer (writing code with a team to fix operational issues or build features), but who has also spent time on operational concerns (investigating production incidents, creating or updating monitoring and alerting plans for production systems, or investigating performance issues, for instance).\nYou don't need to have “SRE” in your job title in order to have appropriate skills for this position. You might come from a DevOps environment or have been one of a handful of engineers in a shop so small that everyone does a little of everything. The important thing is that you have experience in both writing code and maintaining systems, and that you're willing to do both of those things in the future. If you're stronger in one area than the other, that's okay.\nBe sure to read or skim the Site Reliability Engineering book, which we are modeling our team structure around.\nRequirements\nExperience with complex distributed systems and familiarity with how the internet and web applications work. You don’t have to have built a datacenter or run a large cloud service at a major provider, but you do need to have used cloud services. Running LAN infrastructure or doing client-side system administration is not enough for this role.\nWilling to join an on-call rotation that you would participate in defining.\nWilling to work on a distributed (currently all-remote) team spanning multiple time zones. None of us currently lives in the same place or works out of the San Francisco headquarters; all of us are experienced remote workers.\nComfortable reading and writing code with a team in at least one of Ruby, Go, Python, or Erlang. It's fine if you know more than one of those languages and/or other languages, but they are the four most important languages at Heroku. We need people who are comfortable with them, and open to switching between them.\nHow do I know if I should apply?\nIf you have experience with any of the following topics, you should apply!\nContainers and container management technologies such as lxc, Docker and Kubernetes\nExperience with AWS services like EC2, ELB, DynamoDB, S3 (or their Azure or GCP equivalents- OpenStack experience is fine too)\nDatabases and big data stores, especially Postgres or Kafka\nREST APIs\nLoad balancing technologies, including L4 or L7 routing and CDNs\nMonitoring, instrumentation, or observability\nStandard parts of a web app's stack, such as TCP/IP, DNS, HTTP, etc.\nCloud computing patterns (and how they're different than using hardware)\nInfrastructure as code (Terraform, Chef, Puppet, Ansible, CloudFormation, etc)

See more jobs at Salesforce

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Olo


Site Reliability Engineer

Site Reliability Engineer


Olo


sys admin

engineer

admin

sys admin

engineer

admin


👁 1,425 viewed | ✍️ 88 applied (6%)
This job post is archived and the position is probably filled. Please do not apply.
\nOlo looking for an experienced Site Reliability Engineer (SRE) to support our team of software and infrastructure engineers. This position is more than actively monitoring a dashboard - you’ll work with smart, passionate engineers dedicated to innovation and experimentation while also delivering amazing products.\n\nYou will partner with Engineering and Product Managers to shepherd reliability and availability aspects of product and technology initiatives and help us sharpen our execution skills as we deliver an exceptional platform. Your focus will be on championing whole team reliability while building and maintaining solutions.\n\nAt Olo, Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run at scale, distributed, fault-tolerant systems. As an SRE you will ensure that Olo's applications both internal and external have reliability and uptime appropriate to end users' needs and a feedback loop focused on improvement while keeping a watchful eye on capacity and performance.\n\nYou can work at Olo’s headquarters in New York City’s Financial District or remotely from anywhere in the U.S. In fact, more than two-thirds of our engineering team is remote.\n\nResponsibilities\n\n\n* Practice sustainable incident response (on call rotation) and postmortems.\n\n* Brainstorm, define, and build collaborative monitoring solutions with members across multiple Development, QA, and Infrastructure teams.\n\n* Contribute insights across teams to help us improve or re-architect observability of existing systems to support scale and extensibility.\n\n* Constantly re-evaluate our observability tooling to improve architecture, knowledge models, user experience, performance and stability.\n\n* Maintain services in a running state once they are live by measuring and monitoring availability, latency and overall system health.\n\n* Directly influence an engineering culture of reliability, observability, and availability.\n\n* Contribute to new and existing compliance initiatives.\n\n\n\n\n\n \n\nRequirements\n\n\n* Interest in Operations concepts & technologies. (Automating Ops tasks using Ansible, CloudFormation, Packer, Salt Stack, and Terraform. This is not an Ops position, but we are engineering solutions to automate Ops workflows.)\n\n* Experience with monitoring systems like PagerDuty, Datadog, Sumo Logic, Raygun, and NewRelic.\n\n* Working knowledge of at least one of Jenkins, TeamCity, Octopus, App Service Plan, CodePipeline, CircleCI, etc.\n\n* You've been in the trenches building highly scalable, efficient, and resilient systems.\n\n* Must have senior level experience in defining and participating in incident response at the Production level.\n\n* Understanding of platform level concerns, such as configuration management, canary deployments, etc.\n\n* Obsess about learning, and champion the newest technologies & tricks with others, in an effort to raise the technical IQ of the team.\n\n* Software Development experience\n\n* Legally able to work in the U.S.\n\n\n\n\n\nAbout Olo \nOlo is the on-demand interface for the restaurant industry, powering digital ordering and delivery for over 250 restaurant brands. Olo’s enterprise-grade software powers every stage of the digital restaurant transaction, from fully-branded user interfaces to the back-of-house order management features that keep the kitchen running smoothly. Orders from Olo are injected seamlessly into existing restaurant systems to help brands capture demand from on-demand channels such as branded website and apps, third-party marketplaces, social media channels, and personal assistant devices like the Amazon Echo. Olo is a pioneer in the industry, beginning with text message ordering on mobile feature phones in 2005. Today, millions of consumers use Olo to order ahead (SKIP THE LINE®) or get meals delivered from the restaurants they love. Customers include Applebee’s, Chili’s, Chipotle, Denny’s, Five Guys Burgers & Fries, Jamba Juice, Noodles & Company, Red Robin, Shake Shack, sweetgreen, Wingstop, and more.\n\nOlo is located at 26 Broadway in the historic Standard Oil Building, the former home of John D. Rockefeller.  We offer great benefits, such as 20 days of Paid Time Off, fully paid health, dental and vision care premiums, stock options, a generous parental leave plan, and perks like FitBits, rotating craft beers on tap in our kitchen, and food events featuring our clients' menu items (now you know why we give out FitBits!). Check out our culture map: https://www.olo.com/images/culture.jpg.\n\nWe encourage you to apply!\n\nAt Olo, we know a diverse and inclusive team not only makes our products better, but our workplace better. Many groups are consistently underrepresented across the tech sector and we are fully committed in doing our part to move the needle.\n\nOlo is an equal opportunity employer and diversity is highly valued at our company. All applicants receive consideration for employment. We do not discriminate on the basis of race, religion, color, national origin, gender identity, sexual orientation, pregnancy, age, marital status, veteran status, or disability status.\n\nIf you like what you read, hear, and/or know about Olo, and want to be a part of our team, please do not hesitate to apply! We are excited to hear from you!

See more jobs at Olo

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

New Context Services


Experienced Site Reliability Engineer

Experienced Site Reliability Engineer


New Context Services


sys admin

engineer

admin

sys admin

engineer

admin


👁 1,338 viewed | ✍️ 90 applied (7%)
This job post is archived and the position is probably filled. Please do not apply.
\nSite Reliability Engineer\n\nNew Context is a rapidly growing consulting company in the heart of downtown San Francisco. We specialize in Lean Security: an approach that leads organizations to build better, safer software through hands-on technical and management consulting.\n\nWe are a group of engineers who live and breathe Agile Infrastructure, Systems Automation, Cloud Orchestration, and Information & Application Security. As a New Context Site Reliability Engineer, you will provide technical leadership with a hands-on approach. Our clients look to us to guide them to a solution that makes sense for them, and you should expect to provide thought leadership, design, and implement that solution.\n\nExpect to heavily use Open Source software to take on challenges like delivery of highly secured containers, management of IoT devices or building Big Data ecosystems at petabyte scale and beyond. You will utilize our core methodologies - Agile, Lean, TDD and Pair Programming - along with your fluency in DevOps - to implement robust and reliable systems for our clients.\n\nYou will work with our clients and other New Context team members while working from the New Context office, at client sites, or from your home. We foster a tight-knit, highly-supportive environment where there are no stupid questions. Even if you may not know the answer immediately, you'll have the entire company supporting you via Slack, Zoom, or in-person. We also host a daily, all-company stand-up via Zoom, and a weekly company Retro, so you won't just be a name on an email.\n\nAt New Context, our core values are Humility, Integrity, Quality & Passion! Our employees live these values every single day.\n\nWho you are:\n\n\n* A seasoned technologist with 5+ years work experience in a DevOps, SRE, or Continuous Integration role;\n\n* Experienced in Open Source web technologies, especially in the areas of highly-available, secure systems;\n\n* Accustomed to implementing cloud-based solutions (AWS, Google Cloud, Azure) with significant work experience in public cloud technologies;\n\n* Have developed production-quality applications in an Agile environment;\n\n* Fluent in one or more high-level languages, ideally Ruby and/or Python;\n\n* Familiar with Infrastructure as Code (IaC) and automated server provisioning technologies;\n\n* Experienced as a technical lead on technical projects;\n\n* An excellent communicator, experienced working with external clients and customers and able to communicate productively with customers to explain technical aspects and project status;\n\n* Able to think on your feet and learn quickly on-the-job in order to meet the expectations of our clients;\n\n* A great teammate and a creative and independent thinker.\n\n\n\n\nBonus points if you are:\n\n\n* Comfortable as a technically hands-on Project Manager;\n\n* Experienced managing teams;\n\n* Happy and effective in a consulting role;\n\n* Familiar with: TCP/IP, firewall policy design, social engineering, intrusion detection, code auditing, forensic analysis;\n\n* A believer in automated tests and their role in software engineering;\n\n* Able to translate complex concepts to business customers\n\n\n\n\nTechnology we use:\n\nWe tailor solutions to our customers. You might work on projects using any of the following technologies:\n\n\n* Automation: Chef, Puppet, Docker, Ansible, Salt, Terraform, Automated Testing\n\n* Containerization Ecosystem: Docker, Mesosphere, Rancher, CoreOS, Kubernete\n\n* Cloud & Virtualization: AWS, Google Compute Engine, OpenStack, Cloudstack, kvm, libvirt\n\n* Tools: Jenkins, Atlassian Suite, Pivotal Tracker, Vagrant, Git, Packer\n\n* Monitoring: SysDig, DataDog, AppDynamics, New Relic, Sentry, Nagios, Prometheus\n\n* Databases/Datastores: Cassandra, Hadoop, Redis, postgres, MySQL\n\n* Security: Compliance standards, Application Security, Firewalls, OSSEC, Hashicorp Vault\n\n* Languages: Ruby, Python, Go, JavaScript\n\n\n\n\nAll applicants must be authorized to work in the U.S. We will not sponsor visas for this position.\n\nWe are committed to equal-employment principles, and we recognize the value of committed employees who feel they are being treated in an equitable and professional manner. We are passionate about finding ways to attract, develop and retain the talent and unique viewpoints needed to meet business objectives, and to recruit and employ highly qualified individuals representing the diverse communities in which we live, because we believe that this diversity results in conversations which stimulate new and innovative ideas.\n\nEmployment policies and decisions on employment and promotion are based on merit, qualifications, performance, and business needs. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

See more jobs at New Context Services

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Empire Flippers


Ruby on Rails Engineer

verified

Ruby on Rails Engineer


Empire Flippers


ruby

engineer

api's

ror

ruby

engineer

api's

ror


👁 2,567 viewed | ✍️ 129 applied (5%)
This job post is archived and the position is probably filled. Please do not apply.
The Ruby on Rails Engineer position is vital to the success of our company.\n\nYou’ll use your initiative in implementing API’s and integrations to address our business needs along with the rest of the engineering team. Both our clients and staff will be using the software you write. Our small and close knit engineering team currently consists of a UX specialist, 2 frontend engineers, one backend engineer, one WordPress engineer and several QA specialists. You’ll work closely with the team to implement solutions for all departments within Empire Flippers, be it compliance, customer support, sales and migrations. You’ll work closely with the frontend and WordPress engineers to ensure features are implemented correctly to the highest standard, and you’ll work closely with one more backend engineer to ensure scalability, speed, code cleanliness and readability.\n\nThe feature set will have already been decided on – it’s your responsibility to plough ahead with the implementation and to ensure the user experience is elevated to unprecedented levels and ultimately, close more deals.\n\n# Responsibilities\n We believe in hiring people that are a good fit for us culturally.\n\nA good fit is actually more important to us than the skill set since we will teach you everything you need to know.\n\nYou should have a few good years of experience under your belt, having implemented some complex, data driven applications. Your portfolio speaks louder than your words.\n\nYou should be a ninja with every component of our tech stack. You must have a complete working knowledge of RoR in API mode, SQL, Postgres, Sidekiq, Rspec, Git, Redis.\n\nExperience working with a wide range of 3rd party integrations. Our platform talks to many 3rd party applications, you should have experience building and maintaining such integrations in a test driven fashion.\n\nDev-ops/sysadmin skills. Experience with managing servers, maintaining hosting environments, being responsible for uptime and responsiveness, addressing bottlenecks, ensuring backups are kept safe and sound.\n\nYou need to have immaculate attention to detail. We need to hear you grunting and moaning if something doesn’t quite look or feel right, to the nearest code change and to the nearest hexadecimal color, to the point you become annoying to us. At times other developers may edit your code, you’ll be watching to ensure the code base remains readable, scalable and fast.\n\nBe a good communicator. It sounds very cliché, but you’ll immerse yourself in almost every department, you’ll be learning problems and presenting solutions, and also overseeing the implementation of those solutions too.\n\nA self-starter. We need to see some evidence that you’re able to get up every morning, bite the bullet and just get on with it, even if you’ve tried four coffee shops and none have decent wifi. You won’t have eyes looking over your shoulder on a day to day basis, you’ll be working in almost full autonomy, we’ll need to trust you to deliver the goods. We don’t believe in micro-management.\n\nThe following skills/experience would be a bonus, but not required:\n\nReact. Our client code is written in React. Being able to navigate the front-end code and patch things up would be a huge bonus.\n\nPHP/WordPress. We will be interfacing with WordPress significantly, being able to speak the same language would be great.\n\nDatabases. A comprehensive experience working with various types of SQL and noSQL databases would be very useful. MySQL, Postgres, DynamoDB, Cassandra, to name a few.\n\nCaching. The software we’re building needs to be fast and to remain fast as we scale, both in terms of traffic and database size. Having experience with Memcached, Redis, Varnish or experience with complicated CDN setups with many rules would be a plus. \n\n# Requirements\nHere is the sequence of events we use when hiring our Rails Engineer:\n\nYou record a YouTube video* explaining who you are and why you’re a good fit for the position, fill out an application, and submit it ASAP.\nThe deadline is the 1st of May 2019.\nWe review submissions and schedule interviews.\nSecond interviews are conducted, and a final decision is made.\nThe chosen candidate will begin in May. \n\n#Salary\n40-90K DOQ\n

See more jobs at Empire Flippers

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Invoca


Site Reliability Engineer

Site Reliability Engineer


Invoca


devops

sre

linux

chef

devops

sre

linux

chef


👁 2,171 viewed | ✍️ 32 applied (1%)
This job post is archived and the position is probably filled. Please do not apply.
Invoca offers an unusually valuable engineering experience. You will be part of a team of world-class Operations Engineers deploying code to our production SaaS platform and public cloud infrastructure multiple times per day. Our remote-first team is committed to upholding high standards via modern methodologies of agile software deployment, test-driven development, and DevOps.\n\n# Responsibilities\n * Work on solutions for challenging problems, including:\n * Highly available architecture\n * Large scale data warehouses\n * Scalable and reliable VoIP telephony\n* Exhibit an exceptional diligence to automate processes.\n* Practice sustainable incident response and blameless postmortems.\n* Participate in peer code reviews, design reviews, production on-call rotation, standups, retrospectives, mentoring 1-on-1s, root cause analyses, and more.\n* Focus on the whole production stack; building, maintaining, and monitoring systems that a wide range of internal and external customers are using.\n \n\n# Requirements\n* A systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.\n* Previous experience with a production-like environment, troubleshooting and supporting Linux operating systems and Internet-based applications\n* Demonstrable experience with a Public Cloud provider (e.g. AWS, GCP, Azure) and/or Containers (e.g. Docker, Kubernetes)\n* Hands-on experience with Configuration Management (e.g. Chef, Ansible, Puppet) and/or Infrastructure as Code (e.g. Terraform, CloudFormation).\n* A desire to create and write elegant, scalable, and maintainable tools and solutions.

See more jobs at Invoca

# How do you apply?\n\n This job post has been archived by the poster, which means they probably have enough applicants now. Please do not apply.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.


👁 1,936 viewed | ✍️ 72 applied (4%)
This job post is archived and the position is probably filled. Please do not apply.
Doximity is transforming the healthcare industry. Our mission is to help doctors be more productive, informed, and connected. As a software engineer, you'll work within cross-functional delivery teams alongside other engineers, designers, and product managers in building software to help improve healthcare.  \n\nOur team brings a diverse set of technical and cultural backgrounds and we like to think pragmatically in choosing the tools most appropriate for the job at hand.\n\n**About Us**\n\nHere are some of the ways we bring value to doctors\n* Our web applications are built primarily using Ruby, Rails, Javascript (Vue.js), and a bit of Golang\n* Our data engineering stack run on Python, MySQL, Spark, and Airflow\n* Our production application stack is hosted on AWS and we deploy to production on average 50 times per day\n* We have over 350 private repositories in Github containing our applications, forks of gems, our own internal gems, and open-source projects\n* We have worked as a distributed team for a long time; we're currently about 65% distributed\n* Find out more information on the Doximity engineering blog\n* Our[ company core values](https://work.doximity.com/)\n* Our [recruiting process](https://engineering.doximity.com/articles/engineering-recruitment-process-doximity)\n* Our [product development cycle](https://engineering.doximity.com/articles/mofo-driven-product-development)\n* Our [on-boarding & mentorship process](https://engineering.doximity.com/articles/software-engineering-on-boarding-at-doximity)\n\n**Here's How You Will Make an Impact**\n* Improve the performance and scalability of services, optimize our Rest and GraphQL APIs\n* Manage infrastructure using Chef and Terraform\n* Address security concerns and proficiently maintain our application stack\n* Active involvement in design, implementation, and maintenance of the development, staging, and production infrastructure and services your team is responsible for\n* Troubleshoot issues across the whole stack, such as high-load, memory full, network issues and come up with temporary/long term solutions based on the root cause\n* Create concise postmortems in the event of an outage\n* Write and maintain run-books for other engineers to leverage\n* Ensure proper security, monitoring, alerting, and reporting for the applications your team is responsible for\n* Collaborate with other engineers to make sound infrastructure decisions, improve workflow, and deploy applications ready for production\n* Hands-on maintenance on our Ruby on Rails and Go (Golang) applications\n* Monitor capacity, cost and plan for upgrades\n* Increase our automated test coverage and deployment infrastructure robustness \n* Participate in an on-call rotation\n\n**About you**\n* You are a problem solver with a passion for simple, clean, and maintainable solutions\n* You have extensive experience with Terraform and Chef (or equivalent)\n* You are knowledgeable of memory and CPU profiling tools to help adjust Ruby jobs and processes to use resources effectively\n* You have high familiarity with OOP and design principles to ensure well-architected services\n* You have significant experience deploying, configuring, and maintaining NGINX\n* You are proficient with Unix, AWS, and Git\n* You have experience writing automated tests and appreciate the benefit that tests offer\n* You are self-motivated and able to manage yourself and your own queue\n* You agree that concise and effective written and verbal communication is a must for a successful team\n* You have experience with web infrastructure, distributed systems, and performance optimizations\n* You are able to maintain a minimum of 5 hours overlap with 9:30 to 5:30 PM Pacific time\n* You can dedicate about two weeks per year for travel to company events\n\n**Benefits**\n\nDoximity has industry leading benefits. For an updated list, see our career page\n\n**More info on Doximity**\n\nWe’re thrilled to be named the Fastest Growing Company in the Bay Area, and one of Fast Company’s Most Innovative Companies. Joining Doximity means being part of an incredibly talented and humble team. We work on amazing products that over 70% of US doctors (and over one million healthcare professionals) use to make their busy lives a little easier. We’re driven by the goal of improving inefficiencies in our $3.5 trillion U.S. healthcare system and love creating technology that has a real, meaningful impact on people’s lives. To learn more about our team, culture, and users, check out our careers page, company blog, and engineering blog. We’re growing steadily, and there’s plenty of opportunity for you to make an impact.\n\n*Doximity is proud to be an equal opportunity employer, and committed to providing employment opportunities regardless of race, religious creed, color, national origin, ancestry, physical disability, mental disability, medical condition, genetic information, marital status, sex, gender, gender identity, gender expression, pregnancy, childbirth and breastfeeding, age, sexual orientation, military or veteran status, or any other protected classification. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law.*\n\n#Location\n- North America

See more jobs at Doximity

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Auth0


Principal Node.js Engineer Site Reliability

Principal Node.js Engineer Site Reliability


Auth0


javascript

node js

sys admin

engineer

javascript

node js

sys admin

engineer


👁 909 viewed | ✍️ 48 applied (5%)
This job post is archived and the position is probably filled. Please do not apply.
Auth0, a global leader in Identity-as-a-Service (IDaaS), provides thousands of enterprise customers with a Universal Identity Platform for their web, mobile, IoT, and internal applications. Its extensible platform seamlessly authenticates and secures more than 1.5B logins per month, making it loved by developers and trusted by global enterprises. Auth0 has raised more than $110 million to date and continues its global growth at a rapid pace. We are consistently recognized as a great place to work based our outstanding leadership and dedication to company culture, and are looking for the best people to join our incredible team spread across more than 35 countries!\n\nAuth0 gives companies simple, powerful and developer friendly building blocks so they can free up resources to focus on innovation. We strive to be the identity platform of choice for developers and Enterprises. We take our culture very seriously and are looking for people who are drawn to both our mission and our culture.\n\nOur platform is mainly composed of Node.js services that processes thousands of requests per second for customers all around the world. Scaling and improving our platform is crucial for us. The Site Reliability Teamis looking for Software Engineers that are experts on Node.js internals and building services with Node.js.\n\n\n\n\nYou are a good fit if you...\n\n\n\n\n* Have initiative and can "unblock" yourself to get things done.\n\n* Tend to deliver work incrementally to get feedback and iterate over solutions.\n\n* Can mentor junior people and pair with other teams: education is a very important part of this role.\n\n* Like to get your hands dirty by debugging and fixing issues in production.\n\n* Understand the real problems by reading between the lines and asking good questions.\n\n* Communicate well, take feedback in a positive way and are OK not always doing the most glamorous tasks.\n\n\n\n\n\n\n\n\n\n\nResponsbilities\n\n\n\n\n* Collaborate in designing and developing scalable, reliable Node.js services.\n\n* Determine and implement best practices for writing observable Node.js services:measuring and monitoring availability, latency and overall system health.\n\n* Debug, troubleshoot and fix Node.js issues in production services such as: memory leaks and event loop being blocked for long periods of time.\n\n* Collaborate with engineering teams to optimize services and implement reliability best practices.\n\n* Improve developer productivity by providing better debugging and performance tools.\n\n\n\n\n\n\n\n\n\n\nRequirements\n\n\n\n\n* Experience in designing, analyzing and troubleshooting large-scale Node.js distributed systems.\n\n* You have a systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.\n\n* You have a great ability to debug and optimize code and automate routine tasks.\n\n* You have designed applications and systems that scale, are resilient to failure, and are observable.\n\n* Timezone: we are giving preference to candidates located in GMT-8 to GMT+2.\n\n\n\n\n\n\n\n\n\n\nExtra Points\n\n\n\n\n\n\n* Experience with Amazon Web Services.\n\n* Experience with Linux.\n\n* Experience with MongoDB.\n\n* Experience working in a remote friendly, async environment.\n\n\n\n\n\n\n\n\n\n\n\nAuth0 is an Equal Employment Opportunity employer. Auth0 conducts all employment-related activities without regard to race, religion, color, national origin, age, sex, marital status, sexual orientation, disability, citizenship status, genetics, or status as a Vietnam-era special disabled and other covered veteran status, or any other characteristic protected by law. Auth0 participates in E-Verify and will confirm work authorization for candidates residing in the United States.

See more jobs at Auth0

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Heetch


Site Reliability Engineer

Site Reliability Engineer


Heetch


sys admin

engineer

admin

sys admin

engineer

admin


👁 763 viewed | ✍️ 43 applied (6%)
This job post is archived and the position is probably filled. Please do not apply.
\nDriver Growth @Heetch\n\nWe're a thoughtful, talented, full stack and distributed product team consisting of backend, mobile, frontend and QA engineers, as well as product managers and product designers. We're responsible for the acquisition, engagement, and retention of all our drivers .\n\nOur multi-disciplined team allows us to work autonomously across the realms of our scope. This means we own our roadmap entirely, and we empower each team member to contribute and influence what we work on and how. Our mission is quite simple; Deliver Driver happiness and ensure they get the optimum experience that they deserve. Drivers use and rely on the products we build every single day to earn a living. This is a responsibility that we hold dear and do not take for granted. \n\nSRE within Driver Growth\n\nOur infrastructure receives 2.5 millions of events per day and processes 100M of API requests. We also serve over a dozen thousand rides, have a Driver signup funnel with 50 separate Data fields and process hundred of gigabytes of log and interaction data daily. Our team owns upwards of 20 microservices on top of Elixir, Kafka and Docker, and are focussing our efforts on adding to this number as we extract from our legacy codebase.\n\nTo put it simply; The services we support and the code we produce are critical to the business. Be it a potential driver going through our acquisition funnel, an active driver entering our marketplace or a driver viewing their earnings and account details to name but a few, the impact our backend engineers have on the business as a whole is enormous.\n\nTeam Values \n• Transparency: we discuss everything openly within the team. Our speak up culture is strong. \n• Remote first: our team is fully distributed, and we work hard at that, but feel free to work from any of our offices in Paris, London, Milan, Bruxelles or Casablanca. \n• The courage to fail: we celebrate the wins, but more importantly we're not afraid to fail, we always learn and go again. \n• Team unity: no one is left behind. \n• Code quality: it's not software without tests.\n\nYour role \n\nIn this role, you'll be in charge of building the tools and systems that every backend engineer in the Driver Growth team uses to develop, scale, understand, and monitor their operations. You will dive deep into gnarly operational issues; from the software, systems, automation, and process perspectives, and, you will work with our production services throughout their entire life cycle, from design and architecture, through implementation, deployment, and sustaining operations.\n\n\nWhat will you do? \n\n• Build tools and infrastructure to make the team iterate faster without overthinking about the core infrastructure. \n• Partner with fellow backend engineers to architect and develop mission-critical systems that can stand the test of scale and availability, while limiting operational overhead. \n• Perform deep dives into both systemic and latent reliability issues; partner with software and SRE engineers across the organization to produce and roll out fixes. \n• Design, build & support systems to detect, alert and remediate or escalate on the team' platform. \n• Contribute to standardization efforts across multiple disciplines and services in conjunction with the Core SRE team. \n• Handle efficiencies in systems and processes: design, configuration management, performance tuning, monitoring, and root cause analysis. \n• Participate in an on-call rotation and contribute to needed escalation missions. \n\nWhat do you need? \n\n• Software Engineer background (+5 years). \n• Practical knowledge of various aspects of service design like messaging protocols & behavior, caching strategies and software design practices. \n• Solid understanding of systems and application design, including the operational trade-offs of various designs. \n• Excellent programming skills in Go, and an ability to pick up new programming languages. \n• Excellent written and social communication, and documentation skills in English. \n• Be adaptable and able to focus on the most straightforward, most efficient & reliable solutions. \n• Experience in the Linux environment and a deep understanding of its fundamentals and internals: filesystems and modern memory management, threads and processes, the user/kernel-space divide, networking. \n• Exposure to the AWS ecosystem. \n• Real world experience with Packer/Terraform. \n• Customer service skills and empathy to develop solutions that span multiple teams. \n• Work well with and be able to influence a myriad of personalities at all levels.\n\n Bonus \n• Experience building highly-available fault-tolerant distributed systems with microservices, including containerized architectures, application security, monitoring, and storage systems. \n• Experience with message brokers (such as RabbitMQ or Kafka). \n\nPerks \n\n• Stocks. \n• Paid conference attendance/travel. \n• Heetch credits. \n• A Spotify subscription. \n• Code retreats and company retreats. \n• Travel budget (visit your remote co-workers and our offices).

See more jobs at Heetch

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Skillshare


Senior Site Reliability Engineer

Senior Site Reliability Engineer


Skillshare


senior

sys admin

engineer

admin

senior

sys admin

engineer

admin


👁 896 viewed | ✍️ 30 applied (3%)
This job post is archived and the position is probably filled. Please do not apply.
\nAs a Senior Site Reliability Engineer at Skillshare, you’ll play a key role in balancing our current operations with building for the future. We’re scaling quickly and are excited to bring someone on board who can help us proactively tackle resulting challenges – both in the day-to-day operations, and anticipating those further out. This role is an exciting blend of both Infrastructure and DevOps, which means opportunity for impact across the board. We’ll look to your strategic expertise, reliable execution, and sound judgment to improve and maintain our infrastructure, along with creating increasingly smooth processes for our engineers as we grow the platform. You’ll be joining a team that’s passionate about technology, and helping pave the way for building products together that we’re proud of. We’re excited to meet you.\n\nWhat you’ll do:\n\n\n\n\n* Improve, monitor and maintain our infrastructure\n\n* Ensure site uptime and performance\n\n* Maintain and improve development and QA environments\n\n* Work with web developers to improve tooling for initiatives like unit testing, deployment processes, etc.\n\n* Proactively prep and train developers for improvements or updated workflows\n\n* Quickly and proactively resolve developer issues\n\n* Support the platform team in building new application platform on Node.js\n\n* Make strategic recommendations and improvements to our application and infrastructure security\n\n\n\n\n\n\nWhat you’ll need to be successful:\n\n\n\n\n* Experience building and supporting cloud-based web infrastructure with AWS\n\n* Docker experience (Kubernetes experience is a plus)\n\n* Continuous integration and deployment experience (preferably with CircleCI)\n\n* Relational databases and queueing systems knowledge (we use MySQL, Redshift, Redis)\n\n* Experience with application monitoring and alerting systems (we use New Relic and Datadog)\n\n* Understanding of web infrastructure: load balancing, high availability configurations, disaster recovery, DNS configuration, security best practices, etc.\n\n* Working knowledge of software engineering practices\n\n* Strong communication skills – you’re a natural collaborator and can report out to stakeholders of all levels\n\n* Ability to balance strategy and execution\n\n\n\n\n\n\nWhy you want this job:\n\n\n\n\n* Impact: you’ll play a key role in shaping the direction of our infrastructure and developer processes long-term\n\n* Growth: Our team is small, so you’ll have room to wear a lot of hats and take on more responsibility over time.\n\n* Our mission: We are building a learning ecosystem for the new economy and changing millions of lives for the better.\n\n* Our team: We have a passionate, smart team that is a lot of fun to work with.\n\n* Your life: We take pride in our flexibility. Need flexible hours, or work a day or two remotely? No problem. We trust you to do what you need to do.\n\n\n\n\n

See more jobs at Skillshare

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Leadfeeder


Site Reliability Engineer

Site Reliability Engineer


Leadfeeder


sys admin

engineer

admin

sys admin

engineer

admin


👁 752 viewed | ✍️ 23 applied (3%)
This job post is archived and the position is probably filled. Please do not apply.
\nWe are a fast-growing startup based in Helsinki determined to make Leadfeeder a big thing globally. We have a solid international business and our customers love what we do for them with Leadfeeder.\nWe are looking for a talented Site Reliability Engineer to join our team. You can either be based in Helsinki, Finland or work remotely (Europe).\n\n\nAs a Site Reliability Engineer you would\n * Work together with our other engineers to improve our automated cloud infrastructure on AWS.\n * Monitor and analyze the Leadfeeder infrastructure and applications with tools like New Relic, AWS CloudWatch, Prometheus and ELK-stack.\n * Automate technical operations: deployments, scaling, recovery, etc.\n * Analyze and improve system performance and reliability.\n \n\n\nBenefits\n\n * Competitive base pay\n * Possibility to work remotely for everyone\n * An interesting and growing field of business\n * Great support from your new colleagues\n * The chance to work with cool and exciting technologies\n * A chance to be part of the next Finnish success story\n * Cool office in central Helsinki. Including bike storage.\n * Home Internet subscription\n * Fun events with the whole crew\n \n

See more jobs at Leadfeeder

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Quimbee


Site Reliability Engineer

Site Reliability Engineer


Quimbee


sys admin

engineer

admin

sys admin

engineer

admin


👁 809 viewed | ✍️ 35 applied (4%)
This job post is archived and the position is probably filled. Please do not apply.
\nQuimbee is growing! Were looking to add a new full-time member to our core team. This position is 100% remote (U.S. only). All you need is an internet connection and a quiet place to work.\n\nWho We Are\n \nQuimbee is one of the most widely used e-learning platforms for law students in the United States. Simply put, our mission is to help law students get A's in their law school courses.\nSince 2007, Quimbee has helped over a hundred thousand law students prepare for classes and final exams. We provide law students with access to a comprehensive database of case summaries, video lessons, practice questions, a legal dictionary, and a growing library of content. We have become one of the most widely used and trusted sites for law students, serving both institutional clients, such as Yale University, American University, and University of Illinois, as well as thousands of individual law students.\nWe prefer a small and highly effective engineering team, so every new team member is vital to the success of the company.\nWho Were Looking For\nWe are looking for our first site-reliability engineer (SRE). As our SRE, you must have strong experience with Ruby on Rails based applications. Ideally, you're an experienced Ruby on Rails developer with a passion for operations tasks. Your focus will be on improving our deployment practices, maintaining, troubleshooting, documenting, and improving the systems that keep our Heroku hosted system running securely and smoothly with the least downtime possible. Eventually, we might also consider alternative hosting platforms in the future, and we expect you to help with that too. There will be a lot of monitoring, alerting, and prioritizing what is worth our attention and what's not. You're expected to investigate and mitigate single points of failure, performance bottlenecks, slow SQL queries, errors, or any other identified issues trying to solve them yourself or with the help of the other developers in the team.\nYou'll have the opportunity to help us define and shape processes, tools, and best practices in the context of our platform. You'll work closely with our team of developers to determine the current state of our platform as well as defining the future of it. Strong candidates will bring strong engineering and operations acumen, combined with the ability to move fast (and fix things).\nWe're looking for collaborative, detail-oriented people who are ready for a challenge. In this role, you'll be responsible for working on the critical task of ensuring our backend systems are rock solid and scalable. \nYoull join a small, 100% remote tech team. Your voice will be heard when we need to make new technical decisions as our product grows. We expect you to go beyond coding to give input on the product roadmap, design, and architecture.\nWe look for:\n * A Ruby developer. You have deep software engineering experience and are comfortable writing code in Ruby as well as at least one other programming language.\n * A DevOps advocate. You believe in the benefits of immutable infrastructure and understand what it takes to implement it from the operating-system level up to datacenter deployments.\n * A data-driven engineer. You know the difference between an MTTR and MTTD and have the skills necessary to optimize them.\n * A great process and code debugger. You feel comfortable leading robust and thorough root cause analysis (RCA) sessions to attack problems at their core and ensure they dont recur.\n * A self-starter. You take responsibility for projects from idea to completion, proactively seeking assistance as needed while guiding the work to successful outcomes.\n * A versatile engineer. You know what you dont know and feel comfortable learning new skills. Youre not ashamed of recognizing mistakes and take measures to avoid falling again.\n * A team player. You share code ownership as much as possible. You don't mind fixing other peoples code or stepping in to help a teammate.\n * A minimalist. You believe a new feature should be built only when the evidence supports it. Youre willing to push back when you believe this rule is being ignored or violated.\n * A great communicator. You communicate your ideas, feedback, and criticism thoroughly, clearly, and courteously. You believe theres no such thing as over-explaining or over-clarifying because thats how miscommunication is avoided.\n \n\n* A business-minded engineer. You have a deep understanding of the importance of building maintainable, efficient, clean code while balancing that with the urgency of the business needs.\n\n\nTask Examples\nWorking with us, you could be asked to (solo or as part of a team):\n * Create and maintain documentation about our platform and all the third-party services it depends on, defining a plan of development for failover mechanisms to improve our platform's resilience.\n * Investigate issues reported by our automated systems or our customer support or QA teams, determine impact and root cause, then prioritize and document them, and solve them yourself when possible or sync with our devs team to solve it.\n * Streamline our deployment process so that deployments are as smooth as possible both for our users as well as for our teams, considering the possibility of having to rollback.\n * Educate engineers throughout the company on how to ensure their projects meet our reliability, performance, and security requirements.\n * Reduce the server-side and front-end latency of our application to deliver a lightning-fast user experience.\n * Optimize our hosting bill by increasing throughput and resource efficiency, while planning capacity for the next two years of growth.\n * Determine and configure a core set of metrics and alerts to make sure our apps and servers are running smoothly and that we can react fast if something bad happens.\n * Develop and maintain performance and load tests.\n * Possible on-call responsibilities.\n \n\n\nBenefits\n\nWhat We Offer\n * Join a small team who loves what they do.\n * 100% remote work for unlimited flexibility.\n * A competitive salary.\n * Untracked paid time off and sick leave.\n * Healthcare coverage (including dental) for you and your family.\n * 401(k) with 3% company matching.\n \n

See more jobs at Quimbee

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

OMNITEC Solutions


System Engineer Web Services

System Engineer Web Services


OMNITEC Solutions


sys admin

web dev

engineer

admin

sys admin

web dev

engineer

admin


👁 1,933 viewed | ✍️ 42 applied (2%)
This job post is archived and the position is probably filled. Please do not apply.
\nSystem Engineer, Web Services\n\n\nAlternate Job Title: System Engineer, DevOps\n\n\n(job number:190002MD)\n\n\nWork Location: Fort Meade, MD, 20755\n\n\nCan you help design and manage the infrastructure DOD's primary public facing web sites? We have started our DevOps journey, and need you to help us reach the next level!\n\n\nJob Description\n\n\nOMNITEC Solutions, Inc., (http://www.omnitecinc.com) has an immediate employment opportunity for a strong Systems Engineer supporting a high profile web hosting infrastructure at our client site in Ft. Meade, Maryland.\n\n\nQuick Note #1: US citizenship and either an active clearance or a clearance that has been inactive less than 24 months; \n\n\nQuick Note #2: Partial remote/telecommute hours are available, but you must reside locally to the 20755 area \n\n\nCome be part of our continuing effort to understand and implement emerging next-generation best practices such as running a lean enterprise, version control, and managing Infrastructure as Code. Help us as we transition our infrastructure and processes into a cloud-based, highly scalable, and efficient environment.\n\n\nYou will work closely with other System/Network professionals, Web Developers, Database Administrators, Built/Test engineers and Help Desk professionals supporting hundreds of sites, including:\n\n\n\n* http://www.defense.gov;\n\n* http://www.af.mil/;\n\n* http://www.navy.mil/;\n\n* http://www.marines.mil/; and\n\n* http://www.usace.army.mil/.\n\n\n\n\n\nDuties and Responsibilities\n\n\n\n* Deploy and support the Windows Platform on physical hardware and virtual machines.\n\n* Deploy and support ASP.Net and PHP Web applications using technology such as IIS, ARR, Microsoft SQL Server and other web related technologies.\n\n* Support virtual infrastructure locally and within AWS cloud\n\n* Provide operational support such as server monitoring, backups, patching, and other duties as required.\n\n* Write PowerShell to automate redundant tasks.\n\n* Find novel solutions to difficult problems.\n\n* Constantly learn and improve.\n\n* Share knowledge and collaborate with your teammates.\n\n\n\n\n\nRequired Skills: \n\n\n\n* 5+ years relevant professional experience.\n\n* Experience with cloud platforms such as AWS or Azure\n\n* Experience with a virtualization platform such as VMWare\n\n* Strong analytical and troubleshooting skills.\n\n* Strong written/verbal communication skills. \n\n* Strong demonstrable knowledge of Microsoft Windows Server. \n\n* Scripting Skills (PowerShell strongly preferred).\n\n* Fundamental knowledge of networking and common protocols.\n\n* Understanding of IIS and related Web technologies. \n\n* Working knowledge of Microsoft SQL Servers\n\n* Security certification such as Security+ certification or higher or ability to immediately obtain.\n\n* US citizenship and either an active clearance or the ability to immediately obtain an Interim Secret is required.\n\n\n\n\n\nHelpful Skills\n\n\n\n* Experience with technologies such as: Github, Splunk, and JIRA/Confluence;\n\n* Knowledge of configuration management using tools such as Desired State, Configuration (DSC), Chef, or Puppet;\n\n* Experience with a CDN such as Akamai;\n\n* Helpful certifications include but are not limited to: MCSA, MCSE, Server+, CISSP, AWS Certified SysOps Administrator, AWS Certified DevOps Engineer, AWS Certified Solutions Architect.\n\n\n\n\n\nWhy should you apply?\n\n\n\n* Ability to work with emerging technologies;\n\n* Highly collaborative team environment;\n\n* Remote working opportunities;\n\n* Tuition/Certification reimbursement.\n\n\n\n\n\nPlease apply via the link on this posting or via: http://omnitecinc.applytojob.com/apply/\n\n\nOMNITEC Solutions, Inc., is an Equal Opportunity/Affirmative Action employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, disability, gender, gender identity, marital status, national origin, race, religion, sexual orientation, veteran status or any other characteristic protected by law.

See more jobs at OMNITEC Solutions

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Amazee


System Engineer

System Engineer


Amazee


sys admin

engineer

admin

sys admin

engineer

admin


👁 829 viewed | ✍️ 22 applied (3%)
This job post is archived and the position is probably filled. Please do not apply.
\nWe are excited to introduce our latest job opening for amazee.io. We invite you to take a look at our website at http://amazee.io, read of the job description below, and to make contact with us if you find our offer exciting.\n\nTo apply, please upload a maximum one-page cover letter in PDF format. Your cover letter should showcase your communication style, introduce yourself, detail your relevant technical work experience, and highlighting anything you feel we should know before we make contact with you. Feel free to also upload a concise resume.\n\nThe company and the job\n\namazee.io provides a Drupal hosting Platform as a Service to clients worldwide. We excel in offering outstanding support and a local Docker-based development environment congruent to production servers, supporting production deployment to any server in any datacenter in the world.\n\nDo you like to support and share knowledge with other engineers? Do you automate processes rather than doing the same thing three times? Do you believe in DevOps as a culture? Do you want to contribute to an open-source project on a daily basis? Then we may have the perfect job for you.\n\namazee.io is looking for a System Engineer to join our tech team. You will be essential in ensuring the happiness of our clients, the stability, and the growth of our high-performance hosting environment. We are a lean, open, international, and fully remote distributed team. \n\nOur hosting environment is based on Docker and Kubernetes/OpenShift, operates in multiple locations all over the world, is completely open source, and currently serves over 500 million hits per month. We are constantly updating and improving the platform and need bright minds who are interested in helping us scale to build the world's best and fastest Drupal hosting environment.\n\nResponsibilities:\n\n\n* Client onboarding and training\n\n* Provide outstanding support to our clients via Slack and in E-Mail\n\n* Go the extra mile for our customers to provide trouble free deployments of their sites\n\n* Assist with ensuring the stability of our hosting environment\n\n* Specify, implement, document, and roll out features/updates\n\n* Support tech teams by being an advocate for and implementing DevOps Principles\n\n* Share the responsibility for frequent off hours “on call” duty\n\n\n\n\nRequirements:\n\n\n* You prefer to work during standard office hours in Australia (UTC+10 to UTC+8)\n\n* 2 years+ of System-Engineer/System-Admin experience\n\n* Working knowledge of Linux, Nginx, Varnish, MariaDB/MySQL\n\n* Working knowledge of Drupal and PHP\n\n* Talking to clients and providing training to technical client personnel doesn’t scare you\n\n* Communicating effectively via cat gifs and emojis\n\n* Experience with Docker\n\n* Basic knowledge of high availability and clustered servers\n\n* Basic knowledge of Mac OS X, Windows\n\n* You're able to self-organize, work independently, but also work as part of a globally distributed team\n\n* You are experienced with or are familiar with agile project methodologies and principles\n\n* Ability to analyze problems and keep calm and focused during outages\n\n* Very good at reading, writing, and communicating in English\n\n\n\n\nBonus Points:\n\n\n* Experience with OpenShift and/or Kubernetes\n\n* Experience with AWS and/or Azure\n\n* Experience with server security and system hardening\n\n* Experience with Ansible, Node.js, or Golang\n\n* Experience with performance optimization and load testing\n\n* You like to share your knowledge by speaking at conferences and meetups\n\n\n\n\nWe offer:\n\n\n* Work in a distributed team of creative professionals in a flat international organization \n\n* Opportunity to build a first-class open-source hosting environment  \n\n* Budget for tech equipment every two years\n\n* Annual budget for education and health & wellness\n\n* Attendance to events and conferences\n\n\n\n\nWork location:\n\nLocation is not that important, however a desire to work during business hours in the UTC+10 timezone is critical.\n\nIf you are excited by the opportunity to join an international hosting company and think you might be a good fit, we would love to hear from you!\n\n\nApplication instructions\n\nTo apply, please upload a maximum one-page cover letter in PDF format. Your cover letter should introduce yourself, detail your relevant technical work experience, and highlighting anything you feel we should know before we make contact with you. Feel free to also upload a concise resume.

See more jobs at Amazee

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.


👁 1,453 viewed | ✍️ 44 applied (3%)
This job post is archived and the position is probably filled. Please do not apply.
SwissBorg is a ambitious fintech building a 2nd layer crypto investment mobile app with an institutional grade back-end technology in order to democratize access to wealth management. Developed by a team of tech and financial experts, we are decentralized to the world with teams in Switzerkland, Toronto, Tokyo and London and operate as a meritocracy.\n\n\n\nWe recently completed our successful ICO (52MUSD raised) and are now following our community-driven roadmap, rapidly expanding and growing our workforce.\n\n\n\nWe are offering you the opportunity to join our team as Financial Scala engineer: if you like fast-paced environments, agile thinking and flexible work policy, this is your chance to apply!\n\n\n\nAdvantages\n\n* Freedom to create, build a research architecture and the company you always dreamed of\n\n* Grow in an environment with experts in crypto, investments, coding, AI, psychology, and business\n\n* Very competitive Salary and Bonus\n\n* Free lunches and unlimited drinks. (Coffee, loose leaf teas, draught beer)\n\n* Flexible work hours\n\n# Responsibilities\n * Write server-side Scala code to build a scalable infrastructure capable of handling digital cash and crypto-currency transactions for smartphone users all around the world.\n\n* Identify requirements for backend architecture design and API with the engineering team.\n\n* Integrate with multiple crypto exchanges, focusing on high scalability\n\n* Solve algorithmic challenging problems in the context of finance and distributed systems\n\n* Write rigorous automated tests and ensure code quality and documentation\n\n* Optimize the code for performance \n\n# Requirements\n* Expertise in Scala is a must\n\n* Experience with JVM technologies, RDBMs, cloud architectures. Cassandra and Kafka a plus\n\n* Knowledge about the CQRS/Event Sourcing architecture\n\n* Strong sense of ownership and entrepreneurial thinking - customer value should be your key driver\n\n* Flexible, proactive, organized, detail-oriented and entrepreneurial\n\n* Excellent English communication skills. French is a plus\n\n* Experience in the finance industry a plus

See more jobs at SwissBorg

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Numbrs Personal Finance AG


Security Engineer

Security Engineer


Numbrs Personal Finance AG


securitiy

cryptography

go

python

securitiy

cryptography

go

python


👁 2,710 viewed | ✍️ 91 applied (3%)
This job post is archived and the position is probably filled. Please do not apply.
At Numbrs, our engineers don’t just develop things – we have an impact. We change the way how people are managing their finances by building the best products and services for our users. \n\nNumbrs engineers are innovators, problem-solvers, and hard-workers who are building solutions in big data, mobile technology and much more. We look for professional, highly skilled engineers who evolve, adapt to change and thrive in a fast-paced, value-driven environment.\n\nJoin our dedicated technology team that builds massively scalable systems, designs low latency architecture solutions and leverages machine learning technology to turn financial data into action. Want to push the limit of personal finance management? Join Numbrs.\n\n**Job Description**\nYou will be a part of a team that is responsible for developing, releasing, monitoring and troubleshooting large scale micro-service based distributed systems with high transaction volume. You enjoy learning new things and are passionate about developing custom security tools, reviewing designs, code, performing in-depth security assessments of mobile apps, distributed backend systems and internal IT infrastructure. You are a great teammate who thrives in a dynamic environment with rapidly changing priorities.\n\n# Responsibilities\n **All candidates will have**\n* a Bachelor's or higher degree in technical field of study\n* a minimum of 3 years security work experience\n* experience with performing application code reviews, design reviews and penetration testing\n* experience in penetration testing web-based apps, mobile apps and back-end infrastructure\n* experience implementing modern cryptosystems\n* excellent knowledge with at least one modern programming language, such as Go, Java, C++, Python and Scala\n* excellent troubleshooting and creative problem-solving abilities\n* excellent written and oral communication and interpersonal skills\n\n**Ideally, candidates will also have**\n* experience with systems for automating deployment, scaling, and management of containerised applications, such as Kubernetes or Mesos\n* experience working with large scale distributed systems

See more jobs at Numbrs Personal Finance AG

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

MaxMind


Site Reliability Engineer Telecommuting Opportunity

Site Reliability Engineer Telecommuting Opportunity


MaxMind


golang

telecommuting

sys admin

engineer

golang

telecommuting

sys admin

engineer


👁 795 viewed | ✍️ 18 applied (2%)
This job post is archived and the position is probably filled. Please do not apply.
Waltham, United States - MaxMind (www.maxmind.com) is looking for a talented Site Reliability Engineer (SRE) to join our Engineering team. We help protect thousands of companies worldwide from fraud, screening over 2 billion online transactions each year, and we provide IP intelligenc...

See more jobs at MaxMind

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

MaxMind


Site Reliability Engineer

verified

Site Reliability Engineer


MaxMind


sys admin

engineer

admin

sys admin

engineer

admin


👁 730 viewed | ✍️ 4 applied (1%)
This job post is archived and the position is probably filled. Please do not apply.
\nMaxMind (www.maxmind.com) is looking for a talented Site Reliability Engineer (SRE) to join our Engineering team. We help protect thousands of companies worldwide from fraud, screening over 2 billion online transactions each year, and we provide IP intelligence data to thousands more. This work requires us to tackle formidable challenges and we want you to help.\n\nThe Position Overview\n\nDo you have SRE skills and are ready to collaborate with us in continuous improvement for scaling, performance and security to support our customers?  Do you want to contribute to the improvements and delivery of a highly available, fault tolerant, and secure customer facing SaaS solution for real-time fraud analysis and IP intelligence which can serve over 2 billion transactions per year?  Will you be a collaborator with peers and Product to define and contribute to the overall development of complex features, and to the success of MaxMind software products? Security is crucial for us - and we are looking for a team member who will help us continuously improve our solution and explore new technologies including Cloud services.\n\n\nAs a MaxMind SRE, you will utilize the best from DevOps and SRE methodologies to make a difference in defining broader architectural, design, and technical objectives of MaxMind, and achieving customer satisfaction by:\n\n\n* Building performant and scalable SaaS solutions and the tools to maintain them\n\n* Collaborating, mentoring,and advising to others\n\n* Offering ideas and suggestions to the improvement of the development tool set, technical direction, and software architecture\n\n* Identifying, triaging, and resolving system issues\n\n* Designing and developing software and tools\n\n* Researching changes in technologies, development environments, and tools including cloud services\n\n* Enabling and extending complex system monitoring\n\n* Updating configuration management and deployments\n\n* Supporting on call after hours in rotation with other members of the team\n\n\n\n\nMinimum Qualifications\n\n\n* 5+ years Experience in an Operations/SaaS Production focused Engineering team, including DevOps and Site Reliability Engineering (SRE), enabling Highly Available SaaS solutions processing web traffic\n\n* Experience building complex monitoring solutions to support identification of issues with high availability\n\n* Able to investigate and resolve issues with Linux performance and network latency/reachability\n\n* Significant experience with Linux systems administration (we use Ubuntu, but that's not essential).\n\n* Experience managing PostgreSQL, including streaming replication and backups\n\n* Programming experience - preferably in Go or Perl. Our code is mostly Ansible and Perl, but we're happy to hear from you if more familiar with other programming languages or configuration management software\n\n* Proficiency with configuration management tools like Puppet, Chef, Ansible.\n\n* Solid understanding of fundamental networking technologies.\n\n* Knowledge of best practices related to security, performance, and disaster recovery.\n\n* Experience with web server configuration, monitoring, trending, network design, high availability.\n\n* Experience with version control, preferably Git\n\n* Strong analytical and problem-solving skills, with logical and repeatable debugging and problem solving approaches\n\n* Ready to learn new things\n\n* Excellent written and verbal communication skills with ability to communicate clearly with partners and end users\n\n* Able to work with a geographically distributed team\n\n\n\n\nHighly Desired (or excited to learn)\n\n\n* Experience doing security audits, security compliance, or penetration testing\n\n* Experience with\n\n\n\n* HAProxy configuration\n\n* Docker, Kubernetes, or other container tools\n\n* ELK/Elastic Stack\n\n* Cloudflare\n\n* Open source technologies\n\n\n\n* Experience with cloud platforms and infrastructure tools, and moving services to a cloud platform\n\n\n\n\n\nOur Engineering Practices\n\nOur SIte Reliability Engineers are members of our Engineering team, working together to deliver to our customers’ success. At MaxMind, we are committed to security and the contributions of our SREs are integral to our work. To learn more about our commitment to security, visit https://www.maxmind.com/en/company/commitment-to-security. We have built a culture of peers, with highly developed practices and processes to work together remotely. To learn more about working at MaxMind, visit https://www.maxmind.com/en/company/working-at-maxmind.  \n\n\nWe use Linux, PostgreSQL, and Ansible to deliver our solution.  We use a wide variety of tools to manage and monitor our systems, including Nagios, Sensu, Grafana, and the Elastic/ELK stack. All work goes through internal code review on GitHub Enterprise.\n\n\n\nOur goal is to automate as much as possible. Our tools are written in Perl and Go. We also want to improve our coding practices for the sysadmin code we write, writing libraries and tests wherever possible instead of one-off scripts.\n\nWorking at MaxMind\n\nMaxMind is a casual, friendly, results-focused company of 45+ employees.  We are passionate about global health and development, as MaxMind and its founder gladly donate over 60% of corporate profits to charities (https://www.maxmind.com/en/corporate-giving).  We maintain a set of core, overlapping hours, but are flexible with specific start and end times and are understanding about appointments and life events. Our software team is largely comprised of telecommuters, so communication centers around video calls, group chat, and agile planning tools.\n\nOur salary range for Engineering hires begins at $100,000 and we value talent and experience. Everyone participates in a company performance-based bonus plan. MaxMind offers a $2,000 professional development budget and five days for professional development annually.\n\nIn addition to medical, dental, and vision coverage, we offer several other benefits in the US, including a 401k with employer contribution, Health Savings Account, Limited Purpose Flexible Spending Account, paid parental leave, and a public transit reimbursement. Please inquire about benefits in Canada.\n\n\nDiversity and Inclusion\n\nWe're committed to diversity and inclusion and are mindful of incorporating them into all aspects of our company. New ideas and perspectives come from diverse ways of seeing and thinking. MaxMind is sensitive to all individuals and viewpoints and believes everyone’s contributions and opinions are valuable assets to our team.\n\nWe hold regular diversity and inclusion meetings and conduct informative sessions on improving collaboration and communication. We value bringing individuals and different perspectives together within and across every department.\n\nWe encourage and sincerely welcome applications from candidates of color, women, queer candidates, candidates with family caregiving responsibilities, transgender candidates, and from other communities not well represented in the tech world.\n\nIf you have suggestions on how we may better promote or express our commitment to diversity and inclusion, we would love to hear from you. Please send your suggestions to [email protected]

See more jobs at MaxMind

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Sticker Mule


Junior Site Reliability Engineer

verified

Junior Site Reliability Engineer


Sticker Mule


sre

site reliability

aws

google cloud

sre

site reliability

aws

google cloud


👁 2,774 viewed | ✍️ 23 applied (1%)
This job post is archived and the position is probably filled. Please do not apply.
**About Sticker Mule**\n\n\n\nWe created Sticker Mule to be the best place to work and shop. That means making ordering fast, simple and fun while creating a stable, low stress and enjoyable place for talented people to work.\n\n\n\nWe're searching for more to join us as we look to build one of the Internet's best technical teams. Some of our current projects include migrating to a service architecture, inter-service communication with GCloud PubSub and GRPC, API Gateway based GraphQL, event sourcing persistence and CQRS, and manufacturing and artwork processing automation.\n\n\n\n[Watch a brief video to learn more\n\n](https://www.stickermule.com/about)\n\n\n\n\n\n**Why we enjoy working here**\n\n\n\n1. We work flexible hours with an asynchronous culture.\n\n2. We work at a sustainable pace without unreasonable external deadlines.\n\n3. Varied, interesting technical challenges to work on.\n\n4. Opportunities to make a large impact as part of a small, highly motivated team. \n\n\n\n# Responsibilities\n 1. Help design, build and maintain tools to develop, test and deploy services efficiently.\n\n2. Help improving the performance, reliability and security of the Sticker Mule cloud infrastructure.\n\n3. Learn how to implement CI/CD pipelines and debug production services. \n\n# Requirements\n1. You have interest in knowing not only how to write software, but also how to run it at scale.\n\n2. You have a minimum of 1 year of professional software development experience.\n\n3. You’re competent in one general purpose language, like Go, Ruby, or JavaScript.\n\n4. You like Linux, the command line and bash scripts.\n\n5. You have basic experience with one of AWS or Google Cloud.\n\n6. You used logging, monitoring and distributed tracing systems.\n\n7. You possess strong analytical and critical thinking skills.\n\n8. You have great written and verbal communication skills in English.\n\n\n\n\n\nApplicants will be sent a Hackerrank test within 1-3 days of applying. Test must be completed within 5 days.

See more jobs at Sticker Mule

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.


👁 2,519 viewed | ✍️ 22 applied (1%)
This job post is archived and the position is probably filled. Please do not apply.
CircleCI is seeking a Staff Site Reliability Engineer (SRE) to work closely with our Software Engineers to deliver and manage the high-performance and scalable infrastructure underlying our multi-tenant Cloud offering as well as our Server-installed, on-premises solution. You will not only have the chance to automate and optimize infrastructure through the construction of appropriate tooling, but you will help software engineers through the design phase to optimize their services for scale in our production environment.\n \nThe CircleCI SRE team is globally distributed and remote-friendly. We take advantage of multiple timezones to manage a platform for our global customer base.\n \n \nAbout CircleCI\nCircleCI is the best platform for software teams looking to rapidly build quality projects, at scale. Our intelligent continuous integration and delivery tools are simple yet powerful. Our aim is to provide the wisdom of a connected development ecosystem to every team member making technology decisions.\n \nWe run 12M+ builds a month on our platform for companies like Spotify, Kickstarter, Sony, and Coinbase. Over 25,000 organizations and 300,000 developers actively build, test, and deploy on CircleCI.  We’ve raised $59.5M in venture capital from Industry Ventures, Top Tier Capital, Scale Venture Partners, DFJ, Harrison Metal Capital, and Baseline Ventures.\n\nIf you’re interested in joining the team at CircleCI, please send a resumé and let us know why you’d be a great fit for our team. If you contribute to an open source project, write a blog, or have a presence on the web (Twitter, GitHub, LinkedIn, etc.) we would love to hear about it.\n \nWe care deeply about diversity and inclusivity. We’re hiring at all experience levels, and seek talented teammates from a wide variety of backgrounds and experiences who are equally committed to cultivating a work environment of respect and kindness. We carefully consider every applicant that takes the time to apply.\n\n# Responsibilities\n What you will do:\n\nDesign and deliver solutions to improve the availability, scalability, latency, and efficiency of CircleCI’s services.\nEngage in service capacity planning and demand forecasting, anticipating performance bottlenecks\nDiagnose and resolve production issues in conjunction with software engineering teams\nArchitect and implement shared infrastructure used by all services within the CircleCI platform, for both SaaS and on-prem configurations\nSupport and advise software engineering teams in the design of scalable services\nBuild and maintain tools for deployment, monitoring, and debugging\nPlan and execute disaster recovery drills\nParticipate in rotating on-call duties, including incident management\n \n\n# Requirements\nWhat will make you successful:\n\nExperience managing a container-based microservice architecture, including orchestration, service-discovery, monitoring, and debugging\nUnderstanding of standard networking protocols and components such as: TCP/IP, HTTP, DNS, ICMP, the OSI Model, Subnetting, and Load Balancing\nIn-depth knowledge of operating systems (processes, threads, IPC, concurrency, locks, mutexes, semaphores, etc.).\nProficiency in one or more of: C, C++, Java, Python, Go\nComprehensive knowledge of the internal workings of at least one of Postgres, Mongo, Redis\nSystematic problem solving approach, coupled with a strong sense of ownership and drive\nTrack-record of working cooperatively with software engineering teams\nFocus on security in the delivery of all levels of a system\nPassion for modern software development and operation, including agile, CI/CD, and infrastructure-as-code\nDesire to learn and grow\n6+ years of experience

See more jobs at CircleCI

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Stack Overflow


Azure Site Reliability Engineer

verified

Azure Site Reliability Engineer


Stack Overflow


sys admin

engineer

admin

sys admin

engineer

admin


👁 801 viewed | ✍️ 5 applied (1%)
This job post is archived and the position is probably filled. Please do not apply.
\nStack Overflow is growing fast, and our infrastructure needs just keep getting bigger. We’re looking for a Site Reliability Engineer to join our existing team of SREs and devs and help us grow the Microsoft Azure side of our infrastructure. As an SRE, you’ll bring a DevOps mindset to system administration, always looking for ways to automate manual work and create repeatable, scalable systems and processes.\n\nWe’re looking for someone with .NET ecosystem experience in an Azure environment (or general Windows Server / IIS experience) (3+ years), but we don’t expect you to know every other part of our stack coming in, so we’ll pair you with other members of the team to learn and develop your skills across our entire infrastructure (including our non-cloud Stackoverflow.com infrastructure).  We are a mixed Windows and Linux environment and expect this role to be strong in Windows but learn Linux as we move more infrastructure to it.\n\nWhat you’ll work on:\n\n\n* Help one of our newest products, hosted Stack Overflow Enterprise, grow to its first 1,000 customers and million users\n\n* Automate the manual steps remaining in deploying and upgrading Stack Overflow Enterprise customers on Azure\n\n* Work to improve our monitoring and alerting strategy for cloud solutions\n\n* Work to improve our security patching and compliance strategy for cloud solutions\n\n* Participate in creating an VM appliance version of our product\n\n* Participate in our on-call rotation\n\n\n\n\nOur ecosystem includes:\n\n\n* Microsoft Azure (Azure SQL, Microsoft SQL Server, Azure Automation, Azure AD)\n\n* Windows Server 2016 and IIS and .NET Core\n\n* Linux (we use CentOS)\n\n* PowerShell / DSC\n\n* Terraform / Go\n\n* Our toolchain includes: Git, GitHub Enterprise, TeamCity (CI), CentOS Linux, Puppet, .NET/C#, ElasticSearch, Redis, OctopusDeploy\n\n* In the future: Containers and Kubernetes\n\n\n\n\nSkills & Requirements\n\nWe’re looking for:\n\n\n* 3+ years of Windows Server experience (we run 2012R2 and 2016)\n\n* 3+ years of Azure experience or equivalent Amazon AWS, Google Cloud, Digital Ocean, etc.\n\n* PowerShell experience, and a developer’s mindset towards system administration (always looking to automate manual tasks)\n\n* Strong written communication skills and a strong inclination to “document as you go”\n\n* Linux experience in a mixed environment (we use mainly CentOS)\n\n* Some Microsoft SQL Server experience (Azure SQL a plus) or other SQL experience\n\n* Basic familiarity with: Networking, DNS, SSL certificates\n\n\n\n\nWe like to see:\n\n\n* Deep experience with Azure administration, debugging, and API use\n\n* Knowledge of programming beyond scripting (we use mainly C# and Go)\n\n* Experience working both on a team and on independent projects\n\n* Good communication and people skills\n\n\n\n\nWhat you’ll get in return:\n\n\n* Flexible hours\n\n* 20 days paid vacation + holidays\n\n* Completely free health insurance - no copay, no premiums (US residents)\n\n* Generous parental leave (10-16 weeks at 100% pay), family care leave, and unlimited sick days\n\n* Employees will never be poked with a sharp stick\n\n\n\n\nThis is a remote position… (US/Pacific time zone) While we are a remote-first team with team members all over the world, this position requires collaborating with people in Sydney and NYC, therefore you must be located in the US/Pacific or compatible time zone.

See more jobs at Stack Overflow

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

QuadPay


Site Reliability Engineer

Site Reliability Engineer


QuadPay


sys admin

engineer

admin

sys admin

engineer

admin


👁 838 viewed | ✍️ 3 applied (0%)
This job post is archived and the position is probably filled. Please do not apply.
\nQuadPay is an alternative payment provider that allows brands to give their customers the opportunity to split their purchases into 4 interest-free, automatic installments. The customer gets the product straight away and we pay the merchant upfront.\n\nAs a Site Reliability Engineer, youll help us scale a platform that processes millions of dollars of e-commerce transactions every day for some of the biggest e-commerce brands. \nAbout QuadPay\n \nCustomers love us because were a payment solution that suits their lifestyles. For the more than 50% of millennials that dont have a credit card, we help their cash go further with no strings attached, helping to make larger purchases accessible with responsible budgeting and cash flow management . Merchants love us because our platform dramatically increases customer loyalty, AoV, conversion and, most importantly, total sales.\n \nAt QuadPay, our goal is to transform the way shoppers pay for their purchases. We believe in choice and in giving shoppers the freedom and flexibility to manage their purchases in the way that best suits their finances.\nAs an SRE you will:\n * Work with our platform and application engineers to write code that scales, maintains and monitors our infrastructure\n * Automate everything and instrument everything! We believe in infrastructure as code and using data and evidence to guide our decisions\n * Create zero-downtime CI/CD deployment pipelines\n * Build tools to analyze application performance and debug production issues\n * Develop fault-tolerant, highly available and self-healing platforms\n * Participate in a blameless culture which focusses on process and technology\n \n\n\n\nWhile we'd prefer you to join us on-site in our New York office, however we do accept exceptional remote candidates. \n\n\n\nBenefits\n\n * Competitive Salary\n * Employee Share Scheme, which means all employees have a meaningful stake in the business\n * Generous leave entitlements\n * Generous staff referral program\n \n\n\n\n\n\nEQUAL OPPORTUNITY\nQuadPay is an equal opportunity employer. We are actively seeking to create a diverse work environment because teams are stronger with different perspectives and experiences.\nWe value a diverse workplace and encourage women, people of color, LGBTQIA individuals, people with disabilities, members of ethnic minorities, foreign-born residents, older members of society, and others from minority groups and diverse backgrounds to apply. We do not discriminate on the basis of race, gender, religion, color, national origin, sexual orientation, age, marital status, veteran status, or disability status. All employees and contractors of QuadPay are responsible for maintaining a work culture free from discrimination and harassment by treating others with kindness and respect.

See more jobs at QuadPay

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Isos Technology


Atlassian Site Reliability Engineer

Atlassian Site Reliability Engineer


Isos Technology


sys admin

engineer

admin

sys admin

engineer

admin


👁 738 viewed | ✍️ 5 applied (1%)
This job post is archived and the position is probably filled. Please do not apply.
\nAtlassian Site Reliability Engineer\n\nJob description\nIsos Technology is looking for an energetic and personable professional to join our team. The right candidate will be a team player who effectively builds relationships both inside and outside the organization. He/she is dedicated to meeting the expectations and requirements of our clients and the Isos team. This professional works directly with clients for the effective deployment of Isos Technology products and services. This individual is a self-starter who values team input and is willing to give input for the betterment of the team.\n\nIsos Technology is one of the largest Platinum/Enterprise Atlassian Solution Partners in the US. We are headquartered in Tempe, AZ but have offices across the US including Washington, D.C. We work with some of the largest companies in the world helping them to implement and maintain the Atlassian tools.\n\nIn addition to being everything Atlassian we focus heavily on our people and creating a culture that is fun, challenging and rewarding. In fact, we recently finished 6th on the Best Places to Work list and plan on moving up that list!\n\nThis position can be from anywhere in the US, from our Tempe, AZ location to Washington D.C. or anywhere in between. \n\nTechnical Requirements:\nAs an Atlassian Site Reliability Engineer you will function as the first-line of support for technical questions for Atlassian managed service and support engagements. You will have the following technical responsibilities:\n\n\n* Support highly available installations of Atlassian Server and Data Center products from both the user interface and system administration. The products include, but are not limited to: Jira, Confluence, BitBucket, Bamboo and FishEye/Crucible and Crowd.\n\n* Support existing application deployments, including upgrading and patching often via configuration management tools such as Ansible, Puppet, Chef, etc.\n\n* Provide best practices and guidance in both on premise and Cloud-based environments. May include work with CloudFormation or Terraform.\n\n* Maintain Atlassian tools in both Linux and Windows environments.\n\n* Set-up, maintain and tune relational databases, including, but not limited to: PostgreSQL, MySQL, SQL Server and Oracle.\n\n* Work with, configure and integrate with authentication / directory services like Active Directory (LDAP), Crowd, OAuth, Okta and SAML. \n\n* Maintain and use version control systems like Git, Subversion and Mercurial. \n\n* Write system administration scripts in Bash, PowerShell, Python or Ruby.\n\n* Work with and understand complex networking topologies and concepts.\n\n* Interpret, maintain and augment system and network monitoring system.\n\n* Use automated security testing platforms in order to audit and securing web-based applications.\n\n* Perform secondary support for end-user activities in the Atlassian tools, such as: create maintain workflows in Jira, author content in Confluence, create build plans in Bamboo, etc.\n\n\n\n\n\nTechnical Objectives:\nAs an Atlassian Site Reliability Engineer you have the following objectives:\n\n\n* Serve as Atlassian Tools Subject Matter Expert; advise clients on Atlassian, related technologies and Isos Atlassian Services best practices, guidelines and recommendations.\n\n* Isos Atlassian Platinum Partnership is dependent on having certified and accredited resources. You will acquire and maintain the required technical and sales certifications in your first calendar year of employment and every subsequent year thereafter.\n\n* Acquire and maintain at least one non-Atlassian professional accreditation or certification in your first calendar year of employment and every subsequent year thereafter. The certification should be relevant to Isos objectives and may include certifications on technologies or methodologies such as: AWS, Certified Scrum Master, Linux, etc. Isos will sponsor the exams, training and study materials to reach this objective.\n\n* Regularly contribute informative and engaging original content to be used in Isos Atlassian Support and Services marketing efforts in the form of white papers, blog posts and social media status updates.\n\n* Continue to build on Atlassian administration skills and technical skills to reach the same levels of competency as the Delivery team consultants.\n\n\n\n\n\nRequired Skills:\nTo function in the role of Atlassian Site Reliability Engineer, Isos requires you already have the following demonstrable and verifiable skills:\n\n\n* Ability to conduct yourself professionally and within Isos values.\n\n* Ability to effectively manage time and expectations with internal resources and clients.\n\n* Ability to manage multiple clients and issues simultaneously.\n\n* Ability to communicate effectively (both orally and in writing) for technical and business operations audiences .\n\n* Ability to function on a highly collaborative team of fellow engineers as well as front end application admins.\n\n* Minimum 2+ years' experience and/or expert knowledge of working in a technical operational environment that involves:\n\n\n* Configuring and working with various deployment methodologies, including "bare-metal", VM and Cloud (e.g AWS).\n\n* Configuring and scripting for various operating systems with a strong focus on Linux and/or Windows.\n\n* Experience with configuration management tools such as Ansible, Puppet, Chef, etc.\n\n* Working with, configuring and integrating with authentication / directory services like Active Directory (LDAP), Crowd, OAuth, Okta and SAML.\n\n* Supporting and working with relational databases, with a strong focus on MySQL and PostgreSQL.\n\n* Third-party and custom application installation, configuration and/or support.\n\n\n\n\n\n\n\n\n\nPreferred Skills:\n\n\n* Minimum 2+ years' experience and/or expert knowledge of Atlassian Jira and Confluence.\n\n\n* A combination of advanced or end-user knowledge of all other supported Atlassian products.\n\n\n\n\n\n* Minimum 2+ years' experience working in a consulting or customer service role.\n\n\n\n\n\nInterested?\nApply now via the application form. For more information about our organization, visit www.isotech.com.\n\nAgency calls are not appreciated.

See more jobs at Isos Technology

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Amazee


Senior System Engineer

Senior System Engineer


Amazee


sys admin

senior

engineer

admin

sys admin

senior

engineer

admin


👁 907 viewed | ✍️ 4 applied (0%)
This job post is archived and the position is probably filled. Please do not apply.
\namazee.io provides Drupal hosting Platform as a Service for clients worldwide. We excel in providing outstanding support, a local development environment based on Docker that is congruent to our production servers and the possibility to host on any server in any datacenter in the world.\n\nDo you loathe repetitive tasks? Do you spend hours developing automation processes rather than doing the same thing three times? Do you believe in DevOps as a culture? Do you want to be contribute to an open-source project on a daily basis? Then we may have the perfect job for you.\n\namazee.io is looking for a System Engineer to join our tech team. You will be essential to ensure stability and growth of our high-performance hosting environment. We are a lean, open, international and fully remote environment.\n\nOur hosting environment is based on Puppet, Docker and Kubernetes/OpenShift, operates in multiple locations all over the world, is completely open source and currently serves over 250 million hits per month. We are constantly updating and improving the platform and need bright minds who are interested in helping us scale to build the world's best and fastest Drupal hosting environment to join our team.\n\nResponsibilities:\n\n\n* Ensure stability of our hosting environment\n\n* Specify, implement, document, and roll out features/updates\n\n* Provide outstanding support to our clients via Chat and in E-Mail\n\n* Support technical teams by implementing DevOps Principles\n\n* Provide consistently high-quality support to clients for trouble-free deployments of their sites\n\n* Available for frequent “on call” duty\n\n\n\n\nRequirements:\n\n\n* You work during standard office hours in Europe (UTC+0 to UTC+3)\n\n* 3 years+ of System-Engineer/System-Admin experience\n\n* Working knowledge of Linux, Nginx, Varnish, MariaDB/MySQL\n\n* Communicating effectively via cat gifs and emojis\n\n* Experience with OpenShift and/or Kubernetes\n\n* Experience with Docker and Ansible\n\n* Experience with CI Systems like Jenkins\n\n* Experience with high availability and clustered servers\n\n* Basic knowledge of Mac OS X, Windows\n\n* You’ve heard of Drupal and PHP and it doesn’t scare you\n\n* You enjoy talking to clients and you're comfortable doing demos to people you don't know\n\n* You can work independently as well as part of a team\n\n* You are experienced with or are familiar with agile project methodologies and principles\n\n* Ability to analyze problems and keep calm and focused when troubleshooting outages\n\n* You are very good at reading, writing, and communicating in English\n\n\n\n\nBonus Points\n\n\n* Experience with server security and system hardening\n\n* Experience with Node.js or Golang\n\n* Experience with performance optimization and load testing\n\n* You like to share your knowledge by speaking at conferences and meetups\n\n\n\n\nWe offer:\n\n\n* Work in a distributed team of creative professionals in a flat international organization\n\n* Opportunity to build a first-class open-source hosting environment  \n\n* The budget for tech equipment every two years\n\n* An annual budget for education and health & wellness\n\n* Attendance to events and conferences\n\n\n

See more jobs at Amazee

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

Numbrs Personal Finance AG


Site Reliability Engineer

Site Reliability Engineer


Numbrs Personal Finance AG


go

kubernetes

amazon-web-services

docker

go

kubernetes

amazon-web-services

docker


👁 2,663 viewed | ✍️ 15 applied (1%)
This job post is archived and the position is probably filled. Please do not apply.
At Numbrs, our engineers don’t just develop things – we have an impact. We change the way how people are managing their finances by building the best products and services for our users. \n\nNumbrs engineers are innovators, problem-solvers, and hard-workers who are building solutions in big data, mobile technology and much more. We look for professional, highly skilled engineers who evolve, adapt to change and thrive in a fast-paced, value-driven environment.\n\nJoin our dedicated technology team that builds massively scalable systems, designs low latency architecture solutions and leverages machine learning technology to turn financial data into action. Want to push the limit of personal finance management? Join Numbrs.\n\n**Job Description**\nYou will be a part of a team that is responsible for developing, releasing, monitoring and troubleshooting large scale micro-service based distributed systems with high transaction volume. You enjoy learning new things and are passionate about developing new features, maintaining existing code, fixing bugs, and contributing to overall system design. You are a great teammate who thrives in a dynamic environment with rapidly changing priorities.\n\n**All candidates will have**\n* A Bachelor's or higher degree in technical field of study or equivalent work experience\n* Experience with systems for automating deployment, scaling, and management of containerised applications, such as Kubernetes or Mesos\n* A track record in automating operations and building large-scale systems\n* A good understanding of network and routing protocols (TCP/IP, DNS and others)\n* Excellent knowledge with at least one modern programming language, such as Go, Java, C++, Python or Scala\n* Excellent troubleshooting and creative problem-solving abilities\n* Excellent written and oral communication and interpersonal skills\n\n**Ideally, candidates will also have**\n* Experience deploying and supporting big data technologies, such as Kafka, Spark, Storm, Flink and Cassandra\n* Experience implementing, operating, and supporting open source tools for network and security monitoring and management on Linux/Unix platforms\n* Experience with encryption and cryptography standards

See more jobs at Numbrs Personal Finance AG

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.

New Context Services


Site Reliability Engineer

Site Reliability Engineer


New Context Services


sys admin

engineer

admin

sys admin

engineer

admin


👁 809 viewed | ✍️ 4 applied (0%)
This job post is archived and the position is probably filled. Please do not apply.
\nSite Reliability Engineer\nNew Context is a rapidly growing consulting company in the heart of downtown San Francisco. We specialize in Lean Security: an approach that leads organizations to build better software through hands-on technical and management consulting. We are a group of engineers who live and breathe Agile Infrastructure, Systems Automation, Cloud Orchestration, and Information Security.\n\nAs a New Context Site Reliability Engineer you will be expected to provide technical leadership with a hands-on approach. On a daily basis you will be interfacing with our clients and other New Context team members while working from the New Context office, at client sites, or from your home. Expect to heavily leverage open source software to tackle challenges like delivery of highly secured containers, management of IoT devices or building Big Data ecosystems at petabyte scale and beyond.\n\nWho you are:\n\nSeasoned Technical Veteran\n\n\n* 5 - 15+ years work experience in a DevOps, SRE, or Continuous Integration role\n\n* Experience with highly available and high-performance open source web technologies\n\n* Existing familiarity (or the eagerness to learn) Ruby and/or Python is helpful\n\n* Experience acting as a technical lead on technical project\n\n* Experience managing teams preferred\n\n* Experience acting as a technically hands on Project Manager preferred\n\n* Consulting experience preferred\n\n\n\nPossess a working knowledge of:\n\n\n* TCP/IP, firewall policy design, social engineering, intrusion detection, code auditing, forensic analysis\n\n* 5+ years experience with public cloud and automated server provisioning\n\n* Automated tests and their role in software engineering\n\n* Understanding of languages C, Perl, Python, and Ruby (some or all)\n\n* Web App development / deployment\n\n\n\nExcellent communication skills\n\n\n* Experience working with external clients and customers\n\n* Translate complex concepts to business customers\n\n\n\nTeam Player & Independent Thinker\n\n\n* You must be able to think on your feet, communicate constantly and professionally, and above all else meet the expectations of our clients.\n\n* Ability to communicate productively with customers to explain the technical aspects and project status.\n\n\n\nValue Driven & Integrity\n\n\n* At New Context, our core values are Humility, Integrity, Quality & Passion and this is lived by our employees every day!\n\n\n\n\nTechnology we use:\n\nAutomation\n\n\n* Chef, Puppet, Docker, Ansible, Salt, Terraform, Automated Testing\n\n\n\nContainerization Ecosystem\n\n\n* Docker, Mesosphere, Rancher, CoreOS, Kubernetes\n\n\n\nCloud & Virtualization\n\n\n* AWS, Google Compute Engine, OpenStack, Cloudstack, kvm, libvirt\n\n\n\nTools\n\n\n* Jenkins, Atlassian Suite, Pivotal Tracker, Vagrant, Git\n\n\n\nMonitoring\n\n\n* SysDig, Data Dog, AppDynamics, New Relic, Nagios, Zabbix\n\n\n\nDatabases/Datastores\n\n\n* Cassandra, Hadoop, Redis, postgresql, MySQL\n\n\n\nSecurity\n\n\n* Compliance standards, firewalls, scanners, OSSEC\n\n\n\nLanguages\n\n\n* Ruby, Python, GO\n\n\n\nOur Methodologies\n\n\n* Agile, Lean, DevOps, TDD, pair programming\n\n\n

See more jobs at New Context Services

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
Apply for this Job

👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!

When applying for jobs, you should NEVER have to pay to apply. That is a scam! Posts that link to pages with "how to work online" are also scams. Don't use them or pay for them. Also always verify you're actually talking to the company in the job post and not an imposter. Scams in remote work are rampant, be careful! When clicking on the button to apply above, you will leave Remote OK and go to the job application page for that company outside this site. Remote OK accepts no liability or responsibility as a consequence of any reliance upon information on there (external sites) or here.