Remote Spark Jobs Open Startup
RSS
API
Remote HealthPost a job

find a remote job
work from anywhere

Browse 4+ Remote Spark Jobs in April 2021 at companies like Shopify, Andalus and working as a Senior Data Engineer, Staff Software Developer Data Platform or Senior Data Scientist. Last post

Join 91,988+ people and get a  email of all new remote Spark jobs

Subscribe
×

  Jobs

  People

πŸ‘‰ Hiring for a remote Spark position?

Post a job
on the πŸ† #1 remote jobs board
Remote Health by SafetyWing
Global health insurance for freelancers & remote workers
Remote Health by SafetyWing
Global health insurance for freelancers & remote workers
Advertise here

This week's remote Spark jobs

Shopify


verified
United States, Canada

Senior Data Scientist


Shopify

United States, Canada

remote data science role

 

senior data scientist

 

data science

 

microsoft azure

 

remote data science role

 

senior data scientist

 

data science

 

microsoft azure

 
**Company Description**\n\nShopify is now permanently remote and working towards a future that is digital by default. Learn more about what this can mean for you.\n\nAt Shopify, we build products that help entrepreneurs around the world start and grow their business. We’re the world’s fastest growing commerce platform with over 1 million merchants in more than 175 different countries, with solutions from point-of-sale and online commerce to financial, shipping logistics and marketing.\n\n**Job Description**\n\nData is a crucial part of Shopify’s mission to make commerce better for everyone. We organize and interpret petabytes of data to provide solutions for our merchants and stakeholders across the organization. From pipelines and schema design to machine learning products and decision support, data science at Shopify is a diverse role with many opportunities to positively impact our success. \n\nOur Data Scientists focus on pushing products and the business forward, with a focus on solving important problems rather than specific tools. We are looking for talented data scientists to help us better understand our merchants and buyers so we can help them on their journey.\n\n**Responsibilities:**\n\n* Proactively identify and champion projects that solve complex problems across multiple domains\n* Partner closely with product, engineering and other business leaders to influence product and program decisions with data\n* Apply specialized skills and fundamental data science methods (e.g. regression, survival analysis, segmentation, experimentation, and machine learning when needed) to inform improvements to our business\n* Design and implement end-to-end data pipelines: work closely with stakeholders to build instrumentation and define dimensional models, tables or schemas that support business processes\n* Build actionable KPIs, production-quality dashboards, informative deep dives, and scalable data products\n* Influence leadership to drive more data-informed decisions\n* Define and advance best practices within data science and product teams\n\n**Qualifications**\n\n* 4-6 years of commercial experience as a Data Scientist solving high impact business problems\n* Extensive experience with Python and software engineering fundamentals\n* Experience with applied statistics and quantitative modelling (e.g. regression, survival analysis, segmentation, experimentation, and machine learning when needed)\n* Demonstrated ability to translate analytical insights into clear recommendations and effectively communicate them to technical and non-technical stakeholders\n* Curiosity about the problem domain and an analytical approach\n* Strong sense of ownership and growth mindset\n \n**Experience with one or more:**\n\n* Deep understanding of advanced SQL techniques\n* Expertise with statistical techniques and their applications in business\n* Masterful data storytelling and strategic thinking\n* Deep understanding of dimensional modelling and scaling ETL pipelines\n* Experience launching productionized machine learning models at scale\n* Extensive domain experience in e-commerce, marketing or SaaS\n\n**Additional information**\n\nAt Shopify, we are committed to building and fostering an environment where our employees feel included, valued, and heard. Our belief is that a strong commitment to diversity and inclusion enables us to truly make commerce better for everyone. We strongly encourage applications from Indigenous people, racialized people, people with disabilities, people from gender and sexually diverse communities and/or people with intersectional identities. Please take a look at our 2019 Sustainability Report to learn more about Shopify's commitments.\n\n#Location\nUnited States, Canada


See more jobs at Shopify

# How do you apply?\n\n Click here to apply => https://smrtr.io/5njyK
Apply for this position

Shopify


verified
United States, Canada

Staff Software Developer Data Platform


Shopify

United States, Canada

staff software developer

 

data platform engineering

 

data engineering

 

spark

 

staff software developer

 

data platform engineering

 

data engineering

 

spark

 
**Company Description**\n\nShopify is the leading omni-channel commerce platform. Merchants use Shopify to design, set up, and manage their stores across multiple sales channels, including mobile, web, social media, marketplaces, brick-and-mortar locations, and pop-up shops. The platform also provides merchants with a powerful back-office and a single view of their business, from payments to shipping. The Shopify platform was engineered for reliability and scale, making enterprise-level technology available to businesses of all sizes. \n\n**Job Description**\n\nOur Data Platform Engineering group builds and maintains the platform that delivers accessible data to power decision-making at Shopify for over a million merchants. We’re hiring high-impact developers across teams:\n\n* The Engine group organizes all merchant and Shopify data into our data lake in highly-optimized formats for fast query processing, and maintaining the security and quality of our datasets.\n* The Analytics group leverages the Engine primitives to build and deliver simple and useful products that power scalable transformation of data at Shopify in batch, streaming, or for machine learning. This group is focused on making it really simple for our users to answer three questions: What happened in the past? What is happening now? And, what will happen in the future? \n* The Data Experiences group builds end-user experiences for experimentation, data discovery, and business intelligence reporting.\n* The Reliability group operates the data platform in a consistent and reliable manner. They build tools for other teams on Data Platform to leverage and encourage consistency as they champion reliability across the platform.\n\n**Qualifications**\n\n* An experienced technical leader with a proven track record of delivering impactful results.\n* Technical engineering background in one or more areas in the next section.\n* Experience with technical mentoring, coaching, and improving the technical output of the people around you.\n* Exceptional communication skills and ability to translate technical concepts into easy to understand language for our stakeholders. \n* Excitement for working with a remote team; you value collaborating on problems, asking questions, delivering feedback, and supporting others in their goals whether they are in your vicinity or entire cities apart.\n\n**A Staff Data Developer would typically have 6-10 years of experience in one or more of the following areas:**\n\n* Experience with the internals of a distributed compute engine (Spark, Presto, DBT, or Flink/Beam)\n* Experience in query optimization, resource allocation and management, and data lake performance (Presto, SQL)\n* Experience with cloud infrastructure (Google Cloud, Kubernetes, Terraform\n* Experience with security products and methods (Apache Ranger, Apache Knox, OAuth, IAM, Kerberos)\n* Experience deploying and scaling ML solutions using open-source frameworks (MLFlow, TFX, H2O, etc.)\n* Experience building full-stack applications (Ruby/Rails, React, TypeScript)\n* Background and practical experience in statistics and/or computational mathematics (Bayesian and Frequentist approaches, NumPy, PyMC3, etc.)\n* Modern Big-Data storage technologies (Iceberg, Hudi, Delta)\n\n**Additional information**\nAt Shopify, we are committed to building and fostering an environment where our employees feel included, valued, and heard. Our belief is that a strong commitment to diversity and inclusion enables us to truly make commerce better for everyone. We strongly encourage applications from Indigenous people, racialized people, people with disabilities, people from gender and sexually diverse communities and/or people with intersectional identities.\n\n#Location\nUnited States, Canada


See more jobs at Shopify

# How do you apply?\n\n Click here to apply: https://smrtr.io/5kR_7
Apply for this position

Shopify


verified
Canada, United States

Senior Data Engineer


Shopify

Canada, United States

senior data engineer

 

data engineering

 

data platform engineering

 

spark

 

senior data engineer

 

data engineering

 

data platform engineering

 

spark

 
**Company Description**\n\nShopify is the leading omni-channel commerce platform. Merchants use Shopify to design, set up, and manage their stores across multiple sales channels, including mobile, web, social media, marketplaces, brick-and-mortar locations, and pop-up shops. The platform also provides merchants with a powerful back-office and a single view of their business, from payments to shipping. The Shopify platform was engineered for reliability and scale, making enterprise-level technology available to businesses of all sizes. \n\n**Job Description**\n\nOur Data Platform Engineering group builds and maintains the platform that delivers accessible data to power decision-making at Shopify for over a million merchants. We’re hiring high-impact developers across teams:\n\n* The Engine group organizes all merchant and Shopify data into our data lake in highly-optimized formats for fast query processing, and maintaining the security + quality of our datasets.\n* The Analytics group builds products that leverage the Engine primitives to deliver simple and useful products that power scalable transformation of data at Shopify in batch, or streaming, or for machine learning. This group is focused on making it really simple for our users to answer three questions: What happened in the past? What is happening now? And, what will happen in the future? \n* The Data Experiences group builds end-user experiences for experimentation, data discovery, and business intelligence reporting.\n* The Reliability group operates the data platform efficiently in a consistent and reliable manner. They build tools for other teams at Data Platform to leverage to encourage consistency and they champion reliability across the platform.\n\n**Qualifications**\n\nWhile our teams value specialized skills, they've also got a lot in common. We're looking for a(n): \n\n* High-energy self-starter with experience and passion for data and big data scale processing. You enjoy working in fast-paced environments and love making an impact. \n* Exceptional communicator with the ability to translate technical concepts into easy to understand language for our stakeholders. \n* Excitement for working with a remote team; you value collaborating on problems, asking questions, delivering feedback, and supporting others in their goals whether they are in your vicinity or entire cities apart.\n* Solid software engineer: experienced in building and maintaining systems at scale.\n\n**A Senior Data Developer at Shopify typically has 4-6 years of experience in one or more of the following areas:**\n\n* Working with the internals of a distributed compute engine (Spark, Presto, DBT, or Flink/Beam)\n* Query optimization, resource allocation and management, and data lake performance (Presto, SQL) \n* Cloud infrastructure (Google Cloud, Kubernetes, Terraform)\n* Security products and methods (Apache Ranger, Apache Knox, OAuth, IAM, Kerberos)\n* Deploying and scaling ML solutions using open-source frameworks (MLFlow, TFX, H2O, etc.)\n* Building full-stack applications (Ruby/Rails, React, TypeScript)\n* Background and practical experience in statistics and/or computational mathematics (Bayesian and Frequentist approaches, NumPy, PyMC3, etc.)\n* Modern Big-Data storage technologies (Iceberg, Hudi, Delta)\n\n**Additional information**\n\nAt Shopify, we are committed to building and fostering an environment where our employees feel included, valued, and heard. Our belief is that a strong commitment to diversity and inclusion enables us to truly make commerce better for everyone. We strongly encourage applications from Indigenous people, racialized people, people with disabilities, people from gender and sexually diverse communities and/or people with intersectional identities.\n\n\n\n#Location\nCanada, United States


See more jobs at Shopify

# How do you apply?\n\n Click here to apply: https://smrtr.io/5kRRR
Apply for this position

Previous remote Spark jobs

Andalus


closed
🌏 Worldwide

Big Data Systems Engineer


Andalus

🌏 Worldwide

kubernetes

 

spark

 

infrastructure

 

big data

 

kubernetes

 

spark

 

infrastructure

 

big data

 
This job post is closed and the position is probably filled. Please do not apply.
Andalus is a start-up aiming to utilize the latest technologies to help clients solve their data needs. We are hackers by nature and like to think of innovative solutions to solving issues that have long been considered status-quo. We aim to empower our clients with state-of-art infrastructure components allowing them to be a data-enabled organization.\n\nYou will be working first-hand on developing the first major release of our Andalus platform targeted to empower important organizations in the MENA region.\n\n\n**Responsibilities:\n**\n* Design, build, and maintain the core data computation infrastructure used by Andalus\n* Debug issues across services and levels of the stack\n* Think and suggest the right hardware specs and components that would run well with the developed software and adhere to clients’ requirements \n* Develop systems that proactively capture the health status of various software and hardware components.\n* Build a great customer experience for people using your infrastructure\n\n\n**Qualifications:\n**\n* Familiar with the latest computation and storage architecture and components\n* Experience with managing distributed systems within computing clusters\n* Experience in configuration and maintenance of applications such as web servers, load balancers, relational databases, storage systems and messaging systems.\n* Knowledge of system design concepts\n* Ability to debug complex problems across the whole stack\n* Demonstrated understanding of container networking and security\n* Comfort working with network protocols, proxies and load balancers\n* Experience building highly available services \n* Experience with Kubernetes or other container orchestration systems\n* Technical writing skills\n* Interest in or experience with systems languages, such as Go\n* Strong communication skills and willingness to engage with your teammates in group problem-solving in a remote-work environment\n\n\n**Preferred Qualifications:\n**\n* Knowledge of data governance tooling and principles\n* Working experience in organizations with large databases\n* Dealt with data integrity and cleaning issues in the past\n* 2+ years of experience as a tech lead\n\n\n#Location\n🌏 Worldwide


See more jobs at Andalus

# How do you apply?\n\n This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.
119ms