This job post is closed and the position is probably filled. Please do not apply. Work for Stripe and want to re-open this job? Use the edit link in the email when you posted the job!
As a platform company powering businesses all over the world, Stripe processes payments, runs marketplaces, detects fraud, helps entrepreneurs start an internet business from anywhere in the world. Stripeโs Data Infrastructure Engineers build the platform, tooling, and pipelines that manage that data.\n\nAt Stripe, decisions are driven by data. Because every record in our data warehouse can be vitally important for the businesses that use Stripe, weโre looking for people with a strong background in big data systems to help us build tools to scale while maintaining correct and complete data. Youโll be creating best in class libraries to help our users fully leverage open source frameworks like Spark and Scalding. Youโll be working with a variety of teams, some engineering and some business, to provide tooling and guidance to solve their data needs. Your work will allow teams to move faster, and ultimately help Stripe serve our customers more effectively.\n\n\n\n# Responsibilities\n
**You will:**\n* Create libraries and tooling that make distributed batch computation easy to create and test for all users across Stripe\n* Become expert in and contribute to open source frameworks such as Scalding, Spark to address issues our users at Stripe encounter\n* Create APIs to help teams materialize data models from production services into readily consumable formats for all downstream data consumption\n* Create libraries that enable engineers at Stripe to easily interact with various serialization frameworks (e.g. thrift, bson, protobuf)\n* Create observability tooling to help our users easily debug, understand, and tune their Spark / Scalding jobs\n* Leveraging batch computation frameworks and our workflow management platform (Airflow) to assist other teams in building out their data pipelines\n* Own and evolve the most critical upstream datasets \n\n# Requirements\n**Weโre looking for someone who has:**\n* A strong engineering background and are interested in data. Youโll be writing production Scala and Python code.\n* Experience developing and maintaining distributed systems built with open source tools.\n* Experience building libraries and tooling that provide beautiful abstractions to users\n* Experience optimizing the end-to-end performance of distributed systems.\n* Experience in writing and debugging ETL jobs using a distributed data framework (Spark/Hadoop MapReduce etcโฆ)\n\n**Nice to haves:**\n* Experience with Scala\n* Experience with Spark or Scalding\n* Experience with Airflow or other similar scheduling tools\n* Itโs not expected that youโll have deep expertise in every dimension above, but you should be interested in learning any of the areas that are less familiar.\n\n**Some things you might work on:**\n* Create libraries that enable engineers at Stripe to easily interact with various serialization frameworks (e.g. thrift, bson, protobuf)\n* Write a unified user data model that gives a complete view of our users across a varied set of products like Stripe Connect and Stripe Atlas\n* Continuing to lower the latency and bridge the gap between our production systems and our data warehouse by rethinking and optimizing our core data pipeline jobs\n* Pair with user teams to optimize and rewrite business critical batch processing jobs in Spark\n* Create robust and easy to use unit testing infrastructure for batch processing pipelines\n* Build a framework and tools to re-architect data pipelines to run more incrementally\n \n\nPlease mention the words **PREDICT TOILET MUFFIN** when applying to show you read the job post completely (#RMzUuMTczLjE3OC42MA==). This is a feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.\n\n \n\n#Salary and compensation\n
No salary data published by company so we estimated salary based on similar jobs related to Finance and Engineer jobs that are similar:\n\n
$75,000 — $120,000/year\n
\n\n#Location\nNorth America
# How do you apply?\n\nThis job post has been closed by the poster, which means they probably have enough applicants now. Please do not apply.