👉 Hiring for a remote Senior position?on the 🏆 #1 remote jobs board
|The first health insurance for remote startups|
A fully equipped health insurance that works for all your global employees
Senior Data Engineer
Senior Data Engineer
\nCreative Commons is working on a project to index every piece of content that's openly licensed online and making it searchable and discoverable through CC Search and the CC Catalog API. The Senior Data Engineer reports to the Director of Engineering and is responsible for building and maintaining the data that powers those products (the CC Catalog). This project will unite billions of records for openly-licensed and public domain works and metadata, across multiple platforms, diverse media types, and a variety of user communities and partners. All the code we write is open source and we’re a 100% remote team.\n\nPrimary responsibilities\n\n\n* Architect, build, and maintain the existing CC Catalog, including:\n\n\n\n* Ingesting content from new and existing sources of CC-licensed and public domain works.\n\n* Scaling the catalog to support billions of records and various media types.\n\n* Implementing resilient, distributed data solutions that operate robustly at web scale.\n\n* Automating data pipelines and workflows.\n\n* Collaborating with the Backend Software Engineer and Front End Engineer to support the smooth operation of the CC Catalog API and CC Search.\n\n\n\n* Augment and improve the metadata associated with content indexed into the catalog using one or more of the following: machine learning, computer vision, OCR, data analysis, web crawling/scraping.\n\n* Build an open source community around the CC Catalog, including:\n\n\n\n* Restructuring the code and workflows such that it allows community contributors to identify new sources of content and add new data to the catalog.\n\n* Guiding new contributors and potentially participating in projects such as Google Summer of Code as a mentor.\n\n* Writing blog posts, maintaining documentation, reviewing pull requests, and responding to issues from the community.\n\n\n\n* Collaborate with other outside communities, companies, and institutions to further Creative Commons’ mission.\n\n\n\n\nQualifications and requirements\n\n\n* Demonstrated experience building and deploying large scale data services, including database design and modeling, ETL processing, and performance optimization\n\n* Proficiency with Python\n\n* Proficiency with Apache Spark or similar tools\n\n* Experience with cloud computing platforms (AWS or similar)\n\n* Experience with Apache Airflow or other workflow management software\n\n* Experience with machine learning or interest in picking it up\n\n* Fluent in English\n\n* Excellent written and verbal communication skills\n\n* Ability to work independently, build good working relationships and actively communicate, contribute, and speak up in a remote work structure\n\n* Curiosity and a desire to keep learning\n\n* Commitment to consumer privacy and security\n\n* Nice to have (but not required):\n\n\n\n* Experience with contributing to or maintaining open source software\n\n* Experience with web crawling\n\n* Experience with Docker\n\n\n\n\n\n\nDiversity & inclusion\n\nWe believe that diverse teams build better organizations and better services. Applications from qualified candidates from all backgrounds, including those from under-represented communities, are very welcome. Creative Commons works openly as part of a global community, guided by collaboratively developed codes of conduct and anti-harassment policies.\n\nWork environment and location\n\nCreative Commons is a fully-distributed organization — we have no central office. This position is in a remote working environment and you can be anywhere in the world as long as you’re available for meetings between 2 PM to 8 PM UTC. You must have reasonable mobility for travel to twice-annual all-staff meetings and the CC Global Summit (a total of 3 trips per year). We provide a subsidy towards high-speed broadband access. Laptop/desktop computer and necessary resources are supplied.\n\nSalary and benefits\n\nCreative Commons is a leading non-profit employer, offering competitive salaries and benefits, including health and wellness plans, annual retirement contributions, and a positive, supportive work environment. We offer competitive salary in the range for this position from $100,000 - $120,000 USD, commensurate with relevant skills, experience, and location.\n\nHow to apply\n\nPlease email your cover letter and resume as a single PDF to “[email protected]” with the subject heading of “Data Engineer / [Last Name].” Phone calls and messages will not be returned.
See more jobs at Creative Commons
# How do you apply? This job post is older than 30 days and the position is probably filled. Try applying to jobs posted recently instead.Apply for this Job
👉 Please reference you found the job on Remote OK, this helps us get more companies to post here!