Software Engineer - Data Movement Platform
Software engineer building and maintaining scalable data movement infrastructure using Spark, Flink, and Airflow to support ML and analytics workloads processing 100B+ daily events.
Responsibilities
- Create and improve scalable, fault tolerant, self-serve data systems in collaboration with teammates and partner teams
- Refine and maintain data infrastructure technologies supporting ML and analytics workflows on data from hundreds of millions of users
- Own the Data Movement Platform enabling batch and stream data processing at Reddit
- Build new infrastructure for Spark, Flink, and Airflow technologies, including contributing to open source as needed
- Build automated solutions to minimize toil for data users and provide a declarative, self-service experience
- Share on-call responsibilities and support monitoring/alerting to improve reliability, scalability, latency, and efficiency of the Data Platform
Requirements
- 2+ years of software engineering experience in a production setting writing clean, maintainable, and well-tested code
- Proficient in object-oriented programming languages like Python, Scala, Go, or Java
- Demonstrated expertise in designing and implementing large-scale systems with proactive leadership as a self-starter
- Experience working with cloud services, Airflow, Kubernetes, CI/CD, Spark, Flink and/or modern cloud-based infrastructure
- Excellent communication skills for effective collaboration within a service-oriented team and broader organizational context
Nice-to-Haves
- Experience contributing to open source projects
- Background in data platform technologies and large-scale data processing systems
Software Engineer, Data Platform
Build and maintain data infrastructure processing petabytes of data. Own end-to-end projects for data ingestion, transformation, and serving systems. Requires 3+ years of software engineering experience.
Staff Analytics Engineer
Design and maintain a robust business data layer in dbt to enable trusted GTM sales analytics, reporting, data science, and AI capabilities. Requires 8+ years in analytics engineering with advanced SQL and dbt expertise.
Data Engineer
Own and extend customer data ingestion platform and large-scale pipelines powering AI workers. Build data lake, retrieval layer, and infrastructure for syncing, enriching, and querying customer data across CRMs and third-party systems.
Staff Software Engineer, Data Platform
Staff Software Engineer building and scaling high-volume, low-latency distributed data platform services and analytics infrastructure using Java, Kinesis, Flink, Snowflake, and Kubernetes. Requires 8+ years experience and U.S. Person status for FedRAMP access.