Reddit logo

Software Engineer, Data Infrastructure

USA flag USA

Apply

Posted 12 days ago

Job Type

Full Time

Salary

$164k - $230k

Skills

Python

Scala

Summary

  • Mission/Vision: Empower communities by building innovative data infrastructure solutions.

  • Key Responsibilities: Develop and maintain Reddit's data infrastructure, manage BigQuery and Airflow, and create automation for data quality and governance.

  • Growth Opportunities: Engage in mentorship, work with large-scale systems, and participate in the complete data lifecycle.

Description

Reddit is a community of communities. It’s built on shared interests, passion, and trust and is home to the most open and authentic conversations on the internet. Every day, Reddit users submit, vote, and comment on the topics they care most about. With 100,000+ active communities and approximately 82M+ daily active unique visitors, Reddit is one of the internet’s largest sources of information. For more information, visit .

The Data Infrastructure team is looking to hire a Software Engineer who is excited to work with production facing data tools, and support a growing business model for Reddit.

Our team sits at the center of Reddit, building self-service solutions that empower data science, ML, and engineering teams to produce & consume data from a petabyte-scale warehouse. In this space, we focus on higher-level orchestration of data tools that support the entire company, and also implementing mechanisms that allow customers to safely interact with data. A subset of current focuses include:

  • Managing BigQuery + Airflow infrastructure for the entire company

  • Building opinionated guardrails to drive improvements in data quality, cost efficiency, and data governance

  • APIs and controllers that support our IAM, compute, and storage patterns

  • Software automation that connects our data services and surfaces metadata to downstream customers for discovery and data contract enforcement

  • Monitoring/alerting for our core systems and the mechanisms built on top

If you have a passion for building and maintaining high quality code, want to improve how Reddit makes strategic decisions at the company level, and are excited about applying engineering best practices to one of the most powerful corpus of data in the world, then this is the team for you!

In your day-to-day, you can expect to:

  • Collaborate effectively with a team of proficient software engineers to develop and maintain the fundamental platform that powers the cutting-edge Reddit's data warehouse infrastructure

  • Engage in the complete data lifecycle at Reddit, participating in the development process and working with one of the world's most extensive and data-rich datasets.

  • Design, Build and Deliver end-to-end data solutions to improve the reliability, scalability, latency and efficiency of Reddit’s Data Platform

  • Implement automation for key elements of the development process, including data quality, managing alerts and handling critical infrastructure operations.

  • Collaborate and Share on-call responsibilities, including incident management, with the Data Warehouse team

  • Guide and support fellow engineers within the team by serving as a mentor, while actively contributing to the sharing of knowledge through training sessions and comprehensive documentation

Who you might be:

  • 4+ years of software engineering experience in a production setting writing clean, maintainable, and well-tested code

  • Proficient in object-oriented programming languages like Python and Scala, with expertise in SQL languages like BigQuery, SparkSQL or Postgres

  • Demonstrated expertise in designing and implementing large-scale systems, diligently monitoring project progress, and showcasing proactive leadership as a self-starter on diverse projects

  • Experience working with cloud services, terraform, airflow, Kubernetes, CI/CD, and working with modern cloud-based infrastructure

  • Excellent communication skills tailored for effective collaboration within both a service-oriented team and the broader organizational context

Benefits:

  • Comprehensive Healthcare Benefits

  • 401k Matching

  • Workspace benefits for your home office

  • Personal & Professional development funds

  • Family Planning Support

  • Flexible Vacation (please use them!) & Reddit Global Wellness Days

  • 4+ months paid Parental Leave

  • Paid Volunteer time off

#LI-remote, #LI-JS5

Pay Transparency:

This job posting may span more than one career level.

In addition to base salary, this job is eligible to receive equity in the form of restricted stock units, and depending on the position offered, it may also be eligible to receive a commission. Additionally, Reddit offers a wide range of benefits to U.S.-based employees, including medical, dental, and vision insurance, 401(k) program with employer match, generous time off for vacation, and parental leave. To learn more, please visit .

To provide greater transparency to candidates, we share base pay ranges for all US-based job postings regardless of state. We set standard base pay ranges for all roles based on function, level, and country location, benchmarked against similar stage growth companies. Final offer amounts are determined by multiple factors including, skills, depth of work experience and relevant licenses/credentials, and may vary from the amounts listed below.

The base pay range for this position is:

$164,200—$229,900 USD

Reddit is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, please contact us at .

Perks

Healthcare benefits icon

Healthcare benefits

401(k) Match icon

401(k) Match

Stock Options icon

Stock Options

Paid Leave icon

Paid Leave

Parental leave icon

Parental leave