Who we are
At CarGurus (NASDAQ: CARG), our mission is to give people the power to reach their destination. We started as a small team of developers determined to bring trust and transparency to car shopping. Since then, our history of innovation and go-to-market acceleration has driven industry-leading growth. In fact, we're the largest and fastest-growing automotive marketplace, and we've been profitable for over 15 years.
What we do
The market is evolving, and we are too, moving the entire automotive journey online and guiding our customers through every step. That includes everything from the sale of an old car to the financing, purchase, and delivery of a new one. Today, tens of millions of consumers visit CarGurus.com each month, and ~30,000 dealerships use our products. But they're not the only ones who love CarGurus-our employees do, too. We have a people-first culture that fosters kindness, collaboration, and innovation, and empowers our Gurus with tools to fuel their career growth. Disrupting a trillion-dollar industry requires fresh and diverse perspectives. Come join us for the ride!
Role Overview
As a Site Reliability Engineering (SRE) Manager at CarGurus, you will lead the Site Reliability and Observability team, ensuring our platform services and infrastructure are reliable, scalable, and continuously improving. This role is responsible for both technical guidance and people leadership-championing incident response, driving platform modernization initiatives, and fostering a culture of operational excellence and collaboration across the engineering organization. You will play a key role in shaping the reliability posture of CarGurus' core services as we continue to innovate and scale in a fast-paced environment.
What You'll Do
- Maintain and optimize on-call rotations for the SRE team and partner engineering platform teams, with a focus on high-priority services
- Oversee training, onboarding, and upskilling of engineers joining SRE on-call; ensure effective incident handling, documentation, and knowledge transfer
- Drive continual improvement in incident response processes, covering escalation, communication, postmortem actions, documentation, and success metrics
- Lead regular and ad hoc capacity planning with Agile development practices; prioritize projects, and allocate resources for critical initiatives
- Coordinate with partner teams for seamless project handoffs, dependency management, and coordinated rollouts
- Manage the selection, implementation, and enablement of monitoring and reliability tools
- Ensure on-call automation, alert routing, hygiene, and integrations (Slack, OpsGenie, IRM) are maintained and documented
- Lead disaster recovery and business continuity initiatives (scenario planning, documentation, and institutional knowledge transfer)
- Document ongoing stakeholder engagement and cross-team responsibilities, including external communications, and maintenance
`
What You'll Bring
- 6+ years of relevant experience in site reliability, 2+ years managing a site reliability team
- Bachelor's Degree in Computer Science or related field, or equivalent work experience
- Strong, proven background in site reliability engineering, software engineering, and production operations
- Hands-on experience with incident management, observability systems, automation, cloud infrastructure, and Kubernetes, front and backend systems
- Demonstrated ability to lead technical teams, foster cross-functional collaboration, and mentor engineers
- Excellent written and verbal communication skills, and experience coordinating initiatives across multiple teams
- Ability to drive continuous improvement and manage complex, cross-cutting projects in a highly dynamic environment
- Operational experience supporting high-availability, revenue-critical services at scale
Working at CarGurus
We reward our Gurus' curiosity and passion with best-in-class benefits and compensation, including equity for all employees, both when they start and as they continue to grow with us. Our career development and corporate giving programs, as well as our employee resource groups (ERGs) and communities, help people build connections while making an impact in personally meaningful ways. A flexible hybrid model and robust time off policies encourage work-life balance and individual well-being. Thoughtful perks like daily free lunch, a new car discount, meditation and fitness apps, commuting cost coverage, and more help our people create space for what matters most in their personal and professional lives.
We welcome all
CarGurus strives to be a place to which people can bring the ultimate expression of themselves and their potential-starting with our hiring process. We do not discriminate based on race, color, religion, national origin, age, sex, marital status, ancestry, physical or mental disability, veteran status, gender identity, or sexual orientation. We foster an inclusive environment that values people for their skills, experiences, and unique perspectives. That's why we hope you'll apply even if you don't check every box listed in the job description. We also encourage you to tell your recruiter if you require accommodations to participate in our hiring process due to a disability so we can provide the appropriate support. We want to know what only you can bring to CarGurus. #LI-Hybrid


