Back to jobs
R

Senior Software Engineer - Robinhood Command Center

Robinhood

Menlo Park, CA0 applicants
Full TimeSenior

Job Description

Join us in building the future of finance. Our mission is to democratize finance for all. An estimated $124 trillion of assets will be inherited by younger generations in the next two decades. The largest transfer of wealth in human history. If you鈥檙e ready to be at the epicenter of this historic cultural and financial shift, keep reading. About the team & role We are building an elite team, applying frontier technologies to the world鈥檚 biggest financial problems. We鈥檙e looking for bold thinkers. Sharp problem-solvers. Builders who are wired to make an impact. Robinhood isn鈥檛 a place for complacency, it鈥檚 where ambitious people do the best work of their careers. We鈥檙e a high-performing, fast-moving team with ethics at the center of everything we do. Expectations are high, and so are the rewards. The Robinhood Command Center (RCC) is a newly formed reliability team that serves as the front line for detecting, coordinating, and mitigating production incidents across Robinhood. As part of Robinhood鈥檚 broader reliability initiative, RCC works closely with product engineering, reliability, observability, infrastructure, and business teams to reduce customer impact and shorten incident duration. As a Senior Engineer, you will be part of the founding RCC team, helping define how Robinhood responds to and learns from incidents at scale. This is a highly visible role focused on incident leadership, operational excellence, and reliability tooling. You will not own product services or core infrastructure, but you will own the processes and tools that enable fast, high-quality incident response. This role is based in our Menlo Park, Califronia office, with in-person attendance expected at least 3 days per week. What you'll do: Serve as a senior technical leader driving the long-term reliability and observability strategy across Robinhood鈥檚 infrastructure Partner closely across many different types of engineers to raise the bar for operational excellence and incident response Lead incident mitigation efforts by coordinating service owners, facilitating time-sensitive decisions like rollbacks, traffic shifts, and maintaining a clear source of truth during active incidents Develop and maintain incident management processes and procedures to ensure timely resolution and minimize customer impact Own incident discovery at the company level by defining and maintaining global dashboards and alerts tied to critical user journeys (CUJs), availability, and business-impact metrics Own and evolve incident response tooling and processes, including education, a

Read original posting

Required Skills

RObservability
R

Robinhood