Staff Machine Learning Engineer, Agentic

Robinhood

Bellevue, WA; Menlo Park, CA0 applicants

Full TimeLead

Job Description

Join us in building the future of finance. Our mission is to democratize finance for all. An estimated $124 trillion of assets will be inherited by younger generations in the next two decades. The largest transfer of wealth in human history. If you’re ready to be at the epicenter of this historic cultural and financial shift, keep reading. About the team + role We are building an elite team, applying frontier technologies to the world’s biggest financial problems. We’re looking for bold thinkers. Sharp problem-solvers. Builders who are wired to make an impact. Robinhood isn’t a place for complacency, it’s where ambitious people do the best work of their careers. We’re a high-performing, fast-moving team with ethics at the center of everything we do. Expectations are high, and so are the rewards. The Agentic AI team builds agentic AI systems that power intelligent, reliable customer experiences across Robinhood products. The team focuses on reducing the time to ship agents with fine-tuned models and while doing so enables other teams to build, evaluate, and improve their own agents. You will contribute to a culture grounded in first-principles thinking, high performance, and strong focus on customer outcomes! As a Staff Machine Learning Engineer (IC6), you will define and uphold the quality bar for agentic systems across the organization. You will design evaluation frameworks, guide model selection, and partner with product, data science, and engineering teams to ensure systems meet clear standards for correctness, safety, latency, and user satisfaction. Your work will shape how agentic systems are built, evaluated, and improved across Robinhood! This role is based in our Bellevue, WA or Menlo Park, CA office, with in-person attendance expected at least 3 days per week. At Robinhood, we believe in the power of in-person work to accelerate progress, spark innovation, and strengthen community. Our office experience is intentional, energizing, and designed to fully support high-performing teams. What you’ll do ● Define and implement evaluation frameworks that measure agent performance, including task success, correctness, tool usage reliability, latency, safety, and user satisfaction ● Evaluate frontier and fine-tuned models across quality, latency, cost, and edge cases to determine appropriate use cases ● Par

Read original posting

Required Skills

RVueMachine Learning

Robinhood