Back to jobs
C

Staff Site Reliability Engineer, Core AI Infrastructure

🇺🇸Coinbase

Remote - USARemote0 applicants
Full TimeLead

Job Description

Ready to be pushed beyond what you think you’re capable of? At Coinbase, our mission is to increase economic freedom in the world. It’s a massive, ambitious opportunity that demands the best of us, every day, as we build the emerging onchain platform — and with it, the future global financial system. To achieve our mission, we’re seeking a very specific candidate. We want someone who is passionate about our mission and who believes in the power of crypto and blockchain technology to update the financial system. We want someone who is eager to leave their mark on the world, who relishes the pressure and privilege of working with high caliber colleagues, and who actively seeks feedback to keep leveling up. We want someone who will run towards, not away from, solving the company’s hardest problems. Our work culture is intense and isn’t for everyone. But if you want to build the future alongside others who excel in their disciplines and expect the same from you, there’s no better place to be. While many roles at Coinbase are remote-first, we are not remote-only. In-person participation is required throughout the year. Team and company-wide offsites are held multiple times annually to foster collaboration, connection, and alignment. Attendance is expected and fully supported. What you’ll be doing (ie. job duties): AI-Driven Innovation: Join a high-performing team of skilled engineers driving AI transformation at Coinbase. This role involves leading the development of scalable AI products with direct exposure to high-level executives, focusing on rapid ideation, execution, and delivering impactful solutions in a dynamic, incubator-style environment. Partner with the Coinbase Infrastructure team to support and extend existing ci/cd frameworks to support IT services, including enterprise network platforms Partner with security and compliance to build surveillance tooling into deployment pipelines Design and implement automation to streamline overall operational IT support workflows Action Kubernetes deployment, implementation, and support Build a technological roadmap based on product requirements Participate in on-call to support the AWS service deployment pipeline Promote DevSecOps mentality and establish best practices to ensure top-tier cloud security Set and maintain a standard of excellence for technical documentation across IT engineering Participate in an operational environment with strict SLAs and managed incident response and disaster recovery strategies Facilitate incident response, conduct root cause analysis and blameless retrospectives Define metrics and design/implement automation opportunities based on monitoring/observability Developing and maintai

Read original posting

Required Skills

ScalaRAWSKubernetesCI/CDObservability
C

Coinbase