Back to jobs
T

Software Architect, Reliability Engineering

Twilio

Remote - USRemote0 applicants
Full TimeLead

Job Description

Who we are At Twilio, we’re shaping the future of communications, all from the comfort of our homes. We deliver innovative solutions to hundreds of thousands of businesses and empower millions of developers worldwide to craft personalized customer experiences. Our dedication to remote-first work , and strong culture of connection and global inclusion means that no matter your location, you’re part of a vibrant team with diverse experiences making a global impact each day. As we continue to revolutionize how the world interacts, we’re acquiring new skills and experiences that make work feel truly rewarding. Your career at Twilio is in your hands. We use Artificial Intelligence (AI) to help make our hiring process efficient. That said, every hiring decision is made by real Twilions! . See yourself at Twilio Join the team as Twilio’s next Reliability Architect. About the job As an Architect in SRE, you will drive the technical strategy, vision and outcomes for Twilio’s Reliability Engineering organization. You will define and lead solutions and initiatives that ensure Twilio products are reliable worldwide, and you will define standards and guide engineering teams on best practices for designing, building, and operating resilient systems. This role is pivotal to Twilio’s commitment to operational excellence, scalability, and pragmatic, large-scale systems design in the cloud. Responsibilities In this role, you’ll: Partner with senior technical leaders across Twilio to set and communicate the reliability strategy, translating business goals into measurable outcomes. Influence company-wide architectural decisions while balancing long-term vision with near-term and compliance needs. Lead the design, implementation, and operation of scalable solutions and paved roads that enable reliable, high-traffic services; Influence company-wide architectural decisions to focus on availability, performance, resilience, and cost efficiency using Kubernetes, AWS, Terraform, and modern observability. Ensure integrity and quality across the service lifecycle; design fault-tolerant architectures, incident response, disaster recovery, and capacity/cost management. Collaborate with product and cross-functional teams to identify reliability risks and convert them into actionable designs, programs, and tooling. Establish and champion reliability practices and drive

Read original posting

Required Skills

GoScalaRAWSKubernetesTerraformSREObservability
T

Twilio