Site Reliability Engineer (Mid / Senior) - Platform Infrastructure
Elastic
Job Description
Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data, at scale — unleashing the potential of businesses and people. The Elastic Search AI Platform, used by more than 50% of the Fortune 500, brings together the precision of search and the intelligence of AI to enable everyone to accelerate the results that matter. By taking advantage of all structured and unstructured data — securing and protecting private information more effectively — Elastic’s complete, cloud-based solutions for search, security, and observability help organizations deliver on the promise of AI. What is The Role: The Infrastructure team sits at the foundation of the Elastic product stack, enabling engineering efforts across the entire company. We are software developers who specialize in managing state — CI pipelines, cloud resources, and cross-team integrations. The Elastic Stack is built on our infrastructure, and we own everything needed to get it there. Beyond that, we act as internal software consultants, putting our own products to work for Elastic's business. What You Will Be Doing: Design and develop tooling that facilitates building, testing, and shipping the Elastic Stack Build and operate production services that power core aspects of the Elastic business, including downloads, Docker registry, maps service, and more Support internal adoption of the Elastic Stack for software development and analytics use cases What You Bring: Software Developer: You have a broad development background and are deeply proficient in at least one language. Our team uses Python, JavaScript, Clojure, and Haskell, but we work alongside engineers across the company using everything from Java to Go — the specific language matters less than the depth of your expertise. Site-Reliability Engineering: We are fundamentally an operations team. We solve problems with code, but our core mission is keeping things running. Experience in an SRE or equivalent role is a strong indicator of fit. Service-Oriented: You have multiple years of hands-on experience administering Linux systems, ideally at scale and in distributed environments. Experience helping operate a SaaS platform is a plus. Infrastructure-as-code: You're comfortable automating production systems collaboratively — treating configuration as code, managing it through version control, and working with tools such as Docker, Terraform, Puppet, Chef, Ansible, Salt, Packer, Kubernetes, or your own well-crafted shell scripts. Bonus Points: A drive to automate and monitor everything. If it can be automated, you'll find a way Experience building reusable software components; open source contributions (library, patch, documentation, or otherwise) are a bonus Comfort with a versioned, Git-based
Read original postingRequired Skills
Elastic