The Transactional Data Storage org builds data platforms which are at the core of Amazon’s consumer and digital businesses. Our OLTP storage platform, Sable, processes billions of transactions every day around the globe. We own multiple tier-1/mission-critical services fueling the growth of Amazon businesses to store customer orders, catalog and product information, shopping cart, Kindle device information, and hundreds of other business critical data-sets. The performance and availability of our platform is critical to the reliability of the Amazon brand. We run some of the largest fleets and the highest throughput services at Amazon. We partner with Amazon Web Services (AWS) to leverage technologies built for cloud customers, and to offer technologies we invent through AWS channels to the broader market.
We are looking for a passionate engineer who understands distributed systems at scale and is interested to learn about database systems from database engine design to operating a tier-1 flagship product. This engineer will work with PEs and other engineers to enhance the Control Plane which determines how the system scales to support millions of requests, how it recovers from disasters, how does it perform cross regions data movement and many other admin operations. This role will contribute to design, development, testing and operations for Sable control plane. An engineer in this role will have strong sense of ownership, high bar for operational excellence and proven record of solving hard technical problems from design to delivery.
Key job responsibilities
Design, develop, and operate distributed control plane services that manage scaling, disaster recovery, cross-region data movement, and administrative operations for Sable - Amazon's mission-critical OLTP storage platform.
Collaborate with Principal Engineers and senior engineers to define technical solutions for complex problems in areas such as observability, automated scaling, fleet management, and system resilience.
Own end-to-end delivery of features from design through deployment, including writing design documents, implementing production-quality code, and driving operational readiness.
Build and maintain monitoring, alarming, and automation to ensure the highest bar of operational excellence for tier-1 services processing billions of daily transactions.
Participate in on-call rotations, troubleshoot production issues, and drive root-cause analysis and corrective actions to prevent recurrence.
Write well-tested, maintainable code and contribute to code reviews that uphold team quality standards.
Identify opportunities to improve system reliability, performance, and efficiency, and drive those improvements from proposal to production.
- Experience (non-internship) in professional software development
- Experience programming with at least one software programming language
- Experience building complex software systems that have been successfully delivered to customers
- Experience designing or architecting (design patterns, reliability and scaling) of new and existing systems
- Experience with full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations
- Bachelor's degree in computer science or equivalent
Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience and skills. We value your passion to discover, invent, simplify and build. Protecting your privacy and the security of your data is a longstanding top priority for Amazon. Please consult our Privacy Notice (https://www.amazon.jobs/en/privacy_page) to know more about how we collect, use and transfer the personal data of our candidates.
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.