Support Engineer - Incident Management, AWS Incident Response (AIR)
DESCRIPTION
AWS Incident Response is at the heart of high availability of Amazon Web Services. We make customer impacting events shorter and less frequent by providing large scale event and incident management. Our automated tooling quickly identifies the cause of an issue and helps mitigate its impact, and much of our engineer time is spent on projects to improve the tooling and automation. We also provide manual incident management for AWS and other Amazon groups, directing the resolution of an issue with service teams, and diving deep into those events to drive improvements to the tooling. It's an exciting time to join our team as we are rapidly growing and expanding our offerings.
As a Support Engineer on the team you will lead projects and build processes to reduce the duration, frequency, and impact of issues within the AWS and Amazon infrastructure. You will also spend a portion of your time directing the resolution of high visibility incidents by leading conference calls and teams across the globe. Using data learned from those incidents you will drive further improvements into our automation, tooling, and processes so that the next event is shorter or avoided entirely. You will participate on project teams to expand use of our tooling to additional areas across Amazon. You'll also have the opportunity to grow your coding skills by taking on development projects matched to your ability level. If you're looking for a supportive team with great growth potential and an opportunity to make a huge impact, this is the team to join.
AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we’re the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain — and we’re looking for talented people who want to help.
You’ll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.
Key job responsibilities
- Drive the resolution of large scale customer impacting issues as part of a team rotation, including some weekends and holidays
- Identify and troubleshoot recurring platform issues and own projects to drive improvements
- Participate in Agile sprints to evolve business processes and technologies
- Create and review documentation; design new standard operating procedures
- Mentor peers in your areas of technical and operational strength
- Lead projects and teams across the globe to drive operational improvements
A day in the life
A Support Engineer on the AWS Incident Response team has full visibility on all AWS services! There are limitless opportunities to learn as we work with AWS internal teams and have visibility into all AWS products and services.
When oncall, we provide incident management capabilities through conference calls and automation, to support internal AWS teams during the response, diagnosis and mitigation of large scale events.
When not oncall, we build processes and automation to help AWS experience fewer, shorter and smaller customer impacting incidents.
About the team
The AWS Incident Response (AIR) team is Amazon’s central defense against large-scale incidents and drives operational excellence across all of Amazon businesses. Our key offering to Amazon is best-in-class Incident Management. Our engineers are front-and-center in driving down event duration through experience in operational excellence, current best practices and incident management tooling.
Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.
Why AWS?
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.
Inclusive Team Culture
Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.
Mentorship & Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.
Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.
BASIC QUALIFICATIONS
- Experience troubleshooting and debugging technical systems
- Experience in agile/scrum or related collaborative workflows
- Experience troubleshooting and documenting findings
- 3+ years of technical support or related experience
PREFERRED QUALIFICATIONS
- Knowledge of UNIX / Linux operating systems
- Experience driving and managing large troubleshooting efforts or incidents
Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience and skills. We value your passion to discover, invent, simplify and build. Protecting your privacy and the security of your data is a longstanding top priority for Amazon. Please consult our Privacy Notice (https://www.amazon.jobs/en/privacy_page) to know more about how we collect, use and transfer the personal data of our candidates.
Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit https://www.amazon.jobs/content/en/how-we-hire/accommodations.