Data Center Engineering Operations Engineer, Data center engineering operation
DESCRIPTION
AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we’re the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain — and we’re looking for talented people who want to help.
You’ll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.
Amazon is expanding the Data Center management team in India. This position serves as the primary operational resource to support ADSIPL within its owned and operated Data Centers in India. This position will provide a central point of ownership and accountability for the overall ‘hands-on’ management of the Mechanical, Electrical (M&E) and across ADSIPL’s portfolio of Data Centers in India. This position will be responsible for the overall operation and maintenance of the critical infrastructure supporting IT operations within the Indian Data Center space. It will also include event management, incident management, problem management, change management, and cost/contract management. In addition, this will include the relationship management with the landlords, critical facility vendors, Data Center Construction team, Data Center Operations team, Technical Program Managers, Security team, and Logistics team in India. The position will require 24x7 on-call, scheduled weekend work support and rotational shift. The location for this job to be discussed, as there may be opportunities in several India locations.
Key job responsibilities
Following are the primary responsibilities (but not limited to) of Data Center Engineering Operations (DCEO) Engineer:
Operations and Maintenance:
· Ownership of all Data Center changes/events/incidents/problems from beginning to end as well as overseeing the completion of post-mortems, root cause analysis and follow-up resolution actions.
· Responsible for ensuring maintenance/ repairs of site-critical facility infrastructure or a Data Center are planned and executed to the best interest of the business.
· Responsible for Asset and Inventory management.
· Develop and maintain method statements, standard operating procedures, emergency response procedures, preventive maintenance programs, and all technical documentation.
· Ensure standardization and consistency with best-in-class operating practices. (Technical Writing Skills and Automation)
· Develop a complete, deep knowledge of the design intent, operational alternatives and contingency plans related to all Data Center systems.
· Manage the engineering aspects of the Data Centers related to financial and cost control, code and regulatory compliance, personnel management, staff training and development
· Health & Safety, local statutory requirements, environmental and energy management.
· Develop and deliver the regular engineering reports and ensure adherence to contracted deliverables including SLA’s and KPI’s.
· Communicate operating philosophies, technical information, objectives and expectations to Amazon personnel and to the vendor critical facilities management teams.
· Providing hands on facility support where required (e.g. installation of new equipment, decommissioning of equipment, replacement of faulty equipment, internal audits…etc.)
· Oversee technical compliance auditing and the effective and timely close out of corrective action plans. Perform annual operational reviews with a focus on compliance with the Amazon standards and all applicable regulatory requirements. (Audits).
· Manage the development and delivery of the portfolio of Energy/Environmental Management Programs. Keep abreast of Data Center industry innovation.
Incident and Emergency Response:
o Reviewing incident reports, documenting periodic trend summaries, and providing updates and recommended actions to management.
o Managing information flow during incidents while providing regular updates to management.
o Manage and coordinate with vendors to resolve any incidents during emergency situations. This may require to physically be dispatched on to site to investigate and resolve the issue.
o Ensure 24*7 shift operations in safe, secure manner without availability impact.
About the team
Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.
Why AWS?
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.
Inclusive Team Culture
Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.
Mentorship & Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.
Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why flexible work hours and arrangements are part of our culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.
BASIC QUALIFICATIONS
- 1+ years of data center or mission critical facilities (example: hospital, military facility, public safety facility, etc.) experience
- Bachelor's degree
- High school or equivalent diploma
- At least one years of experience in a technical field. An undergraduate degree(Degree or diploma) in a technical field (EE, Mech E, Industrial E);
- An excellent understanding on the nature of mission critical systems (Data Centers, Hospitals, Power plants, military facilities, etc.). ·
- The candidate needs to be a self-starter and independent worker. Ability to solve problems at their root, stepping back to understand the broader context. Previous vendor negotiation and management skills for Data Center and/or upgrade construction contracts. Ability to write and review accurate and complete support procedures, system documentation, and issue tracking entries. Shows good judgment and instincts in decision making under pressure. Ability to prioritize in complex, fast-paced environment. Proactively and continually improve his/her level of knowledge about Amazon’s business and relevant technologies. Able to demonstrate his/her ability to take ownership of technical issues brought to him/her by his/her customer base. If the candidate is unable to resolve certain issues by themselves, he/she should demonstrate a willingness to actively engage other support teams to drive it to resolution. An interest in work subject matter that ensures that the teams are kept abreast of all relevant industry standards changes and innovation practices.
PREFERRED QUALIFICATIONS
- 1+ years of electrical or mechanical experience
- At least one years of experience of Data Center operations and on-call support for Data Center facilities.
- An excellent understanding of the Electrical and systems in critical Data Center operations that include but not limited to utility substation feeds, transformers, switchgear, VFI Class UPS, DRUPS, PDUs, ATS, STS, SLA/VRLA batteries and associated systems, diesel/gas turbine generators and related fuel systems, Surge Suppression, Active Harmonic Filtering, battery monitoring systems, branch circuit monitoring systems, SCADA systems.
- An excellent understanding of the Mechanical systems in critical Data Center operations include but not limited to CRAC/CRAHs/AHUs, chillers, cooling towers, storage tanks, chemical system, heat exchangers, piping systems, pumps, valves, duct systems, fans, dampers. An excellent understanding of other facilities systems used in Data Centers and Mission critical facilities, including but not limited to fire detection and suppression systems, plumbing and drainage systems, Building Monitoring Systems, automatic control systems. An excellent understanding of design, procurement, suitability of application, testing and commissioning. Certifications/Accreditations that will be viewed positively: PMP; Prince2; ITIL v2/3; BICSI; ASHRAE, CDCP/S/E or equivalent