Data Centre Engineering Operations, BOM DCEO
This position within Amazon Data Services India Private Limited (ADSIPL) requires broad data centre knowledge with Subject Matter Expertise (SME) in as many specific fields as possible. The location for this job to be discussed, as there may be opportunities in several India locations.
This position serves as the primary operational resource to support ADSIPL within its owned and operated data centres in India. This position will provide a central point of ownership and accountability for the overall 'hands-on' management of the mechanical, electrical (M&E) and across ADSIPL's portfolio of data centres in India. This position will be responsible for the overall operation and maintenance of the critical infrastructure supporting IT operations within the Indian data centre space. It will also include event management, incident management, problem management, change management, and cost/contract management. In addition, this will include the relationship management with the landlords, critical facility vendors, data centre construction team, data centre operations team, technical programme managers, security team, and logistics team in India. The position will require 24x7 on-call, scheduled weekend work support and rotational shift.
Key Job Responsibilities
* Ownership of all data centre changes/events/incidents/problems from beginning to end as well as overseeing the completion of investigation, root cause analysis and follow-up resolution actions.
* Responsible for ensuring maintenance/repairs of a data centre (electrical/mechanical/fire suppression systems) are planned and executed to the best interest of the business.
* Responsible for asset and inventory management.
* Develop and maintain method statements, standard operating procedures, emergency response procedures, preventive maintenance programmes, and all technical documentation.
* Ensure standardisation and consistency with best-in-class operating practices. (Technical writing skills and automation)
* Develop and deliver the regular engineering reports and ensure adherence to contracted deliverables including SLAs and KPIs.
* Providing hands-on facility support where required (e.g. installation of new equipment, decommissioning of equipment, replacement of faulty equipment, internal audits…etc.)
* Oversee technical compliance auditing and the effective and timely close out of corrective action plans.
Incident and Emergency Response
* Managing information flow during incidents while providing regular updates to management.
* Reviewing incident reports, documenting periodic trend summaries, and providing updates and recommended actions to management.
About the Team
You'll be working with highly technical data centre teams including cross-functional ones looking after various aspects of a data centre, in order to maintain 100% uptime for the customers.
BASIC QUALIFICATIONS
* At least two years of experience of data centre operations and on-call support for data centre facilities.
* Degree in engineering field (electrical, mechanical, industrial, instrumentation).
* An excellent understanding of the nature of mission critical systems (data centres, hospitals, power plants, military facilities, etc.).
* The candidate needs to be a self-starter and independent worker.
* Ability to solve problems at their root, stepping back to understand the broader context.
* Ability to write and review accurate and complete support procedures, system documentation, and issue tracking entries.
* Ability to prioritise in a complex, fast-paced environment.
PREFERRED QUALIFICATIONS
* An excellent understanding of the electrical systems in critical data centre operations that include but not limited to utility substation feeds, transformers, switchgear, VFI Class UPS, DRUPS, PDUs, ATS, STS, SLA/VRLA batteries and associated systems, diesel/gas turbine generators and related fuel systems, surge suppression, active harmonic filtering, battery monitoring systems, branch circuit monitoring systems, SCADA systems.
* An excellent understanding of the mechanical systems in critical data centre operations include but not limited to CRAC/CRAHs/AHUs, chillers, cooling towers, storage tanks, chemical system, heat exchangers, piping systems, pumps, valves, duct systems, fans, dampers.
* An excellent understanding of other facilities systems used in data centres and mission critical facilities, including but not limited to fire detection and suppression systems, plumbing and drainage systems, building monitoring systems, automatic control systems.
* An excellent understanding of design, procurement, suitability of application, testing and commissioning. Certifications/accreditations that will be viewed positively: PMP; Prince2; ITIL v2/3; BICSI; ASHRAE, CDCP/S/E or equivalent.