THE ROLE
This role will be a support engineer within the Tesla IT Infrastructure Engineering & Operations department. The Sr. Incident Response Engineer will be coordinating with cross-functional engineering teams for Incident Response & Management in terms of the high availability to Tesla Manufacturing, Business Operations, Customer Service & Experience. We help to reduce the occurrence of incidents by using efficient IT Operation monitoring, effective risk analysis and professional team collaboration.

The Tesla APAC Incident Response Center is a growing team consist of professionals from diverse backgrounds, which will offer you a fantastic development environment. This role will be based on Giga Factory Shanghai, China but will provide support to Tesla Business globally considering of the growing business & great mission.

Responsibilities

Independently lead incident response and management to minimize impact and ensure optimal response times. Develop incident response plans, conduct post-mortem analyses, and organize drills to enhance preparedness.
Drive IT service management projects. Establish/optimize SOPs to reduce inter-team communication barriers, promote technical knowledge sharing, and improve team incident response capabilities.
Monitor IT infrastructure and data center operations, including servers, networks, and applications. Analyze real-time stability metrics, mitigate risks, and deliver regular operational analysis reports.
Proactively enhance team efficiency through tool automation, process refinement, and adoption of industry best practices. Support daily operations and foster a culture of continuous improvement.
Oversee infrastructure changes to minimize risks, streamline approval workflows, and ensure compliance with change management protocols.

Requirements
Must Qualifications

Minimum 5 years of working experience with related academic background(Information Technology, Software Engineering, Computer Science. etc.).
Deep understanding of IT infrastructure knowledge base, such as Networking, Server, Visualization, Storage. Etc. Hands on experience is preferred.
Deep understanding of monitoring tools such Grafana, Prometheus or Splunk.
Experience with change management is preferred.
Fluent English skills, excellent communication skills and strong sense of responsibility, sense of problem solving and great teamwork.

Preferred Qualifications

Problem solving experience as IT infrastructure service administrator or support.
Experience with AWS, or other cloud infrastructure providers as support or Admin.
Hands on experience on Project, Process and Change Management, ITIL expert.

去原网站上申请

IT Incident Response Engineer