THE ROLE
This role will be a support engineer within the Tesla IT Infrastructure Engineering & Operations department. The Sr. Incident Response Engineer will be coordinating with cross-functional engineering teams for Incident Response & Management in terms of the high availability to Tesla Manufacturing, Business Operations, Customer Service & Experience. We help to reduce the occurrence of incidents by using efficient IT Operation monitoring, effective risk analysis and professional team collaboration.
The Tesla APAC Incident Response Center is a growing team consist of professionals from diverse backgrounds, which will offer you a fantastic development environment. This role will be based on Giga Factory Shanghai, China but will provide support to Tesla Business globally considering of the growing business & great mission.
Responsibilities
- Independently lead incident response and management to minimize impact and ensure optimal response times. Develop incident response plans, conduct post-mortem analyses, and organize drills to enhance preparedness.
- Drive IT service management projects. Establish/optimize SOPs to reduce inter-team communication barriers, promote technical knowledge sharing, and improve team incident response capabilities.
- Monitor IT infrastructure and data center operations, including servers, networks, and applications. Analyze real-time stability metrics, mitigate risks, and deliver regular operational analysis reports.
- Proactively enhance team efficiency through tool automation, process refinement, and adoption of industry best practices. Support daily operations and foster a culture of continuous improvement.
- Oversee infrastructure changes to minimize risks, streamline approval workflows, and ensure compliance with change management protocols.
Requirements
Must Qualifications
- Minimum 5 years of working experience with related academic background(Information Technology, Software Engineering, Computer Science. etc.).
- Deep understanding of IT infrastructure knowledge base, such as Networking, Server, Visualization, Storage. Etc. Hands on experience is preferred.
- Deep understanding of monitoring tools such Grafana, Prometheus or Splunk.
- Experience with change management is preferred.
- Fluent English skills, excellent communication skills and strong sense of responsibility, sense of problem solving and great teamwork.
Preferred Qualifications
- Problem solving experience as IT infrastructure service administrator or support.
- Experience with AWS, or other cloud infrastructure providers as support or Admin.
- Hands on experience on Project, Process and Change Management, ITIL expert.