该职位来源于猎聘 About the team At Coupang, the Storage and Tools team builds and maintains high-performance, scalable, and reliable distributed SQL and NoSQL databases. We work on creating systems that efficiently store large amounts of data with zero downtime, enabling various teams across the company to use data effectively and flexibly. Our responsibilities include designing, implementing, testing, and documenting best practices for optimal database usage, ensuring robust, secure, and resilient production database infrastructure. Job Introduction
We are looking for talented SDEs who can translate business and functional requirements into concrete deliverables, build rapid prototypes or proofs of concepts, and invent new features, design systems, develop and deploy highly scalable and reliable distributed services. The responsibilities include:
- Collaborate with domain teams and TPMs to design distributed storage systems, establish deployment / scale plans, and monitor quality after product launch.
- Set up storage cloud infrastructure for large datasets (Clickhouse, Cassandra, Elasticsearch etc.), optimizing cluster performance for scalability and data availability.
- Evaluate and prepare new and upgraded storage products for company-wide use. You will document and share techniques, patterns, and best practices with domain teams, DBAs, and support teams.
- Migrate storage system between different technologies, ensuring seamless transitions.
- Research, design, implement, document, and maintain new and legacy systems, and create software services and tools to drive the software development process toward higher quality.
- Strong experience in building and managing EKS-based storage cloud solutions, e.g. Clickhouse or Vitess. Qualifications
- Bachelor's degree or higher in computer science, engineering, information systems, or a related technical field.
- 7+ years of experience in designing, developing, and managing large-scale software infrastructure.
- Proven experience with enterprise-scale storage system architecture, configuration, deployment, performance tuning, security, backup/recovery, data processing/consistency, and troubleshooting.
- Solid knowledge of DBMS concepts such as ACID, CAP, and consistency.
- Proficiency programming language on Java, Python, or Go preferred.
- Excellent written and verbal English communication skills. Preferential treatment
- Strong experience in developing and operating high-availability services in cloud environments (AWS, etc.) and with containers (Docker, Kubernetes, etc.).
- A team player and quick learner with the ability to quickly classify and troubleshoot complex problems.
- Strong ownership and passion for applying new technologies to real-world products.
- Experience building monitoring and visualization systems using Prometheus, Grafana, etc.