经过一段时间的历练,eBay 上海的 SRE 团队获得了全球技术团队的认可,并进入了快速扩张期。今年我们会依托内部海量的业务和性能指标,利用机器学习,对 eBay 产品和服务的弹性及稳定性进行突破性地改造。我们需要扎实的开发能力,超快速的学习能力和对解决复杂问题的敏锐思考。即使是一两年工作经验的职场新人,只要你有扎实的计算机专业功底,我们一样欢迎。如果你有兴趣,可以把中英文简历发送至 wenli 在 ebay.com 的邮箱。
Job Description
- Leverage billions of internal Site Metrics, Machine Learning platform, fault injection methodology to push the SLA to even higher level.
- Work closely with Software Architect, developers and QE from multiple product teams to develop innovative solutions to attain higher availability, scalability and reliability.
- Jointly drive efforts to enhance and redesign the products to pursue the goal of application resilience across multiple domains, including Payment, Framework, Cloud Platform, Data Services, etc.
- Be the technical and process leader for end-to-end site reliability solution.
- Build the framework for Business PD team to reveal the potential resiliency challenges inside products and provide with self-healing capability.
- Build the tools based on modern open source technologies to be able to detect and resolve the site reliability issues.
Job Requirements
- Hands-on demonstrable coding experience using either Java or Python.
- Having both Software Architecture and Solution Architecture ’ s view to design a software product.
- Having the understanding of Micro Service, Networking, Load Balancing in favor of application resilience.
- Good at tracing the application error from log to source code and solving the performance and reliability challenges in many levels of a distributed system
- Excellent analysis, design, and problem-solving skills are a must.
- Effective verbal and written communication skills are a must.
- BS Degree in technical field or equivalent work experience required.