Site Reliability Engineer
Sunnyvale, CA, United States
Site Reliability Engineer
Status : Full Time
Compensation : 120k to 145k
Hybrid Requirements : 3 days in office, 2 days remote
Lawrence Harvey has partnered with a leading Chinese fintech startup that is committed to democratizing payment services and empowering people and businesses to thrive in the global economy. They are a team of engineers dedicated to building cutting-edge, highly reliable, scalable, and high-throughput payment systems. This organization provides an opportunity to positively impact the world by developing exciting, large-scale payment systems with passionate engineers.
Role Overview:
Our North American Technology team is seeking a talented, creative, and passionate Site Reliability Engineer to help build an innovative payment system that addresses merchant and consumer needs. As a self-motivated and enthusiastic team member, you will collaborate with skilled peers in a dynamic environment to build high-performance, efficient payment authorization services that are scalable, configurable, and available. You will be encouraged to innovate and work swiftly to deliver solutions to our customers.
Responsibilities:
Automate Technical Operations: Design and develop solutions to automate the technical operations of large-scale systems, enhancing stability from a Software Development Lifecycle perspective.
Strengthen System Stability: Improve payment system stability through monitoring, logs, dashboards, and diagnostic tools. Conduct regular drills, develop remedy plans for quick service restoration, and respond to production issues across regions.
Evaluate System Performance: Define indicators to assess system performance and runtime to improve observability, aiding in system development and troubleshooting. Plan system capacities based on business expansion and scheduled promotions.
Analyze Production Cases: Investigate performance bottlenecks and other production issues to establish technical best practices and achieve a high-availability payment architecture.
Design and Implement Data Protection Plans: Design and set up new data centers (IDC) and implement data protection plans to meet standard requirements.
Qualifications:
Education: Bachelors or Masters degree in a Computer Science-related discipline, or equivalent practical experience.
Experience: At least 2 years of experience in the Internet industry is preferred.
Technical Knowledge: Solid understanding of Computer Science principles, including Operating Systems (Unix/Linux), Computer Storage, and Computer Networking.
Programming Skills: Software development experience in at least one programming language, such as Java or Python.
Problem-Solving Abilities: Strong ability to resolve system problems, coupled with good communication skills and a sense of ownership.
Relevant Experience: Familiarity with technologies such as Redis, MySQL, Nginx, Kubernetes, Docker/Containers, Function as a Service, RPC Framework, and Service Mesh.
#J-18808-Ljbffr