A leading B2B SaaS platform in the cross-border e-commerce sector, is expanding its North America operations. We're seeking a Senior DevOps Engineer / Site Reliability Engineer (SRE) to architect and maintain our unified global O&M (operations and maintenance) platform.
This is a newly created role supporting our North America team's contribution. You'll work directly with our Middle Platform Director, Technical Experts, and CEO in a collaborative, remote-first environment, Can be located anywhere in the US.
KEY RESPONSIBILITIES:
โข Design, develop, and maintain unified operation and platform management systems covering resource management, monitoring & alerting, configuration management, and automated operation & maintenance
โข Build and operate observability platforms and CI/CD pipelines; develop self-healing systems and automated incident response processes to realize intelligent O&M
โข Establish DevOps standards and best practices; promote standardization of DevOps toolchains (technology selection, version management)
โข Provide platform-level technical support for product and engineering teams; resolve complex system issues, reduce technical debt, and lead infrastructure and architecture upgrades
โข Promote SRE concepts and engineering practices; organize technical sharing and training; build a reliability engineering system
โข Conduct technical research and innovation; track cloud-native/DevOps industry trends; evaluate new technologies and drive continuous modernization of O&M platforms
REQUIRED QUALIFICATIONS:
โข Currently residing in California or North Carolina, USA
โข US Green Card or US Citizenship (work authorization; no sponsorship available)
โข Fluent in Mandarin Chinese (working language; close collaboration with domestic R&D required)
โข Bachelor's degree or above in Computer Science or related field
โข 4-6 years of hands-on experience in DevOps/SRE/Platform Engineering
โข Proficient in at least one major cloud platform (AWS/Azure/GCP) with deep understanding of VPC, EC2, EKS/K8s, RDS, IAM
โข Proficient in Linux, networking, containers (Docker/Kubernetes), load balancing, and service governance
โข Skilled in IaC (Infrastructure as Code) tools: Terraform, Ansible, Helm
โข Experience building CI/CD pipelines: Jenkins, Argo CD, CodeBuild, etc.
โข Familiar with monitoring/logging/tracing: Prometheus, Grafana, ELK, OpenTelemetry
โข Proficient in at least one development/scripting language: Python, Shell, Go
โข Excellent system design, analysis, and troubleshooting skills
โข Strong cross-team communication and collaboration abilities
PREFERRED QUALIFICATIONS:
โข Master's degree in Computer Science or related field
โข Experience with global platforms, cross-border SRE, multi-cloud O&M
โข Led platform reconstruction, self-healing systems, or observability initiatives
โข Go development, service mesh, chaos engineering, capacity planning experience
โข Demonstrated success improving system availability, reducing incident rates, increasing automation
โข Global technical vision and cross-cultural collaboration experience
โข Result-oriented, self-driven, experienced in technical evangelism/sharing
COMPENSATION:
โข Base Salary: $140,000 - $160,000 annually (top candidates may receive 5-10% upward adjustment)
โข 401(k): Dollar-for-dollar match, up to 4% of salary
โข Medical Insurance
โข PTO: 12 days annually
โข Social Security & Housing Fund: Contributed per US legal requirements
WORK ENVIRONMENT:
โข Location: Silicon Valley, CA OR Raleigh, NC (homebase available)
โข Department: Tech O&M Department
โข Working Style: Remote-first
โข Hours: 8 hours per day, weekends off
โข Travel: No business travel required
โข Expected Start: ASAP
Interview Process: Round 1 (Online): Middle Platform Director + Technical Expert | Round 2 (Online): Head of HR | Round 3 (Online): CEO/Founder