Job Summary:
ByteDance is a company focused on inspiring creativity and enriching life through innovative products like TikTok. They are seeking a Software Engineer Graduate to assist in optimizing their cloud-native infrastructure for AI and LLM workloads, contributing to performance improvements and resource management within their Kubernetes-based systems.
Responsibilities:
• Assist in analyzing and supporting enhancements to Hyper-Scale AI Infrastructure platforms, focusing on improving performance, scalability, and resilience for both traditional workloads and large language model (LLM) applications.
• Contribute to performance optimization efforts for Kubernetes-based infrastructure, including monitoring pod lifecycle, tracking resource utilization, and analyzing system behavior under varying load conditions—working closely with senior engineers to identify improvement opportunities.
• Lead small-scale development tasks related to resource management and scheduling in Kubernetes clusters, such as testing configuration updates, automating routine resource allocation workflows, or contributing to tooling for efficiency tracking.
• Engage actively in team discussions on AI infrastructure design and optimization strategies, leveraging academic knowledge and personal projects to propose fresh insights and potential solutions.
• Develop and maintain clear technical documentation, including runbooks, architecture diagrams, and process guides, to strengthen knowledge sharing and operational efficiency across the team.
Qualifications:
Required:
• Individuals who are completing or have recently completed a PhD degree in Software Development, Computer Science, Computer Engineering, or a related technical discipline, strong publication record is a good plus.
• Proficiency in at least one major programming language such as Python, Go, C++, Rust, and Java.
• Solid understanding of at least one of the following fields: Unix/Linux environments, distributed and parallel systems, high-performance networking systems, developing large scale software systems.
Preferred:
• Hands-on project experience with container and orchestration technologies such as Docker and Kubernetes through internships, coursework, or personal projects.
• Experience in developing or contributing to cloud-native open-source projects.
Company:
ByteDance is a technology company that develops content creation platforms and services. Founded in 2012, the company is headquartered in Beijing, CHN, with a team of 10001+ employees. The company is currently Late Stage.