Job Summary:
ByteDance is a leading technology company focused on inspiring creativity and enriching life. The Seed Speech team is seeking a Research Scientist to develop and scale speech foundation models, improve core capabilities such as speech recognition and synthesis, and explore interactive speech-based systems.
Responsibilities:
• Develop and scale speech foundation models for understanding and generation tasks.
• Design training pipelines including data construction, instruction tuning, and model alignment.
• Improve core capabilities such as speech recognition, synthesis, reasoning, and robustness.
• Optimize model architectures, training efficiency, and system performance.
• Explore natural and interactive interfaces for speech-based systems.
Qualifications:
Required:
• Currently pursuing a Bachelor's or Master's degree in computer science, mathematics, engineering, or a related field, with an expected graduation date in 2027 and the ability to commit to an onboarding date by the end of 2027.
• Excellent coding ability, data structures, and fundamental algorithm skills, proficient in C/C++ or Python, etc.
• Demonstrated interest or project experience in relevant areas.
Preferred:
• Experience in speech processing, audio modeling, or related areas through internships is preferred.
• Strong problem-solving and collaboration skills.
Company:
ByteDance is a technology company that develops content creation platforms and services. Founded in 2012, the company is headquartered in Beijing, CHN, with a team of 10001+ employees. The company is currently Late Stage.