Job Summary:
ByteDance is a global technology company known for its innovative products like TikTok and CapCut. They are seeking a Research Scientist to develop agent frameworks that continuously learn and improve through real-world interactions, focusing on self-evolving agent systems and large-scale log analytics.
Responsibilities:
• Research and develop agent frameworks that continuously learn and improve from execution traces, user feedback, and environmental signals.
• Build large-scale log analytics pipelines to extract quality signals, usage patterns, and actionable insights from model and agent invocation logs, driving data-informed system and model improvements.
• Explore and apply frontier techniques in LLM post-training, reasoning, and planning to enhance agent capabilities.
• Collaborate across algorithm research, platform engineering, and product teams to turn research ideas into production-grade systems at scale.
Qualifications:
Required:
• Ph.D. in Computer Science, Artificial Intelligence, Machine Learning, or a closely related discipline.
• Strong theoretical and practical foundation in machine learning, deep learning, reinforcement learning, or optimization.
• Research experience in at least one of the following areas: LLM-based agents, planning and reasoning, multi-agent systems, continual/lifelong learning, or LLM post-training (e.g., RLHF, DPO, GRPO, self-play).
• Strong programming skills in Python and proficiency with ML frameworks (e.g., PyTorch, TensorFlow, JAX).
• Publication record at top-tier venues (e.g., NeurIPS, ICML, ICLR, ACL, EMNLP, NAACL, AAAI, AAMAS, COLM).
• Strong problem-solving skills and ability to thrive in a fast-paced, collaborative environment.
Preferred:
• Publications in areas directly related to agent learning and adaptation, such as tool use, self-improvement, skill discovery, trajectory optimization, reward modeling, or agent evaluation.
• Research experience in LLM reasoning and planning, including chain-of-thought, tree/graph search, Monte Carlo methods, or inference-time compute scaling.
• Experience training or fine-tuning large language models, including supervised fine-tuning, preference optimization, or curriculum learning.
• Hands-on experience building or evaluating LLM-based agent systems (e.g., ReAct, function calling, code generation agents, or multi-agent orchestration).
• Familiarity with meta-learning, few-shot generalization, or transfer learning in the context of LLM-based systems.
• Experience with feedback-driven optimization loops, such as online learning, bandit methods, or evolutionary strategies applied to agent improvement.
• Strong interest in bridging frontier AI research with production-grade engineering — turning papers into systems that work at scale.
• Internship experience at technology companies or research organizations.
Company:
ByteDance is a technology company that develops content creation platforms and services. Founded in 2012, the company is headquartered in Beijing, CHN, with a team of 10001+ employees. The company is currently Late Stage.