Job Summary:
SambaNova is a leading company in generative AI technology, providing a full-stack platform optimized for enterprise and government organizations. The ML Features Solutions Engineer will focus on the development and optimization of core ML features for enterprise deployment, bridging the gap between ML research and product engineering to deliver high-quality, production-ready solutions.
Responsibilities:
• Design and implement core ML features including model optimization, quantization, and inference enhancements
• Optimize model performance for latency, throughput, and memory efficiency on SambaNova hardware
• Develop and improve features such as Function Calling, Structured Output, and JSON mode conformance
• Create end-to-end ML solutions that showcase platform capabilities and accelerate customer adoption
• Convert cutting-edge ML research into practical, deployable product features
• Establish benchmarks and quality standards for ML features in production environments
• Work with SDK team to ensure ML features are properly exposed and documented for developers
• Support enterprise customers implementing advanced ML features in their workflows
• Partner with ML research, platform engineering, and customer teams
Qualifications:
Required:
• Master’s degree or higher in Computer Science, Machine Learning, Electrical Engineering, or related field
• 5+ years of industry experience in ML engineering or applied ML research
• 3+ years of hands-on experience with large language models and transformer architectures
• Expert proficiency in Python and deep learning frameworks: PyTorch (required), TensorFlow, or JAX
• Experience with model optimization techniques: quantization, pruning, distillation, efficient inference
• Strong understanding of LLM inference optimization: KV cache, batching strategies, memory management
• Experience deploying ML models to production at scale
• Track record of translating research concepts into production features
Preferred:
• PhD in Machine Learning, NLP, or related field
• Experience with custom hardware acceleration (TPUs, custom ASICs)
• Hands-on experience with inference frameworks: vLLM, TensorRT-LLM, or similar
• Experience with function calling and tool use in LLMs
• Knowledge of structured generation and constrained decoding
• Experience with ML feature development in enterprise contexts
• Contributions to open-source ML projects
Company:
SambaNova is an AI hardware and software company that specializes in providing infrastructure for AI and machine learning applications. Founded in 2017, the company is headquartered in Palo Alto, USA, with a team of 201-500 employees. The company is currently Growth Stage.