Job Summary:
Qualcomm Technologies, Inc. is seeking a Senior AI Research Quantization Engineer to join their AI Research team. The role involves developing advanced algorithms for efficient generative AI and model optimization, collaborating with a multi-disciplinary team to enhance machine learning technology deployed in industry-leading devices.
Responsibilities:
• Algorithms research and development for efficient generative AI, LLM, LVM, Multi-modal, VLA
• Efficient inference algorithms, e.g. batching, KV caching, efficient attentions, long context, speculative decoding
• Advanced quantization algorithms for complex generative models, e.g., gradient/non-gradient based optimization, equivalent/non-equivalent transformation, automatic mixed precision, hardware in loop
• Model compression, lossy or lossless, structural and neural search
• Generative AI system prototyping
• Apply solutions toward system innovations for model efficiency advancement on device as well as in the cloud
• Python, Pytorch programming.
Qualifications:
Required:
• Bachelor's degree in Computer Science, Engineering, Information Systems, or related field and 2+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.
• Master's degree in Computer Science, Engineering, Information Systems, or related field and 1+ year of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.
• PhD in Computer Science, Engineering, Information Systems, or related field.
Preferred:
• Master's degree in Computer Science, Engineering, Information Systems, or related field. PHD's degree is preferred.
• 2+ years of experience with Machine Learning algorithms or systems engineering or related work experience.
Company:
Qualcomm designs wireless technologies and semiconductors that power connectivity, communication, and smart devices. Founded in 1985, the company is headquartered in San Diego, USA, with a team of 10001+ employees. The company is currently Late Stage.