Unity
Unity

1 Unity Full Stack Developer Jobs Hiring Near You

Unity Jobs Information

What are the most popular categories at Unity?
Infographic showing various Full Stack Developer job openings at Unity in the United States as of June 2026, with employment types broken down into 100% Full Time. Highlights an 100% Physical job distribution.
Principal Machine Learning Engineer, Mobile AI Inference Optimization

Principal Machine Learning Engineer, Mobile AI Inference Optimization

Unity

Mountain View, CA • On-site

Full-time

Posted yesterday


Job description

Job Summary:
Unity is the world’s leading game engine, powering play for more than 3 billion consumers each month. They are seeking a Principal Machine Learning Engineer to lead the deployment of multi-modal AI models on mobile devices, focusing on optimizing performance and latency for AI-driven features in mobile games.
Responsibilities:
• Set the technical vision and roadmap for deploying multi-modal AI models to iOS and Android, spanning transformers, diffusion models, and JAPE-style generative architectures.
• Make authoritative decisions on model compression, quantization, pruning, and knowledge distillation strategies to meet mobile latency and memory budgets.
• Evaluate and select inference runtimes (e.g., CoreML, ONNX Runtime Mobile, TFLite, ExecuTorch) and drive adoption across the team.
• Own the end-to-end optimization pipeline: from model export and graph transformation to hardware-specific kernel tuning on NPU, GPU, and CPU.
• Collaborate directly with research scientists to translate novel model architectures into deployable, mobile-optimized implementations.
• Design scalable systems for multi-modal inference that process diverse inputs — images, text, primitives, and metadata — and produce pixel-level outputs with real-time performance.
• Pioneer new approaches to dynamic resolution, token reduction, and speculative decoding tailored to mobile constraints.
• Track and rapidly adopt breakthroughs in efficient diffusion (e.g., consistency models, flow matching) and efficient attention (e.g., FlashAttention, linear attention variants).
• Lead and mentor a team of ML engineers; define engineering best practices, code review standards, and on-device benchmarking methodology.
• Partner with platform engineers, product managers, and runtime teams to align ML capabilities with device SKU constraints and product roadmaps.
• Champion a culture of measurement: define KPIs for latency, accuracy, memory, and power consumption and ensure the team tracks them rigorously.
Qualifications:
Required:
• 8+ years in ML engineering, with at least 3 years focused on on-device / edge inference optimization.
• Proven production deployment of transformer-based models (e.g., ViT, LLaMA, Stable Diffusion) and/or JAPE-style generative architectures on mobile or embedded hardware.
• Hands-on expertise with CoreML, TFLite, ONNX Runtime, and/or ExecuTorch; deep understanding of operator fusion, memory layout, and runtime scheduling.
• Expert-level command of INT8/INT4/FP16 quantization, weight sharing, structured/unstructured pruning, and knowledge distillation.
• Strong understanding of mobile SoC architectures (Apple Neural Engine, Qualcomm Hexagon/Adreno, ARM Mali) and how to target each for peak throughput.
• Proficiency in C++ / Objective-C / Swift for runtime integration; solid Python for training-side tooling and export pipelines.
• Ability to read, implement, and extend ML research papers; familiarity with efficient attention, diffusion samplers, and multi-modal fusion techniques.
• Track record of technical leadership: setting direction, influencing cross-functional partners, and growing engineers.
Preferred:
• Experience shipping world-model or neural rendering pipelines (NeRF, 3DGS, or similar) on mobile.
• Contributions to open-source ML inference frameworks or mobile ML research publications.
• Familiarity with compiler stacks such as MLIR, TVM, or XLA for custom kernel generation.
• Background in real-time graphics or game engine pipelines (Metal, Vulkan, OpenGL ES).
Company:
Unity [NYSE: U] offers a suite of tools to create, market, and grow games and interactive experiences across all major platforms from mobile, PC, and console, to extended reality. Founded in 2004, the company is headquartered in San Francisco, USA, with a team of 5001-10000 employees. The company is currently Late Stage.