Evaluate, select, and drive adoption of WebGPU-targeted inference runtimes (ONNX Runtime Web, Transformers.js, WebLLM, TensorFlow.js) alongside native options (CoreML, ONNX Runtime, TFLite ...
Evaluate, select, and drive adoption of WebGPU-targeted inference runtimes (ONNX Runtime Web, Transformers.js, WebLLM, TensorFlow.js) alongside native options (CoreML, ONNX Runtime, TFLite ...
Evaluate, select, and drive adoption of WebGPU-targeted inference runtimes (ONNX Runtime Web, Transformers.js, WebLLM, TensorFlow.js) alongside native options (CoreML, ONNX Runtime, TFLite ...
Evaluate, select, and drive adoption of WebGPU-targeted inference runtimes (ONNX Runtime Web, Transformers.js, WebLLM, TensorFlow.js) alongside native options (CoreML, ONNX Runtime, TFLite ...
Senior Machine Learning Engineer, On-Device & Mobile AI Optimization
San Francisco, CA · On-site
$123K - $169K/yr
Work with WebGPU-targeted inference runtimes (ONNX Runtime Web, Transformers.js, WebLLM, TensorFlow.js) alongside native options (CoreML, ONNX Runtime, TFLite, ExecuTorch), and extend or build glue ...
Senior Machine Learning Engineer, On-Device & Mobile AI Optimization
San Francisco, CA · On-site
$123K - $169K/yr
Work with WebGPU-targeted inference runtimes (ONNX Runtime Web, Transformers.js, WebLLM, TensorFlow.js) alongside native options (CoreML, ONNX Runtime, TFLite, ExecuTorch), and extend or build glue ...
AI Full Stack Developer & Architect
San Jose, CA · On-site
$13K/mo
Implement machine learning models (using frameworks like PyTorch, TensorFlow, or scikit-learn) into ... Experience with modern JavaScript frameworks (React.js + Node.js,Next.js, Plotly Dash + FastAPI)
AI Full Stack Developer & Architect
San Jose, CA · On-site
$13K/mo
Implement machine learning models (using frameworks like PyTorch, TensorFlow, or scikit-learn) into ... Experience with modern JavaScript frameworks (React.js + Node.js,Next.js, Plotly Dash + FastAPI)
Implement machine learning models (using frameworks like PyTorch, TensorFlow, or scikit-learn) into ... Experience with modern JavaScript frameworks (React.js + Node.js,Next.js, Plotly Dash + FastAPI)
Quick apply
Implement machine learning models (using frameworks like PyTorch, TensorFlow, or scikit-learn) into ... Experience with modern JavaScript frameworks (React.js + Node.js,Next.js, Plotly Dash + FastAPI)
AI Full Stack Developer & Architect
San Jose, CA · On-site
$13K/mo
Implement machine learning models (using frameworks like PyTorch, TensorFlow, or scikit-learn) into ... Experience with modern JavaScript frameworks (React.js + Node.js,Next.js, Plotly Dash + FastAPI)
AI Full Stack Developer & Architect
San Jose, CA · On-site
$13K/mo
Implement machine learning models (using frameworks like PyTorch, TensorFlow, or scikit-learn) into ... Experience with modern JavaScript frameworks (React.js + Node.js,Next.js, Plotly Dash + FastAPI)
... TensorFlow, PyTorch, or similar frameworks * Experience with bias, fairness, and evaluation of LLM systems * Experience developing Microsoft Word add-ins using Office.js * Knowledge of JWT ...
Quick apply
... TensorFlow, PyTorch, or similar frameworks * Experience with bias, fairness, and evaluation of LLM systems * Experience developing Microsoft Word add-ins using Office.js * Knowledge of JWT ...
AI Architect
San Jose, CA · On-site
... js). Hands-on experience with machine learning frameworks and libraries, such as TensorFlow, PyTorch, Scikit-learn, or Hugging Face. Proficiency in developing and deploying AI/ML models in cloud ...
Quick apply
AI Architect
San Jose, CA · On-site
... js). Hands-on experience with machine learning frameworks and libraries, such as TensorFlow, PyTorch, Scikit-learn, or Hugging Face. Proficiency in developing and deploying AI/ML models in cloud ...
AI Architect
San Jose, CA · On-site
... js). Hands-on experience with machine learning frameworks and libraries, such as TensorFlow, PyTorch, Scikit-learn, or Hugging Face. Proficiency in developing and deploying AI/ML models in cloud ...
Quick apply
AI Architect
San Jose, CA · On-site
... js). Hands-on experience with machine learning frameworks and libraries, such as TensorFlow, PyTorch, Scikit-learn, or Hugging Face. Proficiency in developing and deploying AI/ML models in cloud ...
Full Stack Engineer
San Francisco, CA · On-site
$180K - $250K/yr
Create engaging and responsive user interfaces with TypeScript (React.js, Next.js, or Angular ... Experience integrating AI/ML models into production workflows, preferably with TensorFlow, PyTorch ...
Full Stack Engineer
San Francisco, CA · On-site
$180K - $250K/yr
Create engaging and responsive user interfaces with TypeScript (React.js, Next.js, or Angular ... Experience integrating AI/ML models into production workflows, preferably with TensorFlow, PyTorch ...
Full Stack Engineer
San Francisco, CA · On-site
$180K - $250K/yr
Create engaging and responsive user interfaces with TypeScript (React.js, Next.js, or Angular ... Experience integrating AI/ML models into production workflows, preferably with TensorFlow, PyTorch ...
Full Stack Engineer
San Francisco, CA · On-site
$180K - $250K/yr
Create engaging and responsive user interfaces with TypeScript (React.js, Next.js, or Angular ... Experience integrating AI/ML models into production workflows, preferably with TensorFlow, PyTorch ...
Job Title:Python Developer
Encino, CA · On-site
$52.75 - $72.50/hr
Exposure to data analysis, machine learning, or data science libraries (e.g., Pandas, NumPy, TensorFlow, PyTorch). * Understanding of front-end frameworks (React, Angular, Vue.js) is a plus. What We ...
Job Title:Python Developer
Encino, CA · On-site
$52.75 - $72.50/hr
Exposure to data analysis, machine learning, or data science libraries (e.g., Pandas, NumPy, TensorFlow, PyTorch). * Understanding of front-end frameworks (React, Angular, Vue.js) is a plus. What We ...
Full-Stack Engineer
San Francisco, CA · On-site
$140K - $200K/yr
... Tensorflow, & Google X). We recently raised $5M in seed funding from leading Silicon Valley ... js and NestJS. A critical part of the job is listening carefully to what customers ask for ...
Full-Stack Engineer
San Francisco, CA · On-site
$140K - $200K/yr
... Tensorflow, & Google X). We recently raised $5M in seed funding from leading Silicon Valley ... js and NestJS. A critical part of the job is listening carefully to what customers ask for ...
Expertise in JavaScript frameworks like Angular.js, Vue.js or React.js for building single-page ... Knowledge in Python and relevant libraries/frameworks such as TensorFlow, PyTorch, Hugging Face ...
Quick apply
Expertise in JavaScript frameworks like Angular.js, Vue.js or React.js for building single-page ... Knowledge in Python and relevant libraries/frameworks such as TensorFlow, PyTorch, Hugging Face ...
Sr Full Stack Engineer
Irvine, CA · On-site
Expertise in JavaScript frameworks like Angular.js, Vue.js or React.js for building single-page ... Knowledge in Python and relevant libraries/frameworks such as TensorFlow, PyTorch, Hugging Face ...
Sr Full Stack Engineer
Irvine, CA · On-site
Expertise in JavaScript frameworks like Angular.js, Vue.js or React.js for building single-page ... Knowledge in Python and relevant libraries/frameworks such as TensorFlow, PyTorch, Hugging Face ...
Expertise in JavaScript frameworks like Angular.js, Vue.js or React.js for building single-page ... Knowledge in Python and relevant libraries/frameworks such as TensorFlow, PyTorch, Hugging Face ...
Expertise in JavaScript frameworks like Angular.js, Vue.js or React.js for building single-page ... Knowledge in Python and relevant libraries/frameworks such as TensorFlow, PyTorch, Hugging Face ...
AI Platform Tech Lead
San Francisco, CA · On-site
PyTorch or TensorFlow * XGBoost / scikit-learn * MLflow / W&B * Feature stores * Model monitoring ... React / Next.js * TypeScript * Component systems * API integration Observability * Prometheus ...
AI Platform Tech Lead
San Francisco, CA · On-site
PyTorch or TensorFlow * XGBoost / scikit-learn * MLflow / W&B * Feature stores * Model monitoring ... React / Next.js * TypeScript * Component systems * API integration Observability * Prometheus ...
AI Platform Tech Lead
San Francisco, CA · On-site
PyTorch or TensorFlow * XGBoost / scikit-learn * MLflow / W&B * Feature stores * Model monitoring ... React / Next.js * TypeScript * Component systems * API integration Observability * Prometheus ...
AI Platform Tech Lead
San Francisco, CA · On-site
PyTorch or TensorFlow * XGBoost / scikit-learn * MLflow / W&B * Feature stores * Model monitoring ... React / Next.js * TypeScript * Component systems * API integration Observability * Prometheus ...
ServiceNow Platform Developer (AI / Now Assist Focus)
Santa Clara, CA · On-site
$63.50 - $87.25/hr
Integrate AI/ML services (OpenAI, Azure Cognitive Services, TensorFlow) into enterprise workflows ... React.js or Node.js development . Experience implementing AI-driven workflows or enterprise ...
Quick apply
ServiceNow Platform Developer (AI / Now Assist Focus)
Santa Clara, CA · On-site
$63.50 - $87.25/hr
Integrate AI/ML services (OpenAI, Azure Cognitive Services, TensorFlow) into enterprise workflows ... React.js or Node.js development . Experience implementing AI-driven workflows or enterprise ...
Software Engineer (Apps)
San Jose, CA · On-site
Work hands-on with modern frameworks (PyTorch, TensorFlow), tools (Python, C/C++, Node.js, Kafka ... Kafka, Kubernetes, Jenkins, RESTful APIs, JavaScript, Node.js. Salary Range: 106,600.00-142,100.00 ...
Software Engineer (Apps)
San Jose, CA · On-site
Work hands-on with modern frameworks (PyTorch, TensorFlow), tools (Python, C/C++, Node.js, Kafka ... Kafka, Kubernetes, Jenkins, RESTful APIs, JavaScript, Node.js. Salary Range: 106,600.00-142,100.00 ...
Tensorflow Js information
See California salary details
$10.91 - $17.79
11% of jobs
$18.94 is the 25th percentile. Wages below this are outliers.
$17.79 - $24.67
83% of jobs
$24.67 - $31.55
0% of jobs
$31.55 - $38.43
0% of jobs
$38.43 - $45.31
0% of jobs
$45.31 - $52.19
4% of jobs
$52.19 - $59.07
0% of jobs
$59.07 - $65.95
0% of jobs
$65.95 - $72.83
0% of jobs
$72.83 - $79.71
0% of jobs
$79.71 - $86.59
2% of jobs
$10
$26
$86
How much do tensorflow js jobs pay per hour?
Is ML a high paying job?
What jobs use TensorFlow?
Is TensorFlow still used in 2026?
What are some common projects or tasks a TensorFlow.js developer typically works on?
As a TensorFlow.js developer, you may work on projects such as building browser-based machine learning models, integrating real-time data predictions into web applications, or converting existing Python-trained models to run client-side. Day-to-day tasks often include designing user interfaces for model interaction, optimizing model performance within browser constraints, and collaborating closely with front-end and back-end teams to deliver seamless user experiences. This role is highly collaborative, and successful developers frequently communicate with product managers or data scientists to align technical implementation with business objectives. There is also significant opportunity to stay updated on the latest web ML trends and contribute to cross-functional innovation within your team.
What are the key skills and qualifications needed to thrive in the Tensorflow Js position, and why are they important?
To thrive in a TensorFlow.js role, you need strong JavaScript programming skills, an understanding of machine learning concepts, and experience developing web applications. Familiarity with TensorFlow.js libraries, browser-based coding environments, and version control systems like Git is highly beneficial. Excellent problem-solving abilities, collaborative teamwork, and effective communication skills help you succeed in fast-paced, multidisciplinary settings. These capabilities are essential for building interactive machine learning solutions that integrate seamlessly with modern web apps and meet real-world business needs.
What is a TensorFlow.js job?
A TensorFlow.js job typically involves developing, deploying, and optimizing machine learning models that run directly in the browser or on Node.js. Professionals in this role work with JavaScript, TensorFlow.js, and related web technologies to build AI-powered applications. Responsibilities may include training models, converting existing TensorFlow models to TensorFlow.js, and improving model performance for web-based environments.
Is TensorFlow still relevant?
- Internship Onboarding Specialist
- Work From Home Baldor Foods
- Internship Online Automation Testing
- Internship Remote Draftsman
- Internship Global Relocation
- Internship Entry Level Cisco Network Engineer
- Internship Internship Sustainable Fashion
- Internship Tableau Desktop Specialist
- Data Scientist Meteorology
- Internship Game Tester

Other
Medical, Life, Retirement, PTO
Posted 15 days ago
Job description
The opportunity
We are building the next generation of AI-driven game experiences, running generative models on-device, right where the players are - on phones, tablets, laptops, and desktops. Our games run inside a modern, browser-native runtime (built on technologies such as WebGPU and WebNN), so the models that power these experiences must be deployed and accelerated entirely within that runtime. As our Principal Engineer for On-Device AI Inference & Systems, you will be the foremost engineering authority on taking state-of-the-art multi-modal models (transformers and diffusion networks) and making them run fast, small, and reliably within that runtime, fully integrated into a production game engine.
This is a deeply hands-on, high-impact engineering role. You will own the inference and integration stack end-to-end - from the moment a trained checkpoint leaves research, through export, optimization, and kernel-level tuning, to a shipped feature running inside the engine at interactive frame rates within a fixed memory and power budget. You will set the engineering standards, drive the architecture of the runtime and integration layers, and mentor a team of senior and mid-level engineers. Your work directly determines the latency, quality, memory footprint, and battery profile of AI features experienced by players worldwide.
This role is for an engineer who is energized by the gap between a research model and a shipping, AI-based product. If you love profilers, frame captures, op-fusion, and shaving milliseconds and megabytes, this is your role.
What you'll be doing
- Inference & On-Device Optimization
- Own the end-to-end optimization pipeline: model export, graph transformation, operator fusion, memory-layout planning, and hardware-specific kernel tuning across NPU, mobile GPU, and desktop/laptop GPU.
- Make authoritative decisions on quantization (INT4/INT8/FP16), weight sharing, structured/unstructured pruning, and knowledge distillation to hit hard latency, memory, and power budgets - and validate them against quality bars.
- Drive low-level performance work: write and tune WebGPU compute shaders (WGSL) and, where relevant, native kernels (Metal, Vulkan/SPIR-V compute, D3D12, CUDA); profile with browser and platform tools (Chrome/Dawn GPU traces, PIX, Instruments/Metal System Trace, Snapdragon Profiler, Nsight, RenderDoc), and eliminate bottlenecks at the op and memory-bandwidth level.
- Apply efficiency techniques - dynamic resolution, token reduction, cross-frame caching/reuse, reduced-step diffusion samplers - as engineering levers to meet budgets on target SKUs.
- Runtime & Systems Integration
- Evaluate, select, and drive adoption of WebGPU-targeted inference runtimes (ONNX Runtime Web, Transformers.js, WebLLM, TensorFlow.js) alongside native options (CoreML, ONNX Runtime, TFLite, ExecuTorch) - and extend or build runtime/glue code where off-the-shelf options fall short of our diffusion workloads.
- Design and own the integration between the ML runtime and the game engine: real-time scheduling, threading, memory pooling, zero-copy buffer sharing between the inference path and the render path, and frame-budget management alongside the renderer.
- Architect inference systems that handle diverse inputs - images, text, primitives, metadata - and produce pixel-level outputs with real-time performance, robust to the messy realities of production (cold starts, thermal throttling, device fragmentation, backgrounding).
- Build the supporting engineering: model packaging and asset pipelines, on-device fallbacks and SKU-aware capability tiers, crash/quality telemetry, and automated on-device benchmarking in CI.
- Research Productionization
- Partner closely with research scientists to turn novel architectures into implementations that are deployable, debuggable, and fast on device.
- Provide the feedback loop back into research: surface hardware constraints, op-support gaps, and cost models early so model design and deployment converge.
- Track breakthroughs in efficient inference (efficient attention, distillation, reduced-step diffusion) and assess them pragmatically: what actually moves latency/memory/power on our target devices, and what is worth the engineering cost.
- Engineering Leadership
- Lead and mentor a team of engineers; set engineering best practices, code-review standards, performance-regression gates, and on-device benchmarking methodology.
- Champion a culture of measurement: define and enforce KPIs for latency, quality, memory, and power, and ensure they are tracked rigorously across the device matrix.
- Partner with platform engineers, product managers, and runtime teams to align ML capabilities with device-SKU constraints and product roadmaps.
What we're looking for
- 8+ years in software/ML engineering, with at least 4 years focused on on-device / edge inference or real-time, performance-critical systems.
- Proven production deployment of transformer- and/or diffusion-based models (e.g., ViT, Stable Diffusion) on mobile, desktop, or embedded hardware - shipped, not just prototyped.
- Hands-on experience deploying models through WebGPU - e.g., ONNX Runtime Web (WebGPU EP), Transformers.js, WebLLM, or TensorFlow.js - including writing/tuning WGSL compute shaders and working within WebGPU's adapter, device-limits, and binding model. Equivalent deep experience with a native GPU/compute API plus a clear path to WebGPU will also be considered.
- Hands-on expertise with at least one major inference runtime (ONNX Runtime / ORT Web, CoreML, TFLite, ExecuTorch) and deep understanding of operator fusion, memory layout, and runtime scheduling.
- Low-level performance engineering: strong command of at least one GPU/compute API - WebGPU/WGSL, Metal, Vulkan, D3D12, or CUDA - and the profiling tools to go with it. You can read a frame capture and a kernel trace and know where the time and memory go.
- Working knowledge of model-optimization techniques - quantization (INT4/INT8/FP16), weight sharing, pruning, and distillation - and the practical judgment to apply them to hit latency and memory budgets. You don't need to be a research expert in these methods; you need to use them effectively as engineering tools.
- Strong understanding of target hardware: mobile SoCs (Apple Neural Engine, Qualcomm Hexagon/Adreno, ARM Mali) and desktop/laptop GPUs (Apple Silicon, NVIDIA, AMD, Intel) - and how to target each for peak throughput.
- Proficiency in the core languages of a browser-native runtime - TypeScript/JavaScript and WGSL - plus solid Python for export pipelines and training-side tooling.
- Working fluency with the models you deploy - enough to read an architecture, modify it for deployment, and reason about accuracy trade-offs.
- Track record of technical leadership: setting engineering direction, influencing cross-functional partners, and growing engineers.
You might also have
- Experience shipping world-model, neural-rendering, or real-time generative pipelines (NeRF, 3DGS, real-time diffusion, or similar) on device.
- Deep game-engine or real-time-graphics background (Unity, Unreal, or a custom engine; Metal/Vulkan/D3D/OpenGL ES render pipelines) - especially integrating compute workloads alongside a renderer.
- Contributions to open-source ML inference frameworks, runtimes, or GPU/compute libraries - especially in the WebGPU ecosystem (Dawn, wgpu, ORT Web, Transformers.js, WebLLM).
- Familiarity with the WebGPU specification and its evolving compute features (subgroups, FP16/shader-f16, timestamp queries) and the trade-offs of running heavy diffusion workloads in the browser/web runtime.
- Familiarity with compiler stacks (MLIR, TVM, IREE, XLA) for custom kernel generation and graph optimization.
- Experience with on-device benchmarking infrastructure, performance-regression CI, and large device-farm matrices.
Additional information
- International relocation support is not available for this position
- Work visa/immigration sponsorship is not available for this position
Benefits
At Unity, we want our team members to thrive. We offer a wide range of benefits designed to support well-being and work-life balance.
Please note: Benefits eligibility, specific offerings, and coverage vary based on the country and employment status.
While specific benefits vary, here are some of the ways we strive to take care of our eligible team members globally: Comprehensive health, life, and disability insurance | Commute subsidy | Employee stock ownership | Competitive retirement/pension plans | Generous vacation and personal days | Support for new parents through leave and family-care programs | Office food snacks | Mental Health and Wellbeing programs and support | Employee Resource Groups | Global Employee Assistance Program | Training and development programs | Volunteering and donation matching program
Life at Unity
Unity [NYSE: U] is the world's leading game engine, powering play for more than 3 billion consumers each month. The top mobile games in the world, the most played PC indie titles, the most innovative console games, and virtually all of the top XR and Web Games are developed, deployed, and grown in Unity. Unity also enables teams across industries like automotive, manufacturing, and healthcare to design, simulate, and collaborate in 3D - closing the gap between ideas and reality. For more information, please visit www.unity.com.
Unity is a proud equal opportunity employer. We are committed to fostering an inclusive, innovative environment and celebrate our employees across age, race, color, ancestry, national origin, religion, disability, sex, gender identity or expression, sexual orientation, or any other protected status in accordance with applicable law. Our differences are strengths that enable us to support the growing and evolving needs of our customers, partners, and collaborators. If you have a disability that means there are preparations or accommodations we can make to help ensure you have a comfortable and positive interview experience, please fill out this form to let us know.
This position requires the incumbent to have a sufficient knowledge of English to have professional verbal and written exchanges in this language since the performance of the duties related to this position requires frequent and regular communication with colleagues and partners located worldwide and whose common language is English.
This posting is intended to fill an existing vacancy, and we are committed to providing applicants with updates throughout the hiring process in accordance with applicable law
Headhunters and recruitment agencies may not submit resumes/CVs through this website or directly to managers. Unity does not accept unsolicited headhunter and agency resumes. Unity will not pay fees to any third-party agency or company that does not have a signed agreement with Unity.
Your privacy is important to us. Please take a moment to review our Prospect Privacy Policy and Applicant Privacy Policy. Should you have any concerns about your privacy, please contact us at DPO@unity.com.
#DIR #LI-MC1