1

Openvino Jobs (NOW HIRING)

Senior Engineer (ML/AI)

$107K - $146.90K/yr

... OpenVINO targeting CPU, GPU, and NPU backends • Build pipelines for training, evaluation, benchmarking, and regression testing • Define and improve accuracy, latency, and resource metrics • ...

Senior Engineer (ML/AI)

$107K - $146.90K/yr

... OpenVINO targeting CPU, GPU, and NPU backends • Build pipelines for training, evaluation, benchmarking, and regression testing • Define and improve accuracy, latency, and resource metrics • ...

PyTorch, TensorFlow, ONNX Runtime, TVM, TensorRT, or OpenVINO. * Understanding of low-level hardware acceleration (e.g., SIMD, AVX, Tensor Cores, VNNI). * Familiarity with compiler optimizations for ...

Experience with ONNX, TensorRT, or OpenVINO for deployment. * Robotics middleware (ROS2). * SLAM, 3D perception, or sensor fusion (LiDAR, IMU). * Real-time or low-latency inference pipelines. Why ...

Experience with OpenCV, TensorRT, or OpenVINO for vision optimization. * Familiarity with ML frameworks like PyTorch or TensorFlow . * Knowledge of industrial protocols (MQTT, WebSockets) for real ...

Software Engineer - ML Infrastructure

San Francisco, CA · On-site

$203.80K - $241.50K/yr

Strong experience with ML frameworks (PyTorch, TensorFlow) and model optimization tools (TensorRT, ONNX Runtime, OpenVINO). * Deep understanding of computer vision architectures and their deployment ...

OR

$122.40K - $161.30K/yr

Prior experience with AI frameworks and engines, such as TensorRT, PyTorch, ONNX, OpenVINO, vLLM, or TRT-LLM. * Knowledge of GPU memory management, cache management, or high-performance networking.

Practical experience implementing CV ML models using deep learning inference engines such as OpenCV, TensorRT, TensorFlow/Keras/TensorFlow Lite, PyTorch, OpenVINO, and/or Qualcomm Neural Processing ...

Practical experience implementing CV ML models using deep learning inference engines such as OpenCV, TensorRT, TensorFlow/Keras/TensorFlow Lite, PyTorch, OpenVINO, and/or Qualcomm Neural Processing ...

next page

Showing results 1-20

Openvino information

What are the key skills and qualifications needed to thrive as an OpenVINO Developer, and why are they important?

To thrive as an OpenVINO Developer, you need a solid background in computer vision, deep learning, and proficiency in Python or C++, often supported by a degree in computer science or a related field. Familiarity with the OpenVINO toolkit, neural network optimization, and frameworks like TensorFlow or PyTorch is essential. Strong problem-solving skills, attention to detail, and effective communication help developers collaborate and innovate in deploying AI solutions. These skills ensure optimized AI performance on edge devices and successful integration of machine learning models into production environments.

What are some common challenges faced by professionals working with OpenVINO in deploying AI models to edge devices?

Professionals working with OpenVINO often encounter challenges related to optimizing and converting AI models to ensure they run efficiently on diverse edge hardware. These challenges include ensuring compatibility with various device architectures, minimizing inference latency, and handling limitations in memory and compute resources. Additionally, troubleshooting model conversion errors and maintaining accuracy during optimization are frequent tasks. Collaboration with hardware engineers and software developers is also essential to address performance bottlenecks and integrate solutions smoothly into production environments.

What is OpenVINO and what is it used for?

OpenVINO (Open Visual Inference and Neural Network Optimization) is a free toolkit developed by Intel to help developers optimize and deploy AI inference, particularly deep learning models, across Intel hardware such as CPUs, GPUs, VPUs, and FPGAs. It is mainly used for accelerating computer vision applications like image classification, object detection, and facial recognition. OpenVINO streamlines the process of converting and optimizing models from popular frameworks, making them faster and more efficient for edge and cloud deployments.

What is the difference between Openvino vs Computer Vision Engineer?

AspectOpenvinoComputer Vision Engineer
Required CredentialsKnowledge of AI frameworks, hardware optimization, programming skillsDegree in Computer Science, Electrical Engineering, or related fields; experience in AI and image processing
Work EnvironmentTech companies, AI development labs, hardware manufacturersResearch institutions, tech firms, startups, or industry-specific companies
Employer & Industry UsagePrimarily used in AI inference optimization, embedded systems, and hardware accelerationDeveloping and implementing computer vision algorithms for applications like surveillance, robotics, and autonomous vehicles

Openvino focuses on optimizing AI inference workloads and hardware acceleration, often requiring knowledge of AI frameworks and hardware. In contrast, a Computer Vision Engineer designs and develops vision algorithms, working across various industries. While both roles involve AI and image processing, Openvino is more specialized in deployment and optimization, whereas Computer Vision Engineers focus on algorithm development and application.

More about Openvino jobs
What cities are hiring for Openvino jobs? Cities with the most Openvino job openings:
What states have the most Openvino jobs? States with the most job openings for Openvino jobs include:
Infographic showing various Openvino job openings in the United States as of May 2026, with employment types broken down into 100% Full Time. Highlights an 81% Physical, and 19% Remote job distribution.

$107K - $146.90K/yr

Full-time

Posted 5 days ago


Job description

Job Summary:
Cephable is an innovative company focused on privacy-first, on-device AI solutions. They are seeking a Lead Machine Learning Engineer to advance their core ML systems, with responsibilities that include model development, optimization, and deployment for various applications.
Responsibilities:
• Design, train, fine-tune, and evaluate ML models for speech recognition, generative and reasoning models, and multimodal inference
• Adapt open-source and foundation models using Hugging Face and related tooling
• Translate research ideas into production-ready systems
• Optimize models for low-latency, low-power, offline execution
• Perform quantization, pruning, and distillation
• Deploy models via ONNX Runtime and OpenVINO targeting CPU, GPU, and NPU backends
• Build pipelines for training, evaluation, benchmarking, and regression testing
• Define and improve accuracy, latency, and resource metrics
• Partner with application and platform engineers to ensure seamless ML integration
• Communicate model performance, architectural decisions, and technical tradeoffs clearly to both technical and non-technical stakeholders
• Own Cephable’s ML architecture
• Set best practices and mentor team members
• Evaluate new tools, frameworks, and hardware
• Mentor engineers across the team on ML concepts and practices as the org grows
Qualifications:
Required:
• 4+ years of experience in machine learning or ML systems
• Strong PyTorch experience
• Hands-on experience with Hugging Face
• Production deployment using ONNX Runtime and/or OpenVINO
• Experience with acceleration frameworks like CUDA and GPU workflows
• Strong software engineering skills (Python, C++, or systems-level experience)
• Excellent communication skills — able to explain complex ML concepts, tradeoffs, and decisions clearly to engineers, product stakeholders, and non-technical partners alike
Preferred:
• Speech recognition or voice assistant experience
• LLMs, SLMs, or reasoning models
• Multimodal ML experience
• Edge or on-device AI background
• Experience with QNN, WinML, and CoreML
Company:
Cephable offers an ambient user interface platform that enables control of digital tools. Founded in 2023, the company is headquartered in Boston, USA, with a team of 11-50 employees. The company is currently Early Stage.