1

Machine Learning Ops Engineer Jobs in California

Sr. Machine Learning Ops Engineer

Los Angeles, CA · On-site

$140K - $179K/yr

They are seeking a Senior Machine Learning Ops Engineer to lead the design and maintenance of scalable infrastructure for ML model deployment and lifecycle management, with a focus on productionizing ...

Sr. Machine Learning Ops Engineer

Los Angeles, CA · On-site

$112K - $154K/yr

CIM Group is a community-focused real estate and infrastructure company seeking a Senior ML Ops Engineer to lead the design and maintenance of scalable infrastructure for ML model deployment and ...

ML Ops Engineer Concord CA Contract Cleint Round - Inperson Overview Tachyon Cortex Machine Learning AI team seeking a ML Ops Engineer to drive the full lifecycle of machine learning solutions. Key ...

AI/ML Ops Engineer Location - Pleasanton CA (Onsite) Job Type - Contract We are looking for a ... machine learning model deployment and management. Key Responsibilities: * Design, build, and ...

Infrastructure Engineer

San Francisco, CA · On-site

$126K - $166K/yr

They are seeking an Infrastructure Engineer to automate and manage their infrastructure ... Machine Learning Ops/Infrastructure Company : Chalk is a data platform for AI inference that ...

Adobe is looking for a Senior Machine Learning Engineer to help shape the future of agentic AI in ... Ops best practices, delivering high quality, production ready code. • Design and build ML ...

Analyze and profile machine learning models to identify performance bottlenecks and areas for optimization. Implement and apply model optimization techniques such as quantization, pruning ...

next page

Showing results 1-20

Machine Learning Ops Engineer information

See California salary details

$31.1K

$127.1K

$191K

How much do machine learning ops engineer jobs pay per year?

As of Jun 19, 2026, the average yearly pay for machine learning ops engineer in California is $127,083.00, according to ZipRecruiter salary data. Most workers in this role earn between $100,200.00 and $153,000.00 per year, depending on experience, location, and employer.

What is a Machine Learning Ops Engineer job?

A Machine Learning Ops Engineer (MLOps Engineer) focuses on deploying, monitoring, and maintaining machine learning models in production. They bridge the gap between data science and software engineering, ensuring models run efficiently, reliably, and at scale. Their responsibilities include automating workflows, managing infrastructure, and ensuring CI/CD pipelines for ML models. They work with tools like Kubernetes, Docker, and cloud platforms to streamline model deployment. Ultimately, an MLOps Engineer ensures that machine learning models are operationalized and continuously improved in a real-world environment.

What does a typical day look like for a Machine Learning Ops Engineer?

A typical day for a Machine Learning Ops Engineer involves collaborating with data scientists to streamline the deployment of models, building and maintaining scalable infrastructure on cloud services, and automating workflows with CI/CD tools. You may troubleshoot issues in production environments, monitor model performance, and implement solutions for model versioning and retraining. Often, you’ll work closely with software engineers, DevOps teams, and data analysts to ensure seamless integration of machine learning solutions into products. This cross-functional role keeps you engaged with cutting-edge technology and provides opportunities to influence both technical and business outcomes.

What are the key skills and qualifications needed to thrive in the Machine Learning Ops Engineer position, and why are they important?

To thrive as a Machine Learning Ops Engineer, you need a solid grasp of machine learning concepts, cloud platforms, software engineering, and DevOps practices, typically supported by a degree in computer science or a related field. Experience with tools like Docker, Kubernetes, TensorFlow, CI/CD pipelines, and certifications such as AWS Certified Machine Learning – Specialty are highly valuable. Strong problem-solving skills, communication, and the ability to work collaboratively across data science and engineering teams set top candidates apart. These skills ensure reliable deployment, scalability, and optimization of machine learning models in production environments.

What cities in California are hiring for Machine Learning Ops Engineer jobs? Cities in California with the most Machine Learning Ops Engineer job openings:
Infographic showing various Machine Learning Ops Engineer job openings in California as of June 2026, with employment types broken down into 6% Internship, 88% Full Time, 3% Contract, and 3% Nights. Highlights an 91% In-person, and 9% Remote job distribution, with an average salary of $127,083 per year, or $61.1 per hour.
Sr. Machine Learning Ops Engineer

Sr. Machine Learning Ops Engineer

CIM Group

Los Angeles, CA • On-site

$140K - $179K/yr

Full-time

Posted 6 days ago


Job description

Job Summary:
CIM Group is a community-focused real estate and infrastructure owner, operator, lender, and developer. They are seeking a Senior Machine Learning Ops Engineer to lead the design and maintenance of scalable infrastructure for ML model deployment and lifecycle management, with a focus on productionizing Generative AI solutions.
Responsibilities:
• Lead the design, implementation, and ongoing maintenance of scalable ML infrastructure on Databricks, including ML flow for experiment tracking, model registry, and model serving endpoints.
• Oversee the development of the ML Ops platform and automated pipelines for deploying, monitoring, and maintaining models within production environments.
• Implement robust solutions for model versioning, systematic retraining, and comprehensive artifact management using Databricks Unity Catalog for ML governance.
• Design and manage Databricks Feature Store for consistent feature engineering across training and inference pipelines.
• Architect and implement Retrieval-Augmented Generation (RAG) systems for document Q&A, enabling business teams to query fund documents, investor letters, and market research.
• Design, deploy, and manage vector database solutions (Databricks Vector Search, Pinecone, or similar) for semantic search and retrieval across enterprise documents.
• Lead LLM fine-tuning and customization initiatives, training models like Claude or open-source alternatives with CIM proprietary data while ensuring data privacy and compliance.
• Develop and optimize document processing pipelines including PDF parsing, chunking strategies, and embedding generation for RAG applications.
• Implement prompt engineering best practices and LLM evaluation frameworks to ensure output quality, relevance, and factual accuracy.
• Build guardrails and safety measures for GenAI applications, including hallucination detection, output validation, and source attribution.
• Design and implement extensive automation across the ML workflow, covering model training, testing, validation, and deployment using Databricks Workflows and Asset Bundles.
• Set up robust CI/CD pipelines for both traditional ML models and GenAI applications, leveraging GitHub Actions, Azure DevOps, or similar tools.
• Automate complex data and model workflows utilizing orchestration tools such as Airflow, Prefect, or Databricks Workflows.
• Implement comprehensive monitoring and alerting systems for real-time tracking of model performance, data quality, and GenAI output quality.
• Utilize specialized tools (Evidently AI, WhyLabs, Prometheus/Grafana) to proactively detect model drift, data quality anomalies, and RAG retrieval degradation.
• Develop evaluation frameworks for GenAI applications including relevance scoring, faithfulness metrics, and human feedback loops.
• Troubleshoot issues within production environments, including debugging model deployment failures, RAG retrieval issues, and LLM response quality problems.
• Build and maintain sophisticated feature stores on Databricks, ensuring precise alignment between training and inference data pipelines.
• Collaborate with data engineers and information architects to build robust ETL pipelines that feed into the Databricks Lakehouse.
• Design embedding pipelines and vector index management strategies for RAG applications, including incremental updates and versioning.
• Integrate robust security measures directly into ML Ops and GenAI pipelines, including access controls via Unity Catalog and data encryption.
• Implement Trustworthy AI guardrails addressing bias detection, explainability, prompt injection prevention, and responsible AI practices.
• Ensure GenAI applications handling sensitive fund and investor data comply with regulatory requirements and internal policies.
• Collaborate with Legal and Compliance to establish AI governance policies and audit trails for model decisions.
• Engage in extensive collaboration with data scientists, platform engineers, information architects, and DevOps teams to ensure seamless ML/AI integration.
• Partner with business teams (Fund Accounting, FP&A, Investor Relations, Sales, Investments) to identify high-value AI use cases and translate business needs into technical solutions.
• Communicate complex AI concepts in business terms, managing expectations and demonstrating ROI of ML/GenAI initiatives.
• Provide technical mentorship to team members, including refactoring data scientist code for production readiness.
Qualifications:
Required:
• Bachelor's or Master's degree in Computer Science, Engineering, Information Systems, or a related field.
• 7+ years of experience as an ML Ops Engineer, ML Engineer, or similar role with production deployment responsibility.
• Expert-level proficiency in Python, complemented by strong skills in Bash scripting.
• Extensive experience designing and implementing cloud solutions on Azure (required) or GCP.
• Deep expertise with Docker and Kubernetes for containerizing and orchestrating ML workloads.
• Hands-on experience with CI/CD tools such as GitHub Actions, Jenkins, GitLab CI, or Azure DevOps.
• Strong SQL proficiency and practical experience with Databricks platform.
• Experience with workflow orchestration tools (Airflow, Prefect, or Databricks Workflows) and monitoring tools (Prometheus, Grafana, Evidently AI).
• Demonstrated experience building and deploying RAG (Retrieval-Augmented Generation) systems in production environments.
• Hands-on experience with vector databases (Databricks Vector Search, Pinecone, Weaviate, Chroma, or Milvus).
• Experience with LLM APIs and frameworks (OpenAI, Anthropic Claude, LangChain, LlamaIndex).
• Understanding of embedding models, chunking strategies, and retrieval optimization techniques.
• Knowledge of prompt engineering best practices and LLM evaluation methodologies.
• Experience with ML flow for experiment tracking, model registry, and model serving.
• Familiarity with Databricks Feature Store and Unity Catalog for ML governance.
• Understanding of Delta Lake and Lakehouse architecture for ML data pipelines.
• Experience with Databricks Model Serving endpoints and inference optimization.
Preferred:
• Experience with LLM fine-tuning techniques (LoRA, QLoRA, full fine-tuning) on proprietary data.
• Familiarity with ML frameworks including TensorFlow, PyTorch, Scikit-learn, XGBoost.
• Experience with model serialization (ONNX) and inference optimization.
• Prior experience within financial services, fintech, or private equity sectors.
• Experience building ML/AI infrastructure from scratch in entrepreneurial environments.
• Relevant certifications: Azure AI Engineer Associate, Databricks ML Professional, Google Cloud ML Engineer.
Company:
CIM is a community-focused real estate and infrastructure owner, operator, lender and developer. Founded in 1994, the company is headquartered in Los Angeles, USA, with a team of 501-1000 employees. The company is currently Late Stage.