1

Observability Manager Jobs in California (NOW HIRING)

The Senior Manager, Observability Engineering will lead a team responsible for building, scaling, and operating observability systems, defining strategy and roadmap, and driving platform reliability ...

CoreWeave is seeking a Senior Manager, Observability Engineering to lead a team responsible for building, scaling, and operating observability systems across metrics, logs, traces, and telemetry ...

CoreWeave is seeking a Senior Manager, Observability Engineering to lead a team responsible for building, scaling, and operating observability systems across metrics, logs, traces, and telemetry ...

CoreWeave is seeking a Senior Manager, Observability Engineering to lead a team responsible for building, scaling, and operating observability systems across metrics, logs, traces, and telemetry ...

Senior Observability Engineer

Irvine, CA · On-site

$112K - $154K/yr

As an expert technology adviser and managed service provider with cross-platform certifications ... As a Senior Observability Engineer, you will be responsible for designing, implementing, migrating ...

Senior Observability Engineer

Irvine, CA · On-site

$112K - $154K/yr

As an expert technology adviser and managed service provider with cross-platform certifications ... As a Senior Observability Engineer, you will be responsible for designing, implementing, migrating ...

next page

Showing results 1-20

Observability Manager information

What is the difference between Observability Manager vs Site Reliability Engineer?

AspectObservability ManagerSite Reliability Engineer
CredentialsTypically requires experience in monitoring, logging, and cloud tools; certifications like AWS, Google Cloud, or Kubernetes are commonRequires strong background in systems engineering, scripting, and cloud platforms; certifications like AWS, GCP, or Linux are often preferred
Work EnvironmentFocuses on overseeing observability tools, data analysis, and team coordination in tech environmentsHands-on role involving system automation, incident response, and infrastructure reliability
Industry UsageUsed across tech companies to improve system visibility and performanceCommon in DevOps and SRE teams to ensure system reliability and uptime

The Observability Manager primarily oversees monitoring and logging strategies, ensuring system visibility, while the Site Reliability Engineer is more hands-on, focusing on automating infrastructure and maintaining system reliability. Both roles require technical expertise and often collaborate closely but differ in scope and daily responsibilities.

What are the most commonly searched types of Observability jobs in California? The most popular types of Observability jobs in California are:
What are popular job titles related to Observability Manager jobs in California? For Observability Manager jobs in California, the most frequently searched job titles are:
What job categories do people searching Observability Manager jobs in California look for? The top searched job categories for Observability Manager jobs in California are:
What cities in California are hiring for Observability Manager jobs? Cities in California with the most Observability Manager job openings:
Senior Manager, Observability

Senior Manager, Observability

CoreWeave

Sunnyvale, CA • On-site

Full-time

Posted 12 days ago


Job description

Job Summary:
CoreWeave is The Essential Cloud for AI™, delivering a platform that enables innovators to build and scale AI with confidence. The Senior Manager, Observability Engineering will lead a team responsible for building, scaling, and operating observability systems, defining strategy and roadmap, and driving platform reliability and performance improvements.
Responsibilities:
• Lead a team responsible for building, scaling, and operating observability systems across metrics, logs, traces, and telemetry pipelines.
• Define strategy and roadmap for observability platforms.
• Drive platform reliability and performance improvements.
• Guide architectural decisions across observability infrastructure.
• Partner closely with infrastructure, platform, security, and application engineering teams to improve instrumentation and production visibility.
Qualifications:
Required:
• 8+ years of software engineering experience with production systems at scale
• 4+ years of engineering management experience leading senior engineers and technical leads
• Experience building and operating observability platforms across logs, metrics, traces, and alerting in distributed systems
• Knowledge of reliability engineering concepts including SLOs, SLIs, incident management, error budgets, and fault-tolerant design
• Experience scaling telemetry systems including collection pipelines, storage backends, and query layers
• Experience with distributed systems, performance engineering, and trade-offs involving scale, resilience, and cost
• Experience partnering with infrastructure, security, and application engineering teams to drive platform adoption
• Experience hiring and managing engineering teams
Preferred:
• Experience with OpenTelemetry, Grafana, Prometheus-compatible systems, log aggregation, and distributed tracing tools
• Experience operating cloud-native infrastructure, including Kubernetes environments
• Experience supporting large-scale cloud, developer platforms, or AI/ML infrastructure
• Familiarity with capacity planning for high-ingest telemetry systems
• Experience scaling platforms in high-growth environments
Company:
CoreWeave provides cloud infrastructure services designed to support artificial intelligence and high-performance computing workloads. Founded in 2017, the company is headquartered in Livingston, USA, with a team of 1001-5000 employees. The company is currently Late Stage.