2

Remote Observability Jobs in California (NOW HIRING)

In this role, you will help redefine what is possible with observability and security data. As part ... Must be comfortable working in a high performance remote-first environment. Responsibilities

In this role, you will help redefine what is possible with observability and security data. As part ... Must be comfortable working in a high performance remote-first environment. Responsibilities

Staff Software Engineer (Remote)

Burlingame, CA ยท On-site +1

$195K - $230K/yr

In this role, you will help redefine what is possible with observability and security data. As part ... Must be comfortable working in a high performance remote-first environment. Responsibilities

SRE Engineer

San Francisco, CA ยท Remote

$67.25 - $89.25/hr

USA / REMOTE Key Responsibilities 1. SRE Fundamentals & Reliability Engineering Apply core SRE ... Observability Strategy & Tool Recommendation (Core Responsibility) Act as the central point of ...

Senior Site Reliability Engineer

San Diego, CA ยท Remote

$60.50 - $80.50/hr

... observability under a single, integrated operating model. Our work focuses on helping customers ... This is a remote, contract opportunity for a project Arctiq is delivering for a client. Candidates ...

This role sits at the intersection of observability, security intelligence, reporting, and platform ... For role with a flexible work location (office/hybrid or remote) The preferred locations for this ...

Senior Build Systems & Pipeline Engineer

San Jose, CA ยท Remote

$122K - $168K/yr

Improve build and test observability through metrics, telemetry, dashboards, alerting, tracing, and ... Enable scalable remote caching and distributed build execution strategies * Apply AI-assisted ...

next page

Showing results 1-20

Remote Observability information

What are some common challenges faced by professionals in a Remote Observability role, and how can they be addressed?

Professionals in Remote Observability often face challenges such as monitoring complex, distributed systems, ensuring reliable data collection, and quickly identifying the root causes of issues without physical access to infrastructure. To address these challenges, it's essential to implement robust monitoring tools, establish clear alerting thresholds, and maintain strong communication with development and operations teams. Regular knowledge-sharing sessions and continuous learning about new observability platforms can also help remote teams stay effective and proactive.

What is the difference between Remote Observability vs Remote Monitoring?

AspectRemote ObservabilityRemote Monitoring
FocusComprehensive system insights, including logs, metrics, and tracesTracking specific system metrics and alerts
ToolsOpenTelemetry, Grafana, JaegerNagios, Zabbix, Datadog
Work EnvironmentDevOps, SRE teams managing complex distributed systemsIT operations teams overseeing system health
CredentialsKnowledge of cloud platforms, scripting, and monitoring toolsBasic networking, system administration skills

Remote Observability provides a holistic view of system health through logs, metrics, and traces, enabling proactive troubleshooting. Remote Monitoring focuses on tracking specific metrics and alerts to detect issues. While both roles involve system oversight, observability offers deeper insights for complex environments, whereas monitoring emphasizes real-time alerts for system stability.

What are the key skills and qualifications needed to thrive as a Remote Observability Engineer, and why are they important?

To thrive as a Remote Observability Engineer, you need expertise in monitoring, logging, and tracing, typically supported by experience in systems administration or DevOps and a relevant technical degree. Familiarity with observability tools like Prometheus, Grafana, Datadog, ELK Stack, and cloud monitoring platforms, as well as certifications such as AWS Certified Cloud Practitioner or Google Professional Cloud DevOps Engineer, is highly valued. Strong analytical thinking, problem-solving, and effective communication are vital soft skills for diagnosing issues and collaborating with distributed teams. These skills and qualifications ensure reliable system performance, rapid incident response, and seamless user experiences in complex, cloud-based environments.

What is remote observability?

Remote observability refers to the ability to monitor, measure, and understand the state and performance of systems, applications, or infrastructure from a distance, typically using specialized tools and platforms. It is crucial for organizations that operate distributed or cloud-based environments, as it allows teams to detect issues, analyze metrics, and ensure reliability without needing physical access to the hardware. Remote observability often involves collecting logs, metrics, traces, and other telemetry data to provide a comprehensive view of system health and performance.
What are the most commonly searched types of Observability jobs in California? The most popular types of Observability jobs in California are:
What job categories do people searching Remote Observability jobs in California look for? The top searched job categories for Remote Observability jobs in California are:
What cities in California are hiring for Remote Observability jobs? Cities in California with the most Remote Observability job openings:
Staff Software Engineer, Observability

Staff Software Engineer, Observability

Pinterest

San Francisco, CA โ€ข On-site, Remote

Other

Posted 13 days ago


Job description

We're seeking an exceptional Staff Software Engineer to join our Observability team at Pinterest. This role combines deep technical expertise in distributed systems and data engineering with a product-oriented mindset to build world-class observability solutions that empower our engineering organization.ย As a Staff Engineer on the Observability team, you'll be responsible for designing and building the infrastructure and tools that provide visibility into Pinterest's large-scale distributed systems, helping thousands of engineers understand, debug, and optimize their services.

What you'll do:

  • Define and execute the observability roadmap, treating it as a product. Understand engineering team needs and translate them into technical solutions with measurable impact
  • Architect, build, and scale distributed observability infrastructure (metrics, logs, traces) to handle massive volumes across Pinterest's distributed systems
  • Build high-performance data pipelines and storage for real-time and historical telemetry analysis at Pinterest scale
  • Champion Best Practices: Establish observability standards and patterns across the organization, making it easy for teams to instrument their services and gain actionable insights
  • Technical Leadership: Mentor engineers, lead architectural reviews, and influence technical decisions across teams to improve overall system reliability and performance
  • Cross-functional Collaboration: Partner with SRE, Infrastructure, Product Engineering, and other teams to understand pain points and deliver solutions that improve developer productivity and system reliability
  • Innovation: Stay current with observability trends and technologies, evaluating and adopting cutting-edge tools and techniques to keep Pinterest at the forefront

What we're looking for:

  • Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent experience
  • Product Mindset: Demonstrated ability to work backwards from customer needs -understanding user needs, prioritizing features, measuring success, and iterating based on feedback. Experience building internal platforms or tools with strong adoption
  • Distributed Systems Expertise: 7+ years of experience designing and operating large-scale distributed systems with deep understanding of consistency, availability, scalability, and failure modes
  • Data Engineering Skills: Strong background in building data pipelines, working with time-series databases, columnar storage, stream processing (Kafka, Flink, etc.), and data modeling at scale
  • Observability Domain Knowledge: Hands-on experience with modern observability tools and practices including metrics, logging, tracing, and profiling. Familiarity with OpenTelemetry, Prometheus, Grafana, or similar technologies
  • Programming Proficiency: Expert-level coding skills in languages like Java, Python, Go, or Scala with ability to write production-quality code
  • Systems Thinking: Ability to see the big picture while managing complex technical details, balancing trade-offs between cost, performance, and reliability
  • Experience building observability platforms from the ground up or significantly scaling existing solutions
  • Familiarity with cloud-native architectures and technologies (Kubernetes, service mesh, etc.)
  • Track record of driving adoption of internal platforms through excellent documentation, UX, and developer advocacy
  • Experience with machine learning or anomaly detection applied to observability use cases
  • Strong communication skills with ability to influence stakeholders at all levels
  • Contributions to open-source observability projects, a plus

In-Office Requirement Statement:ย 

  • We let the type of work you do guide the collaboration style. That means we're not always working in an office, but we continue to gather for key moments of collaboration and connection.
  • This role will need to be in the office for in-person collaboration 1-2 times/quarter and therefore can be situated anywhere in the country.ย 

Relocation Statement:

  • This position is not eligible for relocation assistance. Visit our PinFlex page to learn more about our working model.

#LI-REMOTE

#LI-JT1