1

Executive Observability Engineer Jobs in Virginia

... observability, and service-level indicator frameworks supporting AI and machine learning model ... programming interfaces, reverse proxies, model zoo interfaces, and external provider integrations ...

DevSecOps Engineer

Herndon, VA · On-site

$104K - $166K/yr

Evaluate emerging technologies, pilot automation/observability tools, and codify best practices to ... Ability to produce governance artifacts, technical assessments, and executive briefings; excellent ...

DevSecOps Engineer

Herndon, VA · On-site

$104K - $166K/yr

Evaluate emerging technologies, pilot automation/observability tools, and codify best practices to ... Ability to produce governance artifacts, technical assessments, and executive briefings; excellent ...

Senior Staff Software Engineer

Tysons Corner, VA · On-site

$123K - $162K/yr

Raise reliability through observability, on call support, and capacity planning with operations ... Set the Protocols roadmap and communicate tradeoffs and decisions to executives, partners, and ...

New

next page

Showing results 1-20

Executive Observability Engineer information

What is the difference between Executive Observability Engineer vs Site Reliability Engineer?

AspectExecutive Observability EngineerSite Reliability Engineer
CredentialsTypically requires expertise in observability tools, monitoring, and cloud platformsRequires skills in systems engineering, coding, and infrastructure management
Work EnvironmentFocuses on designing observability solutions, analyzing system health, and strategic monitoringManages system reliability, automates deployment, and maintains infrastructure
Industry UsageUsed in tech companies emphasizing system visibility and performance analysisCommon in cloud services, SaaS, and large-scale web services

The Executive Observability Engineer primarily focuses on implementing and optimizing observability tools to ensure system health, while the Site Reliability Engineer concentrates on maintaining system reliability and automating infrastructure. Both roles require technical expertise but differ in their strategic versus operational focus.

What are some common challenges Executive Observability Engineers face when implementing organization-wide monitoring solutions?

Executive Observability Engineers often encounter challenges such as integrating diverse monitoring tools across legacy and modern systems, ensuring data consistency, and balancing comprehensive visibility with system performance. Coordinating with multiple teams to align observability goals and fostering a culture of proactive monitoring can also require strong communication and leadership skills. Successfully managing these complexities leads to more resilient infrastructure and improved incident response times.

What are Executive Observability Engineers?

Executive Observability Engineers are specialized IT professionals who design and manage systems that monitor, analyze, and optimize the health and performance of an organization's technology infrastructure. They focus on providing high-level visibility into applications, networks, and services, enabling executives to make data-driven decisions. These engineers implement observability tools, create dashboards, and generate reports that translate complex technical metrics into actionable business insights. Their work is crucial for ensuring system reliability, quick incident response, and continuous improvement across the organization.

What are the key skills and qualifications needed to thrive as an Executive Observability Engineer, and why are they important?

To thrive as an Executive Observability Engineer, you need deep expertise in performance monitoring, distributed systems, and troubleshooting, often supported by a degree in computer science or a related field. Familiarity with observability tools such as Datadog, New Relic, Prometheus, and advanced logging systems, as well as certifications like AWS Certified DevOps Engineer, is highly beneficial. Strong analytical thinking, communication, and leadership skills set top candidates apart in this role. These skills and qualities are crucial to proactively detect issues, optimize system performance, and drive strategic decision-making across complex technical environments.
What are the most commonly searched types of Observability Engineer jobs in Virginia? The most popular types of Observability Engineer jobs in Virginia are:
What are popular job titles related to Executive Observability Engineer jobs in Virginia? For Executive Observability Engineer jobs in Virginia, the most frequently searched job titles are:
What job categories do people searching Executive Observability Engineer jobs in Virginia look for? The top searched job categories for Executive Observability Engineer jobs in Virginia are:
What cities in Virginia are hiring for Executive Observability Engineer jobs? Cities in Virginia with the most Executive Observability Engineer job openings:
Senior AI Solutions Engineer - Customer Success and Deployment

Senior AI Solutions Engineer - Customer Success and Deployment

Oracle

Reston, VA • On-site

$57.50 - $74/hr

Full-time

Medical, Dental, Vision, Life, Retirement, PTO

Posted 15 days ago


Oracle rating

8.7

Company rating: 8.7 out of 10

Based on 146 frontline employees who took The Breakroom Quiz

43rd of 202 rated software companies


Job description


Senior AI Solutions Engineer - Customer Success and Deployment
Oracle Government, Defense & Intelligence is seeking a highly technical and customer-focused AI Solutions Engineer to serve as the primary technical interface between Oracle and strategic customers deploying Large Language Models (LLMs) on Oracle Cloud Infrastructure (OCI), including OCI Isolated Regions and sovereign environments.
This role combines deep AI/ML engineering expertise with customer engagement, solution architecture, performance optimization, and operational excellence. The successful candidate will work directly with customer technical teams, business stakeholders, Oracle engineering, product management, operations, and cloud infrastructure teams to ensure deployed AI solutions meet mission requirements, performance expectations, and operational objectives.
The ideal candidate can translate business goals into technical solutions, explain complex AI concepts to both executive and technical audiences, and act as a trusted advisor throughout deployment, testing, optimization, and production operations.
  • MUST possess or have the ability to obtain and maintain an active TS/SCI with FS poly
  • Full time in office position.

Responsibilities
Key Responsibilities
Customer Technical Leadership
  • Serve as the primary technical representative for Oracle during customer AI and Generative AI deployments.
  • Build trusted advisor relationships with customer engineering, operations, security, and leadership teams.
  • Translate customer mission requirements, business objectives, and operational constraints into scalable AI deployment strategies.
  • Communicate model capabilities, limitations, performance expectations, and technical tradeoffs to both technical and executive audiences.
AI Solution Deployment & Optimization
  • Support the deployment, validation, and optimization of Large Language Models (LLMs) running on Oracle GenAI Services and OCI infrastructure, including isolated and sovereign cloud environments.
  • Analyze and improve solution performance across throughput, latency, Time to First Token (TTFT), scalability, context utilization, resource efficiency, and overall user experience.
  • Guide customers through benchmarking, acceptance testing, production readiness, and operational optimization activities.
  • Recommend best practices for model selection, prompting strategies, Retrieval-Augmented Generation (RAG) architectures, and AI solution design.

Customer Validation & Advisory
  • Understand customer evaluation methodologies, benchmark frameworks, and acceptance criteria.
  • Interpret testing and benchmark results, explain performance outcomes, and provide recommendations for continuous improvement.
  • Evaluate model behavior across domain-specific and mission-critical use cases to ensure solutions align with customer objectives.

Cross-Functional Execution & Operatins
  • Partner with Oracle engineering, product management, cloud operations, networking, security, and support teams to deliver successful customer outcomes.
  • Drive resolution of complex deployment, performance, infrastructure, and operational challenges across multiple organizations and environments.
  • Analyze telemetry, observability data, and service metrics to troubleshoot issues, support incident investigations, and identify optimization opportunities.
  • Provide customer and field feedback to influence product direction, service improvements, and engineering roadmaps.

Required Qualifications
  • MUST possess or have the ability to obtain and maintain an active TS/SCI with FS poly

AI/ML Expertise
  • Strong understanding of Large Language Models (LLMs), Generative AI systems, inference architectures, and production AI application deployment.
  • Experience with prompt engineering, Retrieval-Augmented Generation (RAG), embedding models, vector databases, model evaluation methodologies, and model adaptation techniques.
  • Ability to explain AI model capabilities, limitations, risks, and expected behaviors to technical and non-technical stakeholders.

Cloud & Infrastructure
  • Experience with enterprise AI platforms such as Oracle GenAI Service, Azure OpenAI Service, Amazon Bedrock, Google Vertex AI, or similar technologies.
  • Strong understanding of cloud infrastructure, networking, security, distributed systems, and cloud-native architectures.
  • Familiarity with Kubernetes, containerized applications, and supporting production workloads in regulated, sovereign, government, or isolated cloud environments.
  • Experience presenting technical solutions to customer executives, architects, and engineering teams

API & Integration Skills
  • Experience integrating LLM services and APIs into enterprise applications and business workflows.
  • Familiarity with AI development frameworks and tooling such as LangChain, LlamaIndex, LiteLLM, OpenAI-compatible APIs, and agent frameworks.
  • Understanding of API management, authentication and authorization, token management, rate limiting, observability, and monitoring practices.

Performance Engineering, Troubleshooting & Operations
  • Experience analyzing and optimizing AI workload performance, including throughput, latency, concurrency, capacity planning, token consumption, and request lifecycle behavior.
  • Ability to diagnose and resolve issues across application, model, networking, infrastructure, and operational layers.
  • Experience using monitoring, observability, and operational analytics tools to support performance improvement, root cause analysis, and production operations.
  • Strong analytical, problem-solving, and cross-functional collaboration skills in complex technical environments.

Preferred Qualifications
  • Experience with OCI and Oracle Cloud technologies.
  • Experience supporting AI workloads in OCI Dedicated Region, OCI Isolated Region, government cloud, or sovereign cloud environments.
  • Knowledge of GPU infrastructure and AI inference platforms.
  • Familiarity with NVIDIA AI ecosystem technologies.
  • Experience conducting customer-facing architecture reviews and technical workshops.
  • Understanding of AI governance, security, compliance, and responsible AI principles.
  • Experience with benchmark analysis and model evaluation frameworks.
  • Background in Site Reliability Engineering (SRE), DevOps, Cloud Engineering, or AI Platform Engineering.

Critical Success Traits
  • Exceptional customer-facing communication skills.
  • Ability to bridge business objectives and technical implementation.
  • Comfortable operating in ambiguous, fast-moving environments.
  • Strong ownership mindset and bias toward action.
  • Ability to influence without direct authority across multiple organizations.
  • Capable of balancing customer advocacy with technical realism.
  • Skilled at expectation management and executive communication.
  • Trusted advisor mentality with a focus on long-term customer success.

What Success Looks Like
  • Customers successfully deploy and operationalize LLM solutions in OCI environments.
  • Customer expectations are aligned with model capabilities and operational realities.
  • Production systems achieve agreed-upon performance, reliability, and scalability targets.
  • Technical risks are identified early and mitigated proactively.
  • Oracle engineering teams receive actionable feedback that improves products and customer outcomes.
  • Customers view Oracle as a strategic AI partner and trusted advisor.

Come Join Us!
#LI-PA4
Qualifications
Disclaimer:
Certain U.S. based or U.S. customer or client-facing roles may be required to comply with applicable requirements, such as immunization/occupational health mandates, and/or drug testing requirements.
Range and benefit information provided in this posting are specific to the stated locations only
US: Hiring Range in USD from: $97,500 to $209,500 per annum. May be eligible for bonus and equity.
Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.
Oracle US offers a comprehensive benefits package which includes the following:
1. Medical, dental, and vision insurance, including expert medical opinion
2. Short term disability and long term disability
3. Life insurance and AD&D
4. Supplemental life insurance (Employee/Spouse/Child)
5. Health care and dependent care Flexible Spending Accounts
6. Pre-tax commuter and parking benefits
7. 401(k) Savings and Investment Plan with company match
8. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
9. 11 paid holidays
10. Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
11. Paid parental leave
12. Adoption assistance
13. Employee Stock Purchase Plan
14. Financial planning and group legal
15. Voluntary benefits including auto, homeowner and pet insurance
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - IC4
About Us
Only Oracle brings together the data, infrastructure, applications, and expertise to power everything from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers turn that promise into a better future for all. Discover your potential at a company leading the way in AI and cloud solutions that impact billions of lives.
True innovation starts when everyone is empowered to contribute. That's why we're committed to growing a workforce that promotes opportunities for all with competitive benefits that support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_mb@oracle.com or by calling 1-888-404-2494 in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.

What Oracle employees say

Pay

Benefits

Hours and flexibility

Workplace

Get the full story on Breakroom


Oracle logo

About Oracle

Sourced by ZipRecruiter

An Oracle career can span industries, roles, Countries and cultures, giving you the opportunity to flourish in new roles and innovate, while blending work life in. Oracle has thrived through 40+ years of change by innovating and operating with integrity while delivering for the top companies in almost every industry. In order to nurture the talent that makes this happen, we are committed to an inclusive culture that celebrates and values diverse insights and perspectives, a workforce that inspires thought leadership and innovation. Oracle offers a highly competitive suite of Employee Benefits designed on the principles of parity, consistency, and affordability. The overall package includes certain core elements such as Medical, Life Insurance, access to Retirement Planning, and much more. We also encourage our employees to engage in the culture of giving back to the communities where we live and do business. At Oracle, we believe that innovation starts with diversity and inclusion and to create the future we need talent from various backgrounds, perspectives, and abilities. We ensure that individuals with disabilities are provided reasonable accommodation to successfully participate in the job application, interview process, and in potential roles. to perform crucial job functions. That's why we're committed to creating a workforce where all individuals can do their best work. It's when everyone's voice is heard and valued that we're inspired to go beyond what's been done before.

Industry

It services

Company size

10,000+ Employees

Headquarters location

Redwood City, CA, US

Year founded

1977

Social media