$102K - $139.60K/yr
This role is fully remote-friendly, with team members distributed across the US and Canada ... Identify and resolve bottlenecks in data, compute, orchestration, and observability layers * Mentor ...
$102K - $139.60K/yr
This role is fully remote-friendly, with team members distributed across the US and Canada ... Identify and resolve bottlenecks in data, compute, orchestration, and observability layers * Mentor ...
$102K - $139.60K/yr
This role is fully remote-friendly, with team members distributed across the US and Canada ... Identify and resolve bottlenecks in data, compute, orchestration, and observability layers * Mentor ...
US or Canada Remote Responsibilities * Lead architecture and delivery for major ML platform ... Guide the design of model deployment, inference services, monitoring, and observability for ...
US or Canada Remote Responsibilities * Lead architecture and delivery for major ML platform ... Guide the design of model deployment, inference services, monitoring, and observability for ...
This role is fully remote-friendly, with team members distributed across the US and Canada ... Define scalable approaches for model deployment, inference services, monitoring, and observability ...
This role is fully remote-friendly, with team members distributed across the US and Canada ... Define scalable approaches for model deployment, inference services, monitoring, and observability ...
Birmingham, AL · Remote
$49.50 - $67.75/hr
Advance monitoring, observability, and incident readiness across applications and infrastructure ... Remote first work environment * Choice of a HDHP or PPO Medical plan, we pay 100% of the premium ...
Birmingham, AL · Remote
$49.50 - $67.75/hr
Advance monitoring, observability, and incident readiness across applications and infrastructure ... Remote first work environment * Choice of a HDHP or PPO Medical plan, we pay 100% of the premium ...
Montgomery, AL · On-site +1
$197.40K - $232K/yr
Remote Department Engineering Compensation: $197.4K - $232K • Offers Equity At Confluent, we are ... Improve service reliability and operations by defining SLOs/SLAs, strengthening observability, and ...
Montgomery, AL · On-site +1
$197.40K - $232K/yr
Remote Department Engineering Compensation: $197.4K - $232K • Offers Equity At Confluent, we are ... Improve service reliability and operations by defining SLOs/SLAs, strengthening observability, and ...
Montgomery, AL · On-site +1
$197.40K - $232K/yr
Remote Department Engineering Compensation: $197.4K - $232K • Offers Equity At Confluent, we are ... Improve service reliability and operations by defining SLOs/SLAs, strengthening observability, and ...
Montgomery, AL · On-site +1
$197.40K - $232K/yr
Remote Department Engineering Compensation: $197.4K - $232K • Offers Equity At Confluent, we are ... Improve service reliability and operations by defining SLOs/SLAs, strengthening observability, and ...
Montgomery, AL · On-site +1
$133.50K - $179K/yr
The Position We're looking to hire a Principal Software Engineer (.NET + Data) to join our team ... Experience implementing observability practices using tools such as Datadog, Prometheus, CloudWatch ...
Montgomery, AL · On-site +1
$133.50K - $179K/yr
The Position We're looking to hire a Principal Software Engineer (.NET + Data) to join our team ... Experience implementing observability practices using tools such as Datadog, Prometheus, CloudWatch ...
Huntsville, AL · On-site +1
$53.25 - $71/hr
This position can be performed remote from anywhere, but may require up to 15% travel. As a skilled ... Experience with monitoring and observability tools (CloudWatch, Prometheus, Grafana, Datadog)
Huntsville, AL · On-site +1
$53.25 - $71/hr
This position can be performed remote from anywhere, but may require up to 15% travel. As a skilled ... Experience with monitoring and observability tools (CloudWatch, Prometheus, Grafana, Datadog)
AL · On-site +1
$120K/yr
Huntsville, AL/Remote Salary*: 120,000+ *Dependent upon qualifications Summit 7 is here to rise ... Monitor system health and performance using observability stacks such as Prometheus, Grafana, and ...
AL · On-site +1
$120K/yr
Huntsville, AL/Remote Salary*: 120,000+ *Dependent upon qualifications Summit 7 is here to rise ... Monitor system health and performance using observability stacks such as Prometheus, Grafana, and ...
Montgomery, AL · Remote
$106.30K - $139.50K/yr
... observability. For more information, visit www.enterprisedb.com Candidate Note ... This role is 100% remote for candidates based in EST or CST only We are looking for a confident ...
Montgomery, AL · Remote
$106.30K - $139.50K/yr
... observability. For more information, visit www.enterprisedb.com Candidate Note ... This role is 100% remote for candidates based in EST or CST only We are looking for a confident ...
Huntsville, AL · On-site +1
$140K - $220K/yr
... observability. Education/Qualifications Minimum Requirements: * Must be a U.S. citizen and be ... Remote
Huntsville, AL · On-site +1
$140K - $220K/yr
... observability. Education/Qualifications Minimum Requirements: * Must be a U.S. citizen and be ... Remote
Huntsville, AL · Remote
$140K - $220K/yr
... observability. Education/Qualifications Minimum Requirements: * Must be a U.S. citizen and be ... Remote Employment Type: FULL_TIME
Huntsville, AL · Remote
$140K - $220K/yr
... observability. Education/Qualifications Minimum Requirements: * Must be a U.S. citizen and be ... Remote Employment Type: FULL_TIME
Huntsville, AL · Remote
$140K - $220K/yr
... and observability. Minimum Requirements: * Must be a U.S. citizen and be willing to obtain and ... Remote
Huntsville, AL · Remote
$140K - $220K/yr
... and observability. Minimum Requirements: * Must be a U.S. citizen and be willing to obtain and ... Remote
Huntsville, AL · Remote
$140K - $220K/yr
... observability. Education/Qualifications Minimum Requirements: * Must be a U.S. citizen and be ... Remote Employment Type: FULL_TIME
Huntsville, AL · Remote
$140K - $220K/yr
... observability. Education/Qualifications Minimum Requirements: * Must be a U.S. citizen and be ... Remote Employment Type: FULL_TIME
Montgomery, AL · Remote
$107.30K - $145.90K/yr
... observability. For more information, visit www.enterprisedb.com Candidate Note ... This role is 100% remote for candidates based in EST or CST only We are looking for a confident ...
Montgomery, AL · Remote
$107.30K - $145.90K/yr
... observability. For more information, visit www.enterprisedb.com Candidate Note ... This role is 100% remote for candidates based in EST or CST only We are looking for a confident ...
Huntsville, AL · On-site +1
$55 - $73.50/hr
This position can be performed remote from anywhere, but may require up to 15% travel. As a skilled ... Experience with monitoring and observability tools (CloudWatch, Prometheus, Grafana, Datadog)
Huntsville, AL · On-site +1
$55 - $73.50/hr
This position can be performed remote from anywhere, but may require up to 15% travel. As a skilled ... Experience with monitoring and observability tools (CloudWatch, Prometheus, Grafana, Datadog)
Huntsville, AL · On-site +1
Posting Type Remote/Hybrid Job Overview Relativity is aprivate equity-backed, legal data ... Engineering Culture: Continuous delivery, strong observability, and end-to-end ownership.
Huntsville, AL · On-site +1
Posting Type Remote/Hybrid Job Overview Relativity is aprivate equity-backed, legal data ... Engineering Culture: Continuous delivery, strong observability, and end-to-end ownership.
Huntsville, AL · On-site +1
... days remote Position Description: This position focuses on AI/ML and data execution within the ... observability platforms About PingWind PingWind is focused on delivering outstanding services to ...
Huntsville, AL · On-site +1
... days remote Position Description: This position focuses on AI/ML and data execution within the ... observability platforms About PingWind PingWind is focused on delivering outstanding services to ...
The DevSecOps Engineer, Mid designs infrastructure-as-code patterns and observability practices to ... S. citizenship as required for this remote federal IT position. Preferred Qualifications
The DevSecOps Engineer, Mid designs infrastructure-as-code patterns and observability practices to ... S. citizenship as required for this remote federal IT position. Preferred Qualifications
Montgomery, AL · On-site +1
Remote, United States Date Posted: May 5, 2026 Employment Type: Intern Job ID: R-1950 Description ... AI-powered observability tools on AWS (CloudWatch, New Relic, DataDog, etc.) * Contribute to ...
Montgomery, AL · On-site +1
Remote, United States Date Posted: May 5, 2026 Employment Type: Intern Job ID: R-1950 Description ... AI-powered observability tools on AWS (CloudWatch, New Relic, DataDog, etc.) * Contribute to ...
| Aspect | Remote Observability Engineer | Site Reliability Engineer |
|---|---|---|
| Credentials | Knowledge of monitoring tools, scripting, cloud platforms | Same as Observability Engineer, plus SRE certifications often preferred |
| Work Environment | Focus on monitoring, logging, and tracing systems remotely | Broader scope including system reliability, incident response, and automation |
| Industry Usage | Primarily in tech, SaaS, cloud services | Widely in tech, finance, and large-scale online services |
The Remote Observability Engineer specializes in monitoring and analyzing system performance remotely, focusing on tools like logs and metrics. In contrast, the Site Reliability Engineer has a broader role, ensuring overall system reliability, automation, and incident management. While both roles require similar technical skills, SREs often have additional responsibilities related to system resilience and scalability.
$102K - $139.60K/yr
Full-time
Posted 21 days ago
9.5
Based on 5 frontline employees who took The Breakroom Quiz
5th of 184 rated software companies
Job Requisition ID #
POSITION OVERVIEW
The work we do at Autodesk touches nearly every person on the planet. By creating software tools for making buildings, machines, and even the latest movies, we influence and empower some of the most creative people in the world to solve problems that matter.
Autodesk is seeking a Senior ML Engineer, ML Systems and Infrastructure to design and scale the systems that enable machine learning across research and product development. You will help build the infrastructure behind large-scale data pipelines, distributed training systems, evaluation frameworks, and production ML workflows that support foundation models and ML-powered product features.
This role is ideal for an engineer who is deeply interested in scalable systems and production-grade ML infrastructure. You will operate independently across multiple parts of the stack and help define strong engineering practices for reliability, performance, and maintainability.
This role is fully remote-friendly, with team members distributed across the US and Canada.
Location: US or Canada Remote, East Coast
RESPONSIBILITIES
Design and build scalable systems for ML training, evaluation, deployment, and monitoring
Develop and improve data pipelines that process large-scale structured and semi-structured technical datasets
Optimize distributed workflows for performance, reliability, resource utilization, and cost efficiency
Build platform capabilities such as experiment tracking, model versioning, checkpointing, reproducibility, and observability
Contribute to model deployment, inference services, and production monitoring workflows
Improve data quality, lineage, provenance, and operational transparency across ML pipelines
Contribute to architecture and design discussions across the team
Identify and resolve bottlenecks in data, compute, orchestration, and observability layers
Mentor engineers through code reviews, design guidance, and knowledge sharing
Collaborate closely with researchers, product engineers, and platform partners to turn ML workflows into robust engineering systems
MINIMUM QUALIFICATIONS
Bachelor's or Master's degree in Computer Science, Engineering, or a related field, or equivalent industry experience
At least 3 to 4 years of industry experience building and operating production software, ML systems, distributed infrastructure, or large-scale data pipelines
Strong experience in software engineering, distributed systems, backend systems, or ML infrastructure
Strong proficiency in Python and experience delivering production-quality systems
Experience designing and operating scalable data or compute pipelines
Experience with cloud platforms such as AWS, Azure, or GCP
Familiarity with containers, CI/CD, observability, and release quality practices
Ability to independently drive technical execution on complex work with limited oversight
PREFERRED QUALIFICATIONS
Experience building data pipelines for large-scale structured and semi-structured technical datasets
Experience with data lineage, provenance, governance, and responsible data usage in ML systems
Experience with distributed data processing and orchestration systems such as Ray, Airflow, Spark, or similar platforms
Experience with model deployment, inference services, monitoring, and observability for production ML systems
Experience building ML-ready representations for geometry, graph, hierarchical, or multimodal data
Experience with distributed ML frameworks such as PyTorch, Lightning, DeepSpeed, FSDP, Megatron, or similar
Familiarity with AEC workflows, design data, BIM/CAD formats, or Autodesk products
THE IDEAL CANDIDATE
Thinks like a systems engineer and executes like a strong software developer
Can balance short-term delivery with long-term platform health
Brings strong technical judgment and ownership
Improves team effectiveness through mentoring and engineering rigor
Enjoys solving scaling, performance, and reliability challenges
At Autodesk, we're building a diverse workplace and an inclusive culture to give more people the chance to imagine, design, and make a better world. Autodesk is proud to be an equal opportunity employer and considers all qualified applicants for employment without regard to race, color, religion, age, sex, sexual orientation, gender, gender identity, national origin, disability, veteran status or any other legally protected characteristic. We also consider for employment all qualified applicants regardless of criminal histories, consistent with applicable law.
Are you an existing contractor or consultant with Autodesk? Please search for open jobs and apply internally (not on this external site). If you have any questions or require support, contact Autodesk Careers.Sourced by ZipRecruiter
Autodesk is changing how the world is designed and made. Our technology spans architecture, engineering, construction, product design, manufacturing, media, and entertainment, empowering innovators everywhere to solve challenges big and small. From greener buildings to smarter products to more mesmerizing blockbusters, Autodesk software helps our customers to design and make a better world for all. For more information visit autodesk.com or follow @autodesk.
Software development
10,000+ Employees
San Rafael, CA, US
1982