Job Title Software Engineer III - AI/ML Platform Operations - Remote Requisition Number R7739 ... Ensure platform reliability, scalability, performance, security, and operational readiness for ...
Job Title Software Engineer III - AI/ML Platform Operations - Remote Requisition Number R7739 ... Ensure platform reliability, scalability, performance, security, and operational readiness for ...
Job Title Software Engineer III - AI/ML Platform Operations - Remote Requisition Number R7739 ... Ensure platform reliability, scalability, performance, security, and operational readiness for ...
Job Title Software Engineer III - AI/ML Platform Operations - Remote Requisition Number R7739 ... Ensure platform reliability, scalability, performance, security, and operational readiness for ...
2223 Principal Software Engineer
Cincinnati, OH · Remote
$154K - $172K/yr
REMOTE EST/CST Years of Experience: 10+ What You'll Do Newline™ is an enterprise-grade embedded ... Improve platform reliability through code changes, automation, observability, and better system ...
Quick apply
2223 Principal Software Engineer
Cincinnati, OH · Remote
$154K - $172K/yr
REMOTE EST/CST Years of Experience: 10+ What You'll Do Newline™ is an enterprise-grade embedded ... Improve platform reliability through code changes, automation, observability, and better system ...
Engineer, Observability (Remote)
Columbus, OH · On-site +1
$55 - $73.25/hr
At Abercrombie & Fitch the Engineer, Observability is a member of the Site Reliability team which ... IS AN EQUAL OPPORTUNITY EMPLOYER This role allows for remote work across the U.S.. Therefore, in ...
Engineer, Observability (Remote)
Columbus, OH · On-site +1
$55 - $73.25/hr
At Abercrombie & Fitch the Engineer, Observability is a member of the Site Reliability team which ... IS AN EQUAL OPPORTUNITY EMPLOYER This role allows for remote work across the U.S.. Therefore, in ...
Azure/AWS AI services, containerized ML workloads (Docker/Kubernetes), event-driven architecture (Kafka), APIs, IaC, and observability/SRE practices. * Data architecture for AI: feature engineering ...
Azure/AWS AI services, containerized ML workloads (Docker/Kubernetes), event-driven architecture (Kafka), APIs, IaC, and observability/SRE practices. * Data architecture for AI: feature engineering ...
Azure/AWS AI services, containerized ML workloads (Docker/Kubernetes), event-driven architecture (Kafka), APIs, IaC, and observability/SRE practices. * Data architecture for AI: feature engineering ...
Azure/AWS AI services, containerized ML workloads (Docker/Kubernetes), event-driven architecture (Kafka), APIs, IaC, and observability/SRE practices. * Data architecture for AI: feature engineering ...
Condition Monitoring Engineer
Columbus, OH · Remote
$90K - $110K/yr
... customers' reliability engineers. In this role, you will: * Collect and analyze vibration and ... Accompany hardware installation team and recommend placement of machine condition sensing units.
Quick apply
Condition Monitoring Engineer
Columbus, OH · Remote
$90K - $110K/yr
... customers' reliability engineers. In this role, you will: * Collect and analyze vibration and ... Accompany hardware installation team and recommend placement of machine condition sensing units.
... performance, reliability, scalability, and capacity. Leads the configuration of hardware and ... Remote Actions * Dashboards & Investigations * Campaigns & Alerts * Application Experience ...
... performance, reliability, scalability, and capacity. Leads the configuration of hardware and ... Remote Actions * Dashboards & Investigations * Campaigns & Alerts * Application Experience ...
Platform Engineer II - AWS, Kubernetes & Golang
Cincinnati, OH · On-site +1
$91K - $138K/yr
This position is not eligible for remote work. Ready to take your career global? Make your mark at ... 6 AWS regions, ensuring reliability, security, and performance for payment services
Platform Engineer II - AWS, Kubernetes & Golang
Cincinnati, OH · On-site +1
$91K - $138K/yr
This position is not eligible for remote work. Ready to take your career global? Make your mark at ... 6 AWS regions, ensuring reliability, security, and performance for payment services
Enterprise Account Executive
Cleveland, OH · On-site +1
Komodor is an AI-powered SRE platform that helps engineering teams proactively manage reliability ... This role is remote from Ohio** Core mission: * Close business to meet and exceed monthly ...
Enterprise Account Executive
Cleveland, OH · On-site +1
Komodor is an AI-powered SRE platform that helps engineering teams proactively manage reliability ... This role is remote from Ohio** Core mission: * Close business to meet and exceed monthly ...
Senior Cloud Services Engineer - Plex
Mayfield Heights, OH · Remote
$56.75 - $75.75/hr
Prior experience in SRE or Platform Engineering roles. * Degree in Computer Science or related area ... You may be just the right person for this or other roles. #LI-Remote #LI-LifeAtROK #LI-MG4 We are ...
Senior Cloud Services Engineer - Plex
Mayfield Heights, OH · Remote
$56.75 - $75.75/hr
Prior experience in SRE or Platform Engineering roles. * Degree in Computer Science or related area ... You may be just the right person for this or other roles. #LI-Remote #LI-LifeAtROK #LI-MG4 We are ...
Integration Engineer
Beavercreek, OH · On-site +1
$61K - $141K/yr
Remote Work: Yes Job Number: R0242093 Location: Beavercreek,OH,US Share job via: Share Integration ... Ability to work across the hardware or software boundary to enable reliable device discovery, data ...
Integration Engineer
Beavercreek, OH · On-site +1
$61K - $141K/yr
Remote Work: Yes Job Number: R0242093 Location: Beavercreek,OH,US Share job via: Share Integration ... Ability to work across the hardware or software boundary to enable reliable device discovery, data ...
DevOps Engineer III
Beavercreek, OH · Remote
$49 - $67/hr
Coordinate with onsite and remote teams to validate configurations and ensure handoff readiness ... reliability * Support container image movement and lifecycle management using tools such as Skopeo ...
Quick apply
DevOps Engineer III
Beavercreek, OH · Remote
$49 - $67/hr
Coordinate with onsite and remote teams to validate configurations and ensure handoff readiness ... reliability * Support container image movement and lifecycle management using tools such as Skopeo ...
2211 ServiceNow Software Engineer IV
Cincinnati, OH · On-site +1
$142K - $158K/yr
Cincinnati, OH (Madisonville) - Remote/Hybrid candidates considered Years of Experience: 7-15+ TOP ... Required Skills & Experience * 7+ years in ITSM, Change Enablement, Platform/SRE Ops, or Security ...
Quick apply
2211 ServiceNow Software Engineer IV
Cincinnati, OH · On-site +1
$142K - $158K/yr
Cincinnati, OH (Madisonville) - Remote/Hybrid candidates considered Years of Experience: 7-15+ TOP ... Required Skills & Experience * 7+ years in ITSM, Change Enablement, Platform/SRE Ops, or Security ...
DevOps Engineer III
Beavercreek, OH · Remote
$49 - $67/hr
Coordinate with onsite and remote teams to validate configurations and ensure handoff readiness ... reliability * Support container image movement and lifecycle management using tools such as Skopeo ...
DevOps Engineer III
Beavercreek, OH · Remote
$49 - $67/hr
Coordinate with onsite and remote teams to validate configurations and ensure handoff readiness ... reliability * Support container image movement and lifecycle management using tools such as Skopeo ...
DevOps Engineer IV
Beavercreek, OH · Remote
$49 - $67/hr
Coordinate with onsite and remote teams to validate configurations and ensure handoff readiness ... reliability * Support container image movement and lifecycle management using tools such as Skopeo ...
Quick apply
DevOps Engineer IV
Beavercreek, OH · Remote
$49 - $67/hr
Coordinate with onsite and remote teams to validate configurations and ensure handoff readiness ... reliability * Support container image movement and lifecycle management using tools such as Skopeo ...
DevOps Engineer IV
Beavercreek, OH · Remote
$49 - $67/hr
Coordinate with onsite and remote teams to validate configurations and ensure handoff readiness ... reliability * Support container image movement and lifecycle management using tools such as Skopeo ...
DevOps Engineer IV
Beavercreek, OH · Remote
$49 - $67/hr
Coordinate with onsite and remote teams to validate configurations and ensure handoff readiness ... reliability * Support container image movement and lifecycle management using tools such as Skopeo ...
Senior DevOps Engineer
Beavercreek, OH · Remote
$126K - $162K/yr
Coordinate with onsite and remote teams to validate configurations and ensure handoff readiness ... deployment reliability. * Support container image movement and lifecycle management using tools ...
Senior DevOps Engineer
Beavercreek, OH · Remote
$126K - $162K/yr
Coordinate with onsite and remote teams to validate configurations and ensure handoff readiness ... deployment reliability. * Support container image movement and lifecycle management using tools ...
Senior DevOps Engineer
Beavercreek, OH · On-site +1
$126K - $162K/yr
Coordinate with onsite and remote teams to validate configurations and ensure handoff readiness ... deployment reliability. * Support container image movement and lifecycle management using tools ...
Senior DevOps Engineer
Beavercreek, OH · On-site +1
$126K - $162K/yr
Coordinate with onsite and remote teams to validate configurations and ensure handoff readiness ... deployment reliability. * Support container image movement and lifecycle management using tools ...
Senior DevOps Engineer
Beavercreek, OH · Remote
$126K - $162K/yr
Coordinate with onsite and remote teams to validate configurations and ensure handoff readiness ... deployment reliability. * Support container image movement and lifecycle management using tools ...
Senior DevOps Engineer
Beavercreek, OH · Remote
$126K - $162K/yr
Coordinate with onsite and remote teams to validate configurations and ensure handoff readiness ... deployment reliability. * Support container image movement and lifecycle management using tools ...
Remote Hardware Reliability Engineer information
What is a Remote Hardware Reliability Engineer?
What is the difference between Remote Hardware Reliability Engineer vs Remote Hardware Test Engineer?
| Aspect | Remote Hardware Reliability Engineer | Remote Hardware Test Engineer |
|---|---|---|
| Credentials | Engineering degree, certifications in reliability or hardware engineering | Engineering degree, certifications in testing or quality assurance |
| Work Environment | Designing reliability strategies, analyzing failure data, improving hardware durability | Developing and executing testing procedures, validating hardware performance |
| Industry Usage | Manufacturers, tech companies, aerospace, automotive | Manufacturers, consumer electronics, hardware development firms |
While both roles focus on hardware, the Remote Hardware Reliability Engineer emphasizes ensuring long-term durability and reliability through analysis and design improvements. In contrast, the Remote Hardware Test Engineer concentrates on testing hardware components to verify performance and quality before deployment.
What are the key skills and qualifications needed to thrive as a Remote Hardware Reliability Engineer, and why are they important?
How does a Remote Hardware Reliability Engineer collaborate effectively with cross-functional teams while working offsite?
Full-time
Retirement
Posted 3 days ago
Job description
External candidates: In order for your application to be correctly processed please sign-in before you apply
Internal candidates: Please go to Workday and click "Find Jobs" link under Career
Thank you for considering opportunities with us!
Job Title
Software Engineer III - AI/ML Platform Operations - RemoteRequisition Number
R7739 Software Engineer III - AI/ML Platform Operations - Remote (Open)Location
Arizona - Home TeleworkersAdditional Locations
Job Information
CSAA Insurance Group (CSAA IG), a AAA insurer, is one of the leading personal lines property and casualty insurance groups in the United States. Here, every employee shapes our mission. We build innovative, human-centered solutions that help AAA members prevent, prepare for, and recover from life's uncertainties. You will join a collaborative, inclusive culture where your strengths have room to grow and your ideas can drive real impact. Step into a role where you can contribute to our shared success through meaningful work.
We are actively hiring for a Software Engineer - AI/ML Platform Operations - Remote
Your Role: We are seeking a Software Engineer - AI/ML Platform Operations to lead the operational excellence, reliability, and support of our enterprise AI and data platforms. This role is responsible for ensuring the stability, scalability, observability, governance, and operational readiness of AI/ML solutions that power critical business capabilities.
This is not a traditional software application development role. While strong software engineering skills are essential, the primary focus is on AI platform operations, MLOps, automation, reliability engineering, deployment support, observability, governance, and continuous improvement of enterprise AI capabilities.
Your Work: You will work across a modern technology ecosystem that includes Palantir Foundry, AWS Bedrock, Amazon SageMaker, cloud-native services, and emerging Generative AI technologies. You will partner with Data Engineering, Data Science, Architecture, Infrastructure, Security, and Product teams to support production AI workloads and enable the successful adoption of AI capabilities across the organization.
AI Platform Operations & ReliabilityProvide technical leadership for AI/ML platforms including Palantir, AWS Bedrock, Amazon SageMaker, and related cloud-native technologies.
Ensure platform reliability, scalability, performance, security, and operational readiness for production AI workloads.
Support deployment, monitoring, maintenance, and lifecycle management of AI/ML solutions and Generative AI services.
Establish operational standards, support models, service-level objectives (SLOs), and platform governance practices.
Design and implement automation, monitoring, observability, and operational tooling to improve platform reliability and efficiency.
Develop and maintain dashboards, health metrics, alerts, logging frameworks, and operational runbooks.
Enhance CI/CD pipelines, deployment automation, infrastructure-as-code, and model release processes.
Implement best practices for MLOps, model monitoring, model lifecycle management, and AI operational governance.
Serve as a senior escalation point for complex production issues involving AI platforms, machine learning workloads, cloud infrastructure, and data integrations.
Lead root cause analysis efforts and drive corrective and preventive actions to improve platform stability.
Solve performance, availability, deployment, and integration issues across AI and data ecosystems.
Partner with engineering and business teams to rapidly restore service and minimize operational risk.
Provide mentorship, technical guidance, and operational expertise to engineers and platform teams.
Influence platform strategy, architecture decisions, operational processes, and technology adoption.
Collaborate with team members to align platform capabilities with business priorities and AI adoption goals.
Communicate complex technical concepts effectively to both technical and non-technical audiences.
Remain current with advancements in AI/ML, Generative AI, cloud technologies, platform engineering, and reliability practices.
Identify opportunities to improve operational efficiency, governance, security, and developer experience.
Champion modern engineering practices including automation, observability, DevOps, Site Reliability Engineering (SRE), and AI Operations (AIOps).
3+ years of progressive experience in software engineering, platform engineering, cloud operations, MLOps, DevOps, or related technical disciplines.
Bachelor's degree in Computer Science, Engineering, Information Technology, or a related field, or equivalent practical experience.
Experience supporting production cloud-based applications and services in AWS environments.
Strong experience with software engineering and automation using languages such as Python, Java, JavaScript/TypeScript, or Node.js.
Experience with CI/CD, build, integration, and deployment tools such as Jenkins, Maven, GitHub Actions, or equivalent.
Experience with cloud-native services including compute, storage, networking, databases, and serverless architectures.
Experience building and maintaining operational monitoring, observability, and alerting solutions.
Strong troubleshooting, incident response, and root cause analysis skills.
Excellent communication, collaboration, and technical leadership capabilities.
What would make us excited about you?
Experience with AI/ML platforms such as Palantir Foundry, Amazon SageMaker, AWS Bedrock, Databricks, or similar ecosystems.
Experience supporting Generative AI applications, LLM-based solutions, prompt orchestration frameworks, and Retrieval-Augmented Generation (RAG) architectures.
Knowledge of MLOps practices including model deployment, monitoring, governance, experimentation, and lifecycle management.
Experience with observability and monitoring platforms such as Datadog, Splunk, Grafana, Prometheus, CloudWatch, or OpenTelemetry.
Familiarity with AI governance, responsible AI principles, model risk management, and operational controls.
Relevant cloud, AI/ML, DevOps, or platform engineering certifications
Actively shapes our company culture (e.g., participating in employee resource groups, volunteering, etc.)
Lives into cultural norms (e.g., willing to have cameras when it matters: helping onboard new team members, building relationships, etc.)
Travels as needed for role, including divisional / team meetings and other in-person meetings
Fulfills business needs, which may include investing extra time, helping other teams, etc
Please note we are hiring for this role remote anywhere in the United States with the following exceptions: Hawaii and Alaska.
Why Choose a Career at CSAA IG?
At CSAA IG, we are a mission-driven organization proudly committed to empowering our members, our employees, and our communities to thrive.
Recognition: We offer a total compensation package, annual bonus eligibility for most roles, 401(k) with a company match, and so much more! Read more about what we offer and what it is like to be a part of our dynamic team at https://careers.csaainsurance.aaa.com/us/en/benefits.
Career Growth: We believe in growth for everyone. Here at CSAA IG, leaders and mentors partner with employees to align interests, unlock development opportunities, and support longterm success.
Flexible Workplace: We embrace a remote-first culture through our Flexible Workplace. Most employees hold Home-Flex roles, working primarily from home, often with the flexibility to work from various locations including CSAA offices. Our flexible workplace empowers you to balance remote work with intentional inperson moments that deepen connection and collaboration.
Inclusion and Belonging: An inclusive and welcoming workplace is the cornerstone of our success. By fostering an environment where people feel valued and heard, we deepen our ability to understand and meet the unique needs of our members. This strengthens innovation and enhances our products and services, giving us a competitive edge in the market.
Sustainability: As climate change leads to more frequent and severe weather events, we are taking bold action to build more resilient communities and reduce our environmental impact. Submit your application to be considered. We communicate via email, so check your inbox and/or your spam folder to ensure you don't miss important updates from us.
CSAA is committed to providing reasonable accommodations to qualified applicants and employees with disabilities or other limitations. If you would like to request an accommodation to participate in the job application or interview process, please contact TalentAcquistion@csaa.com
If you apply and are selected to continue in the recruiting process, we will schedule a preliminary call with you to discuss the role and will disclose during that call the available salary/hourly rate range based on your location. Factors used to determine the actual salary offered may include location, experience, or education.
CSAA does not provide visa sponsorship for this role. Applicants must have authorization to work indefinitely in the US. Please do not apply for this role if at any time (now or in the future) you will need immigration support (i.e., H-1B, TN, STEM OPT Training Plans, etc.).
CSAA Insurance Group is an equal opportunity employer.
#LI-SB1
.
The national average salary range for this position is $105,345.00-$117,050.00. However, we have a location-based compensation structure. Our salary ranges vary and are calculated based on work location. The starting pay range for this position across all the states we hire in is $105,345.00-$140,550.00. This role also includes an opportunity for a company-wide annual discretionary bonus, through our Annual Incentive Plan (AIP), of up to 8% of eligible pay.This job posting will be unposted on Wed, 8 Jul 2026.