WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that ... Deep-learning frameworks andinference engines (PyTorch, TensorFlow, JAX, Triton, vLLM) * HPC ...
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that ... Deep-learning frameworks andinference engines (PyTorch, TensorFlow, JAX, Triton, vLLM) * HPC ...
Senior Data Scientist - Business Analytics - AI
San Jose, CA · On-site
$109K/yr
Build, test, and refine machine learning models to improve forecasting accuracy and predictive ... AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all ...
Senior Data Scientist - Business Analytics - AI
San Jose, CA · On-site
$109K/yr
Build, test, and refine machine learning models to improve forecasting accuracy and predictive ... AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all ...
... Machine Learning, or a related field. LOCATION: * San Jose, CA or Bellevue, WA preferred. May consider other US markets within proximity of US AMD offices. #LI-MV1 #HYBRID Benefits offered are ...
... Machine Learning, or a related field. LOCATION: * San Jose, CA or Bellevue, WA preferred. May consider other US markets within proximity of US AMD offices. #LI-MV1 #HYBRID Benefits offered are ...
... Machine Learning, or a related field. LOCATION: * San Jose, CA or Bellevue, WA preferred. May consider other US markets within proximity of US AMD offices. #LI-MV1 #HYBRID Benefits offered are ...
... Machine Learning, or a related field. LOCATION: * San Jose, CA or Bellevue, WA preferred. May consider other US markets within proximity of US AMD offices. #LI-MV1 #HYBRID Benefits offered are ...
Model Implementation Engineer
San Francisco, CA · On-site
$165K - $220K/yr
Backed by multi-million-dollar funding and direct sponsorship from AMD with hands-on support from ... Maintain and evolve a large-scale library of modern machine learning models, including but not ...
Model Implementation Engineer
San Francisco, CA · On-site
$165K - $220K/yr
Backed by multi-million-dollar funding and direct sponsorship from AMD with hands-on support from ... Maintain and evolve a large-scale library of modern machine learning models, including but not ...
Senior Research Scientist
San Francisco, CA · On-site
$190K - $250K/yr
Backed by multi-million-dollar funding and direct sponsorship from AMD with hands-on support from ... What you'll do * Lead research in advanced machine learning areas such as LLMs, generative AI ...
Senior Research Scientist
San Francisco, CA · On-site
$190K - $250K/yr
Backed by multi-million-dollar funding and direct sponsorship from AMD with hands-on support from ... What you'll do * Lead research in advanced machine learning areas such as LLMs, generative AI ...
... Machine Learning (ML) training and serving benchmarks. • Use the benchmarks to identify ... NVIDIA/AMD architectures through low-level programming, performance modeling, and bottlenecks ...
... Machine Learning (ML) training and serving benchmarks. • Use the benchmarks to identify ... NVIDIA/AMD architectures through low-level programming, performance modeling, and bottlenecks ...
... machine-learning frameworks (e.g., PyTorch, TensorFlow) for high-throughput and scalable inference Kernel & Inference Frameworks * Strong background in NVIDIA, AMD, or similar GPU architectures and ...
... machine-learning frameworks (e.g., PyTorch, TensorFlow) for high-throughput and scalable inference Kernel & Inference Frameworks * Strong background in NVIDIA, AMD, or similar GPU architectures and ...
AI Infrastructure Engineer
San Jose, CA · On-site
$143K/yr
Understanding of machine learning frameworks (PyTorch, vLLM, SGLang, etc.). #LI-G11 #LI-HYBRID Benefits offered are described: AMD benefits at a glance. AMD does not accept unsolicited resumes from ...
AI Infrastructure Engineer
San Jose, CA · On-site
$143K/yr
Understanding of machine learning frameworks (PyTorch, vLLM, SGLang, etc.). #LI-G11 #LI-HYBRID Benefits offered are described: AMD benefits at a glance. AMD does not accept unsolicited resumes from ...
Senior Software Development Engineer - LLM Inference Framework
Santa Clara, CA · On-site
$144K - $190K/yr
... machine-learning frameworks (e.g., PyTorch, TensorFlow) for high-throughput and scalable inference Kernel & Inference Frameworks * Strong background in NVIDIA, AMD, or similar GPU architectures and ...
Senior Software Development Engineer - LLM Inference Framework
Santa Clara, CA · On-site
$144K - $190K/yr
... machine-learning frameworks (e.g., PyTorch, TensorFlow) for high-throughput and scalable inference Kernel & Inference Frameworks * Strong background in NVIDIA, AMD, or similar GPU architectures and ...
Senior Product Manager, GPU Compilers & Kernels
Santa Clara, CA · Hybrid
$149K - $197K/yr
AMD's Data Center GPU organization builds high-performance GPUs for largescale machine learning. The Triton compiler and kernel teams are central to unlocking performance, stability, and usability of ...
Senior Product Manager, GPU Compilers & Kernels
Santa Clara, CA · Hybrid
$149K - $197K/yr
AMD's Data Center GPU organization builds high-performance GPUs for largescale machine learning. The Triton compiler and kernel teams are central to unlocking performance, stability, and usability of ...
AI Infrastructure Engineer
San Jose, CA · Hybrid
$126K - $165K/yr
Understanding of machine learning frameworks (PyTorch, vLLM, SGLang, etc.). #LI-G11 #LI-HYBRID Benefits offered are described: AMD benefits at a glance. AMD does not accept unsolicited resumes from ...
AI Infrastructure Engineer
San Jose, CA · Hybrid
$126K - $165K/yr
Understanding of machine learning frameworks (PyTorch, vLLM, SGLang, etc.). #LI-G11 #LI-HYBRID Benefits offered are described: AMD benefits at a glance. AMD does not accept unsolicited resumes from ...
Technical Program Manager - Graphics
San Diego, CA · Hybrid
$136K - $177K/yr
The position offers deep exposure to industryleading Graphics, GPU Compute, Machine Learning, and ... AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all ...
Technical Program Manager - Graphics
San Diego, CA · Hybrid
$136K - $177K/yr
The position offers deep exposure to industryleading Graphics, GPU Compute, Machine Learning, and ... AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all ...
Sr. Fellow, ML Workload Performance
San Jose, CA · On-site
$255K/yr
... for Machine Learning workload performance and optimization across frameworks and model ... Represent AMD in external technical forums, benchmarks, and customer engagements. • Communicate ...
Sr. Fellow, ML Workload Performance
San Jose, CA · On-site
$255K/yr
... for Machine Learning workload performance and optimization across frameworks and model ... Represent AMD in external technical forums, benchmarks, and customer engagements. • Communicate ...
... for Machine Learning workload performance and optimization across frameworks and model ... Represent AMD in external technical forums, benchmarks, and customer engagements. · Communicate ...
... for Machine Learning workload performance and optimization across frameworks and model ... Represent AMD in external technical forums, benchmarks, and customer engagements. · Communicate ...
Senior AI Serving Engineer, Backend
San Francisco, CA · On-site
$190K - $250K/yr
Backed by multi-million-dollar funding and direct sponsorship from AMD with hands-on support from ... or machine learning systems. * Strong proficiency in C++/Python/Go/Rust * Experience with ...
Senior AI Serving Engineer, Backend
San Francisco, CA · On-site
$190K - $250K/yr
Backed by multi-million-dollar funding and direct sponsorship from AMD with hands-on support from ... or machine learning systems. * Strong proficiency in C++/Python/Go/Rust * Experience with ...
LLM Training Engineer
San Francisco, CA · On-site
$155K - $220K/yr
Backed by multi-million-dollar funding and direct sponsorship from AMD with hands-on support from ... MS or PhD in Computer Science, Machine Learning, AI, Mathematics, or related field Benefits include
LLM Training Engineer
San Francisco, CA · On-site
$155K - $220K/yr
Backed by multi-million-dollar funding and direct sponsorship from AMD with hands-on support from ... MS or PhD in Computer Science, Machine Learning, AI, Mathematics, or related field Benefits include
Senior HPC & GPU Infrastructure Engineer
San Francisco, CA · On-site
$150K - $220K/yr
... machine learning workflows. This role spans everything from hands-on Linux systems engineering and ... Strong expertise with NVIDIA (H100/B200) or AMD (MI325x/MI355x) GPUs, including driver and kernel ...
Senior HPC & GPU Infrastructure Engineer
San Francisco, CA · On-site
$150K - $220K/yr
... machine learning workflows. This role spans everything from hands-on Linux systems engineering and ... Strong expertise with NVIDIA (H100/B200) or AMD (MI325x/MI355x) GPUs, including driver and kernel ...
Sr. Network Engineer/Rack Solution
$137K - $156K/yr
... Learning and Machine Learning * 8+ years of Linux/networking debugging/testing or relevant ... Familiar with Intel/AMD/NVIDIA development tool kits such as CUDA, oneAPI, ROCm 2. Relevant ...
Sr. Network Engineer/Rack Solution
$137K - $156K/yr
... Learning and Machine Learning * 8+ years of Linux/networking debugging/testing or relevant ... Familiar with Intel/AMD/NVIDIA development tool kits such as CUDA, oneAPI, ROCm 2. Relevant ...
Sr. Network Engineer/Rack Solution
San Jose, CA · On-site
$137K - $156K/yr
... AMD EPYC processors, encompassing functionality, compatibility, performance, stress, and ... Machine Learning • 8+ years of Linux/networking debugging/testing or relevant experience ...
Sr. Network Engineer/Rack Solution
San Jose, CA · On-site
$137K - $156K/yr
... AMD EPYC processors, encompassing functionality, compatibility, performance, stress, and ... Machine Learning • 8+ years of Linux/networking debugging/testing or relevant experience ...
Amd Machine Learning information
See California salary details
$13.76 - $15.29
3% of jobs
$15.29 - $16.82
7% of jobs
$16.82 - $18.35
6% of jobs
$19.50 is the 25th percentile. Wages below this are outliers.
$18.35 - $19.88
11% of jobs
$19.88 - $21.42
13% of jobs
The median wage is $22.23 / hr.
$21.42 - $22.95
18% of jobs
$22.95 - $24.48
16% of jobs
$24.57 is the 75th percentile. Wages above this are outliers.
$24.48 - $26.01
9% of jobs
$26.01 - $27.54
6% of jobs
$27.54 - $29.07
7% of jobs
$29.07 - $30.60
3% of jobs
$13
$22
$30
How much do amd machine learning jobs pay per hour?
What are some common challenges faced by machine learning engineers at AMD, and how can applicants prepare to address them?
What are AMD machine learning engineers?
What is the difference between Amd Machine Learning vs Data Scientist?
| Aspect | Amd Machine Learning | Data Scientist |
|---|---|---|
| Required Credentials | Bachelor's or higher in CS, ML, or related fields; certifications like AWS, Azure | Bachelor's or higher in CS, Statistics, or related fields; certifications like SAS, Python |
| Work Environment | Tech companies, R&D labs, AI startups | Business analytics, finance, healthcare, tech firms |
| Industry Usage | Developing ML models, algorithms, AI solutions | Data analysis, insights, predictive modeling |
| Common Search/Comparison | Yes | Yes |
While both roles involve working with data and algorithms, Amd Machine Learning focuses on developing machine learning models and AI solutions, often requiring specialized technical skills. Data Scientists analyze data to generate insights and support decision-making, with a broader scope that includes statistical analysis. Both roles are vital in tech-driven industries but differ in their primary focus and skill sets.
Which 3 jobs will survive AI?
What are the key skills and qualifications needed to thrive as an AMD Machine Learning Engineer, and why are they important?
- Deep Learning Engineer
- Machine Learning Engineer
- Machine Engineer
- Staff Software Engineer Machine Learning
- Senior Machine Learning Engineer Biotech
- Cloud Computing Developer
- Volunteer Junior Machine Learning Engineer
- Machine Learning Engineer Biotech
- Machine Learning Engineer Apprenticeship
- Senior Machine Learning Researcher
$158K - $212K/yr
Full-time
Posted 28 days ago
Advanced Micro Devices rating
8.4
Based on 7 frontline employees who took The Breakroom Quiz
23rd of 139 rated electronics manufacturers
Job description
WHAT YOU DO AT AMD CHANGES EVERYTHING
At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
Aboutthe Role
Weare seeking aPrincipal Software Quality Engineertoserve as the senior technical leader forROCm software validationacrosscompute workloads and server-class systems. Inthis individual-contributor leadership role, you will definehowAMD provesROCm is ready to ship— from unit andcomponenttesting, through full-stack workload validation, to multi-node system-level qualification on AMD Instinct™ GPU platforms. Youwill set the technical direction for validation strategy, build and evolve the test infrastructure thatgates everyROCm release, and personally drive the hardestdebugging, characterization, and qualification problems. Your work directly determines thequality bar experienced by hyperscalers, OEMs, sovereign-AI customers, and the open-source community runningROCm inproduction.
What You Will Do
Ownthe end-to-end validation architecturefor ROCm — unit, integration, framework, workload, performance, stress, stability, scale-out, and system-leveltest layers — across multiple GPU generations and server platforms.
Definerelease-qualification gates andexit criteriaforROCm software releases (functional coverage, performance regressions, stability hours, scale targets, RAS criteria) anddrive the org to meet them.
Lead system-level testing for server nodes— multi-GPU topologies, PCIe/InfinityFabric/xGMI, BMC/IPMI, thermal/power, firmware interactions, and multi-node fabric(Ethernet/InfiniBand/UALink) bring-up andvalidation.
Drive compute workload validation and characterization— LLM training andinference(PyTorch, vLLM, Triton, JAX), recommender systems, scientific HPC kernels, MLPerf-class benchmarks— establishing reproducible methodology, baselines, and regression tracking.
Architect thetest infrastructure— distributed test runners, GitHub Actions/ Jenkins / internal CI fleets, hardware lab orchestration, resultdatalakes, flaky-test detection, bisectionautomation, and self-servicedeveloper pre-submit pipelines.
Champion modern, agile quality engineering— shift-left testing, test pyramids, contract testing betweenlayers, hermetic test environments, deterministic reproducers, and continuous validation intrunk.
Setthe bar for GitHub-based quality workflows— PR gatingpolicy, requiredchecks, code-coverage standards, bug-bashandtriage cadences, and disciplined issue management acrossROCm/*repositories and partner upstream projects.
Lead complex escalationdebug— partner with development, hardware, firmware, and customer-facing teams to root-cause the hardest multi-day, multi-node, multi-component failures andconvert findings into durable test coverage.
Influence the roadmap— work with product management, silicon, platform, and softwarearchitecture to ensure validation readiness fornext-generation Instinct GPUs and serverplatformsbeforetape-inmilestones and silicon arrival.
Mentor and elevateSenior and Staff validation engineers, SDETs, and SQA leads; raise the technical bar through designreview, code review, and written guidance.
RepresentROCm validation externally— strategic customerengagements, OEM qualification programs, and open-source community quality initiatives.
Minimum Qualifications
Strongl softwareengineering experience withastrong validation, SDET, or quality-engineering focus, including5+ years in a senior IC role(Staff/Principal/PMTS or equivalent) leading validation of complex systems software.
BS/MS/PhDin Computer Science, Computer Engineering, orrelated discipline (or equivalent demonstrated experience).
Expert-levelPythonfortest automation and infrastructure; strongC++for debugging, and extending productioncode paths undertest.
Deep, demonstrable validation experience inat least twoof the following domains:
GPU compute software stacks(ROCm, CUDA, oneAPI, SYCL)
Deep-learning frameworks andinference engines (PyTorch, TensorFlow, JAX, Triton, vLLM)
HPC/ parallel runtimes andcommunication libraries (MPI, RCCL/NCCL, UCX, Libfabric)
Linux kernel, GPU drivers, or accelerator firmware
Distributed systems and large-scale cluster software
System-level validation forserver-class compute nodes— multi-GPU, multi-node, fabric-attached environments — including stress/stability, soak, fault-injection, and RAS testing.
Proven, hands-on experience workingefficiently in an agenticAI engineering environment— daily, productionuseofLLM-based coding agents(e.g., Cursor, Claude Code, Copilot Workspace, Codex-class agents) andorchestration frameworks forrealengineering work, withdemonstrableproductivity, quality, or coverage gains attributable to thoseworkflows. Comfort designing prompts, tool/MCP integrations, evaluation harnesses, and guardrails for autonomous and semi-autonomous agents.
Hands-on experience defining and shippingrelease qualification programsfor software consumedby hyperscalers, OEMs, or otherTier-1 customers.
Mastery ofGitHub atscaleforquality engineering — PR gating, GitHub Actions, self-hosted runners, requiredstatuschecks, releasetagging, and open-source contribution andtriage norms.
Strong commandofmodern, agile software developmentpractices— trunk-based development, CI/CD, shift-left testing, observability, feature flags, andincremental delivery— applied specifically to validation organizations.
Excellent written and verbal communication — able to author crisp test plans, qualification reports, RFCs, and post-mortems, and to influence development teams without authority.
Preferred Qualifications
Direct contributions to validation, CI, or test infrastructure forROCm,PyTorch,LLVM,Triton,vLLM, or comparable upstream open-source projects.
Demonstrated leadership inagentic-AI adoption— builtor rolled out agent-based workflows across an engineering team (e.g., autonomous test generation, AI-driven log/triage pipelines, multi-agent debugsystems, MCP serverdesign, retrieval-augmented engineering knowledge bases) with measurable outcomes.
Experience operating or validatinglarge GPU clusters (256+ GPUs)— fabric bring-up, cluster health monitoring, and fleet-level diagnostics.
Familiarity withTraining/Inference/HPC industry-standard benchmark methodologies andsubmissions.
Backgroundin performance validation: roofline analysis, profiler tooling (rocprof, Omniperf, Nsight-class), regression detection
Experience withfaultinjection, RAS, telemetry, and long-haul stabilityprograms for accelerator platforms.
Familiarity with hardware lab automation: BMC/IPMI/Redfish, PDU control, serial-console capture, automated re-imaging, and topology-aware test scheduling.
Prior experience standing up validation forpre-silicon / emulation / first-silicon bring-upof accelerators.
Why This Role
ROCm powers AIand HPC workloads onAMD Instinct GPUs atthe largest scale inthe industry. The quality of every ROCm release is felt acrossmillions of GPUs in production — and the validation organization iswhatstandsbetween "code complete" and "customerready." AsPrincipal MTS for ROCm Validation, you will define thatbar, build the systems thatenforce it, and personally lead the toughest qualification problems on AMD's moststrategicplatforms.
#LI-TC1
#Hybrid
AMD is an Equal Opportunity Employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.
AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD’s “Responsible AI Policy” is available here.
This posting is for an existing vacancy.
Qualifications:Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.
AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD’s “Responsible AI Policy” is available here.
This posting is for an existing vacancy.
Education:UNAVAILABLEEmployment Type: FULL_TIMEAbout Advanced Micro Devices
Sourced by ZipRecruiter
Industry
Computer and electronic product manufacturing
Company size
5,001 - 10,000 Employees
Headquarters location
Sunnyvale, CA, US
Year founded
1969