1

Internship Inference Jobs (NOW HIRING)

OR · On-site

$122K - $161K/yr

The compiler must deliver leading inference performance, fast build time, reduced memory footprints ... A track record of success in mentoring early-career engineers and interns is a bonus. * Track ...

OR · On-site

$122K - $161K/yr

The compiler must deliver leading inference performance, fast build time, reduced memory footprints ... A track record of success in mentoring early-career engineers and interns is a bonus. * Track ...

next page

Showing results 1-20

Internship Inference information

See salary details

$9

$17

$23

How much do internship inference jobs pay per hour?

As of Jun 6, 2026, the average hourly pay for internship inference in the United States is $17.31, according to ZipRecruiter salary data. Most workers in this role earn between $14.42 and $19.23 per hour, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive as an Inference Intern, and why are they important?

To thrive as an Inference Intern, you generally need a strong background in machine learning, statistics, and programming, often supported by coursework or a degree in computer science or related fields. Familiarity with frameworks like TensorFlow or PyTorch, experience with model deployment tools, and knowledge of cloud platforms such as AWS or GCP are commonly required. Strong analytical thinking, problem-solving abilities, and effective communication help interns contribute meaningfully to research and team projects. These skills are crucial for successfully developing, testing, and deploying inference models in real-world applications.

What is an Internship Inference?

An Internship Inference typically refers to the process of drawing conclusions or gaining insights from the experiences and performance of interns during their internship period. This may involve evaluating an intern's skills, adaptability, and contributions to assess their suitability for future roles or projects. Companies use internship inference to inform hiring decisions, provide feedback, and improve internship programs. The process can also help interns understand their strengths and areas for development.

What types of projects and responsibilities can I expect during an Internship in Inference, and how do these experiences contribute to professional growth?

As an intern focusing on inference, you will typically work on projects involving the deployment, optimization, and evaluation of machine learning models, often supporting a research or engineering team. Responsibilities may include running model benchmarks, improving inference speed or accuracy, and assisting with integration of models into production environments. These tasks provide hands-on experience with real-world data and infrastructure, allowing you to develop technical skills and collaborate closely with data scientists and engineers. Such exposure not only enhances your understanding of applied machine learning but also builds a strong foundation for future roles in AI and data science.
Research Intern - Systems For Efficient AI

Research Intern - Systems For Efficient AI

Microsoft Corporation

Redmond, WA • On-site

$8K - $14K/mo

Internship

This job post has expired today. Applications are no longer accepted.


Microsoft rating

8.6

Company rating: 8.6 out of 10

Based on 125 frontline employees who took The Breakroom Quiz

47th of 186 rated software companies


Job description

Overview

Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers, who pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment.

AI workloads are growing at an unprecedented pace, and inference has become one of the most critical challenges in modern computing. Large-scale models demand massive compute resources, and the diversity of hardware across cloud and edge adds complexity. Achieving low latency and high throughput while controlling cost requires rethinking the entire inference stack-from algorithms to infrastructure.

Within our Systems Innovation research group (https://www.microsoft.com/en-us/research/group/systems-innovation/) , we pursue a full stack approach towards AI inference. We closely collaborate with multiple research teams and product groups across the globe. Some of the research problems we are currently working on (https://www.microsoft.com/en-us/research/group/m365-research/) are related to request scheduling/batching mechanisms, KV caching optimizations, LLM inference optimizations, and GPU fleet orchestration.

We are looking for Research Interns to help advance the state of the art of systems for efficient AI. The ideal candidate will have background in systems for AI, including end-to-end AI inference pipelines, request scheduling and batching mechanisms, performance optimizations for AI inference, and KV caching mechanisms.

Responsibilities

Research Interns put inquiry and theory into practice. Alongside fellow doctoral candidates and some of the world's best researchers, Research Interns learn, collaborate, and network for life. Research Interns not only advance their own careers, but they also contribute to exciting research and development strides. During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community. Research internships are available in all areas of research, and are offered year-round, though they typically begin in the summer.

Qualifications

Required Qualifications

  • Accepted or currently enrolled in a PhD program in Computer Science, Software Engineering, Electrical Engineering, or a related STEM field.??

Other Requirements

  • Research Interns are expected to be physically located in their manager's Microsoft worksite location for the duration of their internship.
  • In addition to the qualifications below, you'll need to submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples. After you submit your application, a request for letters may be sent to your list of references on your behalf. Note that reference letters cannot be requested until after you have submitted your application, and furthermore, that they might not be automatically requested for all candidates. You may wish to alert your letter writers in advance, so they will be ready to submit your letter.

Preferred Qualifications

  • Experience with LLM architectures, systems for LLM inference, and/or AI hardware.
  • Experience with GPUs and understanding of CUDA/ROCm frameworks.
  • Experience with computer systems and/or networks.
  • Experience in conducting research and writing peer-reviewed publications.
  • Proficient written and verbal communication skills. ?
  • Be able to work in a cross-functional and multi-disciplinary setting across research and product.??
  • Proficient software development skills, preferably in C++ and Python.

The base pay range for this internship is USD $6,710 - $13,270 per month. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $8,760 - $14,360 per month.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-intern-pay

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.

This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations. (https://careers.microsoft.com/v2/global/en/accessibility.html)


What Microsoft employees say

Pay

Benefits

Hours and flexibility

Workplace

Get the full story on Breakroom


Microsoft logo

About Microsoft

Sourced by ZipRecruiter

Our infrastructure is comprised of a large global portfolio of more than 100 datacenters and 1 million servers. Our foundation is built upon and managed by a team of subject matter experts working to support services for more than 1 billion customers and 20 million businesses in over 90 countries worldwide. With environmental sustainability and optimization at the forefront of our datacenter design and operations, we continue to grow and evolve as we meet the ever-changing business demands that hold Microsoft as a world-class cloud provider.

Industry

Computer and computer peripheral equipment and software wholesalers

Company size

10,000+ Employees

Headquarters location

Redmond, WA, US

Year founded

1975

Social media