Vllm Jobs in Indiana (NOW HIRING)

Gen AI Engineer

LLM serving platforms (vLLM, Text Generation Inference, FastAPI); Model quantization for LLMs (GPTQ, AWQ, bitsandbytes); GPU memory optimization techniques (tensor parallelism, pipeline parallelism)

Elevance Health

Gen AI Engineer

Indianapolis, IN · Hybrid

Deloitte

Research Engineer - Post-Training & Small Language Models (SLMs), Healthcare AI

Indianapolis, IN

Optimize inference performance - latency, throughput, quantization, and deployment efficiency - for production, including frameworks such as vLLM, TensorRT-LLM, or TGI. Small language models & open ...

Deloitte

Research Engineer - Post-Training & Small Language Models (SLMs), Healthcare AI

Indianapolis, IN

Elevance Health

Gen AI Engineer

Indianapolis, IN · Hybrid

Elevance Health

Gen AI Engineer

Indianapolis, IN · Hybrid

Vllm Jobs in Indiana

Vllm information

How does a VLLM (Very Large Language Model) Engineer typically collaborate with data scientists and product teams during model deployment?

VLLM Engineers work closely with data scientists to understand the specific requirements and fine-tuning needs of large-scale language models. They are often responsible for integrating these models into production systems, ensuring scalability and efficiency. Collaboration with product teams is crucial to align model capabilities with user needs and to troubleshoot real-world application challenges. Frequent communication and agile workflows are common, as updates or optimizations may be needed rapidly based on feedback from both teams.

What is a VLLM and what do they do?

VLLM stands for 'Virtual Large Language Model.' In the context of AI development, VLLM professionals work with optimized inference engines for large language models, enabling faster and more efficient deployment of AI models in production environments. Their responsibilities often include integrating LLMs into applications, optimizing model performance, and ensuring scalability for real-time use cases. They may also collaborate with data scientists and engineers to manage resources and streamline AI workflows.

What is the difference between Vllm vs Data Analyst?

Aspect	Vllm	Data Analyst
Required Credentials	Typically requires knowledge of machine learning, AI, and programming languages like Python or R	Requires skills in statistics, Excel, SQL, and data visualization tools
Work Environment	Often in tech companies, research labs, or AI-focused teams	Commonly in business, finance, healthcare, and marketing sectors
Industry Usage	Emerging role in AI and machine learning projects	Established role in data-driven decision making
Common Search/Comparison	Vllm vs Data Analyst

The main difference between Vllm and Data Analyst lies in their focus and skill set. Vllm professionals specialize in AI and machine learning models, often working in tech environments, while Data Analysts focus on interpreting data to inform business decisions. Both roles require analytical skills, but Vllm roles demand programming and AI expertise, whereas Data Analysts emphasize statistical analysis and data visualization.

What are the key skills and qualifications needed to thrive as a Machine Learning Engineer working with vLLM, and why are they important?

To thrive as a Machine Learning Engineer specializing in vLLM (a high-throughput LLM inference library), you need a strong understanding of machine learning principles, deep learning frameworks, and experience with Python programming. Familiarity with tools like PyTorch, CUDA, distributed computing, and cloud platforms, as well as relevant certifications in ML or data engineering, is highly valuable. Strong problem-solving, collaboration, and communication skills are essential for optimizing model performance and integrating with cross-functional teams. These capabilities ensure effective deployment and scaling of large language models, driving innovation and efficiency in AI applications.

What are popular job titles related to Vllm jobs in Indiana? For Vllm jobs in Indiana, the most frequently searched job titles are:

What job categories do people searching Vllm jobs in Indiana look for? The top searched job categories for Vllm jobs in Indiana are:

Vllm jobs near you

Gen AI Engineer

Elevance Health

Indianapolis, IN • Hybrid

Apply

Full-time

Medical, Dental, Vision, Life, Retirement, PTO

Posted 26 days ago

Elevance Health rating

7.7

Based on 348 frontline employees who took The Breakroom Quiz

198th of 299 rated insurance

Job description

Anticipated End Date:

2026-07-24

Position Title:

Gen AI Engineer

Job Description:

Gen AI Engineer

Location: This role requires associates to be in-office 1 - 2 days per week, fostering collaboration and connectivity, while providing flexibility to support productivity and work-life balance. This approach combines structured office engagement with the autonomy of virtual work, promoting a dynamic and adaptable workplace. Alternate locations may be considered if candidates reside within a commuting distance from an office.

Please note that per our policy on hybrid/virtual work, candidates not within a reasonable commuting distance from the posting location(s) will not be considered for employment, unless an accommodation is granted as required by law.

PLEASE NOTE: This position is not eligible for current or future visa sponsorship.

The Gen AI Engineer is responsible for analyzing and modeling organizational data for the Artificial Intelligence (AI) function to draw business insights, which can be used to make business decisions.

How You Will Make an Impact:

Applies data extraction, transformation and loading techniques in order to connect large data sets from a variety of sources.
LLM development and fine-tuning strategies, best practices, and standards to enhance AI ML model deployment and monitoring efficiency.
Develop roadmap and strategy for NLP, LLM, Gen AI model development and lifecycle implementation.
Responsible for the design and development of custom ML, Gen AI, NLP, LLM Models for batch and stream processing-based AI ML pipelines including data ingestion, preprocessing modules, search and retrieval, Retrieval Augmented Generation (RAG), NLP/LLM model development and ensure the end-to-end solution meets all technical and business requirements, and SLA specifications.
Work closely with the MLOps team to create and maintain robust evaluation solutions and tools to evaluate model performance, accuracy, consistency, reliability, during development, and UAT.
Identify and implement model optimizations to improve system efficiency.
Collaborate closely with the MLOps, product teams, business stakeholders, machine learning engineers, and software engineers for the deployment of machine learning models into production environments, ensuring smooth integration, reliability and scalability.
Ensure the use of standards, governance and best practices in ML model development, and adherence to model and data governance standards.

Minimum Requirements:

Requires a Bachelor's degree in a highly quantitative field (Computer Science, Machine Learning, Operational Research, Statistics, Mathematics, etc.) or equivalent degree and 4 or more years of experience; or any combination of education and experience in configuration management, which would provide an equivalent background.

Preferred Skills, Capabilities, and Experiences:

Advanced Python proficiency.
4+ years of professional hands-on experience leveraging large sets of structured and unstructured data to develop data-driven tactical and strategic analytics and insights using ML, NLP, and computer vision solutions.
Demonstrated 4+ years hands-on experience with Python, SQL, Hugging Face, TensorFlow, Keras, PyTorch, and Spark.
Experience with GCP/AWS cloud platforms.
Strong knowledge of and measurable hands-on experience with developing or tuning Large Language Models (LLM) and Generative AI (GAI)
Experience with NLP, LLMs (extractive and generative), fine-tuning and LLM model development.
Experience developing and optimizing high-quality prompts for NLP applications.
Excellent written & verbal communication and stakeholder management skills.
4+ years project leadership experience including Agile project management, Scaled Agile Frameworks (SAFE).
LLM Infrastructure & Deployment: LLM serving platforms (vLLM, Text Generation Inference, FastAPI); Model quantization for LLMs (GPTQ, AWQ, bitsandbytes); GPU memory optimization techniques (tensor parallelism, pipeline parallelism); LLM caching strategies for inference optimization; RAG architecture design and implementation.
Advanced cloud infrastructure (AWS EKS/ECS, GCP GKE, Azure AKS) knowledge.
Containerization strategies for ML workloads; Canary deployments for ML models.

Job Level:

Non-Management Exempt

Workshift:

1st Shift (United States of America)

Job Family:

IFT > Artificial Intelligence

Please be advised that Elevance Health only accepts resumes for compensation from agencies that have a signed agreement with Elevance Health. Any unsolicited resumes, including those submitted to hiring managers, are deemed to be the property of Elevance Health.

Who We Are

Elevance Health is a health company dedicated to improving lives and communities - and making healthcare simpler. We are a Fortune 25 company with a longstanding history in the healthcare industry, looking for leaders at all levels of the organization who are passionate about making an impact on our members and the communities we serve.

How We Work

At Elevance Health, we are creating a culture that is designed to advance our strategy but will also lead to personal and professional growth for our associates. Our values and behaviors are the root of our culture. They are how we achieve our strategy, power our business outcomes and drive our shared success - for our consumers, our associates, our communities and our business.

We offer a range of market-competitive total rewards that include merit increases, paid holidays, Paid Time Off, and incentive bonus programs (unless covered by a collective bargaining agreement), medical, dental, vision, short and long term disability benefits, 401(k) +match, stock purchase plan, life insurance, wellness programs and financial education resources, to name a few.

Elevance Health operates in a Hybrid Workforce Strategy. Unless specified as primarily virtual by the hiring manager, associates are required to work at an Elevance Health location at least once per week, and potentially several times per week. Specific requirements and expectations for time onsite will be discussed as part of the hiring process.

The health of our associates and communities is a top priority for Elevance Health. We require all new candidates in certain patient/member-facing roles to become vaccinated against COVID-19 and Influenza. If you are not vaccinated, your offer will be rescinded unless you provide an acceptable explanation. Elevance Health will also follow all relevant federal, state and local laws.

Elevance Health is an Equal Employment Opportunity employer, and all qualified applicants will receive consideration for employment without regard to age, citizenship status, color, creed, disability, ethnicity, genetic information, gender (including gender identity and gender expression), marital status, national origin, race, religion, sex, sexual orientation, veteran status or any other status or condition protected by applicable federal, state, or local laws. Applicants who require accommodation to participate in the job application process should submit the following form: Accessibility Accommodation Request Form and a member of the team will be in contact. Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws, including, but not limited to, the Los Angeles County Fair Chance Ordinance and the California Fair Chance Act.

Prospective employees required to be screened under Florida law should review the education and awareness resources at HB531 | Florida Agency for Health Care Administration.

NOTE: Workday keeps job postings active through 11:59:59 PM on the day before the listed end date. Example: If the end date is 3/13, the posting will automatically come down on 3/12 at 11:59:59 PM. In other words - the job is posted until 3/13, not through 3/13.

What Elevance Health employees say

Pay

Benefits

Hours and flexibility

Workplace

Get the full story on Breakroom

About Elevance Health

Sourced by ZipRecruiter

Elevance Health is a health company dedicated to improving lives and communities - and making healthcare simpler. A Fortune 20 company with a longstanding history in the healthcare industry, we are looking for leaders at all levels of the organization who are passionate about making an impact on our members and the communities we serve. You will thrive in a complex and collaborative environment where you take action and ownership to solve problems and lead change. Do you want to be part of a larger purpose and an evolving, high-performance culture that empowers you to make an impact?

Industry

Health care and social assistance

Company size

10,000+ Employees

Headquarters location

Indianapolis, IN, US

Year founded

2004

Website

elevancehealth.com

Social media

View All Elevance Health Jobs

Apply

Vllm Jobs in Indiana (NOW HIRING)

Gen AI Engineer

Gen AI Engineer

Research Engineer - Post-Training & Small Language Models (SLMs), Healthcare AI

Research Engineer - Post-Training & Small Language Models (SLMs), Healthcare AI

Gen AI Engineer

Gen AI Engineer

Vllm information

How does a VLLM (Very Large Language Model) Engineer typically collaborate with data scientists and product teams during model deployment?

What is a VLLM and what do they do?

What is the difference between Vllm vs Data Analyst?

What are the key skills and qualifications needed to thrive as a Machine Learning Engineer working with vLLM, and why are they important?

Gen AI Engineer

Share this job

Elevance Health rating

Get the real story on frontline employers

Job description

What Elevance Health employees say

Get the real story on frontline employers

Pay

Benefits

Hours and flexibility

Workplace

About Elevance Health

Industry

Company size

Headquarters location

Year founded

Website

Social media

Share this job

Vllm Jobs in Indiana (NOW HIRING)

Gen AI Engineer

Gen AI Engineer

Research Engineer - Post-Training & Small Language Models (SLMs), Healthcare AI

Research Engineer - Post-Training & Small Language Models (SLMs), Healthcare AI

Gen AI Engineer

Gen AI Engineer

Vllm information

How does a VLLM (Very Large Language Model) Engineer typically collaborate with data scientists and product teams during model deployment?

What is a VLLM and what do they do?

What is the difference between Vllm vs Data Analyst?

What are the key skills and qualifications needed to thrive as a Machine Learning Engineer working with vLLM, and why are they important?

Gen AI Engineer

Share this job

Elevance Health rating

Get the real story on frontline employers

Job description

What Elevance Health employees say

Get the real story on frontline employers

Pay

Most people get paid breaks

Most people get paid when they’re sick

The job rarely spills into unpaid time

Benefits

Sick days use up paid time off

Most people say they can afford the health insurance

Most people get paid time off

Hours and flexibility

Less than 4 weeks notice of work schedule

Most people don’t worry about their hours

Only some people can choose their shifts

Workplace

Most people feel treated with respect

Most people get breaks without interruption

Most people are stressed out

About Elevance Health

Industry

Company size

Headquarters location

Year founded

Website

Social media

Share this job