1

Vllm Jobs in Indiana (NOW HIRING)

Python web/API development (FastAPI, Flask, Django) Local AI model stacks (vLLM, LiteLLM, Ollama); reverse proxies (Caddy, Nginx, Traefik); vector databases (pgvector, Qdrant, Milvus, Weaviate) LLM ...

LLM serving platforms (vLLM, Text Generation Inference, FastAPI); Model quantization for LLMs (GPTQ, AWQ, bitsandbytes); GPU memory optimization techniques (tensor parallelism, pipeline parallelism)

LLM serving platforms (vLLM, Text Generation Inference, FastAPI); Model quantization for LLMs (GPTQ, AWQ, bitsandbytes); GPU memory optimization techniques (tensor parallelism, pipeline parallelism)

Vllm information

How does a VLLM (Very Large Language Model) Engineer typically collaborate with data scientists and product teams during model deployment?

VLLM Engineers work closely with data scientists to understand the specific requirements and fine-tuning needs of large-scale language models. They are often responsible for integrating these models into production systems, ensuring scalability and efficiency. Collaboration with product teams is crucial to align model capabilities with user needs and to troubleshoot real-world application challenges. Frequent communication and agile workflows are common, as updates or optimizations may be needed rapidly based on feedback from both teams.

What is a VLLM and what do they do?

VLLM stands for 'Virtual Large Language Model.' In the context of AI development, VLLM professionals work with optimized inference engines for large language models, enabling faster and more efficient deployment of AI models in production environments. Their responsibilities often include integrating LLMs into applications, optimizing model performance, and ensuring scalability for real-time use cases. They may also collaborate with data scientists and engineers to manage resources and streamline AI workflows.

What is the difference between Vllm vs Data Analyst?

AspectVllmData Analyst
Required CredentialsTypically requires knowledge of machine learning, AI, and programming languages like Python or RRequires skills in statistics, Excel, SQL, and data visualization tools
Work EnvironmentOften in tech companies, research labs, or AI-focused teamsCommonly in business, finance, healthcare, and marketing sectors
Industry UsageEmerging role in AI and machine learning projectsEstablished role in data-driven decision making
Common Search/ComparisonVllm vs Data Analyst

The main difference between Vllm and Data Analyst lies in their focus and skill set. Vllm professionals specialize in AI and machine learning models, often working in tech environments, while Data Analysts focus on interpreting data to inform business decisions. Both roles require analytical skills, but Vllm roles demand programming and AI expertise, whereas Data Analysts emphasize statistical analysis and data visualization.

What are the key skills and qualifications needed to thrive as a Machine Learning Engineer working with vLLM, and why are they important?

To thrive as a Machine Learning Engineer specializing in vLLM (a high-throughput LLM inference library), you need a strong understanding of machine learning principles, deep learning frameworks, and experience with Python programming. Familiarity with tools like PyTorch, CUDA, distributed computing, and cloud platforms, as well as relevant certifications in ML or data engineering, is highly valuable. Strong problem-solving, collaboration, and communication skills are essential for optimizing model performance and integrating with cross-functional teams. These capabilities ensure effective deployment and scaling of large language models, driving innovation and efficiency in AI applications.

Artificial Intelligence Specialist

L3HHCM20

Fort Wayne, IN • On-site, Remote

$109K - $203K/yr

Other

Medical, Retirement, PTO

Posted 11 days ago


Job description

Job Title: Artificial Intelligence Specialist

Job Code: 38585

Job Location: Palm Bay, FL OR Fort Wayne, IN OR Rochester, NY OR Chantilly, VA OR Waco, TX OR Camden, NJ OR Colorado Springs, CO OR Greenville, TX OR Herndon, VA

Job Schedule: 9/80 (Every other Friday off)

Job Description:

L3Harris is seeking an AI Specialist to support the Space and Mission Systems (SMS) segment AI team serving all SMS sectors. The AI Specialist will design, deploy, and maintain AI-enabled applications both in unclassified and secure containerized environments. This is an ideal opportunity for an engineer pursuing AI/ML application engineering, platform engineering, and DevSecOps in a mission-critical defense environment.

Qualifications:

Ability to obtain and maintain a DoD Secret clearance is required
Bachelor's Degree in Computer Science, Computer Engineering, Electrical Engineering, Systems Engineering, or related technical field and a minimum of 6 years of prior relevant experience; Or, Graduate Degree with a minimum of 4 years of prior related experience
4+ years of experience in at least one programming language (Python preferred)

Preferred Additional Skills:

Working knowledge of Linux (RHEL/AlmaLinux preferred; Ubuntu acceptable)
Hands-on experience with containerization (Docker/Podman), Git (Bitbucket/GitLab), and basic SQL / relational databases (PostgreSQL, MySQL, etc.)
Familiarity with at least one major cloud provider (AWS or Azure) and Agile tools (Jira, Confluence)
Ability to comply with export-controlled (ITAR/EAR) and Controlled Unclassified Information (CUI) handling requirements
LLMs and Generative AI; RAG patterns and agentic frameworks (LangGraph); Python web/API development (FastAPI, Flask, Django)
Local AI model stacks (vLLM, LiteLLM, Ollama); reverse proxies (Caddy, Nginx, Traefik); vector databases (pgvector, Qdrant, Milvus, Weaviate)
LLM evaluation tooling (RAGAS, DeepEval, promptfoo); observability (LangSmith, Phoenix); coursework or project experience in machine learning, NLP, or deep learning
GitOps/DevSecOps concepts and toolchains; CI/CD authoring (Bitbucket Pipelines, GitLab CI); JFrog Artifactory; Microsoft Entra ID app registrations and Graph API in GCC High
Secure software development in regulated/mission-critical environments; familiarity with the NIST AI Risk Management Framework (AI RMF) or OWASP Top 10 for LLM Applications
Experience deploying AI in classified or highly regulated environments, including familiarity with AI ATO processes, data governance requirements, and secure AI infrastructure (on-prem, air-gapped, or GovCloud)
Direct experience with AI-assisted software development tools and agentic coding frameworks (e.g., Claude Code, GitHub Copilot, Cursor, Codex) including measuring productivity outcomes
Background in defense, aerospace, or adjacent mission-critical industries, with experience navigating the unique constraints of delivering AI in program-of-record environments

In compliance with pay transparency requirements, the salary range for this role in California, Massachusetts, New Jersey, Washington, and the Greater D.C, Denver, or NYC areas is $109,500-$203,500. The salary range for this role in Colorado state, Hawaii, Illinois, Maryland, Minnesota, New York state, and Vermont is $95,000-$177,000. This is not a guarantee of compensation or salary, as final offer amount may vary based on factors including but not limited to experience and geographic location. L3Harris also offers a variety of benefits, including health and disability insurance, 401(k) match, flexible spending accounts, EAP, education assistance, parental leave, paid time off, and company-paid holidays. The specific programs and options available to an employee may vary depending on date of hire, schedule type, and the applicability of collective bargaining agreements.

The application window for this requisition is anticipated to close September 01, 2026.

#LI-CG1