1

Gpu Programming Jobs in Indiana (NOW HIRING)

Lead Engineer

Indianapolis, IN · On-site

$97.90K - $129K/yr

... GPU-accelerated, cloud-native LLM serving platform using containerized microservices deployed on Kubernetes. · Design systems that support low-latency, high-throughput inference while maintaining ...

The region supports growing demand for GPU computing infrastructure. About Introl Introl stands ... Our network of 1,000+ field engineers operates globally, tackling the most complex deployments in ...

Sr. Platform Developer

Fishers, IN · On-site

$51 - $67.50/hr

Build and support GPU-accelerated and edge-to-cloud workflows leveraging NVIDIA technologies ... Improve developer experience through tooling, CI/CD pipelines, observability, and automation

Be Your Best - You will learn about new technologies, AI/ML based HPC, large scale GPU clustering ... as a Linux OS/ Platform Engineer * Demonstrated experience leading a global large-scale ...

$100.10K - $131.30K/yr

Provides developer-friendly tools and practices that make ML operations efficient-because ... Familiarity with ML workflows and GPU infrastructure management-you understand what researchers ...

Hands-on experience in HPC platforms, including knowledge of accelerators (e.g., GPU), HPC ... Strong programming and scripting skills in languages such as Python or Bash. Basic Requirements:

Hands-on experience in HPC platforms, including knowledge of accelerators (e.g., GPU), HPC ... Strong programming and scripting skills in languages such as Python or Bash. Basic Requirements:

$195K/yr

As a Senior Software Research Engineer, you will work at the intersection of computer vision ... Comfortable with GPU-accelerated workflows. Some experience in integrating LLM frontier models with ...

$180K - $300K/yr

Nvidia GPU drivers, and operators * OTel, Prometheus What We're Looking For * Experience building or operating large-scale training platforms * Worked with large scale compute clusters (GPUs)

next page

Showing results 1-20

Gpu Programming information

See Indiana salary details

$31.4K

$61.8K

$90.9K

How much do gpu programming jobs pay per year?

As of May 30, 2026, the average yearly pay for gpu programming in Indiana is $61,827.00, according to ZipRecruiter salary data. Most workers in this role earn between $48,100.00 and $76,100.00 per year, depending on experience, location, and employer.

What is a GPU Programming job?

A GPU Programming job involves writing and optimizing code to run on Graphics Processing Units (GPUs) for parallel computing tasks. This role is commonly found in fields like machine learning, scientific computing, gaming, and data analytics. GPU programmers use languages such as CUDA, OpenCL, or Vulkan to accelerate computations and improve performance. They work closely with software engineers and data scientists to optimize algorithms for high-performance applications.

What are the key skills and qualifications needed to thrive in the Gpu Programming position, and why are they important?

To excel in GPU Programming, you need a strong background in parallel computing concepts, mathematics, and proficiency in languages such as CUDA, OpenCL, or DirectX/OpenGL, often supported by a degree in computer science, engineering, or a related field. Familiarity with NVIDIA and AMD GPU development tools, performance profilers, and possibly certifications like NVIDIA's Deep Learning Institute courses are valuable. Teamwork, effective communication, and strong problem-solving abilities are essential soft skills in this field. These competencies enable efficient development, optimization, and integration of high-performance GPU code in real-world applications.

What types of projects or applications do GPU Programmers commonly work on?

GPU Programmers are often involved in developing or optimizing software for high-performance applications such as machine learning, scientific simulations, real-time rendering in gaming and visualization, and video/image processing tools. Their daily work may include collaborating with software engineers, data scientists, and hardware teams to create efficient, scalable parallel algorithms that leverage GPU capabilities. The role frequently requires problem-solving to maximize computational efficiency and troubleshooting complex performance bottlenecks. By working across multidisciplinary teams, GPU Programmers help deliver robust solutions for data-intensive problems in areas like healthcare, finance, automotive technology, and entertainment.
What are the most commonly searched types of Gpu Programming jobs in Indiana? The most popular types of Gpu Programming jobs in Indiana are:
Infographic showing various Gpu Programming job openings in Indiana as of May 2026, with employment types broken down into 1% Internship, 97% Full Time, 1% Temporary, and 1% Nights. Highlights an 34% Physical, 2% Hybrid, and 64% Remote job distribution, with an average salary of $61,827 per year, or $29.7 per hour.

Lead Engineer

INFOSYS NOVA HOLDINGS LLC

Indianapolis, IN • On-site

$97.90K - $129K/yr

Full-time

Posted 5 days ago


Job description


Applicants must be authorized to work for ANY employer in the U.S. We are unable to sponsor or take over sponsorship of an employment Visa at this time.


Location: Indianapolis, IN (onsite 5 days a week)


Role Summary

This role is responsible for the architecture, development, and productionization of an enterprise-scale Generative AI platform designed to host, manage, and operationalize fine-tuned and open-source Large Language Models (LLMs) in highly regulated environments. The platform enables secure, performant, and compliant AI inference across internal enterprise applications, with an initial focus on pharmaceutical and life sciences use cases.

The engineer will operate at the intersection of distributed systems engineering, applied machine learning infrastructure, AI security, and MLOps, translating experimental NLP and generative AI workflows into robust, observable, and governable production services.

---

Core Responsibilities

LLM Platform Architecture & Systems Engineering

· Architect and implement a GPU-accelerated, cloud-native LLM serving platform using containerized microservices deployed on Kubernetes.

· Design systems that support low-latency, high-throughput inference while maintaining fault tolerance, horizontal scalability, and isolation across dev, test, and production clusters.

· Abstract infrastructure primitives to expose self-service model lifecycle APIs for data scientists and ML engineers.

Model Hosting, Fine-Tuning & Lifecycle Management

· Deploy and manage fine-tuned and parameter-efficient LLMs using techniques such as PEFT and LoRA.

· Implement end-to-end model versioning, promotion, rollback, and deprecation workflows.

· Support integration of multiple LLM backends (open-source and commercial) behind standardized inference interfaces.

AI Safety, Security & Runtime Guardrails

· Engineer real-time request/response inspection pipelines to analyze user prompts and model outputs for:

o Prompt injection

o Data exfiltration

o Hallucination risk

o Policy and compliance violations

· Implement multi-layer security controls embedded at ingress, orchestration, and model-serving layers.

· Ensure all model interactions are traceable, auditable, and reproducible.

Advanced Prompting, RAG & Model Evaluation

· Build and operationalize retrieval-augmented generation (RAG) pipelines integrating LLMs with enterprise document repositories and vector search backends.

· Standardize prompt engineering frameworks, contextual grounding strategies, and evaluation methodologies.

· Enable enterprise use cases including contextual Q&A, semantic search, summarization, redaction, and knowledge extraction.

Distributed Orchestration & Workflow Management

· Use workflow orchestration frameworks (e.g., Temporal.io) to manage long-running, stateful AI pipelines, including inference orchestration, evaluation, and post-processing.

· Implement asynchronous, event-driven AI workflows using gRPC-based service communication.

Infrastructure Automation & MLOps

· Standardize infrastructure provisioning using Infrastructure-as-Code (IaC) principles to ensure deterministic, repeatable deployments.

· Automate CI/CD pipelines for model artifacts, prompts, and platform services.

· Enable dynamic resource allocation, GPU scheduling, and zero/low-downtime upgrades.

Observability, Monitoring & Reliability Engineering

· Design and implement observability pipelines collecting:

o Model latency and throughput

o Token usage and cost metrics

o Security violations and guardrail triggers

o Drift, degradation, and anomalous behavior

· Establish Service Level Objectives (SLOs) and reliability targets for LLM inference services.

· Enable proactive debugging, capacity planning, and performance optimization.

Enterprise Governance & Access Control

· Integrate the platform with internal policy enforcement systems, IAM, and role-based access controls (RBAC).

· Ensure generative outputs comply with enterprise governance frameworks, regulatory requirements, and ethical guidelines.

· Maintain detailed audit logs to support compliance and validation in regulated environments.

Framework Reusability & Cross-Functional Enablement

· Develop reusable platform components enabling collaboration across data science, DevOps, and product teams.

· Provide standardized interfaces and SDKs for downstream applications to consume AI services.

· Serve as a technical bridge between AI research experimentation and enterprise-grade production systems.

---

Technical Requirements

· Strong experience designing and operating distributed cloud-native systems.

· Hands-on expertise deploying LLMs in production with performance, scalability, and security constraints.

· Deep understanding of container orchestration (Kubernetes) and GPU-enabled workloads.

· Experience implementing real-time inference services, API gateways.

· Proven ability to design systems meeting compliance, auditability, and governance requirements.

---

Preferred Experience

· Experience in pharmaceutical, healthcare, or highly regulated enterprise environments.

· Exposure to AI security, prompt-risk mitigation, and regulated AI deployment.

· Experience translating NLP research and generative modeling techniques into production platforms.

· Strong collaboration skills with data scientists, ML engineers, SREs, and product teams.

---

Technology Stack

· Languages: Python, Go

· AI / ML: LLMs (AWS Bedrock, Azure OpenAI, Google Vertex ), Prompt Engineering, RAG, PEFT, LoRA

· Platform & Infrastructure: Kubernetes, GPU acceleration, Infrastructure-as-Code

· Distributed Systems: gRPC, Temporal.io, Argo, Flux

· Storage: S3 / S3-compatible object storage

· AI Tooling: OpenAI Agents SDK, Langgraph

· Observability: Prometheus, Grafana

· Libraries: Redis, Langchain, FastAPI


Kaleidoscope, an Infosys Company, is an equal opportunity employer, and all qualified applicants will receive consideration without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, spouse of protected veteran, or disability.