1

Llm Developer Jobs in Raleigh, NC (NOW HIRING)

AI/ML Tech Lead

Raleigh, NC · On-site

$140K - $180K/yr

Familiarity with AI/ML concepts, especially LLMs, prompt engineering, and AI agents. Understanding of LangChain, Agentic AI, or similar LLM orchestration frameworks . Experience integrating with AI ...

AI/ML Lead Architect

Raleigh, NC · Hybrid

$53.75 - $73.75/hr

Familiarity with AI/ML concepts, especially LLMs, prompt engineering, and AI agents. Understanding of LangChain, Agentic AI, or similar LLM orchestration frameworks . Experience integrating with AI ...

... Developer to join their AI Engineering team. The role involves architecting and developing enterprise-grade AI systems using Python and modern LLM frameworks while mentoring engineers and driving ...

Serve as a go-to authority on AI/LLM security for senior engineering and product leadership ... Mentor the next generation of security engineers and raise the bar across the team What We're ...

... LLM-based solutions), and refine algorithms to meet business needs. • Review plan for smooth ... DevOps and IaC tooling (GitHub Actions, Jenkins, Terraform, Helm, Kubernetes) and awareness of ...

Mentor and grow a multidisciplinary team of LLM-focused Data Scientists and ML Engineers. * Drive cross-functional collaboration with Legal SMEs, Data Engineers, Product Managers, and Design.

next page

Showing results 1-20

Llm Developer information

See Raleigh, NC salary details

$24

$48

$77

How much do llm developer jobs pay per hour?

As of May 30, 2026, the average hourly pay for llm developer in Raleigh, NC is $48.76, according to ZipRecruiter salary data. Most workers in this role earn between $38.32 and $59.13 per hour, depending on experience, location, and employer.

What does an LLM Developer do?

An LLM Developer designs, fine-tunes, and implements large language models (LLMs) for various applications, such as chatbots, content generation, and AI-driven tools. They work with machine learning frameworks, optimize model performance, and ensure efficient deployment. This role requires expertise in natural language processing (NLP), deep learning, and programming languages like Python.

What are the key skills and qualifications needed to thrive in the Llm Developer position, and why are they important?

To excel as an LLM Developer, you need strong expertise in natural language processing (NLP), deep learning frameworks, and programming languages such as Python, typically supported by a degree in computer science or a related field. Familiarity with machine learning libraries (like TensorFlow or PyTorch), cloud computing platforms, and experience with prompt engineering or fine-tuning large language models is crucial. Excellent problem-solving abilities, collaboration, and effective communication skills help you design solutions and work efficiently within multidisciplinary teams. These qualifications are essential for successfully building, deploying, and optimizing large language models that drive impactful AI applications.

What are the typical daily tasks and responsibilities of an LLM Developer?

As an LLM Developer, your daily responsibilities often include designing, fine-tuning, and evaluating large language models to meet specific application needs. You may work on tasks such as data preprocessing, model training, performance benchmarking, and error analysis, frequently collaborating with data scientists, research engineers, and product managers. Keeping up to date with the latest advancements in NLP and integrating new techniques into production models is also a key part of the role. These tasks are usually performed in a team-oriented environment where clear communication and iterative experimentation are highly valued.

What is the salary of LLM engineer?

The salary of an LLM (Large Language Model) engineer typically ranges from $100,000 to $180,000 annually, depending on experience, location, and company size. Senior roles or those with specialized skills in deep learning and NLP can earn higher compensation, often exceeding $200,000 with bonuses and stock options.
What are popular job titles related to Llm Developer jobs in Raleigh, NC? For Llm Developer jobs in Raleigh, NC, the most frequently searched job titles are:
What job categories do people searching Llm Developer jobs in Raleigh, NC look for? The top searched job categories for Llm Developer jobs in Raleigh, NC are:
What cities near Raleigh, NC are hiring for Llm Developer jobs? Cities near Raleigh, NC with the most Llm Developer job openings:
Forward Deployed Engineer- Agentic AI

Forward Deployed Engineer- Agentic AI

Deloitte

Raleigh, NC

Other

Posted 9 days ago


Deloitte rating

8.1

Company rating: 8.1 out of 10

Based on 86 frontline employees who took The Breakroom Quiz

59th of 138 rated financial services


Job description

At Deloitte, Forward Deployed Engineers (FDE) don't just build AI solutions, they help clients turn AI ambition into enterprise-scale impact, pairing leading class engineering with pod-based delivery and vertical expertise. If you thrive at the intersection of product, engineering, problem-solving, and client impact, this role puts you at the forefront of AI transformations.

Work you'll do

As an Agentic AI FDE, you will design, build, and operationalize LLM-powered systems that can reason, plan, retrieve information, use tools, and execute multi-step workflows reliably. You will work on the "thinking layer" of AI systems: agent architecture, tool orchestration, memory and context management, retrieval pipelines, evaluation, and observability. You will help shape how complex domain knowledge is transformed into usable AI behavior, with a high bar for precision, traceability, and maintainability. Additional responsibilities include:

Client Engagement

Embed with clients to identify business needs and translate high-value GenAI use cases into solutions.

Partner with leaders, product owners, architects, and engineers to align priorities and delivery.

Lead working sessions to shape solutions and drive client outcomes.

Prototype and deliver working AI solutions using industry expertise and emerging capabilities.

Contribute independently within an FDE pod while mentoring newer team members.

Solution Engineering

Build AI-enabled solutions, agentic platforms, and workflows across enterprise AI platforms.

Develop scalable AI engineering patterns, tool-use approaches, and human-in-the-loop controls.

Apply architecture decisions that balance quality, safety, latency, cost, and model risk.

Deliver production-quality code using strong practices in testing, CI/CD, logging, versioning, and documentation.

Design extensible functionality, support sprint sizing, and align solutions with senior team members.

Contribute reusable assets including code, prompt libraries, runbooks, and reference implementations.

Strong understanding of memory and context management, including context windows, retrieval-driven context assembly, persistent memory, and high-signal token selection.

Deep understanding of how LLMs behave in practice, including strengths, failure modes, hallucination risks, reasoning limitations, latency/cost trade-offs, and evaluation methods. 

Experience with Python and modern software engineering practices, including testing, CI/CD, version control, and API integration.

The team

AI & Engineering leverages cutting-edge engineering capabilities to build, deploy, and operate integrated/verticalized sector solutions in software, data, AI, network, and hybrid cloud infrastructure. These solutions are powered by engineering for business advantage, transforming mission-critical operations. We enable clients to stay ahead with the latest advancements by transforming engineering teams and modernizing technology & data platforms. Our delivery models are tailored to meet each client's unique requirements.

Required qualifications

Bachelor's degree (or equivalent) in Computer Science, Data Science or Engineering.

3+ years of experience in software engineering, data engineering, data science, or analytics engineering.

1+ years of hands-on experience building production-grade applications with LLMs, including prompt design, tool use, structured outputs, error handling, and model behavior tuning.

1+ years of experience with LangChain and especially LangGraph for orchestrating complex LLM workflows and agent behavior.

1+ years of experience designing and optimizing RAG systems end to end, including indexing, retrieval, reranking, grounding, and evaluation.

1+ years of experience with memory and context management, including context windows, retrieval-driven context assembly, persistent memory, and high-signal token selection.

Deep understanding of how LLMs behave in practice, including strengths, failure modes, hallucination risks, reasoning limitations, latency/cost trade-offs, and evaluation methods.

2+ years of experience with Python and modern software engineering practices, including testing, CI/CD, version control, and API integration.

1+ years of hands-on experience building and deploying GenAI/LLM-powered solutions in client or production environments

1+ years of experience leading project workstreams/engagements and translating business problems into AI solutions

1+ years of experience building reliable, maintainable, and well-documented code

Ability to travel 50%, on average, based on the work you do and the clients and industries/sectors you serve

Limited immigration sponsorship may be available

Preferred qualifications

Experience with cloud environments (AWS, Azure, and/or Google Cloud) and common platform services (storage, compute, IAM, networking)

Demonstrated ability to work directly alongside client technical teams and program stakeholders in fast-paced, ambiguous delivery environments

Data engineering experience with Spark, Airflow/dbt, streaming, data modeling or ML/data science background feature engineering, experimentation or model evaluation

Experience with MLOps/LLMOps practices: evaluation frameworks, model monitoring, and prompt management

Experience integrating LLM solutions with enterprise systems via APIs, microservices, or event-driven architectures

Strong understanding of memory and context management, including context windows, retrieval-driven context assembly, persistent memory, and high-signal token selection. 

Deep understanding of how LLMs behave in practice, including strengths, failure modes, hallucination risks, reasoning limitations, latency/cost trade-offs, and evaluation methods. 

Experience with Python and modern software engineering practices, including testing, CI/CD, version control, and API integration.

Experience operating within hybrid onshore/offshore teams

Familiarity with security, privacy, and compliance considerations

The wage range for this role takes into account the wide range of factors that are considered in making compensation decisions including but not limited to skill sets; experience and training; licensure and certifications; and other business and organizational needs. The disclosed range estimate has not been adjusted for the applicable geographic differential associated with the location at which the position may be filled. At Deloitte, it is not typical for an individual to be hired at or near the top of the range for their role and compensation decisions are dependent on the facts and circumstances of each case. A reasonable estimate of the current range is $134,500 to $265,100.

You may also be eligible to participate in a discretionary annual incentive program, subject to the rules governing the program, whereby an award, if any, depends on various factors, including, without limitation, individual and organizational performance.


Qualifications:

At Deloitte, Forward Deployed Engineers (FDE) don't just build AI solutions, they help clients turn AI ambition into enterprise-scale impact, pairing leading class engineering with pod-based delivery and vertical expertise. If you thrive at the intersection of product, engineering, problem-solving, and client impact, this role puts you at the forefront of AI transformations.

Work you'll do

As an Agentic AI FDE, you will design, build, and operationalize LLM-powered systems that can reason, plan, retrieve information, use tools, and execute multi-step workflows reliably. You will work on the "thinking layer" of AI systems: agent architecture, tool orchestration, memory and context management, retrieval pipelines, evaluation, and observability. You will help shape how complex domain knowledge is transformed into usable AI behavior, with a high bar for precision, traceability, and maintainability. Additional responsibilities include:

Client Engagement

Embed with clients to identify business needs and translate high-value GenAI use cases into solutions.

Partner with leaders, product owners, architects, and engineers to align priorities and delivery.

Lead working sessions to shape solutions and drive client outcomes.

Prototype and deliver working AI solutions using industry expertise and emerging capabilities.

Contribute independently within an FDE pod while mentoring newer team members.

Solution Engineering

Build AI-enabled solutions, agentic platforms, and workflows across enterprise AI platforms.

Develop scalable AI engineering patterns, tool-use approaches, and human-in-the-loop controls.

Apply architecture decisions that balance quality, safety, latency, cost, and model risk.

Deliver production-quality code using strong practices in testing, CI/CD, logging, versioning, and documentation.

Design extensible functionality, support sprint sizing, and align solutions with senior team members.

Contribute reusable assets including code, prompt libraries, runbooks, and reference implementations.

Strong understanding of memory and context management, including context windows, retrieval-driven context assembly, persistent memory, and high-signal token selection.

Deep understanding of how LLMs behave in practice, including strengths, failure modes, hallucination risks, reasoning limitations, latency/cost trade-offs, and evaluation methods. 

Experience with Python and modern software engineering practices, including testing, CI/CD, version control, and API integration.

The team

AI & Engineering leverages cutting-edge engineering capabilities to build, deploy, and operate integrated/verticalized sector solutions in software, data, AI, network, and hybrid cloud infrastructure. These solutions are powered by engineering for business advantage, transforming mission-critical operations. We enable clients to stay ahead with the latest advancements by transforming engineering teams and modernizing technology & data platforms. Our delivery models are tailored to meet each client's unique requirements.

Required qualifications

Bachelor's degree (or equivalent) in Computer Science, Data Science or Engineering.

3+ years of experience in software engineering, data engineering, data science, or analytics engineering.

1+ years of hands-on experience building production-grade applications with LLMs, including prompt design, tool use, structured outputs, error handling, and model behavior tuning.

1+ years of experience with LangChain and especially LangGraph for orchestrating complex LLM workflows and agent behavior.

1+ years of experience designing and optimizing RAG systems end to end, including indexing, retrieval, reranking, grounding, and evaluation.

1+ years of experience with memory and context management, including context windows, retrieval-driven context assembly, persistent memory, and high-signal token selection.

Deep understanding of how LLMs behave in practice, including strengths, failure modes, hallucination risks, reasoning limitations, latency/cost trade-offs, and evaluation methods.

2+ years of experience with Python and modern software engineering practices, including testing, CI/CD, version control, and API integration.

1+ years of hands-on experience building and deploying GenAI/LLM-powered solutions in client or production environments

1+ years of experience leading project workstreams/engagements and translating business problems into AI solutions

1+ years of experience building reliable, maintainable, and well-documented code

Ability to travel 50%, on average, based on the work you do and the clients and industries/sectors you serve

Limited immigration sponsorship may be available

Preferred qualifications

Experience with cloud environments (AWS, Azure, and/or Google Cloud) and common platform services (storage, compute, IAM, networking)

Demonstrated ability to work directly alongside client technical teams and program stakeholders in fast-paced, ambiguous delivery environments

Data engineering experience with Spark, Airflow/dbt, streaming, data modeling or ML/data science background feature engineering, experimentation or model evaluation

Experience with MLOps/LLMOps practices: evaluation frameworks, model monitoring, and prompt management

Experience integrating LLM solutions with enterprise systems via APIs, microservices, or event-driven architectures

Strong understanding of memory and context management, including context windows, retrieval-driven context assembly, persistent memory, and high-signal token selection. 

Deep understanding of how LLMs behave in practice, including strengths, failure modes, hallucination risks, reasoning limitations, latency/cost trade-offs, and evaluation methods. 

Experience with Python and modern software engineering practices, including testing, CI/CD, version control, and API integration.

Experience operating within hybrid onshore/offshore teams

Familiarity with security, privacy, and compliance considerations

The wage range for this role takes into account the wide range of factors that are considered in making compensation decisions including but not limited to skill sets; experience and training; licensure and certifications; and other business and organizational needs. The disclosed range estimate has not been adjusted for the applicable geographic differential associated with the location at which the position may be filled. At Deloitte, it is not typical for an individual to be hired at or near the top of the range for their role and compensation decisions are dependent on the facts and circumstances of each case. A reasonable estimate of the current range is $134,500 to $265,100.

You may also be eligible to participate in a discretionary annual incentive program, subject to the rules governing the program, whereby an award, if any, depends on various factors, including, without limitation, individual and organizational performance.


Education:Bachelor's DegreeEmployment Type:

What Deloitte employees say

Pay

Benefits

Hours and flexibility

Workplace

Get the full story on Breakroom