AI Application Engineer
We are currently seeking an AI Application Engineer to join our team in Santa Clara, California (US-CA). AI Application Engineer to support the development and delivery of next-generation AI-powered applications built on client infrastructure. This role will focus on production-grade LLM application engineering, RAG quality, prompt engineering, AI safety, and orchestration of complex multi-step AI pipelines.
Responsibilities:
- Design, develop, and optimize production-grade LLM-powered applications.
- Own AI quality, RAG accuracy, prompt engineering, and AI safety across multiple applications.
- Develop and maintain multi-step LLM orchestration pipelines using LangChain, LlamaIndex, or custom frameworks.
- Implement and optimize RAG pipelines including chunking strategies, embedding selection, reranking, and hybrid search.
- Design multi-turn conversational AI experiences with context management and session memory
- Integrate NVIDIA technologies including NIM, NeMo, NeMoGuardrails, and Riva into enterprise AI applications.
- Build automated evaluation pipelines for model quality, hallucination detection, regression testing, and release gating.
- Perform latency profiling and optimization across multi-step LLM call chains.
- Implement AI safety guardrails including prompt injection prevention, jailbreak mitigation, and topical control.
- Collaborate with globally distributed engineering and product teams to deliver scalable AI solutions.
- Support deployment, monitoring, and continuous improvement of AI applications in production environments.
Qualifications:
- 4+ years of software engineering experience with at least 2 years focused on production LLM application development.
- 4+ years of experience with Python for AI/ML application development and async programming.
- 3+ years of experience with multi-step LLM orchestration frameworks such as LangChain or LlamaIndex.
- 3+ Years of Experience designing and optimizing RAG pipelines and retrieval systems.
- 3+ Years of Experience with vector databases, similarity search tuning, and reranking techniques.
Position can pay between 130-170K (USD) range annually depending on skills match & suitability.
About NTT DATA:
NTT DATA is a $30 billion business and technology services leader, serving 75% of the Fortune Global 100. We are committed to accelerating client success and positively impacting society through responsible innovation. We are one of the world's leading AI and digital infrastructure providers, with unmatched capabilities in enterprise-scale AI, cloud, security, connectivity, data centers and application services. Our consulting and industry solutions help organizations and society move confidently and sustainably into the digital future. As a Global Top Employer, we have experts in more than 50 countries. We also offer clients access to a robust ecosystem of innovation centers as well as established and start-up partners. NTT DATA is a part of NTT Group, which invests over $3 billion each year in R&D. Whenever possible, we hire locally to NTT DATA offices or client sites. This ensures we can provide timely and effective support tailored to each client's needs. While many positions offer remote or hybrid work options, these arrangements are subject to change based on client requirements. For employees near an NTT DATA office or client site, in-office attendance may be required for meetings or events, depending on business needs. At NTT DATA, we are committed to staying flexible and meeting the evolving needs of both our clients and employees.