Senior AI/ML Platform Engineer
Position Overview
We are looking for an experienced Senior AI/ML Platform Engineer to lead the development and deployment of scalable artificial intelligence and machine learning solutions. This role is ideal for a hands-on engineer who enjoys building production-ready systems, solving complex technical challenges, and delivering AI-driven capabilities across cloud-native environments.
The successful candidate will bring a strong software engineering background, deep expertise in machine learning platforms, and practical experience operationalizing AI solutions from concept to production.
Location: Vienna VA (We will consider Remote candidates within US Mainland on EST)
Required Qualifications
- Must be a US Citizen/ Permanent Resident able to be employed as a Fulltime Employee. Due to the nature of the contract we cannot work any with layers / vendors or offer this role to anyone who needs sponsorships or has a temporary work visa.
- Bachelor's degree in Computer Science, Engineering, or a related technical field (or equivalent experience)
- 10+ years of experience in software engineering, cloud engineering, or related IT disciplines
- Minimum 3 years of hands-on experience developing and implementing AI/ML solutions
- At least 4 years of experience working within AWS environments
- Demonstrated success deploying machine learning models and AI applications into production
- Strong software development skills with the ability to design, build, test, and maintain end-to-end solutions
Technical Expertise
- AI/ML Technologies
- Experience with machine learning and deep learning frameworks such as TensorFlow, PyTorch, and Keras
- Knowledge of Large Language Models (LLMs), prompt engineering techniques, and NLP-based solutions
- Experience designing, training, and optimizing machine learning models for production use
- Software Development
- Strong proficiency in Python and Java
- Experience developing RESTful APIs and ML inference services using FastAPI
- Solid understanding of software engineering best practices, testing, and performance optimization
- Cloud & Infrastructure
- Hands-on expertise with AWS services including Lambda, EC2, S3, DynamoDB, API Gateway, ECS/Fargate, and IoT Core
- Experience implementing Infrastructure as Code using Terraform
- Strong understanding of containerization and orchestration technologies including Docker and Kubernetes
- Knowledge of distributed systems and cloud-native architectures
Additional Experience
- Experience integrating edge devices and cloud-based systems
- Proven ability to build and support scalable, resilient AI platforms
- Strong troubleshooting and system optimization skills
Key Responsibilities
- Design, develop, and deploy enterprise-scale AI and machine learning applications
- Build and maintain robust ML pipelines that support model training, deployment, monitoring, and lifecycle management
- Develop intelligent applications leveraging LLMs, generative AI, and NLP technologies
- Create and support high-performance inference services and APIs for production environments
- Automate infrastructure provisioning and platform management using Terraform and AWS services
- Deploy and manage containerized workloads using Kubernetes and Docker
- Collaborate with product, engineering, and data teams to deliver business-critical AI solutions
- Monitor system performance, troubleshoot issues, and continuously improve reliability and scalability
- Take ownership of solutions throughout the full development lifecycle, from implementation through production support
Preferred Candidate Profile
We are seeking a builder who thrives in a hands-on engineering environment and has a proven track record of delivering production-grade AI/ML systems. The ideal candidate combines strong software engineering fundamentals with practical experience deploying modern AI technologies at scale.