Job Summary:
Waystar is a company dedicated to simplifying healthcare payments through innovative technology. They are seeking a Senior Machine Learning Engineer to develop robust AI systems utilizing Language Models and agentic architectures, focusing on the entire ML pipeline from data extraction to deployment.
Responsibilities:
โข Design, implement, and optimize robust pipelines for ingesting, parsing, and extracting structured information from complex documents (leveraging OCR, document layout analysis, Named Entity Recognition (NER), and Relationship Extraction (RE)).
โข Develop rich, nested JSON schemas for representing structured data and ensure scalable storage
โข Generate and manage high-quality vector embeddings for efficient retrieval-augmented generation (RAG) within a Vector Database.
โข Research, select, and experiment with appropriate open-source Language Models (Large & Small) (e.g., Phi-3, Mistral, Llama, Nemotron-H families) for specialized tasks.
โข Design and execute efficient fine-tuning strategies (e.g., LoRA, QLoRA, full fine-tuning) on curated, domain-specific datasets to achieve precise performance for tasks like coverage determination, code lookups, and policy rule application.
โข Explore and implement knowledge distillation techniques to transfer capabilities from larger models to smaller, more efficient LMs.
โข Build and maintain the core agentic framework, including the orchestrator that intelligently routes queries and coordinates interactions between various specialized LM tools.
โข Develop and integrate "tools" (specialized LMs and external APIs) that perform atomic medical necessity tasks, ensuring strict behavioral alignment and structured outputs.
โข Deploy, manage, and monitor LMs and agentic components on Google Cloud Platform (GCP) using services like Vertex AI, GKE, Cloud Functions, and Cloud Run.
โข Implement robust MLOps practices for continuous integration, continuous delivery (CI/CD), model versioning, and performance monitoring (latency, throughput, accuracy).
โข Establish effective feedback loops from end-user interactions and system logs to identify areas for model improvement.
โข Curate and expand training datasets, ensuring data privacy (PHI/PII masking) and legal compliance.
โข Stay abreast of the latest research in LMs, agentic AI, NLP, and document understanding, applying relevant advancements to our system.
โข Work closely with subject matter experts, product managers, and other engineers to translate complex requirements into technical solutions and evaluate system performance.
Qualifications:
Required:
โข Bachelor's or Master's degree in Computer Science, Machine Learning, Artificial Intelligence, or a related quantitative field.
โข 3+ years of professional experience in Machine Learning Engineering, with a strong focus on NLP.
โข Proven experience with Language Models (LMs), including model selection, fine-tuning, and deployment.
โข Strong proficiency in Python and familiarity with ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face Transformers).
โข Solid understanding and hands-on experience with core NLP techniques and architectures, especially Transformers.
โข Experience with cloud platforms, particularly Google Cloud Platform (GCP), including services like Vertex AI, Cloud Storage, and compute services.
โข Familiarity with MLOps principles and tools for model serving, monitoring, and pipeline automation.
โข Excellent problem-solving skills, attention to detail, and ability to work independently and collaboratively.
โข Active use of artificial intelligence (AI) tools and techniques to enhance performance, drive innovation, and improve decision-making across business functions.
โข Ability to leverage AI tools and platforms to streamline workflows, improve decision-making, and drive innovation.
โข Curiosity and adaptability in exploring emerging AI technologies, with a mindset for continuous learning and experimentation.
Preferred:
โข Hands-on experience building or contributing to agentic AI systems or multi-agent frameworks.
โข Direct experience with document processing technologies such as OCR, layout parsing, Document AI, or custom information extraction from unstructured text.
โข Experience with Vector Databases (e.g., pgvector, Pinecone, Weaviate, Qdrant) and RAG architectures.
โข Exposure to the healthcare domain, particularly understanding medical terminology, CPT/ICD codes, or regulatory documents.
Company:
Waystar is a technology platform that provides healthcare revenue cycle management solutions. Founded in 2017, the company is headquartered in Louisville, USA, with a team of 1001-5000 employees. The company is currently Late Stage.