Job Summary:
RevSpring is a company focused on providing innovative data solutions, and they are seeking a Senior Software Engineer specializing in Big Data and data foundations. The role involves designing and optimizing data pipelines, developing backend services, and ensuring data performance and quality in a healthcare context.
Responsibilities:
โข Collaborate and Innovate: Partner with product managers, data engineers, and business leaders to translate complex product and data requirements into scalable, reliable data pipelines and the search experiences they power.
โข Architect Data Pipelines: Design, build, and optimize large-scale distributed batch and streaming pipelines (using Apache Airflow, Apache Beam/Dataflow, and DBTon BigQuery) to ingest, model, and transform high-volume healthcare data into clean, well-tested, query-ready datasets and search indices.
โข Build Data Models & Backend Services: Develop resilient Python services and DBT models that power data delivery and self-service analytics, including Model Context Protocol (MCP) servers that expose curated data and tooling to downstream and AI consumers, and integrate with external REST/SOAP APIs and third-party data sources.
โข Optimize Data & Search Performance: Deeply tune pipeline throughput, data warehouse performance, and search indexing โ optimizing BigQuery cost and query performance and Elasticsearch index design to ensure data freshness, relevance, and scalability across high-volume datasets.
โข Drive Engineering Excellence: Write clean, maintainable, well-tested code and lead by example through rigorous code reviews, architectural and data-modeling design discussions, and mentoring, driving a culture of high-quality software and trustworthy data.
โข Pioneer New Technologies: Stay at the forefront of modern data engineering, the analytics-engineering ecosystem (e.g., DBT, BigQuery), and information retrieval, proactively applying these advancements to strengthen our data platform and the products it powers.
Qualifications:
Required:
โข Proven experience designing and orchestrating large-scale ETL/ELT pipelines using Apache Beam/Google Cloud Dataflow (or similar), and DBT, built on modern cloud data warehouses.
โข 4+ years of experience working with relational databases and analytical data warehouses, with deep, advanced SQL skills and solid data-modeling fundamentals (e.g., dimensional and normalized modeling).
โข Working experience with search indexing and Elasticsearch, including index management, mappings, and building and maintaining search indices from pipeline output.
โข Experience building scalable Python services and high-performance data APIs, including developing Model Context Protocol (MCP) servers that expose data and tooling to downstream and AI consumers.
โข Strong understanding of containerization (Docker), CI/CD methodologies (e.g., GitHub Actions), Git, Infrastructure as Code (e.g., Terraform/Pulumi), and managing services within cloud platforms.
โข Familiarity with healthcare data standards (e.g., NPPES/NPI registries, NUCC Provider Taxonomy, machine-readable files (MRFs) for cost transparency, and FHIR).
โข Experience with data quality and pipeline testing frameworks (e.g., dbt tests, Great Expectations) and streaming/event ingestion (e.g., Pub/Sub, Kafka).
โข Experience integrating graph-based data and healthcare taxonomy ontologies to enrich datasets and search query context.
โข Experience with observability and logging platforms (e.g., DataDog) for monitoring pipeline health and data freshness.
โข Bachelorโs Degree
โข 5+ years of professional experience with Python, with strong software-engineering fundamentals (testing, code review, design).
โข 3+ years experience with Java or another JVM language is also high desired, particularly for Beam/Dataflow.
โข Ability to read, analyze and interpret general business periodicals, professional journals, technical procedures or governmental regulations.
โข Ability to write reports, business correspondence and procedure manuals.
โข Ability to effectively present information and respond to questions from a variety of both internal and external sources.
Preferred:
โข BigQuery experience is a plus.
โข Familiarity with hybrid (BM25 + semantic/vector) search is a plus.
โข 3+ years of GCP experience preferred.
Company:
RevSpring is a provider of revenue cycle technology services offering data analytics, multi-channel customer communications. Founded in 1997, the company is headquartered in Wixom, USA, with a team of 501-1000 employees. The company is currently Late Stage.