Feast or Tecton, point-in-time joins, online/offline consistency * Vector databases: Pgvector or Qdrant in production -- index tuning, ANN search, embedding upsert pipelines * RAG data fundamentals ...
Quick apply
Feast or Tecton, point-in-time joins, online/offline consistency * Vector databases: Pgvector or Qdrant in production -- index tuning, ANN search, embedding upsert pipelines * RAG data fundamentals ...
Quick apply
Feast or Tecton, point-in-time joins, online/offline consistency * Vector databases: Pgvector or Qdrant in production -- index tuning, ANN search, embedding upsert pipelines * RAG data fundamentals ...
New York, NY · On-site
$63 - $84.50/hr
Feast, Tecton) and their role in supporting both real-time and batch ML use cases * Experience with ML observability tooling, including drift detection, prediction monitoring, feature freshness ...
New York, NY · On-site
$63 - $84.50/hr
Feast, Tecton) and their role in supporting both real-time and batch ML use cases * Experience with ML observability tooling, including drift detection, prediction monitoring, feature freshness ...
$108K - $147K/yr
Experience implementing or managing a Feature Store (e.g., Feast, Tecton). * Familiarity with Data Versioning Control tools (e.g., DVC, LakeFS). * Published research or conference papers in data ...
$108K - $147K/yr
Experience implementing or managing a Feature Store (e.g., Feast, Tecton). * Familiarity with Data Versioning Control tools (e.g., DVC, LakeFS). * Published research or conference papers in data ...
Zionsville, IN · On-site
Experience with feature stores (SageMaker Feature Store, Feast, Tecton, or similar); designing feature pipelines for both batch and real-time serving * Experiment Tracking & Registry: MLflow, Weights ...
Zionsville, IN · On-site
Experience with feature stores (SageMaker Feature Store, Feast, Tecton, or similar); designing feature pipelines for both batch and real-time serving * Experiment Tracking & Registry: MLflow, Weights ...
Tampa, FL · On-site
$141K - $212K/yr
Experience in building CI CD pipeline and single click deployment -Tecton, Harness, Lightspeed, Openshift * Ability to drive engineering deliveries and handle multiple concurrent initiatives.
Tampa, FL · On-site
$141K - $212K/yr
Experience in building CI CD pipeline and single click deployment -Tecton, Harness, Lightspeed, Openshift * Ability to drive engineering deliveries and handle multiple concurrent initiatives.
OR · Remote
$105K - $143K/yr
Experience implementing or managing a Feature Store (e.g., Feast, Tecton). * Familiarity with Data Versioning Control tools (e.g., DVC, LakeFS). * Published research or conference papers in data ...
OR · Remote
$105K - $143K/yr
Experience implementing or managing a Feature Store (e.g., Feast, Tecton). * Familiarity with Data Versioning Control tools (e.g., DVC, LakeFS). * Published research or conference papers in data ...
Mountain View, CA · On-site
$194K - $262K/yr
Strong software engineering fundamentals: experience contributing to and maintaining shared ML libraries, feature stores, or feature engineering frameworks (e.g., featlib, feat-layer, Feast, Tecton ...
Mountain View, CA · On-site
$194K - $262K/yr
Strong software engineering fundamentals: experience contributing to and maintaining shared ML libraries, feature stores, or feature engineering frameworks (e.g., featlib, feat-layer, Feast, Tecton ...
El Segundo, CA · On-site
$143K - $192K/yr
Feast or Tecton, point-in-time joins, online/offline consistency * Vector databases: Pgvector or Qdrant in production - index tuning, ANN search, embedding upsert pipelines * RAG data fundamentals ...
El Segundo, CA · On-site
$143K - $192K/yr
Feast or Tecton, point-in-time joins, online/offline consistency * Vector databases: Pgvector or Qdrant in production - index tuning, ANN search, embedding upsert pipelines * RAG data fundamentals ...
San Mateo, CA · On-site
$123K - $168K/yr
Experience with feature stores (Feast, Tecton, or custom) * Experience with real-time / streaming feature engineering * Experience with LLM-augmented retrieval or hybrid retrieval architectures * E ...
San Mateo, CA · On-site
$123K - $168K/yr
Experience with feature stores (Feast, Tecton, or custom) * Experience with real-time / streaming feature engineering * Experience with LLM-augmented retrieval or hybrid retrieval architectures * E ...
San Mateo, CA · On-site +1
$123K - $168K/yr
Experience with feature stores (Feast, Tecton, or custom) * Experience with real-time / streaming feature engineering * Experience with LLM-augmented retrieval or hybrid retrieval architectures * E ...
San Mateo, CA · On-site +1
$123K - $168K/yr
Experience with feature stores (Feast, Tecton, or custom) * Experience with real-time / streaming feature engineering * Experience with LLM-augmented retrieval or hybrid retrieval architectures * E ...
El Segundo, CA · On-site
$143K - $192K/yr
Feast or Tecton, point-in-time joins, online/offline consistency * Vector databases: Pgvector or Qdrant in production - index tuning, ANN search, embedding upsert pipelines * RAG data fundamentals ...
El Segundo, CA · On-site
$143K - $192K/yr
Feast or Tecton, point-in-time joins, online/offline consistency * Vector databases: Pgvector or Qdrant in production - index tuning, ANN search, embedding upsert pipelines * RAG data fundamentals ...
$119K - $163K/yr
Exposure to feature stores (Feast, Tecton) and workflow tools (Airflow, Argo). * Familiarity with regulatory considerations (model auditability, interpretability, data privacy laws such as CCPA/GDPR)
$119K - $163K/yr
Exposure to feature stores (Feast, Tecton) and workflow tools (Airflow, Argo). * Familiarity with regulatory considerations (model auditability, interpretability, data privacy laws such as CCPA/GDPR)
East Hanover, NJ · Hybrid
$194K - $361K/yr
Expert knowledge of feature store technologies (Feast, Tecton, SageMaker Feature Store, Databricks Feature Store). * Deep expertise in modern data platforms optimized for ML workloads (Databricks ...
East Hanover, NJ · Hybrid
$194K - $361K/yr
Expert knowledge of feature store technologies (Feast, Tecton, SageMaker Feature Store, Databricks Feature Store). * Deep expertise in modern data platforms optimized for ML workloads (Databricks ...
East Hanover, NJ · Hybrid
$194K - $361K/yr
Expert knowledge of feature store technologies (Feast, Tecton, SageMaker Feature Store, Databricks Feature Store). * Deep expertise in modern data platforms optimized for ML workloads (Databricks ...
East Hanover, NJ · Hybrid
$194K - $361K/yr
Expert knowledge of feature store technologies (Feast, Tecton, SageMaker Feature Store, Databricks Feature Store). * Deep expertise in modern data platforms optimized for ML workloads (Databricks ...
Zionsville, IN · On-site
Experience with feature stores (SageMaker Feature Store, Feast, Tecton, or similar); designing feature pipelines for both batch and real-time serving * Experiment Tracking & Registry: MLflow, Weights ...
Zionsville, IN · On-site
Experience with feature stores (SageMaker Feature Store, Feast, Tecton, or similar); designing feature pipelines for both batch and real-time serving * Experiment Tracking & Registry: MLflow, Weights ...
| Aspect | Tecton | Data Engineer |
|---|---|---|
| Primary Role | Builds and manages feature stores for machine learning models | Designs, develops, and maintains data pipelines and infrastructure |
| Skills & Certifications | Machine learning, data engineering, cloud platforms, SQL | Data pipeline tools, SQL, Python, cloud services |
| Work Environment | Collaborates with data scientists and ML teams | Works with data engineers, analysts, and software teams |
| Industry Usage | Used in organizations deploying ML models | Used across data-driven companies for data infrastructure |
While both Tecton and Data Engineers work with data infrastructure, Tecton specializes in building feature stores for machine learning applications, whereas Data Engineers focus on creating data pipelines and managing data infrastructure for various business needs. The roles often overlap but serve different core functions within data teams.

$143K - $192K/yr
Full-time
Posted just now
ABOUT SAVIYNT
Saviynt is a leader in identity security, delivering an AI-powered platform that governs and secures access to applications, data, and business processes for global enterprises and government institutions. Built for the AI era, Saviynt helps organizations move faster — securely and compliantly.
ABOUT THE ROLE
You set the architectural direction for how training data flows, evolves, and is governed across the AI Platform. You define the standards ML engineers and scientists build on, and ensure every training signal is tenant-isolated, PII-free, and traceable from source to model.
WHAT YOU'LL OWN
AI Data Lake on GCS: bucket layout, raw → silver → gold tier separation, CMEK encryption, lifecycle rules
Batch pipelines: Spark on Dataproc for TB-scale feature backfills, Iceberg compaction, and daily S3→GCS incremental sync
Streaming pipelines: Apache Beam on Dataflow for sub-5-min CDC ingestion with exactly-once semantics and PII assertion gates
Schema registry: Avro / Protobuf schema versioning, compatibility modes, and migration playbooks for safe schema evolution
Orchestration: Flyte as primary DAG layer — task authoring standards, domain isolation, retry policies, DataCatalog memoization; evaluate Kubeflow Pipelines where relevant
Multi-tenancy: strict per-tenant GCS prefix isolation, quota policies, and cross-tenant contamination validation
Data Anonymizer and Data Labeler microservices: strip PII and attach ML labels before signals leave each customer environment
Feature store: Feast offline (GCS Parquet) and online (Redis) with point-in-time correctness and < 0.1% consistency SLA
Vector database: operate Pgvector (Cloud SQL) for POC and Qdrant on GKE for production-scale embedding storage; design index strategies (IVFFlat, HNSW) and manage ANN query latency SLAs
RAG data pipeline: build embedding generation pipelines that chunk, encode, and upsert document embeddings into the vector store; own the data refresh cadence and staleness SLAs for retrieval context
Service APIs: expose data platform services (feature serving, embedding upsert, schema validation) over HTTPS with mTLS and gRPC where low-latency streaming is required
Synthetic data pipelines for dev/staging where real customer data is not permitted
Data quality gates: Great Expectations / dbt checks as Flyte tasks, blocking on schema and PII-absence failures
YOU'LL THRIVE HERE IF YOU HAVE
8+ years of data engineering at production scale across multiple companies
Demonstrated principal impact: platform standards you defined adopted org-wide, or major cross-team pipeline/schema migrations you led
Data lake ownership (essential): you have designed and operated a production data lake end-to-end — storage layout, partitioning strategy, tiered retention (hot/warm/cold), table format (Iceberg or Delta Lake), compaction, and access control; not just consumed one
Deep Spark (PySpark / Scala): executor tuning, shuffle diagnosis, Iceberg table maintenance
Hands-on Beam / Dataflow: windowing, exactly-once, side inputs, autoscaling
Schema registry experience: Protobuf / Avro compatibility rules, breaking-change migrations in production
Orchestration at scale: Flyte, Kubeflow Pipelines, Airflow, or Prefect — operated in production, ideally benchmarked two
Multi-tenant data architecture: per-tenant isolation as a hard requirement, not a post-hoc concern
Feature store operations: Feast or Tecton, point-in-time joins, online/offline consistency
Vector databases: Pgvector or Qdrant in production — index tuning, ANN search, embedding upsert pipelines
RAG data fundamentals: chunking strategies, embedding model selection, retrieval quality evaluation, and context freshness management
API transport: gRPC and HTTPS/mTLS for service-to-service communication; comfortable defining proto contracts and managing certificate lifecycle
Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience or equivalent military experience
NICE TO HAVE
Differential privacy or k-anonymity for ML training datasets
Open source contributions: Feast, Great Expectations, Apache Beam, or dbt
Familiarity with IAM / access governance data: entitlements, provisioning events, access graphs
Iceberg or Delta Lake at petabyte scale
WHY JOIN SAVIYNT
Work on a large-scale, Kubernetes-based SaaS platform
Solve challenging cloud and reliability problems at scale
Collaborate with strong engineers in a reliability-focused culture
Competitive compensation, benefits, and growth opportunities
SECURITY & COMPLIANCE
This role requires adherence to Saviynt's information security and privacy policies, including annual security training.
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses and identifying potential inconsistencies or verification signals in application materials based on available information. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
Sourced by ZipRecruiter
Software development
501 - 1,000 Employees
Los Angeles, CA, US
2010