Job Description Data Architect Location: Menlo Park, CA (Remote) Experience: 12+ Years Job Description Role Overview: We are seeking a visionary to architect a Self-Healing, Autonomous Data Fabric. You will replace legacy ETL with a nervous system where metadata is active, governance is computational, and data sharing is zero-copy. Mandatory Skills: - Active Metadata: Experience building closed-loop automation (e.g., metadata-triggered autonomous schema repair)
- Semantic Engineering: Mastery of RDF, OWL, and SHACL for ontology-first modeling and SPARQL reasoning. - Production-level Open Policy Agent (OPA)/ Policy-as-Code(Zero-Trust) for dynamic, context-aware access control. Other Technical Skills: - Advanced Privacy: Implementation of Homomorphic Encryption (FHE) or SMPC for analytics on encrypted PII.
- Zero-Copy Architecture: Expertise in Delta Sharing for cross-cloud analytics without egress. - Compute: Trino (GraalVM), StarRocks, DuckDB (WASM). - Orchestration: Dagster, Airflow (Provider-level).
- Semantic Layer: Stardog, Apache Jena, GraphQL Federation. - System Languages: Rust, Clojure, or Java. Note Education M.S./Ph.D
in Computer Science (Formal Methods/Logic) or Computational Mathematics.