1

Data Streaming Jobs in Texas (NOW HIRING)

Data Engineer - Dallas, TX

Dallas, TX

$113K - $136K/yr

Real-time Data Streaming: Build low-latency data streams (using Kafka or Flink) to provide agents with "fresh" data, enabling them to act on real-time market or operational changes. * Evaluation ...

AWS Data Architect (Remote)

Irving, TX · On-site +1

$62.25 - $81.50/hr

Design, implement and evolve event driven and real time data streaming architectures. * Define and enforce best practices for data access, storage and processing across Analytical and operational use ...

Real-time Data Streaming: Build low-latency data streams (using Kafka or Flink) to provide agents with "fresh" data, enabling them to act on real-time market or operational changes. * Evaluation ...

Real-time Data Streaming: Build low-latency data streams (using Kafka or Flink) to provide agents with "fresh" data, enabling them to act on real-time market or operational changes. * Evaluation ...

Design and lead real-time data streaming solutions leveraging. * Apache Flink on AWS Cloud, enabling scalable, low-latency, and fault-tolerant data processing platforms for business-critical use ...

Data Architect | Onsite | Dallas/Charlotte

Dallas, TX · On-site

$63 - $81/hr

Enable real-time analytics through integration of data streaming and event-driven architectures. * Support AI/ML model development by providing high-quality, accessible datasets. Collaboration and ...

Staff Software Engineer I

Austin, TX · On-site

$235K - $277K/yr

One Data Streaming Platform. About the Role: The Infrastructure team at Confluent is responsible for building and operating the foundation that powers Confluent Cloud. Our mission is to design ...

Renewals Manager

Austin, TX · On-site

$105K/yr

Built on a multi-modal data streaming engine, Redpanda empowers agentic applications that reason and act in real-time with speed, autonomy, and precision. Global leaders including Activision Blizzard ...

React Developer

Houston, TX

$99K - $115K/yr

Real-Time Streaming: Build and optimize UI components driven by continuous, high-throughput data streams via WebSockets and internal real-time data services. * Performance Engineering: Optimize ...

next page

Showing results 1-20

Data Streaming information

What are typical daily responsibilities for someone working in Data Streaming?

Professionals in Data Streaming typically design, develop, and maintain real-time data pipelines, ensuring continuous and reliable data flow across systems. They monitor the health of streaming platforms, troubleshoot latency or processing issues, and optimize performance for scalability. Collaboration with data engineering, analytics, and DevOps teams is common to ensure integration and alignment with business needs. Regular activities may also involve updating system configurations, deploying new features, and maintaining data security and compliance throughout the streaming infrastructure.

What are the key skills and qualifications needed to thrive in the Data Streaming position, and why are they important?

To excel in Data Streaming roles, candidates need a solid understanding of distributed computing, real-time data processing, and programming in languages such as Java, Scala, or Python. Experience with tools like Apache Kafka, Apache Flink, Spark Streaming, and relevant cloud platforms, along with certifications such as Confluent Certified Developer, are highly valuable. Strong analytical thinking, problem-solving skills, and the ability to communicate complex concepts clearly are important soft skills. Mastery of these skills enables professionals to efficiently handle large-scale streaming data workloads and ensure reliable, low-latency data pipelines in dynamic business environments.

What is a Data Streaming job?

A Data Streaming job involves managing real-time data pipelines to process, analyze, and deliver continuous data flows. Professionals in this role work with technologies like Apache Kafka, Apache Flink, or Spark Streaming to ensure seamless data movement and low-latency processing. They design scalable architectures that handle large volumes of events for use cases like real-time analytics, monitoring, and AI-driven decision-making.

What are the most commonly searched types of Data Streaming jobs in Texas? The most popular types of Data Streaming jobs in Texas are:
What cities in Texas are hiring for Data Streaming jobs? Cities in Texas with the most Data Streaming job openings:
Infographic showing various Data Streaming job openings in Texas as of June 2026, with employment types broken down into 62% Full Time, and 38% Contract. Highlights an 75% In-person, and 25% Remote job distribution.
Data Engineer - Dallas, TX

Data Engineer - Dallas, TX

Photon

Dallas, TX

$113K - $136K/yr

Other

Medical, Dental, Vision, Retirement, PTO

Posted 6 days ago


Job description

We are seeking a Data Engineer who will be responsible for the "Ingestion-to-Insight" pipeline that allows autonomous agents to access, search, and reason over vast amounts of proprietary and public data.

Your role is critical: you will design the RAG (Retrieval-Augmented Generation) architectures and data pipelines that ensure our agents have the right context at the right time to make accurate decisions.

Key Responsibilities

  • AI-Ready Data Pipelines: Design and implement scalable ETL/ELT pipelines that process both structured (SQL, logs) and unstructured (PDFs, emails, docs) data specifically for LLM consumption.
  • Vector Database Management: Architect and optimize Vector Databases (e.g., Pinecone, Weaviate, Milvus, or Qdrant) to ensure high-speed, relevant similarity searches for agentic retrieval.
  • Chunking & Embedding Strategies: Collaborate with AI Engineers to optimize data chunking strategies and embedding models to improve the "recall" and "precision" of the agent's knowledge retrieval.
  • Data Quality for AI: Develop automated "Data Cleaning" workflows to remove noise, PII (Personally Identifiable Information), and toxicity from training/context datasets.
  • Metadata Engineering: Enrich raw data with advanced metadata tagging to help agents filter and prioritize information during multi-step reasoning tasks.
  • Real-time Data Streaming: Build low-latency data streams (using Kafka or Flink) to provide agents with "fresh" data, enabling them to act on real-time market or operational changes.
  • Evaluation Frameworks: Construct "Gold Datasets" and versioned data snapshots to help the team benchmark agent performance over time.

Required Skills & Qualifications

  • Experience: 4+ years in Data Engineering, with at least 1 year focusing on data for LLMs or AI/ML applications.
  • Python Mastery: Deep expertise in Python (Pandas, Pydantic, FastAPI) for data manipulation and API integration.
  • Data Tooling: Strong experience with modern data stack tools (e.g., dbt, Airflow, Dagster, Snowflake, or Databricks).
  • Vector Expertise: Hands-on experience with at least one major Vector Database and knowledge of similarity search algorithms (HNSW, Cosine Similarity).
  • Search Knowledge: Familiarity with hybrid search techniques (combining semantic search with traditional keyword search like Elasticsearch/BM25).
  • Cloud Infrastructure: Proficiency in managing data workloads on AWS, Azure, or GCP.

Preferred Qualifications

  • Experience with LlamaIndex or LangChain for data ingestion.
  • Knowledge of Graph Databases (e.g., Neo4j) to help agents understand complex relationships between data points.
  • Familiarity with "Data-Centric AI" principles-prioritizing data quality over model size.

Compensation, Benefits and Duration

Minimum Compensation: USD  38,000
Maximum Compensation: USD 133,000
Compensation is based on actual experience and qualifications of the candidate. The above is a reasonable and a good faith estimate for the role.
Medical, vision, and dental benefits, 401k retirement plan, variable pay/incentives, paid time off, and paid holidays are available for full time employees.
This position is not available for independent contractors
No applications will be considered if received more than 120 days after the date of this post