1

Data Annotation Engineer Jobs in Texas (NOW HIRING)

... data extraction from scanned TIF documents. You will architect and implement computer vision ... Collaborate with software engineering teams to integrate trained models into Java/Python-based ...

Delivery Lead

Dallas, TX · Remote

$110K - $140K/yr

... data creation to annotation to delivery. We design and create datasets from scratch, recruit and ... Partner with Product and Engineering to evolve internal tooling, automation, and operational ...

Delivery Lead

Austin, TX · Remote

$110K - $140K/yr

... data creation to annotation to delivery. We design and create datasets from scratch, recruit and ... Partner with Product and Engineering to evolve internal tooling, automation, and operational ...

Construct end-to-end pipelines for large-scale data ingestion, cleaning, annotation, and feature engineering. Leverage internal compliance datasets and proprietary labeling tools to ensure high ...

next page

Showing results 1-20

Data Annotation Engineer information

See Texas salary details

$48K

$137.4K

$183.5K

How much do data annotation engineer jobs pay per year?

As of Jun 21, 2026, the average yearly pay for data annotation engineer in Texas is $137,383.00, according to ZipRecruiter salary data. Most workers in this role earn between $78,300.00 and $182,600.00 per year, depending on experience, location, and employer.

What are the main challenges faced by Data Annotation Engineers in their daily work?

One of the main challenges Data Annotation Engineers face is ensuring consistent accuracy and quality in labeling large and often complex datasets. Attention to detail is critical, as even small errors can significantly affect machine learning model performance. Additionally, engineers must frequently adapt to evolving annotation guidelines and emerging data types, which requires ongoing learning and flexibility. Collaboration with data scientists and project managers is common to clarify requirements and resolve ambiguities, making strong communication skills essential for success.

What are the key skills and qualifications needed to thrive in the Data Annotation Engineer position, and why are they important?

To thrive as a Data Annotation Engineer, you need a strong background in data analysis, attention to detail, and familiarity with annotation processes, often supported by a degree in computer science or a related field. Proficiency with annotation tools like Labelbox, CVAT, or VIA, and understanding of data formats used in machine learning, is commonly required. Excellent communication, collaboration, and organizational skills help you effectively manage projects and cooperate with cross-functional teams. These abilities are crucial for delivering high-quality labeled data, which directly impacts the performance of AI and machine learning models.

Is data annotation real or fake?

Data annotation is a real and essential process in machine learning where human annotators label data such as images, text, or audio to train AI models. Data annotation engineers perform this work using specialized tools and quality standards to ensure accurate and reliable datasets.

What is a data annotation engineer?

A data annotation engineer is a professional responsible for labeling and annotating data, such as images, text, or videos, to train machine learning models. They often use specialized tools and follow guidelines to ensure data quality and accuracy, supporting AI development and data-driven applications.

How hard is it to get a job with data annotation tech?

Getting a job as a Data Annotation Engineer typically requires basic computer skills, attention to detail, and familiarity with annotation tools or platforms. Entry-level positions are often accessible with minimal formal education, but having knowledge of machine learning concepts or experience with data labeling can improve job prospects.

Does data annotation really pay you?

Data annotation engineers are typically paid for their work, often earning hourly wages or project-based fees depending on the employer or platform. Compensation varies based on experience, skill level, and the complexity of annotation tasks, which may involve using tools like labeling software or AI platforms.

What is a Data Annotation Engineer job?

A Data Annotation Engineer is responsible for labeling and annotating data—such as text, images, audio, or video—to train machine learning models. They ensure that data is accurately categorized and structured to improve model performance. This role often involves using specialized annotation tools, following detailed guidelines, and working closely with data scientists and AI teams. Data Annotation Engineers play a crucial role in the development of AI applications by providing high-quality labeled datasets for supervised learning.

What are popular job titles related to Data Annotation Engineer jobs in Texas? For Data Annotation Engineer jobs in Texas, the most frequently searched job titles are:
What job categories do people searching Data Annotation Engineer jobs in Texas look for? The top searched job categories for Data Annotation Engineer jobs in Texas are:
What cities in Texas are hiring for Data Annotation Engineer jobs? Cities in Texas with the most Data Annotation Engineer job openings:
Infographic showing various Data Annotation Engineer job openings in Texas as of June 2026, with employment types broken down into 68% Full Time, 8% Part Time, and 24% Contract. Highlights an 83% In-person, and 17% Remote job distribution, with an average salary of $137,383 per year, or $66 per hour.
Applied AI/ML Lead

Other

Posted 15 days ago


JPMorgan Chase & Co. rating

8.1

Company rating: 8.1 out of 10

Based on 470 frontline employees who took The Breakroom Quiz

46th of 141 rated banks


Job description

Applied AI/ML Lead

As Applied AI/ML Lead within Commercial & Investment Bank with the Healthcare Provider team, you will lead the design, development, and production deployment of AI/ML solutions focused on image classification, text categorization, and data extraction from scanned TIF documents. You will architect and implement computer vision pipelines leveraging CRNN architectures for document type identification, page-level categorization, and visual feature extraction.

Job Responsibilities

  • Lead the design, development, and production deployment of AI/ML solutions focused on image classification, text categorization, and data extraction from scanned TIF documents and evaluate and explore additional models and architectures to continuously improve classification accuracy, extraction quality, and processing efficiency.
  • Drive the development and fine-tuning of models for document understanding, text categorization, named entity recognition, and semantic understanding and combine visual layout information, textual content, and spatial relationships to extract structured data from complex scanned documents, while enabling automated categorization and metadata tagging of OCR-extracted text.
  • Lead the integration and optimization of OCR technology and generative AI capabilities into the document processing pipeline, ensuring high-accuracy text extraction from scanned TIF images across diverse document types, layouts, fonts, and quality levels. Leverage Amazon Bedrock to explore foundation model capabilities for intelligent document understanding, classification, document summarization, and augmenting traditional extraction pipelines.
  • Architect and implement scalable ML training and inference pipelines using AWS SageMaker, managing model training, hyperparameter tuning, distributed training for large vision models, and real-time/batch inference endpoint deployment. Collaborate with software engineering teams to integrate trained models into Java/Python-based microservices deployed on AWS EKS, ensuring low-latency, high-throughput inference for production document processing workloads.
  • Establish robust MLOps practices and annotation workflows, including model versioning, automated retraining triggers, A/B testing of model variants, drift detection on document distributions, and comprehensive performance monitoring dashboards and design and manage labeling strategies for training data, ensuring high-quality ground truth datasets for image classification, text categorization, and document extraction tasks.
  • Build and manage a team of ML engineers and applied scientists, fostering a culture of experimentation, rapid prototyping, and rigorous evaluation of model performance against business KPIs.

Required Qualifications, Capabilities, and Skills

  • Bachelor's degree or MS or PhD in quantitative discipline, e.g. Computer Science, Mathematics, Operations Research, Data Science.
  • 7+ years of experience in applied ML/AI roles with at least 2+ years leading teams or large-scale ML initiatives
  • Advanced proficiency in Python and enterprise languages, with deep experience in PyTorch, TensorFlow, Hugging Face Transformers, OpenCV, and Pillow for model development and image processing. Proficiency in Java and/or Groovy for integrating ML capabilities into backend services and enterprise application ecosystems. Familiarity with Oracle databases for feature extraction, training data retrieval, and integration with ML workflows.
  • Deep expertise in computer vision and NLP models, with hands-on experience implementing and fine-tuning CRNN-based architectures for image classification and feature extraction. Strong experience with multimodal document understanding combining text, layout, and image features. Proficiency in transformer-based NLP models for text categorization, sequence labeling, named entity recognition, and semantic analysis of OCR-extracted content.
  • Practical experience with OCR technologies and image preprocessing, for text extraction from scanned documents, with an understanding of OCR accuracy optimization, preprocessing techniques, and post-processing correction. Experience with image preprocessing for scanned documents in TIF format, including multi-page handling, resolution normalization, deskewing, binarization, and noise removal.
  • Deep hands-on experience with AWS SageMaker and Amazon Bedrock, including end-to-end ML workflows such as training jobs, processing pipelines, model registry, distributed training, and real-time/batch inference endpoints. Practical experience leveraging foundation models, prompt engineering, and building generative AI-augmented document processing solutions. Experience deploying and scaling ML models as containerized microservices on AWS EKS using Docker and Kubernetes, with expertise in optimizing GPU-based inference workloads.
  • Strong knowledge of MLOps tools and practices, including MLflow, SageMaker Pipelines, or equivalent platforms for experiment tracking, pipeline automation, and model lifecycle management. Excellent leadership and communication skills with the ability to present complex technical concepts to senior leadership and non-technical audiences.

Preferred Qualifications, Capabilities, and Skills

  • Domain expertise in the healthcare industry
  • Experience in applied ML/AI roles in document processing, computer vision, or NLP domains

What JPMorgan Chase & Co. employees say

Pay

Benefits

Hours and flexibility

Workplace

Get the full story on Breakroom