... data extraction from scanned TIF documents. You will architect and implement computer vision ... Collaborate with software engineering teams to integrate trained models into Java/Python-based ...
... data extraction from scanned TIF documents. You will architect and implement computer vision ... Collaborate with software engineering teams to integrate trained models into Java/Python-based ...
Delivery Lead
Dallas, TX · Remote
$110K - $140K/yr
... data creation to annotation to delivery. We design and create datasets from scratch, recruit and ... Partner with Product and Engineering to evolve internal tooling, automation, and operational ...
Quick apply
Delivery Lead
Dallas, TX · Remote
$110K - $140K/yr
... data creation to annotation to delivery. We design and create datasets from scratch, recruit and ... Partner with Product and Engineering to evolve internal tooling, automation, and operational ...
Delivery Lead
Austin, TX · Remote
$110K - $140K/yr
... data creation to annotation to delivery. We design and create datasets from scratch, recruit and ... Partner with Product and Engineering to evolve internal tooling, automation, and operational ...
Quick apply
Delivery Lead
Austin, TX · Remote
$110K - $140K/yr
... data creation to annotation to delivery. We design and create datasets from scratch, recruit and ... Partner with Product and Engineering to evolve internal tooling, automation, and operational ...
Rust Software Engineering Intern - Fall
$97K - $132K/yr
You'll work across everything from low-level systems and data synchronization, to high-performance ... annotation, and measurement * Develop production-grade Rust software targeting both native and ...
Rust Software Engineering Intern - Fall
$97K - $132K/yr
You'll work across everything from low-level systems and data synchronization, to high-performance ... annotation, and measurement * Develop production-grade Rust software targeting both native and ...
Linework, layering, and annotation standards * Ability to interpret: * Field notes * Survey data * Engineering and pipeline plans * Strong attention to detail and production accuracy * Effective ...
Quick apply
Linework, layering, and annotation standards * Ability to interpret: * Field notes * Survey data * Engineering and pipeline plans * Strong attention to detail and production accuracy * Effective ...
Linework, layering, and annotation standards * Ability to interpret: * Field notes * Survey data * Engineering and pipeline plans * Strong attention to detail and production accuracy * Effective ...
Linework, layering, and annotation standards * Ability to interpret: * Field notes * Survey data * Engineering and pipeline plans * Strong attention to detail and production accuracy * Effective ...
Linework, layering, and annotation standards * Ability to interpret: * Field notes * Survey data * Engineering and pipeline plans * Strong attention to detail and production accuracy * Effective ...
Linework, layering, and annotation standards * Ability to interpret: * Field notes * Survey data * Engineering and pipeline plans * Strong attention to detail and production accuracy * Effective ...
Strong understanding of responsible AI use in engineering workflows, including data sensitivity ... J2EE , Core Java including Java 17 with annotation, enum, collections, streaming, multithreading ...
Strong understanding of responsible AI use in engineering workflows, including data sensitivity ... J2EE , Core Java including Java 17 with annotation, enum, collections, streaming, multithreading ...
Strong understanding of responsible AI use in engineering workflows, including data sensitivity ... J2EE , Core Java including Java 17 with annotation, enum, collections, streaming, multithreading ...
Strong understanding of responsible AI use in engineering workflows, including data sensitivity ... J2EE , Core Java including Java 17 with annotation, enum, collections, streaming, multithreading ...
Designer (Civil3D)
Austin, TX · On-site +1
Our engineers, scientists, technical and project employees serve our clients from offices across ... Experience using/incorporating land or aerial surveying/mapping data and files. (required)
Designer (Civil3D)
Austin, TX · On-site +1
Our engineers, scientists, technical and project employees serve our clients from offices across ... Experience using/incorporating land or aerial surveying/mapping data and files. (required)
Strong understanding of responsible AI use in engineering workflows, including data sensitivity ... J2EE , Core Java including Java 17 with annotation, enum, collections, streaming, multithreading ...
Strong understanding of responsible AI use in engineering workflows, including data sensitivity ... J2EE , Core Java including Java 17 with annotation, enum, collections, streaming, multithreading ...
Senior Applied Scientist, Document Understanding
$85K - $117K/yr
... annotation and synthetic data * Drive independent technical decisions on chunking strategy ... Partner with engineering on delivery, reliability, and scale across multiple product lines
Senior Applied Scientist, Document Understanding
$85K - $117K/yr
... annotation and synthetic data * Drive independent technical decisions on chunking strategy ... Partner with engineering on delivery, reliability, and scale across multiple product lines
... annotation and synthetic data * Own technical decisions on architecture, chunking strategy, classification approach, and knowledge extraction methods * Partner with engineering on delivery ...
... annotation and synthetic data * Own technical decisions on architecture, chunking strategy, classification approach, and knowledge extraction methods * Partner with engineering on delivery ...
Sr Analog Physical Design Engineer | Kilby Labs
$95K - $130K/yr
... annotation support. 9. Mentor and guide contract layout engineers in India, review their work and ... data center, personal electronics and communications equipment. At our core, we have a passion to ...
Sr Analog Physical Design Engineer | Kilby Labs
$95K - $130K/yr
... annotation support. 9. Mentor and guide contract layout engineers in India, review their work and ... data center, personal electronics and communications equipment. At our core, we have a passion to ...
Sr Analog Physical Design Engineer | Kilby Labs
Dallas, TX · On-site
$95K - $130K/yr
... annotation support. 9. Mentor and guide contract layout engineers in India, review their work and ... data center, personal electronics and communications equipment. At our core, we have a passion to ...
Sr Analog Physical Design Engineer | Kilby Labs
Dallas, TX · On-site
$95K - $130K/yr
... annotation support. 9. Mentor and guide contract layout engineers in India, review their work and ... data center, personal electronics and communications equipment. At our core, we have a passion to ...
Senior AI Underwriter
Austin, TX · On-site
$100K - $120K/yr
... data-driven outcomes across our book of business ... What You'll Do * Partner closely with AI researchers and engineers throughout the full model ...
Senior AI Underwriter
Austin, TX · On-site
$100K - $120K/yr
... data-driven outcomes across our book of business ... What You'll Do * Partner closely with AI researchers and engineers throughout the full model ...
Senior AI Underwriter
Austin, TX · On-site +1
$100K - $120K/yr
... data-driven outcomes across our book of business ... What You'll Do * Partner closely with AI researchers and engineers throughout the full model ...
Senior AI Underwriter
Austin, TX · On-site +1
$100K - $120K/yr
... data-driven outcomes across our book of business ... What You'll Do * Partner closely with AI researchers and engineers throughout the full model ...
LLM Engineering & NLU Design: Manage complex intent taxonomies capable of handling 200+ unique ... Conduct Return-on-Investment (ROI) analysis of AI-powered solutions and interpret telemetry data to ...
LLM Engineering & NLU Design: Manage complex intent taxonomies capable of handling 200+ unique ... Conduct Return-on-Investment (ROI) analysis of AI-powered solutions and interpret telemetry data to ...
Flag ambiguous, defective, or unusual images for review by quality inspectors, engineers, or data ... Prior experience with image annotation, data labeling, or QA work * Familiarity with labeling tools ...
Flag ambiguous, defective, or unusual images for review by quality inspectors, engineers, or data ... Prior experience with image annotation, data labeling, or QA work * Familiarity with labeling tools ...
Applied Researcher 1
Austin, TX · On-site
Construct end-to-end pipelines for large-scale data ingestion, cleaning, annotation, and feature engineering. Leverage internal compliance datasets and proprietary labeling tools to ensure high ...
Applied Researcher 1
Austin, TX · On-site
Construct end-to-end pipelines for large-scale data ingestion, cleaning, annotation, and feature engineering. Leverage internal compliance datasets and proprietary labeling tools to ensure high ...
Data Annotation Engineer information
See Texas salary details
$48K - $60.3K
2% of jobs
$60.3K - $72.6K
9% of jobs
$81.2K is the 25th percentile. Wages below this are outliers.
$72.6K - $85K
20% of jobs
$85K - $97.3K
4% of jobs
$97.3K - $109.6K
4% of jobs
$109.6K - $121.9K
1% of jobs
$121.9K - $134.2K
0% of jobs
$134.2K - $146.6K
0% of jobs
The median wage is $152.7K / yr.
$146.6K - $158.9K
18% of jobs
$158.9K - $171.2K
0% of jobs
$176K is the 75th percentile. Wages above this are outliers.
$171.2K - $183.5K
41% of jobs
$48K
$137.4K
$183.5K
How much do data annotation engineer jobs pay per year?
What are the main challenges faced by Data Annotation Engineers in their daily work?
One of the main challenges Data Annotation Engineers face is ensuring consistent accuracy and quality in labeling large and often complex datasets. Attention to detail is critical, as even small errors can significantly affect machine learning model performance. Additionally, engineers must frequently adapt to evolving annotation guidelines and emerging data types, which requires ongoing learning and flexibility. Collaboration with data scientists and project managers is common to clarify requirements and resolve ambiguities, making strong communication skills essential for success.
What are the key skills and qualifications needed to thrive in the Data Annotation Engineer position, and why are they important?
To thrive as a Data Annotation Engineer, you need a strong background in data analysis, attention to detail, and familiarity with annotation processes, often supported by a degree in computer science or a related field. Proficiency with annotation tools like Labelbox, CVAT, or VIA, and understanding of data formats used in machine learning, is commonly required. Excellent communication, collaboration, and organizational skills help you effectively manage projects and cooperate with cross-functional teams. These abilities are crucial for delivering high-quality labeled data, which directly impacts the performance of AI and machine learning models.
Is data annotation real or fake?
What is a data annotation engineer?
How hard is it to get a job with data annotation tech?
Does data annotation really pay you?
What is a Data Annotation Engineer job?
A Data Annotation Engineer is responsible for labeling and annotating data—such as text, images, audio, or video—to train machine learning models. They ensure that data is accurately categorized and structured to improve model performance. This role often involves using specialized annotation tools, following detailed guidelines, and working closely with data scientists and AI teams. Data Annotation Engineers play a crucial role in the development of AI applications by providing high-quality labeled datasets for supervised learning.

JPMorgan Chase & Co. rating
8.1
Based on 470 frontline employees who took The Breakroom Quiz
46th of 141 rated banks
Job description
As Applied AI/ML Lead within Commercial & Investment Bank with the Healthcare Provider team, you will lead the design, development, and production deployment of AI/ML solutions focused on image classification, text categorization, and data extraction from scanned TIF documents. You will architect and implement computer vision pipelines leveraging CRNN architectures for document type identification, page-level categorization, and visual feature extraction.
Job Responsibilities
- Lead the design, development, and production deployment of AI/ML solutions focused on image classification, text categorization, and data extraction from scanned TIF documents and evaluate and explore additional models and architectures to continuously improve classification accuracy, extraction quality, and processing efficiency.
- Drive the development and fine-tuning of models for document understanding, text categorization, named entity recognition, and semantic understanding and combine visual layout information, textual content, and spatial relationships to extract structured data from complex scanned documents, while enabling automated categorization and metadata tagging of OCR-extracted text.
- Lead the integration and optimization of OCR technology and generative AI capabilities into the document processing pipeline, ensuring high-accuracy text extraction from scanned TIF images across diverse document types, layouts, fonts, and quality levels. Leverage Amazon Bedrock to explore foundation model capabilities for intelligent document understanding, classification, document summarization, and augmenting traditional extraction pipelines.
- Architect and implement scalable ML training and inference pipelines using AWS SageMaker, managing model training, hyperparameter tuning, distributed training for large vision models, and real-time/batch inference endpoint deployment. Collaborate with software engineering teams to integrate trained models into Java/Python-based microservices deployed on AWS EKS, ensuring low-latency, high-throughput inference for production document processing workloads.
- Establish robust MLOps practices and annotation workflows, including model versioning, automated retraining triggers, A/B testing of model variants, drift detection on document distributions, and comprehensive performance monitoring dashboards and design and manage labeling strategies for training data, ensuring high-quality ground truth datasets for image classification, text categorization, and document extraction tasks.
- Build and manage a team of ML engineers and applied scientists, fostering a culture of experimentation, rapid prototyping, and rigorous evaluation of model performance against business KPIs.
Required Qualifications, Capabilities, and Skills
- Bachelor's degree or MS or PhD in quantitative discipline, e.g. Computer Science, Mathematics, Operations Research, Data Science.
- 7+ years of experience in applied ML/AI roles with at least 2+ years leading teams or large-scale ML initiatives
- Advanced proficiency in Python and enterprise languages, with deep experience in PyTorch, TensorFlow, Hugging Face Transformers, OpenCV, and Pillow for model development and image processing. Proficiency in Java and/or Groovy for integrating ML capabilities into backend services and enterprise application ecosystems. Familiarity with Oracle databases for feature extraction, training data retrieval, and integration with ML workflows.
- Deep expertise in computer vision and NLP models, with hands-on experience implementing and fine-tuning CRNN-based architectures for image classification and feature extraction. Strong experience with multimodal document understanding combining text, layout, and image features. Proficiency in transformer-based NLP models for text categorization, sequence labeling, named entity recognition, and semantic analysis of OCR-extracted content.
- Practical experience with OCR technologies and image preprocessing, for text extraction from scanned documents, with an understanding of OCR accuracy optimization, preprocessing techniques, and post-processing correction. Experience with image preprocessing for scanned documents in TIF format, including multi-page handling, resolution normalization, deskewing, binarization, and noise removal.
- Deep hands-on experience with AWS SageMaker and Amazon Bedrock, including end-to-end ML workflows such as training jobs, processing pipelines, model registry, distributed training, and real-time/batch inference endpoints. Practical experience leveraging foundation models, prompt engineering, and building generative AI-augmented document processing solutions. Experience deploying and scaling ML models as containerized microservices on AWS EKS using Docker and Kubernetes, with expertise in optimizing GPU-based inference workloads.
- Strong knowledge of MLOps tools and practices, including MLflow, SageMaker Pipelines, or equivalent platforms for experiment tracking, pipeline automation, and model lifecycle management. Excellent leadership and communication skills with the ability to present complex technical concepts to senior leadership and non-technical audiences.
Preferred Qualifications, Capabilities, and Skills
- Domain expertise in the healthcare industry
- Experience in applied ML/AI roles in document processing, computer vision, or NLP domains
What JPMorgan Chase & Co. employees say
Pay
Benefits
Hours and flexibility
Workplace
Get the full story on Breakroom
About JPMorgan Chase & Co
Sourced by ZipRecruiter
Industry
Finance and insurance and banking and credit intermediation
Company size
10,000+ Employees
Headquarters location
New York, NY, US