... such as text classification, entity extraction, unstructured data extraction, document ... CV/ML, with direct responsibility for building and managing labeling programs. * Hands-on ...
... such as text classification, entity extraction, unstructured data extraction, document ... CV/ML, with direct responsibility for building and managing labeling programs. * Hands-on ...
... text, audio, sensor data, metadata, etc. Experience with data labeling platforms. * Strong ... Track, facilitate, and direct priorities for several concurrent annotation efforts. Track key ...
... text, audio, sensor data, metadata, etc. Experience with data labeling platforms. * Strong ... Track, facilitate, and direct priorities for several concurrent annotation efforts. Track key ...
$168K - $210K/yr
The Opportunity As Senior Director, Client Accounts for our AI Data Foundry practice, you will lead ... text, audio, video, image, geo, and 3D data at any scale and complexity. Our data annotation ...
$168K - $210K/yr
The Opportunity As Senior Director, Client Accounts for our AI Data Foundry practice, you will lead ... text, audio, video, image, geo, and 3D data at any scale and complexity. Our data annotation ...
Senior Manager of Quality Assurance, AIML Data Operations
Cupertino, CA · On-site
$214K - $356.60K/yr
... data annotation tasks across multiple data types (text, audio, image, video, and multimodal ... Work with the Director of Data Operations to align QA strategy with broader organizational ...
Senior Manager of Quality Assurance, AIML Data Operations
Cupertino, CA · On-site
$214K - $356.60K/yr
... data annotation tasks across multiple data types (text, audio, image, video, and multimodal ... Work with the Director of Data Operations to align QA strategy with broader organizational ...
Machine Learning Engineer, Data
San Francisco, CA · On-site
$134.90K - $162K/yr
Our text-to-speech models are purpose-built for high-volume conversational deployments, engineered ... End-to-end audio annotation pipeline : Currently some stages exist as prototypes; productionizing ...
Machine Learning Engineer, Data
San Francisco, CA · On-site
$134.90K - $162K/yr
Our text-to-speech models are purpose-built for high-volume conversational deployments, engineered ... End-to-end audio annotation pipeline : Currently some stages exist as prototypes; productionizing ...
VP, AI Solutions
$300K/yr
Design and optimize multi-modal workflows (text, audio, video, image, and sensor data). * Drive pre ... Direct experience with leading LLM providers , delivering solutions for data labeling, annotation ...
VP, AI Solutions
$300K/yr
Design and optimize multi-modal workflows (text, audio, video, image, and sensor data). * Drive pre ... Direct experience with leading LLM providers , delivering solutions for data labeling, annotation ...
Data Operations Engineer
Mountain View, CA · On-site
$136.30K - $163.60K/yr
... direct interaction with datasets. • Professional proficiency in Mandarin Chinese and English is ... datasets (text, image, video, audio, or 3D). • Familiarity with data annotation, labeling ...
Data Operations Engineer
Mountain View, CA · On-site
$136.30K - $163.60K/yr
... direct interaction with datasets. • Professional proficiency in Mandarin Chinese and English is ... datasets (text, image, video, audio, or 3D). • Familiarity with data annotation, labeling ...
... and annotation schemas, that inform model training, DCO reward signals, and creative tooling ... Maintain current, deep fluency across tools including text-to-image (Midjourney, Adobe Firefly ...
... and annotation schemas, that inform model training, DCO reward signals, and creative tooling ... Maintain current, deep fluency across tools including text-to-image (Midjourney, Adobe Firefly ...
... and annotation schemas, that inform model training, DCO reward signals, and creative tooling ... Maintain current, deep fluency across tools including text-to-image (Midjourney, Adobe Firefly ...
... and annotation schemas, that inform model training, DCO reward signals, and creative tooling ... Maintain current, deep fluency across tools including text-to-image (Midjourney, Adobe Firefly ...
Maintain current, deep fluency across tools including text-to-image (Midjourney, Adobe Firefly ... annotation schemas, or training data curation • Familiarity with aesthetic moderation, content ...
Maintain current, deep fluency across tools including text-to-image (Midjourney, Adobe Firefly ... annotation schemas, or training data curation • Familiarity with aesthetic moderation, content ...
... direct influence over both technical direction and business outcomes. In this role, your focus will ... Training custom models to automate critical annotation workflows for audio, video, and text data ...
... direct influence over both technical direction and business outcomes. In this role, your focus will ... Training custom models to automate critical annotation workflows for audio, video, and text data ...
... direct influence over both technical direction and business outcomes.In this role, your focus will ... Training custom models to automate critical annotation workflows for audio, video, and text data.3. ...
... direct influence over both technical direction and business outcomes.In this role, your focus will ... Training custom models to automate critical annotation workflows for audio, video, and text data.3. ...
... a direct line into the evolution of Liquid's multimodal post-training stack. If you care about ... text pair curation, annotation pipelines, and synthetic data generation for visual tasks. * Run ...
... a direct line into the evolution of Liquid's multimodal post-training stack. If you care about ... text pair curation, annotation pipelines, and synthetic data generation for visual tasks. * Run ...
Language Engineer, Artificial General Intelligence - Data Services
Boston, MA · On-site
$124.40K - $149.40K/yr
... annotation, dialog/semantic schemas, and automatic processing of large datasets. You will play a ... Criminal history may have a direct, adverse, and negative relationship with some of the material ...
Language Engineer, Artificial General Intelligence - Data Services
Boston, MA · On-site
$124.40K - $149.40K/yr
... annotation, dialog/semantic schemas, and automatic processing of large datasets. You will play a ... Criminal history may have a direct, adverse, and negative relationship with some of the material ...
Freelancer - GenAI Experts
New York, NY · On-site +1
... Image, Text-to-Video, Multi-Modal models, and AI Agents. Analyze outputs, categorize ... Support data annotation, curation, and quality control processes * Summarize findings into ...
Freelancer - GenAI Experts
New York, NY · On-site +1
... Image, Text-to-Video, Multi-Modal models, and AI Agents. Analyze outputs, categorize ... Support data annotation, curation, and quality control processes * Summarize findings into ...
Language Engineer, Artificial General Intelligence - Data Services
Boston, MA · On-site
$124.40K - $149.40K/yr
... annotation, dialog/semantic schemas, and automatic processing of large datasets. You will play a ... Criminal history may have a direct, adverse, and negative relationship with some of the material ...
Language Engineer, Artificial General Intelligence - Data Services
Boston, MA · On-site
$124.40K - $149.40K/yr
... annotation, dialog/semantic schemas, and automatic processing of large datasets. You will play a ... Criminal history may have a direct, adverse, and negative relationship with some of the material ...
Language Engineer, Artificial General Intelligence - Data Services
Boston, MA · On-site
$124.40K - $149.40K/yr
... annotation workflow development - Master's or higher degree in a relevant field (Computational ... Criminal history may have a direct, adverse, and negative relationship with some of the material ...
Language Engineer, Artificial General Intelligence - Data Services
Boston, MA · On-site
$124.40K - $149.40K/yr
... annotation workflow development - Master's or higher degree in a relevant field (Computational ... Criminal history may have a direct, adverse, and negative relationship with some of the material ...
Language Engineer, Artificial General Intelligence - Data Services
Boston, MA · On-site
$124.40K - $149.40K/yr
... annotation workflow development - Master's or higher degree in a relevant field (Computational ... Criminal history may have a direct, adverse, and negative relationship with some of the material ...
Language Engineer, Artificial General Intelligence - Data Services
Boston, MA · On-site
$124.40K - $149.40K/yr
... annotation workflow development - Master's or higher degree in a relevant field (Computational ... Criminal history may have a direct, adverse, and negative relationship with some of the material ...
Language Engineer, Artificial General Intelligence - Data Services
Boston, MA · On-site
$124.40K - $149.40K/yr
... annotation workflow development - Experience with language annotation and other forms of data ... Criminal history may have a direct, adverse, and negative relationship with some of the material ...
Language Engineer, Artificial General Intelligence - Data Services
Boston, MA · On-site
$124.40K - $149.40K/yr
... annotation workflow development - Experience with language annotation and other forms of data ... Criminal history may have a direct, adverse, and negative relationship with some of the material ...
Language Engineer, Artificial General Intelligence - Data Services
Boston, MA · On-site
$124.40K - $149.40K/yr
... annotation workflow development - Experience with language annotation and other forms of data ... Criminal history may have a direct, adverse, and negative relationship with some of the material ...
Language Engineer, Artificial General Intelligence - Data Services
Boston, MA · On-site
$124.40K - $149.40K/yr
... annotation workflow development - Experience with language annotation and other forms of data ... Criminal history may have a direct, adverse, and negative relationship with some of the material ...
Director Text Annotation information
Other
Posted 24 days ago
Job description
The Fitch Group Emerging technology AI group is seeking a Data Annotation AI Specialist to be part of a team that will be dedicated to build and support Generative AI, Machine learning, Deep Learning and Data science solutions across the organization. The position could be based out of our Chicago or NY offices. We are seeking a Data Annotation AI Specialist to lead the evaluation, selection, and onboarding of a data annotation platform, and to establish best-in-class annotation workflows for our NLP and CV initiatives. This role will bridge product, data science, MLOps, and compliance to ensure high-quality labeled datasets that accelerate model development for tasks such as text classification, entity extraction, unstructured data extraction, document summarization, and prompt/response curation.
What We Offer:
- This will be a high impact role with significant visibility where the candidate will work on some flagship Fitch products
- The candidate will have an excellent opportunity to work in the cutting-edge field of AI, NLP, Computer vision and MLOPs/LLMOps
- Fitch promotes an excellent work culture and is known for providing a good work life balance
We'll Count on You To:
- Platform Evaluation and Onboarding:
- Assess and compare data annotation platforms (e.g., Labelbox, Prodigy, Snorkel, Scale AI, SuperAnnotate, LightTag, custom open-source stacks) against business and technical requirements.
- Lead proof-of-concept trials; define evaluation criteria (quality, throughput, cost, security, privacy, compliance, UI/UX, workflow features, integrations, auditability).
- Drive vendor due diligence, security reviews, and coordinate procurement/contracting with Legal, Security, and Procurement.
- Plan and execute platform deployment, integrations (SSO, data lakes, MLOps pipelines), and role-based access controls.
- Workflow and Taxonomy Design:
- Collaborate with NLP and CV scientists and product owners to define labeling taxonomies, guidelines, and rubrics for tasks such as NER, data extraction, intent classification, topic modeling, toxicity/BI risk tagging, and document QA.
- Establish annotation protocols, inter-annotator agreement measures (IAA), and quality gates; design multi-pass review processes and adjudication steps.
- Develop gold standards and calibration sets; maintain versioning and change management of label schemas.
- Quality Management:
- Implement QA metrics and dashboards (precision/recall on labeled subsets, IAA, disagreement analysis, drift detection, sampling strategies).
- Design active learning and human-in-the-loop strategies to continually improve data quality and labeling efficiency.
- Conduct audits, bias checks, and error analyses; enforce data governance and documentation (data sheets, model cards inputs).
- Operations and Scale:
- Build and manage a hybrid workforce model (in-house annotators, expert reviewers, external vendors) including training, SLAs, throughput planning, and budget tracking.
- Create training materials and onboarding programs for annotators, SMEs, and reviewers; run calibration sessions and periodic refreshers.
- Optimize throughput and cost with workflow automation, pre-labeling, heuristics, and annotation tooling features.
- Integration and MLOps:
- Integrate the annotation platform with data pipelines, model training loops, experiment tracking, and storage (e.g., Databricks, Snowflake, AWS/GCP/Azure, MLflow).
- Implement programmatic interfaces (APIs/SDKs) for data ingestion/export, schema management, and reproducibility.
- Collaborate on dataset curation, splitting strategies, and governance (PII handling, encryption, retention policies).
What You Need to Have:
- 4–7+ years of experience in data annotation, data operations, or applied NLP/CV/ML, with direct responsibility for building and managing labeling programs.
- Hands-on experience with annotation platforms and workflows for NLP tasks; familiarity with enterprise deployment considerations (SSO, RBAC, audit, SOC2).
- Strong understanding of NLP and CV techniques: tokenization, embeddings, NER, text classification, sentiment, summarization, prompt engineering, and evaluation.
- Proficiency in Python and data tooling (Pandas, spaCy, Hugging Face, NLTK); experience using APIs/SDKs to automate annotation and active learning loops.
- Experience defining label taxonomies, guidelines, and measuring IAA; practical knowledge of QA methodologies and error/bias analysis.
- Familiarity with cloud platforms (AWS/GCP/Azure), data governance, and secure data handling.
- Excellent communication skills; ability to collaborate with data scientists, product managers, engineers, SMEs, and vendors.
What Would Make You Stand Out:
- Experience with large language model (LLM) data curation, RLHF/RLAIF pipelines, and prompt/response quality evaluation.
- Background in financial services, risk analytics, or regulated industries with strong compliance requirements.
- Prior experience building hybrid annotation teams and managing external vendors.
- Knowledge of annotation for multilingual NLP and document-heavy workflows (PDF parsing, OCR)