3

Remote Entry Level Data Annotation Jobs in New York

Research Crawling Engineer

New York, NY · Remote

$100K - $130K/yr

Location: Remote - Must have a 6 hour overlap with EST Remote | Full-time Compensation: $100K ... Develop robust pipelines for data cleaning, deduplication, filtering, and normalization. * Build ...

Generative AI Analyst

New York, NY · On-site +1

$50K - $60K/yr

We seek an entry-level AI Analyst to join our team to research, prototype and implement AI ... make data-driven recommendations for improvements Qualifications Bachelor's degree in Computer ...

Generative AI Analyst

New York, NY · On-site +1

$50K - $60K/yr

We seek an entry-level AI Analyst to join our team to research, prototype and implement AI ... Data Science, or related field ● 0-2 years of professional experience (personal projects ...

next page

Showing results 1-20

Remote Entry Level Data Annotation information

What are the key skills and qualifications needed to thrive as a Remote Entry Level Data Annotation specialist, and why are they important?

To thrive as a Remote Entry Level Data Annotation specialist, you need strong attention to detail, basic computer literacy, and a high school diploma or equivalent. Familiarity with annotation platforms, data labeling tools, and sometimes basic spreadsheet software is typically required. Effective time management, communication, and the ability to follow precise instructions help candidates excel in this role. These skills ensure accurate, high-quality data labeling, which is crucial for training reliable machine learning models.

What is the difference between Remote Entry Level Data Annotation vs Remote Data Labeling Specialist?

AspectRemote Entry Level Data AnnotationRemote Data Labeling Specialist
CredentialsBasic computer skills, attention to detailSimilar, often no formal certifications required
Work EnvironmentRemote, flexible hoursRemote, often part-time or freelance
Industry UsageAI, machine learning, tech companiesAI, autonomous vehicles, tech firms
Search IntentEntry-level data annotation jobsData labeling roles for AI projects

Remote Entry Level Data Annotation and Remote Data Labeling Specialist roles are similar in credentials and work environment, both serving AI and machine learning industries. The main difference lies in terminology used by employers and job seekers, with 'Data Labeling Specialist' often emphasizing more specialized tasks. Both roles are suitable for individuals seeking remote, entry-level positions in data preparation for AI applications.

What are some common challenges faced by remote entry-level data annotators, and how can they be managed?

Remote entry-level data annotators often encounter challenges related to maintaining focus and productivity while working independently, as well as ensuring consistency and accuracy in their annotations. To manage these challenges, it's helpful to establish a clear daily routine, set up a dedicated workspace, and actively communicate with team leads or supervisors for guidance. Utilizing collaboration tools and regularly reviewing project guidelines can also help maintain annotation quality and keep workflow on track.

What is a Remote Entry Level Data Annotation job?

A Remote Entry Level Data Annotation job involves labeling, tagging, or categorizing data such as images, text, audio, or video from a remote location. These roles are crucial for training machine learning algorithms, as annotated data helps improve the accuracy of artificial intelligence models. Entry-level positions typically require attention to detail and basic computer skills but do not usually require prior experience or specialized knowledge. Working remotely allows you to perform these tasks from home or any location with internet access.
What are the most commonly searched types of Remote Data Annotation jobs in New York? The most popular types of Remote Data Annotation jobs in New York are:
What are popular job titles related to Remote Entry Level Data Annotation jobs in New York? For Remote Entry Level Data Annotation jobs in New York, the most frequently searched job titles are:
What job categories do people searching Remote Entry Level Data Annotation jobs in New York look for? The top searched job categories for Remote Entry Level Data Annotation jobs in New York are:
What cities in New York are hiring for Remote Entry Level Data Annotation jobs? Cities in New York with the most Remote Entry Level Data Annotation job openings:
Infographic showing various Remote Entry Level Data Annotation job openings in New York as of June 2026, with employment types broken down into 92% Full Time, 4% Part Time, and 4% Temporary. Highlights an 97% Physical, 1% Hybrid, and 2% Remote job distribution.

Research Crawling Engineer

MLabs

New York, NY • Remote

$100K - $130K/yr

Other

Posted 27 days ago


Job description

Location: Remote - Must have a 6 hour overlap with EST

Remote | Full-time

Compensation: $100K - $130K

We are hiring on behalf of our client who is a technical infrastructure firm specializing in the delivery of massive-scale web data to organizations developing advanced artificial intelligence models. The organization supports high-capacity bandwidth-sharing networks and operates a distributed crawler capable of accessing high-quality public web data at a global scale. Additionally, the team has engineered sophisticated pipelines for the ingestion, segmentation, and annotation of billions of multimedia files, facilitating dataset creation for frontier research labs.

The organization operates as a lean, technical team that prioritizes speed and direct execution. As a Research Crawling Engineer, the successful candidate will design and operate large-scale web data acquisition systems. This role encompasses distributed systems, scraping infrastructure, and data pipelines, focusing on providing high-quality inputs for research and model development.

Key Responsibilities

  • Construct and maintain large-scale web crawlers across diverse domains.
  • Design high-throughput, fault-tolerant systems for data collection, managing volumes ranging from millions to billions of URLs per day.
  • Navigate anti-bot systems, rate limits, and dynamic, JavaScript-heavy websites.
  • Develop robust pipelines for data cleaning, deduplication, filtering, and normalization.
  • Build and maintain datasets specifically structured for research and machine learning model training.
  • Monitor and optimize crawl performance, coverage, and data quality through rapid iteration.
  • Collaborate with research teams to ensure data collection efforts align with modeling requirements.
  • Optimize infrastructure to ensure cost-efficiency, low latency, and reliability.

Requirements

  • Extensive programming experience in one or more of the following: Go, Rust, Python, Java, or C++.
  • Proven experience in building web crawlers or large-scale data pipelines.
  • Solid understanding of HTTP, networking protocols, and browser behavior.
  • Familiarity with distributed systems and parallel processing techniques.
  • Experience handling large datasets, ideally at the terabyte to petabyte scale.
  • Demonstrated ability to debug and maintain systems within unstable or adversarial environments.

Preferred Qualifications:

  • Experience with NLP pipelines or dataset curation for machine learning.
  • Familiarity with LLM pre-training data or retrieval systems.
  • Practical experience with headless browsers (e.g., Playwright, Puppeteer, or Chrome DevTools Protocol).
  • Knowledge of proxy systems, IP rotation, and large-scale request orchestration.
  • Background in data quality evaluation or benchmarking.
  • Experience running workloads on cloud or bare-metal infrastructure.

Benefits

  • Impactful Opportunity: Contribute to the development of a web-scale crawler and knowledge graph at the forefront of AI data accessibility.
  • High-Performance Culture: Join a lean, low-ego team that prioritizes high output and professional growth.
  • Remote Work: This position is part of a fully remote team, offering flexibility and autonomy.
  • Competitive Compensation: A package including a competitive salary, comprehensive benefits, and equity, commensurate with experience and the ability to operate at scale.

Interview Process

  1. Recruiter Coordination Call
  2. Hiring Manager Interview
  3. Founder / CEO Interview
  4. Secondary Executive Interview
  5. Final Interview


Due to the high volume of applications we anticipate, we regret that we are unable to provide individual feedback to all candidates. If you do not hear back from us within 4 weeks of your application, please assume that you have not been successful on this occasion. We genuinely appreciate your interest and wish you the best in your job search.

Commitment to Equality and Accessibility:

At MLabs, we are committed to offer equal opportunities to all candidates. We ensure no discrimination, accessible job adverts, and providing information in accessible formats. Our goal is to foster a diverse, inclusive workplace with equal opportunities for all. If you need any reasonable adjustments during any part of the hiring process or you would like to see the job-advert in an accessible format please let us know at the earliest opportunity by emailing human-resources@mlabs.city.

MLabs Ltd collects and processes the personal information you provide such as your contact details, work history, resume, and other relevant data for recruitment purposes only. This information is managed securely in accordance with MLabs Ltd's Privacy Policy and Information Security Policy, and in compliance with applicable data protection laws. Your data may be shared only with clients and trusted partners where necessary for recruitment purposes. You may request the deletion of your data or withdraw your consent at any time by contacting legal@mlabs.city.