2

Full Time Image Annotation Jobs in Washington (NOW HIRING)

Full Time Image Annotation information

See Washington salary details

$860

$2.4K

$3.5K

How much do full time image annotation jobs pay per week?

As of Jun 6, 2026, the average weekly pay for full time image annotation in Washington is $2,397.46, according to ZipRecruiter salary data. Most workers in this role earn between $1,763.46 and $2,984.62 per week, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive in the Full Time Image Annotation position, and why are they important?

To thrive as a Full Time Image Annotation professional, you need strong attention to detail, familiarity with visual data, and typically at least a high school diploma or equivalent. Experience with annotation platforms, image labeling software, and sometimes basic knowledge of programming languages or AI concepts can be advantageous. Excellent concentration, time management, and the ability to work both independently and collaboratively are valuable soft skills. These competencies ensure accurate, consistent data labeling, which is crucial for developing reliable machine learning and AI systems.

What is a Full Time Image Annotation job?

A Full Time Image Annotation job involves labeling or tagging images to help train machine learning models. Annotators use specialized tools to draw bounding boxes, classify objects, or mark key points within images. This process improves artificial intelligence systems in areas like autonomous driving, medical imaging, and facial recognition. The role requires attention to detail, accuracy, and sometimes familiarity with specific domain guidelines. It is typically performed in-office or remotely, depending on the employer.

What types of images and annotation tasks can I expect in a Full Time Image Annotation role?

In a Full Time Image Annotation position, you may work with a broad variety of image types, including photographs, satellite images, medical scans, or video frames, depending on the project and industry. Typical annotation tasks include drawing bounding boxes, segmenting objects, labeling regions, or identifying specific features for use in machine learning datasets. Many teams work on tight deadlines and coordinate through digital platforms, so accuracy and consistency are key performance metrics. You may collaborate with quality assurance specialists and data scientists to ensure the annotated data meets required standards. Over time, proficiency in advanced annotation techniques can lead to opportunities for quality control, project management, or more technical roles within the AI data pipeline.

What are the most commonly searched types of Image Annotation jobs in Washington? The most popular types of Image Annotation jobs in Washington are:
What are popular job titles related to Full Time Image Annotation jobs in Washington? For Full Time Image Annotation jobs in Washington, the most frequently searched job titles are:
What cities in Washington are hiring for Full Time Image Annotation jobs? Cities in Washington with the most Full Time Image Annotation job openings:
Machine Learning Engineer - Geospatial (TS/SCI)

Machine Learning Engineer - Geospatial (TS/SCI)

LaunchCode

Springfield, VA • On-site

$175K - $250K/yr

Full-time

Posted 3 days ago


Job description

Description
Title: AI/Machine Learning Engineer - Vision Language Models / Multimodal AI (NGA)
Location: Springfield or Herndon, VA (onsite)
Clearance: TS/SCI (CI Poly preferred)
Position Type: Full-Time, Direct Hire
Pay: $175,000 to $250,000 for an SME
Company: The name of our partner organization will be disclosed during the interview process. This is not a direct role with LaunchCode; it is a position through LaunchCode, working with one of our partner companies.
Disclaimer: We are unable to provide work sponsorship for this role
Overview:
We're hiring a AI/Machine Learning Engineer with strong experience in multimodal AI and large-scale model training to support advanced vision-language initiatives in a secure government environment. This role will focus on fine-tuning Vision Language Models (VLMs) on domain-specific geospatial imagery, building scalable AWS training infrastructure, and developing evaluation frameworks for image understanding and spatial reasoning. Ideal candidates will have deep experience with PyTorch, HuggingFace, distributed training, and computer vision, along with the ability to optimize and deploy multimodal models in mission-critical environments.
Huge plus for candidates who have hands-on experience taking multimodal models such as CLIP, LLaVA, Qwen-VL, or similar Vision Language Models and fine-tuning them on classified or mission-specific imagery datasets. The ideal candidate can build the AWS infrastructure needed to train and scale these models, evaluate performance improvements across real-world use cases, and deploy solutions into secure government or air-gapped environments.
Key Responsibilities:
  • Design and execute fine-tuning pipelines for Vision Language Models (VLMs) using domain-specific imagery datasets
  • Handle data preprocessing, training orchestration, and hyperparameter optimization for multimodal models
  • Build evaluation frameworks for image understanding, visual question answering, and spatial reasoning tasks
  • Develop scalable AWS-based ML infrastructure using SageMaker and GPU-enabled EC2 for distributed training
  • Create data pipelines for curating, annotating, and transforming geospatial imagery into model-ready datasets
  • Partner with applied scientists and architects on model architecture improvements, LoRA/QLoRA strategies, and inference optimization

Required Qualifications:
  • Active TS/SCI with CI Poly
  • 5+ years of machine learning engineering experience focused on deep learning
  • 1+ year of hands-on experience fine-tuning foundation models (LLMs or VLMs)
  • Experience with LoRA, QLoRA, adapters, supervised fine-tuning, instruction tuning, and RLHF/DPO
  • 4+ years of advanced Python development for ML workloads
  • Strong PyTorch and HuggingFace experience (Transformers, PEFT, Datasets, Accelerate)
  • Experience with distributed training frameworks such as DeepSpeed, FSDP, or Megatron
  • 3+ years working with computer vision or multimodal models
  • Familiarity with vision transformer architectures (ViT, CLIP, LLaVA, etc.)
  • Experience processing and augmenting image datasets at scale
  • 3+ years with AWS ML infrastructure including SageMaker, EC2 GPU environments, and S3
  • Experience with ML evaluation pipelines, benchmarking, metrics, and result analysis
  • Strong software engineering fundamentals including version control, testing, and CI/CD

Preferred Qualifications:
  • 2+ years working with geospatial or remote sensing imagery
  • Experience with EO or SAR satellite imagery
  • Understanding of geospatial metadata, coordinate systems, and imagery preprocessing
  • Experience with model quantization / inference optimization (vLLM, TensorRT, ONNX)
  • MLOps tooling experience (MLflow, Weights & Biases, SageMaker Experiments)
  • Familiarity with annotation tools and active learning workflows
  • Containerized ML experience with Docker / ECR / ECS / EKS
  • Experience supporting ATO processes and NIST 800-53 compliance
  • Experience deploying in air-gapped/disconnected environments
  • Familiarity with multimodal evaluation benchmarks (MMMU, MMBench, GQA)
  • Publications or contributions in computer vision, multimodal AI, or VLMs
  • Synthetic data generation experience for training augmentation