Experience with image understanding tasks such as semantic segmentation, scene recognition, image captioning, visual question answering, image aesthetics, or image retrieval.Strong fundamental ...
Experience with image understanding tasks such as semantic segmentation, scene recognition, image captioning, visual question answering, image aesthetics, or image retrieval.Strong fundamental ...
... image captioning, visual question answering, image aesthetics, or image retrieval.\nStrong fundamental software engineering background A track record of creative problem-solving - taking an ambiguous ...
... image captioning, visual question answering, image aesthetics, or image retrieval.\nStrong fundamental software engineering background A track record of creative problem-solving - taking an ambiguous ...
... image captioning, question answering, language models, etc. Learn more about our innovative research: The Atlanta Area base salary range for this full-time position is $137,500-$168,200, which can ...
... image captioning, question answering, language models, etc. Learn more about our innovative research: The Atlanta Area base salary range for this full-time position is $137,500-$168,200, which can ...
Audio, image, or text applications - Source separation, text-to-speech, music synthesis, image segmentation, image captioning, question answering, language models, etc. Learn more about our ...
Audio, image, or text applications - Source separation, text-to-speech, music synthesis, image segmentation, image captioning, question answering, language models, etc. Learn more about our ...
Experience with image understanding tasks such as semantic segmentation, scene recognition, image captioning, visual question answering, image aesthetics, or image retrieval. Strong fundamental ...
Experience with image understanding tasks such as semantic segmentation, scene recognition, image captioning, visual question answering, image aesthetics, or image retrieval. Strong fundamental ...
Image Library Editor (Volunteer)
Los Angeles, CA · On-site +1
Optional but helpful: a few examples of image editing, photo sourcing, metadata, captioning, or other relevant visual work If you cannot upload your materials, email them to [email protected]. Please ...
Image Library Editor (Volunteer)
Los Angeles, CA · On-site +1
Optional but helpful: a few examples of image editing, photo sourcing, metadata, captioning, or other relevant visual work If you cannot upload your materials, email them to [email protected]. Please ...
Shipped a product feature backed by a VLM (e.g., image captioning, document understanding) - including handling inference latency, cost-per-call tradeoffs, and degraded-mode fallbacks * Shipped AI ...
Shipped a product feature backed by a VLM (e.g., image captioning, document understanding) - including handling inference latency, cost-per-call tradeoffs, and degraded-mode fallbacks * Shipped AI ...
Shipped a product feature backed by a VLM (e.g., image captioning, document understanding) - including handling inference latency, cost-per-call tradeoffs, and degraded-mode fallbacks * Shipped AI ...
Shipped a product feature backed by a VLM (e.g., image captioning, document understanding) - including handling inference latency, cost-per-call tradeoffs, and degraded-mode fallbacks * Shipped AI ...
Shipped a product feature backed by a VLM (e.g., image captioning, document understanding) - including handling inference latency, cost-per-call tradeoffs, and degraded-mode fallbacks * Shipped AI ...
Shipped a product feature backed by a VLM (e.g., image captioning, document understanding) - including handling inference latency, cost-per-call tradeoffs, and degraded-mode fallbacks * Shipped AI ...
Photographer (Part-Time)
New York, NY · On-site
Editing & Captioning: Assist the editing team with editing and captioning images from photographers on assignment in real time, as well as post assignment * Image Review & Management: Review images ...
Photographer (Part-Time)
New York, NY · On-site
Editing & Captioning: Assist the editing team with editing and captioning images from photographers on assignment in real time, as well as post assignment * Image Review & Management: Review images ...
Photographer (Part-Time)
New York, NY · On-site
Editing & Captioning: Assist the editing team with editing and captioning images from photographers on assignment in real time, as well as post assignment * Image Review & Management: Review images ...
Photographer (Part-Time)
New York, NY · On-site
Editing & Captioning: Assist the editing team with editing and captioning images from photographers on assignment in real time, as well as post assignment * Image Review & Management: Review images ...
... captioning, and in-depth data studies, particularly for visual and audio data. • Design evaluation frameworks, metrics, benchmarks, evals, and reward models tailored to image/video/audio quality ...
... captioning, and in-depth data studies, particularly for visual and audio data. • Design evaluation frameworks, metrics, benchmarks, evals, and reward models tailored to image/video/audio quality ...
... captioning, and in-depth data studies, particularly for visual and audio data. • Design evaluation frameworks, metrics, benchmarks, evals, and reward models tailored to image/video/audio quality ...
... captioning, and in-depth data studies, particularly for visual and audio data. • Design evaluation frameworks, metrics, benchmarks, evals, and reward models tailored to image/video/audio quality ...
... captioning, and in-depth data studies, particularly for visual and audio data. • Design evaluation frameworks, metrics, benchmarks, evals, and reward models tailored to image/video/audio quality ...
... captioning, and in-depth data studies, particularly for visual and audio data. • Design evaluation frameworks, metrics, benchmarks, evals, and reward models tailored to image/video/audio quality ...
Member of Technical Staff - Imagine Model
$180K - $440K/yr
... captioning, and in-depth data studies, particularly for visual and audio data. * Design evaluation frameworks, metrics, benchmarks, evals, and reward models tailored to image/video/audio quality and ...
Member of Technical Staff - Imagine Model
$180K - $440K/yr
... captioning, and in-depth data studies, particularly for visual and audio data. * Design evaluation frameworks, metrics, benchmarks, evals, and reward models tailored to image/video/audio quality and ...
... captioning, and in-depth data studies, particularly for visual and audio data. • Design evaluation frameworks, metrics, benchmarks, evals, and reward models tailored to image/video/audio quality ...
... captioning, and in-depth data studies, particularly for visual and audio data. • Design evaluation frameworks, metrics, benchmarks, evals, and reward models tailored to image/video/audio quality ...
Experience with multimodal and vision-language models for image understanding, captioning, or visual analysis. * Experience with cloud-based AI infrastructure for training, fine-tuning, and serving ...
Experience with multimodal and vision-language models for image understanding, captioning, or visual analysis. * Experience with cloud-based AI infrastructure for training, fine-tuning, and serving ...
Experience with multimodal and vision-language models for image understanding, captioning, or visual analysis. * Experience with cloud-based AI infrastructure for training, fine-tuning, and serving ...
Experience with multimodal and vision-language models for image understanding, captioning, or visual analysis. * Experience with cloud-based AI infrastructure for training, fine-tuning, and serving ...
Experience with multimodal and vision-language models for image understanding, captioning, or visual analysis. * Experience with cloud-based AI infrastructure for training, fine-tuning, and serving ...
Experience with multimodal and vision-language models for image understanding, captioning, or visual analysis. * Experience with cloud-based AI infrastructure for training, fine-tuning, and serving ...
Member of Technical Staff - Imagine Model
$180K - $440K/yr
... captioning, and in-depth data studies, particularly for visual and audio data. * Design evaluation frameworks, metrics, benchmarks, evals, and reward models tailored to image/video/audio quality and ...
Quick apply
Member of Technical Staff - Imagine Model
$180K - $440K/yr
... captioning, and in-depth data studies, particularly for visual and audio data. * Design evaluation frameworks, metrics, benchmarks, evals, and reward models tailored to image/video/audio quality and ...
Image Captioning information
See salary details
$19.95 - $24.43
1% of jobs
$24.43 - $28.91
2% of jobs
$28.91 - $33.39
6% of jobs
$33.39 - $37.87
15% of jobs
$38.10 is the 25th percentile. Wages below this are outliers.
$37.87 - $42.35
21% of jobs
The median wage is $43.84 / hr.
$42.35 - $46.83
16% of jobs
$50.75 is the 75th percentile. Wages above this are outliers.
$46.83 - $51.31
17% of jobs
$51.31 - $55.79
10% of jobs
$55.79 - $60.27
4% of jobs
$60.27 - $64.75
6% of jobs
$64.75 - $69.23
2% of jobs
$19
$46
$69
How much do image captioning jobs pay per hour?
What are the key skills and qualifications needed to thrive in the Image Captioning position, and why are they important?
To thrive in an Image Captioning role, you need strong attention to detail, language proficiency, and an ability to interpret visual content accurately. Familiarity with digital annotation tools, content management systems, or image labeling platforms is often required. Exceptional communication and time management skills help you handle large volumes of images and collaborate with team members or editors. These abilities ensure captions are clear, contextually relevant, and consistently meet quality and deadline standards.
What are the typical responsibilities of someone working in image captioning?
Professionals in image captioning are primarily responsible for examining photos, graphics, or other visual data and crafting concise, accurate, and contextually appropriate captions. This process often involves using specialized software to annotate or tag images, ensuring consistency with style guidelines, and collaborating with editors, data teams, or project managers to align with project objectives. Daily tasks may also include reviewing and revising captions based on feedback, managing large batches of content, and maintaining organization within digital asset systems. The role is detail-oriented and can be performed individually or as part of a larger content or machine learning team depending on the employer.
What is an Image Captioning job?
An Image Captioning job involves generating descriptive text for images using artificial intelligence or human expertise. Professionals in this field work with machine learning models, datasets, and natural language processing to create accurate and contextually relevant captions. This role is essential for improving accessibility, content organization, and searchability of visual media. It is commonly used in applications like social media, e-commerce, and automated reporting.
Full-time
Posted 19 days ago
Apple rating
8.1
Based on 661 frontline employees who took The Breakroom Quiz
6th of 30 rated technology retailers
Job description
As a Machine Learning Engineer on the Creative Foundations team, you will pioneer novel approaches to image understanding - designing architectures, training strategies, and intelligent systems that push the boundaries of what our camera and photo experiences can do. You'll continuously survey state-of-the-art research, rapidly prototype high-potential ideas, and translate them into shippable features - while also leveraging model introspection and interpretability techniques to deeply understand why models behave the way they do and guide decisions accordingly. You'll collaborate across disciplines with product designers, software engineers, and aesthetic science researchers in an environment that values diverse perspectives, research rigor, and agility in an ever-evolving ML landscape.
MS or PhD in Computer Science, Machine Learning, Artificial Intelligence, Electrical Engineering, Applied Mathematics, Statistics, or a related field - or equivalent practical experience demonstrating deep ML expertise.Experience in machine learning, computer vision, or a related field (academic or industry), with a strong portfolio of building and shipping models or publishing research.Deep understanding of modern ML architectures and techniques - including (but not limited to) transformers, diffusion models, contrastive learning, multi-modal models, and efficient neural network design and optimization.Proficiency in ML frameworks such as PyTorch, and comfort working across the full model lifecycle from research exploration using large-scale data to production deployment.Experience with image understanding tasks such as semantic segmentation, scene recognition, image captioning, visual question answering, image aesthetics, or image retrieval.Strong fundamental software engineering background
A track record of creative problem-solving - taking an ambiguous challenge and finding an elegant, sometimes unconventional, ML-driven solution.A genuine passion for pushing the boundaries of what's possible with machine learning and a deep curiosity for how intelligent systems can transform everyday experiences.Published research at top-tier venues (CVPR, ICCV, ECCV, NeurIPS, ICML, SIGGRAPH, etc.) is valued - but so is a strong portfolio of impactful shipped features or open-source contributions.Comfort navigating ambiguity and working in a fast-moving R&D environment where the problem definition evolves alongside the solution.A personal connection to photography or visual storytelling - whether through a creative practice, a deep appreciation for the craft, or simply an obsession with what makes a great image.Specific computer vision experience in the areas of Semantic Image Understanding, Diffusion for Image Generation, Style Transfer, Computational Photography, Image Enhancement (Super-Resolution, Eenoising, etc.), Aesthetic Quality Assessment, Personalization (Few-Shot Adaptation)
About Apple
Sourced by ZipRecruiter
Imagine what you could do here! At Apple, new ideas have a way of becoming extraordinary products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish. Dynamic, intelligent people and inspiring, innovative technologies are the norm here. The people who work here have reinvented entire industries with all Apple Hardware products. The same real passion for innovation that goes into our products also applies to our practices strengthening our dedication to leave the world better than we found it.
Industry
Computer and electronic product manufacturing
Company size
10,000+ Employees
Headquarters location
Cupertino, CA, US
Year founded
1976