Familiarity with open-source LLMs and multimodal models (HuggingFace, LangChain, LlamaIndex, etc.) * Strong foundations in cloud-native development Preferred Experience * Experience with document ...
Familiarity with open-source LLMs and multimodal models (HuggingFace, LangChain, LlamaIndex, etc.) * Strong foundations in cloud-native development Preferred Experience * Experience with document ...
Deep Learning Research Intern
San Jose, CA · On-site
$18 - $59/hr
Proficiency with PyTorch; experience with HuggingFace or similar frameworks is a plus * Solid Python programming skills * Research experience with publications in top conferences or journals ...
Deep Learning Research Intern
San Jose, CA · On-site
$18 - $59/hr
Proficiency with PyTorch; experience with HuggingFace or similar frameworks is a plus * Solid Python programming skills * Research experience with publications in top conferences or journals ...
Content Lead
San Francisco, CA · On-site +1
$130K - $180K/yr
We've raised $11M from top funds, founders, and senior leaders at OpenAI, Anthropic, HuggingFace, Mistral, DeepMind, Datadog, Sentry, and others * We process over one hundred million API calls every ...
Content Lead
San Francisco, CA · On-site +1
$130K - $180K/yr
We've raised $11M from top funds, founders, and senior leaders at OpenAI, Anthropic, HuggingFace, Mistral, DeepMind, Datadog, Sentry, and others * We process over one hundred million API calls every ...
Senior Software Engineer - NIM Platform SDK and Framework
Santa Clara, CA · On-site
$143K - $189K/yr
... HuggingFace, S3, GCS) - parallel transfers, integrity verification, and seamless multi-cloud operability. • Implement the model profile and manifest system that ensures NIMs are optimized for every ...
Senior Software Engineer - NIM Platform SDK and Framework
Santa Clara, CA · On-site
$143K - $189K/yr
... HuggingFace, S3, GCS) - parallel transfers, integrity verification, and seamless multi-cloud operability. • Implement the model profile and manifest system that ensures NIMs are optimized for every ...
On-device ML Infrastructure Engineer, Compiler & Runtime, Graphics, Games & ML
Cupertino, CA · On-site
Experience with open source machine learning models (Mistral, Phi, Gemma, Huggingface, etc)Experience with any compiler stack (MLIR/LLVM/TVM/...).Experience with any ML authoring framework (PyTorch ...
On-device ML Infrastructure Engineer, Compiler & Runtime, Graphics, Games & ML
Cupertino, CA · On-site
Experience with open source machine learning models (Mistral, Phi, Gemma, Huggingface, etc)Experience with any compiler stack (MLIR/LLVM/TVM/...).Experience with any ML authoring framework (PyTorch ...
Senior Software Engineer, Quantized Inference
Santa Clara, CA · On-site
$143K - $189K/yr
Own model export pipelines (ModelOpt, Megatron-LM HuggingFace), ensuring quantized checkpoints serialize correctly for downstream serving * Build prototypes and benchmarking harnesses to evaluate ...
Senior Software Engineer, Quantized Inference
Santa Clara, CA · On-site
$143K - $189K/yr
Own model export pipelines (ModelOpt, Megatron-LM HuggingFace), ensuring quantized checkpoints serialize correctly for downstream serving * Build prototypes and benchmarking harnesses to evaluate ...
Head of Enterprise
San Francisco, CA · On-site
$150/hr
... Huggingface) and many others. Your Role Own and scale our enterprise revenue engine. Build the GTM motion that brings compute, RL infrastructure, and post-training services to AI labs, research orgs ...
Head of Enterprise
San Francisco, CA · On-site
$150/hr
... Huggingface) and many others. Your Role Own and scale our enterprise revenue engine. Build the GTM motion that brings compute, RL infrastructure, and post-training services to AI labs, research orgs ...
Sr. Applied Machine Learning Engineer - Search
$205K - $260K/yr
Huggingface, Spacy, Scikit-learn, Pytorch). * Strong communication skills and the ability to collaborate effectively with cross-functional teams * Comfortable supporting production systems and ...
Quick apply
Sr. Applied Machine Learning Engineer - Search
$205K - $260K/yr
Huggingface, Spacy, Scikit-learn, Pytorch). * Strong communication skills and the ability to collaborate effectively with cross-functional teams * Comfortable supporting production systems and ...
Sr. Applied Machine Learning Engineer - Search
San Francisco, CA · On-site +1
$205K - $260K/yr
Huggingface, Spacy, Scikit-learn, Pytorch). * Strong communication skills and the ability to collaborate effectively with cross-functional teams * Comfortable supporting production systems and ...
Sr. Applied Machine Learning Engineer - Search
San Francisco, CA · On-site +1
$205K - $260K/yr
Huggingface, Spacy, Scikit-learn, Pytorch). * Strong communication skills and the ability to collaborate effectively with cross-functional teams * Comfortable supporting production systems and ...
Deep proficiency in Python and modern ML frameworks (PyTorch, HuggingFace, Tensorflow, OpenAI Gym/Gymnasium or similar) * Experience with LLMs in production: prompt engineering, structured outputs ...
Deep proficiency in Python and modern ML frameworks (PyTorch, HuggingFace, Tensorflow, OpenAI Gym/Gymnasium or similar) * Experience with LLMs in production: prompt engineering, structured outputs ...
Member of Technical Staff - Full Stack Software Engineer
San Francisco, CA · On-site +1
$150/hr
... Huggingface), Emad Mostaque (Stability AI), and many others. Role Impact This is a generalist software engineering role focused on building the product surface of Prime Intellect - the developer ...
Member of Technical Staff - Full Stack Software Engineer
San Francisco, CA · On-site +1
$150/hr
... Huggingface), Emad Mostaque (Stability AI), and many others. Role Impact This is a generalist software engineering role focused on building the product surface of Prime Intellect - the developer ...
Deep Learning Research Intern
San Jose, CA · On-site
$18 - $59/hr
Proficiency with PyTorch; experience with HuggingFace or similar frameworks is a plus * Solid Python programming skills * Research experience with publications in top conferences or journals ...
Deep Learning Research Intern
San Jose, CA · On-site
$18 - $59/hr
Proficiency with PyTorch; experience with HuggingFace or similar frameworks is a plus * Solid Python programming skills * Research experience with publications in top conferences or journals ...
Associate Manager Machine Learning
Irvine, CA · Hybrid
$134K - $158K/yr
Strong programming skills in Python and ML tooling (e.g., PyTorch, HuggingFace, ONNX, MLflow). * Experience optimizing model latency and integrating ML with backend infrastructure. Preferred ...
Associate Manager Machine Learning
Irvine, CA · Hybrid
$134K - $158K/yr
Strong programming skills in Python and ML tooling (e.g., PyTorch, HuggingFace, ONNX, MLflow). * Experience optimizing model latency and integrating ML with backend infrastructure. Preferred ...
2026 PhD Applied Scientist Intern (Commerce, Trust, Safety and Support), United States
San Francisco, CA · On-site
... HuggingFace Transformers, LangChain) and Generative AI APIs (OpenAI, Google). Basic Qualifications * Current Ph.D. student majoring in Operations Research, Mathematics, Computer Science, Statistics ...
2026 PhD Applied Scientist Intern (Commerce, Trust, Safety and Support), United States
San Francisco, CA · On-site
... HuggingFace Transformers, LangChain) and Generative AI APIs (OpenAI, Google). Basic Qualifications * Current Ph.D. student majoring in Operations Research, Mathematics, Computer Science, Statistics ...
Member of Technical Staff - Sandbox Platform
San Francisco, CA · On-site
$150/hr
... Huggingface), Emad Mostaque (Stability AI) and many others. Role Impact This is a hybrid role spanning both our infrastructure layers and developer platform. You'll work on two key areas: * The ...
Member of Technical Staff - Sandbox Platform
San Francisco, CA · On-site
$150/hr
... Huggingface), Emad Mostaque (Stability AI) and many others. Role Impact This is a hybrid role spanning both our infrastructure layers and developer platform. You'll work on two key areas: * The ...
Principal AI Engineer - Frontier Data
San Francisco, CA · On-site
$159K - $213K/yr
PyTorch, LangChain, LlamaIndex, Pinecone/Weaviate, OpenAI/Anthropic APIs, HuggingFace. * Infra (Expertise in all is not required) : Kubernetes, Terraform, AWS/GCP. Compensation 232,000 - 260,000 base ...
Principal AI Engineer - Frontier Data
San Francisco, CA · On-site
$159K - $213K/yr
PyTorch, LangChain, LlamaIndex, Pinecone/Weaviate, OpenAI/Anthropic APIs, HuggingFace. * Infra (Expertise in all is not required) : Kubernetes, Terraform, AWS/GCP. Compensation 232,000 - 260,000 base ...
Strong programming skills in Python and Scala; experience with ML libraries such as TensorFlow, PyTorch, HuggingFace, and Scikit-learn. * Hands-on experience with full ML model lifecycle: from ...
Strong programming skills in Python and Scala; experience with ML libraries such as TensorFlow, PyTorch, HuggingFace, and Scikit-learn. * Hands-on experience with full ML model lifecycle: from ...
AI Architect for Software Development
$260K - $280K/yr
Expertise in Python, PyTorch, and modern AI frameworks (HuggingFace, vLLM, LangChain). * Experience building robust RAG pipelines (embedding optimization, retrieval metrics). * Experience designing ...
AI Architect for Software Development
$260K - $280K/yr
Expertise in Python, PyTorch, and modern AI frameworks (HuggingFace, vLLM, LangChain). * Experience building robust RAG pipelines (embedding optimization, retrieval metrics). * Experience designing ...
Member of Technical Staff - GPU Infrastructure
San Francisco, CA · On-site +1
$150/hr
... Huggingface), Emad Mostaque (Stability AI) and many others. Core Technical Responsibilities This customer-facing role combines deep technical expertise with hands-on implementation. You'll be ...
Member of Technical Staff - GPU Infrastructure
San Francisco, CA · On-site +1
$150/hr
... Huggingface), Emad Mostaque (Stability AI) and many others. Core Technical Responsibilities This customer-facing role combines deep technical expertise with hands-on implementation. You'll be ...
Research Engineer - Distributed Training
San Francisco, CA · On-site +1
$150/hr
... Huggingface), Emad Mostaque (Stability AI) and many others. If you're excited about the opportunity to build the foundation for the future of decentralized AI and create a platform that empowers ...
Research Engineer - Distributed Training
San Francisco, CA · On-site +1
$150/hr
... Huggingface), Emad Mostaque (Stability AI) and many others. If you're excited about the opportunity to build the foundation for the future of decentralized AI and create a platform that empowers ...
Huggingface information
See California salary details
$8.71 - $13.42
16% of jobs
$14.86 is the 25th percentile. Wages below this are outliers.
$13.42 - $18.13
29% of jobs
The median wage is $19.30 / hr.
$18.13 - $22.83
19% of jobs
$27.01 is the 75th percentile. Wages above this are outliers.
$22.83 - $27.54
12% of jobs
$27.54 - $32.25
8% of jobs
$32.25 - $36.96
5% of jobs
$36.96 - $41.66
4% of jobs
$41.66 - $46.37
2% of jobs
$46.37 - $51.08
2% of jobs
$51.08 - $55.79
1% of jobs
$55.79 - $60.50
1% of jobs
$8
$25
$60
How much do huggingface jobs pay per hour?
What are the key skills and qualifications needed to thrive in the Huggingface position, and why are they important?
To thrive in a role at Hugging Face, you typically need strong skills in machine learning, natural language processing (NLP), and software development, supported by a relevant degree in computer science or a related field. Familiarity with frameworks like PyTorch or TensorFlow, plus experience using version control systems such as Git, are often required; open-source contributions and cloud platform knowledge are a plus. Excellent communication, collaborative teamwork, and problem-solving abilities help candidates stand out in this dynamic, innovation-driven environment. These strengths are crucial because they enable individuals to develop high-impact AI tools, work effectively in interdisciplinary teams, and contribute to open-source communities.
What does a typical day look like for an engineer working at Hugging Face?
As an engineer at Hugging Face, your day typically involves collaborating with team members to design, develop, and improve state-of-the-art machine learning models and tools, with a strong focus on open-source NLP projects. You’ll participate in code reviews, experiment with new technologies, engage with the community through forums or GitHub, and help support user questions or issues. Expect a fast-paced, collaborative environment where cross-functional teamwork with product managers, researchers, and other engineers is common. The work is project-driven, with plenty of opportunities to contribute ideas, learn from experts, and advance your technical skills.
What is a Huggingface job?
A Hugging Face job typically refers to a role at Hugging Face, a company specializing in machine learning and natural language processing (NLP). Employees at Hugging Face work on developing and maintaining open-source AI tools, including the popular Transformers library. Roles range from research and engineering to product and community development, often focusing on advancing state-of-the-art AI models.
Other
Medical
Posted 24 days ago
Job description
We're seeking an Agent Engineer to design and build agentic features in our platform, including document understanding, advanced RAG, and customer support automation. In this role, you will develop not only the agent components themselves, but also the Friendli Agent API, which serves as the core developer interface for building and extending agent applications. You will also build agent applications as production-ready examples of how agents can solve real-world problems.
These applications will be primarily written in Python and will serve as reference implementations for our customers and community. We are looking for a hands-on engineer who is passionate about building agent systems and making AI easy for developers to adopt. The ideal candidate is comfortable creating agent applications that showcase what is possible, is curious about and experienced with open-source models, and enjoys turning them into reliable, high-impact features.
Key Responsibilities
- Design, build, and maintain agent APIs and applications that deliver document understanding and other high-value features
- Evaluate and integrate open-source models to power production-ready agent features where possible
- Develop reference agent applications to showcase workflows and accelerate customer adoption
- Collaborate with backend and infrastructure teams to integrate agents with deployment, orchestration, and monitoring systems
- Ensure APIs are robust, developer-friendly, and enterprise-ready through strong design principles and documentation
- Continuously improve the reliability, scalability, and performance of agent features in production
- 3+ years of experience in software engineering, preferably in backend, ML systems, or API development
- Bachelor's or Master's degree in Computer Science, Computer Engineering, or equivalent
- Strong programming skills in Python; experience with various Python frameworks
- Solid understanding of LLM workflows, agent patterns, or tool invocation systems
- Experience designing and delivering production APIs
- Familiarity with open-source LLMs and multimodal models (HuggingFace, LangChain, LlamaIndex, etc.)
- Strong foundations in cloud-native development
- Experience with document understanding pipelines (e.g., OCR, RAG, summarization, structured extraction)
- Familiarity with Kubernetes or container orchestration in production
- Built or contributed to agent frameworks, SDKs, or CLIs
- Have worked in a startup or fast-paced environments with ownership and ambiguity
- Passion for developer experience and enabling AI adoption
- Flexible working hours
- Daily lunch and dinner provided; unlimited snacks and beverages
- Supportive and highly collaborative work environment
- Health check-up support and top-tier equipment/hardware support
- A front-row seat to the generative AI infrastructure revolution
- Competitive compensation, startup equity, health insurance, and other benefits.
About FriendliAI
FriendliAI is building the world's best AI inference platform that makes large language and multi-modal models fast, efficient, and deployable at scale. We power high-throughput, low-latency AI workloads for organizations worldwide and integrate directly with Hugging Face, giving developers instant access to over 500,000 open-source models.
We are a small, fast-moving team doing work that matters at one of the most exciting moments in the history of technology. With our world-class inference engine, we are building a platform that the AI industry can actually rely on.