2

Remote Data Infrastructure Jobs (NOW HIRING)

While this is a remote position, we are a global company and are looking for applicants located in ... Design, build, and operate critical data infrastructure platforms (lakehouse, replication ...

While this is a remote position, we are a global company and are looking for applicants located in ... Design, build, and operate critical data infrastructure platforms (lakehouse, replication ...

The Data Warehouse Infrastructure team is responsible for the foundational big data infrastructure ... This position is US - Remote Eligible. The role may include occasional work at an Airbnb office or ...

Data Platform Engineer

Union City, CA · Remote

$130.40K - $156.60K/yr

Remote, USA Job Type: Full-Time Experience Required: 5 to 8 years Eligibility: Male/Female both can ... Build and improve data platform infrastructure and services * Support orchestration, storage, and ...

$59 - $76/hr

Remote Fulltime Must Have Technical/Functional Skills • Experience in Data Engineering, Data ... ML Data Infrastructure Roles & Responsibilities • Candidate should have 15+ years of IT ...

next page

Showing results 1-20

Remote Data Infrastructure information

What are the key skills and qualifications needed to thrive as a Remote Data Infrastructure Engineer, and why are they important?

To excel as a Remote Data Infrastructure Engineer, you need a strong background in computer science, data architecture, and experience with cloud platforms such as AWS, Azure, or Google Cloud. Familiarity with tools like Terraform, Kubernetes, and data pipeline technologies, as well as relevant certifications (e.g., AWS Certified Solutions Architect), is typically required. Strong problem-solving abilities, clear communication, and self-motivation are essential soft skills for remote collaboration and troubleshooting. These competencies ensure reliable, scalable data systems and effective teamwork across distributed environments.

What are some common challenges faced by professionals working in remote data infrastructure roles?

Professionals in remote data infrastructure roles often encounter challenges such as ensuring seamless communication across distributed teams, maintaining high availability and performance of data systems, and managing security risks associated with remote access. Coordinating with colleagues across different time zones can require flexibility in scheduling and proactive communication. Additionally, remote data infrastructure engineers must stay up-to-date with evolving cloud technologies and best practices to effectively support scalable, reliable, and secure data architectures.

What is remote data infrastructure?

Remote data infrastructure refers to the systems, tools, and processes that enable organizations to collect, store, manage, and analyze data from remote locations, often via cloud-based platforms. This infrastructure allows teams to access and work with data securely from anywhere, supporting distributed work environments and scalable data solutions. It typically involves cloud storage, data pipelines, databases, and security protocols tailored for remote accessibility. Remote data infrastructure is essential for businesses that operate in multiple locations or have remote teams.
More about Remote Data Infrastructure jobs
What cities are hiring for Remote Data Infrastructure jobs? Cities with the most Remote Data Infrastructure job openings:
What are the most commonly searched types of Data Infrastructure jobs? The most popular types of Data Infrastructure jobs are:
What states have the most Remote Data Infrastructure jobs? States with the most job openings for Remote Data Infrastructure jobs include:
Infographic showing various Remote Data Infrastructure job openings in the United States as of May 2026, with employment types broken down into 1% Internship, 91% Full Time, 2% Part Time, 1% Temporary, and 5% Contract. Highlights an 83% Physical, 6% Hybrid, and 11% Remote job distribution.
Data/Infrastructure Advocate Engineer - US Remote

Data/Infrastructure Advocate Engineer - US Remote

Hugging Face

New York, NY • Remote

$117.20K - $140.70K/yr

Full-time

Medical, Dental, Vision, PTO

Posted 23 days ago


Job description

At Hugging Face, we’re on a journey to democratize good AI. We are building the fastest growing platform for AI builders with over 5 million users & 100k organizations who collectively shared over 1M models, 300k datasets & 300k apps. Our open-source libraries have more than 400k+ stars on Github.

About the Role

As our first Data/Infrastructure Advocate Engineer, you’ll bridge the gap between cutting-edge data infrastructure and the global community of data engineers, researchers, and developers. You’ll champion Xet storage on the Hugging Face Hub, empowering users to efficiently store, version, and collaborate on large-scale datasets. This role is for someone who thrives at the intersection of technical depth (storage, Parquet, deduplication) and community advocacy—helping define the future of open data workflows.

You’ll collaborate with teams like Datasets, Hub, and Infrastructure to shape how developers interact with data on our platform, and inspire a community to build better, faster, and more scalable data pipelines.

Your Main Missions:

    • Grow and nurture the open-source data/infra community—launch initiatives, collaborate with data-focused groups, and organize events or challenges. Engage with communities like Apache Parquet, Open Tables Formats, and data engineering forums to promote best practices and Hugging Face tools.
    • Promote the Hugging Face Hub as the go-to platform for data storage, versioning, and collaboration—curate and showcase datasets, benchmarks, and tools like Xet.
    • Highlight use cases like efficient large dataset updates, Parquet editing, and deduplication to demonstrate the Hub’s value for data workflows.
    • Create demos, benchmarks, and tools (e.g., Colab notebooks) to illustrate best practices for data storage and versioning.bExperiment with Xet, Parquet, and other data formats to showcase their potential for ML and data engineering.
    • Produce high-quality tutorials, blog posts, and videos that make complex topics accessible.
    • Share insights on storage optimization, dataset versioning, and deduplication to empower developers.
    • Actively participate in online communities (Discord, GitHub, forums) to highlight contributions, answer questions, and foster collaboration.
    • Ensure datasets and tools released on the Hub are well-documented, with clear examples, benchmarks, and use cases.
About you

You’re a great fit if you:

  • Have strong technical skills in Python, data libraries (e.g., pandas, pyarrow, huggingface/datasets), and storage systems (Parquet, Open Table Formats, S3).
  • Are a hands-on builder who loves experimenting with data tools, storage optimization, and dataset versioning.
  • Can clearly explain complex topics (e.g., deduplication, compression, Parquet editing) through writing, demos, or talks.
  • Are active in developer communities (GitHub, Discord, forums) and passionate about open source and knowledge sharing.
  • Thrive in fast-moving environments and enjoy building in public to inspire others.

If you're interested in joining us but don't tick every box above, we still encourage you to apply! We're building a diverse team whose skills, experiences, and backgrounds complement one another. We're happy to consider where you might be able to make the biggest impact.

More about Hugging Face

We are actively working to build a culture that values diversity, equity, and inclusivity. We are intentionally building a workplace where you feel respected and supported—regardless of who you are or where you come from. We believe this is foundational to building a great company and community, as well as the future of machine learning more broadly. Hugging Face is an equal opportunity employer, and we do not discriminate based on race, ethnicity, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or ability status.

We value development. You will work with some of the smartest people in our industry. We are an organization that has a bias for impact and is always challenging ourselves to grow continuously. We provide all employees with reimbursement for relevant conferences, training, and education.

We care about your well-being. We offer flexible working hours and remote options. We offer health, dental, and vision benefits for employees and their dependents. We also offer parental leave and flexible paid time off.

We support our employees wherever they are. While we have office spaces in NYC and Paris, we're very distributed, and all remote employees have the opportunity to visit our offices. If needed, we'll also outfit your workstation to ensure you succeed.

We want our teammates to be shareholders. All employees have company equity as part of their compensation package. If we succeed in becoming a category-defining platform in machine learning and artificial intelligence, everyone enjoys the upside.