1

Software Engineer Data Infrastructure Opensearch Jobs

$88K - $106K/yr

Our partner is looking for a Software Engineer, Data Infrastructure & Acquisition based in Netherlands. This role sits at the intersection of software engineering, data infrastructure, and applied AI ...

Software Engineer, Data Infrastructure

New York, NY · On-site

$125K - $150K/yr

As a Software Engineer, Data Infrastructure, you will: * Work directly on petabyte-scale storage infrastructure, and the networking and performance challenges that come with it. * Collaborate daily ...

Software Engineer, Data Infrastructure

San Francisco, CA · On-site

$134K - $162K/yr

Identify and drive cost optimization opportunities across data processing, compute infrastructure, and storage. * Collaborate with AI researchers, data scientists, product engineers, and business ...

Software Engineer, Data Infrastructure

San Francisco, CA · On-site +1

$134K - $162K/yr

Identify and drive cost optimization opportunities across data processing, compute infrastructure, and storage. * Collaborate with AI researchers, data scientists, product engineers, and business ...

next page

Showing results 1-20

Software Engineer Data Infrastructure Opensearch information

See salary details

$44.5K

$129.7K

$177.5K

How much do software engineer data infrastructure opensearch jobs pay per year?

As of Jun 29, 2026, the average yearly pay for software engineer data infrastructure opensearch in the United States is $129,716.00, according to ZipRecruiter salary data. Most workers in this role earn between $114,500.00 and $137,500.00 per year, depending on experience, location, and employer.

What are the typical challenges faced by a Software Engineer working on Data Infrastructure with OpenSearch?

Software Engineers in Data Infrastructure roles focusing on OpenSearch often encounter challenges such as optimizing search performance at scale, ensuring high availability and fault tolerance, and managing data ingestion pipelines. They must also address issues related to data consistency, security, and fine-tuning cluster configurations for specific workloads. Collaboration with DevOps, data engineers, and application developers is key to maintaining seamless integration and reliable operation of search capabilities across the organization.

What does a Software Engineer Data Infrastructure Opensearch do?

A Software Engineer Data Infrastructure specializing in Opensearch is responsible for designing, building, and maintaining scalable data systems that leverage Opensearch for search, analytics, and data indexing. They work on optimizing data pipelines, ensuring high availability, and improving the performance of large-scale distributed systems. Their role typically includes collaborating with data engineers, DevOps teams, and software developers to integrate Opensearch into company infrastructure, manage cluster operations, and implement best practices for data security and reliability.

What are the key skills and qualifications needed to thrive as a Software Engineer Data Infrastructure Opensearch, and why are they important?

To thrive as a Software Engineer Data Infrastructure Opensearch, you need a deep understanding of distributed systems, data indexing, and search algorithms, typically supported by a degree in computer science or a related field. Experience with Opensearch or Elasticsearch, as well as proficiency in languages like Java or Python and familiarity with cloud platforms, are commonly required, along with knowledge of CI/CD tools. Strong problem-solving abilities, effective teamwork, and clear communication set outstanding candidates apart. These skills ensure reliable, scalable data infrastructure and enable efficient handling of large-scale search and analytics workloads.
What job categories do people searching Software Engineer Data Infrastructure Opensearch jobs look for? The top searched job categories for Software Engineer Data Infrastructure Opensearch jobs are:

Software Engineer, Data Infrastructure & Acquisition

Jobgether

Remote

$88K - $106K/yr

Full-time

Posted 7 days ago


Key responsibilities

  • Build, maintain, and scale data ingestion and acquisition systems that support AI model training and product development.

  • Design and extend cloud-based infrastructure and optimize data pipelines for efficient processing of high-volume datasets.

  • Identify and integrate new external data sources, including audio and web-based datasets, into production pipelines.


Job description

This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a Software Engineer, Data Infrastructure & Acquisition based in Netherlands.

This role sits at the intersection of software engineering, data infrastructure, and applied AI, focusing on building and scaling the systems that power large-scale dataset acquisition for next-generation machine learning models. You will work in a fully distributed environment alongside engineers, researchers, and product leaders to design robust ingestion pipelines capable of handling massive, high-quality audio and text datasets. The work directly impacts how data is collected, processed, and transformed into training-ready assets that fuel AI innovation. You'll contribute to improving the cost, scale, and efficiency of data systems while helping define the roadmap for dataset development. The environment is fast-moving, highly collaborative, and deeply technical, with strong ownership and autonomy. This is a chance to shape foundational infrastructure used by millions of users globally.

Accountabilities

You will be responsible for building, maintaining, and scaling large-scale data ingestion and acquisition systems that support AI model training and product development. You will design and extend cloud-based infrastructure, optimize data pipelines, and ensure efficient processing of high-volume datasets across distributed systems. You will collaborate closely with AI scientists and engineering teams to improve data quality, reduce cost, and increase throughput for training workflows. You will also identify and integrate new external data sources, including audio and web-based datasets, into production pipelines. Additionally, you will help define dataset strategy and contribute to architectural decisions that support long-term scalability and reliability of infrastructure systems.

  • Build and maintain scalable data ingestion and processing pipelines
  • Extend cloud infrastructure (GCP) using Infrastructure-as-Code tools
  • Identify and integrate new data sources into acquisition systems
  • Collaborate with research and AI teams to improve dataset quality and efficiency
  • Optimize systems for cost, throughput, and reliability at scale
  • Contribute to architecture and roadmap decisions for data infrastructure
Requirements

The ideal candidate brings strong software engineering experience with a focus on distributed systems, data infrastructure, or backend engineering in production environments. You should have hands-on experience with Python and Linux-based development workflows, along with strong familiarity with cloud platforms such as GCP and infrastructure-as-code tools like Terraform. Experience with Docker, large-scale data pipelines, or web crawling systems is highly valuable. You are comfortable working in fast-paced, ambiguous environments and can manage multiple priorities effectively. Strong communication skills and the ability to collaborate across technical and research-driven teams are essential. A background in Computer Science or a related technical field is expected, along with a proven ability to build reliable and scalable systems.

  • 5+ years of software engineering experience
  • Strong proficiency in Python and Linux environments (bash scripting)
  • Experience with GCP and Infrastructure-as-Code (Terraform preferred)
  • Hands-on experience with Docker and cloud-native development
  • Exposure to large-scale data pipelines or web crawling systems (preferred)
  • Strong problem-solving and system design skills
  • Excellent communication and cross-functional collaboration abilities
  • Degree in Computer Science or related technical field (BS/MS/PhD)
Benefits
  • Competitive base salary with bonus and equity opportunities
  • Fully remote, distributed-first work environment
  • High-impact role working on AI systems used at global scale
  • Opportunity to shape foundational data infrastructure for ML models
  • Collaborative, engineering-driven culture with strong autonomy
  • Access to cutting-edge AI and data engineering technologies
  • Fast-paced environment with ownership over meaningful technical problems
  • Work on a product that improves accessibility and learning experiences worldwide
How Jobgether works:
We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.
We appreciate your interest and wish you the best!
 Why Apply Through Jobgether? 
 
Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.
 
 
#LI-CL1
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses and identifying potential inconsistencies or verification signals in application materials based on available information. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
apply for this job