1

Flex Cuda Programmer Jobs (NOW HIRING)

next page

Showing results 1-20

Flex Cuda Programmer information

See salary details

$12

$39

$68

How much do flex cuda programmer jobs pay per hour?

As of May 31, 2026, the average hourly pay for flex cuda programmer in the United States is $39.54, according to ZipRecruiter salary data. Most workers in this role earn between $25.72 and $51.44 per hour, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive as a Flex CUDA Programmer, and why are they important?

A Flex CUDA Programmer needs strong proficiency in C/C++ programming, deep knowledge of parallel computing concepts, and experience with NVIDIA CUDA architecture, usually supported by a degree in computer science or a related field. Familiarity with CUDA Toolkit, GPU profiling tools, and version control systems is essential, and certifications in GPU programming can be advantageous. Problem-solving ability, attention to detail, and effective teamwork are critical soft skills that distinguish top performers in this role. These skills and qualifications are vital to efficiently develop, optimize, and troubleshoot high-performance GPU-accelerated applications.

What are some common challenges Flex CUDA Programmers face when optimizing code for GPU architectures?

Flex CUDA Programmers often encounter challenges such as managing memory efficiently between the host and device, optimizing parallelization to avoid thread divergence, and ensuring that kernels are optimized for the specific GPU architecture. Debugging and profiling code can be more complex than traditional CPU programming due to the parallel nature of GPUs and the need to minimize data transfer overhead. Collaborating with data scientists, engineers, and other developers is also crucial, as integrating CUDA-accelerated code often requires close communication across multidisciplinary teams.

What is a Flex CUDA Programmer?

A Flex CUDA Programmer is a software developer who specializes in programming with NVIDIA's CUDA (Compute Unified Device Architecture) platform, specifically for applications that may require flexible or dynamic GPU computing solutions. These programmers write code that leverages the parallel processing power of GPUs to accelerate tasks such as scientific simulations, machine learning, or data processing. They often work in fields where performance and efficiency are critical, optimizing algorithms to run efficiently on GPU hardware. Their expertise includes C/C++ programming, CUDA libraries, and understanding of GPU architectures.

What is the difference between Flex Cuda Programmer vs CUDA Developer?

AspectFlex Cuda ProgrammerCUDA Developer
Required CredentialsKnowledge of CUDA, programming skills in C/C++, experience with GPU computingSimilar credentials, often including CUDA certification and C/C++ expertise
Work EnvironmentEmbedded systems, high-performance computing, hardware integrationSoftware development, simulation, and optimization in GPU environments
Industry UsageEmbedded systems, automotive, aerospace, scientific researchResearch labs, tech companies, gaming, AI development

Both roles focus on GPU programming with CUDA, requiring similar technical skills and certifications. Flex Cuda Programmers often work in embedded and specialized hardware environments, while CUDA Developers may focus more on software applications and algorithm optimization. The choice depends on the industry and project focus.

More about Flex Cuda Programmer jobs
What cities are hiring for Flex Cuda Programmer jobs? Cities with the most Flex Cuda Programmer job openings:
What are the most commonly searched types of Cuda Programmer jobs? The most popular types of Cuda Programmer jobs are:
What states have the most Flex Cuda Programmer jobs? States with the most job openings for Flex Cuda Programmer jobs include:
What job categories do people searching Flex Cuda Programmer jobs look for? The top searched job categories for Flex Cuda Programmer jobs are:
Infographic showing various Flex Cuda Programmer job openings in the United States as of May 2026, with employment types broken down into 80% Full Time, 3% Part Time, and 17% Contract. Highlights an 93% Physical, 3% Hybrid, and 4% Remote job distribution, with an average salary of $82,234 per year, or $39.5 per hour.

Principal AI/ML Engineer (Large Language Model) (TS/SCI) {S}

ARKA Group

King Of Prussia, PA โ€ข Remote

Full-time

Medical, Dental, Vision, Life, Retirement, PTO

Posted 15 hours ago


Job description

ARKA Group L.P. (โ€œARKAโ€) is an advanced technologies company serving the U.S. military, intelligence community, and commercial space industry delivering next-generation solutions to support the national security space enterprise. Built on more than six decades of excellence, ARKA brings modern approaches and a culture of innovation to the challenges of today.

Join the ARKA team to learn how Beyond Begins Here. Discover your next career opportunity now!

Position Overview:

The Principal AI/ML Engineer will support the development of AI/ML algorithms in a multitude of disciplines from object detection/classification, natural language processing, reinforcement learning, and large language models.

We offer generous relocation benefits for eligible candidates.

In support of work/life balance, many positions are available for a flexible schedule within the pay period.ย  Ask us about the opportunity for flex scheduling if thatโ€™s of interest to you.ย 

Responsibilities:

Lead and mentor a multidisciplined team consisting of developers and researchers to implement machine learning algorithms to solve a broad set of challenges for our various customers

  • Apply Large Language Models (LLMs) to a variety of applications within remote sensing such as tasking collections, identifying gaps in collection plans, analyzing patterns of life, and more.
  • Fine tune foundation models and building adaptors for new applications (llama factory, PEFT)
  • Apply retrieval augmented generation (RAG) techniques to data to populate and query vector databases (e.g. Weaviate)
  • Build custom applications with LLM frameworks such as LangChain, DSPy
  • Deploy LLM solutions across cloud-based and local resources using kubernetes (llama.ccp, vllm etc)
  • Analyze large multi-domain datasets such as images, text and/or graph data, to identify statistically relevant features to build models that provide analysts with actionable data
  • Review relevant publications to understand and apply cutting edge concepts to defense and commercial applications
  • Interface with both internal and external leadership to communicate technical status

Required Qualifications:

  • BS in machine learning, computer science, mathematics, or related fields.
  • 10+ years of experience, preferably in software development or as a data scientist with 2+ years of building LLM applications using some of the following:
    • Fine-tuning foundational models
      • Steering Techniques (e.g Sparse auto encoders, representation tuning)
      • Building adapters to use foundational models (e.g. PEFT, llama factory)
    • Prompt engineering techniques / Inference time techniques (e.g. chain of thought, tree of thoughts, etc.)
    • Using Retrieval Augmented Generation techniques to populate and query vector databases (e.g. Weaviate, pinecone)
    • Using LLM Frameworks (e.g. LangChain, DSPy)
    • Using AI APIs ( e.g AWS Bedrock, OpenAI)
    • Using LLM deployment frameworks (eg llama.cpp, vllm, tgi)
    • Developing UIs with ReAct
  • Experience leading an interdisciplinary team of researchers and software developers and working with a program manager to define project scope and schedule to ensure we meet project milestones as defined by our customers
  • Experience with Python and data science / machine learning libraries (e.g. PyTorch, TensorFlow, Keras, OpenCV, NumPy, Pandas, Polars, scikit-learn, etc.)
  • Active TS/SCI U.S. Government Security Clearance

Preferred Qualifications:

  • MS or PhD in machine learning, computer science, mathematics, or related fields.
  • Experience leading an interdisciplinary team of researchers and software developers
  • Experience with any of the following Computer Vision domains:
    • Large Language Models and experience identifying ways to incorporate them into new areas and applications
    • Applying Transformer-based architectures to domains in other areas outside of Natural Language Processing (NLP) such as computer vision
    • Object detection algorithms such as YOLO and Faster-RCNN
    • Natural Language Processing algorithms such as BERT
    • Generative Adversarial Networks and Variational Autoencoders
    • Reinforcement learning and familiarity with Gymnasium Gym, RLlib, and Stable Baselines
    • Applying clustering algorithms and/or deep neural networks to real life problems
    • Implementing tracking and pattern-of-life algorithms
  • Experience with Machine Learning libraries and frameworks such as HuggingFace and LangChain
  • Experience with Computer Vision libraries such as OpenCV, Nerfstudio, FiftyOne, etc.
  • Experience with Linux
  • Familiarity with using AWS cloud computing resources such as EC2, S3, Lambda, etc.
  • Experience with any of the following additional languages: Java, C++, Rust, Go, and/or C#
  • Experience implementing algorithms on the GPU in Python or C++ using CUDA and other CUDA libraries
  • Experience with implementing tracking and pattern-of-life algorithms
  • Experience in application deployment, virtualization, and containerization (e.g. Podman, Docker, Kubernetes, Rancher)
  • Experience working with various Remote Sensing datasets (e.g. EO/OPIR/SAR images, passive RF, etc.)
  • Experience shaping and writing proposals

Location: King of Prussia, PA

Situated less than an hour outside of Philadelphia and hosting the largest mall on the east coast, King of Prussia offers the urban feel sought in the city, while also giving opportunities to experience the beauty and history found only in Pennsylvania.

What We Offer:

  • Comprehensive medical/vision/dental insurance packages
  • Company contributions to qualified HSA accounts
  • 401k retirement plan with industry leading company contributions
  • 3 weeks of vacation accrual per year plus time off for sick leave and unscheduled life events
  • 13 paid holidays
  • Upfront tuition assistance for approved degree programs
  • Annual bonus program based on company and employee performance
  • Company paid life insurance, AD&D, Short-Term and Long-Term disability insurance
  • 4 weeks paid Parental Leave
  • Employee assistance program (EAP)

EHS/Environmental Requirements:

This job operates in a professional office environment. While performing the duties of this job, the employee routinely is required to use hands to keyboard, communicate, listen to, and interpret instructions and remain stationary for extended periods of the time. This would require the ability to move around the campus and occasionally move/lift items weighing up to 25 lbs.ย  Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions of the job.

Applicants are invited to apply for a reasonable accommodation to perform the essential duties of the job. To apply, send a request to staffing@arka.org or contact 203-797-5000 and press 2 for Human Resources.

ITC & Security Clearance Requirements:

This position requires an active TS/SCI U.S. Government Security Clearance.

Visa Restrictions:

No visa sponsorship is available for this position.

Pre-employment Screenings:

Employment with any ARKA companies in the U.S. is contingent upon satisfactory completion of several pre-employment requirements to include a credit check, background check, and drug screen.