RadixArk

47 Radixark Jobs Hiring Near You

Job Summary : RadixArk is an infrastructure-first company focused on building world-class open systems for AI inference and training. They are seeking a Performance Engineer to enhance the ...

This is a rare opportunity to define what financial rigor looks like at RadixArk - and to build it yourself. We move fast, operate with a sense of urgency, and expect the same from everyone on the ...

AI Infra Resident (1-Year Program)

Palo Alto, CA ยท On-site

$116.40K - $148.70K/yr

About The Role RadixArk is launching a full-time, paid, 1-year residency program for aspiring AI infrastructure engineers. You'll rotate across inference, training, kernels, compilers, and cluster ...

About the Role We're looking for a Head of Business Development to build the BD function at RadixArk from the ground up. The BD team is the institutional memory of this company - maintaining active ...

About The Role RadixArk is building the infrastructure layer for frontier AI systems - unified inference, training, and evaluation stacks powering next-generation LLM applications at scale. Our ...

About the Role We're looking for a Head of Business Development to build the BD function at RadixArk from the ground up. The BD team is the institutional memory of this company - maintaining active ...

About the Role RadixArk is seeking experienced product-focused engineers to join our team in building the developer-facing surfaces of our inference and training infrastructure. As a Software ...

About RadixArk RadixArk is an infrastructure-first company built by engineers who've shipped production AI systems, created SGLang (20K+ GitHub stars, the fastest open LLM serving engine), and ...

RadixArk is an infrastructure-first company focused on democratizing AI infrastructure. They are seeking a Product Manager to define and drive the product roadmap for inference and training ...

About RadixArk RadixArk is an infrastructure-first company built by engineers who've shipped production AI systems, created SGLang (20K+ GitHub stars, the fastest open LLM serving engine), and ...

Showing results 21-40

RadixArk Jobs Information

What are the most popular cities for Radixark jobs?
What are the most popular states for Radixark jobs?
What are the most popular job types at Radixark?
Infographic showing various job openings at Radixark in the United States as of May 2026, with employment types broken down into 97% Full Time, and 3% Contract. Highlights an 100% Physical job distribution.

Performance Engineer

RadixArk

Palo Alto, CA โ€ข On-site

Full-time

This job post hasย expired today.ย Applications are no longer accepted.


Job description

Job Summary:
RadixArk is an infrastructure-first company focused on building world-class open systems for AI inference and training. They are seeking a Performance Engineer to enhance the performance of their AI systems across various environments, ensuring optimal usability, affordability, and reliability in production.
Responsibilities:
โ€ข Analyze and improve performance across SGLang, Miles, and RadixArk production deployments
โ€ข Benchmark LLM inference and training workloads across GPUs, TPUs, and cloud environments
โ€ข Optimize latency, throughput, memory usage, batching, scheduling, routing, and GPU utilization
โ€ข Investigate performance regressions in real customer environments
โ€ข Work closely with kernel, runtime, distributed systems, and product engineers
โ€ข Build internal tooling for profiling, tracing, benchmarking, and regression detection
โ€ข Translate customer workload characteristics into concrete performance tuning strategies
โ€ข Help define performance metrics that matter commercially, including cost-per-token and serving efficiency
โ€ข Partner with customers and cloud partners on deep technical evaluations
โ€ข Contribute performance insights back to open-source SGLang and Miles
Qualifications:
Required:
โ€ข Strong systems engineering background, especially in performance-critical software
โ€ข Experience with GPU systems, distributed systems, inference serving, ML runtimes, or high-performance computing
โ€ข Familiarity with profiling tools, performance debugging, tracing, and benchmark methodology
โ€ข Comfort working with Python and C++
โ€ข Understanding of LLM inference concepts such as batching, KV cache, prefill/decode, speculative decoding, MoE, long context, and P99 latency
โ€ข Ability to debug messy real-world performance issues across software, hardware, and infrastructure layers
โ€ข Strong communication skills โ€” you should be able to explain performance tradeoffs to both engineers and customers
Preferred:
โ€ข Experience with CUDA, Triton, Pallas, ROCm, XLA, or kernel-level optimization is a strong plus
โ€ข Prior experience with production AI infrastructure, cloud GPU environments, or open-source ML systems is a plus
Company:
RadixArk focuses on developing infrastructure for AI inference and training systems. Founded in 2025, the company is headquartered in San Francisco, USA, with a team of 11-50 employees. The company is currently Early Stage.