1

Prefix Jobs (NOW HIRING)

Configure and validate BGP peering, route filtering (prefix-lists/route-maps), and community-based policy control * Support integration and troubleshooting with Type I encryption devices (e.g ...

Care Giver

Sacramento, CA · On-site

$16 - $18/hr

Benefits: * Very Available Manager * Small and Supportive Team * Flexible Time Off Request *In Advance* * Meals Included * 401(k) matching * Free food & snacks * Training & development Seeking ...

Be Seen First

CNC Machinist III

Morgan Hill, CA · On-site

$32 - $39/hr

Our company name, Minimatics, is derived from the Greek suffix -matos, meaning "willing to perform" and the Greek prefix -mini, which means to make smaller. Thus, we have always distinguished ...

Caregiver

Sacramento, CA · On-site

$16.50 - $18/hr

Benefits/Perks * 10 hour shift (four days a week to reach 40 hours) Job Summary Seeking someone Local to the Arden Arcade Area of Sacramento. Looking for someone who lives locally. The Job: To work ...

Caregiver

Sacramento, CA · On-site

$16.50 - $18/hr

Benefits/Perks * 10 hour shift (four days a week to reach 40 hours) Job Summary Seeking someone Local to the Arden Arcade Area of Sacramento. Looking for someone who lives locally. The Job: To work ...

Caregiver

Sacramento, CA · On-site

$16.50 - $18/hr

Benefits: * 401(k) matching * Free food & snacks * Opportunity for advancement * Training & development * Vision insurance Benefits/Perks * 10 hour shift (four days a week to reach 40 hours) Job ...

next page

Showing results 1-20

Prefix information

See salary details

$8

$26

$61

How much do prefix jobs pay per hour?

As of Jun 9, 2026, the average hourly pay for prefix in the United States is $26.34, according to ZipRecruiter salary data. Most workers in this role earn between $15.14 and $30.77 per hour, depending on experience, location, and employer.

What are Prefix jobs?

Prefix jobs typically refer to roles that involve the use, management, or analysis of prefixes in various fields such as linguistics, computer science, or telecommunications. In linguistics, a prefix job could involve studying how prefixes alter word meanings. In computer science and IT, it might involve managing network prefixes or working with prefix-based algorithms. The specific duties depend on the industry, but generally, these jobs require attention to detail and a solid understanding of how prefixes function in the relevant context.

What are some common challenges a Prefix operator may face when working in a fast-paced manufacturing environment?

Prefix operators in manufacturing often manage multiple machines or processes simultaneously, which can lead to challenges such as maintaining quality control under time pressure and quickly troubleshooting equipment issues. Adapting to frequent changes in production schedules or product specifications also requires strong attention to detail and flexibility. Effective communication with team members and supervisors is essential to ensure smooth workflow and minimize downtime.
More about Prefix jobs
What cities are hiring for Prefix jobs? Cities with the most Prefix job openings:
Principal GenAI Inference Optimization Engineer

Principal GenAI Inference Optimization Engineer

AMD

San Jose, CA • On-site

Full-time

Posted 15 days ago


Advanced Micro Devices rating

8.4

Company rating: 8.4 out of 10

Based on 7 frontline employees who took The Breakroom Quiz

24th of 139 rated electronics manufacturers


Job description

Job Summary:
AMD is a company focused on building innovative products that enhance computing experiences across various domains including AI and data centers. They are seeking a Principal GenAI Inference Optimization Engineer to improve the performance and efficiency of generative AI inference workloads on AMD GPU platforms, optimizing latency and throughput for large-scale models.
Responsibilities:
• Optimize performance of GenAI inference workloads on AMD GPU platforms across single-node and distributed environments.
• Improve latency, throughput, and cost efficiency for LLM and multimodal model serving in production.
• Analyze and resolve bottlenecks across compute, memory, and communication (e.g., kernel efficiency, KV-cache usage, memory bandwidth, scheduling).
• Contribute to cross-stack optimizations spanning kernels, runtimes, communication libraries, and inference/serving frameworks (e.g., vLLM, SGLang, Triton, or similar systems).
• Implement and evaluate inference optimization techniques such as batching strategies, quantization, prefix caching, and speculative decoding.
• Support development and optimization of scalable serving systems, including request scheduling and resource utilization.
• Develop and use profiling, benchmarking, and performance analysis tools for inference workloads.
• Collaborate with hardware, compiler, and framework teams to improve overall system performance.
• Contribute to internal tools and, where applicable, open-source projects for inference optimization on AMD platforms.
• Document best practices and contribute to performance guidelines for GenAI deployment.
Qualifications:
Required:
• Strong technical contributor with expertise in GenAI inference optimization, GPU performance, and large-scale serving systems.
• Solid understanding of GPU architecture, memory systems, and communication patterns.
• Ability to improve inference efficiency.
• Comfortable working across multiple layers—from kernels and runtimes to frameworks and serving systems.
• Ability to independently drive optimization efforts while collaborating with cross-functional teams.
• Optimize performance of GenAI inference workloads on AMD GPU platforms across single-node and distributed environments.
• Improve latency, throughput, and cost efficiency for LLM and multimodal model serving in production.
• Analyze and resolve bottlenecks across compute, memory, and communication (e.g., kernel efficiency, KV-cache usage, memory bandwidth, scheduling).
• Contribute to cross-stack optimizations spanning kernels, runtimes, communication libraries, and inference/serving frameworks (e.g., vLLM, SGLang, Triton, or similar systems).
• Implement and evaluate inference optimization techniques such as batching strategies, quantization, prefix caching, and speculative decoding.
• Support development and optimization of scalable serving systems, including request scheduling and resource utilization.
• Develop and use profiling, benchmarking, and performance analysis tools for inference workloads.
• Collaborate with hardware, compiler, and framework teams to improve overall system performance.
• Contribute to internal tools and, where applicable, open-source projects for inference optimization on AMD platforms.
• Document best practices and contribute to performance guidelines for GenAI deployment.
• B.S., M.S. or Ph.D. in Computer Science, Computer Engineering, or a related field preferred, or equivalent industry experience.
Preferred:
• Strong understanding of GPU architecture and performance fundamentals (compute, memory hierarchy, interconnects such as PCIe/Infinity Fabric/RDMA).
• Experience with GenAI inference optimization techniques (e.g., quantization, KV-cache optimization, batching).
• Hands-on experience with inference/serving frameworks such as vLLM, SGLang, Triton, TensorRT-LLM, or similar.
• Experience working on LLM or multimodal inference workloads.
• Familiarity with distributed systems and serving architectures.
• Experience with ML frameworks (PyTorch, JAX, or TensorFlow), especially for inference.
• Proficiency in Python and at least one systems language (C++/CUDA/HIP).
• Experience with profiling, debugging, and performance tuning tools.
• Ability to work collaboratively across teams and deliver impactful optimizations.
Company:
Advanced Micro Devices is a semiconductor company that designs and develops graphics units, processors, and media solutions. Founded in 1969, the company is headquartered in Santa Clara, USA, with a team of 10001+ employees. The company is currently Late Stage.