Deep Learning Quantization Jobs in Texas (NOW HIRING)

Senior Software Developer (Contractor) RITM1788244

$54 - $71.25/hr

We bring deep technical knowledge, real-world experience, and a commitment to work that matters. If ... Build and maintain production-grade machine learning pipelines and model deployment frameworks.

RE/SPEC Inc.

Senior Software Developer (Contractor) RITM1788244

Austin, TX · On-site

$54 - $71.25/hr

RE/SPEC Inc.

Senior Software Developer (Contractor) RITM1788244

Austin, TX · Remote

$54 - $71.25/hr

RE/SPEC Inc.

Senior Software Developer (Contractor) RITM1788244

Austin, TX · Remote

$54 - $71.25/hr

CCS INC

Lead Gen AI Engineer

Plano, TX

$85 - $110/hr

Deep understanding of LLMs, embeddings, vector databases (e.g., FAISS, Pinecone, Weaviate ... Use techniques like quantization, distillation, and caching to improve efficiency.

Quick apply

CCS INC

Lead Gen AI Engineer

Plano, TX

$85 - $110/hr

Deep understanding of LLMs, embeddings, vector databases (e.g., FAISS, Pinecone, Weaviate ... Use techniques like quantization, distillation, and caching to improve efficiency.

Ambiq Micro, Inc.

Sr. Staff Edge AI Applied Machine Learning Engineer

Austin, TX · On-site

$121K - $160K/yr

Scope Ambiq is seeking an experienced Edge AI Applied ML Engineer with deep experience in audio and ... Apply model efficiency techniques: quantization, compression, pruning, and structured ...

Ambiq Micro, Inc.

Sr. Staff Edge AI Applied Machine Learning Engineer

Austin, TX · On-site

$121K - $160K/yr

Invoca

Senior ML Engineer

Austin, TX · On-site +1

$103K - $142K/yr

Advanced Python and deep learning proficiency (PyTorch, HuggingFace Transformers, spaCy ... models via quantization, batching, and throughput tuning * Proficiency with inference ...

Quick apply

Invoca

Senior ML Engineer

Austin, TX · On-site +1

$103K - $142K/yr

Advanced Python and deep learning proficiency (PyTorch, HuggingFace Transformers, spaCy ... models via quantization, batching, and throughput tuning * Proficiency with inference ...

Photon

Sr Data Scientist - Gen AI ML - New York / Jersey City

Irving, TX

The ideal candidate blends deep machine learning expertise with modern software engineering ... Knowledge of model fine-tuning techniques and local LLM quantization/hosting. Familiarity with ...

Photon

Sr Data Scientist - Gen AI ML - New York / Jersey City

Irving, TX

Photon

Expert Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Irving, TX

Photon

Expert Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Irving, TX

eStaffLLC

Senior Software Developer Specialist

Austin, TX

$54 - $71.25/hr

... deep expertise in AI/ML development to join our Austin client's team ... In this role, you will design, build, and deploy production-grade machine learning systems that ...

eStaffLLC

Senior Software Developer Specialist

Austin, TX

$54 - $71.25/hr

... deep expertise in AI/ML development to join our Austin client's team ... In this role, you will design, build, and deploy production-grade machine learning systems that ...

Photon

Sr Data Scientist - Gen AI ML - Irving

Irving, TX · On-site

Photon

Sr Data Scientist - Gen AI ML - Irving

Irving, TX · On-site

Photon

Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Irving, TX

Photon

Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Irving, TX

Photon

Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Irving, TX · On-site

Photon

Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Irving, TX · On-site

Photon

Sr Data Scientist - Gen AI ML - Irving

Irving, TX

Photon

Sr Data Scientist - Gen AI ML - Irving

Irving, TX

Photon

Sr Data Scientist - Gen AI ML - New York / Jersey City

Irving, TX

Photon

Sr Data Scientist - Gen AI ML - New York / Jersey City

Irving, TX

Photon

Sr Data Scientist - Gen AI ML - Irving

Irving, TX · On-site

Photon

Sr Data Scientist - Gen AI ML - Irving

Irving, TX · On-site

Photon

Expert Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Irving, TX · On-site

Photon

Expert Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Irving, TX · On-site

Photon

Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Irving, TX

Photon

Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Irving, TX

Photon

Architect - Gen AI ML - New York / Jersey City

Irving, TX · On-site

Photon

Architect - Gen AI ML - New York / Jersey City

Irving, TX · On-site

Photon

Sr Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Irving, TX · On-site

Photon

Sr Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Irving, TX · On-site

Photon

Sr Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Irving, TX · On-site

Photon

Sr Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Irving, TX · On-site

Photon

Architect - Gen AI ML - New York / Jersey City

Irving, TX · On-site

Photon

Architect - Gen AI ML - New York / Jersey City

Irving, TX · On-site

Showing results 1-20

Deep Learning Quantization Jobs in Texas

Deep Learning Quantization information

What are the key skills and qualifications needed to thrive as a Deep Learning Quantization Engineer, and why are they important?

To excel as a Deep Learning Quantization Engineer, you need a strong background in machine learning, applied mathematics, and computer science, usually supported by an advanced degree in a related field. Familiarity with deep learning frameworks (such as TensorFlow or PyTorch), quantization toolkits, and hardware acceleration platforms is crucial. Analytical thinking, problem-solving, and clear technical communication are standout soft skills in this role. These abilities are essential for efficiently optimizing models for deployment on resource-constrained hardware while maintaining accuracy and performance.

What is the difference between Deep Learning Quantization vs Machine Learning Engineer?

Aspect	Deep Learning Quantization	Machine Learning Engineer
Required Credentials	Advanced degrees in AI, Computer Science, or related fields; knowledge of neural networks	Bachelor's or Master's in CS, Data Science, or related fields; programming skills
Work Environment	Research labs, AI development teams, hardware optimization settings	Software development teams, data-driven projects, product-focused environments
Industry Usage	AI hardware optimization, model deployment, edge computing	Model development, data analysis, software solutions across industries

Deep Learning Quantization focuses on reducing model size and improving inference speed through techniques like weight and activation quantization, often in hardware or embedded systems. Machine Learning Engineers develop, implement, and optimize machine learning models for various applications. While both roles require knowledge of AI and programming, Deep Learning Quantization is more specialized in model optimization techniques, whereas Machine Learning Engineers work broadly on model development and deployment.

What is deep learning quantization?

Deep learning quantization is the process of reducing the precision of the numbers used to represent a neural network's parameters, activations, or both. By converting the typically used 32-bit floating-point values to lower bit-width formats such as 16-bit or 8-bit integers, quantization significantly reduces the memory footprint and computational requirements of deep learning models. This technique helps deploy models efficiently on edge devices and mobile hardware while maintaining acceptable accuracy levels. Quantization is widely used in model optimization for faster inference and lower power consumption.

What are some common challenges faced when implementing deep learning quantization in production environments?

One of the main challenges in implementing deep learning quantization is balancing model accuracy with computational efficiency, as quantization can sometimes lead to a drop in model performance. Additionally, ensuring hardware compatibility and optimizing for different devices (such as CPUs, GPUs, or edge devices) can require extensive testing and tuning. Collaboration with data scientists, software engineers, and hardware specialists is often essential to successfully deploy quantized models at scale. Staying updated with the latest quantization techniques and frameworks is also important for overcoming these challenges.

What are popular job titles related to Deep Learning Quantization jobs in Texas? For Deep Learning Quantization jobs in Texas, the most frequently searched job titles are:

What job categories do people searching Deep Learning Quantization jobs in Texas look for? The top searched job categories for Deep Learning Quantization jobs in Texas are:

What cities in Texas are hiring for Deep Learning Quantization jobs? Cities in Texas with the most Deep Learning Quantization job openings:

Deep Learning Quantization jobs near you

Senior Software Developer (Contractor) RITM1788244

RE/SPEC Inc.

Austin, TX • On-site

Apply

$54 - $71.25/hr

Full-time

This job post has expired today. Applications are no longer accepted.

Job description

Company Description
Big challenges need bold thinkers.
If you're someone who sees problems as opportunities, you'll thrive here. RESPEC is 100% employee-owned, which means we take ownership of every challenge. Here, your ideas drive real solutions. Since 1969, we've tackled complex challenges in energy transition, infrastructure resilience, digital transformation, and sustainability.
At RESPEC, you'll work alongside clients to take on critical problems. Depending on your expertise, you might design infrastructure in remote locations, develop renewable energy solutions for global projects, or apply data-driven technology to improve mining and water systems.
We bring deep technical knowledge, real-world experience, and a commitment to work that matters. If you're looking for a place where your contributions have real impact, you'll fit right in.
We do not accept unsolicited resumes from third-party recruiters.
Job Description
RESPEC is seeking an experienced Software Developer Specialist to support a major transportation technology initiative for our government client in Austin, Texas. This role focuses on designing, developing, deploying, and optimizing advanced Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision, and cloud-based solutions that support large-scale operational and business objectives.
The ideal candidate will bring deep expertise across AI/ML engineering, cloud platforms, MLOps, DevOps, and production-grade software development while collaborating with technical and business stakeholders in a highly visible public-sector environment.
Responsibilities:

Design, develop, test, and deploy scalable AI/ML solutions in cloud environments.
Build and maintain production-grade machine learning pipelines and model deployment frameworks.
Develop applications leveraging Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), and transformer-based architectures.
Create and optimize NLP solutions, recommendation systems, forecasting models, and anomaly detection systems.
Design and implement computer vision solutions for real-time and large-scale data processing.
Develop and maintain MLOps workflows, model monitoring, and automated retraining processes.
Build and manage CI/CD pipelines supporting AI and software delivery.
Containerize and deploy applications using Docker and Kubernetes.
Collaborate with cross-functional teams to gather requirements and translate business needs into technical solutions.
Optimize model performance through quantization, pruning, distillation, and distributed training techniques.
Work with structured, unstructured, vector, and spatial datasets to support analytics and predictive modeling initiatives.
Document solutions, architectures, and deployment processes according to client standards.
Participate in technical reviews, troubleshooting, and ongoing operational support.

Qualifications
8+ Years of Experience Required:
Cloud Platforms & AI Infrastructure

AWS, Microsoft Azure, Google Cloud Platform (GCP), or Oracle Cloud Infrastructure (OCI)
Deploying and managing machine learning workloads in cloud environments
Utilizing AI/ML services across major cloud providers

DevOps & Platform Engineering

Ansible
Docker
Kubernetes
CI/CD implementation and automation

Database Technologies

SQL databases including PostgreSQL and MySQL
NoSQL databases
Vector databases

Automation & Scripting

Bash scripting
PowerShell scripting

CI/CD Tools

Azure DevOps
GitHub Actions
Jenkins
Comparable enterprise CI/CD platforms

3+ Years of Experience Required:
Python Development

Production-level Python application development
Building scalable backend and AI-driven solutions

Natural Language Processing & Large Language Models

Transformer architectures
Retrieval-Augmented Generation (RAG)
Fine-tuning models
Prompt engineering
LLM application development

Time Series Analytics

Forecasting
Sequential modeling
Anomaly detection
Real-time monitoring systems

Recommendation Systems

Collaborative filtering
Ranking algorithms
Personalization engines
Content recommendation platforms

MLOps

MLflow
Weights & Biases
Kubeflow
Apache Airflow
Similar MLOps platforms

Distributed AI Training

Multi-GPU environments
Multi-node training
Data parallelism
Large-scale model training

Computer Vision

PyTorch
TensorFlow
OpenCV
YOLO
Object detection
Image segmentation
Real-time inference systems

Feature Engineering

Feature stores such as Feast or Tecton
Advanced feature engineering methodologies

Model Optimization

Quantization
Pruning
Knowledge distillation

Alternative/Open-Source LLM Platforms

Ollama
Hugging Face
Other non-frontier/open-source model ecosystems

2+ Years of Experience Required:
Production AI/ML Delivery

Demonstrated experience building and deploying at least 2-3 machine learning models used by real-world users in production environments

Preferred Qualifications:
Candidates with one or more of the following qualifications will receive additional consideration:

GIS and spatial data analysis experience
Transportation, logistics, or smart-city technology experience
Computer vision applications involving infrastructure, roadway, or vehicle-related data
Public-sector data governance, compliance, and security experience
Unreal Engine experience
Digital twin implementation experience
Google Maps Cesium API experience
Polygonflow Dash experience

Additional Information
Schedule:
• Monday through Friday
• 8:00 AM to 5:00 PM Central Time
• State holidays observed per client schedule
On-Site Requirement:
• Minimum of 4 days per week on-site in Austin, Texas
• Remote work flexibility is limited and subject to client approval
Important:
Candidates must be able to reliably commute to the Austin office throughout the engagement.
Work Authorization
Applicants must be legally authorized to work in the United States throughout the duration of the engagement.
Background Screening
Selected candidates must successfully complete required background investigations before beginning work, including:
• Criminal history review
• State and county-level checks
• Sex offender registry review
• Additional client-required screenings if applicable
Employment Conditions
• Overtime may occasionally be required and must receive prior client approval.
• Candidates may be asked to support occasional evening, weekend, or holiday activities based on project demands.
• Time reporting must comply with client-established procedures and systems.
Candidate Considerations Before Applying
Please review the following carefully before applying:
• Relocation assistance is not specified.
• Candidates must be available to start near the anticipated project start date.
• Extended absences may impact project eligibility.
• Background screening is mandatory.
• Only candidates authorized to work in the United States will be considered.
• Compensation is subject to client-established limits and final approval.
If you are passionate about applying advanced AI, machine learning, computer vision, and cloud technologies to impactful public-sector initiatives, we encourage you to apply and join RESPEC's growing government technology practice.
All your information will be kept confidential according to EEO guidelines.

About RESPEC

Sourced by ZipRecruiter

Industry

Business management consulting

Company size

201 - 500 Employees

Headquarters location

Rapid City, SD, US

Year founded

1969

Website

respec.com

Social media

View All RESPEC Jobs

Apply

Deep Learning Quantization Jobs in Texas (NOW HIRING)

Senior Software Developer (Contractor) RITM1788244

Senior Software Developer (Contractor) RITM1788244

Senior Software Developer (Contractor) RITM1788244

Senior Software Developer (Contractor) RITM1788244

Lead Gen AI Engineer

Lead Gen AI Engineer

Sr. Staff Edge AI Applied Machine Learning Engineer

Sr. Staff Edge AI Applied Machine Learning Engineer

Senior ML Engineer

Senior ML Engineer

Sr Data Scientist - Gen AI ML - New York / Jersey City

Sr Data Scientist - Gen AI ML - New York / Jersey City

Expert Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Expert Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Senior Software Developer Specialist

Senior Software Developer Specialist

Sr Data Scientist - Gen AI ML - Irving

Sr Data Scientist - Gen AI ML - Irving

Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Sr Data Scientist - Gen AI ML - Irving

Sr Data Scientist - Gen AI ML - Irving

Sr Data Scientist - Gen AI ML - New York / Jersey City

Sr Data Scientist - Gen AI ML - New York / Jersey City

Sr Data Scientist - Gen AI ML - Irving

Sr Data Scientist - Gen AI ML - Irving

Expert Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Expert Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Architect - Gen AI ML - New York / Jersey City

Architect - Gen AI ML - New York / Jersey City

Sr Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Sr Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Sr Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Sr Data Scientist - Gen AI ML - Tampa/Irving/ Mississauga

Architect - Gen AI ML - New York / Jersey City

Architect - Gen AI ML - New York / Jersey City

Deep Learning Quantization information

What are the key skills and qualifications needed to thrive as a Deep Learning Quantization Engineer, and why are they important?

What is the difference between Deep Learning Quantization vs Machine Learning Engineer?

What is deep learning quantization?

What are some common challenges faced when implementing deep learning quantization in production environments?

Senior Software Developer (Contractor) RITM1788244

Share this job

Job description

About RESPEC

Industry

Company size

Headquarters location

Year founded

Website

Social media

Share this job