QA Lead - AI Systems & Models Testing Quality Assurance Artificial Intelligence Contract Position ... Proficiency applying generative AI evaluation metrics and establishing quality thresholds ...
QA Lead - AI Systems & Models Testing Quality Assurance Artificial Intelligence Contract Position ... Proficiency applying generative AI evaluation metrics and establishing quality thresholds ...
QA Lead AI Systems & Models Testing Quality Assurance Artificial Intelligence Contract Position ... Proficiency applying generative AI evaluation metrics and establishing quality thresholds ...
Quick apply
QA Lead AI Systems & Models Testing Quality Assurance Artificial Intelligence Contract Position ... Proficiency applying generative AI evaluation metrics and establishing quality thresholds ...
Generative AI (GenAI) | Prompt Engineer
Montreal, QC ยท On-site +1
... MEDFAR's generative AI features -- most notably CoeurWay, our AI-powered clinical scribe, and ... testing for prompt changes, and documentation of known limitations. * Design and implement new ...
Quick apply
Generative AI (GenAI) | Prompt Engineer
Montreal, QC ยท On-site +1
... MEDFAR's generative AI features -- most notably CoeurWay, our AI-powered clinical scribe, and ... testing for prompt changes, and documentation of known limitations. * Design and implement new ...
AI Developer
Montreal, QC ยท On-site
Lead the architecture, design, and delivery of Generative AI solutions-including multi-step ... testing, reliability, reusability, and secure-by-design delivery * Define evaluation frameworks ...
AI Developer
Montreal, QC ยท On-site
Lead the architecture, design, and delivery of Generative AI solutions-including multi-step ... testing, reliability, reusability, and secure-by-design delivery * Define evaluation frameworks ...
Senior Applied AI Engineer
Montreal, QC ยท On-site
The ideal candidate will combine deep expertise in Generative AI systems engineering with strong ... Hands-on experience with Python, FastAPI / Flask, async workflows, APIs, testing frameworks, and CI ...
Senior Applied AI Engineer
Montreal, QC ยท On-site
The ideal candidate will combine deep expertise in Generative AI systems engineering with strong ... Hands-on experience with Python, FastAPI / Flask, async workflows, APIs, testing frameworks, and CI ...
Applied AI Engineer
Montreal, QC ยท On-site
This role is focused on Generative AI engineering and agentic systems, including single-agent and ... Hands-on experience with Python, FastAPI / Flask, async workflows, APIs, testing frameworks, and CI ...
Applied AI Engineer
Montreal, QC ยท On-site
This role is focused on Generative AI engineering and agentic systems, including single-agent and ... Hands-on experience with Python, FastAPI / Flask, async workflows, APIs, testing frameworks, and CI ...
AI & Generative AI Enablement * Develop and integrate AI and Generative AI capabilities into ... Apply software engineering best practices, including clean code, automated testing, secure coding ...
AI & Generative AI Enablement * Develop and integrate AI and Generative AI capabilities into ... Apply software engineering best practices, including clean code, automated testing, secure coding ...
AI & Generative AI Enablement * Develop and integrate AI and Generative AI capabilities into ... Apply software engineering best practices, including clean code, automated testing, secure coding ...
AI & Generative AI Enablement * Develop and integrate AI and Generative AI capabilities into ... Apply software engineering best practices, including clean code, automated testing, secure coding ...
AI & Generative AI Enablement * Develop and integrate AI and Generative AI capabilities into ... Apply software engineering best practices, including clean code, automated testing, secure coding ...
AI & Generative AI Enablement * Develop and integrate AI and Generative AI capabilities into ... Apply software engineering best practices, including clean code, automated testing, secure coding ...
AI & Generative AI Enablement * Develop and integrate AI and Generative AI capabilities into ... Apply software engineering best practices, including clean code, automated testing, secure coding ...
AI & Generative AI Enablement * Develop and integrate AI and Generative AI capabilities into ... Apply software engineering best practices, including clean code, automated testing, secure coding ...
Data Platform Lead
Montreal, QC ยท On-site
Experimentation & Continuous Improvement o Design and analyze experiments (e.g., A/B testing) to ... Experience across traditional ML and generative AI use cases. Strong understanding of feature ...
Data Platform Lead
Montreal, QC ยท On-site
Experimentation & Continuous Improvement o Design and analyze experiments (e.g., A/B testing) to ... Experience across traditional ML and generative AI use cases. Strong understanding of feature ...
We are seeking apassionate, collaborative senior GenAIprofessional to help shape how Generative AI ... Apply strong engineering discipline, including testing, version control, monitoring, and iteration ...
We are seeking apassionate, collaborative senior GenAIprofessional to help shape how Generative AI ... Apply strong engineering discipline, including testing, version control, monitoring, and iteration ...
There AI Core group pioneers' platforms across Generative AI, AI Agents, RAG, Knowledge Bases, Data ... testing, deployment, and operations) * 5-10 years of Python expertise, including advanced features ...
Quick apply
There AI Core group pioneers' platforms across Generative AI, AI Agents, RAG, Knowledge Bases, Data ... testing, deployment, and operations) * 5-10 years of Python expertise, including advanced features ...
Stay Current on AI Trends & Innovations - Monitor developments in generative AI space and bring ... Certification in software testing (e.g., ISTQB). Experience with cloud-based testing environments.
Stay Current on AI Trends & Innovations - Monitor developments in generative AI space and bring ... Certification in software testing (e.g., ISTQB). Experience with cloud-based testing environments.
You'll define the GEO and AI content development and distribution strategy, architect the operating ... Run continuous testing across prompts, formats, and content pillars to improve model and business ...
You'll define the GEO and AI content development and distribution strategy, architect the operating ... Run continuous testing across prompts, formats, and content pillars to improve model and business ...
You'll define the GEO and AI content development and distribution strategy, architect the operating ... Run continuous testing across prompts, formats, and content pillars to improve model and business ...
You'll define the GEO and AI content development and distribution strategy, architect the operating ... Run continuous testing across prompts, formats, and content pillars to improve model and business ...
... testing, release, and sustainment * Strong communication skills with the ability to engage business ... Exposure to AI agent tools and generative AI capabilities such as Copilot Studio, Azure AI, Azure ...
New
... testing, release, and sustainment * Strong communication skills with the ability to engage business ... Exposure to AI agent tools and generative AI capabilities such as Copilot Studio, Azure AI, Azure ...
New
... testing, release, and sustainment * Strong communication skills with the ability to engage business ... Exposure to AI agent tools and generative AI capabilities such as Copilot Studio, Azure AI, Azure ...
New
... testing, release, and sustainment * Strong communication skills with the ability to engage business ... Exposure to AI agent tools and generative AI capabilities such as Copilot Studio, Azure AI, Azure ...
New
... testing, release, and sustainment * Strong communication skills with the ability to engage business ... Exposure to AI agent tools and generative AI capabilities such as Copilot Studio, Azure AI, Azure ...
New
... testing, release, and sustainment * Strong communication skills with the ability to engage business ... Exposure to AI agent tools and generative AI capabilities such as Copilot Studio, Azure AI, Azure ...
New
... testing, release, and sustainment * Strong communication skills with the ability to engage business ... Exposure to AI agent tools and generative AI capabilities such as Copilot Studio, Azure AI, Azure ...
New
... testing, release, and sustainment * Strong communication skills with the ability to engage business ... Exposure to AI agent tools and generative AI capabilities such as Copilot Studio, Azure AI, Azure ...
New
Generative Ai Testing information
What are the key skills and qualifications needed to thrive as a Generative AI Testing Specialist, and why are they important?
What are some common challenges faced when testing generative AI models, and how can I prepare to address them in this role?
What is Generative AI Testing?
What is the difference between Generative Ai Testing vs Data Scientist?
| Aspect | Generative Ai Testing | Data Scientist |
|---|---|---|
| Required Credentials | Knowledge of AI models, testing tools, programming skills | Statistics, programming, data analysis certifications |
| Work Environment | AI development teams, testing labs, tech companies | Research labs, tech firms, finance, healthcare |
| Employer & Industry Usage | AI product testing, quality assurance in tech | Data analysis, predictive modeling across industries |
Generative Ai Testing focuses on evaluating and validating AI-generated content and models, ensuring quality and accuracy. Data Scientists analyze data, build models, and derive insights. While both roles require programming and AI knowledge, Generative Ai Testing emphasizes testing processes, whereas Data Scientists focus on data analysis and model development.

Contractor
Posted 24 days ago
Job description
QA Lead - AI Systems & Models Testing
Quality Assurance Artificial Intelligence Contract Position
Contract
Montreal, QC
AI / ML Testing
LLM / RAG / LangChain
ABOUT THE ROLE
We are seeking an experienced QA Lead with deep expertise in AI systems testing to join our team on a contract basis in Montreal, Quebec. This role sits at the intersection of quality engineering and artificial intelligence, requiring hands-on proficiency in LLM behavior analysis, RAG pipeline validation, and modern AI orchestration frameworks. You will own the end-to-end test strategy for complex AI products and help define quality standards in a rapidly evolving space.
MUST-HAVE SKILLS
- Proven QA leadership experience designing and executing test strategies for AI/ML systems or LLM-powered applications.
- Strong understanding of LLM internals: tokenization, embeddings, attention mechanisms, and inference behavior to anticipate and diagnose failure modes.
- Hands-on experience with prompt engineering - constructing effective prompts, detecting hallucinations, and evaluating outputs across accuracy, tone, coherence, and bias dimensions.
- Experience testing RAG pipelines and knowledge base integrations, including validation of data quality and retrieval accuracy as they impact model outputs.
- Familiarity with vector database mechanics: similarity search thresholds, embedding drift, near-duplicate documents, and sparse vs. dense embeddings.
- Practical experience with LangChain and/or LangGraph - able to read chain/graph construction code, identify failure points, and write test harnesses.
- Ability to validate MCP (Model Context Protocol) integration points, including tool availability and error-handling scenarios.
- Proficiency applying generative AI evaluation metrics and establishing quality thresholds appropriate for production AI systems.
- Excellent written and verbal communication in English; bilingualism (English/French) is a plus for the Montreal market.
NICE-TO-HAVE SKILLS
- Experience with bias detection and safety testing frameworks for AI systems.
- Exposure to performance and scalability testing of vector databases under high load.
- Familiarity with CI/CD pipelines for ML model deployment and automated regression testing.
- Knowledge of responsible AI principles and AI governance frameworks.
- Contributions to or experience with open-source AI testing or evaluation tooling (e.g., DeepEval, Ragas, PromptFlow).
- Background in data engineering or data quality practices relevant to AI pipeline inputs.
- Cloud platform experience (AWS, Azure, or GCP) in the context of deploying or testing AI workloads.
KEY RESPONSIBILITIES
- Lead design and execution of comprehensive test strategies across AI systems, including prompt evaluation, output quality assessment, and bias/safety analysis.
- Develop and maintain test harnesses for LangChain and LangGraph-based applications; review chain and graph construction code to proactively surface integration risks.
- Validate RAG pipeline integrity - data ingestion, chunking, retrieval accuracy, and embedding consistency - and define edge-case coverage for vector database interactions.
- Establish and track generative AI quality metrics and thresholds; report on model output quality across multiple evaluation dimensions.
- Collaborate with ML engineers, data scientists, and product teams to embed quality practices throughout the AI development lifecycle.
- Document test findings clearly for both technical and non-technical stakeholders.
Contract position based in Montreal, Quebec, Canada On-site / Hybrid
Employment Type: CONTRACTOR