Senior QA Automation Engineer - AI Validation

ai71 • United Arab Emirates

Visa Sponsorship Relocation

Apply

AI Summary

The Senior QA Automation Engineer will lead the validation and verification strategies for ai71's AI transformation. The role involves defining automated testing frameworks to validate AI behaviors against Ground Truth datasets. The ideal candidate will have experience in QA Automation, AI evaluation, and performance engineering.

Key Highlights

Lead validation and verification strategies for AI transformation

Define automated testing frameworks for AI behaviors

Ensure AI agents meet strict reliability standards

Key Responsibilities

Architect automated frameworks to evaluate Generative AI outputs

Implement automated metrics to verify Retrieval-Augmented Generation pipelines

Design regression suites to monitor prompt drift

Technical Skills Required

Python Pytest Selenium Playwright Requests DeepEval TruLens Locust K6 SQL Great Expectations

Benefits & Perks

100% tax-free salary and benefits package

Comprehensive relocation assistance

Visa sponsorship

Nice to Have

Background in Defense, Aerospace, or highly regulated industries

Familiarity with IV&V processes

Job Description

The Mission

AI71 is seeking a Senior QA Automation Engineer to lead the validation and verification strategies for ai71's AI transformation. In this role, you will define "what good looks like" for non-deterministic AI systems, ensuring that Large Language Models (LLMs) and predictive engines meet the strict reliability standards required for the defense and enterprise sectors.

You will act as the bridge between Agile development and formal Systems Engineering. Your mandate is to build automated testing frameworks that validate AI behaviors against "Ground Truth" datasets and ensure our AI agents pass rigorous Test Readiness Reviews (TRR) and Functional Configuration Audits (FCA)

Key Responsibilities What You'll Bring

AI & LLM Validation
Non-Deterministic Testing: Architect automated frameworks to evaluate Generative AI outputs for hallucination, consistency, and factual accuracy against "Gold Standard" datasets.
RAG Evaluation: Implement automated metrics (e.g., RAGAS, faithfulness, answer relevance) to verify that Retrieval-Augmented Generation pipelines accurately cite technical and regulatory documentation.
Prompt Regression: Design regression suites to monitor "prompt drift," ensuring model updates do not degrade the quality of AI-generated engineering documents.
Integration & System Verification
Enterprise Integration: Build robust tests to validate data consistency between AI agents and critical systems (e.g., SAP S/4HANA, Ariba), ensuring the integrity of Bill of Materials (BOM) and financial data.

Looking to advance your Development & Programming career with relocation support? Explore Development & Programming Jobs with Relocation Packages that include comprehensive packages to help you move and settle in your new role.

Performance Benchmarking: Design tests to validate latency and throughput for forecasting models and risk-scoring engines using tools like Locust, JMeter, or K6.
API & Security Validation: Automate testing of secure API gateways, verifying Role-Based Access Control (RBAC) and PII redaction logic before data reaches AI models.
Governance & Traceability
V-Model Alignment: Map automated test cases to "System Requirements" to create digital evidence for formal Verification and Validation (V&V) reports.
Stage Gate Compliance: Prepare "Test Readiness" packages for formal reviews, providing quantitative evidence that systems are stable enough to move from MVP to Production.
Defect Lifecycle Management: Manage the feedback loop between Requirements Quality Assistants and development teams, tracing AI logic defects back to specific model versions.

Technical Requirements

Discover our full range of relocation jobs with comprehensive support packages to help you relocate and settle in your new location.

Core Automation: Expert proficiency in Python (Pytest) and standard libraries (Selenium/Playwright, Requests).
AI Evaluation: Hands-on experience with LLM evaluation frameworks (e.g., DeepEval, TruLens) and "Ground Truth" dataset management.
Performance Engineering: Proficiency in crafting Performance Test Plans and implementations (Locust, K6, etc.).
Data Validation: Expertise in SQL and data quality tools (e.g., Great Expectations) for Data Lakehouses and Vector Databases.
CI/CD & DevOps: Strong experience integrating quality gates into GitLab CI/CD pipelines.
Engineering Practices: Deep understanding of modern QE practices, including Shift Left, Test Pyramid, and Mono-repo architectures.

Professional Qualifications

Interested in relocating to United Arab Emirates? Check out our comprehensive Relocation Jobs in United Arab Emirates page with detailed relocation packages and benefits.

Experience: 5+ years in QA Automation, with 2+ years focused on complex data-driven applications, ML models, or AI agents.
Domain Expertise: Background in Defense, Aerospace, or highly regulated industries is a strong plus. Familiarity with IV&V processes is highly desirable.
Analytical Mindset: Ability to define pass/fail criteria for probabilistic systems and communicate "Confidence Levels" to engineering leadership.

Why AI71?

Final Line of Defense: You define whether an AI agent is "trusted" to negotiate a contract or design a critical system component in a safety-critical environment.
Tax-Free Compensation: A market-leading, 100% tax-free salary and benefits package.
Full Relocation Support: Comprehensive assistance including flights, housing support, and visa sponsorship for you and your family to Abu Dhabi.
Pioneering Work: Set the global standard for how defense organizations validate intelligent systems using world-leading models.

Job Overview

Posted Date Jun 04, 2026

Employment Type Full-time

Experience Level Not Applicable

Location United Arab Emirates

Category Programming

Company ai71

Mentioned Skills

Industries

Similar Jobs

Explore other opportunities that match your interests

Senior Software Engineer (Backend)

Programming

•

9h ago

Visa Sponsorship Relocation Remote

Job Type Full-time

Experience Level Mid-Senior level

ventureone

United Arab Emirates

Senior/Principal Software Engineer - Fullstack AI-Assisted Legal Drafting Platform

Programming

•

2d ago

Visa Sponsorship Relocation Remote

Job Type Full-time

Experience Level Mid-Senior level

laine

United Arab Emirates

Talent Acquisition Specialist (Remote, Commission-Based)

Programming

•

1w ago

Visa Sponsorship Relocation Remote

Job Type Full-time

Experience Level Mid-Senior level

whitecarrot.ai

United Arab Emirates

Senior QA Automation Engineer - AI Validation

Key Highlights

Key Responsibilities

Technical Skills Required

Benefits & Perks

Nice to Have

Job Description

Job Overview

Mentioned Skills

Industries

Similar Jobs

Senior Software Engineer (Backend)

ventureone

Senior/Principal Software Engineer - Fullstack AI-Assisted Legal Drafting Platform

laine

Talent Acquisition Specialist (Remote, Commission-Based)

whitecarrot.ai

Subscribe our newsletter