AI / Machine LearningOnsite

Agent Evaluation Engineer

Mindrift - Company

New South Wales, Australia, 🇦🇺 AustraliaFreelance - Mid level (2-4 years)0 applicantsCloses Jun 14, 2026

Salary

CHECK DESCRIPTION

Apply for this job

Job description

Role overview

Mindrift is seeking a freelance Agent Evaluation Engineer to work on project-based AI evaluation and improvement initiatives. This role involves testing AI systems, analyzing performance metrics, and collaborating with tech companies to enhance AI capabilities. Participation is project-based, not permanent employment.

Candidates must submit a CV in English and specify their English proficiency level. The position requires a mid-level understanding of AI/ML principles and evaluation methodologies.

Responsibilities

Evaluate and test AI agent performance across multiple metrics
Collaborate with development teams to identify system improvements
Design and implement evaluation frameworks for AI models

Requirements

2-4 years of experience in AI/ML evaluation or testing
Proficiency in Python and data analysis tools
Strong understanding of AI system performance metrics

Benefits

Flexible freelance project opportunities
Work with leading technology companies on cutting-edge AI projects

Keywords

AIMachine LearningAgent EvaluationPythonTestingData AnalysisAI SystemsEvaluation Frameworks