Onlano
AI / Machine LearningOnsite

Agent Evaluation Engineer

Mindrift - Company

New South Wales, Australia, ๐Ÿ‡ฆ๐Ÿ‡บ AustraliaFreelance - Mid level (2-4 years)0 applicantsCloses Jun 14, 2026

Salary

CHECK DESCRIPTION

Apply for this job

Job description

Role overview

Mindrift is seeking a freelance Agent Evaluation Engineer to work on project-based AI evaluation and improvement initiatives. This role involves testing AI systems, analyzing performance metrics, and collaborating with tech companies to enhance AI capabilities. Participation is project-based, not permanent employment.

Candidates must submit a CV in English and specify their English proficiency level. The position requires a mid-level understanding of AI/ML principles and evaluation methodologies.

Responsibilities

  • Evaluate and test AI agent performance across multiple metrics
  • Collaborate with development teams to identify system improvements
  • Design and implement evaluation frameworks for AI models

Requirements

  • 2-4 years of experience in AI/ML evaluation or testing
  • Proficiency in Python and data analysis tools
  • Strong understanding of AI system performance metrics

Benefits

  • Flexible freelance project opportunities
  • Work with leading technology companies on cutting-edge AI projects

Keywords

AIMachine LearningAgent EvaluationPythonTestingData AnalysisAI SystemsEvaluation Frameworks