AI / Machine LearningRemote

Freelance Agent Evaluation Engineer

Mindrift - Company

UK, 🇬🇧 United KingdomFreelance - Mid level (2-4 years)0 applicantsCloses Jul 23, 2026

Salary

GBP 69,357 - 69,357 / year

Apply for this job

Job description

Job details

Location: UK
Work mode: Remote
Employment type: Freelance (Not an internship)
Salary: GBP 69,357 per year

Role overview

Mindrift is seeking a skilled Freelance Agent Evaluation Engineer based in the UK to help refine the next generation of AI coding agents. In this role, you will focus on testing and improving AI systems by creating complex, real-world developer tasks and defining rigorous evaluation criteria to measure model performance accurately.

Job details

This is a Freelance, project-based position located in the UK. This role is Remote and is not an internship. The offered salary is 69,357 GBP. Candidates must provide a CV in English and state their proficiency level.

Responsibilities

Develop challenging real-world coding tasks to evaluate AI agent capabilities.
Establish clear and objective evaluation criteria for AI-generated code.
Analyze model outputs to identify failures in logic, syntax, or efficiency.
Contribute to the creation of high-quality datasets for AI model training.
Collaborate with technical teams to improve the accuracy of coding agents.

Requirements

Proven experience in software development with strong coding proficiency.
Ability to design complex technical scenarios for AI testing.
High level of proficiency in written and spoken English.
Strong analytical skills to evaluate the quality of AI-generated solutions.
Experience with AI/ML evaluation frameworks is a plus.

Benefits

Flexible remote work environment.
Opportunity to work with leading global tech companies.
Engagement in cutting-edge AI development projects.

Keywords

AI EvaluationLLM TestingCoding AgentsSoftware EngineeringDataset CreationQuality AssurancePythonAI Training