Job description
Job details
- Location: London, UK
- Work mode: Remote
- Employment type: Freelance (Not an internship)
- Salary: GBP 79,921 per year
Role overview
Mindrift is seeking a Freelance Agent Evaluation Engineer in London, UK to work on project-based AI opportunities. This remote freelance role focuses on building datasets to evaluate AI coding agents and their performance on real-world developer tasks. The position offers ยฃ79,921 and involves creating challenging evaluation tasks and criteria for leading tech companies' AI systems.
Job details
This is a freelance, project-based opportunity based in London, UK with remote work options. The role involves testing, evaluating, and improving AI systems through structured task creation and assessment. Candidates must submit their CV in English and indicate their English proficiency level. Salary is ยฃ79,921 GBP. This is not permanent employment but project-based engagement with Mindrift.
Responsibilities
- Create challenging tasks to evaluate AI coding agent performance
- Develop evaluation criteria for real-world developer scenarios
- Test and assess AI systems for leading tech companies
- Build datasets for AI model evaluation and improvement
- Document task requirements and evaluation methodologies
Requirements
- Strong English proficiency (written and spoken)
- Experience with AI systems testing or evaluation
- Understanding of software development workflows
- Ability to design realistic developer task scenarios
- Analytical skills for assessing AI model outputs
Benefits
- Flexible project-based work
- Remote work opportunity
- Competitive freelance rate of ยฃ79,921
- Work with leading tech companies