Job description
Role overview
Mindrift is seeking a freelance Agent Evaluation Engineer to work on project-based AI evaluation and improvement initiatives. This role involves testing AI systems, analyzing performance metrics, and collaborating with tech companies to enhance AI capabilities. Participation is project-based, not permanent employment.
Candidates must submit a CV in English and specify their English proficiency level. The position requires a mid-level understanding of AI/ML principles and evaluation methodologies.
Responsibilities
- Evaluate and test AI agent performance across multiple metrics
- Collaborate with development teams to identify system improvements
- Design and implement evaluation frameworks for AI models
Requirements
- 2-4 years of experience in AI/ML evaluation or testing
- Proficiency in Python and data analysis tools
- Strong understanding of AI system performance metrics
Benefits
- Flexible freelance project opportunities
- Work with leading technology companies on cutting-edge AI projects
Keywords
AIMachine LearningAgent EvaluationPythonTestingData AnalysisAI SystemsEvaluation Frameworks