About Us
Based in San Francisco, California, Turing is the world's leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems.
Project Overview
As a Software Engineering evaluator, you will create cutting-edge datasets for training, benchmarking, and advancing large language models, collaborating closely with researchers. This includes curating code examples, providing precise solutions, and making corrections in Python, JavaScript (including React), C/C++, Java, Rust, and Go; evaluating and refining AI-generated code for efficiency, scalability, and reliability; and working with cross-functional teams to enhance enterprise-level AI-driven coding solutions.
Responsibilities
- Work on AI model training initiatives by curating code examples, building solutions, and correcting code in Python, JavaScript (including React), C/C++, Java, Rust, and Go
- Evaluate and refine AI-generated code to ensure that it is efficient, scalable, and reliable
- Collaborate with cross-functional teams to enhance AI-driven coding solutions against industry performance benchmarks
- Build agents that can verify the quality of the code and identify error patterns
- Hypothesize on steps in the software engineering cycle and evaluate model capabilities on them
- Design verification mechanisms that can automatically verify a solution to a software engineering task
Requirements
- Several years of software engineering experience (5+ years), including 2+ years of continuous full-time experience at a top-tier product company
- Strong expertise in building full-stack applications and deploying scalable, production-grade software using modern languages and tools
- Deep understanding of software architecture, design, development, debugging, and code quality/review assessment
- Excellent oral and written communication skills for clear, structured evaluation rationales