This role is for one our client. Our client is one of the worlds fastest-growing AI companies, pushing the boundaries of AI-assisted software development. Their mission is to empower the next generation of AI systems to reason about and work with real-world software repositories. You'll be working at the intersection of software engineering, open-source ecosystems, and frontier AI.
Our client is building high-quality evaluation and training datasets to improve how Large Language Models (LLMs) interact with realistic software engineering tasks. A key focus of this project is curating verifiable software engineering challenges from public GitHub repository histories using a human-in-the-loop process.