AI Research Lead Evaluation

5 days ago


London, United Kingdom GSM Conference Services Full time

Department: TechnologyTeam: AILocation: London with hybrid ways of workingPosition type: Short Term Contract (Inside IR35) until end of Dec 2026 with potential to extendWhat the hiring manager saysAs the AI Research Lead you will be at the forefront of developing and maintaining the GSMAs Open Telco AI benchmarks and working with members on evaluating low-resource language models. This role is critical to ensuring our evaluation pipelines are robust and transparent directly impacting the quality and reliability of our Members AI solutions. Youll have the opportunity to collaborate with leading telecom operators and frontier AI ecosystem partners making a tangible impact on industry best practice and innovation.About the TeamYoull join a dynamic cross-functional team dedicated to advancing AI capabilities in the telecom sector. Our team is growing rapidly with a culture of collaboration and technical excellence. We value curiosity initiative and a drive to set new benchmarks for the industry.About the roleYou will own and maintain the evaluation pipeline for open telco benchmarks designing and implementing new benchmarks in collaboration with telecom operator members and AI partners. Youll lead the integration and benchmarking of new models including member-submitted LLMs and guide the expansion of benchmarking to include AI agents and diverse architectures. Youll also provide technical support for members building low-resource language LLM initiatives. Success in this role means delivering robust transparent benchmarking processes and supporting the community in adopting best practices.About YouYou are passionate about machine learning model evaluation and have demonstrable experience with open-source evaluation frameworks (such as HELM or lm-eval-harness). You thrive in collaborative environments working with technical partners and cross-functional stakeholders. Your experience developing or evaluating local language LLMs gives you a unique perspective on linguistic cultural and resource constraints. You are adept at managing versioned pipelines and benchmarking reports and you bring an understanding of telco or enterprise AI.About your skillsYoull possess :Strong experience in machine learning model evaluation and benchmarks.Familiarity with open-source evaluation frameworks (e.g. HELM lm-eval-harness).Experience collaborating with technical partners and cross-functional stakeholders.Demonstrated ability to manage versioned pipelines and benchmarking reports.Understanding of telco or enterprise AI use cases (a strong plus).Proven experience developing or evaluating Local Language LLMs with an understanding of unique linguistic cultural and resource constraints (a strong plus)Communication Analysis Project Management Innovation Stakeholder Management.We strive to offer a meaningful and inclusive application experience for all candidates. Should you require any accommodations or adjustments due to a disability or for any other reason during the hiring process please contact with your request.Contract typeShort term ContractorWorker typeContingent WorkerWhat We OfferWorking at the GSMA offers you unparalleled access to the mobile industry. We offer a chance to truly shape the direction of mobile whatever your role. By joining the GSMA you will be exposed to a fast-paced rapidly evolving environment working on global solutions genuinely fascinating and industry-changing projects and a stimulating and dynamic environment designed to enable you to flourish.In addition to architect-designed offices and competitive compensation our benefits include fantastic learning & development opportunities generous holiday allowances four additional days off for professional development and many others.To learn more about the GSMA visit our career site our LinkedIn page and our Twitter page.Being You at the GSMAWe care deeply about diversity equity and inclusivity and aspire to be the best at it. Your well-being and work/life balance is important so flexi-time and remote working is available to all staff. Were keen to ensure everyone is equal represented and connected so we particularly encourage applications from all demographics. The sucess of the GSMA year on year will continue to be contributed by people from all walks of life.GSMA ValuesOur values not only drive our culture they shape how we work and interact inside and outside our global organisation.Passionately drivenWe approach everything we do with unparalleled capability tenacity and commitment knowing that the challenging scale pace and complexity of our work is what leads to its world-changing impact.Insightful leadersWe continually develop and engage our expertise insight and creativity so that were always ready to respond to the changing landscape with authority agility and nuance.Stronger togetherWe lean on each other so the industry can lean on us embracing our diversity by actively seeking out perspectives and skill sets beyond our own fuelling each others successes and constantly asking how we can help.Underpinning our values is our collective mindset to show up purposefully as good human beings every day in every situation. When were at our best we are collaborative considerate and compassionate to others and we create a safe space for one another to thrive assuming positive intent in our colleagues. And if we arent at our best and the pressure is on we feel free to be ourselves but still remain curious lean into the tough stuff and we are always respectful to others and accountable for the part we play. Key Skills Administrative Skills,Facilities Management,Biotechnology,Creative Production,Design And Estimation,Architecture Employment Type : Full-Time Experience: years Vacancy: 1



  • City Of London, United Kingdom Elsevier Full time

    Academic Research AI Evaluation Lead Do you thrive at the intersection of AI, research, and evaluation? Do you enjoy partnering with data scientists, researchers, and product teams to ensure generative AI solutions meet the highest standards of trust, quality, and reliability in advancing science and education? About our Team Elsevier’s Academic and...


  • Greater London, United Kingdom RELX INC Full time

    Academic Research AI Evaluation Lead Do you thrive at the intersection of AI, research, and evaluation? Do you enjoy partnering with data scientists, researchers, and product teams to ensure generative AI solutions meet the highest standards of trust, quality, and reliability in advancing science and education? About our Team Elsevier’s Academic and...


  • Greater London, United Kingdom RELX Full time

    A global information and analytics leader is seeking an Academic Research AI Evaluation Lead to manage evaluation strategies for generative AI in research and education. This role requires expertise in evaluating AI models and experience in academic research. The successful candidate will work collaboratively with data scientists and product teams, ensuring...

  • Ai Research Lead

    23 hours ago


    London, United Kingdom GSMA Full time

    Department: Technology Team: AI Location: London with hybrid ways of working Position type: Short Term Contract (Inside IR35) until end of Dec 2026, with potential to extend What the hiring manager says - "As the AI Research Lead, you will be at the forefront of developing and maintaining the GSMA’s Open Telco AI benchmarks and working with members on...


  • City Of London, United Kingdom Elsevier Full time

    A global leader in research solutions seeks an experienced Academic Research AI Evaluation Lead to drive the evaluation strategy for its academic products. Responsibilities include establishing frameworks for AI assessments, collaborating with data scientists, and enhancing methodologies. Candidates should have a master's degree, significant experience in...

  • Research Lead

    1 week ago


    London, United Kingdom Canva Full time £200

    Join to apply for the Research Lead - Evaluations role at CanvaFind out more about this role by reading the information below, then apply to be considered.OverviewAs Canva grows, so does the impact and opportunity of our AI-powered features. We’re looking for a Research Lead to coach a team of world-class scientists, build on our foundational model...

  • Research Lead

    4 hours ago


    London, United Kingdom Canva Full time £200

    Join to apply for the Research Lead - Evaluations role at CanvaFind out more about this role by reading the information below, then apply to be considered.OverviewAs Canva grows, so does the impact and opportunity of our AI-powered features. We’re looking for a Research Lead to coach a team of world-class scientists, build on our foundational model...

  • Research Lead

    1 week ago


    London, United Kingdom Canva Full time £200

    Join to apply for the Research Lead - Evaluations role at CanvaRead on to fully understand what this job requires in terms of skills and experience If you are a good match, make an application.OverviewAs Canva grows, so does the impact and opportunity of our AI-powered features. We’re looking for a Research Lead to coach a team of world-class scientists,...


  • Greater London, United Kingdom RELX INC Full time

    A global leader in information analytics is seeking an experienced Academic Research AI Evaluation Lead to formulate evaluation strategies for its academic products. You will guide multidisciplinary teams to ensure generative AI solutions meet high standards of quality and trust. The ideal candidate will possess a Master's degree and strong background in...


  • Greater London, United Kingdom RELX INC Full time

    A global information and analytics leader is seeking an Academic and Government Evaluation Lead to drive evaluation strategies for AI-driven products. The role requires expertise in academic research and AI evaluation, along with leadership of multidisciplinary teams. Successful candidates will possess a Master's degree and experience in scientific...