Challenge #HackathonSomosNLP 2026: INCLUDE Exams

How to participate in this challenge and help improve the cultural knowledge of language models


Look for multiple-choice exams from your country to evaluate LLMs’ knowledge. Prioritize exams in languages other than Spanish and/or focused on cultural topics (e.g. history, literature). We will use these questions and answers to extend the open INCLUDE benchmark.

April 9 - May 31 (EXTENDED) | max 1 point

Participate now!

🌎 You can contribute exams from any country regardless of where you’re from or live — check the “Prioridad países” sheet for priorities.

✨ Incentives (numbers refer to questions with their corresponding answers):

  • Per team:
    • 100 questions in total = 0.5 points
    • 200 questions in total = 1 point
    • 200 per team = also a requirement to access the 500 USD in Cohere API credits for the main challenge
  • Per person:
    • Every 100 questions = 50 USD in GPU credits or books (your choice)
    • 300 per person = invitation to the global project Slack and co-authorship in the INCLUDE v2 paper led by EPFL
  • NOTE: Exams must meet the requirements!

Resources:


Protocol for collecting multilingual exams

Below, we present the protocol for participating in the INCLUDE project, focused on collecting multilingual exams.

1. Search for exams

Check that the exam meets the following requirements:

  • Not proprietary. If the license restricts commercial use but allows redistribution for research purposes, then we can use the exam. If the license is unknown, include it anyway.

  • It’s a multiple-choice exam with 4 options per question.

  • Contains the answers, with only one correct answer per question.

  • The exam topic must be related to a country’s culture (e.g. history, literature) or be regional information (e.g. driver’s license). Exact or natural-science exams (e.g. mathematics, physics) are not valid.

  • Prioritize exams in languages native to LATAM or co-official in Spain.

  • Spanish-language exams from the following countries are also valid:

    PRIORITYNO*
    Puerto RicoSpain
    Dominican RepublicChile
    Costa Rica
    Panama
    Nicaragua
    Guatemala
    El Salvador
    Equatorial Guinea
    Honduras
    Cuba
    Bolivia
    Colombia
    Paraguay
    Uruguay
    Venezuela

*Unless it’s an exam with a very strong cultural or regional component. In that case, ask first on Discord. Either way, we still recommend looking for exams from the priority countries.

Ideas for finding exams:

  • Language exams
  • Naturalization exams
  • Driving theory exams
  • University entrance or university exams
  • Primary or secondary school exams
  • Professional qualifying exams (law, medicine, psychology, etc.)
  • Questions from “Who Wants to Be a Millionaire?”-style shows
  • Questions from Trivial Pursuit-type games
  • Self-assessment tests in textbooks

Remember: it doesn’t have to be a digitized exam — you can also scan books or take photos of documents.

2. Add exams to the spreadsheet

When you find an exam, save its URL/name/article/source documentation and add it to the spreadsheet.

Include the following:

  • Your name
  • Your Discord name
  • Exam name (as detailed as possible)
  • Language and country of origin of the exam
  • Exam domain (e.g. Literature, Law, Driving, etc.)
  • Exam level
  • Number of questions
  • Exam source (URL if available online, book name or URL to the PDF document in your Drive, etc.)
  • Original format (e.g. PDF, web page, textbook, etc.)

3. Process the exams

Once you’ve found an exam:

Example JSON in the expected format:

{
  "language": "es",
  "country": "España",
  "exam_name": "Examen final de Historia de España de Secundaria 2017",
  "source": "https://url-of-the-exam",
  "license": "CC-BY-SA",
  "level": "University entrance",
  "category_en": "History",
  "category_original_lang": "Historia",
  "original_question_num": 1,
  "question": "¿En cuál de los siguientes años comenzó la Guerra Civil?",
  "options": [ "1936", "1937", "1938", "1939" ],
  "answer": 0
}

Team

Many thanks to:

  • EPFL: prizes and global team organization
  • The team: María Grandury and Angelika Romanou
Participate now!
Back to challenges