Our hackathon 2025 has ended

Discover the final projects and the talks delivered

María Grandury · Jul 31, 2025 · 6min

The fourth edition of the SomosNLP hackathon has come to an end — what an experience!

Hackathon 2025 Poster

🚀 Projects

The focus of this hackathon was the generation of open resources for evaluating and improving the cultural adequacy of LLMs for Ibero-American countries.

Curious to see the projects developed during the SomosNLP 2025 Hackathon? Here they are!

🎦 The presentation videos are available in this YouTube playlist along with the workshops and expert talks held during the hackathon.

🤗 All resources are available on the Hugging Face Hub: hf.co/somosnlp-hackathon-2025

We hope you enjoy them and that many applications emerge using these new open resources 💛

📚 Cultural knowledge benchmark: INCLUDE

This challenge consisted of collecting multiple-choice exams and extracting questions to generate a large LLM evaluation benchmark focused on regional knowledge.

In total, we collected more than 38,000 questions from 23 countries 🔥

In particular, we obtained more than 1,000 questions for México, Colombia, Perú, Argentina, Bolivia, España and Ecuador.

Thank you so much for your effort!

The people who contributed the most prompts were...
RankNameQuestions extracted
🥇Francisco-Javier Rodrigo-Ginés4599
🥈Pablo Carrera2830 *
🥉Alfonso Amayuelas2300
4Naira Paola Arnez Jordan1581
5Oscar Cumbicus1280
6Jorge Vallego927
7Juan Calderón902 *
8Reewos Talla608 *
9Carlos Arriaga598
10Andrea Parra577
11Jorge Téllez561 *
12Susana Zhou560
13Enrique Paiva502
14David Quispe449 *
15Gonzalo Martínez436
16Guido Ivetta393
17Javier Conde377
18Fabian Perez372
19Andrés Sebastian370
20Gerardo Huerta353
21Marcos J. Gómez348
22David Nazareno Campo303
23Roverico303 *
24Henry Mantilla302
25Constanza Jeldres300
26Rasel Agüero Fernández300
27Rosabel F. Medina Sarmiento300
28Adrián Sáez227 *
29Gabriela Palomeque120

The table includes the number of questions extracted (not collected) by each participant. Numbers with an asterisk indicate that payment of compensation requires the person to confirm the license of some exams. All people with more than 300 questions will be co-authors of the INCLUDE paper.

📚 Cultural knowledge benchmark: BLEND

This challenge consisted of answering questions about their country to extend the open BLEND benchmark for evaluating cultural knowledge of LLMs.

The countries with the highest participation were España, México, Chile, Cuba, and Perú. Great work! 👏

The annotation space is still open — join in!

📚 Stereotype validation

This challenge consisted of collecting and validating stereotypes about different nationalities. In total, we obtained nearly 1,000 stereotypes that will help us mitigate biases in LLMs.

The people who contributed the most prompts were...
RankDiscord IDStereotypes validated
🥇bea esparcia126
🥈neovalleltd122
🥉dreamripper185
4andres_seba70
5alexis_castillo68
6elena w.57
7alebravo30
8jedzill427
9gonznm24
10agumeister21
11adriszmar20
12jorge.vallego14
13jorgeav13
14maria isabel ll12
15clauvallory5
16dramos75
17enpaiva933
18lucase#55963
19alvaro8gb2
20mcdaqc2
21xat.2
22freddyalfonsoboulton1
23roverico1
24valaery1
25yee511

📚 Preference dataset

This challenge consisted of designing prompts that evaluated cultural adequacy for each country, followed by choosing the best response in an LLM Arena.

🤗 The dataset with the prompt collection is available on Hugging Face: hf.co/datasets/somosnlp-hackathon-2025/dataset-preferencias-dpo-v0

The countries with the highest participation were Colombia, Chile, España, Perú, Paraguay, Nicaragua, and México.

The people who contributed the most prompts were...
RankDiscord IDPreferences
🥇rasel3132430
🥈bel21093206
🥉conilinguist196
4roverico164
5pablo.ce153
6steminism133
7andres_seba120
8mcdaqc118
9susanazhou111
10enpaiva93107
11dreamripper183
12bea esparcia80
13angustias2263
14henry mantilla58
15luceldasilva58
16fabianpp50
17alvaro8gb42
18ghuerta17035
19edmenciab30
20adriszmar22
21diegoacheve21
22danielcavilla19
23helenpy19
24gonzalo_401468

The number of preferences is the number of prompts each participant submitted to the Arena and voted on which was the best response generated by the LLMs. This number may not match the number of prompts designed and uploaded to the Hugging Face dataset by each team if not all prompts were submitted to the Arena.

And the three best corpora were… 🥁🥁🥁

  • 🥇 TralaleloTralala-MemeAlign
  • 🥈 IberoTales
  • 🥉 HoCV-COL

Congratulations to the finalist teams (in alphabetical order):

  • 👏 Comida Colombia + Ecuador
  • 👏 Cresia
  • 👏 Equipo LeIA
  • 👏 Falsos Amigos
  • 👏 Refranero Afro-Cubano
  • 👏 Sabiduría Popular Castellana
  • 👏 Think Paraguayo

Congratulations to all the teams!

🎁 Prizes and next steps

  • During the month of August, we will share more information about honorable mentions and contact all teams to deliver the corresponding prizes.
  • If you have any questions about the point count, don’t hesitate to ask. The email-Discord ID mapping was done with the data from the registration form.
  • If you want to continue contributing to the mini challenges and have a more active participation in the papers we are going to write, you can let us know in the #compare-tu-proyecto channel and we will invite you to the corresponding private channels.
  • If in the submission form you expressed interest in publishing a paper presenting your project, we will contact you in September for the mentoring sessions. You can start writing up your experiments in article format (introduction/motivation, methodology, results, and analysis).

💛 Thank you so much and see you next time!