#HackathonSomosNLP 2025

Let's enable the creation of LLMs aligned with the culture of LATAM and the Iberian Peninsula.


There are 600M Spanish-speakers and 265M Portuguese-speakers in the world. These are the main languages in 29 countries, each with a rich culture. Although LLMs present increasingly better multilingual capabilities, are they really multicultural? Join now the #HackathonSomosNLP, the largest open-source NLP hackathon in Spanish and Portuguese! 🚀

In past editions, we had in total more than 1500 participants from 30 different countries, we surpassed 20,000 views of our events, learned from 20 speakers, and developed 50 projects related to the UN’s Sustainable Development Goals, demonstrating the potential of NLP to address social challenges. We’re back for that and much more! 💪

In this fourth edition, we are going to create open-source resources to evaluate and improve the cultural adequacy of LLMs with respect to each of the countries of LATAM and the Iberian Peninsula.

The best part? EVERYONE can collaborate! 🎉

Here are all the links, keep reading for more information.

GIF Hackathon #Somos600M

🚀 How you can collaborate

Click on each of the following options to learn more:

💻 Create a language model aligned with your culture

By joining this hackathon, you will have the opportunity to develop and apply your knowledge in LLM training to create quality and inclusive models in your language. You will have access to state-of-the-art model APIs, the possibility to win prizes, participate in raffles, attend talks, workshops and mentoring sessions, publish a paper… Sign up now!

Each participating team (1-5 people) will generate a dataset, align an LLM, and create a demo to share their great work with the community. It’s also possible to contribute only to the dataset.

At SomosNLP, we want to encourage you to participate regardless of your current knowledge. We will organize practical workshops and mentoring sessions so that both research institute groups and undergraduate student groups can participate, all projects add up!

To ensure everyone starts with the same conditions, we will make the rules public on April 1st.

💻 Register now!
💡 Attend talks by experts

At SomosNLP, we believe that training is also a way to collaborate with the future of NLP in Spanish. During the Tuesdays of April, various keynotes will be given by professionals in the world of Natural Language Processing. These events are free and open to everyone.

And until April arrives? The recordings of previous talks are available!

🙌 Sponsor this wonderful event

SomosNLP is a non-profit community, we seek donations, prizes, and visibility to achieve our ambitious goals and bring language models closer to the Spanish-speaking world. All help is welcome, discover how you can support our mission by offering visibility, vouchers, and donations. We count on you!

🙌 Sponsor the hackathon
📣 Help us spread the word

Help us spread the word about the event in your network so this initiative reaches more people, all support is welcome! Additionally, after 4 publications, we will add your logo to the website in the “Community Sponsorships” section.

📣 Spread the word
🔊 Propose a talk (in Spanish or Portuguese)

We invite people from academia or industry, experts and passionate about AI and particularly NLP, to share their knowledge and advances. Read the suggested topics and send us your proposal!

🔊 Propose a talk
🧑‍🏫 Offer mentoring (ES, PT, EN)

Share your experience and knowledge by supporting participating teams in creating quality databases and training a good LLM. You can provide one-time or continuous mentoring. Think about your strengths and offer mentoring!

🧑‍🏫 Offer mentoring

💡 Talks and mentorship sessions

You will have the opportunity to learn from leaders in academia and industry, keep posted as we will announce new speakers and mentors!

👏 Acknowledgments

Thank you very much for your time and for supporting us so that our initiative can reach further. Let’s make language models more inclusive!

🚀 Organized by

SomosNLP

CENIA

Universidad Politécnica de Madrid

💎 Platinum

Cohere For AI

🥇 Gold

Hugging Face

🥈 Silver

UPM - Eunomia

MistralAI

🌟 Community

Saturdays AI
DiverTLes
Grupo de Ingeniería Lingüística
Proyecto ILENIA
Sociedad Española de Procesamiento de Lenguaje Natural (SEPLN)
LatinX in AI
Mujeres Tech
Instituto de Ingeniería del Conocimiento
AI TINKERERS

🤗 Connect!

To stay up to date with all events and progress: