#HackathonSomosNLP 2025

We are going to drive the creation of language models aligned with the culture of LATAM countries and the Iberian Peninsula.


There are 600M Spanish speakers and 265M Portuguese speakers in the world. Spanish and Portuguese are the main languages in 29 countries, each with a great cultural richness. Although language models show increasing multilingual capabilities, are they truly multicultural? Join the #HackathonSomosNLP now, the largest open-source hackathon for Natural Language Processing in Spanish and Portuguese 🚀

In previous editions we had a total of more than 1500 participants from 30 different countries, exceeded 20,000 views of our events, learned from 20 speakers, and developed 50 projects related to the UN Sustainable Development Goals, demonstrating the potential of NLP to address social challenges. We are back for all that and much more! 💪

In this fourth edition we will focus on creating resources that allow us to evaluate and improve the cultural adequacy of large language models for each of the LATAM countries and the Iberian Peninsula.

The best part? EVERYONE can contribute! 🎉

Here are the links to all the forms, keep reading for more information.

GIF Hackathon #Somos600M

(In Portuguese, in English)

🚀 How you can contribute

Click on each of the following options for more information:

💻 Create a language model aligned with your culture

By joining this hackathon you will have the opportunity to develop and apply your LLM training knowledge to create quality and inclusive models in your language. You will have access to state-of-the-art model APIs, the chance to win prizes, participate in raffles, attend talks, workshops and mentoring sessions, publish a paper… Sign up now!

Each participating team (1-5 people) will generate a dataset, align an LLM, and create a demo to share their great work with the community. It is also possible to contribute only to the dataset.

At SomosNLP we want to encourage you to participate regardless of your current knowledge. We will organize practical workshops and mentoring sessions so that both research institute groups and undergraduate student groups can participate – all projects count!

💡 Attend talks by specialists

At SomosNLP we believe that training yourself is also a way to contribute to the future of NLP in Spanish. During the Tuesdays of April, various keynotes will take place given by professionals from the world of Natural Language Processing. These events are free and open to everyone.

And until April arrives? Recordings of previous talks are available!

💻 Register now
🧑‍🏫 Offer a mentorship

Share your experience and knowledge by supporting participating teams in creating quality databases and training a good LLM. You can provide a one-time or ongoing mentorship. Think about your strengths and offer a mentorship!

🧑‍🏫 Offer a mentorship
🙌 Sponsor this wonderful event

SomosNLP is a non-profit community, we seek donations, prizes, and visibility to achieve our ambitious goals and bring language models closer to the Spanish-speaking world. All help is welcome, discover how you can support our mission by offering visibility, vouchers, and donations. We count on you!

🙌 Sponsor the hackathon
📣 Help us spread the word

Help us spread the word about the event in your network so that this initiative reaches more people – all support is welcome! Additionally, from 4 publications onward we will add your logo to the website in the “Community Sponsorships” section.

📣 Spread the word
🤗 Join the team

You can contribute by creating content, support resources (e.g., tutorials), writing articles, or researching Cultural NLP.

🤗 Join the team

💡 Talks and mentorships

You will have the opportunity to learn from leaders in academia and industry – we will be announcing new talks and mentorships!

👏 Acknowledgments

Thank you so much for your time and for supporting us so that our initiative reaches further. Let’s make language models more inclusive!

🚀 Organized by

SomosNLPCENIAUniversidad Politécnica de Madrid

💎 Platinum

Cohere For AI

🥇 Gold

Hugging Face

🥈 Silver

UPM - EunomiaMistralAI

🌟 Community

DiverTLes
Grupo de Ingeniería Lingüística
Proyecto ILENIA
Sociedad Española de Procesamiento de Lenguaje Natural (SEPLN)
LatinX in AI
Mujeres Tech
Instituto de Ingeniería del Conocimiento
AI TINKERERS
Piango Solutions

🤗 Connect!

To stay up to date with all events and updates: