Hackathon SomosNLP 2024: #Somos600M

Let's create a corpus that represents the 600M Spanish-speaking people and standardize how to evaluate our LLMs.


There are 600M Spanish-speaking people, let’s give this widely spoken and rich language a fair representation in the field of NLP. Participate in the SomosNLP Hackathon, an international online competition whose main objective isto create open NLP resources in Spanish and co-official languages.

The best part? EVERYONE can collaborate! 🎉

GIF Hackathon #WeAre600M

🚀 Our goals

The democratization of NLP in Spanish is our main goal at SomosNLP, and we believe that one of the best ways to move towards this goal is by promoting the creation of open NLP resources in our language.

In past editions, we had a total of more than 1000 participants from 30 different countries, we surpassed 20,000 views of our events, learned from 20 speakers, and developed 50 projects related to the UN’s Sustainable Development Goals, demonstrating the potential of NLP to address social challenges. We’re back for that and much more! 💪

In this third edition, we join the revolution of LLMs and continue setting high-impact goals:

  1. 🌎 Create the largestquality instruction corpus that represents the different varieties of Spanishthat allows us to train inclusive models.
  2. ✅ Create the firstpublic leaderboard of LLMs in Spanishthat allows us to standardize how to evaluate and compare the different models in Spanish and co-official languages.

Join now the largest open-source Natural Language Processing hackathon in Spanish! 🚀

Hackathon 2024 Poster

How can you collaborate?

💻 Participate in the hackathon

By joining this hackathon, you will have the opportunity to collaborate in creating quality and inclusive LLMs in your language. Apply your knowledge to overcome the challenges of each stage of your LLM’s development: corpus creation, training, and evaluation.

Each participating team (1-5 people) will generate an instruction corpus, train their LLM, and create a demo to share their great work with the community.

At SomosNLP, we want to encourage you to participate regardless of your current knowledge. We will organize practical workshops and mentoring sessions so that both research institute groups and undergraduate student groups can participate, all projects add up!

💡 Attend specialist talks

At SomosNLP, we believe that training is also a way to collaborate with the future of NLP in Spanish. During the Tuesdays of March, various keynotes will be given by professionals in the world of Natural Language Processing. These events are free and open to everyone.

And until March arrives?The recordings of previous talks are available!on our YouTube channel. This is a great opportunity to learn from experts and get inspired for your own projects.

🤗 Join the team organizing it

Being part of the organizing team is a unique experience that allows you to contribute directly to the success of the hackathon. You will work closely with experts in the field, learn about the latest trends in NLP, and help create an inclusive and diverse community.

If you are interested in joining the organizing team, please fill out the form below, and we will contact you with more information.

🔊 Propose a talk

Do you have expertise in NLP or a related field? Share your knowledge with the community by proposing a talk for the hackathon. This is a great way to contribute to the education of participants and the growth of NLP in Spanish.

Please fill out the form with your proposal, and we will get back to you with more details.

🧑‍🏫 Offer mentoring

Mentors play a crucial role in guiding teams through the hackathon process, from ideation to implementation. If you have experience in NLP and want to help teams succeed, consider becoming a mentor.

Your guidance can make a significant difference in the outcome of projects and the learning experience of participants.

📚 Donate a database

Databases are the foundation of NLP projects. By donating a database, you contribute valuable resources that can help teams develop innovative solutions. If you have a database that could be useful for the hackathon, please consider donating it.

Your contribution will be acknowledged, and you will be helping to advance NLP in Spanish.

🙌 Sponsor this wonderful event

Your support as a sponsor will help us make the hackathon a success and contribute to the development of NLP in Spanish. Sponsors have the opportunity to gain visibility in the NLP community, connect with talented individuals, and demonstrate their commitment to advancing technology in Spanish-speaking countries.

For more information on sponsorship opportunities, please visit our website or contact us directly.


👏 Acknowledgments

Thank you very much for your time and for supporting us so that our initiative can reach further. Let’s democratize NLP in Spanish!

Gold Sponsors

Argilla

Hugging Face

Instituto de Ingeniería del Conocimiento

Calamo&Cran

LenguajeNatural.AI

Impulse Data & AI Conference

Universidad de Puerto Rico

Yamato

Community Sponsors

AlexFocus

Mujeres Tech

Proyecto ILENIA

Sociedad Española para el Procesamiento del Lenguaje Natural

DiverTLes

Saturdays AI

Women Tech Global Conference

Spain AI

Big Onion

Universidad Nacional de Loja

🤗 Information

To stay up to date with all events and progress: