MSA Team Wins 2022 Data 4 Good Case Competition

By Shelley Wunder-Smith

Four students from 色花堂鈥檚  program won first place in Purdue University鈥檚 2022 Data 4 Good Case Competition, surpassing more than 150 other teams.

 (MSA 23),  (MSA 23),  (MSA 23), and  (MSA 23) used data to solve a problem related to image captioning in multiple languages.

鈥淲e were excited about using data to solve the problem of businesses鈥 online visibility in other countries where English is not the native language,鈥 Vaddi explained. 鈥淧rior research has shown that businesses advertising in these local languages often have poor online responses in the international market. This makes it difficult for them to perform well, economically speaking.鈥

The teams participating in the competition were asked to employ a data-based approach using image-captioning models to generate advertisement captions in three target languages: Hausa (spoken in West and Central Africa), Thai (the primary language of Thailand), and Kyrgyz (the national language of Kyrgyzstan).

The MSA team created a model using Meta鈥檚 NLLB translator and OpenAI鈥檚 Multilingual CLIP model, both of which were state-of-the-art when the competition took place. They also used a number of  models. (Hugging Face is an open-source platform that enables developers to collaborate around machine learning and AI.)

鈥淭he judges seemed impressed with our approach and our use of pretrained models, instead of building models from scratch,鈥 Vaddi noted. 鈥淭he case competition helped us build greater confidence in our skills as data scientists.鈥

Visit the  site for information on the 2024 challenge.