Shared Task on Code-mixed Less-resourced Sentiment Analysis for Indo-Aryan Languages - SIGUL

18 Jan 2024


      Code-mixing, the dynamic interplay of multiple languages within a single
discourse, is a widespread linguistic phenomenon observed in multilingual
societies. Code-mixing is particularly intriguing when observed in closely
related languages.
We invite you to participate in our shared task at the WILDRE workshop,
which is co-located with LREC-COLING 2024. This shared task addresses the
complexities of code-mixed data from less-resourced similar languages for
sentiment analysis. We will provide annotated data for the following
code-mixed languages:
1. Magahi-Hindi-English
2. Bangla-English-Hindi
3. Hindi-English
The evaluation will be in two different Tracks:
*A. Track 1:* Given training and validation data to determine the comment's
polarity (positive, negative, neutral or mixed) in the same code-mixed
setting.
1. Hindi-English
2. Magahi-Hindi-English
3. Bangla-English
4. Combined all the language pairs (1+2+3)
*B. Track 2:* Given unlabelled test data for the code-mixed Maithili
language (Maithi-Hindi-English), leverage any or all of the available
training datasets in Track 1 to determine the sentiment of a comment in the
target language.
Important Links:
- Registration Link https://forms.gle/HVRK1W1hHqBwtgpu6
   - WILDRE Workshop Link http://sanskrit.jnu.ac.in/conf/wildre7/index.jsp
   - GitHub
   https://github.com/wildre-workshop/wildre-7_code-mixed-sentiment-analysis
Important Dates:
- Dec 22, 2023: Registration
   - Jan 10, 2024: Train and Validation Data set Release [to get the data,
   please register]
   - Feb 15, 2024: Test Set Release
   - Feb 23, 2024: System Submission Due
   - Feb 29, 2024: System Results
   - March 15, 2024: System Description Paper Due
   - March 28, 2024: Paper notification of acceptance