Call for participation IberLEF 2023 Task - FinancES. Financial Targeted Sentiment Analysis in Spanish - Corpora

13 Feb 2023


      CALL FOR PARTICIPATION
IBERLEF 2023 Task - FinancES. Financial Targeted Sentiment Analysis in
Spanish
Held as part of iberLEF 2023 https://sites.google.com/view/iberlef-2023,
a shared evaluation campaign for Natural Language Processing (NLP) systems
in Spanish and other Iberian languages
September 26th 2023, Jaen
Codalab link: https://codalab.lisn.upsaclay.fr/competitions/10052
Dear All,
We are inviting researchers and students to participate in the
shared-task FinancES.
Financial Targeted Sentiment Analysis in Spanish held as part of iberLEF
2023, shared evaluation campaign for Natural Language Processing (NLP)
systems in Spanish and other Iberian languages.
This shared task aims to explore targeted sentiment analysis in the
financial domain. Specifically, the approach adopted here is grounded in
the field of microeconomics. In this regard, Bowles (2004) explains the
role of economic agents, that is to say, individuals or organizations
impacting the economy. The author states that the main microeconomic agents
in the capital market are consumers (households/individuals), companies
(firms), governments, and central banks. Consequently, in order to develop
a sentiment analysis method where different viewpoints are considered,
three different perspectives are included: (1) economic target of the news
item; (2) individual economic agent: companies; and (3) individual economic
agent: consumers –the target is the sector where the economic fact applies,
and companies produce the goods and services that households/individuals
consume. From these three viewpoints, the news item has an impact on the
target and the economic agents which are considered as positive, negative,
or neutral. With all, two tasks are proposed. On the one hand, a task
combining the challenges of aspect-term extraction for identifying the
target entity in text, and aspect-based sentiment classification for
determining the sentiment polarity towards the target. On the other hand, a
task devoted to assessing the impact of a news headline on both other
economic agents, namely, companies and consumers.
The participants will be provided development, development_test, training
and test datasets in Spanish. The dataset for this task is composed of news
headlines written in Spanish collected from digital newspapers specialized
in economic, financial and political news. The dataset is labeled with the
target entity and the sentiment polarity on three dimensions: target,
companies, and consumers. That is, given a headline, it has been manually
classified as positive, neutral, or negative for three specific entities:
(1) target entity (i.e., the specific company or asset where the economic
fact applies), (2) companies (i.e., the entities producing the goods and
services that others consume), and (3) consumers (i.e.,
households/individuals). Each headline was annotated by three members of
the organization committee. In case of disagreement, the annotators
discussed the special case and, if no agreement was reached, the headline
was discarded. During this first step, we compiled about 14k headlines, the
headlines with a short length or those that did not specify a target entity
were filtered out. The final dataset is composed of 8k-10k news headlines.
For the shared tasks, training and test sets will be released (80%-20%).
In order to facilitate participation in the competition, a  Google Colab
notebook has also been provided. In this notebook, it is shown how to load
the development dataset and how to train a baseline for both tasks based on
SpaCy for the identification of the main economic target and a Bag-of-Words
(BoW) with linear regression of the polarity of each dimension. In
addition, it is shown how to calculate the final F1-score of each task and
how to generate the final submission file.   To download the data, the
notebook and participate, go to
https://codalab.lisn.upsaclay.fr/competitions/10052#participate.
Best regards,
The FinancES 2023 organizing committee
References
-
Bowles, S. (2004). Microeconomics: Behavior, institutions, and
   evolution. Princeton University Press.
Important dates
-
Release of development corpora: Feb 13, 2023
   -
Release of training corpora: Mar 13, 2023
   -
Release of test corpora and start of evaluation campaign: Apr 17, 2023
   -
End of evaluation campaign (deadline for runs submission): May 3, 2023
   -
Publication of official results: May 5, 2023
   -
Paper submission: May 28, 2023
   -
Review notification: Jun 16, 2023
   -
Camera ready submission: Jul 6, 2023
   -
IberLEF Workshop (SEPLN 2023): Sep 26, 2023
   -
Publication of proceedings: Sep ??, 2023
Organizing committee
-
José Antonio García-Díaz (UMUTeam,, Universidad de Murcia)
   -
Ángela Almela Sánchez-Lafuente (UMUTeam, Universidad de Murcia)
   -
Francisco García-Sánchez (UMUTeam, Universidad de Murcia)
   -
Gema Alcaraz-Mármol (UMUTeam, Universidad de Castilla La Mancha)
   -
María José Marín (UMUTeam, Universidad de Murcia)
   -
Rafael Valencia-García (UMUTeam, Universidad de Murcia)