[Apologies for cross-posting]
*************************
Call for Participation
*************************
Task: *Homotransphobia Detection in Italian (HODI)* at EVALITA 2023
<https://www.evalita.it/campaigns/evalita-2023/>
Info: https://hodi-evalita.github.io/
Final Workshop: 7th - 8th September 2023, Parma, Italy
*Registration is required to obtain data and participate in the shared
task.*
To register, follow the instruction here
https://hodi-evalita.github.io/how_to_participate/
-----------------------------------------------------
🌈 The HODI Shared Task 🌈
-----------------------------------------------------
We invite participants to participate in the first shared task of
homotransphobia detection in Italian (HODI). Despite the NLP community’s
interest in hate speech detection datasets and models, very few studies
covered homotransphobia. This is a concern, due to the target-oriented
nature of hate speech: recent studies have revealed that hate speech
detection methods cannot be used to multiple sorts of hate speech targets.
HODI is organized according to two main subtasks:
** Subtask A - Homotransphobia detection:** the objective is to detect if a
text is homotransphobic or not.
** Subtask B - Explainability:** the objective is to extract the rationales
of the classification models trained for Subtask A.
Further details on the task, data, and evaluation are available at the task
website: <https://di.unito.it/sardistance2020>
https://hodi-evalita.github.io/
-----------------------
Important Dates
-----------------------
- 7 Feb 2023: Training data available (training period starts)
- 2 May 2023 Test data available
- 9 May 2023 Systems results due to organizers
- 30 May 2023 Results notification to participants
- 14 Jun 2023 Technical report due to organizers
- 10 Jul 2023 Reviews to participants (peer-reviews)
- 25 Jul 2023 Camera ready due to organizers
- 7 - 8 Sep 2023 EVALITA Workshop
----------------
Organizers
----------------
Debora Nozza, Bocconi University
Greta Damo, Bocconi University
Alessandra Teresa Cignarella, University of Turin
Tommaso Caselli, University of Groningen
Viviana Patti, University of Turin
--
Tommaso Caselli, Ph.D.
Senior Assistant Professor in Computational Semantics
Faculty of Arts, Rijksuniversiteit Groningen
The Netherlands
----------------------------
https://xs4all.academia.edu/TommasoCasellihttps://www.researchgate.net/profile/Tommaso_Caselli
Twitter: @tommaso_caselli
IberLEF 2023 Task ClinAIS: Automatic Identification of Sections in Clinical Documents
Website: https://ixa2.si.ehu.eus/clinais/
ClinAIS: will be organized as part of IberLEF 2023, at the SEPLN 2023 Conference (Jaen, September 2023).
The ClinAIS task presented at IberLEF 2023 aims to tackle the problem of automatic identification of sections in unstructured Spanish clinical documents. The task is focused on identifying 7 predefined medical sections: Present Illness, Derived from/to, Past Medical History, Family history, Exploration, Treatment and Evolution in ECNs, mainly progress notes.
The successful resolution of this task will enable the improvement of higher level applications that can extract valuable, actionable information from clinical documents, such as medical entity recognition, patient cohort retrieval, and temporal relation extraction. This will ultimately improve patient care and clinical decision-making.
IMPORTANT DATES
- March 2023 Release of Train + Dev Sets and Evaluation Library
- April 2023 Release of Test and Background Set
- May 2023 Submission of Results.
- May 2023 System Paper Submission Deadline.
- June 2023 Notification to Authors.
- June 2023 Camera Ready Submission Deadline.
- September 2023 Publication of Proceedings.
- September 2023 IberLEF within SEPLN 2023.
ORGANIZERS
IOMED
María Vivó NLP Data Scientist
Paula Chocrón NLP Engineer & Researcher
Gabriel de Maeztu Co-founder & CTO at IOMED
HiTZ Center
Iker de la Iglesia Research Scientist
Aitziber Atutxa Professor at UPV/EHU
Koldo Gojenola Professor at UPV/EHU
Esther Miranda Technical Staff
Contact: ixa.iomed-clinais(a)ehu.es
Registration: https://ixa2.si.ehu.eus/clinais/registration
Dear colleagues,
Field Matters workshop 2023 extended the submission deadline. We accept papers until February 23.
The Second Workshop on NLP Applications to Field Linguistics (Field Matters 2023) will take place at EACL 2023 (https://2023.eacl.org/) in Dubrovnik, Croatia on May 5 or 6 (online participants are also welcomed).
We accept papers on the following topics:
- Application of NLP to field linguistics workflow;
- Transfer learning for under-resourced language processing;
- The use of fieldwork data to build NLP systems;
- Modeling morphology and syntax of typologically diverse languages in the low-resource setting;
- Speech processing for under-resourced languages;
- Computational analysis of field linguistics datasets;
- Using technology for preserving culture via language;
- Improving ways of interaction with Indigenous communities;
- Machine-readable field linguistic datasets.
You can find more information on the submission process and format requirements on our web-site https://field-matters.github.io/cfp2023
Subscribe to our Twitter page to follow the updates https://twitter.com/field_matters
Best regards,
Anna Postnikova
Field Matters workshop organizing committee
Applications are invited for a 1-year research scholarship in English Linguistics at the Department of Foreign Languages and Literatures, University of Verona, within the project “Interconnected Nord-Est Innovation Ecosystem (iNEST)”, financed by NextGenerationEU in the context of the National Recovery and Resilience Plan (NPRR).
The project entails the compilation and analysis of a corpus of English-language texts promoting and describing prominent urban and extra-urban destinations in the Veneto region of Italy as well as the creation of a set of guidelines for tourism promotion. Experience in corpus linguistics, tourism discourse and discourse analysis, among others, are requested from applicants.
The closing date for applications is March 6th.
For more information about the research project, requirements and application process, you can find the complete call at this link:
https://www.univr.it/en/job-vacancies/assegnisti-di-ricerca/assegni-di-rice…
Best wishes,
Valeria Franceschi
----------------------------------------
Valeria Franceschi
Temporary Assistant Professor - English Language and Translation (L-LIN/12)
Department of Foreign Languages and Literatures
University of Verona
CALL FOR PARTICIPATION
IberLEF 2023 Task - PoliticEs: Political ideology detection in Spanish texts
Held as part of the evaluation forum IberLEF 2023
<https://sites.google.com/view/iberlef-2023> in the XXXIX edition of the
International Conference of the Spanish Society for Natural Language
Processing (SEPLN 2023 <http://sepln2023.sepln.org/en/home/>)
September 26, 2023. Jaén, Andalusia, Spain
Codalab link: https://codalab.lisn.upsaclay.fr/competitions/10173
Dear All,
We are inviting researchers and students to participate in the
shared-task PoliticEs
2023: Political ideology detection in Spanish texts, held as part of IberLEF
2023, the shared evaluation campaign for Natural Language Processing
systems in Spanish and other Iberian languages, collocated with SEPLN 2023
Conference.
The goal of this task is to extract political ideology information from
Spanish texts. For this, an automatic document classification task on
clusters of texts is proposed. It consists of extracting the self-assigned
gender and profession as demographic traits, and the political ideology as
a psychographic trait from a set of texts written in Spanish from several
authors that share those traits. Political ideology is considered as a
binary and as a multiclass problem. The PoliticES 2023 shared task is based
on a previous task named PoliticES 2022 presented at IberLEF2022
(García-Díaz et. al. 2022b) where the dataset was an extension of the
PoliCorpus 2020 dataset (García-Díaz et al., 2022a). The novelty of this
year is that participants will work with clusters of texts written by
different users, but with the same traits, instead of profiling users to
prevent legal and ethical issues.
The participants will be provided development, development_test, training
and test datasets in Spanish from an extension of the PoliCorpus 2020
(García-Díaz et al., 2022) and the corpus used for the PoliticES 2022
shared task (García-Díaz et. al. 2022b). The dataset was collected between
2020 and 2022 from the Twitter accounts of politicians, political
journalists and celebrities in Spain using the UMUCorpusClassifier
(García-Díaz et al., 2020). We automatically created clusters of texts
mixing some of these extracted tweets in order to prevent ethical and
privacy issues about author profiling in Twitter. Each cluster is composed
of 80 tweets written by different users that share all the traits under
evaluation. We labeled each cluster with the self-assigned gender (male,
female), profession (politician, celebrity, journalist) and political
spectrum on two axes: binary (left, right) and multiclass (left,
moderate_left, moderate_right, right). Moreover, the Twitter mentions of
the politicians were anonymised by replacing them with the token @user. In
addition, other Twitter accounts mentions were also encoded as @user. Other
entities, such as political party references, are also replaced with the
@political_party token. Consequently, the text traits cannot be guessed
trivially by reading the user's name and searching information on them on
the Internet. The dataset is composed of approximately 2800 different
clusters.
Finally, in order to facilitate participation in the competition, a
notebook with two baselines will be provided. The first one will be based
on BoW and the second one will be based on Transformers. To download the
data, the notebook and participate, go to
https://codalab.lisn.upsaclay.fr/competitions/10173.
Best regards,
The PoliticES 2023 organizing committee
References
-
García-Díaz, J. A., Almela, Á., Alcaraz-Mármol, G., & Valencia-García,
R. (2020). UMUCorpusClassifier: Compilation and evaluation of linguistic
corpus for Natural Language Processing tasks. Procesamiento del Lenguaje
Natural, 65, 139-142.
-
García-Díaz, J. A., Colomo-Palacios, R., & Valencia-García, R. (2022a).
Psychographic traits identification based on political ideology: An author
analysis study on Spanish politicians’ tweets posted in 2020. Future
Generation Computer Systems, 130(1), 59-74.
-
García-Díaz, J. A., Jiménez Zafra, S. M., Martín Valdivia, M. T.,
García-Sánchez, F., Ureña López, L. A., & Valencia García, R. (2022b).
Overview of PoliticEs 2022: Spanish Author Profiling for Political
Ideology. Procesamiento del Lenguaje Natural, 69, 265-272.
Important dates
-
Release of development corpora: Feb 13, 2023
-
Release of training corpora: Mar 13, 2023
-
Release of test corpora and start of evaluation campaign: Apr 17, 2023
-
End of evaluation campaign (deadline for runs submission): May 3, 2023
-
Publication of official results: May 5, 2023
-
Paper submission: May 29, 2023
-
Review notification: Jun 17, 2023
-
Camera ready submission: Jun 27, 2023
-
IberLEF Workshop (SEPLN 2023): Sep 26, 2023 (Jaén, Andalusia, Spain)
-
Publication of proceedings: Sep ??, 2023
Organizing committee
-
José Antonio García-Díaz (UMUTeam, Universidad de Murcia)
-
Salud María Jiménez-Zafra (SINAI, Universidad de Jaén)
-
María-Teresa Martín Valdivia (SINAI, Universidad de Jaén)
-
Francisco García-Sánchez (UMUTeam, Universidad de Murcia)
-
L. Alfonso Ureña-López (SINAI, Universidad de Jaén)
-
Rafael Valencia-García (UMUTeam, Universidad de Murcia)
[image: Universidad de Jaén] <http://www.uja.es/> *Salud María Jiménez
Zafra*
sjzafra(a)ujaen.es
Universidad de Jaén
Grupo de Investigación SINAI <http://sinai.ujaen.es/> | Departamento de
Informática
EPS Jaén, Edificio A3, Despacho 219
Campus Las Lagunillas s/n 23071 - Jaén | +34 953212992
[image: Universidad de Jaén] <http://www.uja.es/>
*** First Call for Papers ***
*** The 7th Workshop on Online Abuse and Harms (WOAH) ***
Website: https://www.workshopononlineabuse.com/
Important Dates
--------
- Submission due: May 2, 2023
- ARR reviewed submission due: May 22, 2023
- Notification of acceptance: May 26, 2023
- Camera-ready papers due: June 2, 2023
- Workshop: July 13, 2023
All deadlines are 11:59 PM AoE time.
Overview
--------
The Workshop on Online Abuse and Harms (WOAH) invites paper submissions from a wide range of fields, including natural language processing, machine learning, computational social sciences, law, politics, psychology, sociology and cultural studies. We explicitly encourage interdisciplinary submissions, technical as well as non-technical submissions, and submissions that focus on under-resourced languages. We also invite non-archival submissions and civil society reports.
The topics covered by WOAH include, but are not limited to:
New models or methods for detecting abusive and harmful online content;
Biases and limitations of existing detection models or datasets for abusive and harmful online content, particularly those in commercial use;
New datasets and taxonomies for online abuse and harms;
Dynamics of online abuse and harms, as well as their impact on different communities
Social, legal, and ethical implications of detecting, monitoring and moderating online abuse
In addition, we invite submissions related to the theme for this seventh edition of WOAH, which will be *subjectivity and disagreement in abusive language data*. Hate speech and other forms of abuse are highly subjective. By choosing this theme, we want to encourage submissions that analyse, address or make use of this subjectivity. To match the theme and complement thematic submissions, we have invited a strong lineup of relevant speakers.
Submission Guidelines
--------
Submission is electronic, using the Softconf START conference management system.
Submission link: https://softconf.com/acl2023/WOAH/
The workshop will accept three types of papers.
1) Academic Papers (long and short): Long papers of up to 8 pages, excluding references, and short papers of up to 4 pages, excluding references. Unlimited pages for references and appendices. Accepted papers will be given an additional page of content to address reviewer comments. Previously published papers cannot be accepted.
2) Non-Archival Submissions: Up to 2 pages, excluding references, to summarise and showcase in-progress work and work published elsewhere.
3) Civil Society Reports: Non-archival submissions, with a minimum of 2 pages and no upper limit. Can include work published elsewhere.
All submissions must use the official ACL 2023 style files. Submissions that do not conform to the required styles, including paper size, margin width, and font size restrictions, will be rejected without review. All submissions should adhere to the workshop policies https://www.workshopononlineabuse.com/policies.html.
All submissions, except for civil society reports, must be fully anonymised. Self-references that reveal the author's identity, e.g., "We previously showed (Smith, 1991) ...", should be avoided. Instead, use citations such as "Smith previously showed (Smith, 1991) ...".
Following the ACL 2023 guidelines, we believe that it is also important to discuss the limitations of your work, in addition to its strengths. The “Limitations” section will appear at the end of the paper, after the discussion/conclusions section and before the references, and will not count towards the page limit.
Multiple Submissions Policy
--------
The workshop allows for multiple submissions.
Papers that have been or will be presented at other venues may only be presented as non-archival. Papers that are presented at the main conference (ACL 2023) can be presented at the workshop as non-archival.
Organizers
--------
Yi-Ling Chung, The Alan Turing Institute
Aida Mostafazadeh Davani, Google
Debora Nozza, Bocconi University
Paul Röttger, University of Oxford
Zeerak Talat, Digital Democracies Institute, Simon Fraser University
Please send any questions about the workshop to organizers(a)workshopononlineabuse.com
The Natural Language Processing Section at the Department of Computer Science, Faculty of Science at University of Copenhagen is offering a PhD scholarship in Explainable Natural Language Understanding, as well as a postdoc position in Human-Centered Explainable Fact Checking with a start date of 1 September 2023. The application deadline is 1 March 2023.
Applications for the positions can be submitted here: https://jobportal.ku.dk/phd/?show=158207 (PhD position); https://jobportal.ku.dk/videnskabelige-stillinger/?show=158206 (postdoc position).
The Natural Language Processing Section provides a strong, international and diverse environment for research within core as well as emerging topics in natural language processing, natural language understanding, computational linguistics and multi-modal language processing. It is housed within the main Science Campus, which is centrally located in Copenhagen. The section came into effect on 1 January 2021 as a spin-off from the Machine Learning section, to which it still maintains close ties. Further information about research at the Department is available here: https://di.ku.dk/english/research/. The successful candidate will join Isabelle Augenstein’s Natural Language Understanding research group (www.copenlu.com/<http://www.copenlu.com/>). The Natural Language Processing research environment at the University of Copenhagen is internationally leading, as e.g. evidenced by it being ranked 2nd in Europe according to CSRankings.
The postions are offered in the context of an ERC Starting Grant held by Isabelle Augenstein on ‘Explainable and Robust Automatic Fact Checking (ExplainYourself)’. ERC Starting Grant is a highly competitive funding program by the European Research Council to support the most talented early-career scientists in Europe with funding for a period of 5 years for blue-skies research to build up or expand their research groups.
More information about the project can also be found at: http://www.copenlu.com/talk/2022_11_erc/
Informal enquiries about the positions can be made to Professor Isabelle Augenstein, Department of Computer Science, University of Copenhagen, e-mail: augenstein(a)di.ku.dk<mailto:augenstein@di.ku.dk>.
———
Isabelle Augenstein, PhD, Dr. Scient.
Full Professor
Head of the NLP Section
Department of Computer Science
University of Copenhagen
Universitetsparken 1, 2100 Copenhagen, Denmark
http://isabelleaugenstein.github.io/
The Natural Language Processing Section at the Department of Computer Science, Faculty of Science at University of Copenhagen is offering a PhD scholarship in Fair and Accountable Natural Language Processing, with a start date of 1 September 2023. The application deadline is 28 February 2023. Applications for the positions can be submitted here: https://candidate.hr-manager.net/ApplicationInit.aspx?cid=1307&ProjectId=15…
The PhD fellowship is offered in the context of a project supported by the Carlsberg Foundation on employer images in job ads led by Pia Ingold and co-led by Isabelle Augenstein (https://www.carlsbergfondet.dk/da/Forskningsaktiviteter/Bevillingsstatistik…). The project team will further include one postdoctoral researcher (to be hired at the Department of Psychology) as well as external partners. The project will comprise studies using methods from experimental psychology, as well as analyses of two existing big datasets on job ads (one in Danish, one in German) using Natural Language Processing. The role of the PhD student to be recruited in this call will be to research fair and accountable Natural Language Processing methods, which can be used to understand what influences the employer images that organisations project in job ads.
Informal enquiries about the position can be made to Professor Isabelle Augenstein, Department of Computer Science, University of Copenhagen, e-mail: augenstein(a)di.ku.dk<mailto:augenstein@di.ku.dk>.
———
Isabelle Augenstein, PhD, Dr. Scient.
Full Professor
Head of the NLP Section
Department of Computer Science
University of Copenhagen
Universitetsparken 1, 2100 Copenhagen, Denmark
http://isabelleaugenstein.github.io/
Call for Papers: 1st International Workshop on Disinformation and Toxic Content Analysis (DiTox 2023), September 13th, 2023
https://ditox.ait.ac.at/
In conjunction with the 4th biennial conference on Language, Data and Knowledge (LDK 2023) to be held in Vienna, Austria.
The spread of misinformation and disinformation not only affects people's perceptions and beliefs, but can also have a direct impact on democratic institutions, critical infrastructure, and lives and families. Most critically, it raises the more fundamental issue of what sources of information can be trusted at all, potentially calling into question our relationship of trust with traditional media. Because of these profoundly harmful effects, disinformation is seen as one of the most pressing problems of our time.
The weak definition of the research task of disinformation analysis and detection, as well as the enormous range in terms of the heterogeneity and multimodality of the data involved, make this an exceptionally challenging field of research. The complexity ranges from media tampering detection to text content analysis to large-scale information fusion to analyze disinformation trends. Maintaining a comprehensive overview is equally difficult.
Respectively, the overall goal of this workshop is therefore to provide insights on how approaches from different domains can be used to address disinformation at a technical level including AI/ML-based methods, visual analytics, and visualization approaches as well as interdisciplinary approaches inspired by the social sciences (i.e., computational social science). To this end, we invite task-specific contributions, as well as large-scale integration approaches, demo and project presentations, to provide a comprehensive overview of the current state of the art in countering disinformation.
Topics:
Full Paper Submissions:
- Machine and Deep learning methods for disinformation (e.g., analysis, detection)
- Visual analytics and visualization approaches for disinformation
- Social network analysis (e.g., key actors, distribution patterns) including visualization approaches
- Graph algorithms for disinformation identification
- Natural language processing methods (e.g., content evaluation, toxicity, radicalization)
- AI-supported fact checking and detection of disinformation campaigns
- Identification of fabricated and manipulated content (e.g., deep fakes, generated text)
- Community detection and characterization in social networks (e.g., conspiracy theories, echo chambers)
- Bots characterization and detection
- Multimodal fake content detection
- Recommendation systems and disinformation
- AI uses, practices and tools in fact-checking journalism
- Qualitative and quantitative studies on disinformation
- Ethics and law in disinformation
Demo and Project Presentation (Short Paper Track, Poster Presentation):
- Demo presentations (e.g., fact checking tools, disinformation detection tools)
- Project platform presentations
- Project presentations
Important Dates:
- Paper submission: May 21st, 2023
- Notification: June 20th, 2023
- Camera-ready submission deadline: July 9th, 2023
- DiTox workshop: September 13th, 2023
Submission:
Submissions can be in the form of Long papers (9-12 pages) and Short papers (4-6 pages). All submission lengths are given including references. Accepted submissions will be published by ACL in an open-access conference proceedings volume, free of charge for authors. The reviewing process is single-blind, submissions should not be anonymised. The workshop will be hybrid (face-to-face and remote). At least one author of each accepted paper must register to present the paper at the workshop (either remotely or on-site). There will be no registration fee administered for participating in LDK 2023. Papers should be submitted via OpenReview at the following address: https://openreview.net/group?id=LDK/2023/Conference
Second Call for participation - shared task on Multilingual Grammatical Error Detection (MultiGED-2023) on Czech, English, German, Italian and Swedish
Official website for the shared task: https://github.com/spraakbanken/multiged-2023
UPDATE: Additional data for English (REALEC) is now available on the github page<https://github.com/spraakbanken/multiged-2023/tree/main/english> and on Codalab<https://codalab.lisn.upsaclay.fr/competitions/9784>.
The Computational SLA<https://spraakbanken.gu.se/en/compsla> working group invites you to participate in the first shared task on Multilingual Grammatical Error Detection, MultiGED-2023, which includes five languages: Czech, English, German, Italian and Swedish.
The aim of this shared task is to detect tokens in need of correction across five different languages, labeling them as either correct ("c") or incorrect ("i"), i.e. performing binary classification at the token level. You can work on one of the provided languages or any combination of languages.
More details about the task: https://github.com/spraakbanken/multiged-2023
The shared task is part of the NLP4CALL workshop<https://spraakbanken.gu.se/en/research/themes/icall/nlp4call-workshop-serie…>, which will take place on 22 May 2023, co-located with the NoDaLiDa conference<https://www.nodalida2023.fo/> to be held in the Faroe Islands. Accepted papers with systems descriptions will be published in the workshop proceedings and double-published through the ACL anthology.
Timeline:
* 23 January, 2023 - first call for participation. Training and validation data released, CodaLab opens for team registrations.
* 14 February, 2023 - second call/reminder
* 27 February, 2023 - test data released
* 03 March, 2023 - system submission deadline (system output)
* 10 March, 2023 - results announced
* 03 April, 2023 - paper submission deadline with system descriptions. We encourage you to share models, code, fact sheets, extra data, etc. with the community through github or other repositories on paper publication.
* 21 April, 2023 - paper reviews sent to the authors
* 01 May, 2023 - camera-ready deadline
* 22 May, 2023 - presentations of the systems at NLP4CALL workshop
To register for/express interest in the shared task, please fill in this form<https://forms.gle/DgwTNmTCQhsmrbxq6>.
To ask questions and to get important information and updates about the shared task, please join the MultiGED-2023 Google Group<https://groups.google.com/g/multiged-2023>.
Official system evaluation will be carried out on CodaLab<https://codalab.lisn.upsaclay.fr/competitions/9784>.
Organizers:
* Elena Volodina<https://spraakbanken.gu.se/en/about/staff/elena>, University of Gothenburg, Sweden
* Chris Bryant<https://www.cst.cam.ac.uk/people/cjb255>, University of Cambridge, UK
* Andrew Caines<https://www.cl.cam.ac.uk/~apc38/>, University of Cambridge, UK
* Orphee De Clercq<https://research.flw.ugent.be/nl/orphee.declercq>, Ghent University, Belgium
* Jennifer-Carmen Frey<https://www.eurac.edu/en/people/jennifer-carmen-frey>, EURAC Research, Italy
* Elizaveta Ershova, JetBrains, Cyprus
* Alexandr Rosen<http://utkl.ff.cuni.cz/~rosen/>, Charles University, Czech Republic
* Olga Vinogradova, Independent researcher, Israel
Please, feel free to forward this call to those who might be interested.
___________________
Elena Volodina, PhD, Docent
https://spraakbanken.gu.se/en/about/staff/elena
Life is like a mirror. Smile at it and it smiles back at you.
Peace Pilgrim