The WNUT Workshop will be collocated with EACL 2024 (Malta). The website for
the workshop is at:
http://noisy-text.github.io/
The WNUT workshop focuses on core NLP tasks (e.g., POS/NER tagging and
translation; not computational social science) over user-generated text, such
as that found on social media, web forums, online reviews, digital health
records, or language learner essays.
We seek submissions of long and short papers on original and unpublished work
(same format and page limit as EACL main conference). All accepted
submissions will be presented as posters. Additionally, selected submissions
will be presented orally. There will be best paper awards for both short and
long papers.
Topics of interest include but are not limited to:
* NLP of noisy text, e.g. POS, NER tagging, Parsing
* Text normalization and error correction
* Paraphrase identification and semantic similarity of short text or noisy
text
* Extracting user demographics, profiles, and major life events
* Machine translation and Multilingual NLP over noisy text
* Information extraction from noisy text, global and regional trend
detection, and event extraction
* Colloquial language, e.g. idiom detection
* Domain adaptation to user-generated text
* Detecting rumors, contradictory information, sarcasm and humor on social
media
* Sentiment analysis
* Temporal aspects of user-generated content (resolving time expressions,
concept drift, etc...)
* Representing and mining language variation in user-generated content
* Processing of automatically generated data
* Robustness to Noise, both Natural and Adversarial
[IMPORTANT DATES]
* Submission Deadline: December 18, 2023 (anytime on earth; dual-submission
allowed)
* Acceptance Notification: January 20, 2024
* Camera-Ready Deadline: january 30, 2024
* Workshop Day: March 21/22, 2024
[INVITED SPEAKERS]
* Su Lin Blodgett
* Jennifer Foster
[ORGANIZERS]
* Tim Baldwin (University of Melbourne)
* Wei Xu (Georgia Institute of Technology)
* Alan Ritter (Georgia Institute of Technology)
* Rob van der Goot (IT University of Copenhagen)
* Max Müller-Eberstein (IT University of Copenhagen)
[SUBMISSION]
Submissions should conform to the ACL style guidelines. Long and short paper
submissions must be anonymized. Please submit your papers via OpenReview:
https://openreview.net/group?id=eacl.org/EACL/2024/Workshop/WNUT
Second Call for workshop papers and Shared Task participation: the 6th
workshop on Challenges and Applications of Automated Extraction of
Socio-political Events from Text - CASE @ EACL 2024
************************************************************************************
URL: https://emw.ku.edu.tr/case-2024/
Paper submission deadline: 18 December 2023
Paper acceptance notification: 20 January 2024
Paper camera-ready: 30 January 2024
Workshop dates: 21-22 March 2024
Softconf page of the workshop: https://softconf.com/eacl2024/CASE-2024/
************************************************************************************
We invite contributions from researchers in computer science, NLP, ML, DL,
AI, socio-political sciences, conflict analysis and forecasting, peace
studies, as well as computational social science scholars involved in the
collection and utilization of socio-political event data. This includes
(but is not limited to) the following topics
1) Extracting events and their arguments such as time and location in and
beyond a sentence or document, event coreference resolution.
2) Research in NLP technologies in relation to event detection: geocoding,
temporal reasoning, argument structure detection, syntactic and semantic
analysis of event structures, text classification, for event type
detection, learning event-related lexica, event co-reference resolution,
fake news analysis, and others with a focus on real or potential event
detection applications.
3) New datasets, training data collection, and annotation for event
information.
4) Event-event relations, e.g., subevents, main events, spatio-temporal
relations, causal relations.
5) Event dataset evaluation in light of reliability and validity metrics.
6) Defining, populating, and facilitating event schemas and ontologies.
7) Automated tools and pipelines for event collection related tasks.
8) Lexical, syntactic, semantic, discursive, and pragmatic aspects of event
manifestation.
9) Methodologies for development, evaluation, and analysis of event
datasets.
10) Applications of event databases, e.g. early warning, conflict
prediction, policymaking.
11) Estimating what is missing in event datasets using internal and
external information.
12) Detection of new and emerging SPE types, e.g. creative protests.
13) Release of new event datasets.
14) Bias and fairness of the sources and event datasets.
15) Ethics, misinformation, privacy, and fairness concerns pertaining to
event datasets.
16) Copyright issues on event dataset creation, dissemination, and sharing.
17) Cross-lingual, multilingual and multimodal aspects in event analysis.
18) Resources and approaches related to contentious politics around climate
change.
**** Shared tasks ****
We invite the community to participate in the shared task we organize and
consider working on data from our previous shared tasks in the scope of the
CASE workshop @ EACL 2024 (https://emw.ku.edu.tr/case-2024/).
Recent & Active Shared task:
*T1: Climate Activism Stance and Hate Event Detection*
*Codalab Link:* https://codalab.lisn.upsaclay.fr/competitions/16206
<https://codalab.lisn.upsaclay.fr/competitions/16206>
Registration: In order to register for the shared task, please send a
request in Codalab. The organizers will approve requests on a daily basis.
*GitHub Page:* https://github.com/therealthapa/case2024-climate
<https://github.com/therealthapa/case2024-climate>
Previous shared tasks for working on regular papers (no official
competition), please see the regular paper submission timeline:
PT1: Multilingual Protest News Detection
Contact person: Ali Hürriyetoğlu (ali.hurriyetoglu(a)gmail.com)
Github: https://github.com/emerging-welfare/case-2022-multilingual-event
PT2: Event Causality identification
Contact person: Fiona Anting Tan (tan.f(a)u.nus.edu)
Github: https://github.com/tanfiona/CausalNewsCorpus
PT3: Multimodal Hate Speech Event Detection
Contact person: Surendrabikram Thapa (surendrabikram(a)vt.edu)
Codalab page: https://codalab.lisn.upsaclay.fr/competitions/16203
Github: https://github.com/therealthapa/case2023_task4
Note: The organizers follows a specific timeline. Please see the Codalab
page.
*** Keynotes ***
We will continue our tradition of inviting keynote speakers from both
social and computational sciences. The up-to-date list of keynote speakers
will be announced soon.
*** Submission guidelines ***
This call solicits short and long papers reporting original and unpublished
research on the topics listed above. The papers should emphasize obtained
results rather than intended work and should indicate clearly the state of
completion of the reported results. The page limits and content structure
announced at ACL ARR page (https://aclrollingreview.org/cfp) should be
followed for both short and long papers.
Papers should be submitted on the START page of the workshop (
https://softconf.com/eacl2024/CASE-2024/) in PDF format, in compliance with
the ACL publication author guidelines for ACL publications
https://acl-org.github.io/ACLPUB/formatting.html. The templates can be
found on https://github.com/acl-org/acl-style-files.
The reviewing process will be double-blind and papers should not include
the author’s names and affiliations. Each submission will be reviewed by at
least three members of the program committee. The workshop proceedings will
be published on ACL Anthology.
*** Workshop organizers ***
Ali Hürriyetoğlu, KNAW Humanities Cluster, the Netherlands,
ali.hurriyetoglu(a)gmail.com
Hristo Tanev, European Commission, Joint Research Centre (EU JRC), Italy
Erdem Yörük, Koc University, Turkey
Jatin Bedi, Thapar Institute of Engineering and Technology, Patiala, India.
Surendrabikram Thapa, Virginia Tech, the USA
S. Angel Deborah, SSN College of Engineering, India
S. Rajalakshmi, SSN College of Engineering, India
Onur Uca, Mersin University, Turkey
Mark Lee, School of Computer Science University of Birmingham, United
Kingdom
Francielle Vargas, University of São Paulo, Brazil
Farhana Ferdousi Liza, University of East Anglia, the United Kingdom
Shruti Kulkarni, Oak Ridge National Laboratory, United States
Vivek Kumar, University of the Bundeswehr Munich, Germany
Milena Slavcheva, IICT, Bulgarian Academy of Sciences, Bulgaria
Guneet Singh Kohli, Thapar University, India
Vanni Zavarella, University of Cagliari, Italy
Two open positions: deadline Dec. 10th 12.00 CET
(1) Research Fellowship - Assegno di Ricerca I Fascia (Requirements: Master's Degree) - https://pica.cineca.it/uniroma2/f1-2023-0098/
(2) Post-doc Research Fellowship - Assegno di Ricerca II Fascia (Requirements: Ph.D. Degree) - https://pica.cineca.it/uniroma2/f2-2023-0026/
If you want to apply, use the above links and, possibly, inform fabio.massimo.zanzotto(a)uniroma2.it .
As you may know, we offer:
- an uncompetitive salary
- no extra-benefits
- no clear career path
Yet, YOU can help us shape ways schools may integrate these disruptive LLMs to prepare "biological brains" for the vibrating future.
These positions are within an Italian Research Project of National Interest (PRIN): "Class-tAIs: Artificial Intelligence and multi-brain connectivity as a buddy to Enhancing Competencies in students"
Positions will start early next year.
Lab: Human-centric Art at the University of Rome Tor Vergata (Italy)
Follow us on our newly established X account: @HumanCentricArt
https://twitter.com/HumanCentricArt
Dear colleagues,
My group is currently seeking for two new positions (details below)
I would really appreciate sharing to potential interested candidates or
lists.
Best regards,
Martin
*Context And Mission*
The Natural Language Processing for Biomedical Information Analysis
(NLP4BIA) group at BSC is an internationally renowned research group
working on the development of NLP, language technology, and text-mining
solutions applied primarily to biomedical and clinical data. It is a highly
interdisciplinary team, funded through competitive European and National
projects requiring the implementation of natural language processing and
advanced AI solutions making use of diverse technologies, including
Transformers and recent advances in Large Language Models (LLM) to improve
healthcare data analysis.
*Position 1*
*Reference:* 514_23_LS_NLPBIA_RE1
*Job title:* Junior Research Engineer - NLP for Biomedical Information
Analysis (RE1)
*URL: **https://www.bsc.es/join-us/job-opportunities/51423lsnlpbiare1
<https://www.bsc.es/join-us/job-opportunities/51423lsnlpbiare1>*
The NLP4BIA group at BSC is looking for a Software/Data Engineer or Full
Stack Developer with an interest in learning technical aspects of Natural
Language Processing, AI, and Language Models. The ideal candidate will be
responsible for advancing a cutting-edge NLP platform, leveraging the use
of state-of-the-art language technologies and NLP resources. This role
involves collaboration with hospitals, research teams, and experts on both
national and international scales to drive innovation in the field.
*Key Duties*
- Software Development: Create backend, frontend, and web services,
along with web-based demos for NLP tools.
- NLP platform Integration: Integrate existing NLP components into the
CogStack platform for processing clinical and biomedical content.
- Data Processing and Ingestion: Develop scripts for the ingestion,
cleaning, and formatting of text data to make it appropriate for neural
architectures.
- Data Management: Organize and maintain data repositories in alignment
with group and project requirements.
- Documentation and Reporting: Create technical reports and project
documentation in both English and Spanish.
- Automated Data Annotation: Use existing state-of-the-art NLP tools to
annotate data, improving operational efficiency autonomously.
- In this role, the candidate will closely collaborate with AI/NLP
experts of the team to define the technical requirements for running and
deploying NLP components. You'll collaborate with them in writing research
proposals and contributing to research scientific papers. Furthermore, your
duties will include working with external teams to provide technical
support related to tools used for data annotation and platform deployment,
ensuring seamless project execution and interdisciplinary collaboration.
*Position 2*
*Reference:* 515_23_LS_NLPBIA_RE2
*Job title:* Research Engineer - NLP for Biomedical Information Analysis
(RE2)
*URL: https://www.bsc.es/join-us/job-opportunities/51523lsnlpbiare2
<https://www.bsc.es/join-us/job-opportunities/51523lsnlpbiare2>*
The NLP4BIA-BSC is looking for a Research Engineer with experience in
Language Technologies and Deep Learning. The candidate will be involved in
technical work related to international projects, being part of a team of
researchers working on topics related to multilingual information
extraction in the clinical field, including Named-Entity Recognition,
Entity Linking and Language Modeling. The candidate will have the
opportunity to advance the state of the art of cross-lingual biomedical NLP
methods by working in a multidisciplinary environment alongside linguists,
medical experts, and other engineers.
*Key Duties*
- NLP model development: Development of multilingual information
extraction models in the biomedical field, including mention extraction and
linking of terms to controlled terminologies. Pre-training of cross-lingual
large language models for healthcare.
- Technical project coordination: Coordinate technical contributions
from different partners in technological projects.
- Documentation and Reporting: Create technical reports and project
documentation in both English and Spanish
- Scientific writing: Collaborate in drafting technical research
proposals and writing scientific papers.
*About BSC*
The Barcelona Supercomputing Center - Centro Nacional de Supercomputación
(BSC-CNS) is the leading supercomputing center in Spain. It houses
MareNostrum, one of the most powerful supercomputers in Europe, was a
founding and hosting member of the former European HPC infrastructure PRACE
(Partnership for Advanced Computing in Europe), and is now hosting entity
for EuroHPC JU, the Joint Undertaking that leads large-scale investments
and HPC provision in Europe. The mission of BSC is to research, develop and
manage information technologies in order to facilitate scientific progress.
BSC combines HPC service provision and R&D into both computer and
computational science (life, earth and engineering sciences) under one
roof, and currently has over 900 staff from 55 countries.
2nd Call for abstracts UniDive 2nd general meeting, University of Naples
L'Orientale, Italy, 7-9 February 2024
UniDive <https://www.cost.eu/actions/CA21167/> is a COST action, i.e. a
scientific network, dedicated to universality, diversity and idiosyncrasy
in language technology. It is structured around 4 Working Groups:
-
WG1: Corpus annotation
-
WG2: Lexicon-corpus interface
-
WG3: Multilingual and cross-lingual language technology
-
WG4: Quantifying and promoting diversity
The second general meeting
<https://unidive.lisn.upsaclay.fr/doku.php?id=meetings:general_meetings:2nd_…>
of
the action will take place on February 7-9, 2024 at the University of
Naples L'Orientale <http://www.unior.it/> <http://www.unior.it/>in Italy.
We invite UniDive WG members to submit abstract proposals related to the
scientific program of the WGs.
Proposals may describe diverse types of contributions, according to 3
different tracks:
-
Planned work
-
Work in progress
-
Complete work, also previously published
A proposal should be anonymous, written in English, and submitted in pdf only.
It should include (on the title page) the list of the relevant WGs, but in
the submission form, only one WG can be selected as the main one. It should
not exceed 2 pages, including figures and tables (bibliographic references
may go beyond the 2-page limit). If linguistic examples from languages
other than English are included, those should be glossed and translated
into English, and an extra half page is allowed for this purpose.
For the sake of uniformity and easing the reviewers' effort, we encourage
authors to use the Overleaf Latex template
<https://www.overleaf.com/read/yqbpxcbjmjjw>. Other formats (not
necessarily Latex-based) can also be used, provided that they conform to
the following specifications: A4 paper, 11pt font, 1in margins. The
submission link will be announced soon.
The submission link is:
https://openreview.net/group?id=UniDive/2024/General_Meeting
The reviewing process is double-blind. The selection of proposals will be
done by UniDive Program Committee according to the following criteria:
-
relevance to UniDive and the work program of its Working Groups (see pp.
18-20 of the Memorandum of Understanding),
-
clarity
-
diversity of the languages covered by the workshop program
The selected proposals will be presented at the 2nd UniDive general meeting
as posters and/or oral presentations.
At least one author per selected proposal will be reimbursed for their
travel and stay.
Important dates
-
26 October 2023: Call for abstracts
-
24 November 2023: Submission deadline
-
15 December 2023: Notification of acceptance
-
20 December 2023: Communication of the names of the presenters
-
12 January 2024: Final versions of abstracts
-
7-9 February 2024: UniDive 2nd general meeting
The time zone for all deadlines is anywhere on Earth (UTC-12). Due to the
tight schedule, no extension of the submission deadline is foreseen.
Program Chairs
-
Victoria Bobicev, Technical University of Moldova (Moldova)
-
Johanna Monti, University of Naples L'Orientale (Italy)
-
Ranka Stanković, University of Belgrade (Serbia)
Dear All,
We are looking for PhD applicants for our Computational Social Science Group (https://css.cs.ut.ee/) ! So, if you are interested, please write to us and if you know someone who might be interested, please spread the word!!
We are interested in topics related to misinformation, media biasness, fairness and explainability on computational journalism and online social media.
We are also open to other topics if they overlap with the interests of our group.
Why join us?
====================
You'll be a part of Estonia's esteemed Institute of Computer Science (https://cs.ut.ee/en), University of Tartu, which is in the modern Delta Centre (https://delta.ut.ee/en/), a beacon of technological innovation and research excellence.
Relevant Information
====================
The gross salary is 2000 euros per month for four years. We will support research related travels and there is no tuition fees for PhD students.
A master's degree is required. Additionally, candidates with good implementation skills and previous experience in large scale dataset analysis, text mining, machine learning algorithms will be preferred.
Please drop a mail for any query or to show your interests at contact(a)css.cs.ut.ee<mailto:contact@css.cs.ut.ee> or roshni.chakraborty(a)ut.ee<mailto:roshni.chakraborty@ut.ee> or rajesh.sharma(a)ut.ee<mailto:rajesh.sharma@ut.ee> !
Kind Regards,
Roshni Chakraborty,
Assistant Professor,
Institute of Computer Science
University of Tartu, Estonia
Apologies for cross posting
The Fourth Workshop on Speech, Vision, and Language Technologies for
Dravidian Languages -(DravidianLangTech-2024) at EACL 2024
Link: https://sites.google.com/view/dravidianlangtech-2024/home
The development of technology increases our internet use, and most of the
global languages have adapted themselves to the digital era. However, there
are many regional, under-resourced languages that face challenges as they
still lack developments in language technology. One such language family is
the Dravidian family of languages. Dravidian languages
<https://en.wikipedia.org/wiki/Dravidian_languages> are primarily spoken in
south India and Sri Lanka. Pockets of speakers are found in Nepal,
Pakistan, Malaysia, other parts of India and elsewhere in the world. The
Dravidian languages, which are 4,500 years old and spoken by millions of
speakers, are under-resourced in speech and natural language processing.
The Dravidian languages are divided into four groups: South, South-Central,
Central, and North groups. Dravidian morphology is agglutinating and
exclusively suffixal. Syntactically, Dravidian languages are head-final and
left-branching. They are free-constituent order languages. To improve
access to and production of information for monolingual speakers of
Dravidian languages, it is necessary to have speech and language
technologies. The aim of these workshops is to save the Dravidian languages
from extinction in technology. This is the first workshop on speech and
language technologies for Dravidian languages.
The broader objectives of DravidianLangTech-2024 will be
-
To investigate challenges related to speech and language resource
creation for Dravidian languages.
-
To promote research in speech and language technology in Dravidian
languages.
-
To adopt appropriate language technology models which suit Dravidian
languages.
-
To provide opportunities for researchers from the Dravidian language
community from around the world to collaborate with other researchers.
Our workshop theme focuses on being more inclusive and providing a platform
for researchers to create Language Technologies (LT) of a more inclusive
nature. We hope that through these engagements we can develop LT tools to
be more inclusive of everyone, including marginalized people.
Call for Papers
DravidianLangTech-2024 welcomes theoretical and practical paper submission
on any Dravidian languages (Tamil, Kannada, Malayalam, Telugu, Tulu,
Allar, Aranadan, Attapadya, Kurumba, Badaga, Beary, Betta Kurumba,
Bharia, Bishavan, Brahui, Chenchu, Duruwa, Eravallan, Gondi,
Holiya, Irula, Jeseri, Kadar, Kaikadi, Kalanadi, Kanikkaran,
Khiwar, Kodava, Kolami, Konda, Koraga, Kota, Koya, Kurambhag
Paharia, Kui, Kumbaran, Kunduvadi, Kurichiya, Kurukh, Kurumba, Kuvi,
Madiya, Mala Malasar, Malankuravan, Malapandaram, Malasar, Malto,
Manda, Muduga, Mullu Kurumba, Muria, Muthuvan, Naiki, Ollari, Paliyan,
Paniya, Pardhan, Pathiya, Pattapu, Pengo, Ravula, Sholaga, Thachanadan,
Toda, Wayanad Chetti, and Yerukala) that contributes to research in
language processing, speech technologies or resources for the same. We will
particularly encourage studies that address either practical application or
improving resources for a given language in the field.
We invite submissions on topics that include, but are not limited to, the
following:
- Code-mixing/ Code-switching
- Cognitive Modeling and Psycholinguistics
- Computer-assisted language learning (CALL)
- Corpus development, tools, analysis and evaluation
- COVID-19 alert, NLP Applications for Emergency Situations and Crisis
Management
- Equality, Diversity, and Inclusion
- Fake New, Spam, and Rumor Detection
- Hate speech detection and Offensive Language Detection
- Lexicons and Machine-readable dictionaries
- Linguistic Theories, Phonology, Morphological analysis, Syntax and
Semantics
- Machine Translation, Sentiment Analysis, and Text summarization
- Multimodal Analysis- Image Captioning and Video Captioning
- Speech technology and Automatic Speech Recognition
Important dates
- Workshop paper due: *December 12, 2023*
- Direct Submission deadline (pre-reviewed ARR & main conference):
January 17, 2024
-
Notification of acceptance: January 15, 2024
-
Camera-ready papers due: January 25 2024
-
Workshop dates: March 21-22, 2024
Submission Link:
https://openreview.net/group?id=eacl.org/EACL/2024/Workshop/DravidianLangTe…
with regards,
Dr. Bharathi Raja Chakravarthi,
Assistant Professor / Lecturer-above-the-bar
School of Computer Science, University of Galway, Ireland
Insight SFI Research Centre for Data Analytics, Data Science Institute,
University of Galway, Ireland
E-mail: bharathiraja.akr(a)gmail.com , bharathi.raja(a)universityofgalway.ie
<bharathiraja.asokachakravarthi(a)universityofgalway.ie>
Google Scholar: https://scholar.google.com/citations?user=irCl028AAAAJ&hl=en
Website:
https://www.universityofgalway.ie/our-research/people/bharathirajaasokachak…
Breaking ground: Discussing the present and the future of Data-driven learning – Online seminar
December 4, 2023, 10:00 - 13:00
This online seminar aims to provide a stage for new voices to share their experiences on DDL research and their vision for its development. It is the result of a collaboration between the ATILF/University of Lorraine and the University of Murcia to bring together current and innovative research from current and future specialists in DDL . The webinar will take place on 4th December 2023 and it will consist of a 2-hour session in which early-career researchers will present their novel takes on DDL, followed by a 1-hour roundtable with experienced researchers (Alex Boulton-U. Lorraine, Elisa Corino-U. Torino and Pascual Pérez-Paredes -U. Murcia) and everyone involved in the webinar.
Click below to register. It’s free.
https://umurcia.zoom.us/webinar/register/WN_DuBwB0RnSUGzGfsxJ2i8jw#/registr…
For speakers and more details:
https://www.um.es/languagecorpora/breaking-ground/
Pascual Pérez-Paredes
https://webs.um.es/pascualf/<https://webs.um.es/pascualf/miwiki/doku.php>
Assistant/Associate Professor in Natural Language Processing
IT University of Copenhagen
Application deadline: 26 November 2023
The computer science department at the IT University of Copenhagen is recruiting a new faculty member to join the natural language processing group.
Candidates from any background and all areas of natural language processing are very welcome to apply!
ITU is a small university with a motivated and engaged student community in Copenhagen, a city that consistently ranks as one of the most livable places in the world.
If you have any questions, don’t hesitate to get in touch with me:
chrha(a)itu.dk<mailto:chrha@itu.dk>
More information and application link in the official job posting:
https://candidate.hr-manager.net/ApplicationInit.aspx?cid=119&ProjectId=181…
Website of the NLP group at ITU:
https://nlpnorth.github.io/
--
Christian Hardmeier, Associate Professor – https://christianhardmeier.rax.ch/
IT University of Copenhagen, Department of Computer Science
***Second Call for Papers***
**Overview**
Submission page: https://softconf.com/eacl2024/LAW-XVIII/
Website: https://sigann.github.io/LAW-XVIII-2024/
E-mail: law-xviii-2024(a)googlegroups.com
**Workshop Description**
LAW-XVIII will be the 18th annual meeting endorsed by the ACL Special
Interest Group for Annotation (SIGANN). It will take place in March 2024 at
EACL in St. Julians, Malta.
Linguistic annotation of natural language corpora is the backbone of
supervised methods in both statistical and neural natural language
processing. Annotated corpora are also a major supporting source of
information for unsupervised methods, multitask learning, and evaluation of
both NLP tools and theories about language within and outside of linguistics.
The LAW-XVIII will provide a forum for presentation and discussion of
innovative research on all aspects of linguistic annotation, including
creation/evaluation of annotation schemes, methods for automatic and manual
annotation, use and evaluation of annotation software and frameworks,
representation of linguistic data and annotations, semi-supervised “human
in the loop” methods of annotation, crowd-sourcing approaches, and more.
The LAW will also provide a forum for annotation researchers to work towards
standardization, best practices, and interoperability of annotation
information and software.
In line with the EACL main conference, LAW will be hybrid, allowing both
in-person and virtual presentations.
**Special Theme**
The special theme of LAW-XVIII is “Annotation in the Age of Large Language
Models (LLMs).” In addition to LAW’s general topics, we specifically
invite submissions on the following topics:
- Comparison of linguistically annotated datasets vs. datasets created using
large language models. Potential topics include:
- Comparison of models that have been trained on the respective datasets
- Impact of data size of manually annotated resources already available prior
to dataset creation with LLMs
- Is synthetic dataset creation a viable option for non-standard domains,
e.g., the medical domain, where expert knowledge is required?
- Non-performance-related considerations of manual vs. synthetic dataset
creation (e.g., explainability)
- Impact and prevention of test dataset contamination in LLM training
- Usefulness of LLMs for linguistic research (in relation to annotation).
- Any other topics related to the special theme.
**Submissions**
We accept both direct submissions and commitments from ACL Rolling Review (ARR).
We welcome submissions of long and short papers, posters, and demonstrations
relating to the special theme or any aspect of linguistic annotation,
including:
- Annotation procedures
- Innovative automated and manual strategies for annotation
- Machine learning and knowledge-based methods for automation of corpus
annotation
- Creation, maintenance, and interactive exploration of annotation structures
and annotated data
- Annotation evaluation
- Inter-annotator agreement and other evaluation metrics and strategies
- Qualitative evaluation of linguistic representations
- Innovative means to evaluate annotation quality
- Annotation access and use
- Representation formats/structures for annotations of different phenomena,
especially annotations at multiple levels, and means to explore/manipulate
them
- Linguistic considerations for merging annotations of distinct phenomena
- Annotation schemes, guidelines and standards
- New and innovative annotation schemes, comparison of annotation schemes
- Methodologies and resources for annotation scheme development
- Best practices for annotation procedures and/or development and
documentation of annotation schemes
- Interoperability of annotation formats and/or frameworks among different
systems as well as different tasks, frameworks, modalities, and languages
- Results from the application and evaluation of standards for linguistic
annotation
- Annotation software and frameworks
- Development, evaluation and/or innovative use of annotation software
frameworks
Submissions should report original and unpublished research on topics of
interest to the workshop. We also invite substantiated position papers, in
particular with regard to our special theme. Accepted papers are expected to
be presented at the workshop and will be published in the workshop
proceedings. They should emphasize obtained results rather than intended
work, and should indicate clearly the state of completion of the reported
results.
A paper accepted for presentation at the workshop must not be or have been
presented at any other meeting with publicly available proceedings.
Long/short paper submissions must use the official ACL style templates. Long
papers must not exceed eight (8) pages of content. Short papers and
demonstration papers must not exceed four (4) pages of content. References do
not count against these limits.
Note: The supplementary material does not count towards page limit and should
not be included in the paper, but should be submitted separately using the
appropriate field on the submission website. All submissions must be in PDF
format.
Reviewing of papers will be double-blind. Therefore, the paper must not
include the authors’ names and affiliations or self-references that reveal
the authors’ identity--e.g., "We previously showed (Smith, 1991) ..."
should be replaced with citations such as "Smith (1991) previously showed
...". Papers that do not conform to these requirements will be rejected
without review.
Authors of papers that have been or will be submitted to other meetings or
publications must provide this information to the workshop co-chairs
(law-xviii-2024(a)googlegroups.com). Authors of accepted papers must notify
the program chairs within 10 days of acceptance if the paper is withdrawn for
any reason.
We follow previous and current ACL policy to establish an anonymity period
(from submission to author notification) during which non-anonymous posting
of preprints is not allowed. Also included in that policy are instructions to
reviewers to not rate papers down for not citing recent preprints. Authors
are asked to cite published versions of papers instead of preprint versions
when possible.
Papers can be submitted at https://softconf.com/eacl2024/LAW-XVIII/.
If you have any questions, please feel free to contact the program co-chairs
via e-mail or check the workshop website
(https://sigann.github.io/LAW-XVIII-2024/) for updates.
**Dates**
(All submission deadlines are 11:59 p.m. UTC-12:00 “anywhere on Earth”)
Anonymity period starts: November 18, 2023
Submission of long and short papers: December 18, 2023
ARR Commitment deadline: January 17, 2024
Notification of acceptance: January 20, 2024
Camera-ready papers due: January 30, 2024
Workshop: March 21 or 22, 2024
**Workshop Organizers**
Manfred Stede (Program Co-Chair)
Sophie Henning (Program Co-Chair)
Amir Zeldes (ACL SIGANN President)
Ines Rehbein (ACL SIGANN Secretary)