LT4HALA 2024 -- deadline extension -- Third Workshop on Language Technologies for Historical and Ancient LAnguages @ LREC COLING 2024
The Third Workshop on Language Technologies for Historical and Ancient LAnguages (LT4HALA 2024) will be held on May 25th in Torino (Italy), co-located with LREC-COLING 2024. This one-day workshop seeks to bring together scholars, who are developing and/or are using Language Technologies (LTs) for historically attested languages, so to foster cross-fertilization between the Computational Linguistics community and the areas in the Humanities dealing with historical linguistic data, e.g. historians, philologists, linguists, archaeologists and literary scholars.
*
Submission deadline: 26th February 2024 **NEW DEADLINE: 1st March 2024**
Website: https://circse.github.io/LT4HALA/2024/
Submission page: https://softconf.com/lrec-coling2024/lt4hala2024/
------------------------------------------------------------------------------
MentalRiskES at IberLEF 2024: Call for Participation
Website: https://sites.google.com/view/mentalriskes2024
-------------------------------------------------------------------------------
MentalRiskES describes the second edition of a novel task on early risk
identification of mental disorders in Spanish comments from social media
sources. The first edition took place last year in the IberLEF evaluation
forum as part of the SEPLN 2023. The task was resolved as an online
problem, that is, the participants had to detect a potential risk as early
as possible in a continuous stream of data. Therefore, the performance not
only depended on the accuracy of the systems but also on how fast the
problem was detected. These dynamics are reflected in the design of the
tasks and the metrics used to evaluate participants. For this second
edition, we propose three novel tasks, the first subtask is about
detection disorder, the second subtask consists of detecting the context
that may be associated with the disorder, and the third subtask is about
suicidal ideation detection.
We would like to invite you to participate in the following tasks:
1. Disorders detection (multi-class classification)
2. Disorder contextualization (fine-grained classification)
3. Suicidal ideation detection (binary classification)
Find out more at https://sites.google.com/view/mentalriskes2024.
MentalRiskES 2024 is part of the IberLEF Workshop and will be held in
conjunction with the SEPLN 2024 conference in Valladolid (Spain).
-------------------------------------------------------------------------------
Important Dates
-------------------------------------------------------------------------------
Feb 16th Registration open
Feb 21st Release of trial corpora (trial server available)
Mar 20th Release of training corpora
Mar 29th Registration closed
Apr 8th Release of test corpora and start of the evaluation
campaign (test server available and trial submissions closed)
Apr 12th End of evaluation campaign (deadline for submission
of runs)
Apr 18th Publication of official results and release of test
gold labels
May 10th Deadline for paper submission
May 31st Acceptance notification
Jun 17th Camera-ready submission deadline
July 11th Final camera-ready submission deadline (to IberLEF
organisers)
Please reach out to the organizers at MentalRiskEs@IberLEF2024.
The MentalRiskES 2024 organizing committee.
--
M. Dolores Molina González
-----------------------------------------------------------
Mas informacion sobre listas de correo en la Univ. de Jaen
http://www.ujaen.es/sci/redes/listas/
-----------------------------------------------------------
*** Second Call for Papers ***
We invite paper submissions to the 8th Workshop on Online Abuse and Harms (WOAH), which will take place on June 20/21 at NAACL 2024.
Website: https://www.workshopononlineabuse.com/cfp.html
Join our WOAH community Slack channel<https://hatespeechdet-47d7560.slack.com/join/shared_invite/zt-2a8d96j4z-gkN…>!
Important Dates
Submission due: March 10, 2024
ARR reviewed submission due: April 7, 2024
Notification of acceptance: April 14, 2024
Camera-ready papers due: April 24, 2024
Workshop: June 20/21, 2024
Overview
Digital technologies have brought many benefits for society, transforming how people connect, communicate and interact with each other. However, they have also enabled abusive and harmful content such as hate speech and harassment to reach large audiences, and for their negative effects to be amplified. The sheer amount of content shared online means that abuse and harm can only be tackled at scale with the help of computational tools. However, detecting and moderating online abuse and harms is a difficult task, with many technical, social, legal and ethical challenges. The Workshop on Online Abuse and Harms invites paper submissions from a wide range of fields, including natural language processing, machine learning, computational social sciences, law, politics, psychology, sociology and cultural studies. We explicitly encourage interdisciplinary submissions, technical as well as non-technical submissions, and submissions that focus on under-resourced languages. We also invite non-archival submissions and civil society reports.
The topics covered by WOAH include, but are not limited to:
* New models or methods for detecting abusive and harmful online content, including misinformation;
* Biases and limitations of existing detection models or datasets for abusive and harmful online content, particularly those in commercial use;
* New datasets and taxonomies for online abuse and harms;
* New evaluation metrics and procedures for the detection of harmful content;
* Dynamics of online abuse and harms, as well as their impact on different communities
* Social, legal, and ethical implications of detecting, monitoring and moderating online abuse
In addition, we invite submissions related to the theme for this eighth edition of WOAH, which will be online harms in the age of large language models. Highly capable Large Language Models (LLMs) are now widely deployed and easily accessible by millions across the globe. Without proper safeguards, these LLMs will readily follow malicious instructions and generate toxic content. Even the safest LLMs can be exploited by bad actors for harmful purposes. With this theme, we invite submissions that explore the implications of LLMs for the creation, dissemination and detection of harmful online content. We are interested in how to stop LLMs from following malicious instructions and generating toxic content, but also how they could be used to improve content moderation and enable countermeasures like personalised counterspeech. To support our theme, we have invited an interdisciplinary line-up of high-profile speakers across academia, industry and public policy.
Submission
Submission is electronic, using the Softconf START conference management system.
Submission link: https://softconf.com/naacl2024/WOAH2024/manager/scmd.cgi?scmd=submitPaperCu…
The workshop will accept three types of papers.
* Academic Papers (long and short): Long papers of up to 8 pages, excluding references, and short papers of up to 4 pages, excluding references. Unlimited pages for references and appendices. Accepted papers will be given an additional page of content to address reviewer comments. Previously published papers cannot be accepted.
* Non-Archival Submissions: Up to 2 pages, excluding references, to summarise and showcase in-progress work and work published elsewhere.
* Civil Society Reports: Non-archival submissions, with a minimum of 2 pages and no upper limit. Can include work published elsewhere.
Format and styling
All submissions must use the official ACL two-column format, using the supplied official style files. The templates can be downloaded in Style Files and Formatting<https://github.com/acl-org/acl-style-files>.
Please send any questions about the workshop to organizers(a)workshopononlineabuse.com<mailto:organizers@workshopononlineabuse.com>
Organisers
Paul Röttger, Bocconi University
Yi-Ling Chung, The Alan Turing Institute
Debora Nozza, Bocconi University
Aida Mostafazadeh Davani, Google Research
Agostina Calabrese, University of Edinburgh
Flor Miriam Plaza-del-Arco, Bocconi University
Zeerak Talat, MBZUAI
The Alan Turing Institute is a limited liability company, registered in England with registered number 09512457. Our registered office is at British Library, 96 Euston Road, London, England, NW1 2DB. We are also a charity registered in England with charity number 1162533. This email and any attachments are confidential and may be legally privileged. If you have received it in error, you are on notice of its status. If you have received this message in error, please send it back to us, and immediately and permanently delete it. Do not use, copy or disclose the information contained in this message or in any attachment. DISCLAIMER: Although The Alan Turing Institute has taken reasonable precautions to ensure no viruses are present in this email, The Alan Turing Institute cannot accept responsibility for any loss or damage sustained as a result of computer viruses and the recipient must ensure that the email (and attachments) are virus free. While we take care to protect our systems from virus attacks and other harmful events, we give no warranty that this message (including attachments) is free of any virus or other harmful matter, and we accept no responsibility for any loss or damage resulting from the recipient receiving, opening or using it. E-mail transmission cannot be guaranteed to be secure or error-free as information could be intercepted, corrupted, lost, destroyed, arrive late or be incomplete. If you think someone may have interfered with this email, please contact the Alan Turing Institute by telephone only and speak to the person dealing with your matter or the Accounts Department. Fraudsters are increasingly targeting organisations and their affiliates, often requesting funds to be transferred to a different bank account. The Alan Turing's bank details are contained within our terms of engagement. If you receive a suspicious or unexpected email from us, or purporting to have been sent on our behalf, particularly containing different bank details, please do not reply to the email, click on any links, open any attachments, nor comply with any instructions contained within it, but contact our Accounts department by telephone. Our Transparency Notice found here - https://www.turing.ac.uk/transparency-notice sets out how and why we collect, store, use and share your personal data and it explains your rights and how to raise concerns with us.
Dear colleagues,
We are glad to share the second Call for Paper of the Joint Workshop of the
7th Financial Technology and Natural Language Processing (FinNLP) and the
5th Knowledge Discovery from Unstructured Data in Financial Services (KDF)
at LREC-COLING-2024. This workshop aims at discussing NLP for financial
applications and knowledge discovery from financial data. *The submission
deadline is March 1st, 2024, and here is the Submission System:
https://softconf.com/lrec-coling2024/finnlp-kdf2024/
<https://softconf.com/lrec-coling2024/finnlp-kdf2024/>*
In addition to the main track, we will have two keynote talks provided by
Dr. James Zhang (Ant Group) and Prof. Diyi Yang (Stanford). Besides, we
also have a shared task related to ESG (environmental, social, and
corporate governance). Please visit our website for more details:
https://sites.google.com/nlg.csie.ntu.edu.tw/finnlp-kdf-2024/home
We look forward to receiving your submissions and welcoming you to join
FinNLP-KDF.
Best Regards,
FinNLP-KDF organizers
*Call for Paper - FinNLP-KDF*
*Topics of Interest*We invite submissions of original contributions on
methods, theories, applications, and systems on artificial intelligence,
machine learning, natural language processing & understanding, big data,
statistical learning, data analytics, and deep learning, with a focus on
knowledge discovery in the financial services domain. The scope of the
workshop includes, but is not limited to, the following areas:
- Representation learning, and distributed representation learning and
encoding in natural language processing for financial document
- Language modeling on financial corpora including tabular and numerical
data, and multi-modal modeling; large language models (LLMs) and
applications for finance
- Graph representation learning, mining learning on graph structures
from financial data
- Multi-source knowledge integration and fusion, and knowledge alignment
and integration from heterogeneous data
- Synthetic or genuine financial datasets and benchmarks for baseline
models
- Transfer learning applications for financial data, knowledge
distillation as a method for compression of pre-trained models or
adaptation to financial datasets
- Search and question answering systems designed for financial corpora
- Event discovery from alternative data and impact on organization
equity price
- Environmental, social, governance (ESG) event discovery, evaluation,
and impact assessment
*Submission Details*
- Submission Deadline: March 1st, 2024
- Paper Notification: March 20th
- Camera-Ready Deadline: March 25th
- FinNLP-KDF-2024: May 20, 2024
Accepted papers proceedings will be published at ACL Anthology.
*Format*
- The LREC-COLING template MUST be used for your submission(s).
- Long Paper: May consist of up to 8 pages of content, plus unlimited
pages for references and appendix.
- Short Paper and Demo Paper: May consist of up to 4 pages of content,
plus unlimited references and appendix.
*Policies*
- The reviewing process will be double-blind for Long and Short Paper,
and single-blind for Demo Paper. Submissions must be in electronic form
using the FinNLP-KDF-2024 paper submission link above.
- At least one author of each accepted paper should register and present
their work (either online or in-person) in FinNLP-KDF-2024. Papers with “No
Show” may be redacted. Authors will be required to agree to this
requirement at the time of submission.
- When submitting a paper from the START page, authors will be asked to
provide essential information about resources (in a broad sense, i.e. also
technologies, standards, evaluation kits, etc.) that have been used for the
work described in the paper or are a new result of your research. Moreover,
ELRA encourages all LREC-COLING authors to share the described LRs (data,
tools, services, etc.) to enable their reuse and replicability of
experiments (including evaluation ones).
(Apologies for cross-postings)
�
*** The GUM Corpus - Release 10.0.0 ***
*** Georgetown University Multilayer corpus ***
�
Corpling@GU <https://gucorpling.org/corpling/> is happy to announce the first release of series 10 of the Georgetown University Multilayer corpus (GUM V10.0.0):
�
https://gucorpling.org/gum/
�
New in this version:
�
- 4 new genres with 22 new documents: (total tokens: 228,399)
- Courtroom transcripts
- Essays
- Letters (on paper, not e-mails)
- Podcasts
- Expansions to the discourse annotation layer
- Enhanced RST parses with additional, non-projective tree-breaking relations (multiple relations per node)
- Complete signaling annotation including discourse markers and other discourse signals following the Signaling Corpus
- PDTB-style connective annotation and DISRPT style relation classification data
- Morphological segmentation following UniMorph
- Annotation of select constructions based on Construction Grammar (e.g. resultatives, NPN, causal-excess)
- Many corrections to all annotation layers
�
GUM is an open source corpus of richly annotated English texts from 16 genres: academic, bio, courtroom, conversation, essay, fiction, interview, letters, news, podcasts, speeches, textbooks, travel, vlogs, how-to and Reddit forum discussions. The corpus is created by students as part of the Computational Linguistics curriculum at Georgetown University and is available under Creative Commons licenses.
�
This is the first version of GUM series 10, containing roughly 228K tokens annotated for:
�
- Multiple POS tags (100% manual gold PTB, extended PTB, converted CLAWS5 and UPOS) and UD morphological features
- Manually corrected lemmatization and morphological segmentation
- Sentence segmentation and rough speech act (manual)
- Document structure using TEI tags (paragraphs, headings, figures, captions etc., all manual)
- Constituent and dependency syntax (manually corrected Universal Dependencies, and PTB parses from gold tags with function labels and enhanced dependencies)
- Information status (given-active/inactive, accessible-inferable/common ground/aggregate, and new)
- Entity type, salience and coreference annotation (including non-named entities, singletons, appositions, cataphora and several types of bridging), as well as Centering Theory annotations
- Entity linking (Wikification) of all named entities with Wikipedia articles, including their non-named and pronominal mentions
- Discourse parses in enhanced Rhetorical Structure Theory (eRST) and discourse dependencies
- Discourse signal annotations classified into 9 major and 45 minor types indicating how the presence of a relation is marked (based on the Signaling Corpus scheme)
- Abstractive summaries for each document (two summaries per document in the test set)
�
Note on Reddit data: token text is not contained in the release but can be downloaded with an included script.
�
For more information and to search or download the corpus online, see the corpus website <https://gucorpling.org/gum/> .
�
Best wishes,
The GUM team
�
PS – if you like GUM, check out our ‘extreme genre test set’ GENTLE <https://github.com/gucorpling/gentle/> , and the larger, automatically annotated AMALGUM <https://github.com/gucorpling/amalgum/> corpus!
�
* Extended deadline for submissions: February 29th, 2024 (original deadline: February 20th, 2024)
-------
Call for Papers
We are very pleased to share our second call for papers for our workshop on Reference, Framing, and Perspective co-located with LREC-COLING 2024.
* Workshop website: https://cltl.github.io/reference-framing-perspective/
* When: Saturday, May 25th, 20204
* Where: Torino, Italy (co-located with LREC-COLING 2024)
* Deadline for submissions: February 29th, 2024
* Paper types: regular papers (short and long) and extended abstracts
* Paper submission link: https://softconf.com/lrec-coling2024/reference-framing-perspective2024/user…
* Deadline for camera-ready papers: March 29th, 2024
* Shared dataset: https://github.com/cltl/rfp_corpus_collection
When something happens in the world, we have access to an unlimited range of ways (from lexical choices to specific syntactic structures) to refer to the same real-world event. We can chose to express information explicitly or imply it. Variations in reference may convey radically different perspectives. This process of making reference to something by adopting a specific perspective is also known as framing. Although previous work in this area is present (see Ali and Hassan (2022)’s survey for an overview), there is a lack of a unitary framework and only few targeted datasets (Chen et al., 2019) and tools based on Large Language Models exist (Minnema et al., 2022). In this workshop, we propose to adopt Frame Semantics (Fillmore, 1968, 1985, 2006) as a unifying theoretical framework and analysis method to understand the choices made in linguistic references to events. The semantic frames (expressed by predicates and roles) we choose give rise to our understanding, or framing, of an event. We aim to bring together different research communities interested in lexical and syntactic variation, referential grounding, frame semantics, and perspectives. We believe that there is significant overlap within the goals and interests of these communities, but not necessarily the common ground to enable collaborative work.
Referentially Grounded Shared Dataset
One way to study variation in framing is to conduct contrastive analyses of texts reporting on the same real-world event. Such an analysis can help to reveal the extent of variation in framing and possibly give rise to the underlying factors that lead to different choices in framing the same event. We collected such a corpus about the Eurovision Song Festival and make it available as a Shared Dataset for the Workshop. The purpose of this corpus is to enable exploratory analyses, facilitate discussion among participants, and, last but not least, make our workshop a real working workshop.
The corpus is composed of news articles reporting on the Eurovision Song Contest that took place in Rotterdam, the Netherlands (canceled in 2020 and held in 2021). The news articles have been collected using the structured data-to-text approach (Vossen et al., 2018). The corpus contains news articles in multiple languages. We invite participants to submit short and targeted analyses using the data (extended abstracts to be discussed in a hands-on data session). Participants are also free to use the data in regular contributions.
Regular contributions:
We aim to lay the groundwork for such efforts. We invite contributions (regular long papers of 8 pages or short papers of 4 pages) targeting any of the following - non-exhaustive - list of topics:
* Theoretical models of framing and perspective
* Annotation frameworks for framing and perspectives
* Computational models of framing and perspective
* Approaches for creating and analyzing referentially grounded datasets (containing different perspectives, written at different points in time, written in different languages)
* Approaches for and analyses of texts about contested and divisive events triggering different opinions and perspectives
* Analyses of and methods for analyzing (diachronic) lexical variation and framing
* Language resources for reference, frames, and perspectives
* Approaches and tools to compare claims of sources
* Frames as expressions of bias in the representation of social groups
* User interface for the visualization of multiple perspectives
Extended abstracts:
We invite extended abstracts (1,500 words maximum) about small analyses or experiments conducted on our Shared Data. The abstracts will be non-archival and discussed in a dedicated data session.
Invited speakers:
Maria Antoniak
Vered Shwartz
Organizers:
Pia Sommerauer, Tommaso Caselli, Malvina Nissim, Levi Remijnse, Piek Vossen
*Final CfP: Joint Workshop on Multiword Expressions and Universal
Dependencies (MWE-UD 2024)*
Co-located with LREC-COLING 2024
Organized and funded by SIGLEX (the Special Interest Group on the Lexicon
of the Association for Computational Linguistics <http://www.siglex.org/>),
SIGLEX-MWE (SIGLEX Multiword Expressions Section
<https://multiword.org/organization/constitution.html>), Universal
Dependencies <https://universaldependencies.org>, and UniDive
<https://unidive.lisn.upsaclay.fr/doku.php?id=wg1:wg1> COST Action CA21167.
Venue: Torino, Italy and online
Workshop Date: May 25, 2024
Workshop Webpage: https://multiword.org/mweud2024/
Paper submission (extended): Mar 3, 2024
Submission link:
https://softconf.com/lrec-coling2024/mwe-ud2024/user/scmd.cgi?scmd=submitPa…
ARR commitment: Mar 25, 2024
ARR commitment link:
https://softconf.com/lrec-coling2024/mwe-ud2024/user/scmd.cgi?scmd=submitPa…
Invited speakers:
Natalia Levshina, Radboud University
Harish Tayyar Madabushi, University of Bath
We are pleased to announce that the multiword expressions (MWE) and
Universal Dependencies (UD) research communities are joining forces in 2024
to organize a joint workshop. This is a timely collaboration because the
two communities clearly have overlapping interests. For instance, while UD
has several dependency relations that can be used to annotate MWEs, both
annotation guidelines (i.e. is syntactic irregularity and inflexibility or
semantic non-compositionality the leading criterion?) and annotation
practice (both across treebanks for a single language and across languages)
for these relations can be improved (Schneider and Zeldes, 2021). The
PARSEME MWE-annotated corpora for 26 languages build on UD annotated
corpora (Savary et al., 2023). Both communities share an interest in
developing guidelines, data-sets, and tools that can be applied to a wide
range of typologically diverse languages, raising fundamental questions
about tokenization, lemmatization, and morphological decomposition of
tokens. Proposals for harmonizing annotation practices between what has
been achieved in PARSEME and UD and expanding PARSEME MWE annotation to
non-verbal MWEs are also central to the recently started UniDive COST
action (CA21167).
The workshop invites submissions of original research on MWE, UD, and the
interplay of both. In particular, the following topics are especially
relevant:
-
Sensitivity of LLMs to MWE and syntactic dependencies. Studies along the
lines of Manning et al. (2020) (UD), Nedumpozhimana and Kelleher (2021),
Garcia et al. (2021), Fakharian and Cook (2021), Moreau et al. (2018)
(MWE), and others on the question to what extent LLMs make use of syntactic
dependencies or are capable of detecting MWEs and capturing their
semantics.
-
Applicability of UD and MWE annotation and discovery for low-resource
and typologically diverse languages and language varieties. Both UD and
PARSEME aim at universal applicability across a wide range of languages.
Much theoretical, computational, and empirical work concentrates on
high-resource languages however. Applying these frameworks to typologically
diverse languages may lead one to reconsider the notion of token, word, and
morphological segmentation, and to reassess the notion of MWE for languages
that feature compounding or incorporation (Baldwin et al., 2021;
Haspelmath, 2023).
-
Case studies. Studies on the consistency, coverage or universal
applicability of MWE annotation in the UD or PARSEME frameworks, as well as
studies on automatic detection and interpretation of MWEs in corpora.
-
MWE and UD processing to enhance end-user applications. MWEs have gained
particular attention in end-user applications, including MT (Zaninello and
Birch, 2020; Han et al., 2021), simplification (Kochmar et al., 2020),
language learning and assessment (Paquot et al., 2019; Christiansen and
Arnon, 2017), social media mining (Maisto et al., 2017), and abusive
language detection (Zampieri et al., 2020; Caselli et al., 2020). We
believe that it is crucial to extend and deepen these first attempts to
integrate and evaluate MWE technology in these and further end-user
applications.
-
Testing developed systems on the latest dataset versions. Authors are
also encouraged to submit papers that test the developed systems using the
recent UD 2.13 and/or PARSEME 1.3 releases.
Organizational Details: Grant, Format, Publications
-
UniDive members with accepted papers may be eligible for travel
reimbursement.
-
If you are based in an underrepresented country or work on low-resource
languages and have an accepted paper, you may be eligible for an ACL-SIGLEX
travel grant.
-
The workshop follows LREC-COLING’s hybrid online/onsite format.
-
Workshop proceedings will be published in the ACL Anthology.
-
The workshop follows LREC/COLING’s anti-harassment policy
<https://coling2022.org/policy>.
Submission Instructions
The workshop invites two types of submissions:
-
archival submissions that present substantially original research in
both long paper format (8 pages + references) and short paper format (4
pages + references)
-
non-archival submissions of abstracts describing relevant research
presented/published elsewhere which will not be included in the MWE-UD
proceedings.
Papers should be submitted via the workshop’s START submission page
<https://softconf.com/lrec-coling2024/mwe-ud2024/user/scmd.cgi?scmd=submitPa…>.
Please choose the appropriate submission format (archival/non-archival).
Submissions must follow the LREC-COLING 2024 stylesheet
<https://lrec-coling-2024.org/authors-kit/>.
When submitting a paper from the START page, authors will be asked to
provide essential information about resources (in a broad sense, i.e. also
technologies, standards, evaluation kits, etc.) that have been used for the
work described in the paper or are a new result of your research. Moreover,
ELRA encourages all LREC-COLING authors to share the described LRs (data,
tools, services, etc.) to enable their reuse and replicability of
experiments (including evaluation ones).
Archival papers with existing reviews from ACL Rolling Review will also be
considered. Submission is via the START commitment form
<https://softconf.com/lrec-coling2024/mwe-ud2024/user/scmd.cgi?scmd=submitPa…>.
A paper may not be simultaneously under review through ARR and MWE-UD. A
paper that has or will receive reviews through ARR may not be submitted for
review to MWE-UD.
Important Dates
Paper submission (extended): Mar 3, 2024
ARR paper commitment: Mar 25, 2024
Notification of acceptance: Apr 1, 2024
Camera ready papers due: Apr 8, 2024
Workshop: May 25, 2024
All deadlines are at 23:59 UTC-12 (Anywhere on Earth).
Organizing Committee
Archna Bhatia, Gosse Bouma, Kilian Evang, Marcos Garcia, Voula Giouli,
Lifeng Han, Joakim Nivre, Alexandre Rademaker, A. Seza Doğruöz
For any inquiries contact the Organizing Committee at
mweud2024-organizers(a)uni-duesseldorf.de.
*13th Joint Conference on Lexical and Computational Semantics (*SEM 2024)*
Home: https://sites.google.com/view/starsem2024
*Co-located with NAACL 2024*
https://2024.naacl.org/
Mexico City, Mexico
June 16–21, 2024
* Important Dates*
Direct submission deadline: extended to * Feb 29, 2024 *
Date of notification: Apr 22, 2024
Camera ready deadline: May 5, 2024
*SEM workshop: Jun 20-21, 2024
Submission link: https://openreview.net/group?id=aclweb.org/StarSEM/2024/Conference
*Invited Speakers*
Dr. Greg Durrett (The University of Texas at Austin) https://www.cs.utexas.edu/~gdurrett/#
Prof. Heng Ji (University of Illinois Urbana-Champaign) https://cs.illinois.edu/about/people/faculty/hengji
*SEM brings together researchers interested in the semantics of (many and diverse!) natural languages and its computational modeling. The conference embraces data-driven, neural, and probabilistic approaches, as well as symbolic approaches and everything in between; practical applications as well as theoretical contributions are welcome. The long-term goal of *SEM is to provide a stable forum for the growing number of NLP researchers working on all aspects of semantics of (many and diverse!) natural languages.
*Topics of Interest*
* Lexical semantics and word representations
* Compositional semantics and sentence representations
* Statistical, machine learning, and deep learning methods in semantic tasks
* Multilingual and cross-lingual semantics
* Word sense disambiguation and induction
* Semantic parsing, and syntax-semantics interface
* Frame semantics and semantic role labeling
* Textual inference, textual entailment, and question answering
* Formal approaches to semantics
* Extraction of events and of causal and temporal relations
* Entity linking, pronouns and coreference
* Discourse, pragmatics, and dialogue
* Machine reading
* Extra-propositional aspects of meaning
* Multiword and idiomatic expressions
* Metaphor, irony, and humor
* Knowledge mining and acquisition
* Common sense reasoning
* Language generation
* Semantics in NLP applications: sentiment analysis, abusive language detection, summarization, fact-checking, etc.
* Multidisciplinary research on semantics
* Grounding and multimodal semantics
* Psycholinguistcs
* Interpretability and Explainability
* Human semantic processing
* Semantic annotation, evaluation, and resources
* Ethical aspects and bias in semantic representations
Event: 11th Workshop on the Representation and Processing of Sign Languages (sign-lang@LREC 2024)
NEW Deadline: 29 February 2024
Website: https://www.sign-lang.uni-hamburg.de/lrec2024/
Submission page: https://softconf.com/lrec-coling2024/signlang2024/
CALL FOR PAPERS
In response to several requests, we are happy to announce that the deadline for the sign-lang@lREC workshop has been extended to 29 February (11:59PM UTC-12:00 “anywhere on Earth”). Submissions are invited for a full day workshop on sign language resources, to take place on 20 May 2024 as a satellite event of LREC-COLING 2024 in Turin, Italy.
During the past years, a number of large-scale sign language corpus projects have started. Some have already been completed, but many more projects are about to start. At the same time, sign language technologies are maturing and are promising to support the time-consuming basic annotation. The workshop aims at bringing together those researchers who already work with multimodal sign language corpora (and those who see the need for empirical underpinnings of their current research) with those who develop sign language technologies. It provides the platform to compare competing approaches.
As sign language resource technologies build to a large extent on methodologies and tools used in the language resource community in general, but add very specific perspectives (e.g. no writing system established, use of video as data source) and works with a different modality of human language, sign language research is able to feed back to the language resource community at large. At the same time, as the raw data are in the visual domain, the field naturally bridges into Computer Vision. Thus, researchers use Machine Learning methods on both visual and linguistic data.
We invite submissions of papers to be presented either on stage (20 minutes plus 10 minutes discussion) or as posters (with or without demonstrations) on the following topics:
2024 SPECIAL TOPIC: EVALUATION OF SIGN LANGUAGE RESOURCES
With the field maturing, it becomes an urgent issue to assess the quality of sign language resources for a large variety of tasks. We invite contributions on both automatic and human-based evaluation procedures for all kinds of sign language resources and tools.
GENERAL ISSUES ON SIGN LANGUAGE CORPORA AND TOOLS
• Avatar technology as a tool in sign language corpora and corpus data feeding into advances in avatar technology
• Experiences in building sign language corpora
• Elicitation methodology appropriate for corpus collection
• Proposals for standards for linguistic annotation or for metadata descriptions
• Experiences from linguistic research using corpora
• Use of (parallel) corpora and lexicons in translation studies
• Language documentation and long-term accessibility for sign language data
• Annotation and visualization Tools
• Linking corpora and lexicons and integrated presentation of corpus and dictionary contents
• “Internet as a corpus” for sign languages
• Sign language corpus mining
• Crowd and community sourcing for corpus work
• Multi-lingual sign language resources and connecting sign language resources to language resources for spoken languages
• FAIR, CARE and OpenScience for sign language data
In the tradition of LREC, oral/signed presentations and poster presentations (with or without demonstrations) have equal status, and authors are encouraged to suggest the presentation format best suited to communicate their ideas. Papers (4-8 pages) of all accepted submissions to this workshop will be published as workshop proceedings published on the conference website – independent of whether you have a poster or an oral/signed presentation. The workshop does not differentiate between long, short, or position papers.
Please submit your paper through the LREC START system (https://softconf.com/lrec-coling2024/signlang2024/) not later than 29 February 2024, indicating whether you prefer an oral/signed presentation, a poster presentation or a poster presentation with demo. Unlike the main conference, the workshop will be reviewed single-blind, so submissions SHOULD NOT BE ANONYMOUS.
ATTENTION Please note that you are expected to submit the full paper, not an extended abstract as in previous years!
IMPORTANT DATES
• Deadline for submissions (NEW): 29 February 2024 (11:59PM UTC-12:00 “anywhere on Earth”)
• Notification of acceptance: 22 March, 2024
• Early bird registration ends: tbd
• Camera ready version of the paper (for both oral/signed presentations and posters): 8 April 2024
• Submission of slides for interpreters' preparation (oral/signed presentations only): 10 May 2024
• This workshop: 20 May 2024
• LREC main conference: 22–24 May 2024
• LREC workshops 20, 21 & 25 May 2024
The King's DRIVE-Health Centre for Doctoral Training in Data-Driven Health has 15 fully funded four year studentships starting in October 2024. Many of the available projects are NLP related, working on real-world electronic health record text in collaboration with UK NHS hospitals and clinicians. You will be joining a leading health NLP research centre, working on established datasets, and using state-of-the-art compute infrastructure.
For full details see the following projects at https://www.drive-health.org.uk/
projects 17, 27, 50, 53, 55, 78, 79, 85, 86, 87
Application deadline: Sunday 10th March at 23:59 UTC+0
--
Dr. Angus Roberts
Training lead, DRIVE-Health Centre for Doctoral Training
King’s College London
Box PO 80, De Crespigny Park, LONDON SE5 8AF