*** Apologies for cross-posting ***
++ DEADLINE EXTENSION & LAST CALL FOR PAPERS ++
****************************************************************************
Eighth International Workshop on Narrative Extraction from Texts (Text2Story'25)
Held in conjunction with the 47th European Conference on Information Retrieval (ECIR'25)
April 10th, 2025 – Lucca, Italy
Website: https://text2story25.inesctec.pt
****************************************************************************
++ Important Dates ++
- Submission Deadline: February 7th, 2025 January 24th, 2025
- Acceptance Notification: March 3rd, 2025
- Camera-ready copies: March 17th, 2025
- Workshop: April 10th, 2025
++ Overview ++
For seven years, the Text2Story Workshop series has fostered a vibrant community dedicated to understanding narrative structure in text, resulting in significant contributions to the field and developing a shared understanding of the challenges in this domain. While traditional methods have yielded valuable insights, the advent of Transformers and LLMs have ignited a new wave of interest in narrative understanding. In the eighth edition of the Text2Story workshop, we propose to go deeper into the role of LLMs in narrative understanding exploring the issues involved in using LLMs to unravel narrative structures, while also examining the characteristics of narratives generated by LLMs. By fostering dialogue on these emerging areas, we aim to identify the wide-ranging issues related to the narrative extraction task and continue the workshop's tradition of driving innovation in narrative understanding research.
++ List of Topics ++
Research works submitted to the workshop should advance the scientific understanding of all aspects of narrative extraction from texts. This includes, but is not limited to, topics such as narrative information extraction, formal representation of narratives, narrative analysis and generation, development of datasets and evaluation protocols, as well as ethics and bias in narratives, and narrative applications. We encourage the submission of high-quality and original submissions covering the following topics and contributions focused on low and medium-resource languages.
Narrative Information Extraction
- Identification of Participants, Events and Temporal Expressions
- Identification of Participants, Events and Temporal Expressions
- Temporal Reasoning and Ordering of Events
- Causality Detection
- Big Data Applied to Narrative Extraction
- LLMs for Narrative Extraction
Narrative Representation
- Annotation Protocols
- Narrative Representation Models
- Lexical, Syntactic, and Semantic Ambiguity in Narrative Representation
- LLM-learned Representation
Narrative Analysis and Generation
- Discourse and Argument Structure Analysis
- Narrative analysis of LLM generated text
- Multilingual and Cross-lingual Narrative Analysis
- Story Evolution and Shift Detection
- Automatic Timeline Generation
- Generative Language Models for Narrative Generation
Datasets and Evaluation Protocol
- Evaluating LLM-Generated Narratives
- Evaluation of Multimodal Narrative Models
- Annotated datasets
- Narrative Resources
- Using LLMs for Data Creation and Augmentation
Ethics and Bias in Narratives
- Identifying and Mitigating Bias in Generated Narratives
- Ethical and Fair Narrative Generation
- Misinformation and Fact Checking
- Bias in LLM-generated narratives
Narrative Applications
- Narrative-focused Search in Text Collections
- Narrative Summarization
- Narrative Q&A
- Multimodal Narrative Summarization
- Multimodal Narrative-focused Search
- Sentiment and Opinion Detection in Narratives
- Social Media Narratives
- Narrative Text Simplification
- Narrative-based Text Anonymization
- Personalization and Recommendation of Narratives
- Storyline Visualization (including multimodal) and Narrative Structures
++ Objectives ++
Overall, the workshop has the following main objectives: (1) raise awareness within the Information Retrieval (IR) community regarding the challenges posed by narrative extraction and comprehension; (2) bridge the gap and foster connections between academic research, practitioners, and industrial applications; (3) discuss new methods, recent advances, and emerging challenges; (4) share experiences from research projects, case studies, and scientific outcomes structured around fundamental research questions related to narrative understanding; (5) identify dimensions that might be influenced by the automation of the narrative process; (6) highlight tested hypotheses that did not result in the expected outcomes
++ Submission Guidelines ++
We expect contributions from researchers on all aspects of narrative extraction, representation, analysis, and generation. This includes the extraction and formal representation of events, their temporal and causal relationships, and methods for temporal reasoning and ordering. Submissions focusing on narrative comprehension, such as the analysis of generated narratives, are also highly encouraged. Additionally, we welcome innovative approaches to presenting narrative information, including automatic timeline generation, multi-modal narrative summarization, and narrative visualization. Research addressing misinformation and the verification of extracted facts, evaluation methodologies, and the development of annotated datasets, annotation schemas, and evaluation metrics is particularly valued. Finally, we are especially interested in submissions that focus on low and medium-resource languages, as well as multilingual and cross-lingual narrative analysis.
Building on these themes, several pressing questions emerge within the field, offering valuable guidance for authors in shaping their submissions.How can we better integrate multimodal content - combining text, images, videos, and audio - into cohesive narratives? What strategies can reliably extract or generate accurate narratives from large, multi-genre, and multi-lingual datasets? How can systems dynamically adapt to real-time shifts in narratives as the volume of generated content grows? What methodologies can effectively annotate data and evaluate novel approaches, for complex tasks such as visualization but also for characterization of multi-lingual narratives? How can we guarantee the explainability, interpretability, and coherence of narratives across diverse domains and languages? To what extent can novel approaches be generalized to new tasks, genres, and languages with minimal effort? What ethical safeguards are essential to ensure that narrative extraction systems are not misused for propaganda or manipulation? How can challenges posed by ambiguous or contradictory information within narratives be addressed through innovative methods? What role do cultural and contextual nuances play in narrative extraction, and how can these be effectively incorporated into automated systems to ensure greater inclusivity? How can collaboration between human annotators and automated systems be optimized to achieve more accurate, nuanced narrative understanding? How can systems generate concise, evidence-backed explanations to justify the dominant narrative while remaining grounded in the source text?
-> Full papers (up to 8 pages + references): Original and high-quality unpublished contributions to the theory and practical aspects of the narrative extraction task. Full papers should introduce existing approaches, describe the methodology and the experiments conducted in detail. Negative result papers to highlight tested hypotheses that did not get the expected outcome are also welcomed.
-> Short papers (up to 5 pages + references): Unpublished short papers describing work in progress; position papers introducing a new point of view, a research vision or a reasoned opinion on the workshop topics; and dissemination papers describing project ideas, ongoing research lines, case studies or summarized versions of previously published papers in high-quality conferences/journals that is worthwhile sharing with the Text2Story community, but where novelty is not a fundamental issue.
-> Demos | Resource Papers (up to 5 pages + references): Unpublished papers presenting research/industrial demos; papers describing important resources (datasets or software packages) to the text2story community;
Papers submitted to Text2Story 2025 should be original work and different from papers that have been previously published, accepted for publication, or that are under review at other venues. Exceptions to this rule are "dissemination papers". Pre-prints submitted to ArXiv are eligible.
All papers will be refereed through a double-blind peer-review process by at least two members of the programme committee. The accepted papers will appear in the proceedings published at CEUR workshop proceedings (indexed in Scopus and DBLP) as long as they don't conflict with previous publication rights.
++ Invited Speakers ++
Sara Tonelli, Fondazione Bruno Kessler, Italy
Title:
Revisiting frames for event extraction in the Digital Humanities
Abstract:
Frame Semantics as a cognitive linguistic theory was first formalised by Charles Fillmore around 50 years ago. Since then, it has been adapted to different application scenarios as a framework to support event-based information extraction. But what is the role of frames in the era of generative AI? In this talk I will present some recent research works in which frame semantics has been tailored to support digital humanities research. In particular, we explored the use of frames to extract sensory information from historical archives and capture shifts in perception over time. Frame-based event extraction has also been investigated as a way to navigate news collections, build narratives from event chains and present the same event from different points of view.
Bio:
Sara Tonelli is the head of the Digital Humanities research group at Fondazione Bruno Kessler, Trento (Italy) and holds a Phd in Language Sciences from Università Ca' Foscari, Venice. Between 2021 and 2024 she served as Liaison Representative of the ACL Special Interest Group on Language Technologies for the Socio-Economic Sciences and Humanities (SIGHUM) and she is currently part of the board of the Italian Association for Computational Linguistics (AILC). In the last years, she has served as area chair and senior area chair for major *ACL conferences in tracks related to cultural analytics, social media analysis, digital humanities and offensive language detection. She has also participated in different EU-funded projects around disinformation, computational social science and cultural heritage and was scientific coordinator of the KID ACTIONS European project (2021-2022), aimed at addressing cyberbullying among children and adolescents through interactive education and gamification. Her research interests focus on understanding how people communicate on social media and what dynamics are involved in online attacks, as well as what kind of biases can affect this analysis. She is also interested in using NLP to extract information from digital archives to address historical and cultural heritage research questions.
++ Organizing committee ++
Ricardo Campos (INESC TEC; University of Beira Interior, Covilhã, Portugal)
Alípio M. Jorge (INESC TEC; University of Porto, Portugal)
Adam Jatowt (University of Innsbruck, Austria)
Sumit Bhatia (Media and Data Science Research Lab, Adobe)
Marina Litvak (Shamoon Academic College of Engineering, Israel)
++ Proceedings Chair ++
João Paulo Cordeiro (NOVA Lincs & University of Beira Interior, Covilhã, Portugal)
Conceição Rocha (INESC TEC, Portugal)
++ Web and Dissemination Chair ++
Hugo Sousa (INESC TEC & University of Porto, Portugal)
Behrooz Mansouri (University of Maine, USA)
++ Program Committee ++
Abhai Singh (Amazon)
Ali Salehi (University at Buffalo)
Arian Pasquali (Faktion AI)
Andreas Spitz (University of Konstanz)
Antoine Doucet (Université de La Rochelle)
António Horta Branco (University of Lisbon)
Bart Gajderowicz (University of Toronto)
Behrooz Mansouri (Rochester Institute of Technology)
Brenda Santana (Federal University of Rio Grande do Sul)
Brucce dos Santos (Computational Intelligence Laboratory (LABIC) - ICMC/USP)
Bruno Martins (IST & INESC-ID, University of Lisbon)
David Semedo (Universidade NOVA de Lisboa)
Dennis Aumiller (Cohere)
Dhruv Gupta (Norwegian University of Science and Technology)
Evelin Amorim (INESC TEC)
Sérgio Matos (University of Aveiro)
Florian Boudin (Nantes University)
Henrique Lopes Cardoso (LIACC & University of Porto)
Irina Rabaev (Shamoon College of Engineering)
Ismail Altingovde (Middle East Technical University)
Junbo Huang (University of Hamburg)
Jakub Piskorski (Polish Academy of Sciences)
João Paulo Cordeiro (Nova lincs & University of Beira Interior)
Jin Zhao (Brandeis University)
Luca Cagliero (Politecnico di Torino)
Ludovic Moncla (INSA Lyon)
Luis Filipe Cunha (INESC TEC & University of Minho)
Marc Finlayson (Florida International University)
Marc Spaniol (Université de Caen Normandie)
Moreno La Quatra (Kore University of Enna)
Nianwen Xue (Brandeis University)
Nuno Guimarães (INESC TEC & University of Porto)
Paulo Quaresma (Universidade de Évora)
Paul Rayson (Lancaster University)
Purificação Silvano (CLUP & University of Porto)
Ross Purves (University of Zurich)
Sérgio Nunes (INESC TEC & University of Porto)
Sriharsh Bhyravajjula (University of Washington)
Udo Kruschwitz (University of Regensburg)
Valentina Bartalesi (ISTI-CNR, Italy)
Yangyang Chen (Brandeis University)
++ Contacts ++
Website: https://text2story25.inesctec.pt
For general inquiries regarding the workshop, reach the organizers at: text2story2025(a)easychair.org
**** We apologize for the multiple copies of this email. In case you are
already registered to the next webinar, you do not need to register
again. ****
------------------------------------------------------------------------
Dear colleague,
We are happy to announce the next webinar in the Language Technology
webinar series organized by the HiTZ Chair of AI< (https://hitz.eus).
You can check the videos of previous webinars and the schedule for
upcoming webinars here: http://www.hitz.eus/webinars
Next webinar:
*Speaker:* Sebastian Ruder (Meta)
*Title:* Multilingual LLM Evaluation in Practical Settings
*Date: * Thursday, February 6, 2025 - 15:00 CET
*Summary:* Large language models (LLMs) are increasingly used in a
variety of applications across the globe but do not provide equal
utility across languages. In this talk, I will discuss multilingual
evaluation of LLMs in two practical settings: conversational
instruction-following and usage of quantized models. For the first part,
I will focus on a specific aspect of multilingual conversational ability
where errors result in a jarring user experience: generating text in the
user’s desired language. I will describe a new benchmark and evaluation
of a range of LLMs. We find that even the strongest models exhibit
language confusion, i.e., they fail to consistently respond in the
correct language. I will discuss what affects language confusion, how to
mitigate it, and potential extensions. In the second part, I will
discuss the first evaluation study of quantized multilingual LLMs across
languages. We find that automatic metrics severely underestimate the
negative impact of quantization and that human evaluation—which has been
neglected by prior studies—is key to revealing harmful effects. Overall,
I highlight limitations of multilingual LLMs and challenges of
real-world multilingual evaluation.
*Bio:* Sebastian Ruder is a research scientist at Meta based in Berlin,
Germany where he works on improving evaluation and benchmarking of large
language models (LLMs). He previously led the Multilinguality team at
Cohere with the objective to improve the multilingual capabilities of
Cohere's LLMs. Before that he was a research scientist at Google
DeepMind. He completed his PhD in Natural Language Processing (NLP) at
the Insight Research Centre for Data Analytics, while working as a
research scientist at Dublin-based text analytics startup AYLIEN.
Previously, he studied Computational Linguistics at the University of
Heidelberg, Germany and at Trinity College, Dublin.
*
Upcoming webinars:*
· Christian Herff (Thursday, March 6, 2025)
· Emanuele Bugliarello (Thursday, April 3, 2025)
· André F. T. Martins (Thursday, May 8, 2025)
If you are interested in participating, please complete this
registration form: http://www.hitz.eus/webinar_izenematea
If you cannot attend this seminar, but you want to be informed of the
following HiTZ webinars, please complete this registration form instead:
http://www.hitz.eus/webinar_info
Best wishes,
HiTZ Zentroa
P.S: HiTZ will not grant any type of certificate for attendance at these
webinars.
Hi,
We are pleased to announce that the submission deadline for the Insights 2025 Workshop (May 3-4, 2025, co-located with NAACL 2025) has been extended to February 10, 2025 (AoE).
If you are working on unexpected or negative results in NLP research, we encourage you to submit your work. This workshop provides a venue for highlighting methodological challenges, limitations of current approaches, and insights that can guide the community toward more rigorous research practices.
* New Submission Deadline: February 10, 2025 (AoE)
* Submission Portal: https://softconf.com/naacl2025/Insights2025
We invite:
- Short papers (up to 4 pages + references & appendices)
- 1-2 page non-archival abstracts for work published elsewhere
For full details, please refer to https://insights-workshop.github.io/2025/cfp/
We appreciate your contributions and look forward to your submissions!
Best regards,
Insights 2025 Organizing Committee
3rd and FINAL Call for Abstracts!
Never been to a NARNiHS Research Incubator?!?
Take advantage of the newly extended abstract submission deadline to join us for this year's opportunity to brainstorm your cutting-edge work with us!
***********************************
2025 NARNiHS Research Incubator
North American Research Network in Historical Sociolinguistics
7th edition
***********************************
==> 01-03 May 2025 -- entirely online!
==> FINAL Submission Deadline
==> 03 February 2025, 11:59 PM (U.S. Eastern Time)
The 2025 NARNiHS Research Incubator is an entirely online event (with **free** registration). This event offers an opportunity for scholars in historical sociolinguistics from all over the world to participate in cutting edge research without the limitations imposed by international travel. We encourage our fellow historical sociolinguists and scholars from related fields in our global scholarly community to join us online for our Research Incubator this spring.
FINAL abstract submission deadline: 03 February 2025, 11:59 PM (U.S. Eastern Time)
Abstract submission online: https://easyabs.linguistlist.org/conference/25_NARNiHS_Incubator/
The North American Research Network in Historical Sociolinguistics (NARNiHS) is accepting abstracts for its 2025 NARNiHS Research Incubator. The 7th edition of this inclusive NARNiHS event seeks to provide a collaborative environment where presenters bring work that is in-progress, exploratory, proof-of-concept, or prototyping. The incubator's audience actively participates in workshopping these new ideas, brainstorming along with the presenter to forge scholarly paths and develop research solutions. We see the NARNiHS Research Incubator as a place for testing and pushing boundaries; developing new theories, methods, models, and tools in historical sociolinguistics; seeking feedback from peers; and engaging in productive assessment of fledgling ideas and nascent projects.
Successful abstracts for this research incubator environment will demonstrate thorough grounding in historical sociolinguistics, scientific rigor in the formulation of research questions, and promise for rich discussion of ideas.
NARNiHS welcomes papers in all areas of historical sociolinguistics, which is understood as the application/development of sociolinguistic theories, methods, and models for the study of historical language variation and change over time, or more broadly, the study of the interaction of language and society in historical periods and from historical perspectives. Thus, a wide range of linguistic areas, subdisciplines, and methodologies easily find their place within the field, and we encourage submission of abstracts that reflect this broad scope.
We are soliciting abstracts for **25-minute presentations**. Presenters will have the entire 25 minutes for their presentations, with discussion happening in the "incubation session" at the end of each panel. Abstracts should be **no more than one page** (not including examples and references, see below).
Abstracts will be accepted until 03 February 2025 -- late abstracts will not be considered.
Successful abstracts will be explicit about which theoretical frameworks, methodological protocols, and analytical strategies are being applied or critiqued. Data sources and examples should be sufficiently (if briefly) presented, so as to allow reviewers a full understanding of the scope and claims of the research. Please note that **the connection of your research to the field of historical sociolinguistics should be explicitly outlined** in your abstract. Failure to adhere to these criteria will likely result in rejection of the abstract.
To encourage maximum exchange of ideas in the incubation environment, an hour-long discussion with the audience -- led by specialists -- will follow each thematic panel and will encompass specific feedback on three papers as well as emergent considerations of overarching questions of theory, methods, and models. To facilitate such incubation, authors will be required to submit a draft of their presentation materials for distribution to the panel discussants and the other presenters a few days prior to the start of the conference.
Abstract Content Requirements:
1) Abstracts should be explicit about which theoretical frameworks, methodological protocols, and analytical strategies are being applied or critiqued.
2) Data sources and examples should be sufficiently (if briefly) presented, so as to allow reviewers a full understanding of the scope and claims of the research.
3) The connection of your research to the field of historical sociolinguistics should be explicitly outlined.
Abstract Format Guidelines:
1) Abstracts must be submitted in PDF format.
2) Abstracts must fit on one standard 8.5x11 inch page, with margins no smaller than 1 inch and a font style and size no smaller than Times New Roman 12 point. All additional content (visualizations, trees, tables, figures, captions, examples, and references) must fit on a single (1) additional page. No exceptions to these requirements are allowed; abstracts exceeding these limits will be rejected without review.
3) Anonymize your abstract. We realize that sometimes complete anonymity is not attainable, but there is a difference between the nature of the research creating an inability to anonymize and careless non-anonymizing (in citations, references, file names, etc.). Be sure to anonymize your PDF file (you may do so in Adobe Acrobat Reader by clicking on "File", then "Properties", removing your name if it appears in the "Author" line of the "Description" tab, and re-saving the file before submission). Do not use your name when saving your PDF (e.g. Smith_Abstract.pdf); file names will not be automatically anonymized by the EasyAbs system. Rather, use non-identifying information in your file name (e.g. HistSoc4Lyfe.pdf). Your name should only appear in the online form accompanying your abstract submission. Papers that are not sufficiently anonymized wherever possible will be rejected without review.
General Conference Requirements:
1) Abstracts must be submitted electronically, using the following link: https://easyabs.linguistlist.org/conference/25_NARNiHS_Incubator/
2) Papers must be delivered as projected in the abstract or represent bona fide developments of the same research.
3) Authors are expected to virtually attend the conference and present their own papers.
4) Presentations will be delivered via Zoom. Technical details and instructions regarding the platform will be sent to authors in due time.
Please contact us at NARNiHistSoc(a)gmail.com with any questions.
[Apologies for cross-postings]
********************************************************************************
Final Call for Papers
21st Workshop on Multiword Expressions (MWE 2025)
Organized, sponsored and endorsed by SIGLEX, the Special Interest Group on
the Lexicon of the ACL
Full-day workshop collocated with NAACL 2025, Albuquerque, New Mexico,
U.S.A., May 3 or 4, 2025
Hybrid (on-site & on-line)
Submission deadline: February 13, 2025
MWE 2025 website: <https://multiword.org/mwe2022/>
https://multiword.org/mwe2025/
********************************************************************************
Multiword expressions (MWEs), i.e., word combinations that exhibit lexical,
syntactic, semantic, pragmatic, and/or statistical idiosyncrasies (Baldwin
and Kim, 2010), such as “by and large”, “hot dog”, “make a decision” and
“break one's leg” are still a pain in the neck for Natural Language
Processing (NLP). The notion encompasses closely related phenomena: idioms,
compounds, light-verb constructions, phrasal verbs, rhetorical figures,
collocations, institutionalized phrases, etc. Given their irregular nature,
MWEs often pose complex problems in linguistic modeling (e.g. annotation),
NLP tasks (e.g. parsing), and end-user applications (e.g. natural language
understanding and Machine Translation), hence still representing an open
issue for computational linguistics (Constant et al., 2017).
For more than two decades, modelling and processing MWEs for NLP has been
the topic of the MWE workshop organised by the MWE section
<https://multiword.org/> of ACL-SIGLEX <http://www.siglex.org/> in
conjunction with major NLP conferences since 2003. Impressive progress has
been made in the field, but our understanding of MWEs still requires much
research considering their need and usefulness in NLP applications. This is
also relevant to domain-specific NLP pipelines that need to tackle
terminologies most often realised as MWEs. Following previous years, for
this 21st edition of the workshop, we identified the following topics on
which contributions are particularly encouraged:
-
MWE processing to enhance end-user applications. MWEs gained particular
attention in end-user applications, including Machine Translation (MT)
(Zaninello and Birch, 2020), simplification (Kochmar et al., 2020),
language learning and assessment (Paquot et al., 2020), social media mining
(Pelosi et al., 2017), and abusive language detection (Zampieri et al.
2020). We believe that it is crucial to extend and deepen these first
attempts to integrate and evaluate MWE technology in these and further
end-user applications.
-
MWE processing and identification in the general language, as well as in
specialized languages and domains: Multiword terminology extraction from
domain-specific corpora (Lossio-Ventura et al, 2014) is of particular
importance to various applications, such as MT (Semmar and Laib, 2017), or
for the identification and monitoring of neologisms and technical jargon
(Chatzitheodorou and Kappatos, 2021).
-
MWE processing in low-resource languages: The PARSEME shared tasks (2017
<https://multiword.sourceforge.net/PHITE.php?sitesig=CONF&page=CONF_05_MWE_2…>,
2018
<https://multiword.sourceforge.net/PHITE.php?sitesig=CONF&page=CONF_04_LAW-M…>,
2020
<https://multiword.sourceforge.net/PHITE.php?sitesig=CONF&page=CONF_02_MWE-L…>)
among others, have fostered significant progress in MWE identification,
providing datasets that include low-resource languages, evaluation
measures, and tools that now allow fully integrating MWE identification
into end-user applications. There are continuous efforts in this direction
(Diaz Hernandez, 2024) and a few of them have also explored methods for the
automatic interpretation of MWEs (Bhatia et al., 2018), and their
processing in low-resource languages (Eder et al., 2021). Resource creation
and sharing should be pursued in parallel with the development of
multilingual benchmarks for MWE identification (Savary et al., 2023).
-
MWE identification and interpretation in LLMs: Most current MWE
processing is limited to their identification and detection using
pre-trained language models, but we still lack understanding about how MWEs
are represented and dealt with therein (Garcia et al., 2021), how to better
model the compositionality of MWEs from semantics (Phelps et al., 2024).
Now that NLP has shifted towards end-to-end neural models like BERT,
capable of solving complex tasks with little or no intermediary linguistic
symbols, questions arise about the extent to which MWEs should be
implicitly or explicitly modelled (Shwartz and Dagan, 2019).
-
New and enhanced representation of MWEs in language resources and
computational models of compositionality as gold standards for formative
intrinsic evaluation.
Through this workshop, we will bring together and encourage researchers in
various NLP subfields to submit their MWE-related research, We also intend
to consolidate the converging results of previous joint workshops LAW-MWE-CxG
2018 <http://multiword.sourceforge.net/lawmwecxg2018/>, MWE-WN 2019
<http://multiword.sourceforge.net/mwewn2019/> and MWE-LEX 2020
<http://multiword.sourceforge.net/mwelex2020/>, the joint MWE-WOAH panel in
2021 <https://multiword.org/mwe2021/#program>, the MWE-SIGUL 2022 joint
session <https://multiword.org/mwe2022/>, and the MWE-UD 2024
<https://multiword.org/mweud2024/>, extending our scope to MWEs in
e-lexicons, and WordNets, MWE annotation, as well as grammatical
constructions. Correspondingly, we call for papers on research related (but
not limited) to MWEs and constructions in:
-
Computationally-applicable theoretical work in psycholinguistics and
corpus linguistics;
-
Annotation (expert, crowdsourcing, automatic) and representation in
resources such as corpora, treebanks, e-lexicons, WordNets, constructions
(also for low-resource languages);
-
Processing in syntactic and semantic frameworks (e.g. CCG, CxG, HPSG,
LFG, TAG, UD, etc.);
-
Discovery and identification methods, including for specialized
languages and domains such as clinical or biomedical NLP;
-
Interpretation of MWEs and understanding of text containing them;
-
Language acquisition, language learning, and non-standard language (e.g.
tweets, speech);
-
Evaluation of annotation and processing techniques;
-
Retrospective comparative analyses from the PARSEME shared tasks;
-
Processing for end-user applications (e.g. MT, NLU, summarisation,
language learning, etc.);
-
Implicit and explicit representation in pre-trained language models and
end-user applications;
-
Evaluation and probing of pre-trained language models;
-
Resources and tools (e.g. lexicons, identifiers) and their integration
into end-user applications;
-
Multiword terminology extraction;
-
Adaptation and transfer of annotations and related resources to new
languages and domains including low-resource ones.
Submission formats:
The workshop invites two types of submissions:
-
archival submissions that present substantially original research in
both long paper format (8 pages + references) and short paper format (4
pages + references).
-
non-archival submissions of abstracts describing relevant research
presented/published elsewhere which will not be included in the MWE
proceedings.
Paper submission and templates
Papers should be submitted via the workshop's submission page
<https://openreview.net/group?id=aclweb.org/NAACL/2025/Workshop/MWE> (
https://openreview.net/group?id=aclweb.org/NAACL/2025/Workshop/MWE). Please
choose the appropriate submission format (archival/non-archival). Archival
papers with existing reviews will also be accepted through the ACL Rolling
Review. Submissions must follow the ACL stylesheet
<https://github.com/acl-org/acl-style-files>.
Important Dates
Paper Submission Deadline: February 13, 2025
Notification of acceptance: March 8 2025
Camera-ready papers due: March 17, 2025
Workshop: May 3 or 4, 2025
All deadlines are at 23:59 UTC-12 (Anywhere on Earth).
Organizing Committee
Verginica Barbu Mititelu, Voula Giouli, Grazina Korvel, A. Seza Doğruöz,
Alexandre Rademaker, Atul Kr. Ojha, Mathieu Constant
Anti-harassment policy
The workshop follows the ACL anti-harassment policy
<https://www.aclweb.org/adminwiki/index.php?title=Anti-Harassment_Policy>.
Contact
For any inquiries regarding the workshop, please send an email to the
Organizing Committee at <mweworkshop2023(a)googlegroups.com>
mwe2025workshop(a)gmail.com.
SEMANTiCS 2025 - Call for Workshops and Tutorials
21st International Conference on Semantic Systems
Vienna, Austria
September 03-05, 2025
Important Dates for Workshops:
-
*Proposals WS Deadline: March 22, 2025 (11:59 pm, AoE)*
-
*Notification of Acceptance: 29, 2025 (11:59 pm, AoE)*
Important Dates for Tutorials (and other meetings, e.g. seminars,
show-cases, etc., without call for papers):
-
*Proposals Tutorial Deadline: June 11, 2025 (11:59 pm, AoE)*
-
*Notification of Acceptance: June 18, 2025 (11:59 pm, AoE)*
*Submission via Easychair on
https://easychair.org/conferences/?conf=semantics2025
<https://easychair.org/conferences/?conf=semantics2025>*
*SEMANTiCS Workshops and Tutorials*
SEMANTiCS 2025 is a major venue for research and industrial innovation and
features a workshop and tutorial program addressing the diverse practical
interests of its audience. This program is intended to offer a rich
diversity of topics to conference attendees and local participants seeking
to pick up new skills and stay up-to-date regarding the latest developments
in the community. We encourage submissions of proposals on all topics in
the general areas of SEMANTiCS 2025 and proposals bridging or introducing
new perspectives and/or challenges in these areas. Workshops and tutorials
may incorporate panel discussions, lightning talks, meetings, networking or
hands-on sessions, hackathons and other practical formats where applicable.
Rooms for business or project meetings are available upon request as well.
*Scope and Goals*
Workshops and tutorials at SEMANTiCS 2025 allow your organization or
project to advance and promote your topics and gain increased visibility.
The workshops and tutorials will be announced on the SEMANTiCS website, and
they will be seen by all participants. SEMANTiCS 2025 workshops and
tutorials can be incubators for industrial and scientific communities that
form and share a particular research and development agenda, and they will
provide a forum for presenting contributions and findings to a diverse and
knowledgeable community. Furthermore, the event can be used as a
dissemination activity in the scope of large research projects or as a
closed format for research/commercial project consortia meetings.
*Proceedings*
Workshop papers will be published in the SEMANTiCS side event proceedings
through CEUR. Side events proceedings will include posters & demos and
contributions from workshops.
*Setup and Requirements*
SEMANTiCS 2025 workshops and tutorials may be either half or full-day long.
Workshops and tutorials take place on the days before and/or after the main
SEMANTiCS 2025 EU conference (03th of September 2025). Further details will
be communicated in due time.
Organizers of workshops and tutorials will be granted three free tickets
(only for the workshop & tutorial day) for organization purposes or
keynotes. Participants of workshops and tutorials will only be charged a
reduced fee to cover the basic costs.
Workshop and tutorial proposals must include the following information:
-
outline of the *themes and goals of the event*, including a title and a
brief abstract (less than 200 words) intended for the SEMANTiCS 2025
website.
-
a statement addressing *why the event is important*, *why the event is
timely*, and how it is relevant to SEMANTiCS 2025 and the field of
Semantic Web. For the tutorials, why the presenters are qualified for a
high-quality introduction to the topic.
-
*related workshops and conferences*, i.e., specifying if this is a
continuation of a workshop series or a new workshop. Please provide
information about past versions (in any) and other related workshops
(including URLs and submission/acceptance counts, if available).
-
a statement addressing the *quality assurance criteria* that will be
used by the event organizers to select the papers for the workshops and the
presenters for the tutorials (e.g., peer review or review/evaluation by
event organizers). If a peer review process is chosen as a quality
assurance criterion for the workshops, the organizers will be responsible
for their own reviewing process. Workshop organizers will be responsible
also for their own publicity (e.g., website, timelines and call for papers)
and proceedings production.
-
*structure of the event* and plans for generating and stimulating
discussion; how will the interaction be organized in case of a hybrid event.
-
expected *number of event participants* and (in case of previously held
events) number of registered attendees and website for previous editions of
the event
-
a description of the intended *audience* and the expected learning
*outcomes.*
-
desired *prerequisite* knowledge of the audience.
-
proposed *duration of the event* (i.e., half or full day), different
sessions if applicable (final time slot will be assigned in accordance with
the SEMANTiCS program).
-
any *equipment*, room capacity, or other logistic constraints.
-
full *contact information* of all organizers of the event and main
contact person; a brief description of each *organizer's background*,
including relevant past experience in organizing events.
Proposals for workshop and tutorial proposals must be submitted via
Easychair: *https://easychair.org/conferences/?conf=semantics2025*
<https://easychair.org/conferences/?conf=semantics2025> (max 4 pages)
*Important Dates*
Important Dates for Workshops:
-
*Proposals WS Deadline: 22, 2025 (11:59 pm, AoE)*
-
*Notification of Acceptance: March 29, 2025 (11:59 pm, AoE)*
-
*Workshop website is online: April 15th, 2025*
*Suggested* dates for Workshop organizers (with Call for Papers)
-
*Submission WS papers Deadline: June 14, 2025 (11:59 pm, AoE)*
-
*Notification of Acceptance: July 05, 2025 (11:59 pm, AoE)*
Important Dates for Tutorials (and other meetings, e.g. seminars,
show-cases, etc., without call for papers):
-
*Proposals Tutorial Deadline: June 11, 2025 (11:59 pm, AoE)*
-
*Notification of Acceptance: June 18, 2025 (11:59 pm, AoE)*
*Review and Evaluation Criteria*
Workshop and tutorial proposals will be reviewed by the SEMANTiCS 2025
Workshop Chairs, as well as by the SEMANTiCS 2025 organizing committee,
according to the following criteria:
-
The potential to advance the state of Semantic Web research and practice
-
The quality assurance criteria proposed by the organizers to select
high-quality papers for workshops and presenters for tutorials
-
The organizers' experience and ability to lead a successful event
-
Timeliness and expected interest in the event topics
-
The balance and synergy between all SEMANTiCS 2025 events
*Topics of interest include (but are not limited to):*
-
Web Semantics & Linked (Open) Data
-
Enterprise Knowledge Graphs, Graph Data Management
-
Machine Learning Techniques for/using Knowledge Graphs (e.g.
reinforcement learning, deep learning, data mining and knowledge discovery)
-
Interplay between Large Language Models, generative AI and Knowledge
Graphs (e.g., Retrieval Augmented Generation)
-
Knowledge Management (e.g. acquisition, capture, extraction, authoring,
integration, publication)
-
Terminology, Thesaurus & Ontology Management, Ontology engineering
-
Reasoning, Rules, and Policies
-
Natural Language Processing for/using Knowledge Graphs (e.g. entity
linking and resolution using target knowledge such as Wikidata and DBpedia,
foundation models)
-
Crowdsourcing for/using Knowledge Graphs
-
Data Quality Management and Assurance
-
Mathematical Foundation of Knowledge-aware AI
-
Multimodal Knowledge Graphs
-
Semantics in Data Science
-
Semantics in Blockchain environments
-
Trust, Data Privacy, and Security with Semantic Technologies
-
IoT, Stream Processing, dealing with temporal data
-
Conversational AI and Dialogue Systems
-
Provenance and Data Change Tracking
-
Semantic Interoperability (via mapping, crosswalks, standards, etc.)
-
Linked Data storage, triple stores, graph databases
-
Robust and scalable management, querying and analysis of semantics and
data
-
User interfaces for the Semantic Web & its management
-
Explainable and Interoperable AI
-
Decentralised and Federated Knowledge Graphs (e.g., Federated querying,
link traversal)
-
Application of Semantically-Enriched and AI-based Approaches, such as,
but not limited to:
-
Knowledge Graphs in Bioinformatics, Medical AI and Preventive Healthcare
-
Clinical Use Case of semantic-enabled AI-based Approaches
-
AI for Environmental Challenges
-
Semantics in Scholarly Communication and Scientific Knowledge Graphs
-
AI and LOD within GLAM (galleries, libraries, archives, and museums)
institutions
-
Knowledge Graphs & hybrid AI for predictive maintenance and Industry
4.0/5.0
-
Digital Humanities and Cultural Heritage
-
LegalTech, AI Safety, EU AI Act
-
Economics of Data, Data Services, and Data Ecosystems
We especially invite contributions that illustrate the applicability of the
topics mentioned above for industrial purposes and/or illustrate the
business relevance of their contribution for specific industries. Workshop
proposals on *emerging themes* and *open challenges* for the topics listed
above are encouraged.
In case you have additional questions concerning the submission process,
please do not hesitate to contact us via Easychair.
We are looking forward to your contribution!
*Workshop & Tutorial Chairs:*
-
Daniel Garijo, Universidad Politécnica de Madrid, Spain (email:
daniel.garijo(a)upm.es)
-
David Chaves-Fraga, Universidade de Santiago de Compostela Spain (email:
david.chaves(a)usc.es)
Kind Regards,
On behalf of the organising committee.
=========================
Dr. Kossi Amouzouvi
ScaDS.AI Dresden/Leipzig, TU Dresden
--
DISCLAIMER: The contents of this email and any attachments are
confidential. They are intended for the named recipient(s) only. If you
have received this email by mistake, please notify the sender immediately
and you are herewith notified that the contents are legally privileged and
that you do not have permission to disclose the contents to anyone, make
copies thereof, retain or distribute or act upon it by any means,
electronically, digitally or in print. The views expressed in this
communication may be of a personal nature and not be representative of
AIMS-NEI and/or any of its Centres or Initiatives.
The Reading Concordances in the 21st Century project (RC21) is inviting you to two events on concordance reading at FAU in Erlangen, Germany.
Both events are **free**, but places are limited – please register early.
Thu 20 & Fri 21 March 2025 – RC21 Symposium with poster session
Join us for a two-day symposium with talks by guest speakers and the project team. The talks will cover methodology, theory, and applications of concordance reading.
The symposium includes a poster session, where participants can present their own research.
Poster submissions are open until 31 January 2025.
The following speakers will be presenting:
Laurence Anthony<https://www.dhss.phil.fau.eu/person/laurence-anthony-ph-d-2/>, Waseda University, Tokyo, Japan
Nathan Dykes<https://www.dhss.phil.fau.eu/person/nathan-dykes/>, Friedrich-Alexander-Universität, Erlangen-Nürnberg, Germany
Stephanie Evert<https://www.linguistik.phil.fau.de/person/prof-dr-stephanie-evert/>, Friedrich-Alexander-Universität, Erlangen-Nürnberg, Germany
Susan Hunston<https://www.birmingham.ac.uk/staff/profiles/elal/hunston-susan>, University of Birmingham, United Kingdom
Marc Kupietz<https://www.ids-mannheim.de/digspra/personal/kupietz/>, Leibniz Institute for the German Language (IDS), Mannheim, Germany
Michaela Mahlberg<https://michaelamahlberg.com/>, Friedrich-Alexander-Universität, Erlangen-Nürnberg, Germany
Alexander Piperski<https://www.linguistik.phil.fau.de/person/alexander-piperski/>, Friedrich-Alexander-Universität, Erlangen-Nürnberg, Germany
Patricia Ronan<https://islk.kuwi.tu-dortmund.de/ronan/>, Technical University Dortmund, Germany
Charlotte Taylor<https://profiles.sussex.ac.uk/p329327-charlotte-taylor>, University of Sussex, England
Yukio Tono<https://www.tufs.ac.jp/research/researcher/people/english/tono_yukio.html>, Tokyo University of Foreign Studies, Japan
Valentin Werner<https://www.uni-bamberg.de/eng-ling/personen/werner/>, University of Bamberg, Germany
Viola Wiegand<https://www.stir.ac.uk/people/1844219>, University of Stirling, UK
Wed 19 March 2025 – RC21 Training Day
The Concordance Reading Training Day, which precedes the symposium, includes hands-on sessions on different aspects of concordance reading. Participants need to bring their own devices.
Deadline for registration: 19 February 2025
Deadline for submission of poster abstracts: 31 January 2025
The events are part of the Reading Concordances in the 21st Century project jointly funded by the AHRC and the DFG.
For more information and registration details:
https://www.dhss.phil.fau.eu/reading-concordances-in-the-21st-century-rc21/…
The RC21 team is looking forward to welcoming you to FAU!
Stephanie Evert, Michaela Mahlberg, Nathan Dykes, Sasha Piperski
--
Nathan Dykes
Friedrich-Alexander-Universität Erlangen-Nürnberg
Department Digital Humanities and Social Studies
Werner-von-Siemens-Str. 61
90152 Erlangen
The Survey of English Usage at University College London will be running the 12th Summer School in English Corpus Linguistics online from 25-27 June 2025.
This Summer School is an accessible and inspiring introductory course in English Corpus Linguistics for students of linguistics and students of the English language.
The course will be taught online over three days in the morning (UK time). The course consists of theoretical and practical sessions.
Students are expected to have a basic knowledge of concepts in linguistics, especially grammar.
Places are limited. Be sure to book early to get the early bird rate.
For students in full-time education the course fee includes a free copy of either the ICE-GB Corpus (https://www.ucl.ac.uk/english-usage/projects/ice-gb) or the DCPSE Corpus (https://www.ucl.ac.uk/english-usage/projects/dcpse), with the associated exploration software ICECUP.
For more information about the course, provisional timetable and how to apply, see:
https://www.ucl.ac.uk/english-usage/summer-school<https://www.ucl.ac.uk/english-usage/summer-school/>
Prof. Bas Aarts
Department of English Language and Literature
UCL
Bluesky: @englishgrammar.bsky.social
Substack: https://basaarts.substack.com
Grammarianism Blog: http://bit.ly/1d1zKzN
Continuous Professional Development and INSET courses for teachers: https://bit.ly/39qnKIH
Note: I respect your work/life balance. If I send you an email outside of your normal working hours there is no expectation that you will read or respond to the message at that time.
Dear all,
As every year, the ESRC Centre for Corpus Approaches to Social Science (CASS), Lancaster University offers a free in-person training event: Lancaster Summer schools in corpus linguistics. This year, the summer schools run from 16 to 20 June 2025.
* As part of our innovation programme, we offer a new Corpus linguistics for language testing and assessment <https://wp.lancs.ac.uk/corpussummerschools/language-testing-with-corpus-lin…> stream in addition to
* Corpus linguistics for analysis of language, discourse and society<https://wp.lancs.ac.uk/corpussummerschools/language-discourse-society/>
More info/application form are available here: https://wp.lancs.ac.uk/corpussummerschools/
Please feel to share with postgraduate students and other researchers who might benefit from the training. Please apply early because places are limited.
Best,
Vaclav
Professor Vaclav Brezina
Professor in Corpus Linguistics
Department of Linguistics and English Language
ESRC Centre for Corpus Approaches to Social Science
Faculty of Arts and Social Sciences, Lancaster University
Lancaster, LA1 4YD
Office: County South, room C05
T: +44 (0)1524 510828
[cid:image001.jpg@01DB723D.978AD030]@vaclavbrezina
[cid:image002.jpg@01DB723D.978AD030]<http://www.lancaster.ac.uk/arts-and-social-sciences/about-us/people/vaclav-…>
Apologies for cross-posting.
---------------------------------------------------------------------------
*The Eighth Workshop on Technologies for Machine Translation of
Low-Resource Languages (LoResMT 2025)*
*https://www.loresmt.org/ <https://www.loresmt.org/>*
*@ NAACL 2025 (May 3–4, 2025)*
*Albuquerque, New Mexico, U.S.A.*
*SUBMISSION*
*
<https://openreview.net/group?id=aclweb.org/ACL/2024/Workshop/LoResMT>https://openreview.net/group?id=aclweb.org/NAACL/2025/Workshop/LoResMT
<https://openreview.net/group?id=aclweb.org/NAACL/2025/Workshop/LoResMT>*
*TIMELINE*
*Paper submission due:* *February 13, 2025* (Anywhere on Earth)
*Pre-reviewed (ARR) submission deadline:* *February 27, 2025*
*Notification of acceptance:* March 8, 2025
*Camera-ready papers due:* March 17, 2025 (Anywhere on Earth)
*Pre-recorded video due (hard deadline):* April 8, 2025
*Workshop dates at NAACL 2025:* May 3–4, 2025
*SCOPE*
Based on the success of past low-resource machine translation (MT)
workshops at AMTA 2018, MT Summit 2019, AACL-IJCNLP 2020, AMTA 2021, COLING
2022, EACL 2023, ACL 2024, we introduce LoResMT 2025 workshop at NAACL
2025. The workshop provides a discussion panel for researchers working on
MT systems/methods for low-resource and under-represented languages in
general. We would like to help review/overview the state of MT for
low-resource languages and define the most important directions. We also
solicit papers dedicated to supplementary NLP tools that are used in any
language and especially in low-resource languages. Overview papers of these
NLP tools are very welcome. It will be beneficial if the evaluations of
these tools in research papers include their impact on the quality of MT
output.
*TOPICS*
We are highly interested in (1) original research papers, (2)
review/opinion papers, and (3) online systems on the topics below; however,
we welcome all novel ideas that cover research on low-resource languages.
- Neural machine translation (NMT) for low-resource languages
- Use of LLMs (large language models) for low-resource MT systems
- COVID-related corpora, their translations and corresponding NLP/MT systems
- Work that presents online systems for practical use by native speakers
- Word tokenizers/de-tokenizers for specific languages
- Word/morpheme segmenters for specific languages
- Alignment/Re-ordering tools for specific language pairs
- Use of morphology analyzers and/or morpheme segmenters in MT
- Multilingual/cross-lingual NLP tools for MT
- Corpora creation and curation technologies for low-resource languages
- Review of available parallel corpora for low-resource languages
- Research and review papers on MT methods for low-resource languages
- MT systems/methods (e.g. rule-based, SMT, NMT) for low-resource languages
- Pivot MT for low-resource languages
- Zero-shot MT for low-resource languages
- Fast building of MT systems for low-resource languages
- Re-usability of existing MT systems for low-resource languages
- Machine translation for language preservation
*SUBMISSION INFORMATION*
We are soliciting two types of submissions: (1) research, review, and
position papers and (2) system demonstration papers. For research, review
and position papers, the length of each paper should be at least four (4)
and not exceed eight (8) pages, plus unlimited pages for references. For
system demonstration papers, the limit is four (4) pages. Submissions
should be formatted according to the official ACL style templates
(Overleaf). Please refer to the NAACL submission guideline for further
information <https://2025.naacl.org/calls/papers/#paper-submission-details>.
Accepted papers will be published at ACL Anthology in the NAACL 2025 and
will be presented at the conference.
Submissions must be anonymized and should be done using the provided
submission system. Scientific papers that have been or will be submitted to
other venues must be declared as such and must be withdrawn from the other
venues if accepted and published at LoResMT. The review will be
double-blind. Authors of an accepted paper should present their paper in
person at NAACL 2025. Papers should be submitted in PDF to the LoResMT Open
Review
<https://openreview.net/group?id=aclweb.org/NAACL/2025/Workshop/LoResMT>.
We would like to encourage authors to cite papers written in ANY language
that are related to the topics, as long as both original bibliographic
items and their corresponding English translations are provided.
Registration is handled by the main conference (https://2025.naacl.org/).
*ORGANIZING COMMITTEE (LISTED ALPHABETICALLY)*
Atul Kr. Ojha, University of Galway
Chao-Hong Liu, Potamu Research Ltd
Ekaterina Vylomova, University of Melbourne, Australia
Jonathan Washington, Swarthmore College
Nathaniel Oco, National University (Philippines)
Flammie Pirinen, UiT The Arctic University of Norway, Tromsø
Xiaobing Zhao, Minzu University of China
*PROGRAM COMMITTEE (LISTED ALPHABETICALLY)*
Abigail Walsh, ADAPT Centre, Dublin City University, Ireland
Alberto Poncelas, Rakuten, Singapore
Ali Hatami, University of Galway
Alina Karakanta, Fondazione Bruno Kessler (FBK), University of Trento
Anna Currey, AWS AI Labs
Aswarth Abhilash Dara, Walmart Global Technology
Atul Kr. Ojha, University of Galway & Panlingua Language Processing LLP
Bogdan Babych, Heidelberg University
Chao-hong Liu, Potamu Research Ltd
Constantine Lignos, Brandeis University, USA
Daan van Esch, Google
Dana Moukheiber, Massachusetts Institute of Technology
Ekaterina Vylomova, University of Melbourne, Australia
Eleni Metheniti, CLLE-CNRS and IRIT-CNRS
Flammie Pirinen, UiT Norgga árktalaš universitehta
Gaurav Negi, University of Galway
Jinliang Lu, Institute of automation, Chinese Academy of Sciences
John Philip McCrae, University of Galway
Jonathan Washington, Swarthmore College
Koel Dutta Chowdhury, Saarland University
Majid Latifi, UPC University
Maria Art Antonette Clariño, University of the Philippines Los Baños
Milind Agarwal, George Mason University
Mathias Müller, University of Zurich
Nathaniel Oco, De La Salle University
Pavel Rychlý, Masaryk University and Lexical Computing
Pengwei Li, Meta
Rashid Ahmad, International Institute of Information Technology, Hyderabad
Rico Sennrich, University of Zurich
Santanu Pal, Wipro
Sangjee Dondrub, Qinghai Normal University
Sardana Ivanova, University of Helsinki
Sourabrata Mukherjee, Charles University
Thepchai Supnithi, National Electronics and Computer Technology Center
Timothee Mickus, University of Helsinki
Valentin Malykh, Huawei Noah’s Ark lab and Kazan Federal University
Wen Lai, LMU Munich
Xuebo Liu, Harbin Institute of Technolgy, Shenzhen
Yalemisew Abgaz, Dublin City University
Yasmin Moslem, Bering Lab
Zhanibek Kozhirbayev, National Laboratory Astana, Nazarbayev University
*CONTACT*
Please email loresmt(a)googlegroups.com if you have any
questions/comments/suggestions.