The WNUT Workshop will be collocated with EACL 2024 (Malta). The website for
the workshop is at:
http://noisy-text.github.io/
The WNUT workshop focuses on core NLP tasks (e.g., POS/NER tagging and
translation; not computational social science) over user-generated text, such
as that found on social media, web forums, online reviews, digital health
records, or language learner essays.
We seek submissions of long and short papers on original and unpublished work
(same format and page limit as EACL main conference). All accepted
submissions will be presented as posters. Additionally, selected submissions
will be presented orally. There will be best paper awards for both short and
long papers.
Topics of interest include but are not limited to:
* NLP of noisy text, e.g. POS, NER tagging, Parsing
* Text normalization and error correction
* Paraphrase identification and semantic similarity of short text or noisy
text
* Extracting user demographics, profiles, and major life events
* Machine translation and Multilingual NLP over noisy text
* Information extraction from noisy text, global and regional trend
detection, and event extraction
* Colloquial language, e.g. idiom detection
* Domain adaptation to user-generated text
* Detecting rumors, contradictory information, sarcasm and humor on social
media
* Sentiment analysis
* Temporal aspects of user-generated content (resolving time expressions,
concept drift, etc...)
* Representing and mining language variation in user-generated content
* Processing of automatically generated data
* Robustness to Noise, both Natural and Adversarial
[IMPORTANT DATES]
* Submission Deadline: December 18, 2023 (anytime on earth; dual-submission
allowed)
* Acceptance Notification: January 20, 2024
* Camera-Ready Deadline: january 30, 2024
* Workshop Day: March 21/22, 2024
[INVITED SPEAKERS]
* Su Lin Blodgett
* Jennifer Foster
[ORGANIZERS]
* Tim Baldwin (University of Melbourne)
* Wei Xu (Georgia Institute of Technology)
* Alan Ritter (Georgia Institute of Technology)
* Rob van der Goot (IT University of Copenhagen)
* Max Müller-Eberstein (IT University of Copenhagen)
[SUBMISSION]
Submissions should conform to the ACL style guidelines. Long and short paper
submissions must be anonymized. Please submit your papers via OpenReview:
https://openreview.net/group?id=eacl.org/EACL/2024/Workshop/WNUT
*** Ph.D. Award: First Call for Applications ***
36th International Conference on Advanced Information Systems Engineering
(CAiSE'24)
June 3-7, 2024, 5* St. Raphael Resort and Marina, Limassol, Cyprus
https://cyprusconferences.org/caise2024/
(*** Submission Deadline: 1st March, 2024 AoE ***)
The deadline to apply for the CAiSE 2024 PhD Award is March 1st 2024. The conditions to
apply are:
• having participated as an author in a previous CAiSE Doctoral Consortium or at a main
CAiSE Event: either the main conference, the CAiSE Forum, EMMSAD, or BPMDS;
• having successfully defended the PhD thesis in the last two years (i.e., since January 2022).
The application must be submitted electronically to the PhD Awards track of CAiSE 2024 via
EasyChair <https://easychair.org/conferences/?conf=caise2024> . The application must be
a single PDF file containing:
• a short cover letter that includes the list of PhD committee members,
• a support letter from the thesis advisor,
• the candidate's defended PhD thesis,
• the candidate’s CV.
About the PhD Award
The CAiSE PhD Award 2024 is granted annually to an outstanding recent PhD thesis in the
field of Information Systems Engineering.
The award is co-sponsored by the CAiSE Steering Committee and Springer. It consists of a
certificate, free full registration (5 days) to the next two editions of the CAiSE conference,
and a book voucher for a free selection worth EUR 500 from Springer’s printed books
collection. In addition, the selected thesis will be recommended for publication as a
monograph in the LNBIP series published by Springer, provided that Springer’s publication
conditions are met.
The PhD theses submitted for the award will be reviewed by a standing committee of senior
members selected from the CAiSE Advisory Committee, the CAiSE Steering Committee, and
the CAiSE Program Committee.
Award Chair
Professor Andreas L Opdahl, University of Bergen, Norway
Key Dates
• Submission of application: 1st March, 2024 (AoE)
• Notification: 15th April, 2024
Past Recipents
• 2023: Anna Bernasconi, PhD from Politecninco Milano (Italy), thesis title “Model, Integrate,
Search... Repeat: a Sound Approach to Building Integrated Repositories of Genomic Data” (link to the forthcoming monograph: https://link.springer.com/book/9783031449062)
• 2022: Volodymyr Leno, PhD from University of Melbourne (Australia), thesis title
“Robotic Process Mining: Accelerating the adoption of Robotic Process Automation” (link to the thesis: https://minerva-access.unimelb.edu.au/bitstream/handle/
11343/297274/98f9efca-4dd2-eb11-94dc-0050568d0279_manuscript.pdf)
• 2021: Orlenys Lopez Pintado, PhD from University of Tartu (Estonia), thesis title
“Collaborative Business Process Execution on the Block Chain: the Caterpillar System” (link to the thesis: https://dspace.ut.ee/items/1e09072c-5442-463a-b8c6-0425951cb90b)
• 2020: Steven Mertens, PhD from Ghent University (Belgium), thesis title “Enabling process
management for loosely framed knowledge-intensive processes” (link to the published monograph: https://www.springer.com/gp/book/9783030661922)
• 2019: Giovanni Meroni, PhD from Politecnico di Milano (Italy), thesis title “Artifact-driven
business process monitoring” (link to the published monograph: https://www.springer.com/gp/book/9783030324117)
• 2018: Wei Wang, PhD from University of Queensland (Australia) thesis title “Integrated
Modeling of Business Processes and Business Rules” (link to the published monograph: https://www.springer.com/gp/book/9783030118082)
• 2017: Marcela Ruiz, PhD from the Universidad Politécnica de Valencia (Spain), thesis title
“TraceME: A Traceability-Based Method for Conceptual Model Evolution” (link to the published monograph: https://www.springer.com/gp/book/9783319897158)
• 2016: Le Minh Sang Tran, PhD from University of Trento (Italy), thesis title “Managing the
Uncertainty of the Evolution of Requirements Models” (testimony of the 2016 CAiSE PhD Award winner: https://www.youtube.com/watch?v=q-vvlH66lC4)
* We apologize if you receive multiple copies of this CFP *
For the online version of this Call, visit:
https://easychair.org/cfp/3rdDHandNLP
===============
3rd DHandNLP
Third Workshop on Digital Humanities and Natural Language Processing
Co-located with PROPOR 2024
14-15 March 2024, Universidade de Santiago de Compostela, Galizia, Spain
Website: https://sites.google.com/view/dhandnlp-propor
Submission deadline: 20 January 2024 (23:59 GMT)
Submission link: https://easychair.org/conferences/?conf=3rddhandnlp
3rd DHandNLP is a one-day workshop during PROPOR - 14-15 March 2024
*Workshop description*
Digital humanities (DH) stand at the intersection of computing and the
humanities, involving
collaborative transdisciplinary research. While current DH practice already
shows an impressive
array of new digital tools and methods for the study of the humanities, we
believe that natural
language processing techniques and experience can significantly enhance the
field, while DH
can also bring new testbeds and problems for the NLP community.
As shown in the previous workshops, there is an increasing set of
researchers in the processing
of Portuguese who are interested in this active collaboration, and we
believe that we should
cater for a forum which may join the two communities, DH and NLP,
showcasing several
different aspects allowed by this cross-fertilization.
The 3rdDHandNLP welcomes papers stemming from humanities that deal with
language, such
as philosophy, history, geography, law, philology, linguistics, or
literature, and that can benefit
from a digital approach or enhanced with computational linguistics methods
or techniques, be it
by using large sets of (written or spoken) textual data or by developing
applications for an
increasingly digital world.
We also welcome papers that use “traditional” DH tools or techniques, such
as topic modeling,
and papers that use standard NLP tools that were already applied in
different DH contexts, such
as named entity recognition, document clustering and classification,
sentiment analysis,
dialect/language identification and linked data.
*Main workshop topics*
- Digital philology, critical editions production and textual criticism
- Lexicometrics, lexicology and lexicography
- Visualization or sonification of large textual bodies in specific domains
- Computational stylometry, authorship attribution and profiling
- Distant reading of literature
- Construction of historical thesauri
Finally, we are especially interested in approaches that deal with
historical material, involving
not only historical linguistics but historical lexicology, corpus
processing and their multilingual
analysis.
*SUBMISSION GUIDELINES*
All papers must be anonymous, original and not simultaneously submitted to
another journal or
conference. They must strictly adhere to the submission templates of the
main conference.
We welcome submissions of:
- Short papers, consisting of up to 4 pages of content, plus unlimited
pages of references
- Full papers, consisting of up to 8 pages of content, plus unlimited pages
of references
Kind regards,
Maria José B. Finatto and Leonardo Zilio (on behalf of the organising
committee)
The PROPOR 2024 demonstration program committee invites submissions for
demonstrations. Following the spirit of previous PROPOR editions, the
demonstration track aims at bringing together academia and industry,
creating a forum where more than written or spoken descriptions of research
are available. Thus, demos should allow attendees to try and test them
during their presentation in a dedicated session that will provide a more
informal and interactive setting. Products, systems, or tools are examples
of acceptable demos. Both early-research prototypes and mature systems may
also be considered.
*Important dates:*
Demos Submission: January 10 2024
Notification of acceptance or rejection: February 21 2024
Camera-ready demo paper: February 28 2024
Conference: March 14 and 15 2024
*Topics:*
The areas of interest include all topics related to theoretical and applied
issues of written and spoken Portuguese and Galician, such as, but not
limited to, the same topics as for the conference paper submission:
Natural language processing tasks (e.g. parsing, word sense disambiguation,
coreference resolution)
Natural language processing applications (e.g. question answering,
subtitling, summarization, sentiment analysis)
Natural language generation
Information extraction and information retrieval
Speech technologies (e.g. spoken language generation, speech and speaker
recognition, spoken language understanding)
Speech applications (e.g. spoken language interfaces, dialogue systems,
speech-to-speech translation)
Resources, standardization and evaluation (e.g. corpora, ontologies,
lexicons, grammars)
NLP-oriented linguistic description or theoretical analysis
Distributional semantics and language modeling
Portuguese language varieties and dialect processing (including the
language varieties of Angola, Brazil, Cape Verde, East Timor, Galicia,
Guinea-Bissau, Macau, Mozambique, Portugal, São Tomé, and Principe)
Multilingual studies, methods, applications, and resources including
Portuguese/Galician
The systems may be of the following kinds:
Natural Language Processing systems or system components
Application systems using language technology components
Software tools for computational linguistics research
Software for demonstration or evaluation
Development tools
*Submissions:*
Submissions should consist of a non-anonymous brief description document of
up to three pages of content, including references. Developers must outline
the main characteristics of their system/product/tool, provide sufficient
details to allow its evaluation, and give information on how they plan to
demonstrate it. Developers are encouraged to focus their description on the
relevance of the computational processing component of Portuguese or
Galician in the proposed system.
Submissions should be written in English. At submission time, only PDF
format is accepted. For the final versions, authors of accepted papers will
be given one extra content page to take the reviews into account. Authors
of accepted papers will be requested to send the source files for the
production of the proceedings.
Submissions must be sent via EasyChair (
https://easychair.org/my/conference?conf=propor2024) — please select the
track: PROPOR2024 Demo Paper.
All submitted papers must conform to the official ACL style guidelines. ACL
provides style files for LaTeX and Microsoft Word that meet these
requirements. They can be found at:
LaTeX styelesheet:
https://github.com/acl-org/acl-style-files/tree/master/latex
MS Word stylesheet:
https://github.com/acl-org/acl-style-files/tree/master/word
Publication:
Accepted demo papers are expected to be published by ACL as a volume in ACL
Anthology (https://aclanthology.org/) as part of the PROPOR 2024
proceedings. They will be available online. To ensure publication, at least
one author of each accepted paper must complete an adequate registration
for PROPOR 2024 by the early registration deadline.
*Presentation format:*
Accepted demos will be presented at a designated demo session with an
optional accompanying poster. Developers should make sure they could run
their demos properly. Thus, it is the authors’ responsibility to provide
the necessary technical conditions (i.e. equipment) for the demo at the
conference. Note that the local organizers will not provide any hardware or
software. Free high-speed Internet access will be available.
There will be a best demo award for the best-presented project.
Further details on the date, time, and instructions of the demonstration
session(s) will be determined and provided at a later date.
*Demo chairs:*
Marlo Souza (Universidade Federal da Bahia, Brazil)
Iria de-Dios-Flores (Universidade de Santiago de Compostela, Spain)
--
*Iria de-Dios-Flores (PhD)*
*https://sites.google.com/view/iriadediosflores/
<https://sites.google.com/view/iriadediosflores/>*
We are very pleased to share our first call for papers for our workshop on Reference, Framing, and Perspective co-located with LREC-COLING 2024.
* Workshop website: https://cltl.github.io/reference-framing-perspective/
* When: Saturday, May 25th, 20204
* Where: Torino, Italy (co-located with LREC-COLING 2024)
* Deadline for submissions: February (details tba)
* Paper submission link: tba
* Deadline for camera-ready papers: beginning of April 2024 (details tba)
When something happens in the world, we have access to an unlimited range of ways (from lexical choices to specific syntactic structures) to refer to the same real-world event. Variations in reference may convey radically different perspectives. This process of making reference to something by adopting a specific perspective is also known as framing. Although previous work in is this area is present (see Ali and Hassan (2022)’s survey for an overview), there is a lack of unitary framework and only few targeted datasets (Chen et al., 2019) and tools based on Large Language Models exist (Minnema et al., 2022). In this workshop, we propose to adopt Frame Semantics (Fillmore, 1968, 1985, 2006) as a unifying theoretical framework and analysis method to understand the choices made in linguistic references to events. The semantic frames (expressed by predicates and roles) we choose give rise to our understanding, or framing, of an event. We aim to bring together different research communities interested in lexical and syntactic variation, referential grounding, frame semantics, and perspectives. We believe that there is significant overlap within the goals and interests of these communities, but not the necessary common ground to enable collaborative work.
Shared dataset:
To facilitate discussion among participants and to make this a real working workshop, we make available a shared corpus. The corpus is composed of news articles reporting on the 2020/2021 Eurovision Song Contest (canceled in 2020 and held in 2021) that took place in Rotterdam (the Netherlands). The news articles have been collected using the structured data-to-text approach (Vossen et al., 2018). At this point, the corpus contains texts in English and Dutch. We are extending it to a range of other European languages. We invite participants to submit short and targeted analyses using the data (extended abstracts to be discussed in a hands-on data session). Participants are also free to use the data in regular contributions. More information about the corpus will be released soon.
Regular contributions:
We aim to lay the groundwork for such efforts. We invite contributions (regular long papers of 8 pages or short papers of 4 pages) targeting any of the following - non-exhaustive - list of topics:
* Theoretical models of framing and perspective
* Annotation frameworks for framing and perspectives
* Computational models of framing and perspective
* Approaches for creating and analyzing referentially grounded datasets (containing different perspectives, written at different points in time, written in different languages)
* Approaches for and analyses of texts about contested and divisive events triggering different opinions and perspectives
* Analyses of and methods for analyzing (diachronic) lexical variation and framing
* Language resources for reference, frames, and perspectives
* Approaches and tools to compare claims of sources
* Frames as expressions of bias in the representation of social groups
* User interface for the visualization of multiple perspectives
Extended abstracts:
We invite extended abstracts (1,500 words maximum) about small analyses or experiments conducted on our Shared Data. The abstracts will be non-archival and discussed in a dedicated data session.
Invited speakers:
Maria Antoniak
Vered Shwartz
Organizers:
Pia Sommerauer, Tommaso Caselli, Malvina Nissim, Levi Remijnse, Piek Vossen
***Third Call for Papers***
**Overview**
- Submission of long and short papers: December 18, 2023 (no deadline
extension possible)
- Submission page: https://softconf.com/eacl2024/LAW-XVIII/
- Website: https://sigann.github.io/LAW-XVIII-2024/
**Workshop Description**
LAW-XVIII will be the 18th annual meeting endorsed by the ACL Special
Interest Group for Annotation (SIGANN). It will take place in March 2024 at
EACL in St. Julians, Malta.
Linguistic annotation of natural language corpora is the backbone of
supervised methods in both statistical and neural natural language
processing. Annotated corpora are also a major supporting source of
information for unsupervised methods, multitask learning, and evaluation of
both NLP tools and theories about language within and outside of
linguistics. The LAW-XVIII will provide a forum for presentation and
discussion of innovative research on all aspects of linguistic annotation,
including creation/evaluation of annotation schemes, methods for automatic
and manual annotation, use and evaluation of annotation software and
frameworks, representation of linguistic data and annotations,
semi-supervised “human in the loop” methods of annotation, crowd-sourcing
approaches, and more.
The LAW will also provide a forum for annotation researchers to work
towards standardization, best practices, and interoperability of annotation
information and software.
In line with the EACL main conference, LAW will be hybrid, allowing both
in-person and virtual presentations.
**Special Theme**
The special theme of LAW-XVIII is “Annotation in the Age of Large Language
Models (LLMs).” In addition to LAW’s general topics, we specifically invite
submissions on the following topics:
- Comparison of linguistically annotated datasets vs. datasets created
using large language models. Potential topics include:
- Comparison of models that have been trained on the respective datasets
- Impact of data size of manually annotated resources already available
prior to dataset creation with LLMs
- Is synthetic dataset creation a viable option for non-standard domains,
e.g., the medical domain, where expert knowledge is required?
- Non-performance-related considerations of manual vs. synthetic dataset
creation (e.g., explainability)
- Impact and prevention of test dataset contamination in LLM training
- Usefulness of LLMs for linguistic research (in relation to annotation).
- Any other topics related to the special theme.
**Submissions**
We accept both direct submissions and commitments from ACL Rolling Review
(ARR).
We welcome submissions of long and short papers, posters, and
demonstrations relating to the special theme or any aspect of linguistic
annotation, including:
- Annotation procedures
- Innovative automated and manual strategies for annotation
- Machine learning and knowledge-based methods for automation of corpus
annotation
- Creation, maintenance, and interactive exploration of annotation
structures and annotated data
- Annotation evaluation
- Inter-annotator agreement and other evaluation metrics and strategies
- Qualitative evaluation of linguistic representations
- Innovative means to evaluate annotation quality
- Annotation access and use
- Representation formats/structures for annotations of different phenomena,
especially annotations at multiple levels, and means to explore/manipulate
them
- Linguistic considerations for merging annotations of distinct phenomena
- Annotation schemes, guidelines and standards
- New and innovative annotation schemes, comparison of annotation schemes
- Methodologies and resources for annotation scheme development
- Best practices for annotation procedures and/or development and
documentation of annotation schemes
- Interoperability of annotation formats and/or frameworks among different
systems as well as different tasks, frameworks, modalities, and languages
- Results from the application and evaluation of standards for linguistic
annotation
- Annotation software and frameworks
- Development, evaluation and/or innovative use of annotation software
frameworks
Submissions should report original and unpublished research on topics of
interest to the workshop. We also invite substantiated position papers, in
particular with regard to our special theme. Accepted papers are expected
to be presented at the workshop and will be published in the workshop
proceedings. They should emphasize obtained results rather than intended
work, and should indicate clearly the state of completion of the reported
results.
A paper accepted for presentation at the workshop must not be or have been
presented at any other meeting with publicly available proceedings.
Long/short paper submissions must use the official ACL style templates.
Long papers must not exceed eight (8) pages of content. Short papers and
demonstration papers must not exceed four (4) pages of content. References
do not count against these limits.
Note: The supplementary material does not count towards page limit and
should not be included in the paper, but should be submitted separately
using the appropriate field on the submission website. All submissions must
be in PDF format.
Reviewing of papers will be double-blind. Therefore, the paper must not
include the authors' names and affiliations or self-references that reveal
the authors’ identity--e.g., "We previously showed (Smith, 1991) ..."
should be replaced with citations such as "Smith (1991) previously showed
...". Papers that do not conform to these requirements will be rejected
without review.
Authors of papers that have been or will be submitted to other meetings or
publications must provide this information to the workshop co-chairs (
law-xviii-2024(a)googlegroups.com). Authors of accepted papers must notify
the program chairs within 10 days of acceptance if the paper is withdrawn
for any reason.
We follow previous and current ACL policy to establish an anonymity period
(from submission to author notification) during which non-anonymous posting
of preprints is not allowed. Also included in that policy are instructions
to reviewers to not rate papers down for not citing recent preprints.
Authors are asked to cite published versions of papers instead of preprint
versions when possible.
Papers can be submitted at https://softconf.com/eacl2024/LAW-XVIII/.
If you have any questions, please feel free to contact the program
co-chairs via e-mail or check the workshop website (
https://sigann.github.io/LAW-XVIII-2024/) for updates.
**Dates**
(All submission deadlines are 11:59 p.m. UTC-12:00 “anywhere on Earth” and
will not be extended)
Anonymity period starts: November 18, 2023
Submission of long and short papers: December 18, 2023
ARR Commitment deadline: January 17, 2024
Notification of acceptance: January 20, 2024
Camera-ready papers due: January 30, 2024
Workshop: March 21 or 22, 2024
**Workshop Organizers**
Manfred Stede (Program Co-Chair)
Sophie Henning (Program Co-Chair)
Amir Zeldes (ACL SIGANN President)
Ines Rehbein (ACL SIGANN Secretary)
Apologies for cross-posting.
-------------------------
2nd Workshop on Resources and Technologies for Indigenous, Endangered and
Lesser-resourced Languages in Eurasia (EURALI) @ LREC-COLING 2024
Date: 20-25 May, 2024
Venue: Lingotto Conference Centre - Torino (Italia)
Main website: https://sites.google.com/view/eurali/
<https://sites.google.com/view/eurali/>
LREC-COLING 2024 website: https://lrec-coling-2024.org/
——————————————————————————————————
Workshop overview and objectives
This workshop will focus on the development of language technology
resources and tools for indigenous, endangered and lesser-resourced
languages on the Eurasian continent.
In a media-centric world where language technology allows people to break
cultural and language barriers, it is important that speakers of endangered
and indigenous languages can be empowered to use this technology to
continue to share their knowledge and culture with the world. With the hope
of bridging this gap, the goal of this workshop is to increase visibility
and promote research for lesser-resourced and underrepresented languages in
Europe and Asia. Through collaboration between NLP researchers, language
experts and linguists working for the benefit of endangered languages in
these communities, we aim to create language technology resources that will
help to preserve and revive these languages for future generations.
Furthermore, the workshop aims to promote the emergence of new methods that
benefit linguists, for instance for automation of analysis and validation
processes, field linguists, the facilitation of data collection and
analysis processes, and computational linguists by developing new
techniques necessary for linguistic analysis, development of supervised or
weakly supervised methods for the analysis of poorly written or
undocumented languages.
The main objective of the workshop is to create basic resources and develop
tools for Eurasiatic languages, including but not limited to the following
topics:
-
identifying languages and variants spoken in these regions
-
creation of language resources and applications, e.g. sentiment
analysis, named entity recognition, and syntactic parsing
-
standardization for endangered languages
-
automatic identification and classification of lexical variation and
language varieties
-
adaptation of fundamental NLP tools for these languages, e.g.,
morphological analysis, taggers and parsers
-
reusability of language resources in NLP applications, e.g. machine
translation, and POS tagging
-
machine translation between closely related languages
evaluation of language resources and tools when applied to lesser-resourced
languages in the same language families
-
corpora, resources, and tools for closely related languages
-
linguistic and textual similarities among languages in Eurasia
-
digitalization of endangered languages
-
challenges in the creation of language resources and tools from
linguistic perspectives (which includes any perspective formal theory)
Submissions
We are seeking submissions under the following category:
Full papers: 8 pages+unlimited reference
Short papers (work in progress): 4 pages+unlimited reference
Posters (innovative ideas/proposals, a research idea of students): 4
pages+unlimited reference
Demo (of working online/standalone systems): 2 pages
Papers must describe original, completed or in progress, and unpublished
work. The accepted papers will be given up for full/short paper and poster
in the workshop proceedings and will be presented as an oral presentation
or poster.
Papers should be formatted according to the LREC-COLING style sheet (
https://lrec-coling-2024.org/authors-kit/), which is provided on the
LREC-COLING 2024 website(https://lrec-coling-2024.org/). Please submit
papers in PDF format to the START account (the submission link will be
available soon). For further information on this initiative, please refer
to the https://sites.google.com/view/eurali/.
Important Dates (tentative)
February 23, 2024: Paper submissions due
March 22, 2024: Paper notification of acceptance
May 20-25, 2024: Workshop
Workshop Chair:
Atul Kr. Ojha, Sina Ahmadi,
Chao-Hong Liu, Potamu Research Ltd, Dublin (Ireland)
John P. McCrae, University of Galway, Galway (Ireland)
Theodorus Fransen, Università Cattolica del Sacro Cuore, Milan (Italy)
Silvie Cinkovà, Charles University, Prague (Czech Republic)
Programme Committee (to be updated):
Abigail Walsh*, Dublin City University, Dublin (Ireland)
Agata Savary, University of Paris-Saclay, Paris-Saclay (France)
A. Seza Doğruöz, Ghent University, Ghent (Belgium)
Alina Karakanta, University of Leiden, Leiden (Netherlands)
Alina Wróblewska, Institute of Computer Science, Jana Kazimierza, Warszawa
(Poland)
Akanksha Bansal, Panlingua, Delhi (India)
Anabela Barreiro*, INESC-ID, Lisboa (Portugal)
Atul Kr. Ojha, University of Galway, Galway (Ireland) & Panlingua, (India)
Bharathi Raja Chakravarthi, University of Galway, Galway (Ireland)
Bogdan Babych, Heidelberg University, Heidelberg (Germany)
Chao-Hong Liu, Potamu Research Ltd, Dublin (Ireland)
Daan van Esch, Google, Amsterdam (Netherlands)
Daniel Zeman, Charles University, Prague (Czech Republic)
Deepak Alok, IIT-Delhi, Delhi (India)
Dorothee Beermann, Norwegian University of Science and Technology,
Trøndelag (Norway)
Esha Banerjee, J.P. Morgan, Bengaluru (India)
Ekaterina Vylomova, University of Melbourne, Melbourne (Australia)
George Rehm, GmbH, Berlin (Germany)
Jamal Abdul Nasir, University of Galway, Galway (Ireland)
Joakim Nivre, Uppsala University, (Sweden)
John P. McCrae, University of Galway, (Ireland)
Jonathan Washington, Swarthmore College, Swarthmore (USA)
Joseph Mariani, LIMSI-CNRS, Pairs (France)
Kaja Dobrovoljc, University of Ljubljana, Ljubljana (Slovenia)
Katharina Kann*, University of Colorado at Boulder, USA
Kevin Patrick Scannell, Cadhan Aonair, LLC, Missouri (USA)
Khalid Choukri, ELDA/ELRA, Paris (France)
Marie-Catherine de Marneffe, UCLouvainCollège Léon Durpiez, (Belgium)
Massimo Monaglia, University of Florence, (Italy)
Nicoletta Calzolari, CNR-ILC, (Italy)
Olesea Caftanatov, Vladimir Andrunachievici Institute of Mathematics and
Computer Science, Chişinău (Moldova)
Richard Sproat, Google, Tokyo (Japan)
Rico Sennrich, University of Zurich, Zurich (Switzerland)
Ritesh Kumar, Agra University, Agra (India)
Saliha Muradoglu, Australian National University, Canberra (Australia)
Silvie Cinkovà, Charles University, Prague (Czech Republic)
Sina Ahmadi, George Mason University, (USA)
Stella Markantonatou, Athena RC, Athens (Greece)
Sourabrata Mukherjee, Charles University, Prague (Czech Republic)
Sunipa Dev, Google, Washington (USA)
Theodorus Fransen, Università Cattolica del Sacro Cuore, Milan (Italy)
Valentin Malykh, MTS AI / ITMO University
Verginica Barbu Mititelu, Research Institute for Artificial Intelligence,
Bucharest (Romania)
Voula Giouli, Institute for Language and Speech Processing, Athens (Greece)
First International Conference on Natural Language Processing
and Artificial Intelligence for Cyber Security
(NLPAICS’2024)
Lancaster University, Lancaster, United Kingdom
29-30 July 2024
https://www.nlpaics.com
First Call for Papers
Recent advances in Natural Language Processing (NLP), Deep Learning and Large Language Models (LLMs) have resulted in improved performance of applications. . In particular, there has been a growing interest in employing AI methods in different Cyber Security applications.
In today's digital world, Cyber Security has emerged as a heightened priority for both individual users and organisations. As the volume of online information grows exponentially, traditional security approaches often struggle to identify and prevent evolving security threats. The inadequacy of conventional security frameworks highlights the need for innovative solutions that can effectively navigate the complex digital landscape for ensuring robust security. NLP and AI in Cyber Security have vast potential to significantly enhance threat detection and mitigation by fostering the development of advanced security systems for autonomous identification, assessment, and response to security threats in real-time. Recognising this challenge and the capabilities of NLP and AI approaches to fortify Cyber Security systems, the First International Conference on Natural Language Processing (NLP) and Artificial Intelligence (AI) for Cyber Security (NLPAICS’2024) serves as a gathering place for researchers in NLP and AI methods for Cyber Security. We invite contributions that present the latest NLP and AI solutions for mitigating risks in processing digital information.
Conference topics
The conference invites submissions on a broad range of topics related to the employment of NLP and AI (and in general, language studies and models) for Cyber Security including but not limited to:
Societal and Human Security and Safety
• Content Legitimacy and Quality
o Detection and mitigation of hate speech and offensive language
o Fake news, deepfakes, misinformation and disinformation
o Detection of machine generated language in multimodal context (text, speech and gesture)
o Trust and credibility of online information
• User Security and Safety
o Cyberbullying and identification of internet offenders
o Monitoring extremist fora
o Suicide prevention
o Clickbait and scam detection
o Fake profile detection in online social networks
• Technical Measures and Solutions
o Social engineering identification, phishing detection
o NLP for risk assessment
o Controlled languages for safe messages
o Prevention of malicious use of ai models
o Forensic linguistics
• Human Factors in Cyber Security
Speech Technology and Multimodal Investigations for Cyber Security
• Voice-based security: Analysis of voice recordings or transcripts for security threats
• Detection of machine generated language in multimodal context (text, speech and gesture)
• NLP and biometrics in multimodal context
Data and Software Security
• Cryptography
• Digital forensics
• Malware detection, obfuscation
• Models for documentation
• NLP for data privacy and leakage prevention (DLP)
• Addressing dataset “poisoning” attacks
Human-Centric Security and Support
• Natural language understanding for chatbots: NLP-powered chatbots for user support and security incident reporting
• User behaviour analysis: analysing user-generated text data (e.g., chat logs and emails) to detect insider threats or unusual behaviour
• Human supervision of technology for Cyber Security
Anomaly Detection and Threat Intelligence
• Text-Based Anomaly Detection
o Identification of unusual or suspicious patterns in logs, incident reports or other textual data
o Detecting deviations from normal behaviour in system logs or network traffic
• Threat Intelligence Analysis
o Processing and analysing threat intelligence reports, news, articles and blogs on latest Cyber Security threats
o Extracting key information and indicators of compromise (IoCs) from unstructured text
Systems and Infrastructure Security
• Systems Security
o Anti-reverse engineering for protecting privacy and anonymity
o Identification and mitigation of side-channel attacks
o Authentication and access control
o Enterprise-level mitigation
o NLP for software vulnerability detection
• Malware Detection through Code Analysis
o Analysing code and scripts for malware
o Detection using NLP to identify patterns indicative of malicious code
Financial Cyber Security
• Financial fraud detection
• Financial risk detection
• Algorithmic trading security
• Secure online banking
• Risk management in finance
• Financial text analytics
Ethics, Bias, and Legislation in Cyber Security
• Ethical and Legal Issues
o Digital privacy and identity management
o The ethics of NLP and speech technology
o Explainability of NLP and speech technology tools
o Legislation against malicious use of AI
o Regulatory issues
• Bias and Security
o Bias in Large Language Models (LLMs)
o Bias in security related datasets and annotations
Datasets and resources for Cyber Security Applications
Specialised Security Applications and Open Topics
• Intelligence applications
• Emerging and innovative applications in Cyber Security
Submissions and Publication
NLPAICS welcomes high-quality submissions in English, which can take two forms:
• Regular long papers: These can be up to eight (8) pages long, presenting substantial, original, completed, and unpublished work.
• Short papers: These can be up to four (4) pages long and are suitable for describing small, focused contributions, negative results, system demonstrations, etc.
Note that the page limits mentioned above exclude additional pages for references, ethical considerations, conflict-of-interest statements, as well as data and code availability statements.
Papers must be anonymised to support double-blind reviewing.
Please submit your work as pdf using the following link: https://softconf.com/n/nlpaics2024/
Submission templates can be accessed here: LaTeX Overleaf, LaTeX , MS Office
Accepted papers, including both long and short papers, will be published as part of the same e-proceedings to be uploaded on ACL Anthology.
Important dates
• Submissions due: 5 April 2024
• Reviewing process: 25 April-31 May 2024
• Notification of acceptance: 5 June 2024
• Camera-ready due: 20 June 2024
• Conference: 29-30 July 2024
Venue
The First International Conference on Natural Language Processing and Artificial Intelligence for Cyber Security (NLPAICS’2024) will take place at Lancaster University and is organised by the Lancaster University UCREL NLP research group.
Organisation
• Conference Chair
o Ruslan Mitkov (Lancaster University)
• Conference Programme Chairs
o Cengiz Acartürk (Jagiellonian University)
o Matthew Bradbury (Lancaster University)
o Mo El-Haj (Lancaster University)
o Paul Rayson (Lancaster University)
• Sponsorship Chair
o Saad Ezzini (Lancaster University)
• Publicity Chair
o Tharindu Ranasinghe (Aston University)
• Publication Chair
o Ignatius Ezeani (Lancaster University)
Contact
For further information please refer to the conference website (nlpaics.com) for regular updates. General queries can be emailed to info(a)nlpaics.com
First CfP: Joint Workshop on Multiword Expressions and Universal
Dependencies (MWE-UD 2024)
Co-located with LREC-COLING 2024
Torino, Italy and online
May 25, 2024
Workshop Webpage: https://multiword.org/mweud2024/
We are pleased to announce that the multiword expressions (MWE) and
Universal Dependencies (UD) research communities are joining forces in 2024
to organize a joint workshop. This is a timely collaboration because the
two communities clearly have overlapping interests. For instance, while UD
has several dependency relations that can be used to annotate MWEs, both
annotation guidelines (i.e. is syntactic irregularity and inflexibility or
semantic non-compositionality the leading criterion?) and annotation
practice (both across treebanks for a single language and across languages)
for these relations can be improved (Schneider and Zeldes, 2021). The
PARSEME MWE-annotated corpora for 26 languages build on UD annotated
corpora (Savary et al., 2023). Both communities share an interest in
developing guidelines, data-sets, and tools that can be applied to a wide
range of typologically diverse languages, raising fundamental questions
about tokenization, lemmatization, and morphological decomposition of
tokens. Proposals for harmonizing annotation practices between what has
been achieved in PARSEME and UD and expanding PARSEME MWE annotation to
non-verbal MWEs are also central to the recently started UniDive COST
action (CA21167) <https://unidive.lisn.upsaclay.fr/doku.php?id=start>.
The workshop invites submissions of original research on MWE, UD, and the
interplay of both. In particular, the following topics are especially
relevant:
-
Sensitivity of LLMs to MWE and syntactic dependencies. Studies along the
lines of Manning et al. (2020) (UD), Nedumpozhimana and Kelleher (2021),
Garcia et al. (2021), Fakharian and Cook (2021), Moreau et al. (2018)
(MWE), and others on the question to what extent LLMs make use of syntactic
dependencies or are capable of detecting MWEs and capturing their
semantics.
-
Applicability of UD and MWE annotation and discovery for low-resource
and typologically diverse languages and language varieties. Both UD and
PARSEME aim at universal applicability across a wide range of languages.
Much theoretical, computational, and empirical work concentrates on
high-resource languages however. Applying these frameworks to typologically
diverse languages may lead one to reconsider the notion of token, word, and
morphological segmentation, and to reassess the notion of MWE for languages
that feature compounding or incorporation (Baldwin et al., 2021;
Haspelmath, 2023).
-
Case studies. Studies on the consistency, coverage or universal
applicability of MWE annotation in the UD or PARSEME frameworks, as well as
studies on automatic detection and interpretation of MWEs in corpora.
-
MWE and UD processing to enhance end-user applications. MWEs have gained
particular attention in end-user applications, including MT (Zaninello and
Birch, 2020; Han et al., 2021), simplification (Kochmar et al., 2020),
language learning and assessment (Paquot et al., 2019; Christiansen and
Arnon, 2017), social media mining (Maisto et al., 2017), and abusive
language detection (Zampieri et al., 2020; Caselli et al., 2020). We
believe that it is crucial to extend and deepen these first attempts to
integrate and evaluate MWE technology in these and further end-user
applications.
-
Testing developed systems on the latest dataset versions. Authors are
also encouraged to submit papers that test the developed systems using the
recent UD 2.13 and/or PARSEME 1.3 releases.
Organizational Details
-
The workshop is sponsored by ACL-SIGLEX <https://siglex.org/> and UniDive
<https://unidive.lisn.upsaclay.fr/doku.php?id=start>.
-
UniDive members with accepted papers may be eligible for travel
reimbursement.
-
If you are based in an underrepresented country or work on low-resource
languages and have an accepted paper, you may be eligible for an ACL-SIGLEX
travel grant of up to 500 USD.
-
The workshop follows LREC-COLING’s hybrid online/onsite format.
-
Workshop proceedings will be published in the ACL Anthology.
-
The workshop follows the ACL anti-harassment policy
<https://www.aclweb.org/adminwiki/index.php/Anti-Harassment_Policy>.
Submission Instructions
The workshop invites two types of submissions:
-
archival submissions that present substantially original research in
both long paper format (8 pages + references) and short paper format (4
pages + references)
-
non-archival submissions of abstracts describing relevant research
presented/published elsewhere which will not be included in the MWE-UD
proceedings.
Papers should be submitted via the workshop’s START submission page (link
will be provided once available). Please choose the appropriate submission
format (archival/non-archival). Submissions must follow the LREC-COLING
2024 stylesheet <https://lrec-coling-2024.org/authors-kit/>.
When submitting a paper from the START page, authors will be asked to
provide essential information about resources (in a broad sense, i.e. also
technologies, standards, evaluation kits, etc.) that have been used for the
work described in the paper or are a new result of your research. Moreover,
ELRA encourages all LREC-COLING authors to share the described LRs (data,
tools, services, etc.) to enable their reuse and replicability of
experiments (including evaluation ones)
Archival papers with existing reviews from ACL Rolling Review will also be
considered. A paper may not be simultaneously under review through ARR and
MWE-UD. A paper that has or will receive reviews through ARR may not be
submitted for review to MWE-UD.
Important Dates (Tentative)
Paper submission: Feb 25, 2024
ARR paper commitment: Mar 25, 2024
Notification of acceptance: Apr 1, 2024
Camera ready papers due: Apr 8, 2024
Workshop: May 25, 2024
All deadlines are at 23:59 UTC-12 (Anywhere on Earth).
Organizing Committee
Archna Bhatia, Gosse Bouma, Kilian Evang, Marcos Garcia, Voula Giouli,
Lifeng Han, Joakim Nivre.
For any inquiries contact the Organizing Committee at
mweud2024-organizers(a)uni-duesseldorf.de.
Paris Graduate school of Linguistics offers 12 Masters specialties.
Application for September 2024 is open
https://mobility.smarts-up.fr/
Choose one or our master specialties and apply for a 10 000 euros /year
grant
https://paris-gsl.org/applying <https://paris-gsl.org/>
Deadline: January 19th 2024 (5 pm Paris time)
Please circulate this information
Anne Abeillé
Professor
Head of Paris Graduate School of Linguistics
https://paris-gsl.org/
Université Paris Cité