BIONLP 2024 and Shared Tasks @ ACL 2024
https://aclweb.org/aclwiki/BioNLP_Workshop
*Tentative* Important Dates
(All submission deadlines are 11:59 p.m. UTC-12:00 “anywhere on Earth”)
Paper submission deadline: May 17 (Friday), 2024
Notification of acceptance: June 17 (Monday), 2024
Camera-ready paper due: July 1 (Monday), 2024
Workshop: August 16, 2024, Location: LOTUS SUITE 12
Please watch for the updates!
SUBMISSION INSTRUCTIONS
-----------------------------------------
Two types of submissions are invited: full papers and short papers.
Full papers should not exceed eight (8) pages of text, plus unlimited
references. These are intended to be reports of original research. BioNLP
aims to be the forum for interesting, innovative, and promising work
involving biomedicine and language technology, whether or not yielding high
performance at the moment. This by no means precludes our interest in and
preference for mature results, strong performance, and thorough
evaluation. Both types of research and combinations thereof are
encouraged.
Short papers may consist of up to four (4) pages of content, plus unlimited
references. Appropriate short paper topics include preliminary results,
application notes, descriptions of work in progress, etc.
Electronic Submission
Submissions must be electronic and in PDF format, using the Softconf START
conference management system
Submissions need to be anonymous.
Submission site for the workshop https://softconf.com/acl2024/BioNLP2024
Please follow the ACL formatting guidelines:
https://github.com/acl-org/acl-style-files
Dual submission policy: papers may NOT be submitted to the BioNLP workshop
if they are or will be concurrently submitted to another meeting or
publication.
INVITED TALK
-----------------------------------------
Titipat Achakulvisut. Biomedical and Data (Bio-Data) lab at Mahidol
University
WORKSHOP OVERVIEW AND SCOPE
-----------------------------------------
The BioNLP workshop, associated with the ACL SIGBIOMED special interest
group, is an established primary venue for presenting research in language
processing and language understanding for the biological and medical
domains. The workshop has been running every year since 2002 and continues
getting stronger. Many other emerging biomedical and clinical language
processing workshops can afford to be more specialized because BioNLP truly
encompasses the breadth of the domain and brings together researchers in
bio- and clinical NLP from all over the world.
BioNLP 2024 will be particularly interested in transparency of the
generative approaches and factuality of the generated text. Language
processing that supports DEIA (Diversity, Equity, Inclusion and
Accessibility) is still of utmost importance. The work on detection and
mitigation of bias and misinformation continues to be of interest. Research
in languages other than English, particularly, under-represented languages,
and health disparities are always of interest to BioNLP. Other active areas
of research include, but are not limited to:
Tangible results of biomedical language processing applications;
Entity identification and normalization (linking) for a broad range of
semantic categories;
Extraction of complex relations and events;
Discourse analysis; Anaphora \& coreference resolution;
Text mining \& Literature based discovery;
Summarization;
Text simplification;
Question Answering;
Resources and strategies for system testing and evaluation;
Infrastructures and pre-trained language models for biomedical NLP;
Processing and annotation platforms;
Synthetic data generation \& data augmentation;
Translating NLP research into practice;
Getting reproducible results.
SHARED TASKS
-----------------------------------------
1. Clinical Text generation
Task 1: Radiology Report Generation
An important medical application of natural language generation (NLG) is to
build assistive systems that take X-ray images of a patient and generate a
textual report describing clinical observations in the images. This is a
clinically important task, offering the potential to reduce radiologists’
repetitive work and generally improve clinical communication. This shared
task is using the first large-scale collection of RRG datasets based on
MIMIC-CXR, CheXpert, PadChest and CANDID-PTX. Participants will need to
generate findings and impression from chest x-rays and will be evaluated on
a common leaderboard with recent proposed metrics such as F1-Radgraph and
RadCliQ. This shared task aims to benchmark recent progress using common
data splits and evaluation implementations.
See details at https://stanford-aimi.github.io/RRG24/
Task 2: Discharge Me!
The primary objective of this task is to reduce the time and effort
clinicians spend on writing detailed notes in the electronic health record
(EHR). Clinicians play a crucial role in documenting patient progress in
discharge summaries, but the creation of concise yet comprehensive hospital
course summaries and discharge instructions often demands a significant
amount of time, especially since these sections cannot be readily copied
from prior notes. This can lead to clinician burnout and operational
inefficiencies within hospital workflows. By streamlining the generation of
these sections, we can not only enhance the accuracy and completeness of
clinical documentation but also significantly reduce the time clinicians
spend on administrative tasks, ultimately improving patient care quality.
See details at https://stanford-aimi.github.io/discharge-me/
2. BioLaySumm
This shared task surrounds the abstractive summarization of biomedical
articles, with an emphasis on catering to non-expert audiences through the
generation of summaries that are more readable, containing more background
information and less technical terminology (i.e., a “lay summary”).
This is the 2nd iteration of BioLaySumm, following the success of the 1st
edition of the task at BioNLP 2023 which attracted 56 submissions across 20
different teams. In this edition, we aim to build on last year’s task by
introducing a new test set, updating our evaluation protocol, and
encouraging participants to explore novel approaches that will help to
further advance the state-of-the-art for Lay Summarization.
See details at https://biolaysumm.org/
Organizers
-----------------------------------------
* Dina Demner-Fushman, US National Library of Medicine
* Sophia Ananiadou, National Centre for Text Mining and University of
Manchester, UK
* Makoto Miwa, Toyota Technological Institute, Japan
* Kirk Roberts, UTHealth, Houston, Texas
* Jun-ichi Tsujii, National Institute of Advanced Industrial Science and
Technology, Japan
On behalf of the Association of Cyber Forensics and Threat
Investigators (ACFTI), I am pleased to invite you to the new
Cybersecurity stream lecture/seminar series.
The presentation is a maximum of 1 hour in length, with an audience of
about 60+, made up of undergraduate and postgraduate students plus
cybersecurity students from developing countries. Our goal is to shine
a spotlight on the broad array of new advances in cybersecurity
science and operations currently adopted in the industry. This session
will be conducted online. It will be fantastic to have any hands-on
topics related to cyber forensics.
Your discussion on this topic will be a great addition to our event.
Expressions of interest to present from anyone doing research or
applying cybersecurity techniques to practical or theoretical
applications related to the interactions between cyber forensics and
threat investigations can be sent as a summary of your work (c.200
words) to acfti (at) acfti (dot) org by February 15, 2024
Thank you in advance for your consideration, and we are very much
looking forward to hearing from you.
To get more news about our events, please join our low-traffic
announcement group @ https://groups.google.com/g/acfti
________________________________________________________
Association of Cyber Forensics and Threat Investigators
https://www.acfti.org
Twitter: @acfti
We are very pleased to share our second call for papers for our workshop on Reference, Framing, and Perspective co-located with LREC-COLING 2024.
Quick overview:
* Workshop website: https://cltl.github.io/reference-framing-perspective/
* When: Saturday, May 25th, 20204
* Where: Torino, Italy (co-located with LREC-COLING 2024)
* Deadline for submissions: February 20th, 2024
* Paper types: regular papers (short and long) and extended abstracts
* Paper submission link: https://softconf.com/lrec-coling2024/reference-framing-perspective2024/user…
* Deadline for camera-ready papers: March 29th, 2024
* Shared dataset: https://github.com/cltl/rfp_corpus_collection
When something happens in the world, we have access to an unlimited range of ways (from lexical choices to specific syntactic structures) to refer to the same real-world event. We can chose to express information explicitly or imply it. Variations in reference may convey radically different perspectives. This process of making reference to something by adopting a specific perspective is also known as framing. Although previous work in this area is present (see Ali and Hassan (2022)’s survey for an overview), there is a lack of a unitary framework and only few targeted datasets (Chen et al., 2019) and tools based on Large Language Models exist (Minnema et al., 2022). In this workshop, we propose to adopt Frame Semantics (Fillmore, 1968, 1985, 2006) as a unifying theoretical framework and analysis method to understand the choices made in linguistic references to events. The semantic frames (expressed by predicates and roles) we choose give rise to our understanding, or framing, of an event. We aim to bring together different research communities interested in lexical and syntactic variation, referential grounding, frame semantics, and perspectives. We believe that there is significant overlap within the goals and interests of these communities, but not necessarily the common ground to enable collaborative work.
Referentially Grounded Shared Dataset
One way to study variation in framing is to conduct contrastive analyses of texts reporting on the same real-world event. Such an analysis can help to reveal the extent of variation in framing and possibly give rise to the underlying factors that lead to different choices in framing the same event. We collected such a corpus about the Eurovision Song Festival and make it available as a Shared Dataset for the Workshop. The purpose of this corpus is to enable exploratory analyses, facilitate discussion among participants, and, last but not least, make our workshop a real working workshop.
The corpus is composed of news articles reporting on the Eurovision Song Contest that took place in Rotterdam, the Netherlands (canceled in 2020 and held in 2021). The news articles have been collected using the structured data-to-text approach (Vossen et al., 2018). The corpus contains news articles in multiple languages. We invite participants to submit short and targeted analyses using the data (extended abstracts to be discussed in a hands-on data session). Participants are also free to use the data in regular contributions.
Regular contributions:
We aim to lay the groundwork for such efforts. We invite contributions (regular long papers of 8 pages or short papers of 4 pages) targeting any of the following - non-exhaustive - list of topics:
* Theoretical models of framing and perspective
* Annotation frameworks for framing and perspectives
* Computational models of framing and perspective
* Approaches for creating and analyzing referentially grounded datasets (containing different perspectives, written at different points in time, written in different languages)
* Approaches for and analyses of texts about contested and divisive events triggering different opinions and perspectives
* Analyses of and methods for analyzing (diachronic) lexical variation and framing
* Language resources for reference, frames, and perspectives
* Approaches and tools to compare claims of sources
* Frames as expressions of bias in the representation of social groups
* User interface for the visualization of multiple perspectives
Extended abstracts:
We invite extended abstracts (1,500 words maximum) about small analyses or experiments conducted on our Shared Data. The abstracts will be non-archival and discussed in a dedicated data session.
Invited speakers:
Maria Antoniak
Vered Shwartz
Organizers:
Pia Sommerauer, Tommaso Caselli, Malvina Nissim, Levi Remijnse, Piek Vossen
Apologies for cross-posting.
---------------------------------------------------------------------------
The *9th Workshop on Representation Learning for NLP (RepL4NLP 2024)*,
co-located with ACL 2024 in Bangkok, Thailand, invites papers of a
theoretical or experimental nature describing recent advances in vector
space models of meaning, compositionality, and the application of deep
neural networks and spectral methods to NLP. We welcome submissions on
representations of text, as well as representations that are multi-modal,
cross-lingual, representations of symbolic languages, code, enriched with
external knowledge, or structure-informed (syntax, morphology, etc).
*Topics for the workshop will include, but are not limited to:*
- *Developing new representations*: at any level of granularity
(document to character) using supervised, unsupervised or semi-supervised
techniques for a multitude of tasks such as language modeling, similarity
search, clustering, etc.
- *Efficient learning of representations*: with respect to training and
inference time, model size, amount of training data, etc.
- *Evaluating representations*: with respect to training objectives (for
LLMs: next token prediction, RLHF, span-mask denoising, etc), types of test
data (e.g., text vs code), and architectures (decoder-only,
encoder-decoder, etc), as well as assessing representations for
generalization, compositionality, and robustness (e.g., adversarial), etc.
- *Representation analysis*: methods for visualizing, explaining, and
inspecting specific properties of representations (e.g., through probing),
enhancing their interpretability, investigating their influence on the
model's behavior, assessing the causal impact of interventions within the
representation space on the model's behavior, etc.
- *Relating representation to behavior*: whether, and to what extent, a
model’s representations cause, condition, or boost its behavior (e.g., for
LLMs: the relationship between encoded knowledge and task performance). Is
possessing good representations necessary or sufficient for solving a task?
Vice versa, is model behavior informative of its learned representations?
*Key Dates*
Direct paper submission deadline: May 17, 2024
ARR commitment deadline: June 1, 2024
Notification of acceptance: June 17, 2024
Camera-ready papers due: July 1, 2024
Workshop date: Aug 16, 2024
*Submissions*Papers may be long (maximum 8 pages plus references) or short
(maximum 4 pages plus references). We encourage authors to include a
broader impact and ethical concerns statement, following ARR Ethics Policy
from the main conference. Papers can be submitted directly via OpenReview.
*ACL 2023 fast-track submissions*Papers submitted to the ACL 2024 main
conference that have not been selected can be submitted to the RepL4NLP
2024 fast-track. We will then make a decision based on your reviews
received from ACL 2024. Note that you do not need to submit the reviews
received from ACL 2024.
*Website*
https://sites.google.com/view/repl4nlp2024/
*Organizers*
Chen Zhao, New York University Shanghai
Marius Mosbach, Saarland University
Pepa Atanasova, University of Copenhagen
Seraphina Goldfarb-Tarrent, Cohere
Peter Hase, University of North Carolina at Chapel Hill
Arian Hosseini, University of Montreal
Maha Elbayad, Meta AI
Sandro Pezzelle, University of Amsterdam
Maximilian Mozes, University College London
Dear colleagues
I would like to invite you to register and join us *online *or ** in person
** in Qatar Foundation Minaretein building Auditorium on Feb 19th 2024 as
of 9AM to attend two panels related to Data sciences, AI, LLM, NLP and
Social computing to discuss topics such as Hate Speech Detection, Fake news
Detection and Text analytics with an impressive lineup of International
speakers such as *Dr. Kareem Darwish *(AIXPLAIN), *Dr. Kiran
Garimella* (Rutgers
University), Tuğrulcan Elmas ( University of Edinburgh), Roy Lee Ka Wei
(Singapore University of Technology and Design) , Patrick Juola (Duquesne
University), Jiří Milička (Charles University) and David Kaufer (Carnegie
Mellon University).
I believe this public event will be useful for faculty, students,
researchers, staff etc. Please feel free to forward this invitation to your
colleagues and students. The event is organized within the second HBKU
MIDDLE EAST CONFERENCE 2024 Gender, Technology, and Digital Cultures in the
Middle East
Registration link (Free Registration)
https://app.micetribe.com/public/workspaces/chss/events/1282251051/forms/vi…
<https://eur05.safelinks.protection.outlook.com/?url=https%3A%2F%2Fapp.micet…>
Full Program (Feb 18 and Feb 19)
https://www.hbku.edu.qa/en/mec2024/program
<https://eur05.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.hbku.…>
Best,
Wajdi Zaghouani
----
*Wajdi Zaghouani, Ph.D.*
*Associate Professor in Digital Humanities*
College of Humanities and Social Sciences
P.O. Box 34110 | Education City | Doha, Qatar
tel: +974 4454 5601 | mob: +974 33454992
wzaghouani(a)hbku.edu.qa| Office A141, LAS Building
================[Apologies for any cross-posting]================
**Special issue of the journal Traitement Automatique des Langues (TAL)
Abusive Language Detection : Linguistic Resources, Methods and
Applications **
**Guest Editors**
Farah Benamara (IRIT-Toulouse University, IPAL Singapore), Delphine
Battistelli (MoDyCo, Paris Nanterre University) and Viviana Patti (Turin
University)
**Motivations**
Abusive language - or, in another very common terminology, hate speech -
and the propagation of harmful stereotypes have unfortunately become
commonplace occurrences on various social media platforms, partly due to
users’ freedom and anonymity and the lack of regulation provided by
these platforms. The sheer volume and often implicit nature of such
unwanted content make manual moderation of these user spaces a
formidable task. Various scientific communities interested in its at
least partial automation have taken up the problem over the past ten
years. In particular, Computational Social Science, Natural Language
Processing and Computational Linguistics have proposed numerous works to
create resources, datasets, and models aimed at automating the task of
abusive language detection (henceforth ALD). In fact, we see that ALD
has become a research theme in its own right in the field of Natural
Language Processing with an abundant literature.
Abusive language (umbrella term to refer to the various forms of harmful
language, such as toxic, offensive language, hate speech, and
stereotypes) is topically focused and each specific manifestation of
abusive language targets different vulnerable groups based on
characteristics such as gender (misogyny, sexism), ethnicity, race,
religion (xenophobia, racism, Islamophobia), sexual orientation
(homophobia), and so on. Most automatic ALD approaches cast the problem
into a binary classification task but important considerations should be
taken into account, in particular: (1) the topical focus or the
target-oriented nature of hate speech ; (2) the degree of engagement of
users in abusive content (e.g., denunciation, approbation, reporting,
neutral attitude) ; (3) the question of stereotypes and dominant
ideologies ; (4) the question of linguistic strategies more particularly
linked or born with social networks (e.g., emoticons, hashtags).
Furthermore, most of the work (resources, classifiers) is developed for
English.
**Topics**
Motivated by the interest of the community in the problem of ALD, we
invite papers from Natural Language Processing, Machine Learning and
Computational Social Sciences. We explicitly encourage interdisciplinary
submissions (resources, computational methods, and user applications at
the interface of linguistics/psychology/socio-linguistics/sociology) but
also position papers on the actual state of the art in the field
discussing the limitations of the current approaches and directions for
future work. The topics covered by the special issue include, but are
not limited to:
-- Linguistic resources and evaluation: annotation schemes, corpus
linguistics studies, new datasets, with a particular interest in French
language and/or multilingual resources. In the case of strictly lexical
resources: methods for constituting them and coverage, semantic
categories retained.
-- Formal/Conceptual approaches for ALD as inspired by models in
sociology, socio-linguistics and psychology.
-- Models and Methods: supervised and unsupervised approaches, including
LLMs.
-- Role of contextual phenomena, including discourses, extra-linguistic
contexts (e.g., cultural aspects).
-- Models for cross-lingual and multimodal detection.
-- New approaches beyond binary classification: target-oriented ALD,
degrees of user engagement, etc.
-- Dynamics of online AL in social media, propaganda propagation.
-- Bias detection and removal in resource creation, datasets and methods.
-- Application of ALD tools in education, social media content
moderation, etc.
-- Social, legal, and ethical implications of detecting, monitoring and
moderating AL.
**Important dates**
May 31th, 2024: Submission deadline
July 15th, 2024: Notification of acceptance after first rereading
End of September 2024: Revised version
Mid October 2024: Final decision
End of November 2024: Camera ready
January 2025: Publication of the special issue
**Submission**
Submissions can either be in French or English and should follow the
journal templates: https://tal-65-3.sciencesconf.org/
**About the journal**
Traitement Automatiques des Langues Journal (TAL) is the international
French journal of Natural Language Processing
(https://www.atala.org/revuetal) published by ATALA (French Association
for Natural Language Processing, http://www.atala.org) since 1959 with
the support of CNRS (National Centre for Scientific Research). It is
indexed by ACL Anthology as well as DBLP. It is also supported by the
Institute of Human and Social Sciences of the CNRS.
**Contact**
For any question, please contact tal-65-3(a)sciencesconf.org
**External committee**
-- Cristina Bosco, University of Turin
-- Elena Cabrio, University of Côte d'Azur
-- Tommaso Caselli, Faculty of Arts, Rijksuniveristeit Groningen
-- Valentina Dragos, ONERA
-- Karën Fort, Sorbonne University
-- Claire Hugonnier, University of Grenoble Alpes
-- Irina Illina, University of Lorraine
-- Roy Ka-Wei Lee, Singapore University of Technology and Design
-- Véronique Moriceau, IRIT, University of Toulouse
-– Frédérique Segond, INRIA Paris
-- Mariona Taulé, University of Barcelona
-- Samuel Vernet, Aix-Marseille University
-- Mathieu Valette, Paris Sorbonne Nouvelle University
-- Marcos Zampieri, George Mason University
--
========================
Farah Benamara Zitoune
Professor in Computer Science, Université Paul Sabatier
IRIT-CNRS
118 Route de Narbonne, 31062, Toulouse.
Tel : +33 5 61 55 77 06
http://www.irit.fr/~Farah.Benamara
==================================
Call for papers:
12th International Conference and Workshops on Technology and Chinese
Language Teaching (TCLT12), June 22 to 23, 202, University of California,
Los Angeles.
Conference will be held online.
More information about the conference can be found from the conference
website.:
http://www.tclt.us/tclt12/cfp.php
Dear colleagues,
We are happy to announce that The Third Workshop on NLP Applications to Field Linguistics (Field Matters 2024) will take place at ACL 2024 [https://2024.aclweb.org/] in Bangkok, Thailand on August 16. Please find call for papers bellow:
=== The Third Workshop on NLP Applications to Field Linguistics ===
Field linguistics plays a crucial role in the development of linguistic theory and universal language modeling, as it provides uncontested, the only way to obtain structural data about the rapidly diminishing diversity of natural languages.
The Field matters workshop aims to bring together the urgent needs of field linguists and the vast community of NLP practitioners, developing up-to-date NLP tools for easier, faster, more reliable data collection and annotation.
This year we are adding a special track dedicated to the indigenous languages of Thailand and South-East Asia. We encourage you to submit theses on this topic, although general submissions are also welcomed.
We are particularly interested in the following topics:
- Application of NLP to field linguistics workflow;
- The impact, benefits and harms of NLP-assisted fieldwork;
- Transfer learning for under-resourced language processing;
- The use of fieldwork data to build NLP systems;
- Modeling morphology and syntax of typologically diverse languages in the low-resource setting;
- Speech processing for under-resourced languages;
- Machine-readable field linguistic datasets and computational analysis of field linguistics datasets;
- Using technology to preserve culture via language;
- Improving ways of interaction with Indigenous communities;
- Special track: Indigenous languages of Thailand and South-East Asia.
We accept three types of papers:
- non-archival submissions: abstracts (2-page) or papers (up to 8-page) that can present already published work or work in progress (note that we accept non-archival submissions even after the main deadline);
- short archival submissions: 4-page papers that present new work;
- long archival submissions: 8-page papers that present new work.
The special track submissions can be either long or short and either archival or non-archival.
We offer the following ways of presenting the papers:
- the main section;
- poster sections.
The way a paper will be presented will be determined during the review process.
All submissions should be anonymized. We are subjected to the ACL Anonymity Policy. ACL changed its policy for review and citation, and no anonymity period will be required.
Dual submissions with the main conference are allowed, but authors must declare dual submission by entering the paper’s main conference submission id. The reviews for the submission for the main conference will be automatically forwarded to the workshop and taken into consideration when your paper is evaluated. Authors of dual-submission papers accepted to the main conference should retract them from the workshop by.
Papers posted to preprint servers such as arxiv can be submitted without any restrictions on when they were posted.
The workshop will run its own review process, and papers can be submitted directly to the workshop via OpenReview (https://openreview.net/group?id=aclweb.org/ACL/2024/Workshop/Field_Matters).
The workshop will take place at ACL 2024 (https://2024.aclweb.org/). Both papers and abstracts must follow the ACL 2024 format (https://github.com/acl-org/acl-style-files). Please do not modify these style files.
Workshop webpage: https://field-matters.github.io/2024
Contact email: fieldmattersworkshop(a)gmail.com
Important dates:
- Paper submission deadline: May 17 (Friday), 2024
- Notification of acceptance: June 17 (Monday), 2024
- Camera-ready paper due: July 1 (Monday), 2024
- Workshop dates: August 15–16, 2024
Organizers
- Oleg Serikov (KAUST, HSE University)
- Elena Klyachko (HSE University)
- Francis Tyers (Indiana University)
- Ekaterina Vylomova (University of Melbourne)
- Éric Le Ferrand (Boston College)
- Saliha Muradoğlu (The Australian National University (ANU))
- Ekaterina Voloshina (Independent Researcher)
- Anna Postnikova (Independent Researcher)
---------------
Best regards,
Anna Postnikova
Field Matters organizing committee
Applications are invited for a Postdoctoral Researcher position within the
project “Polyglot Machines: Human-like Learning of Morphologically Rich
Languages”, financed by a NWO-VIDI Talent Grant and coordinated by
Principal Investigator (PI) dr. Arianna Bisazza. This is an
interdisciplinary project at the intersection of Computational
Linguistics/Natural Language Processing (NLP), Computational
Psycholinguistics and Language Acquisition.
Despite the impressive advances made possible by neural networks, current
NLP systems are still far from displaying the learning abilities of humans
in many languages. By contrast, children around the world acquire extremely
diverse languages in comparable time spans and from considerably less
linguistic input than that required by neural models.
This project aims to improve language modeling for low-resource
morphologically rich languages, taking inspiration from child language
acquisition insights. Among other methodologies, an artificial language
learning paradigm will be used to simulate the learning of typologically
diverse languages and evaluate the effect of known child-directed language
properties on the acquisition of morphology and other language aspects.
You will be carrying out your research in the context of the Computational
Linguistics group, which is part of the Centre for Language and Cognition
of the University of Groningen, The Netherlands.
An important part of your work will be conducted together with the PI and
the PhD student that will be hired for the same project. Collaboration is
also possible with other PhD students supervised by the PI, as well as
other members of CLCG.
Main requirement: A PhD degree in any area related to the tasks (such as
Computational Linguistics, Computational Psycholinguistics and Language
Acquisition).
Find more details and apply here by 15 February 2024:
https://www.rug.nl/about-ug/work-with-us/job-opportunities/?details=00347-0…
Starting date: Negotiable. Ideally 1 September 2024. The appointment will
be for a specified period of 1 year, renewable for up to 2 more years (so
up to 3 years in total) following positive evaluation.
For questions about the position: A. Bisazza (do not use email for
applications)
a.bisazza(a)rug.nl
--
Arianna Bisazza
Associate Professor
University of Groningen
http://www.cs.rug.nl/~bisazza
*The 8th International Workshop on Cognitive Aspects of the Lexicon*
https://sites.google.com/view/cogalex-viii-2024
*Co-located with LREC-COLING 2024*
https://lrec-coling-2024.org/
Turin, Italy, 20-25 May 2024
* Important Dates*
Submission deadline: February 23, 2024
Date of notification: March 20, 2024
Camera ready deadline: March 29, 2024
CogALex workshop: May 20, 2024
*Meeting Description*
The way we look at the lexicon has changed dramatically over the last few
decades. While in the past being considered as an appendix to grammar, the
lexicon has now moved to the center stage. Indeed, there is hardly any task
in NLP that can be conducted without it. Also, many new proposals have
emerged during the last few years. Living in a fast-moving world, it is
hard for anyone to stay on top of the wave. Hence the reason for organizing
events like this.
The goal of this workshop is to provide builders and users of lexical
resources (researchers in NLP, psychologists, computational lexicographers)
a forum to share their knowledge and needs concerning the construction,
organization, and use of a lexicon by people (lexical access) and machines
(NLP, IR, data mining).
Like in the past, we invite researchers to address unsolved problems
concerning the lexicon, by considering this time however also Large
Language Models (LLMs). More precisely, we would like to explore their
potential for building and using lexical resources as well as their ability
to shed new light on the cognitive aspects of the lexicon.
We solicit contributions including, but not limited to, the topics listed
below, topics, which can be considered from any of the following points of
view:
- traditional-, computational- or corpus linguistics,
- neuro- or psycholinguistics (tip of the tongue problem, word
associations),
- mathematics (embedding-based approaches, graph theory, small-world
problems), etc.
*Submissions*
Possible submission topics are:
- The potential of Large Language Models for the creation and use of
lexical resources;
- Organization, i.e., structure of the lexicon;
- The meaning of words and how to reveal it;
- Analysis of the conceptual input given by a dictionary user;
- Methods for crafting dictionaries or indexes;
- Creation of new types of dictionaries;
- Dictionary access (navigation and search strategies), interface issues
Short papers can be up to 4 pages in length and long papers up to 8 pages.
Both submission formats can have an unlimited number of pages for
references. To create your document, please follow the guidelines defined
by COLING using their style sheets (
https://lrec-coling-2024.org/authors-kit/).
The submissions must be anonymous and they will be peer-reviewed by our
program committee. The peer review is double blinded.
Accepted papers will also be given an additional page to address the
reviewers’ comments. Notice that at least one of the authors of an accepted
paper must register for the main conference and present the paper.
*Submission Page*
https://softconf.com/lrec-coling2024/cogalex2024/
*Dual Submission Policy*
Papers may not be submitted to the workshop if they are or will be
concurrently submitted to another meeting or publication.
*Invited Speaker*
Prof. Gilles-Maurice de Schryver (Ghent University, Belgium)
https://tshwanedje.com/members/gmds/cv.html
*Workshop organizers*
- Michael Zock (CNRS, LIS, Aix-Marseille University, Marseille, France)
- Emmanuele Chersoni (Department of Chinese and Bilingual Studies, The Hong
Kong Polytechnic University, Hong Kong, China)
- Yu-Yin Hsu (Department of Chinese and Bilingual Studies, The Hong Kong
Polytechnic University, Hong Kong, China)
- Simon de Deyne (University of Melbourne / School of Psychological
Sciences, University of Adelaide)