==============================================================
Call for Participation
3nd Cardiff NLP Workshop, 1-2 July 2024
==============================================================
We are organising the 3nd Cardiff NLP Summer Workshop, an in-person workshop on Natural Language Processing. It will take place from July 1st to July 2nd 2024 in the Abacws building in Cardiff (Wales, UK).
The workshop is especially designed for PhD students and early career researchers, and the registration is free for everyone. Please fill the following expression of interest form by April 8th if interested in joining the workshop: https://docs.google.com/forms/d/e/1FAIpQLSc73QR6I6bqdiFlC-0DIC4TOTjCHdcsXmb…
Workshop Activities:
* Invited speakers from academia and industry
* Tutorials
* Poster session and networking
* Panel NLP research and large language models in academia and industry
Important Dates:
* Application Period: 19 February - 8 April 2024
* Notification of Acceptance: Late April 2024
* Workshop: 1-2 July 2024 in Cardiff
For more details, please check the workshop website: https://www.cardiffnlpworkshop.org/
Cardiff NLP Organisation team
--
Jose Camacho Collados
http://www.josecamachocollados.com<http://www.josecamachocollados.com/>
The deadline is extended to March 14th!
***************
Semantic Methods for Events and Stories, 2nd Edition (SEMMES 2024) – Call for Papers
***************
Website: https://anr-kflow.github.io/semmes/
Workshop co-located with the Extended Semantic Web Conference (ESWC) in Hersonissos, Greece
Submission deadline: March 7th, 2024 => March 14th, 2024
Scope
***************
An important part of human history and knowledge is made of events, which can be aggregated and connected to create stories, be they real or fictional. These events as well as the stories created from them can typically be inherently complex, reflect societal or political stances and be perceived differently across the world population. The Semantic Web offers technologies and methods to represent these events and stories, as well as to interpret the knowledge encoded into graphs and use it for different applications, spanning from narrative understanding and generation to fact-checking.
The aim of the 2nd edition of our workshop on Semantic Methods for Events and Stories (SEMMES) is to offer an opportunity to discuss the challenges related to dealing with events and stories, and how we can use semantic methods to tackle them. We welcome approaches which combine data, methods and technologies coming from the Semantic Web with methods from other fields, including machine learning, narratology or information extraction. This workshop wants to bring together researchers working on complementary topics, in order to foster collaboration and sharing of expertise in the context of events and stories.
Topics
***************
Topics of interest include, but are not limited to:
- Ontologies and data models for representing events, event relations, and narratives;
- Event extraction, co-reference and linking;
- Event Relation extraction and linking (e.g. temporal, causal, modal relationships);
- Methods combining KGs and LLMs targeting event- or narrative-related research;
- Fake events detection and event verification;
- Event-centric question answering;
- Event information visualisation;
- Event-centric knowledge graphs and vocabularies;
- Completion of event-centric knowledge graphs and reasoning;
- Event summarisation;
- Automatic narrative understanding and generation;
- Storytelling Applications/Demos.
Submission Guidelines
***************
We welcome the following types of contributions.
- Long papers (10-15 pages including references)
- Short papers (5-9 pages including references)
We welcome any types of research, resource and application papers, as well as (short only) demonstration submissions.
Submissions must be written in English and formatted using the template for submissions to CEUR Workshop Proceedings (https://www.overleaf.com/latex/templates/template-for-submissions-to-ceur-w…)
All papers and abstracts have to be submitted electronically via EasyChair: https://easychair.org/conferences/?conf=semmes2024.
Each accepted paper needs to be presented by one of the authors, who agrees to register and participate in SEMMES.
Authors may be requested to serve as reviewers for max 2 papers.
Important Dates
***************
- Submission deadline: March 7th, 2024 => March 14th, 2024
- Notifications: April 4th, 2024
- Camera-ready version: April 18th, 2024
- Workshop day: May 26th or 27th, 2024 (half-day, TBA)
All deadlines are 23:59 anywhere on earth (UTC-12).
Proceedings
***************
The complete set of papers will be published with the joint CEUR ESWC Workshop Proceedings (http://CEUR-WS.org), listed by the DBLP.
--
Pasquale Lisena
EURECOM, Campus SophiaTech
450 route des Chappes, 06410 Biot, France
e-mail: pasquale.lisena(a)eurecom.fr
site: http://pasqlisena.github.io/
Dear colleagues,
We have a couple of updates regarding the ongoing GEM shared task
<https://gem-benchmark.com/shared_task>:
*Event*: We are delighted to announce that the GEM shared task is endorsed
by SIGGEN <https://siggen-acl.github.io/index.html>, and will be part of
the Generation Challenges (GenChal) at INLG’24. Participants will have the
possibility to (i) publish a system description in the GenChal proceedings
(available on the ACL anthology, see GenChal’23 Proceedings
<https://aclanthology.org/volumes/2023.inlg-genchal/>), and (ii) present
their results during the GenChal session at the INLG conference in Tokyo in
September 2024.
*Pre-registration*: The deadline for pre-registering your system
submissions is approaching (March 8th 23.59 AoE)! Note that it will be
possible for participants to pre-register after March 8th, but that doing
so does not guarantee a participation in the human evaluation. Pre-registration
link <https://nyustern.az1.qualtrics.com/jfe/form/SV_8qRqfdN3qBy3Bqe>.
*Important dates*
March 8: Deadline for pre-registering systems (ensuring human evaluation in
languages selected by the organisers).
April 5: Deadline for output submission (all subtasks).
April 6: Human evaluation starts.
TBD: System descriptions due.
Late September: GenChal@INLG’24.
best,
simon, on behalf of the GEM Human Evaluation Team
*ADAPT Research Centre / Ionaid Taighde ADAPT*
*School of Computing, Dublin City University, Glasnevin Campus
/ Scoil na Ríomhaireachta,
Campas Ghlas Naíon, Ollscoil Chathair Bhaile Átha Cliath*
* We apologize if you receive multiple copies of this CFP *
For the online version of this Call, visit:
https://nldb2024.di.unito.it/submissions/
===============
*SUBMISSIONS ARE OPEN AT* https://easychair.org/conferences/?conf=nldb2024
===============
NLDB 2024
The 29th International Conference on Natural Language & Information Systems
25-27 June 2024, University of Turin, Italy.
Website: https://nldb2024.di.unito.it/
*Submission deadline: 22 March, 2024*
About NLDB
The 29th International Conference on Natural Language & Information
Systems will be held at the University of Turin, Italy, and will be a
face to face event. Since 1995, the NLDB conference brings together
researchers, industry practitioners, and potential users interested in
various applications of Natural Language in the Database and Information
Systems field. The term "Information Systems" has to be considered in
the broader sense of Information and Communication Systems, including
Big Data, Linked Data and Social Networks.
The field of Natural Language Processing (NLP) has itself recently
experienced several exciting developments. In research, these
developments have been reflected in the emergence of Large Language
Modelsand the importance of aspects such as transparency, bias and
fairness, Large Multimodal Models and the connection of the NLP field
with Computer Vision, chatbots and dialogue-based pipelines.
Regarding applications, NLP systems have evolved to the point that they
now offer real-life, tangible benefits to enterprises. Many of these NLP
systems are now considered a de-facto offering in business intelligence
suites, such as algorithms for recommender systems and opinion
mining/sentiment analysis. Language models developed by the open-source
community have become widespread and commonly used. Businesses are now
readily adopting these technologies, thanks to the efforts of the
open-source community. For example, fine-tuning a language model on a
company’s own dataset is now easy and convenient, using modules created
by thousands of academic researchers and industry experts.
It is against this backdrop of recent innovations in NLP and its
applications in information systems that the 29th edition of the NLDB
conference takes place. We welcome research and industrial
contributions, describing novel, previously unpublished works on NLP and
its applications across a plethora of topics as described in the Call
for Papers.
Call for Papers:
NLDB 2024 invites authors to submit papers on unpublished research that
addresses theoretical aspects, algorithms, applications, architectures
for applied and integrated NLP, resources for applied NLP, and other
aspects of NLP, as well as survey and discussion papers. This year's
edition of NLDB continues with the Industry Track to foster fruitful
interaction between the industry and the research community.
Topics of interest include but are not limited to:
* Large Language Models: training, applications, transfer learning,
interpretability of large language models.
* Multimodal Models: Integration of text with other modalities like
images, video, and audio; multimodal representation learning;
applications of multimodal models.
* AI Safety and ethics: Safe and ethical use of Generative AI and NLP;
avoiding and mitigating biases in NLP models and systems; explainability
and transparency in AI.
* Natural Language Interfaces and Interaction: design and implementation
of Natural Language Interfaces, user studies with human participants on
Conversational User Interfaces, chatbots and LLM-based chatbots and
their interaction with users.
* Social Media and Web Analytics: Opinion mining/sentiment analysis,
irony/sarcasm detection; detection of fake reviews and deceptive
language; detection of harmful information: fake news and hate speech;
sexism and misogyny; detection of mental health disorders;
identification of stereotypes and social biases; robust NLP methods for
sparse, ill-formed texts; recommendation systems.
* Deep Learning and eXplainable Artificial Intelligence (XAI): Deep
learning architectures, word embeddings, transparency, interpretability,
fairness, debiasing, ethics.
* Argumentation Mining and Applications: Automatic detection of
argumentation components and relationships; creation of resource (e.g.
annotated corpora, treebanks and parsers); Integration of NLP techniques
with formal, abstract argumentation structures; Argumentation Mining
from legal texts and scientific articles.
* Question Answering (QA): Natural language interfaces to databases, QA
using web data, multi-lingual QA, non-factoid QA(how/why/opinion
questions, lists), geographical QA, QA corpora and training sets, QA
over linked data (QALD).
* Corpus Analysis: multi-lingual, multi-cultural and multi-modal
corpora; machine translation, text analysis, text classification and
clustering; language identification; plagiarism detection; information
extraction: named entity, extraction of events, terms and semantic
relationships.
* Semantic Web, Open Linked Data, and Ontologies: Ontology learning and
alignment, ontology population, ontology evaluation, querying ontologies
and linked data, semantic tagging and classification, ontology-driven
NLP, ontology-driven systems integration.
* Natural Language in Conceptual Modelling: Analysis of natural language
descriptions, NLP in requirement engineering, terminological ontologies,
consistency checking, metadata creation and harvesting.
* Natural Language and Ubiquitous Computing: Pervasive computing,
embedded, robotic and mobile applications; conversational agents; NLP
techniques for Internet of Things (IoT); NLP techniques for ambient
intelligence
* Big Data and Business Intelligence: Identity detection, semantic data
cleaning, summarisation, reporting, and data to text.
Important Dates:
*Full paper submission: 22 March, 2024 *
Paper notification: 19 April, 2024
Camera-ready deadline: 26 April, 2024
Conference: 25-27 June 2024
Submission Guidelines:
Authors should follow the LNCS format
(https://www.springer.com/gp/computer-science/lncs/conference-proceedings-gu…)
and submit their manuscripts in pdf via Easychair
(https://easychair.org/conferences/?conf=nldb2024)
Papers can be submitted to either the main conference or the industry track.
Submissions can be full papers (up to 15 pages including references and
appendices), short papers (up to 11 pages including references and
appendices) or papers for a poster presentation or system demonstration
(6 pages including references). The program committee may decide to
accept some full papers as short papers or poster papers.
All questions about submissions should be emailed to
federico.torrielli(a)unito.it (Web & Publicity Chair)
General Chairs:
Luigi Di Caro, University of Turin
Farid Meziane, University of Derby
Amon Rapp, University of Turin
Vijayan Sugumaran, Oakland University
Dear Colleagues,
You are invited to participate in a short ~5 minute survey on sound
change. We are an interdisciplinary team of researchers interested in
understanding the current views of prominent scholars on the process of
sound change. To better understand views that may be commonly held but
rarely written down, we have created a short ~5 minute survey aimed at
targeting several long-standing debates in our field. Should you choose
to provide additional explanation in optional text boxes, the survey may
take a few minutes longer.
Your responses will help develop an understanding of how common
different positions are regarding these major debates, how much
disagreement exists, and what factors predict respondents’ positions on
specific issues.
To reach a wide range of scholars, thus covering the diverse opinions
across our field, we are forwarding an invitation to participate in our
research through this server list so that anyone who wishes to can
decide to participate. The survey contains both fixed responses and open
responses that allow you to express your views in more detail if you
wish. Should you decide to participate, as a thank you for your time,
you can go in the draw to win one of 10 $50 Amazon gift vouchers.
You are welcome to request a summary of our findings by emailing
q.atkinson(a)auckland.ac.nz.
To participate, follow this link to the survey:
https://auckland.au1.qualtrics.com/jfe/form/SV_e2jzEXLrG6zk3qu
Please feel free to forward this email on to colleagues who may also be
interested in participating.
Thanks! If you have any questions or issues with the survey, please
contact our staff at q.atkinson(a)auckland.ac.nz.
Best wishes to all,
Quentin Atkinson
Remco Bouckaert
Jordan Douglas
Russell Gray
Mattis List
Mary Walworth
Approved by the University of Auckland Human Participants Ethics
Committee on 10/10/2023 for three years. Reference Number: UAHPEC26714.
*Apologies for crossposting*
TermTrends24: Models and Best Practices for Terminology Representation in
the Semantic Web
Workshop colocated with MDTT 2024 <https://mdtt2024.dei.unipd.it/en/>
Date: 26th June, 2024
Venue: Granada, Spain
More info: https://termtrends.linkeddata.es/
*Submission: 15th March*
*About TermTrends*TermTrends 2024, co-located with MDTT 2024 aims to
provide a discussion forum on the theoretical and methodological approaches
for the representation of terminological data, both at a conceptual and a
linguistic level. In particular, we would like to focus on their connection
to the Linguistic Linked (Open) Data (LLOD) paradigm through the
representation of these data according to Semantic Web formats. By adopting
models or vocabularies proposed for the representation of linguistic data,
we would contribute to the creation of interoperable and reusable
terminological resources.
With this objective, the workshop intends to explore the advantages and
challenges underlying various Terminology-related standardisation
approaches, ranging from the initially proposed standards to represent
terminology within the International Standardisation Organisation (ISO),
such as the TermBase eXchange (TBX) format, to models that represent
linguistic descriptions associated with ontologies in the Semantic Web,
such as SKOS and Ontolex-lemon.
Being multidisciplinary in scope, it focuses on identifying terminological
representation needs, as well as limitations of current models in
addressing such needs, with the aim of also exploring the development of an
extension of the Ontolex-lemon vocabulary and how that may contribute to
overcoming such challenges.
*Call for Papers*The topics of interest for this workshop include, but are
not limited to, the following topics:
- Terminology Representation Standards
- Terminology as Linguistic Linked (Open) Data
- Interoperability of Terminological Resources
- Reusability of Terminological Resources
- Challenges in Terminology Representation
- Analysis of the structure of Terminological Resources
*Submissions*
Papers proposals should follow the CEUR template. Short and long papers
will be accepted. Following CEUR guidelines, short papers should be 5-6
pages long and long papers 8-10 pages long. Authors must submit their
papers through the EasyChair platform following this link.
*Important Dates15 March 2024* - Deadline for paper submission
*20 April 2024* - Deadline for notification for paper submission
*15 May 2024* - Deadline for camera-ready paper submission
*26 June 2024 *- TermTrends Workshop
*Workshop Organisers*
Rute Costa, NOVA FCSH / NOVA CLUNL (Portugal)
Elena Montiel-Ponsoda, Universidad Politécnica de Madrid (Spain)
Sara Carvalho, Univ. de Aveiro / NOVA CLUNL (Portugal)
Patricia Martín-Chozas, Universidad Politécnica de Madrid (Spain)
Federica Vezzani, University of Padova (Italy)
*Patricia Martín Chozas - Postdoctoral Researcher*
* Ontology Engineering Group*
Artificial Intelligence Department
ETSI Informáticos - Universidad Politécnica de Madrid
Phone: (+34) 910673091
Dear Colleagues,
The Lattice Lab in Montrouge (Ecole normale supérieure-PSL & CNRS) is recruiting a Postdoc / Research Engineer in Computational Social Sciences, for 18 months beginning in April 2024 or soon thereafter. See here for details:
https://euraxess.ec.europa.eu/jobs/200537
We are looking for a strong candidate with relevant first publications in the domain, and with a good knowledge of natural language techniques / LLMs (and/or willing to develop new NLP techniques, of course).
The post is related to the ANR Medialex project (https://anr.fr/Projet-ANR-21-CE38-0016), on the mutual influence between the medias (including social medias) and the political sphere (esp. debates at the Parliament). Some command of French is necessary, but it does not need to be your main language.
To apply, please send me an email with a few words about your interest for the job, a detailed CV, one relevant publication and the name of two referees.
All the best,
Thierry
Applications are invited for a 4-year salaried PhD position within the
research project “Polyglot Machines: Human-like Learning of Morphologically
Rich Languages”, financed by a NWO-VIDI Talent Grant and coordinated by
Principal Investigator (PI) dr. Arianna Bisazza. This is an
interdisciplinary project at the intersection of Computational
Linguistics/Natural Language Processing (NLP), Computational
Psycholinguistics and Language Acquisition.
Despite the impressive advances made possible by neural networks, current
NLP systems are still far from displaying the learning abilities of humans
in many languages. This project aims to improve language modeling for
low-resource morphologically rich languages, taking inspiration from child
language acquisition insights.
Among other methodologies, an artificial language learning paradigm will be
used to simulate the learning of typologically diverse languages and
evaluate the effect of known properties of child-directed language on the
acquisition of morphology and other language aspects.
Other possible research directions include: the design of better input
segmentation methods; language acquisition inspired curriculum learning;
and leveraging existing language resources (like dictionaries or
morphological analyzers) to boost the learning process in very low-resource
settings.
This PhD position offers a unique opportunity to acquire valuable research
experience in an international environment: You will be part of the
Computational
Linguistics group <https://www.rug.nl/research/clcg/research/cl/?lang=en> (@
GroNLP <https://twitter.com/GroNlp>), which is part of the Centre for
Language and Cognition of the University of Groningen (CLCG).
Main requirement: A Master’s degree in computational linguistics,
artificial intelligence, computer science, information science, or related
area.
Find more details and apply here by 11 March 2024:
https://www.rug.nl/about-ug/work-with-us/job-opportunities/?details=00347-0…
Starting date: September 2024
For questions about the position: A. Bisazza a.bisazza(a)rug.nl (do not use
email for applications)
--
Arianna Bisazza
Associate Professor
University of Groningen
http://www.cs.rug.nl/~bisazza
We are pleased to invite abstract submissions to the *2nd Workshop on Eye
Movements and the Assessment of Reading Comprehension* scheduled to take
place on June 20–22 in Zürich, Switzerland.
1. Workshop theme:
Effective and widely available reading assessments are fundamental for
education, and instrumental for early diagnosis of reading difficulties,
enabling timely and targeted intervention. In this workshop, we explore how
eye-tracking and machine learning technologies can enhance reading
assessments. Our goal is to bring together researchers from various
relevant fields, including educational science, cognitive psychology and
psycholinguistics, eye-tracking-based reading research, and machine
learning. The workshop will provide a platform for exchanging ideas for the
next generation of reading assessments aided by eye-tracking and machine
learning technologies, as well as inspiring cross-disciplinary research
collaborations.
2. Workshop format:
The formal part of the workshop will span two days, featuring a structured
program with talks, posters, and discussions. An optional third day is
dedicated to more casual exchanges, with the opportunity to engage in open
conversations during a leisurely hike or a picnic.
3. Relevant topics for submission:
- Methodology and design of reading assessments, including large scale
reading assessments (LSAs)
- Reading assessments for different populations (e.g. children, adults,
elderly, populations with cognitive impairments)
- Reading development
- Cognitive processes underlying reading comprehension
- Eye movements as indicators of reading comprehension
- Machine learning approaches to reading assessment
4. Submissions:
- We invite submissions of short abstracts of up to 350 words.
- To submit your abstract, please fill in this abstract submission form:
https://forms.gle/RKRLhJsYKgc3pNQy8
- Submission format is plain text (optionally markdown).
- Submissions will be reviewed by the workshop organizers, primarily
with an eye to relevance for the workshop's theme.
- We expect to accept 25–35 submissions.
5. Important Dates:
- Paper submission deadline: April 1st, 2024
- Acceptance notification: April 19th, 2024
- Workshop date: June 20–22, 2024
6. Keynote speakers
1. Jean-François Rouet (Université de Poitiers)
2. t.b.d.
7. Funding:
The workshop is sponsored by the MultiplEYE COST Action (
https://multipleye.eu), which will provide financial support for covering
travel expenses to a limited number of participants. Authors will be
invited to apply for travel funding upon abstract acceptance. Funding may
be partial and priority will be given to junior researchers.
We look forward to your submissions and active participation.
Best regards,
The workshop organizers:
Lena Jäger (University of Zurich)
Yevgeni Berzak (Technion - Israel Institute of Technology)
Titus von der Malsburg (University of Stuttgart)
Workshop homepage:
https://multipleye.eu/workshop-on-eye-movements-and-assessment-of-reading-c…
Good day,
This is to announce the expansion of the collection of open Large Language Models (LLMs)
for the Portuguese language with the following models:
- the family of *encoders* is enlarged with the new *_Albertina 1.5B_
*https://huggingface.co/PORTULAN/albertina-1b5-portuguese-ptpt-encoder
- the family of *decoders* has now _*Gervásio 7B*_
https://huggingface.co/PORTULAN/gervasio-7b-portuguese-ptpt-decoder
This ecosystem encompasses now over ten LLMs that were specifically developed for
the Portuguese language, covering both its European variant, spoken in Portugal (PTPT),
and its American variant, spoken in Brazil (PTBR), and that can be run
on consumer-grade hardware.
The Albertina family includes encoders with *100M*, *900M* and *1.5B* parameters.
The Gervásio family, in turn, integrates a decoder with *7B* parameters.
All these models are *fully open*, being open source and openly distributed,
for free and with no registration required, under an open license, including
for research and commercial purposes.
They are also *fully documented*, thus including reports also on their evaluation scores,
which indicate they are top performing solutions for fully open models of their class
for Portuguese.
These models, their companion datasets and their documentation, for both PTPT and PTBR,
can all be found at https://huggingface.co/PORTULAN
Regards,
António Branco
University of Lisbon
NLX Natural Language and Speech Group
Faculdade de Ciências, Departamento de Informática