* We apologize if you receive multiple copies of this CFP *
For the online version of this Call, visit:
https://nldb2024.di.unito.it/submissions/
===============
SUBMISSIONS ARE OPEN AT https://easychair.org/conferences/?conf=nldb2024
===============
NLDB 2024
The 29th International Conference on Natural Language & Information Systems
25-27 June 2024, University of Turin, Italy.
Website: https://nldb2024.di.unito.it/
*Submission deadline: *22 March*5 April, 2024 (Extended)*
About NLDB
The 29th International Conference on Natural Language & Information
Systems will be held at the University of Turin, Italy, and will be a
face to face event. Since 1995, the NLDB conference brings together
researchers, industry practitioners, and potential users interested in
various applications of Natural Language in the Database and Information
Systems field. The term "Information Systems" has to be considered in
the broader sense of Information and Communication Systems, including
Big Data, Linked Data and Social Networks.
The field of Natural Language Processing (NLP) has itself recently
experienced several exciting developments. In research, these
developments have been reflected in the emergence of Large Language
Models and the importance of aspects such as transparency, bias and
fairness, Large Multimodal Models and the connection of the NLP field
with Computer Vision, chatbots and dialogue-based pipelines.
Regarding applications, NLP systems have evolved to the point that they
now offer real-life, tangible benefits to enterprises. Many of these NLP
systems are now considered a de-facto offering in business intelligence
suites, such as algorithms for recommender systems and opinion
mining/sentiment analysis. Language models developed by the open-source
community have become widespread and commonly used. Businesses are now
readily adopting these technologies, thanks to the efforts of the
open-source community. For example, fine-tuning a language model on a
company's own dataset is now easy and convenient, using modules created
by thousands of academic researchers and industry experts.
It is against this backdrop of recent innovations in NLP and its
applications in information systems that the 29th edition of the NLDB
conference takes place. We welcome research and industrial
contributions, describing novel, previously unpublished works on NLP and
its applications across a plethora of topics as described in the Call
for Papers.
Call for Papers:
NLDB 2024 invites authors to submit papers on unpublished research that
addresses theoretical aspects, algorithms, applications, architectures
for applied and integrated NLP, resources for applied NLP, and other
aspects of NLP, as well as survey and discussion papers. This year's
edition of NLDB continues with the Industry Track to foster fruitful
interaction between the industry and the research community.
Topics of interest include but are not limited to:
* Large Language Models: training, applications, transfer learning,
interpretability of large language models.
* Multimodal Models: Integration of text with other modalities like
images, video, and audio; multimodal representation learning;
applications of multimodal models.
* AI Safety and ethics: Safe and ethical use of Generative AI and NLP;
avoiding and mitigating biases in NLP models and systems; explainability
and transparency in AI.
* Natural Language Interfaces and Interaction: design and implementation
of Natural Language Interfaces, user studies with human participants on
Conversational User Interfaces, chatbots and LLM-based chatbots and
their interaction with users.
* Social Media and Web Analytics: Opinion mining/sentiment analysis,
irony/sarcasm detection; detection of fake reviews and deceptive
language; detection of harmful information: fake news and hate speech;
sexism and misogyny; detection of mental health disorders;
identification of stereotypes and social biases; robust NLP methods for
sparse, ill-formed texts; recommendation systems.
* Deep Learning and eXplainable Artificial Intelligence (XAI): Deep
learning architectures, word embeddings, transparency, interpretability,
fairness, debiasing, ethics.
* Argumentation Mining and Applications: Automatic detection of
argumentation components and relationships; creation of resource (e.g.
annotated corpora, treebanks and parsers); Integration of NLP techniques
with formal, abstract argumentation structures; Argumentation Mining
from legal texts and scientific articles.
* Question Answering (QA): Natural language interfaces to databases, QA
using web data, multi-lingual QA, non-factoid QA(how/why/opinion
questions, lists), geographical QA, QA corpora and training sets, QA
over linked data (QALD).
* Corpus Analysis: multi-lingual, multi-cultural and multi-modal
corpora; machine translation, text analysis, text classification and
clustering; language identification; plagiarism detection; information
extraction: named entity, extraction of events, terms and semantic
relationships.
* Semantic Web, Open Linked Data, and Ontologies: Ontology learning and
alignment, ontology population, ontology evaluation, querying ontologies
and linked data, semantic tagging and classification, ontology-driven
NLP, ontology-driven systems integration.
* Natural Language in Conceptual Modelling: Analysis of natural language
descriptions, NLP in requirement engineering, terminological ontologies,
consistency checking, metadata creation and harvesting.
* Natural Language and Ubiquitous Computing: Pervasive computing,
embedded, robotic and mobile applications; conversational agents; NLP
techniques for Internet of Things (IoT); NLP techniques for ambient
intelligence
* Big Data and Business Intelligence: Identity detection, semantic data
cleaning, summarisation, reporting, and data to text.
*Student Registration*:
We are committed to fostering the participation of young researchers and
students in the NLDB 2024 conference. To accommodate as many young minds
as possible, we have reduced the student registration fees. We believe
that this will provide an excellent opportunity for students to engage
with the latest research and industrial applications of Natural Language
Processing across information systems.
Important Dates:
*Full paper submission: *22 March*5 April, 2024 (Extended)*
Paper notification: 3 May, 2024
Camera-ready deadline: 10 May, 2024
Conference: 25-27 June 2024
Submission Guidelines:
Authors should follow the LNCS format
(https://www.springer.com/gp/computer-science/lncs/conference-proceedings-gu…)
and submit their manuscripts in PDF via Easychair
(https://easychair.org/conferences/?conf=nldb2024)
Papers can be submitted to either the main conference or the industry track.
Submissions can be full papers (up to 15 pages including references and
appendices), short papers (up to 11 pages including references and
appendices) or papers for a poster presentation or system demonstration
(6 pages including references). The program committee may decide to
accept some full papers as short papers or poster papers.
All questions about submissions should be emailed to
federico.torrielli(a)unito.it (Web & Publicity Chair)
General Chairs:
Luigi Di Caro, University of Turin
Farid Meziane, University of Derby
Amon Rapp, University of Turin
Vijayan Sugumaran, Oakland University
Dear Colleagues,
We are excited to announce that the upcoming FinNLP event will be held in
conjunction with IJCAI-2024 in Jeju, South Korea, from August 3rd to 9th.
This year, we host a Joint Workshop of the 8th Financial Technology and
Natural Language Processing (FinNLP) and the 1st Agent AI for Scenario
Planning (AgentScen).
For our main track, we are inviting submissions for both long and short
papers, as well as demonstrations. The submission deadline is April 26th,
2024.
Additionally, this year, we are featuring three shared tasks:
1. Multiple Question Generation from Presentation Transcripts (MQG),
2. Forward-Looking Document Comment Generation (DCG),
3. LLM Evaluation in the Financial Sector (FinLLM).
For more information and updates, kindly visit our website at
https://sites.google.com/nlg.csie.ntu.edu.tw/finnlp-agentscen/home
Looking forward to your contributions and participation.
Warm regards,
Chung-Chi Chen, Tatsuya Ishigaki, Hiroya Takamura, Akihiko Murai, Ryoko
Nishino, Hen-Hsen Huang, Hsin-Hsi Chen
FinNLP-AgentScen Organizers
-----
We understand that some of you may be subscribed to multiple lists and
could receive this message more than once. We apologize for any
inconvenience this may cause and appreciate your understanding as we strive
to keep our community well-informed.
Call for Doctoral Students in Artificial Intelligence
The Finnish Doctoral Program Network in Artificial Intelligence is looking for 100 new PhD students to work in fundamental AI and machine learning research and in five application areas. Come do a PhD tackling challenging research questions in a network that fosters industry and multidisciplinary collaboration!
* Call website: https://fcai.fi/doctoral-program
* Deadline: April 2, 2024
JOB DETAILS
The positions are based at one of the ten universities<https://fcai.fi/doctoral-program#offer> that are part of the Finnish Doctoral Program Network in Artificial Intelligence. The recruiting university will be the same as that of the primary supervisor. The matching of the candidates with supervisors will be done during the review process and the candidates will have a chance to prioritise the supervising professor they want to work with (see details in FAQ<https://fcai.fi/doctoral-program#faq>).
All positions are fully-funded. PhD student contracts will be made for three years. The terms of employment and the salaries are based on the General Collective Agreement for Universities<https://www.sivista.fi/wp-content/uploads/2023/08/Yo-tes-1.4.2023-31.5.2025…>. The contract includes occupational healthcare.
We are looking for 100 new PhD students in two calls (in spring and fall 2024). The accepted candidates of the spring call are expected to start in August 2024, and the applicants from the fall call in January 2025.
HOW TO APPLY
We are looking for 100 new PhD students to join the Finnish Doctoral Program Network in Artificial Intelligence in two calls: the first one is open March 11–April 2, 2024 and the second will open in fall 2024.
Candidates will apply to all universities and application areas with the same joint application. In the application form, you are able to indicate which specific research areas and supervisors you are interested in. Note: Candidates who apply to supervisors based at the University of Helsinki, will have to submit a parallel application to the university’s own recruitment system. Please note that the application needs to be submitted to both of the recruitment systems to ensure a proper review. See further details<https://www.helsinki.fi/en/research/doctoral-school/doctoral-education-pilot>.
The deadline for applications in the ongoing call is April 2, 2024. Please submit your application in our online recruitment system<https://aalto.wd3.myworkdayjobs.com/aalto/login?redirect=%2Faalto%2Fjob%2FO…> with the required attachments (detailed below).
Required attachments:
1. Motivation letter (1–2 pages). Please specify the research area(s) and preferably the supervisors with whom you want to work.
2. CV
3. List of publications (if relevant; please do not attach full copies of publications)
4. A transcript of master’s/bachelor’s studies and the degree certificate of your latest degree. If you don’t have a Master's degree, a plan of completion must be submitted.
In the application form, you are also asked to provide contact details of two senior academics who can provide references.
All materials should be submitted in English in a PDF format. Note: You can upload max. five files to the recruitment system, each max. 5MB.
RESEARCH AREAS
FUNDAMENTAL AI
Fundamental AI methods are the core of the FCAI research activities and the cornerstone in all application areas. Fundamental AI encompasses probabilistic AI for verifiable and uncertainty-aware model building, simulation-based inference for efficient and interpretable reasoning capabilities, data-efficient deep learning, privacy-preserving and secure AI, interactive AI for collaborative AI tools, autonomous AI, statistics, and decision-making. Widely applicable goals of the fundamental AI are AI-assisted decision-making, design and modeling.
Keywords: Artificial Intelligence, Causal Inference, Collaborative AI and human modeling, Machine Learning, Statistics
AI IN LANGUAGE AND SPEECH TECHNOLOGY
The area covers all aspects of natural language processing (NLP), a field of research dealing with computational analysis and generation of human language. NLP is a broad field which spans from highly technical research on machine learning techniques for written and spoken language data, through the myriad of individual tasks such as machine translation and information retrieval, to digital linguistics. The field is reliant on very large datasets and high performance computing, offering exciting software engineering and algorithmic challenges. Finland has a long tradition of top-notch NLP research, especially in the multilingual setting and, recently, large language model development.
Keywords: Foundation models, Human language technology, Natural Language Processing, NLP, Large language models, Speech recognition, Speech generation, Machine translation, Crosslingual models
AI IN COMMUNICATIONS AND SIGNAL PROCESSING
The area covers a wide range of advanced methods in communications and distributed intelligence technologies, statistical methods in signal processing, and analysis of images, video, speech, audio and array signals.
The methodologies can be applied in various layers of communications systems from applications to the radio connectivity with distributed intelligence that is an integral part of next generation communication and computing systems targeting to solve issues related to ultra densification of infrastructure, devices and people, and to guarantee secure, low latency and reliable use of ICT resources using advanced AI methods.
This research area also includes acquiring, processing, analyzing and understanding digital images, video sequences, views from multiple cameras, multi-dimensional data from a 3D scanner, 3D point clouds from LiDaR sensors, or medical scanning devices, and extraction of high-dimensional data from the real world in order to produce numerical or symbolic information, e.g. in the forms of decisions, using models constructed with the aid of geometry, physics, statistics, and learning theory.
Keywords: Array signal processing, Computer vision, Edge intelligence, Perception, Sensors, Wireless communications
AI IN HEALTH
The health and wellbeing field holds high potential to profit from advances in AI. Applications range from personalized care and precision medicine to preventive care and to process optimization. Increasing availability of large amounts of multi-source data combined with novel AI paradigms give huge opportunities. Challenges are how to extract valid actionable knowledge from all that data, how to develop AI-based solutions that are trustworthy, fit into healthcare processes, and that have an actual impact.
Keywords: Biomedical Image and Signal analysis; Multi-modal Health Data Analysis; Predictive, Preventive, Personalized, Participatory Healthcare, Trustworthy AI, Healthcare Processes
AI IN ENGINEERING
Industries are currently employing AI methods in numerous research and development tasks. Examples include product design, predictive maintenance, and combining physical models with data-based methods. There is a great potential also in replacing laboratory development and experiments with virtual laboratory-type approaches. Research topics include:
* AI methods in industrial research and development, including:
* AI for product design and optimization, combining physic-based and data-driven models.
* AI for improving industrial operations: cyber security, anomaly detection in industrial time series and predictive maintenance.
* Methods supporting AI in industrial deployments, including on-device learning and federated learning on edge devices.
* Virtual laboratories for experimentation and cost-effective product design and validation.
* AI methods for autonomous functions in land, sea, air and space vehicles and machines. These range from pilot assistance, collision avoidance and navigation systems to full-mission autopilots.
Keywords: Autonomous systems, Energy systems, Machine automation, Manufacturing, Materials, Mechanical engineering, Robotics
AI IN SOCIETY AND BUSINESS
The area examines the societal, ethical, and economic dimensions of AI, including trustworthy and societally acceptable AI as well as the consequences of the uses of AI. It brings together AI research with social sciences and humanities to gain in-depth understanding of AI’s role in organizations, society, business, and the economy. It includes uses of AI in education and education about AI. The area fosters interdisciplinarity to reinforce cross-cutting themes such as sustainability, ethics, equity, trust, and social responsibility.
Keywords: AI in business operations, AI in society, AI and Education, AI Ethics
--
*****************************************************************
Jörg Tiedemann
Language Technology https://blogs.helsinki.fi/language-technology/
University of Helsinki
*MWE-UD2024 @LREC-COLING2024 Call for Sponsorships*
We are pleased to announce that the multiword expressions (MWE) and
Universal Dependencies (UD) research communities are joining forces in 2024
to organize a joint workshop MWE-UD2024 <https://multiword.org/mweud2024/>,
to be held as a full-day event collocated with LREC-COLING 2024
<https://lrec-coling-2024.org/about-lrec-coling/>, Torino, Italy, May 25,
2024.
The MWE WS organised by the MWE section <https://multiword.org/> of SIGLEX
<http://www.siglex.org/> (Special Interest Group on the Lexicon of the ACL
<https://www.aclweb.org>), ran into its 19th edition in 2023 and is
organized within major NLP conferences since 2003. The UniDive COST action
(CA21167) <https://unidive.lisn.upsaclay.fr/doku.php?id=start> is an
interdisciplinary scientific network devoted to universality, diversity,
and idiosyncrasy in language technology. Both communities share an interest
in developing guidelines, data-sets, and tools that can be applied to a
wide range of typologically diverse languages, raising fundamental
questions about tokenization, lemmatization, and morphological
decomposition of tokens.
The joint workshop invites submissions of original research on MWE, UD, and
the interplay of both. In particular, MWE-UD2024 calls for the submission
of the following topics:
● *Sensitivity of large language models (LLMs) to MWEs and syntactic
dependencies *
● *Applicability of UD and MWE annotation and discovery for
low-resource and typologically diverse languages and language varieties*
● *Case studies* on the consistency, coverage or universal
applicability of MWE annotation in the UD or PARSEME frameworks, as well as
studies on automatic detection and interpretation of MWEs in corpora
● *MWE and UD processing to enhance end-user applications, *including
Machine Translation (MT), text simplification, language learning and
assessment, social media mining, and abusive language detection
● *Testing developed systems on the latest dataset versions*
MWE-UD2024 features both long and short research papers, archival and
non-archival papers, oral and poster presentations, “Best Paper Awards”,
and keynote speakers of rising-star successful scientists including Dr N.G.
Levshina (Natalia) <https://www.ru.nl/en/people/levshina-n> Assistant
professor - Centre for Language Studies - Department of Language and
Communication, Radboud Universiteit, NL and Dr. Harish Tayyar Madabushi
<https://www.harishtayyarmadabushi.com/> Lecturer in Artificial
Intelligence, University of Bath, UK.
MWE-UD is proud of its *gender balance* feature among STEM conferences,
e.g. this year we have confirmed PC members (reviewers) 26 females and 23
males with a ratio of almost 50% each, from 22 countries across most
continents.
MWE-UD2024 offers a variety of sponsorship opportunities suitable for
organizations of all sizes. Sponsors of the MWE-UD2024 workshop at
LREC-COLING gain *visibility* for their companies and institutes at an
*international
and well-known venue* and contribute to the success of the event.
Sponsorship will support increasing the in-person attendance and
engagement of the WS, increasing the diversity of attendances (e.g.,
regions, levels of seniority, gender), through grants, delegates and
organizers reimbursement (e.g., waiving registration fees) especially for
those with limited funds. Sponsors receive more visibility in the MWE-UD
workshop and the LREC-COLING conference, by having their names and logos in
our website, having posters/desks/flags/leaflets in the venue, etc.
For more information about the Level of Sponsorship available at MWE-UD2024
and their benefits, please reach out to e-mail at
mweud2024-organizers(a)uni-duesseldorf.de including details of the person to
contact, e.g. e-mail address, level of sponsorship desired.
*Organizing Committee*
Archna Bhatia (Institute for Human and Machine Cognition, USA)
Gosse Bouma (Groningen University, NL)
Kilian Evang (Heinrich Heine University Düsseldorf, DE)
Marcos Garcia (University of Santiago de Compostela, Galiza, Spain)
Voula Giouli (Institute for Language & Speech Processing, ATHENA RC,
Greece)
Lifeng Han (University of Manchester, UK)
Joakim Nivre (Uppsala University and Research Institutes of Sweden,
Sweden)
*Standing by | At Large Members (SIGLEX-MWE):*
● A. Seza Doğruöz <https://research.flw.ugent.be/en/as.dogruoz> (Ghent
University) - SIGLEX-MWE nominated officer in 2023-2025
● Alexandre Rademaker <http://arademaker.github.io/> (IBM Research and
FGV EMAp) - SIGLEX-MWE nominated officer in 2023-2025
Best regards,
We look forward to seeing you at MWE-UD2024 in Italia!
The MWE-UD 2024 organizing committee
--
https://www.research.manchester.ac.uk/portal/lifeng.han.html Office: 2.90
Kilburn, Oxford Road, Manchester
Apologies for cross-posting.
----------------------------------------
*The International Conference on Spoken Language Translation*
*21st IWSLT 2024 – **third** Call for Participation*
*August 17-18, 2024 – Bangkok, Thailand*
*http://iwslt.org <http://iwslt.org/>*
The International Conference on Spoken Language Translation (IWSLT) is the
premier annual conference for all aspects of Spoken Language Translation.
Every year, the conference organizes and sponsors open evaluation campaigns
around key challenges in simultaneous and consecutive translation, under
real-time/low latency or offline conditions and under low-resource or
multilingual constraints. System descriptions and results from
participants’ systems and scientific papers related to key algorithmic
advances and best practices are presented.
IWSLT is the venue of the SIGSLTs, the Special Interest Group on Spoken
Language Translation of ACL, ISCA and ELRA. With a track record of 20
years, IWSLT benchmarks and proceedings serve as reference for all
researchers and practitioners working on speech translation and related
fields.
The 21st edition of IWSLT <https://iwslt.org/2024/> will be run as an
*ELRA/ACL* event and co-located with ACL 2024 <https://2024.aclweb.org/> on
August 17-18, 2024. It will be run as a hybrid event.
Important Dates
January 15, 2024: Release of shared task training and dev data
April 01-15, 2024: Evaluation period
April 29, 2024: Paper submission due (all papers)
June 4, 2024: Notification of acceptance
June 24, 2024: Camera-ready paper due
July 22, 2024: Pre-recorded video due
August 17-18, 2024: Conference
Evaluation
The IWSLT 2024 features shared tasks <https://iwslt.org/2024/#shared-tasks>
that address the following focus areas:
- Speech-to-speech track
- Simultaneous track
- Subtitling track
- Offline track
- Dubbing track
- Low-resource track
- Indic track
*Registration
<https://docs.google.com/forms/d/e/1FAIpQLSdGhCusmyPVmBz36EB8ABUFignw7nZCoKx…>
for the evaluation* is now open and training and development data for each
shared task are available through the website. The results of all tasks
will be collected and discussed in an overview paper that will be presented
at the conference. In addition, participants have the opportunity to
present their work through a system paper that will be published in the ACL
Proceedings.
Conference
IWSLT also invites submissions of scientific papers to be published in the
ACL Proceedings and presented either in oral or poster format. The
conference selects high-quality, original contributions on theoretical and
practical issues of spoken language translation research, technologies and
applications. For further information on this initiative, please refer to
the website <https://iwslt.org/2024/#paper-submission>
*Special Session: Recent Highlights in SLT*
IWSLT 2024 will introduce a special session entitled “SLT Recent Highlights
<https://iwslt.org/2024/special-session>”. The session intends to provide
an overview of recent highlights from the field across venues, with a
series of short presentations focused reviews on recent developments and
new/emerging trends. Additionally, a short and lively discussion may follow
the presentations. We believe this initiative will contribute to a better
understanding of the current landscape of SLT research and foster
collaboration and exchange of ideas within the IWSLT community. For more
information on this initiative, please refer to the website
<https://iwslt.org/2024/special-session>
Contact
Please send an email to iwslt-evaluation-campaign(a)googlegroups.com if you
have any questions related to the shared tasks.
Thanks,
Marine, Marcello, Alex, Jan, Sebastian, Elizabeth, Atul
(IWSLT organisers)
*******************************************************
EAMT 2024: The 25th Annual Conference of
The European Association for Machine Translation
24 - 27 June 2024
Sheffield, UK
https://eamt2024.sheffield.ac.uk/
@eamt_2024 (X account)
Keynote speakers:
- Alexandra Birch (University of Edinburgh, UK)
- Valter Mavrič (DG TRAD, European Parliament)
EXTENDED Tutorial proposal deadline: 22 March 2024
Tutorial date: 27 June 2024
More information:
https://eamt2024.sheffield.ac.uk/conference-calls/final-call-for-tutorials
*******************************************************
*****
IMPORTANT: In order to use OpenReview (https://openreview.net/), authors
need to have an account in the system. Please create your account as soon
as possible and use an institutional e-mail if possible. OpenReview may
take some time to approve accounts that do not use institutional domains.
If you have any issues, contact eamt2024(a)gmail.com
*****
*** Overview ***
The European Association for Machine Translation (EAMT) invites proposals
for tutorials to be held in conjunction with the EAMT 2024 conference
taking place in Sheffield, UK, from 24 to 27 June, with tutorials held on
27 June. We seek proposals in all areas of machine translation (see the
call for papers of the main conference for the focus areas of EAMT 2024).
The aim of a tutorial is primarily to help the audience develop an
understanding of particular technical, applied, and business matters
related to research, development, and use of MT and translation technology.
Presentations of particular technological solutions or systems are welcome,
provided that they serve as illustrations of broader scientific
considerations.
We recommend that the tutorial covers work by the presenters as well as by
other researchers. The submission should explain that this breadth is
ensured. Tutorials should not be “self-invited talks”.
*** Submission Details ***
Proposals should not exceed 4 pages of content (plus unlimited pages for
references), should be in PDF format, and should contain the following:
- A title and authors, affiliations, and contact information.
- A brief description of the tutorial content and its relevance to the
machine translation community.
- Short description of the target audience and any expected prerequisite
background the audience should be aware of.
- An outline of the tutorial structure content and how it will be covered
in a three-hour slot (half-day). In exceptional cases, six-hour tutorial
slots (full day) are available. These time limits do not include coffee
breaks, e.g., a three-hour tutorial, in fact, occupies a 3.5-hour slot, and
a six-hour tutorial occupies a 7-hour slot.
- Diversity considerations, e.g. use of multilingual data, indications of
how the described methods scale up to various languages or domains,
participation of both senior and junior instructors, demographic and
geographical diversity of the instructors, plans for how to diversify
audience participation, etc.
- Reading list. Work that you expect the audience to read before the
tutorial can be indicated by an asterisk. Recommended papers should provide
the breadth of authorship and include work by other authors, and work from
other disciplines is welcome if relevant.
- For each tutorial presenter, a one-paragraph statement of their research
interests and areas of expertise for the tutorial topic, as well as
experience in instructing an international audience.
An estimate of the audience size for the tutorial. If the same or a similar
tutorial has been given before, include information on where any previous
version of the tutorial was given and how many attendees the tutorial
attracted.
- A description of special requirements for technical equipment.
Tutorial proposals should be submitted as PDF files to OpenReview:
https://openreview.net/group?id=EAMT.org/2024/Tutorials_Track.
Submissions should be formatted according to the templates specified below.
Anonymisation is not required. Submissions should be no longer than 4 pages
(excluding references).
*** Templates for writing your proposal ***
There templates available in the following formats (check our website --
https://eamt2024.sheffield.ac.uk/conference-calls/final-call-for-tutorials):
- LaTeX
- Cloneable Overleaf template
- Word
- Libre Office/Open Office
- PDF
*** Evaluation Criteria ***
Each tutorial proposal will be evaluated according to its clarity and
preparedness, novelty or timely character of the topic, and instructors’
experience.
** Tutorial Instructor Responsibilities ***
Accepted tutorial presenters will be notified by 22 April 2024. They must
then provide abstracts of their tutorials for inclusion in the conference
registration material by the specific conference deadlines. The description
should be in two formats: (a) an ASCII version that can be included in
email announcements and published on the conference website, and (b) a PDF
version for inclusion in the electronic proceedings (detailed instructions
will be provided). Tutorial speakers must provide tutorial materials by 22
May 2024. The final submitted tutorial materials must minimally include
copies of the course slides and a bibliography for the material covered in
the tutorial.
For each tutorial being held at EAMT 2024, we offer free registration to
the conference for one tutor only.
*** Important Dates ***
- Submission deadline for tutorial proposals (extended): 22 March 2024
- Notification of acceptance (extended): 22 April 2024
- Tutorial slides + abstract + bibliography + any other materials
(extended): 22 May 2024
All deadlines are at 23:59 CEST.
*** Workshop Co-Chairs ***
Mary Nurminen (Tampere University)
Diptesh Kanojia (University of Surrey)
*** Local organising committee ***
Carolina Scarton (University of Sheffield)
Charlotte Prescott (ZOO Digital)
Chris Bayliss (ZOO Digital)
Chris Oakley (ZOO Digital)
Xingyi Song (University of Sheffield)
*** Sponsors ***
Silver: Translated <https://translated.com/welcome>, Unbabel
<https://unbabel.com/>
Bronze: Pangeanic <https://pangeanic.com/>, STAR
<https://www.star-group.net/en/home.html>, Transperfect
<https://globallink.transperfect.com/>
Collaborator: Apertium <https://apertium.org/>
Supporter: Spring Nature <https://www.springernature.com/gp>
Confirmed sponsors: Crosslang <https://crosslang.com> and RWS Language
Weaver <https://www.rws.com>
--
*Carolina Scarton*
Lecturer in Natural Language Processing
Department of Computer Science
University of Sheffield
http://staffwww.dcs.shef.ac.uk/people/C.Scarton/
*******************************************************
EAMT 2024: The 25th Annual Conference of
The European Association for Machine Translation
24 - 27 June 2024
Sheffield, UK
https://eamt2024.sheffield.ac.uk/https://twitter.com/eamt_2024 (X account)
Keynote speakers:
- Alexandra Birch (University of Edinburgh, UK)
- Valter Mavrič (DG TRAD, European Parliament)
EXTENDED Paper submission deadline: 22 March 2024
More information:
https://eamt2024.sheffield.ac.uk/conference-calls/final-call-for-papers
*******************************************************
*****
IMPORTANT: In order to use OpenReview (https://openreview.net/), authors
need to have an account in the system. Please create your account as soon
as possible and use an institutional e-mail if possible. OpenReview may
take some time to approve accounts that do not use institutional domains.
If you have any issues, contact eamt2024(a)gmail.com
*****
The European Association for Machine Translation (EAMT) invites everyone
interested in machine translation (MT) and translation-related tools and
resources ― developers, researchers, users, translation and localization
professionals and managers ― to participate in this conference.
Driven by the state of the art, the research community will demonstrate
their cutting-edge research and results. Professional MTusers will provide
insights into successful MT implementation of MT in business scenarios as
well as implementation scenarios involving large corporations, governments,
or NGOs. Translation scholars and translation practitioners are also
invited to share their first-hand MT experience, which will be addressed
during a special track.
Note that papers that have been archived in arXiv can be accepted for
submission provided that they have not already been published elsewhere.
EAMT 2024 has four tracks, namely Research: Technical, Research:
Translators & Users, Implementations & Case Studies, and Products &
Projects.
*** Research: technical ***
Submissions (up to 10 pages, plus unlimited pages for references and
appendices) are invited for reports of significant research results in any
aspect of MT and related areas. Such reports should include a substantial
evaluation component, or have a strong theoretical and/or methodological
contribution where results and in-depth evaluations may not be appropriate.
Papers are welcome on all topics in the areas of MT and translation-related
technologies, including, but not limited to:
- Deep-learning approaches for MT and MT evaluation
- Advances in classical MT paradigms: statistical, rule-based, and hybrid
approaches
- Comparison of various MT approaches
- Technologies for MT deployment: quality estimation, domain adaptation,
etc.
- Resources and evaluation
- MT in special settings: low resources, massive resources, high volume,
low computing resources
- MT applications: translation/localization aids, speech translation,
multimodal MT, MT for user generated content (blogs, social networks), MT
in computer-aided language learning, etc.
- Linguistic resources for MT: corpora, terminologies, dictionaries, etc.
- MT evaluation techniques, metrics, and evaluation results
- Human factors in MT and user interfaces
- Related multilingual technologies: natural language generation,
information retrieval, text categorization, text summarization, information
extraction, optical character recognition, etc.
Papers should describe original work. They should emphasise completed work
rather than intended work, and should indicate clearly the state of
completion of the reported results. Where appropriate, concrete evaluation
results should be included.
Papers should be anonymized, prepared according to the templates specified
below, and be no longer than 10 pages (plus unlimited pages for references
and appendices). Submit the paper as a PDF to OpenReview:
https://openreview.net/group?id=EAMT.org/2024/Technical_Track. Submissions
that do not conform to the required styles may be rejected without review.
**Track co-chairs
Rachel Bawden (Inria, Paris)
Víctor M Sánchez-Cartagena (University of Alicant)
*** Research: translators & users ***
Submissions (up to 10 pages, plus unlimited pages for references and
appendices) are invited for academic research on all topics related to how
professional translators and other types of MT users interact with, are
affected by, or conceptualise MT. Papers should report significant research
results with a strong theoretical and/or methodological contribution.
Topics for the track include, but are not limited to:
- The impact of MT and post-editing: including studies on processes,
effort, strategies, usability, productivity, pricing, workflows, and
post-editese
- Human factors and psycho-social aspects of MT adoption (ergonomics,
motivation, and social impact on the profession, relationship between user
profiles and MT adoption)
- Emerging areas for MT & post-editing: e.g. audiovisual, game
localisation, literary texts, creative texts, social media, health care
communication, crisis translation
- MT and ethics
- The impact of using translators’ metadata and user activity data for
monitoring their work
- The evaluation and reception of different modalities of translation:
human translation, post-edited, raw MT
- MT and interpreting
- Human evaluations of MT output
- MT for gisting and the impact of MT on users: use cases, expectations,
perceptions, trust, views on acceptability
- MT and usability
- MT and education/language learning
- MT in the translation/interpreting classroom
Papers should describe original work. They should emphasise completed work
rather than intended work, and should indicate clearly the state of
completion of the reported results.
Papers should be anonymized, prepared according to the templates specified
below, and be no longer than 10 pages (plus unlimited pages for references
and appendices). Submit the paper as a PDF to OpenReview:
https://openreview.net/group?id=EAMT.org/2024/Research_Translators_Users_Tr….
Submissions that do not conform to the required styles may be rejected
without review.
** Track co-chairs
Patrick Cadwell (DCU)
Ekaterina Lapshinova-Koltunski (University of Hildesheim)
*** Implementations & case studies ***
Submissions (approximately 4–6 pages) are invited for reports on case
studies and implementation experience with MT in organisations of all
types, including small businesses, large corporations, governments, NGOs,
or language service providers. We also invite translation practitioners to
share their views and observations based on their day-to-day experience
working with MT in a variety of environments.
Topics for the track include, but are not limited to:
- Integrating or optimising MT and computer-assisted translation in
translation production workflows (translation memory/MT thresholds, mixing
online and offline tools, using interactive MT, dealing with MT confidence
scores)
- Managing change when implementing and using MT (e.g. switching between
multiple MT systems, limiting degradations when updating or upgrading an MT
system)
- Implementing open-source MT (e.g. strategies to get support, reports on
taking pilot results into full deployment, examples of advanced
customization sought and obtained thanks to the open-source paradigm,
collaboration within open-source MT projects)
- Evaluating MT in a real-world setting (e.g. error detection strategies
employed, metrics used, productivity or translation quality gains achieved)
- Ethical and confidentiality issues when using MT, especially MT in the
cloud
- Using MT in social networking or real-time communication (e.g. enterprise
support chat, multilingual content for social media)
- MT and usability
- Implementing MT to process multilingual content for assimilation purposes
(e.g. cross-lingual information retrieval, MT for e-discovery or spam
detection, MT for highly dynamic content)
- MT in literary, audiovisual, game localization and creative texts
- Impact of MT and post-editing on translation practices and the
profession: processes, effort, compensation,
- Psycho-social aspects of MT adoption (ergonomics, motivation, and social
impact on the profession)
- Error analysis and post-editing strategies (including automatic
post-editing and automation strategies)
- The use of translators’ metadata and user activity data in MT development
- Freelance translators’ independent use of MT
- MT and interpreting
Papers should highlight real-world use scenarios, solutions, and problems
in addition to describing MT integration processes and project settings.
Where solutions do not seem to exist, suggestions for MT researchers and
developers should be clearly emphasized. For papers on implementations and
case studies produced by academics, we require co-authorship with the
actual organizations working with MT implementations.
Papers (approximately 4–6 pages, with a maximum of 10 pages -- plus
unlimited pages for references) should be formatted according to the
templates specified below and submitted as PDF files to Open Review:
https://openreview.net/group?id=EAMT.org/2024/Implementations_Case_Studies_….
Anonymization is not required in the Implementations & Case Studies track
submissions. Submissions that do not conform to the required styles may be
rejected without review.
** Track co-chairs
Vera Cabarrão (Unbabel)
Konstantinos Chatzitheodorou (Strategic Agenda)
*** Products & Projects ***
Submissions (2 pages, including references) are invited on either of the
subtracks (Products or Projects).
- Products: Tools for MT, computer-aided translation, and other translation
technologies (including commercial products and free/open-source
software). Descriptions should include information about product
availability and licensing, an indication of cost if applicable, basic
functionality, (optionally) a comparison with other products, and a
description of the technologies used. The authors should be ready to
present the tools in the form of demos or posters during the conference.
- Projects: Research projects, funded through grants obtained in
competitive public or private calls related to MT. Descriptions should
contain: project title and acronym, funding agency, project reference,
duration, list of partner institutions or companies in the consortium if
there is one, project objectives, and a summary of partial results
available or final results if the project has ended. The authors should be
ready to present the projects in the form of posters during the conference.
This follows on from the successful ‘project villages’ held at the last
EAMT conferences.
There will be a poster boaster session for this track, in which authors
will have 120 seconds to attract attendees to their posters or demos with a
two-slide presentation.
Submissions should be formatted according to the templates specified
below. Anonymization is not required. Submissions should be no longer than
2 pages (including references), and submitted as PDF files to OpenReview:
https://openreview.net/group?id=EAMT.org/2024/Products_Projects_Track.
Track chairs
Helena Moniz (University of Lisbon (FLUL), INESC-ID)
Mikel Forcada (Prompsit Language Engineering, Elx)
*** Templates for writing your proposal ***
There templates available in the following formats (check our website --
https://eamt2024.sheffield.ac.uk/conference-calls/call-for-papers):
- LaTeX
- Cloneable Overleaf template
- Word
- Libre Office/Open Office
- PDF
*** Important deadlines ***
- Deadline for paper submission (further extended): 22 March 2024
- Notification to authors (further extended): 22 April 2024
- Camera ready deadline (further extended): 6 May 2024
- Author Registration (extended): 15 May 2024
All deadlines are at 23:59 CEST.
*** Local organising committee ***
Carolina Scarton (University of Sheffield)
Charlotte Prescott (ZOO Digital)
Chris Bayliss (ZOO Digital)
Chris Oakley (ZOO Digital)
Joanna Wright (University of Sheffield)
Xingyi Song (University of Sheffield)
*** Sponsors ***
Silver: Translated <https://translated.com/welcome>, Unbabel
<https://unbabel.com/>
Bronze: Pangeanic <https://pangeanic.com/>, STAR
<https://www.star-group.net/en/home.html>, Transperfect
<https://globallink.transperfect.com/>
Collaborator: Apertium <https://apertium.org/>
Supporter: Spring Nature <https://www.springernature.com/gp>
Confirmed sponsors: Crosslang <https://crosslang.com> and RWS Language
Weaver <https://www.rws.com>
--
*Carolina Scarton*
Lecturer in Natural Language Processing
Department of Computer Science
University of Sheffield
http://staffwww.dcs.shef.ac.uk/people/C.Scarton/
We offer a 3-year postdoctoral position in NLP at the University of Oslo, Norway, on the topic "Evaluating large language models - model architectures, training regimes and data selection". The application deadline is April 14, 2024. This position is funded by the DSTrain program (https://www.uio.no/dscience/english/dstrain/).
In the past years, (generative) large language models have become the core foundation models for a wide range of traditional NLP tasks, and they have also seen widespread adoption by the general public. At the same time, little is known about the specific training setups of commercial models, and some design decisions (in terms of model architecture, training regimes, and data selection) are based on traditions rather than empirical or theoretical considerations. Moreover, most current LLMs rely heavily on English training and evaluation data, and their performance on non-English languages remains difficult to assess. Potential candidates are expected to formulate their research project within the broad area of LLM evaluation. Examples of research topics are given below:
- Compare fine-tuning external pre-trained LLMs with training language-specific LLMs from scratch.
- Compare encoder-decoder LLMs with decoder-only LLMs.
- Evaluate generative LLMs on various text generation tasks, such as summarization, simplification, text normalization.
- Assess the multilingual (e.g. machine translation) and cross-lingual capabilities (cross-lingual transfer) of LLMs.
- Investigate how closely related low-resource languages are best accommodated in LLMs.
- Implement benchmarking datasets for LLM evaluation.
Applicants are expected to submit a research project that fits in the proposed research theme (Evaluaing large language models). Prospective applicants are encouraged to discuss their application with the contact person (me) to explore scientific focus and cooperation possibilities.
The application process for the DSTrain call is described here:
https://www.uio.no/dscience/english/dstrain/guide-for-applicants/applicatio…
This is the relevant research theme description:
https://www.uio.no/dscience/english/dstrain/research-areas/informatics/eval…
Please apply here:
https://www.jobbnorge.no/en/available-jobs/job/255679/dstrain-msca-postdoct…
Contact:
Yves Scherrer, LTG, University of Oslo
yves.scherrer(a)ifi.uio.no
[apologies if you receive multiple copies of this call]
Dear colleagues and friends,
*We are pleased to release the 2nd Call for Participation - CLEF 2024
SimpleText Task4: SOTA?*
*Overview:* SOTA? is introduced as Task 4 in the SimpleText track of CLEF
2024. The goal of the SOTA? shared task is to develop systems which given
the full text of an AI paper, are capable of recognizing whether an
incoming AI paper indeed reports model scores on benchmark datasets, and if
so, to extract all pertinent (Task, Dataset, Metric, Score) quadruples
presented within the paper.
More info on the task website:
https://sites.google.com/view/simpletext-sota/home
SOTA? will be divided into two evaluation phases:
- Evaluation Phase 1: Few-shot Testing;
- Evaluation Phase 2: Zero-shot Testing
*To participate in SOTA? i.e. SimpleText Task 4 @ CLEF 2024, please
register your team*:
1. CLEF 2024 official registration page
https://clef2024.imag.fr/index.php?page=Pages/registration.html
2. Codalab competition site:
https://codalab.lisn.upsaclay.fr/competitions/16616
Note, SOTA? is organized as a new task this year under the "SimpleText -
Improving Access to Scientific Texts for Everyone" initiative
https://simpletext-project.com/. Please take a look at the other 3 tasks,
i.e. Task 1, 2, and 3, offered by SimpleText and select one or more of
those task options too if you are interested. Note that there is no
interdependence of the dataset between "Task 4 - SOTA?" and the other three
tasks of SimpleText.
*Dates*
Training and validation datasets available: Feb 1, 2024 March 13, 2024
Test data available/Evaluation starts: April 23, 2024
Evaluation ends: May 3, 2024
Participant paper submissions due: May 31, 2024
Notification to authors: June 24, 2024
Camera ready due: July 8, 2024
CLEF 2024 Workshop, Grenoble, France: 9-12 September 2024
*Task Organizers*
Jennifer D’Souza (TIB Leibniz Information Centre for Science and Technology
- Germany)
Salomon Kabongo (L3S Research Center, Germany)
Hamed Babaei Giglou (TIB Leibniz Information Centre for Science and
Technology - Germany)
Yue Zhang (Berlin Technical University, Germany)
Sören Auer (TIB Leibniz Information Centre for Science and Technology -
Germany)
*We look forward to having you on board!*
*Contact:* sota.task [at] gmail.com
Extended deadline for abstract submission: 24 March 2024
The 8th International Conference 'Discourse Markers in Romance Languages'
https://sites.google.com/view/disrom2024
Lisbon, Portugal, 19-21 June 2024
*Important Dates*
24 March 2024 New deadline for abstract submission !
30 April 2024 Notification of acceptance
19-21 June 2024 Conference dates
*Meeting Description*
The Conference is one of a series of conferences on discourse markers in Romance languages (Madrid, 2010; Buenos Aires, 2011; Campinas, 2012; Heidelberg, 2015; Louvain-la- Neuve, 2017; Bergamo, 2019; Craiova 2022) and aims to build on the previous events, serving as a platform for internationally renowned linguists and young researchers alike to exchange views and ideas and to broaden their research perspectives.
This Conference’s theme will deal specifically :
1. with interactions between DMs and their explicit/implicit context, overcoming the traditional divide between their textual and interpersonal functions;
2. with the subjective adjustment function of DMs.
Researchers on discourse markers in Romance languages are invited to submit contributions on these topics, as well as on related subjects including (but not restricted to):
- definition of the discourse marker category;
- lexicons of discourse markers;
- discourse markers and their relation to other pragmatic categories;
- syntax-prosody-discourse interface;
- sociolinguistic approaches to discourse markers;
- variation of discourse markers across registers, languages and language varieties;
- translation studies;
- L1 and L2 acquisition of discourse markers;
- diachronic studies;
- experimental studies;
- corpus-based and computational studies;
- applied studies (business language, legal discourse, educational settings, etc.).
*Submissions*
The Conference will be on-site. Two presentation modalities will be possible: oral presentation and poster presentation.
Abstracts should not exceed one page (single spacing, 12-point Times New Roman font, not including figures and references, and must be uploaded as pdf). Abstracts can be written in any Romance language or in English.
They should be anonymous.
They will be submitted via EasyChair (https://easychair.org/conferences/?conf=disrom2024).
Authors must select the option oral presentation or poster presentation during the submission process on EasyChair.
*Keynote Speakers (provisional list)*
Denis Paillard (CNRS and Université Paris Diderot)
Isabel Margarida Duarte (Universidade de Porto)
*Workshop organizers (University of Lisbon)*
- Pierre Lejeune
- Marco Favaro
- Fabrizio Macagno
- Amália Mendes
*Scientific Committee*
Joanna Blochowiak (Université de Genève)
Margarita Borreguero Zuloaga (Universidad Complutense de Madrid)
Chloé Braud (University of Copenhague)
Sorina Ciobanu (University of Iasi)
Maria Antónia Diniz Caetano Coutinho (Universidade Nova de Lisboa)
Maria Josep Cuenca (Universitat de València)
Antonio Briz Gómez (Universitat de València)
Conceição Carapinha (Universidade de Coimbra)
Anna-Maria De Cesare (Universität Dresden)
Iria da Cunha (Universidad Nacional de Educación a Distancia)
Gaétane Dostie (Université de Sherbrooke)
Oana Adriana Duta (University of Craiova)
Chiara Fedriani (Università di Genova)
Mar Garachana Camarero (Universitat de Barcelona)
Chiara Ghezzi (Universitá di Bergamo)
Sonia Gómez-Jordana (Universidad Complutense de Madrid)
Pedro Gras (Université d’Anvers)
Martin Hummel (Universität Graz)
Julia Lavid Lopez (Universidad Complutense de Madrid)
Diana Lewis (Université Aix-Marseille)
Araceli López Serena (Universidad de Sevilla)
José Pinto de Lima (Centro de Linguística da Universidade Nova de Lisboa)
Maria Aldina Marques (Universidade do Minho)
Piera Molinelli (Università di Bergamo)
Silvia Murillo Ornat (Universidad de Zaragoza)
Cornelia Plag (Universidade de Coimbra)
Salvador Pons Bordería (Universitat de València)
Cecilia Popescu (University of Craiova)
Laurent Prévot (Université Aix-Marseille)
Augusto Soares da Silva (Universidade Catolica Portuguesa)
Laure Vieu (IRIT – Université de Toulouse III – Paul Sabatier)
Jacqueline Visconti (Università di Genova)
Sandrine Zufferey (Universität zu Bern)