SEARCH SOLUTIONS 2024
Innovations in Search & Information Retrieval
Search Solutions is the premier UK forum for the presentation of the latest innovations in search and information retrieval. We bring together practitioners, researchers, analysts and end users to discuss the latest developments in the information retrieval (IR) community and to share insights between research and practice. SS 2024 will be held at the BCS London office on Wednesday 27th November with tutorials on Tuesday 26th.
SUBMISSION GUIDELINES
We solicit speaker proposals for talks around the following categories:
* Innovative approaches used in IR production systems and products
* Topical issues in IR practice, e.g. trust, bias, and fairness
* Implications of generative AI and large language models
* Interdisciplinary collaborations, bridging areas such as information science, data science, user experience, and artificial intelligence
* IR practice within different professional communities, spanning eCommerce, media, recruitment, library and information science, healthcare information, and beyond
* Case studies showcasing best practices, design principles, evaluation techniques, and practical implementations in the field of information retrieval
We encourage presentations from startups and open-source projects. The presentation format of the event will be a combination of presentations, panels and discussions.
Proposals should be no more than 1 page and include:
* Title
* Abstract
* Main contribution & take-aways for attendees
* A short bio of the presenter and/or a brief organisation outline
Proposals should be emailed to irsg(a)bcs.org.uk cc’ing tgr2uk(a)gmail.com. There will be a separate call for tutorial presentations.
IMPORTANT DATES
* Time zone: Anywhere on Earth (AoE)
* Talk proposal due: 14 June
* Notifications: 28 June
ORGANISERS
* Ingo Frommholz
* Udo Kruschwitz
* Tony Russell-Rose
* Haiming Liu (tutorials chair)
Past Events
For past Search Solutions events and links to slides and videos (and for registration) please see: https://www.bcs.org/membership-and-registrations/member-communities/informa…
CONTACT
For further details, contact irsg(a)bcs.org.uk or see https://www.bcs.org/events-calendar/2024/november/search-solutions-2024-inf…
CALL FOR ARR Commitment
The 11th Workshop on Argument Mining @ ACL 2024
August 15, 2024
https://argmining-org.github.io/2024/
The 11th Workshop on Argument Mining will be held on August 15, 2024, in
Bangkok, Thailand, together with ACL 2024. The Workshop provides a
regular forum for presenting and discussing cutting-edge research in
argument mining (a.k.a argumentation mining) for academic and industry
researchers. By continuing a series of ten successful previous
workshops, this edition will welcome the submission of long, short, and
demo papers. Also, it will feature two shared tasks and a keynote talk.
ArgMining 2024 will accept submissions of ARR-reviewed papers, provided
that the ARR reviews and meta-reviews are available by the ARR
commitment deadline (May 24, AoE).
IMPORTANT DATES
Paper commitment from ARR: May 24, 2024
Notification of acceptance: June 17, 2024
Camera-ready papers due: July 1, 2024
Workshop: August 15, 2024
TOPICS OF INTEREST
- Identification of argument components (e.g., premises and conclusions)
- Structure analysis of arguments within and across documents
- Relation Identification between arguments and counterarguments (e.g.,
support and attack)
- Creation and evaluation of argument annotation schemes, relationships
to linguistic and discourse annotations, (semi-) automatic argument
annotation methods and tools, and creation of argumentation corpora
- Assessment of arguments for various properties (e.g., stance, clarity)
- Automatic generation of arguments and their components
- Consideration of discourse goals in argument generation
- Argument mining and generation from multi-modal/multi-lingual data
- Argument mining in specific genres and domains (e.g., education, law,
scientific writing)
- Analysis of unique styles within genres (e.g., short informal text,
highly structured writing)
- Integration of commonsense and domain knowledge into argumentation models
- Combination of information retrieval methods with argument mining
- Real-world applications, including argument web search, opinion
analysis and summarization, and misinformation detection
- Reflection on the ethical aspects and societal impact of
argument-mining methods
- Reflection on the future of argument mining in light of the fast
advancement of large language models (LLMs)
SUBMISSIONS
The organizing committee welcomes submitting long papers, short papers,
and demo descriptions. Accepted papers will be presented via oral or
poster presentations and included in the ACL proceedings as workshop papers.
- Long paper submissions must describe substantial, original, completed,
and unpublished work. Wherever appropriate, concrete evaluation and
analysis should be included. Long papers must be at most eight pages,
including title, text, figures, and tables. An unlimited number of pages
is allowed for references. Two additional pages are allowed for
appendices, and an extra page is allowed in the final version to address
reviewers’ comments.
- Short paper submissions must describe original and unpublished work.
Please note that a short paper is not a shortened long paper. Instead,
short papers should have a point that can be made in a few pages, such
as a small, focused contribution, a negative result, or an interesting
application nugget. Short papers must be at most four pages, including
title, text, figures, and tables. An unlimited number of pages is
allowed for references. One additional page is allowed for the appendix,
and an extra page is allowed in the final version to address reviewers’
comments.
- Demo descriptions must be at most four pages, including title, text,
examples, figures, tables, and references. A separate one-page document
should be provided to the workshop organizers for demo descriptions,
specifying furniture and equipment needed for the demo.
Multiple Submissions
ArgMining 2024 will not consider any paper under review in a journal or
another conference or workshop at the time of submission, and submitted
papers must not be submitted elsewhere during the review period.
ArgMining 2024 will not accept direct submissions that are actively
under review in ARR, or that overlap significantly (>25%) with such
submissions.
Submission Format
All long, short, and demonstration submissions must follow the
two-column ACL 2024 format. Authors are expected to use the LaTeX or
Microsoft Word style template
(https://github.com/acl-org/acl-style-files). Submissions must conform
to the official ACL style guidelines contained in these templates.
Submissions must be electronic and in PDF format.
Submission Link and Deadline For ARR Submissions
Authors have to fill in the submission form in the OpenReview system
including a link to their ARR submission and upload a PDF of their paper
before May 24, 2024, 11:59 pm UTC-12h (anywhere on earth).
https://openreview.net/group?id=aclweb.org/ACL/2024/Workshop/ArgMining_ARR_…
Double Blind Review
ArgMining 2024 will follow the ACL policies for preserving the integrity
of double-blind review for long and short paper submissions. Papers must
not include authors’ names and affiliations. Furthermore,
self-references or links (such as GitHub) that reveal the author’s
identity, e.g., “We previously showed (Smith, 1991) …” must be avoided.
Instead, use citations such as “Smith previously showed (Smith, 1991) …”
Papers that do not conform to these requirements will be rejected
without review. Papers should not refer, for further detail, to
documents that are not available to the reviewers. For example, do not
omit or redact important citation information to preserve anonymity.
Instead, use the third person or named reference to this work, as
described above (“Smith showed” rather than “we showed”). Papers may be
accompanied by a resource (software and/or data) described in the paper,
but these resources should also be anonymized.
Unlike long and short papers, demo descriptions will not be anonymous.
Demo descriptions should include the authors’ names and affiliations,
and self-references are allowed.
ANONYMITY PERIOD (taken from the ACL call for papers in verbatim for the
most part)
We follow the ACL Policies for Review and Citation. Submissions must be
anonymized, but there is no anonymity period or limitation on posting or
discussing non-anonymous preprints while the work is under peer review.
BEST PAPER AWARD
In order to recognize significant advancements in argument mining
science and technology, ArgMining 2024 will include the Best Paper
award. All papers at the workshop are eligible for the best paper award,
and a selection committee consisting of prominent researchers in the
fields of interest will select the award recipients.
SHARED TASKS
We will be hosting two shared tasks this year:
1. Perspective Argument Retrieval
2. DialAM-2024: The First Shared Task on Dialogical Argument Mining
ArgMining 2024 ORGANIZING COMMITTEE
Yamen Ajjour, Leibniz University Hannover
Roy Bar-Haim, IBM Research
Roxanne El Baff, German Aerospace Center (DLR) and Bauhaus-Universität,
Weimar
Zhexiong Liu, University of Pittsburgh
Gabriella Skitalinskaya, Leibniz University Hannover
Call For Papers - ICICS2024: The 15th International Conference on Information and Communication Systems - http://www.just.edu.jo/icics
August 13 - 15, 2024, Irbid, Jordan
"The Conference Program includes free trips to Jarash and Umm Qais"
Full paper submission: May 20th , 2024..
The 15th International Conference on Information and Communication Systems (ICICS 2024) is a forum for scientists, engineers, and practitioners to present their latest research results, ideas, developments, and applications in all areas of Computer and Information Sciences. The topics that will be covered in ICICS 2024 include, but are not limited to:
-Communication Systems, electronics, and Signal Processing
-Networking, and Internet of Things (IoT)
-Data Science and Big Data
-Natural Language Processing and Applications
-Software & web Engineering, and Information Systems
-Security, Privacy, and Digital Forensics
-Cloud and Fog/Mobile Edge Computing
-AI and Machine Learning
-E-Learning Technologies
Prospective authors are invited to submit full papers following the guideline posted on the conference website http://www.just.edu.jo/icics. Submitted papers will be peer-reviewed (check review process in the conference website), and prospective authors are expected to present their papers at the conference (possible for a virtual). The accepted and registered papers will appear in the conference proceedings.
Submission web page is https://easychair.org/conferences/?conf=icics20240
*** Extended version of the selected papers will be published in prestigious journals
Important Dates
-Full paper submission: May 20th , 2024.
-Notification of Decision: June 23rd , 2024.
-Camera-Ready and Registration: June 30th , 2024
Please send any inquiry to: icics(a)just.edu.jo
Dear all,
We are organizing a free online event "Lancaster Talks on Language: Corpus linguistics". This event offers three short lectures on corpus linguistics by leading experts in the field. You will also learn more about training opportunities in corpus linguistics at Lancaster University.
Free registration: https://www.lancaster.ac.uk/linguistics/events/lancaster-talks-on-language-…
Programme:
Prof. Elena Semino: Corpus linguistics and healthcare
Prof. Vaclav Brezina: New tools and methods in corpus linguistics
Dr Dana Gablasova: Corpus linguistics and data-driven learning
Best,
Vaclav
Professor Vaclav Brezina
Professor in Corpus Linguistics
Department of Linguistics and English Language
ESRC Centre for Corpus Approaches to Social Science
Faculty of Arts and Social Sciences, Lancaster University
Lancaster, LA1 4YD
Office: County South, room C05
T: +44 (0)1524 510828
[cid:image001.jpg@01DA9632.B1FA8F10]@vaclavbrezina
[cid:image002.jpg@01DA9632.B1FA8F10]<http://www.lancaster.ac.uk/arts-and-social-sciences/about-us/people/vaclav-…>
***Apologies for possible cross-posting ****
*
At the Institute of Computer Science (Prof. Dr. Alexander Mehler, TTLab,
https://www.texttechnologylab.org), Department of Computer Science and
Mathematics at Goethe University Frankfurt / Germany, a position for a
*Research Assistant (m/f/d)
(E 13 TV-G-U)*
is available *at the next possible date* as a temporary position until
31.12.2027 within the project "INF: Scientific services and data
management”. The project is part of the Collaborative Research Center
(CRC) 1629 "Negation in Language and Beyond (NegLaB)"
(https://www.uni-frankfurt.de/149292001/Negation_in_Language_and_Beyond),
which is funded by the German Research Foundation (DFG). The salary
group classification is based on the job characteristics determined by
the collective labour agreement in effect for the Goethe University
(TV-G-U).
*Responsibilities:*
The aim of the project is the establishment, maintenance, and
development of a research database, including services to ensure the
reproducibility and long-term sustainability of partly multimodal
research data (i.e., both linguistic and non-linguistic data such as
eye-tracking data), as well as the development or application of deep
learning methods for the annotation of linguistic data. The applicant is
expected to engage in the project and actively participate in courses,
workshops, and events of the CRC. We are looking for a highly qualified
person who (a) has a strong interest in contributing to current research
data infrastructures, and (b) wants to join a research team for the
development of NLP methods and their data-driven application. We offer a
stimulating and international environment in the field of computational
linguistics/text technology, including financial funds for conference
participation and individual career development.
*Requirements:*
* Completed academic degree (Master's or equivalent) in computer
science, computational linguistics or a field related to database
management and NLP.
* Demonstrable experience in setting up databases and utilizing NLP
methods (e.g. automatic annotation, corpus linguistic methods).
* Extensive programming knowledge in Java, Python or similar.
* Knowledge of the use of Docker is an advantage.
* An interest in linguistic issues is desirable but not essential.
Please submit your application with the usual documents (cover letter,
curriculum vitae, copies of certificates) electronically in a single PDF
document to Prof. Dr. Alexander Mehler by *03.05.2024*:
mehler(a)em.uni-frankfurt.de
<mailto:mehler@em.uni-frankfurt.de?subject=INF NegLaB>.
--
------------------------------------------------------------------------
Giuseppe Abrami
Text-Technology Lab
Fakultät für Informatik und Mathematik
Goethe-Universität Frankfurt
Robert-Mayer-Strasse 10
4. Stock (Texttechnologie)
60325 Frankfurt am Main
Postfach: 154
Tel: +49 69-798-28926
Fax: +49 69-798-28931
Mail: abrami(a)em.uni-frankfurt.de
Web: http://www.texttechnologylab.org
Call for Abstracts: Analysis of Linguistic VAriation for BEtter Tools (ALVABET) within the LLcD 2024 Conference (https://llcd2024.sciencesconf.org/)
Workshop
Variation plays a particularly important role in linguistic change, since every change stem from a state of variation; but each state of variation does not necessarily end up with a change: the new variant can disappear, or variation can linger but in different contexts. Access to sufficient amounts of data and their quantification, in order to detect the emergence of new variants as precisely as possible, and the recession or even disappearance of others, is a precious tool for the study of variations, whatever their dimensions (diachronic, diatopic, …) and in whatever field (syntax, morphology, …). The appearance of large corpora has thus renewed the study of variation. NLP has contributed largely to this renewal, providing tools for the enrichment and the exploration of these corpora. In return, linguistic analysis can help explain some of these errors and thus deepen the picture where performance metrics tend to flatten out everything under a single number, or even help improve the performances.
NLP annotation tools, such as syntactic parsers and morphological taggers, reach great performances nowadays when they are applied on similar data to those seen during their development. However, they quickly drop as the target data diverges from those of the training scenario. This raises a number of issues when it comes to using automatically annotated data to perform linguistic studies.
This workshop aims at exploring bilateral contributions between Natural Language Processing and variation analysis in the fields of morphosyntax and syntax, from diachronic and diatopic perspectives but also from genre, domain or form of writing, without any restriction on the languages of interest.
We warmly welcome submissions dealing with the issues and contributions of applying NLP to variation analysis :
• Quantification of variation along its different dimensions (both external and internal ones as well as in interaction with each other);
• Impact of annotation errors on the study of marginal structures (emergent or recessing);
• Syntactic variation when it is induced by semantic changes.
But also submissions dealing with the contributions of variation analysis to NLP:
• Variation mitigation (spelling standardisation...);
• Domain adaptation (domain referring here to any variation dimension);
• Error analysis (in and out of domain) in light of known variation phenomena, amongst which (de-)grammaticalisation;
• The evolution of grammatical categories and its impact on prediction models;
• The place of variation studies in NLP in the large language model era.
These themes are only suggestions, and the workshop will gladly host any submission that deals substantially with the reciprocal contributions between NLP and variation analysis in the mentioned fields.
Full workshop description: https://llcd2024.sciencesconf.org/data/pages/WS12Eng.pdf
Important Dates
• Apr 30, 2024: extended deadline for abstract submission
• May 15, 2024: Notification
• Sep 9-11: Conference
Submissions
Abstracts must clearly state the research questions, approach, method, data and (expected) results. They must be anonymous: not only must they not contain the presenters' names, affiliations or addresses, but they must avoid any other information that might reveal their author(s). They should not exceed 500 words (including examples, but excluding bibliographical references).
Abstracts will be assessed by two members of the Scientific Committee and (one of) the workshop organizers.
The Content-Centered Computing
<https://www.cs.unito.it/do/gruppi.pl/Show?_id=453y> group at the
University of Turin, Italy, offers *two 14-month postdoc positions* in the
context of HARMONIA (Harmony in Hybrid Decision-Making: Knowledge-enhanced
Perspective-taking LLMs for Inclusive Decision-Making), funded by the
European Union under the NextGenerationEU program within the larger project
FAIR (Future Artificial Intelligence) Spoke 2 "Integrative AI"
<https://fair.fbk.eu/>. The project aims at developing methods for the
adoption of knowledge-enhanced Large Language Models (LLMs) in supporting
informed and inclusive political decisions within public decision-making
processes.
The topics of the postdoc fellowships are:
- Computational linguistics methods for knowledge-enhanced
perspective-taking LLMs to support Inclusive Decision-Making
- Perspective-taking LLMs for supporting Inclusive Decision-Making
(full descriptions below)
The team includes members of the Computer Science Department and Economics
and Statistics Department of the University of Turin.
A PhD in Computer Science, Computational Linguistics, or related areas is
highly recommended. Knowledge of Italian is not mandatory.
The deadline for application is *May 13th 2024*.
The gross salary is €25.328 (about €1,860/month net salary). Turin
<https://www.turismotorino.org/en/territory/torino-metropoli/torino> is a
vibrant and liveable city in Northern Italy, close to the beautiful Italian
Alps and with a manageable cost of living
<https://en.unito.it/living-turin/when-you-arrive/cost-living-turin>.
Link to the call
<https://www.turismotorino.org/en/territory/torino-metropoli/torino> (in
Italian). Link to the application platform
<https://pica.cineca.it/unito/assegni-di-ricerca-unito-2024-i-pnrr/>.
Please write to <valerio.basile(a)unito.it> or <viviana.patti(a)unito.it> for
further information on how to apply.
Best regards,
Valerio Basile
--
*Computational linguistics methods for knowledge-enhanced
perspective-taking LLMs to support Inclusive Decision-Making*
The activity will focus on a) design of a semantic model to represent
interactions between urban services and citizens and integrate multi-source
hybrid data; b) data annotation by citizens with different socio-cultural
backgrounds to collect different perspectives on social issues. Data will
be collected and organized in a Knowledge Graph. The activity will be
supported by an interdisciplinary team of experts in KR, behavioral
economics and LLMs (link with the design of knowledge-enhanced LLMs).
*Perspective-taking LLMs for supporting Inclusive Decision-Making*
The activity will focus on a) exploring techniques for integrating
multi-source hybrid citizen data into LLMs (RAG and Knowledge Injection);
b) developing methods for training and evaluating perspective-taking LLMs,
which explicitly encode multiple perspectives, embodying the point of view
of different citizen communities on a topic. Planned activities include:
benchmark creation, error analysis, and evaluation of the efficiency and
reliability of the developed technologies.
*CALAMITA - Challenge the Abilities of LAnguage Models in ITAlian*
*Special event co-located with the Tenth Italian Conference on
Computational Linguistics - CLiC-it 2024 Pisa, 4 - 6 December, 2024 -
https://clic2024.ilc.cnr.it/ <https://clic2024.ilc.cnr.it/> *
*Upcoming deadline: 17th May 2024, challenge pre-proposal submission!
Pre-proposal form: *https://forms.gle/u4rSt9yXHHYquKrB6
*Project Description*
AILC, the Italian Association for Computational Linguistics, is launching a
*collaborative* effort to develop a dynamic and growing benchmark for
evaluating LLMs’ capabilities in Italian.
In the *long term*, we aim to establish a suite of tasks in the form of a
benchmark which can be accessed through a shared platform and a live
leaderboard. This would allow for ongoing evaluation of existing and newly
developed Italian or multilingual LLMs.
In the *short term*, we are looking to start building this benchmark
through a series of challenges collaboratively construed by the research
community. Concretely, this happens through the present call for challenge
contributions. In a similar style to standard Natural Language Processing
shared tasks, *participants are asked to contribute a task and the
corresponding dataset with which a set of LLMs should be challenged*.
Participants are expected to provide an explanation and motivation for a
given task, a dataset that reflects that task together with any information
relevant to the dataset (provenance, annotation, distribution of labels or
phenomena, etc.) and a rationale for putting that together that way.
Evaluation metrics and example prompts should also be provided. Existing
relevant datasets are also very welcome, together with related publications
if available. All of the proposed challenges either with existing datasets
or new datasets, will have to follow the challenge template, which will be
distributed in due time, towards the write-up of a challenge paper.
In this first phase, all prospective participants are asked to submit a
*pre-proposal* by filling in this form https://forms.gle/u4rSt9yXHHYquKrB6.
Please fill in all the fields so we can get an idea of what challenge you’d
like to propose, how the model should be prompted to perform the task,
where you’d get the data and how much, whether it’s already available, etc.
The organizers will examine the submitted pre-proposals and select those
challenges that comply with the template’s requirements, with an eye to
balancing different challenge types. The selected challenges will be
expanded with a full dataset, longer descriptions, etc. according to the
aforementioned template which will be distributed later. The final report
of each accepted challenge must provide the code for the evaluation with an
example that must smoothly run on a pre-selected base LLM (most likely
LLaMa-2) which will be communicated by the organisers in the second phase.
All reports will be published as CEUR Proceedings related to the CALAMITA
event. Subsequently, all challenge organisers who wish to be involved can
participate in a broader follow-up paper, targeting a top venue, which will
describe the whole benchmark, procedures, results, and analyses.
Once this first challenge set is put together, the *CALAMITA organizers*
will run *zero* or *few* shots experiments with a selection of LLMs, and
write a final report. No tuning materials or experiments are expected at
this stage of the project.
*Deadlines (tentative)*
- *17th May 2024: pre-proposal submission*
- 27th May 2024: notification of pre-proposal acceptance
- End of May 2024: distribution of challenge paper template and further
instructions
- 2nd September 2024: data and report submission
- 30th September 2024: benchmark ready with reports for each challenge
(after light review)
- October-November 2024: running selected models on the benchmark with
analyses
- 4th-6th December 2024: CLIC-it Pisa (special event co-located with
CLIC-it 2024)
*Website:* https://clic2024.ilc.cnr.it/calamita (under construction)
*Mail: *calamita.ailc(a)gmail.com
*Organizers*
- Pierpaolo Basile (University of Bari Aldo Moro)
- Danilo Croce (University of Rome, Tor Vergata)
- Malvina Nissim (University of Groningen)
- Viviana Patti (University of Turin)
On behalf of Prof. Mark Sandler.
Lyrics generation project using LLMs.
Notice the closing deadline.
From: Mark Sandler <mark.sandler(a)qmul.ac.uk>
I am happy to announce that the Centre for Digital Music is now formally advertising the new research positions I posted last week. One area is lyrics generation and the other is music signal processing (instrument ID, loop ID, lyric transcription). Both are collaborative with London-based music industry companies, session and stage.
These are available immediately and can be offered as either post-doctoral or graduate research assistants, and can be either full- or part-time. Closing date is May 1 2024.
Details can be found here
https://www.qmul.ac.uk/jobs/vacancies/items/9619.htmlhttps://www.qmul.ac.uk/jobs/vacancies/items/9617.html
help
---- Replied Message ----
From corpora-request(a)list.elra.info Date 04/22/2024 20:00 To corpora(a)list.elra.info Cc Subject Corpora Digest, Vol 789, Issue 1
Send Corpora mailing list submissions to
corpora(a)list.elra.info
To subscribe or unsubscribe via email, send a message with subject or
body 'help' to
corpora-request(a)list.elra.info
You can reach the person managing the list at
corpora-owner(a)list.elra.info
When replying, please edit your Subject line so it is more specific
than "Re: Contents of Corpora digest..."
Today's Topics:
1. WMT 2024: Low-Resource Indic Language Translation. (Santanu Pal)
2. Final CPF: SIGIR eCom'24: May 3rd (Tracy Holloway King)
3. [2nd CFP] Special issue on Abusive Language Detection of the journal Traitement Automatique des Langues (TAL)
(Farah Benamara)
4. [Call for Participation]: GermEval2024 Shared Task GerMS-Detect - Sexism Detection in German Online News Fora @Konvens 2024
(stephanie.gross(a)ofai.at)
----------------------------------------------------------------------
Message: 1
Date: Sun, 21 Apr 2024 13:02:42 +0100
From: Santanu Pal <santanu.pal.ju(a)gmail.com>
Subject: [Corpora-List] WMT 2024: Low-Resource Indic Language
Translation.
To: corpora(a)list.elra.info
Message-ID:
<CALdLWwZZ4EJ6Vk5r9xS1b90vGBgtWpfq_PwGJSF=F+UQ6-ZCUg(a)mail.gmail.com>
Content-Type: multipart/alternative;
boundary="00000000000043488c06169a1b64"
Dear Colleagues,
We are pleased to inform you that we will be hosting the "Shared Task:
Low-Resource Indic Language Translation" again this year as part of WMT
2024. Following the outstanding success and enthusiastic participation
witnessed in the previous year's edition, we are excited to continue this
important initiative. Despite recent advancements in machine translation
(MT), such as multilingual translation and transfer learning techniques,
the scarcity of parallel data remains a significant challenge, particularly
for low-resource languages.
The WMT 2024 Indic Machine Translation Shared Task aims to address this
challenge by focusing on low-resource Indic languages from diverse language
families. Specifically, we are targeting languages such as Assamese, Mizo,
Khasi, Manipuri, Nyishi, Bodo, Mising, and Kokborok.
For inquiries and further information, please contact us at
lrilt.wmt24(a)gmail.com. Additionally, you can find more details and updates
on the task through the following link: Task Link:
https://www2.statmt.org/wmt24/indic-mt-task.html.
We highly encourage participants to register in advance so that we can
provide updates regarding release dates of data and other relevant
information periodically
To register for the event, please fill out the registration form available
here. (
https://docs.google.com/forms/d/e/1FAIpQLSd8LwriqdLLhVNAvUWEcGRJmKuBFQZ9BR_…
)
We look forward to your participation and contributions to advancing
low-resource Indic language translation.
with best regards,
Santanu