Apologies for cross-posting
---------------------------------
The Sixth Workshop on Technologies for Machine Translation of
Low-Resource Languages (LoResMT 2023)
https://www.loresmt.org/
@ EACL 2023 (May 2–6, 2023)
Valamar Lacroma Dubrovnik Hotel, Iva Dulčića 34, 20000, Dubrovnik, Croatia
SUBMISSION
https://softconf.com/eacl2023/LoResMT2023
TIMELINE
Paper due: February 22, 2023, at 23:59 (Anywhere on Earth)
Notification of acceptance: March 20, 2023
Camera-ready papers due: March 27, 2023
Conference dates: May 2-6, 2023
SCOPE
Based on the success of past low-resource machine translation (MT)
workshops at AMTA 2018 (https://amtaweb.org/), MT Summit 2019 (
https://www.mtsummit2019.com), AACL-IJCNLP 2020 (http://aacl2020.org/),
AMTA 2021, COLING 2022, we introduce the Sixth Workshop. The workshop
provides a discussion panel for researchers working on MT systems/methods
for low-resource and under-represented languages in general. We would like
to help review/overview the state of MT for low-resource languages and
define the most important directions. We also solicit papers dedicated to
supplementary NLP tools that are used in any language and especially in
low-resource languages. Overview papers of these NLP tools are very
welcome. It will be beneficial if the evaluations of these tools in
research papers include their impact on the quality of MT output.
TOPICS
We are highly interested in (1) original research papers, (2)
review/opinion papers, and (3) online systems on the topics below; however,
we welcome all novel ideas that cover research on low-resource languages.
- COVID-related corpora, their translations and corresponding NLP/MT systems
- Neural machine translation for low-resource languages
- Work that presents online systems for practical use by native speakers
- Word tokenizers/de-tokenizers for specific languages
- Word/morpheme segmenters for specific languages
- Alignment/Re-ordering tools for specific language pairs
- Use of morphology analyzers and/or morpheme segmenters in MT
- Multilingual/cross-lingual NLP tools for MT
- Corpora creation and curation technologies for low-resource languages
- Review of available parallel corpora for low-resource languages
- Research and review papers of MT methods for low-resource languages
- MT systems/methods (e.g. rule-based, SMT, NMT) for low-resource languages
- Pivot MT for low-resource languages
- Zero-shot MT for low-resource languages
- Fast building of MT systems for low-resource languages
- Re-usability of existing MT systems for low-resource languages
- Machine translation for language preservation
SUBMISSION INFORMATION
We are soliciting two types of submissions: (1) research, review, and
position papers and (2) system demonstration papers. For research, review
and position papers, the length of each paper should be at least four (4)
and not exceed eight (8) pages, plus unlimited pages for references. For
system demonstration papers, the limit is four (4) pages. Submissions
should be formatted according to the official EACL 2023 style templates
(LaTeX, Word, Overleaf). Accepted papers will be published online in the
EACL 2023 proceedings and will be presented at the conference.
Submissions must be anonymized and should be done using the official
conference management system (which will be available in the following
weeks). Scientific papers that have been or will be submitted to other
venues must be declared as such and must be withdrawn from the other venues
if accepted and published at LoResMT. The review will be double-blind.
We would like to encourage authors to cite papers written in ANY language
that are related to the topics, as long as both original bibliographic
items and their corresponding English translations are provided.
ORGANIZING COMMITTEE (LISTED ALPHABETICALLY)
Atul Kr. Ojha, University of Galway & Panlingua Language Processing LLP
Chao-Hong Liu, Potamu Research Ltd
Ekaterina Vylomova, University of Melbourne, Australia
Jade Abbott, Retro Rabbit
Jonathan Washington, Swarthmore College
Nathaniel Oco, National University (Philippines)
Tommi A Pirinen, UiT The Arctic University of Norway, Tromsø
Valentin Malykh, Huawei Noah’s Ark lab and Kazan Federal University
Varvara Logacheva, Skolkovo Institute of Science and Technology
Xiaobing Zhao, Minzu University of China
PROGRAM COMMITTEE (LISTED ALPHABETICALLY)
Alberto Poncelas, Rakuten, Singapore
Alina Karakanta, Fondazione Bruno Kessler
Amirhossein Tebbifakhr, Fondazione Bruno Kessler
Anna Currey, Amazon Web Services
Aswarth Abhilash Dara, Amazon
Arturo Oncevay, University of Edinburgh
Bharathi Raja Chakravarthi, University of Galway
Beatrice Savold, University of Trento
Bogdan Babych, Heidelberg University
Constantine Lignos, Brandeis University, USA
Daan van Esch, Google
Diptesh Kanojia, University of Surrey, UK
Duygu Ataman, University of Zurich
Eleni Metheniti, CLLE-CNRS and IRIT-CNRS
Francis Tyers, Indiana University
Kalika Bali, MSRI Bangalore, India
Koel Dutta Chowdhury, Saarland University (Germany)
Jade Abbott, Retro Rabbit
Jasper Kyle Catapang, University of the Philippines
John P. McCrae, DSI, Univerity of Galway
Kevin Patrick Scannell, Saint Louis University
Liangyou Li, Noah’s Ark Lab, Huawei Technologies
Maria Art Antonette Clariño, University of the Philippines Los Baños
Majid Latifi, University of York, York, UK
Mathias Müller, University of Zurich
Monojit Choudhury, Microsoft Turing
Rajdeep Sarkar, Univerity of Galway
Rico Sennrich, University of Zurich
Sangjee Dondrub, Qinghai Normal University
Santanu Pal, WIPRO AI
Sardana Ivanova, University of Helsinki
Shantipriya Parida, Silo AI
Sunit Bhattacharya, Charles University
Surafel Melaku Lakew, Amazon AI
Tommi A Pirinen, UiT The Arctic University of Norway, Tromsø
Wen Lai, Center for Information and Language Processing, LMU Munich
Valentin Malykh, Huawei Noah’s Ark lab and Kazan Federal University
CONTACT
Please email loresmt(a)googlegroups.com if you have any
questions/comments/suggestions.
Dear colleagues,
The UCSC Natural Language Processing (NLP) master’s degree program
<http://nlp.ucsc.edu/> provides both depth and breadth in core algorithms
and methods for NLP. Taught intensively over 15-18 months, our program
design combines theoretical learning with hands-on practice to ensure our
students have the right skill set to prepare for a professional career in
this fast-growing field. We are currently accepting applications for Fall
2023 admission consideration.
Program Highlights:
-
A 15-18 month program with a Capstone project mentored by NLP faculty
and industry experts.
-
NLP students receive an in-depth, systematic education in NLP, machine
learning, and data science and analytics with faculty who have both
academic and industry experience.
-
Our Industry Advisory Board provides insight and career advice through
mentoring, guest lectures, and attendance at networking and professional
development events.
-
All our courses are exclusive to NLP students and are designed with
input from our Industry Advisory Board ensuring the content is current,
relevant, and grounded in real-world context.
-
Our program is based at state-of-the-art facilities at the UCSC Silicon
Valley Campus, located in Santa Clara, California.
Applying to the NLP MS Program
This program is intended for students with a strong background in computer
science. We are looking for multifaceted individuals with solid skills in
programming, algorithms, machine learning, probability, statistics, and
linguistics. Visit our Admissions page <http://nlp.ucsc.edu/admissions> to
review admission requirements and tips for applying, and to connect with
our support team.
Applications for Fall 2023 admission consideration are now open. Apply by
March 1, 2023 <https://applygrad.ucsc.edu/apply/>.
If you have questions about the program or the application process, please
contact the NLP Support Team.
The NLP Program Team
Natural Language Processing MS Program <http://nlp.ucsc.edu/>
Baskin Engineering
University of California, Santa Cruz
Hi everyone,
We’re looking for a summer intern Computational Linguist to join our team
at Linkedin!
https://www.linkedin.com/jobs/view/3448033554
Please feel free to reach out if you have any enquiries about this
internship, or pass this along to any student you think would enjoy working
in Computational Linguistics and Language Technologies this summer.
Many thanks,
Kind regards,
Lamia
DMR 2023, the Fourth International Workshop on Designing Meaning
Representations, will be co-located with IWCS 2023
<http://iwcs2023.loria.fr/>. It will be held on June 20th, 2023 in Nancy,
France.
While deep learning methods have led to many breakthroughs in practical
natural language applications, most notably in Machine Translation, Machine
Reading, Question Answering, Recognizing Textual Entailment, and so on,
there is still a sense among many NLP researchers that we have a long way
to go before we can develop systems that can actually “understand” human
language and explain the decisions they make. Indeed, “understanding”
natural language entails many different human-like capabilities, and they
include but are not limited to the ability to track entities in a text,
understand the relations between these entities, track events and their
participants described in a text, understand how events unfold in time, and
distinguish events that have actually happened from events that are planned
or intended, are uncertain, or did not happen at all. We believe a critical
step in achieving natural language understanding is to design meaning
representations for text that have the necessary meaning “ingredients” that
help us achieve these capabilities. Such meaning representations can also
potentially be used to evaluate the compositional generalization capacity
of deep learning models.
There has been a growing body of research devoted to the design,
annotation, and parsing of meaning representations in recent years. The
meaning representations that have been used for semantic parsing research
are developed with different linguistic perspectives and practical goals in
mind and have different formal properties. Formal meaning representation
frameworks such as Minimal Recursion Semantics (MRS) and Discourse
Representation Theory (as exemplified in the Parallel Meaning Bank) are
developed with the goal of supporting logical inference in reasoning-based
AI systems and are therefore easily translatable into first-order logic,
requiring proper representation of semantic components such as
quantification, negation, tense, and modality. Other meaning representation
frameworks such as Abstract Meaning Representation (AMR), Tecto-grammatical
Representation (TR) in Prague Dependency Treebanks and the Universal
Conceptual Cognitive Annotation (UCCA), put more emphasis on the
representation of core predicate-argument structure, lexical semantic
information such as semantic roles and word senses, or named entities and
relations. There is also a more recent effort in developing a Uniform
Meaning Representation (UMR) that is based on AMR but extends it to
cross-linguistic settings and enhances it to represent document-level
semantic content. The automatic parsing of natural language text into these
meaning representations and the generation of natural language text from
these meaning representations are also very active areas of research, and a
wide range of technical approaches and learning methods have been applied
to these problems.
This workshop will bring together researchers who are producers and
consumers of meaning representations, and through their interaction develop
a deeper understanding of the key elements of meaning representations that
are the most valuable to the NLP community. The workshop will also provide
an opportunity for meaning representation researchers to critically examine
existing frameworks with the goal of using their findings to inform the
design of next-generation meaning representations. A third goal of the
workshop is to explore opportunities and identify challenges in the design
and use of meaning representations in multilingual settings. A final goal
of the workshop is to understand the relationship between distributed
meaning representations trained on large data sets using network models,
and the symbolic meaning representations that are carefully designed and
annotated by NLP researchers and gain a deeper understanding of areas where
each type of meaning representation is the most effective.
The workshop will solicit papers that address one or more of the following
topics:
— Design and annotation of meaning representations;
— Cross-framework comparison of meaning representations;
— Challenges and techniques in automatic parsing of meaning representations;
— Challenges and techniques in automatically generating text from meaning
representations;
— Meaning representation evaluation metrics;
— Lexical resources, ontologies, and grounding in relation to meaning
representations;
— Real-world applications of meaning representations;
— Issues in applying meaning representations to multilingual settings and
lower-resourced languages;
— The relationship between symbolic meaning representations and distributed
semantic representations;
— Formal properties of meaning representations;
— Any other topics that address the design, processing, and use of meaning
representations.
Important dates:
All deadlines are 11:59PM UTC-12:00 (“anywhere on Earth”).
Paper due: April 3, 2023
Notification of acceptance: May 1, 2023
Camera-ready deadline: June 1, 2023
Workshop date: June 20, 2023
Submission instructions:
Submission site: https://softconf.com/iwcs2023/dmr2023
Submissions should report original and unpublished research on topics of
interest to the workshop. Accepted papers are expected to be presented at
the workshop and will be published in the workshop proceedings on the ACL
Anthology. They should emphasize obtained results rather than intended
work, and should indicate clearly the state of completion of the reported
results. A paper accepted for presentation at the workshop must not be or
have been presented at any other meeting with publicly available
proceedings.
Submission is electronic, using the Softconf START conference management
system. Here is the link <https://softconf.com/iwcs2023/dmr2023> to the DMR
submission site. Submissions must adhere to the two-column format of ACL
venues: please use our specific style-files
<https://iwcs2021.github.io/download/iwcs2021-templates.zip> or the Overleaf
template
<https://www.overleaf.com/latex/templates/instructions-for-iwcs-2021-proceed…>,
taken from ACL 2021. Similar to ACL 2021, initial submissions should be
fully anonymous to ensure double-blind reviewing. Long papers must not
exceed eight (8) pages of content. Short papers and demonstration papers
must not exceed four (4) pages of content. If a paper is accepted, it will
be given an additional page to address reviewers’ comments in the final
version of the paper. References and appendices do not count against these
limits.
Reviewing of papers will be double-blind. Therefore, the paper must not
include the authors' names and affiliations or self-references that reveal
the author's identity--e.g., "We previously showed (Smith, 1991) ..."
should be replaced with citations such as "Smith (1991) previously showed
...". Papers that do not conform to these requirements will be rejected
without review.
Authors of papers that have been or will be submitted to other meetings or
publications must provide this information to the workshop organizers (
dmr2023-chairs(a)googlegroups.com). Authors of accepted papers must notify
the program chairs within 10 days of acceptance if the paper is withdrawn
for any reason.
DMR 2023 does not have an anonymity period. However, we ask you to be
reasonable and not publicly advertise your preprint during (or right
before) review.
More information will be posted to the DMR 2023 website:
http://iwcs2023.loria.fr/dmr-2023-the-fourth-international-workshop-on-desi…
Dear Corpora members,
Is anybody aware of any corpora focussing on the topic of sustainability?
Possibly with different genres of texts, but not necessarily.
Thank you!
Laura Narisano
We are looking for a PhD student to work with Jindřich Libovický on a 4-year
project focusing on a better understanding of multilingual language
representations, improving their language neutrality and language coverage.
The prospective PhD thesis topic is: Study if and how cultural aspects
of meaning (e.g., moral values) are encoded in the representations and
methods to make multilingual more culture-aware. However, the exact topic
is negotiable.
The Institute of Formal and Applied Linguistics (ÚFAL) has an active
community working on cutting-edge natural language processing. There
will be significant interaction with the other researchers at the
institute, especially the groups of Ondřej Bojar, Daniel Zeman, Pavel
Pecina, Zdeněk Žabokrtský, and Jan Hajič. Charles University is the
top-ranking university in Czechia, attracting strong talents locally and
internationally.
If you are interested in applying, please send me:
* A cover letter (clearly indicate which position you are applying for
and include possible starting dates);
* A CV;
* A short research plan addressing past, current, and future interests;
* A brief statement of teaching experience and interest;
* The names and addresses of two references.
The working language is English, and no knowledge of Czech is required.
Candidates with a strong background in natural language processing and
machine learning are preferred, but other backgrounds are also welcome.
Please submit your applications by March 15, 2023, via email to
<my_surname> at ufal dot mff dot cuni dot cz. The expected starting date
is October 1, 2023.
Thanks,
Jindřich Libovický
------
Charles Univeristy, Institute of Formal and Applied Linguistics
Malostranské náměstí 25
118 00 Praha
Czech Republic
Email: <my_surname> at ufal dot mff dot cuni dot cz
Web: https://ufal.mff.cuni.cz/jindrich-libovicky
Event Notification Type: Call for Participation
Website: <a href="https://pan.webis.de/clef23/pan23-web/author-profiling.html">https://pan.webis.de/clef23/pan23-web/author-profiling.htmlt</a>
Software submission deadline: May 29, 2023
Submission Deadline: June 05, 2023
We kindly invite you to participate at the Pan-2023 Task 2 - “Profiling Cryptocurrency Influencers with Few-Shot Learning”.
This task is being held as part of CLEF 2023, and all participating teams will be able to publish their system description paper at the CLEF proceedings.
This shared task focuses on the author profiling of cryptocurrency influencers in social media from a low-resource perspective, that is, with little training data. Moreover, we propose to profile types of influencers also using a low-resource setting.
Specifically, we focus on English Twitter posts for three different sub-tasks:
Low-resource influencer profiling (subtask-1): profile authors according to their degree of influence (null, nano, micro, macro, mega).
Low-resource influencer interest profiling (subtask-2): profile authors according to their main interests or areas of influence (technical information, price update, trading matters, gaming, other).
Low-resource influencer intent profiling (subtask-3): profile authors according to the intent of their messages (subjective opinion, financial information, advertising, announcement).
Important Links
Task Website
Dataset site
Slack workspace
Important Dates
February 20, 2023: Training data ready
May 10, 2023: Early bird software submission phase (optional)
May 29, 2023: Software submission deadline
June 05, 2023: Participant paper submission
September 18-21, 2023: Conference
Task organizers
Francisco Rangel (Symanto)
Mara Chinea-Rios (Symanto) Contact Email: mara.chinea(a)symanto.com
Marc Franco-Salvador (Symanto)
Paolo Rosso (Universidad Politécnica de Valencia)
Please reach out to the organizers at crypto-influencers-pan-organizers(a)googlegroups.com, or join the Slack workspace (https://pan2023profil-0q48349.slack.com) to connect with the other participants and organizers.
Third call for papers
Fourth workshop on Resources for African Indigenous Language (RAIL)
https://bit.ly/rail2023
Note: deadline extension and submission system information
The 4th RAIL (Resources for African Indigenous* Languages) workshop
will be co-located with EACL 2023 in Dubrovnik, Croatia. The Resources
for African Indigenous Languages (RAIL) workshop is an
interdisciplinary platform for researchers working on resources (data
collections, tools, etc.) specifically targeted towards African
indigenous languages. In particular, it aims to create the conditions
for the emergence of a scientific community of practice that focuses on
data, as well as computational linguistic tools specifically designed
for or applied to indigenous languages found in Africa.
Previous workshops showed that the presented problems (and solutions)
are not only applicable to African languages. Many issues are also
relevant to other low-resource languages, such as different scripts and
properties like tone. As such, these languages share similar
challenges. This allows for researchers working on these languages with
such properties (including non-African languages) to learn from each
other, especially on issues pertaining to language resource
development.
The RAIL workshop has several aims. First, it brings together
researchers working on African indigenous languages, forming a
community of practice for people working on indigenous languages.
Second, the workshop aims to reveal currently unknown or unpublished
existing resources (corpora, NLP tools, and applications), resulting in
a better overview of the current state-of-the-art, and also allows for
discussions on novel, desired resources for future research in this
area. Third, it enhances sharing of knowledge on the development of
low-resource languages. Finally, it enables discussions on how to
improve the quality as well as availability of the resources.
The workshop has “Impact of impairments on language resources” as its
theme, but submissions on any topic related to properties of African
indigenous languages (including non-African languages) may be accepted.
Suggested topics include (but are not limited to) the following:
Digital representations of linguistic structures
Descriptions of corpora or other data sets of African indigenous
languages
Building resources for (under resourced) African indigenous languages
Developing and using African indigenous languages in the digital age
Effectiveness of digital technologies for the development of African
indigenous languages
Revealing unknown or unpublished existing resources for African
indigenous languages
Developing desired resources for African indigenous languages
Improving quality, availability and accessibility of African indigenous
language resources
*: The term indigenous languages used in the RAIL workshop is intended
to refer to non-colonial languages (in this case those used in Africa).
In no way is this term used to cause any harm or discomfort to anyone.
Many of these languages were or are still marginalised, and the aim of
the workshop is to bring attention to the creation, curation, and
development of resources for these languages in Africa.
Submission requirements:
We invite papers on original, unpublished work related to the topics of
the workshop. Submissions, presenting completed work, may consist of up
to eight (8) pages of content plus additional pages of references. The
final camera-ready version of accepted long papers are allowed one
additional page of content (so up to 9 pages) so that reviewers’
feedback can be incorporated.
Submissions need to use the EACL stylesheets. These can be found at
https://2023.eacl.org/calls/styles. Submission is electronic in PDF
through the START system which can be found at
https://softconf.com/eacl2023/RAIL2023. Reviewing is double-blind, so
make sure to anonymize your submission (e.g., do not provide author
names, affiliations, project names, etc.) Limit the amount of self
citations (anonymized citations should not be used). Accepted papers
will be published in the ACL workshop proceedings.
Please make sure you also go through the responsible NLP checklist
(https://aclrollingreview.org/responsibleNLPresearch/). Also,
submissions should have a section titled “Limitations” (as described in
the stylesheets). Authors are also encouraged to include an explicit
ethics statement.
Important dates:
Submission deadline 20 February 2023
Date of notification 13 March 2023
Camera ready deadline 27 March 2023
RAIL workshop 5 or 6 May 2023
Organising Committee
Rooweither Mabuya, South African Centre for Digital Language Resources
(SADiLaR), South Africa
Don Mthobela, Cam Foundation
Mmasibidi Setaka, South African Centre for Digital Language Resources
(SADiLaR), South Africa
Menno van Zaanen, South African Centre for Digital Language Resources
(SADiLaR), South Africa
--
Prof Menno van Zaanen menno.vanzaanen(a)nwu.ac.za
Professor in Digital Humanities
South African Centre for Digital Language Resources
https://www.sadilar.org
________________________________
NWU PRIVACY STATEMENT:
http://www.nwu.ac.za/it/gov-man/disclaimer.html
DISCLAIMER: This e-mail message and attachments thereto are intended solely for the recipient(s) and may contain confidential and privileged information. Any unauthorised review, use, disclosure, or distribution is prohibited. If you have received the e-mail by mistake, please contact the sender or reply e-mail and delete the e-mail and its attachments (where appropriate) from your system.
________________________________
Call for Papers
*Combating Health Misinformation for Social Wellbeing*Workshop @ ICWSM 2023,
the 17th International Conference on Web and Social MediaJune 5th – 8th
2023, Limassol, Cyprus
https://truehealth.disco.unimib.it/
------------------------------
*Scope and topics*
In recent years, *people have increasingly referred to the Web and social
media as sources of information about health-related problems and solutions*,
as confirmed by the U.S. Pew Research Center, and other European and
international studies. Although, on the one hand, these platforms favor
easier and more direct access to information sources by users without the
intermediation of experts, on the other hand, it is precisely such
democratization of health information that constitutes a potential danger
for people. As we have seen especially in the last period, linked to the
pandemic, the proliferation of false information, conspiracy theories, and
unreliable remedies risk compromising the health not only of individuals
but that of the community as a whole.
From this perspective, it becomes necessary to *study and propose
technological solutions* to help users come into contact with genuine
information, especially in a critical domain such as health, for social
well-being.
To this end, it is essential to promote *research of an interdisciplinary
nature*, involving computer scientists, physicians, lawyers, and
communication experts who can *address the problem of health
misinformation* from
different points of view by combining their expertise.
The* topics of interest* of the TrueHealth 2023 Workshop at ICWSM include,
but are not limited to:
- Assessing the genuineness of Online Health Information (OHI);
- Consumer Health Search (CHS) and genuine information access;
- Debunking health misinformation;
- Fake news/rumors and healthcare;
- Measures, evaluation methods, and datasets for health misinformation
detection;
- Health misinformation detection;
- Health literacy and information genuineness;
- Fact-checking in Online Health Information (OHI);
- Misinformation and public opinion on health;
- Relationship between access to non-genuine information and danger to
public health;
- Relationship between psychological characteristics and perceptions of
health misinformation;
- Techniques for accessing and retrieving genuine Online Health
Information (OHI).
------------------------------
*Submission Instructions*
Instructions can be found on the official ICWSM website:
https://www.icwsm.org/2023/index.html/call_for_submissions.html
------------------------------
*Important Dates*
- *Workshop Papers Submissions*: March 27, 2023
- *Workshop Paper Acceptance Notification*: April 10, 2023
- *Workshop Final Camera-Ready Paper Due*: May 6, 2023
- *ICWSM-2023 Workshops Day*: June 5, 2023
------------------------------
*Organizers*
- *Gabriella Pasi*
<https://ikr3.disco.unimib.it/people/gabriella-pasi/> (Full
Professor), University of Milano-Bicocca, Milan, Italy
- *Rishabh Upadhyay
<https://en.unimib.it/rishabh-gyanendra-upadhyay>* (Research
Fellow), University of Milano-Bicocca, Milan, Italy
- *Marco Viviani
<https://ikr3.disco.unimib.it/people/marco-viviani/>* (Associate
Professor), University of Milano-Bicocca, Milan, Italy
------------------------------
*Program Committee*
- *Sanda Harabagiu* <https://personal.utdallas.edu/~sanda/>, The
University of Texas at Dallas, USA
- *Liadh Kelly
<https://www.maynoothuniversity.ie/faculty-science-engineering/our-people/li…>*,
Maynooth University, Ireland
- *Dongwon Lee <https://ist.psu.edu/directory/dul13>*, Penn State: The
Pennsylvania State University, USA
- *Yelena Mejova <https://www.isi.it/en/people/yelena-mejova>,* ISI
Foundation, Italy
- *Michael Sirivianos* <https://netsysci.cut.ac.cy/michael.sirivianos/>,
Cyprus University of Technology, Cyprus
- *Angelo Spognardi <https://angelospognardi.site.uniroma1.it/>*,
Sapienza University of Rome, Italy
- *Bei Yu <https://ischool.syr.edu/bei-yu/>*, Syracuse University, USA
- *Arkaitz Zubiaga <http://www.zubiaga.org/>*, Queen Mary University of
London, UK
- Other members are going to be added in the coming days
---
Rishabh Upadhyay,
IKR3 Lab, UNIMIB,
Milan, Italy
The CLINKART and TESTLINK twin evaluation tasks
We invite you to participate in the CLINKART and TESTLINK twin tasks on the
extraction of relations from clinical cases.
Both tasks mainly consist in identifying test results and measurements and
linking them to the textual mentions of the laboratory tests and
measurements from which they were obtained.
CLINKART (https://e3c.fbk.eu/clinkart) is organised in the context of
Evalita 2023 and focuses on Italian. Training data is already available.
TESTLINK (https://e3c.fbk.eu/testlinkiberlef) is organised in the context
of IberLEF 2023 and proposes three different challenges: two tasks on
monolingual datasets in Spanish and Basque and a third one requiring a
multilingual approach. Training data due on February 27.
The datasets of both tasks are based on the European Clinical Case Corpus
(E3C), which consists of clinical cases in five European languages and is
freely available (CC-BY-NC-4.0).
Join us in one or many of the proposed challenges!
The CLINKART and TESTLINK organisation teams