First Call For Papers and Participation: BioNLP 2023 and BioNLP Shared Tasks 2023 (co-located with ACL 2023) - Corpora

10 Jan 2023


      BIONLP 2023 and Shared Tasks @ ACL 2023
https://aclweb.org/aclwiki/BioNLP_Workshop#SHARED_TASKS_2023
*Tentative* Important Dates
(All submission deadlines are 11:59 p.m. UTC-12:00 “anywhere on Earth”)
May 1, 2023: Workshop Paper Due Date
June 15, 2023: Camera-ready papers due
BioNLP 2023 Workshop at ACL, July 13 OR 14, 2023, Toronto, Canada
Please watch for the updates!
SUBMISSION INSTRUCTIONS
-----------------------------------------
Two types of submissions are invited: full papers and short papers.
Full papers should not exceed eight (8) pages of text, plus unlimited
references. These are intended to be reports of original research. BioNLP
aims to be the forum for interesting, innovative, and promising work
involving biomedicine and language technology, whether or not yielding high
performance at the moment. This by no means precludes our interest in and
preference for mature results, strong performance, and thorough
evaluation.  Both types of research and combinations thereof are
encouraged.
Short papers may consist of up to four (4) pages of content, plus unlimited
references. Appropriate short paper topics include preliminary results,
application notes, descriptions of work in progress, etc.
Electronic Submission
Submissions must be electronic and in PDF format, using the Softconf START
conference management system
Submissions need to be anonymous.
*The submission site will be announced shortly.*
Dual submission policy: papers may NOT be submitted to  the BioNLP 2017
workshop if they are or will be concurrently submitted to another meeting
or publication.
WORKSHOP OVERVIEW AND SCOPE
---------------------------------------------------
The BioNLP workshop associated with the ACL SIGBIOMED special interest
group has established itself as the primary venue for presenting
foundational research in language processing for the biological and medical
domains. The workshop is running every year since 2002 and continues
getting stronger. BioNLP welcomes and encourages work on languages other
than English, and inclusion and diversity. BioNLP truly encompasses the
breadth of the domain and brings together researchers in bio- and clinical
NLP from all over the world. The workshop will continue presenting work on
a broad and interesting range of topics in NLP. The interest to biomedical
language has broadened significantly due to the COVID-19 pandemic and
continues to grow: as access to information becomes easier and more people
generate and access health-related text, it becomes clearer that only
language technologies can enable and support adequate use of the biomedical
text.
BioNLP 2023 will be particularly interested in language processing that
supports DEIA (Diversity, Equity, Inclusion and Accessibility). The work on
detection and mitigation of bias and misinformation continues to be of
interest. Research in languages other than English, particularly,
under-represented languages, and health disparities are always of interest
to BioNLP.
Other active areas of research include, but are not limited to:
Tangible results of biomedical language processing applications;
Entity identification and normalization (linking) for a broad range of
semantic categories;
Extraction of complex relations and events;
Discourse analysis;
Anaphora/coreference resolution;
Text mining / Literature based discovery;
Summarization;
Τext simplification;
Question Answering;
Resources and strategies for system testing and evaluation;
Infrastructures and pre-trained language models for biomedical NLP
(Processing and annotation platforms);
Development of synthetic data & data augmentation;
Translating NLP research into practice;
Getting reproducible results.
SHARED TASKS 2023
-------------------------------------
Shared Tasks on Summarization of Clinical Notes and Scientific Articles
The first task focuses on Clinical Text.
Task 1A. Problem List Summarization
Automatically summarizing patients’ main problems from the daily care notes
in the electronic health record can help mitigate information and cognitive
overload for clinicians and provide augmented intelligence via computerized
diagnostic decision support at the bedside. The task of Problem List
Summarization aims to generate a list of diagnoses and problems in a
patient’s daily care plan using input from the provider’s progress notes
during hospitalization.This task aims to promote NLP model development for
downstream applications in diagnostic decision support systems that could
improve efficiency and reduce diagnostic errors in hospitals. This task
will contain 768 hospital daily progress notes and 2783 diagnoses in the
training set, and a new set of 300 daily progress notes will be annotated
by physicians as the test set. The annotation methods and annotation
quality have previously been reported here. The goal of this shared task is
to attract future research efforts in building NLP models for real-world
decision support applications, where a system generating relevant and
accurate diagnoses will assist the healthcare providers’ decision-making
process and improve the quality of care for patients.
Task 1B. Radiology report summarization
Radiology report summarization is a growing area of research. Given the
Findings and/or Background sections of a radiology report, the goal is to
generate a summary (called an Impression section) that highlights the key
observations and conclusions of the radiology study.
The research area of radiology report summarization currently faces an
important limitation: most research is carried out on chest X-rays. To
palliate these limitations, we propose two datasets: A shared summarization
task that includes six different modalities and anatomies, totalling 79,779
samples, based on the MIMIC-III database.
A shared summarization task on chest x-ray radiology reports with images
and a brand new out-of-domain test-set from Stanford.
SEE MORE at: https://vilmedic.app/misc/bionlp23/sharedtask
Task 2. Lay Summarization of Biomedical Research Articles
Biomedical publications contain the latest research on prominent
health-related topics, ranging from common illnesses to global pandemics.
This can often result in their content being of interest to a wide variety
of audiences including researchers, medical professionals, journalists, and
even members of the public. However, the highly technical and specialist
language used within such articles typically makes it difficult for
non-expert audiences to understand their contents.
Abstractive summarization models can be used to generate a concise summary
of an article, capturing its salient point using words and sentences that
aren’t used in the original text. As such, these models have the potential
to help broaden access to highly technical documents when trained to
generate summaries that are more readable, containing more background
information and less technical terminology (i.e., a “lay summary”).
This shared task surrounds the abstractive summarization of biomedical
research articles, with an emphasis on controllability and catering to
non-expert audiences. Through this task, we aim to help foster increased
research interest in controllable summarization that helps broaden access
to technical texts and progress toward more usable abstractive
summarization models in the biomedical domain.
For more information, see:
Main site: https://biolaysumm.org/
CodaLab page - subtask 1: https://codalab.lisn.upsaclay.fr/competitions/9541
CodaLab page - subtask 2: https://codalab.lisn.upsaclay.fr/competitions/9544
*Workshop Organizers*
  Dina Demner-Fushman, US National Library of Medicine
  Kevin Bretonnel Cohen, University of Colorado School of Medicine
  Sophia Ananiadou, National Centre for Text Mining and University of
Manchester, UK
  Jun-ichi Tsujii, National Institute of Advanced Industrial Science and
Technology, Japan