BIONLP 2023 and Shared Tasks @ ACL 2023 https://aclweb.org/aclwiki/BioNLP_Workshop#SHARED_TASKS_2023
*Tentative* Important Dates (All submission deadlines are 11:59 p.m. UTC-12:00 “anywhere on Earth”) May 1, 2023: Workshop Paper Due Date June 15, 2023: Camera-ready papers due BioNLP 2023 Workshop at ACL, July 13 OR 14, 2023, Toronto, Canada
Please watch for the updates!
SUBMISSION INSTRUCTIONS ----------------------------------------- Two types of submissions are invited: full papers and short papers.
Full papers should not exceed eight (8) pages of text, plus unlimited references. These are intended to be reports of original research. BioNLP aims to be the forum for interesting, innovative, and promising work involving biomedicine and language technology, whether or not yielding high performance at the moment. This by no means precludes our interest in and preference for mature results, strong performance, and thorough evaluation. Both types of research and combinations thereof are encouraged.
Short papers may consist of up to four (4) pages of content, plus unlimited references. Appropriate short paper topics include preliminary results, application notes, descriptions of work in progress, etc.
Electronic Submission Submissions must be electronic and in PDF format, using the Softconf START conference management system Submissions need to be anonymous.
*The submission site will be announced shortly.*
Dual submission policy: papers may NOT be submitted to the BioNLP 2017 workshop if they are or will be concurrently submitted to another meeting or publication.
WORKSHOP OVERVIEW AND SCOPE --------------------------------------------------- The BioNLP workshop associated with the ACL SIGBIOMED special interest group has established itself as the primary venue for presenting foundational research in language processing for the biological and medical domains. The workshop is running every year since 2002 and continues getting stronger. BioNLP welcomes and encourages work on languages other than English, and inclusion and diversity. BioNLP truly encompasses the breadth of the domain and brings together researchers in bio- and clinical NLP from all over the world. The workshop will continue presenting work on a broad and interesting range of topics in NLP. The interest to biomedical language has broadened significantly due to the COVID-19 pandemic and continues to grow: as access to information becomes easier and more people generate and access health-related text, it becomes clearer that only language technologies can enable and support adequate use of the biomedical text.
BioNLP 2023 will be particularly interested in language processing that supports DEIA (Diversity, Equity, Inclusion and Accessibility). The work on detection and mitigation of bias and misinformation continues to be of interest. Research in languages other than English, particularly, under-represented languages, and health disparities are always of interest to BioNLP.
Other active areas of research include, but are not limited to:
Tangible results of biomedical language processing applications; Entity identification and normalization (linking) for a broad range of semantic categories; Extraction of complex relations and events; Discourse analysis; Anaphora/coreference resolution; Text mining / Literature based discovery; Summarization; Τext simplification; Question Answering; Resources and strategies for system testing and evaluation; Infrastructures and pre-trained language models for biomedical NLP (Processing and annotation platforms); Development of synthetic data & data augmentation; Translating NLP research into practice; Getting reproducible results.
SHARED TASKS 2023 ------------------------------------- Shared Tasks on Summarization of Clinical Notes and Scientific Articles
The first task focuses on Clinical Text.
Task 1A. Problem List Summarization Automatically summarizing patients’ main problems from the daily care notes in the electronic health record can help mitigate information and cognitive overload for clinicians and provide augmented intelligence via computerized diagnostic decision support at the bedside. The task of Problem List Summarization aims to generate a list of diagnoses and problems in a patient’s daily care plan using input from the provider’s progress notes during hospitalization.This task aims to promote NLP model development for downstream applications in diagnostic decision support systems that could improve efficiency and reduce diagnostic errors in hospitals. This task will contain 768 hospital daily progress notes and 2783 diagnoses in the training set, and a new set of 300 daily progress notes will be annotated by physicians as the test set. The annotation methods and annotation quality have previously been reported here. The goal of this shared task is to attract future research efforts in building NLP models for real-world decision support applications, where a system generating relevant and accurate diagnoses will assist the healthcare providers’ decision-making process and improve the quality of care for patients.
Task 1B. Radiology report summarization Radiology report summarization is a growing area of research. Given the Findings and/or Background sections of a radiology report, the goal is to generate a summary (called an Impression section) that highlights the key observations and conclusions of the radiology study.
The research area of radiology report summarization currently faces an important limitation: most research is carried out on chest X-rays. To palliate these limitations, we propose two datasets: A shared summarization task that includes six different modalities and anatomies, totalling 79,779 samples, based on the MIMIC-III database. A shared summarization task on chest x-ray radiology reports with images and a brand new out-of-domain test-set from Stanford.
SEE MORE at: https://vilmedic.app/misc/bionlp23/sharedtask
Task 2. Lay Summarization of Biomedical Research Articles Biomedical publications contain the latest research on prominent health-related topics, ranging from common illnesses to global pandemics. This can often result in their content being of interest to a wide variety of audiences including researchers, medical professionals, journalists, and even members of the public. However, the highly technical and specialist language used within such articles typically makes it difficult for non-expert audiences to understand their contents.
Abstractive summarization models can be used to generate a concise summary of an article, capturing its salient point using words and sentences that aren’t used in the original text. As such, these models have the potential to help broaden access to highly technical documents when trained to generate summaries that are more readable, containing more background information and less technical terminology (i.e., a “lay summary”).
This shared task surrounds the abstractive summarization of biomedical research articles, with an emphasis on controllability and catering to non-expert audiences. Through this task, we aim to help foster increased research interest in controllable summarization that helps broaden access to technical texts and progress toward more usable abstractive summarization models in the biomedical domain.
For more information, see:
Main site: https://biolaysumm.org/ CodaLab page - subtask 1: https://codalab.lisn.upsaclay.fr/competitions/9541 CodaLab page - subtask 2: https://codalab.lisn.upsaclay.fr/competitions/9544
*Workshop Organizers* Dina Demner-Fushman, US National Library of Medicine Kevin Bretonnel Cohen, University of Colorado School of Medicine Sophia Ananiadou, National Centre for Text Mining and University of Manchester, UK Jun-ichi Tsujii, National Institute of Advanced Industrial Science and Technology, Japan