The Second Workshop on Corpus Generation and Corpus Augmentation for
Machine Translation (CoCo4MT) @MT-SUMMIT XIX
The 19th Machine Translation Summit
Sep 4-8, 2023, Macau SAR, China
https://sites.google.com/view/coco4mt
SCOPE
It is a well-known fact that machine translation systems, especially
those that use deep learning, require massive amounts of data. Several
resources for languages are not available in their human-created format.
Some of the types of resources available are monolingual, multilingual,
translation memories, and lexicons. Those types of resources are
generally created for formal purposes such as parliamentary collections
when parallel and more informal situations when monolingual. The quality
and abundance of resources including corpora used for formal reasons is
generally higher than those used for informal purposes. Additionally,
corpora for low-resource languages, languages with less digital
resources available, tends to be less abundant and of lower quality.
CoCo4MT is a workshop centered around research that focuses on manual
and automatic corpus creation, cleansing, and augmentation techniques
specifically for machine translation. We accept work that covers any
language (including sign language) but we are specifically interested in
those submissions that explicitly report on work with languages with
limited existing resources (low-resource languages). Since techniques
from high-resource languages are generally statistical in nature and
could be used as generic solutions for any language, we welcome
submissions on high-resource languages also.
CoCo4MT aims to encourage research on new and undiscovered techniques.
We hope that the methods presented at this workshop will lead to the
development of high-quality corpora that will in turn lead to
high-performing MT systems and new dataset creation for multiple
corpora. We hope that submissions will provide high-quality corpora that
are available publicly for download and can be used to increase machine
translation performance thus encouraging new dataset creation for
multiple languages that will, in turn, provide a general workshop to
consult for corpora needs in the future. The workshop’s success will be
measured by the following key performance indicators:
- Promotes the ongoing increase in quality of machine translation
systems when measured by standard measurements,
- Provides a meeting place for collaboration from several research areas
to increase the availability of commonly used corpora and new corpora,
- Drives innovation to address the need for higher quality and abundance
of low-resource language data.
Topics of interest include:
- Difficulties with using existing corpora (e.g., political
considerations or domain limitations) and their effects on final MT
systems,
- Strategies for collecting new MT datasets (e.g., via crowdsourcing),
- Data augmentation techniques,
- Data cleansing and denoising techniques,
- Quality control strategies for MT data,
- Exploration of datasets for pretraining or auxiliary tasks for
training MT systems.
SHARED TASK
To encourage research on corpus construction for low-resource machine
translation, we introduce a shared task focused on identifying
high-quality instances that should be translated into a target
low-resource language. Participants are provided access to multi-way
corpora in the high-resource languages of English, Spanish, German,
Korean, and Indonesian, and using these, are required to identify
beneficial instances, that when translated into the low-resource
languages of Cebuano, Gujarati, and Burmese, lead to high-performing MT
systems. More details on data, evaluation and submission can be found on
the website (https://sites.google.com/view/coco4mt/shared-task) or by
emailing coco4mt-shared-task(a)googlegroups.com.
SUBMISSION INFORMATION
CoCo4MT will accept research, review, or position papers. The length of
each paper should be at least four (4) and not exceed ten (10) pages,
plus unlimited pages for references. Submissions should be formatted
according to the official MT Summit 2023 style templates
(https://www.overleaf.com/latex/templates/mt-summit-2023-template/knrrcnxhkq…).
Accepted papers will be published in the MT Summit 2023 proceedings
which are included in the ACL Anthology and will be presented at the
conference either orally or as a poster.
Submissions must be anonymized and should be made to the workshop using
the Softconf conference management system
(https://softconf.com/mtsummit2023/CoCo4MT). Scientific papers that have
been or will be submitted to other venues must be declared as such, and
must be withdrawn from the other venues if accepted and published at
CoCo4MT. The review will be double-blind.
We would like to encourage authors to cite papers written in ANY
language that are related to the topics, as long as both original
bibliographic items and their corresponding English translations are
provided.
Registration will be handled by the main conference. (To be announced)
IMPORTANT DATES
May 18, 2023 - Call for papers released
May 19, 2023 - Shared task release of train, dev and test data
May 25, 2023 - Shared task release of baselines
June 5, 2023 - Second call for papers
June 20, 2023 - Third and final call for papers
July 05, 2023 - Paper submissions due
July 05, 2023 - Shared task deadline to submit results
July 20, 2023 - Notification of acceptance
July 20, 2023 - Shared task system description papers due
July 31, 2023 - Camera-ready due
September 4-5, 2023 - CoCo4MT workshop
CONTACT
CoCo4MT Workshop Organizers:
coco4mt-2023-organizers(a)googlegroups.com
CoCo4MT Shared Task Organizers:
coco4mt-shared-task(a)googlegroups.com
ORGANIZING COMMITTEE (listed alphabetically)
Ananya Ganesh University of Colorado Boulder
Constantine Lignos Brandeis University
John E. Ortega Northeastern University
Jonne Sälevä Brandeis University
Katharina Kann University of Colorado Boulder
Marine Carpuat University of Maryland
Rodolfo Zevallos Universitat Pompeu Fabra
Shabnam Tafreshi University of Maryland
William Chen Carnegie Mellon University
PROGRAM COMMITTEE (listed alphabetically tentative)
Abteen Ebrahimi University of Colorado Boulder
Adelani David Saarland University
Ananya Ganesh University of Colorado Boulder
Alberto Poncelas ADAPT Centre at Dublin City University
Anna Currey Amazon
Amirhossein Tebbifakhr University of Trento
Atul Kr. Ojha National University of Ireland Galway
Ayush Singh Northeastern University
Barrow Haddow University of Edinburgh
Bharathi Raja Chakravarthi National University of Ireland Galway
Beatrice Savoldi University of Trento
Bogdan Babych Heidelberg University
Briakou Eleftheria University of Maryland
Constantine Lignos Brandeis University
Dossou Bonaventure Mila Quebec AI Institute
Duygu Ataman New York University
Eleftheria Briakou University of Maryland
Eleni Metheniti Université Toulosse - Paul Sabatier
Jasper Kyle Catapang University of Birmingham
John E. Ortega Northeastern University
Jonne Sälevä Brandeis University
Kalika Bali Microsoft
Katharina Kann University of Colorado Boulder
Kochiro Watanabe The University of Tokyo
Koel Dutta Chowdhury Saarland University
Liangyou Li Huawei
Manuel Mager University of Stuttgart
Maria Art Antonette Clariño University of the Philippines Los Baños
Marine Carpuat University of Maryland
Mathias Müller University of Zurich
Nathaniel Oco De La Salle University
Niu Xing Amazon
Patrick Simianer Lilt
Rico Sennrich University of Zurich
Rodolfo Zevallos Universitat Pompeu Fabra
Sangjee Dondrub Qinghai Normal University
Santanu Pal Saarland University
Sardana Ivanova University of Helsinki
Shantipriya Parida Silo AI
Shiran Dudy Northeastern University
Surafel Melaku Lakew Amazon
Tommi A Pirinen University of Tromsø
Valentin Malykh Moscow Institute of Physics and Technology
Xing Niu Amazon
Xu Weijia University of Maryland
Dear Corpora list members:
The Language and Voice Lab (LVL; https://lvl.ru.is/) at Reykjavik University, has developed GameQA -- a gamified mobile app platform for building multiple-domain question-answering (QA) datasets. GameQA has been used to build an Icelandic QA dataset, consisting of about 20,700 peer-reviewed questions of which about 12,700 were answered and reviewed.
The technology behind GameQA, as well as the data collection process, is described in the following EACL 2023 paper: https://aclanthology.org/2023.eacl-demo.18/
The source code for GameQA is open and free and LVL has compiled instructions on how to localize it for other languages (https://gameqa.app/).
If you are interested in using GameQA for gathering a QA dataset for a particular language, feel free to contact us at gameqa(a)ru.is<mailto:gameqa@ru.is> -- we would be excited to assist you!
Regards,
Hrafn Loftsson, Ph.D., www.ru.is/~hrafn<http://www.ru.is/~hrafn>
Dósent | Associate Professor
Tölvunarfræðideild | Department of Computer Science
Tæknisvið | School of Technology
Háskólinn í Reykjavík | Reykjavik University
Apologies for cross-postings
The Norwegian School of Economics (NHH), University of Bergen and Western Norway University of Applied Sciences welcome PhD candidates to register for the course The Multilingual Workplace and Multilingual Society offered at NHH in Bergen, Norway on 4-6 October 2023.
For information about registration etc. see https://www.nhh.no/en/calendar/professional-and-intercultural-communication…
Kind regards
Gisle Andersen, NHH
**Deadline extended to June 30th**
Call for Papers
The 1st Workshop on Counter Speech for Online Abuse:
A workshop for creating, investigating and improving tools for producing and evaluating counter speech.
Hate speech and abusive and toxic language are prevalent in online spaces. For example, a 2019 survey shows that in the UK 30-40% of people have experienced online abuse, and platforms like Facebook bring down millions of harmful posts every year, with the help of AI tools. While removal of such content can immediately reduce the quantity of harmful messages, it can bring about accusations of censorship and may not be effective at curbing hate in the long term. An alternative approach is to reply with counter speech, i.e. targeted responses aimed at refuting the hateful language using thoughtful and cogent reasons, and fact-bound arguments. This has been shown to be effective in influencing the behaviour of both the perpetrators of abuse and bystanders that witness the interactions, as well as providing support to victims.
The sheer amount of social media data shared online on a daily basis means that hate mitigation, using counter speech, requires reliable, efficient and scalable tools. Recently, efforts have been made to curate hate countering datasets and automate the production of counter speech. However, this research field is still in its infancy, and many questions remain open regarding the most effective approaches and methods to take, as well as how to evaluate them.
This first multidisciplinary workshop aims to bring together researchers from diverse backgrounds such as computer science and the social sciences, as well as policy makers and other stakeholders to attempt to understand how counter speech is currently used to tackle abuse by individuals, activists and organisations, how Natural Language Processing (NLP) and Generation (NLG) can be applied to produce counter narratives, and the implications of using large language models for this task. It will also address, but not be limited to, the questions of how to evaluate and measure the impacts of counter speech, the importance of expert knowledge from civil society in the development of counter speech datasets and taxonomies, and how to ensure fairness and mitigate the biases present in language models when generating counter speech.
Topics
We invite papers (long and short) on a wide range of topics, including but not limited to:
• Models and methods for generating counter speech;
• Dialogue agents employing counter speech to address hateful inputs, directed towards other people or the AI itself;
• Human and automatic evaluation methods of counter speech tools;
• Multidisciplinary studies including different perspectives on the topic such as from computer science, social science, NGOs and stakeholders;
• Development of datasets and taxonomy for counter speech;
• Potentials and limitations (e.g., fairness, biases) of using large language models for generating counter speech;
• Social impact and empirical studies of counter speech on social media, including investigating the effectiveness and consequences on users of employing counter speech to fight online hate;
• Proposals for future research on counter speech, and/or preliminary results of studies in this field
We accept three types of submissions:
* Regular research papers – long (8 pages) or short (4 pages);
* Non-archival submissions: like research papers, but will not be included in the proceedings;
* Research communications: 2-4 page abstracts summarising relevant research published elsewhere.
Submission link: https://softconf.com/n/cs4oa2023
Location: co-located with SIGdialxINLG, Prague, Czechia
Important dates
All deadlines are Anywhere on Earth (UTC-12)
* Submission deadline: Jun 26, 2023 30 Jun 2023
* Notification of acceptance Jul 17, 2023 19 Jul 2023
* Camera-ready deadline Aug 11, 2023
* Workshop date: September 11 2023
Format and Styling
Submissions should follow ACL Author Guidelines<https://www.aclweb.org/adminwiki/index.php?title=ACL_Author_Guidelines> and policies for submission, review and citation, and be anonymised for double blind reviewing. Please use ACL 2023 style files; LaTeX style files and Microsoft Word templates are available at https://2023.aclweb.org/calls/style_and_formatting/<https://2021.aclweb.org/downloads/acl-ijcnlp2021-templates.zip>.
Organising Committee:
* Yi-Ling Chung, The Alan Turing Institute
* Gavin Abercrombie, Heriot-Watt University
* Helena Bonaldi, Fondazione Bruno Kessler
* Marco Guerini, Fondazione Bruno Kessler
Contact
If you have any questions, please let us know at cs4oa(a)googlegroups.com
Website: https://sites.google.com/view/cs4oa
Twitter: @cs4oa_workshop<https://twitter.com/cs4oa_workshop>
________________________________
Founded in 1821, Heriot-Watt is a leader in ideas and solutions. With campuses and students across the entire globe we span the world, delivering innovation and educational excellence in business, engineering, design and the physical, social and life sciences. This email is generated from the Heriot-Watt University Group, which includes:
1. Heriot-Watt University, a Scottish charity registered under number SC000278
2. Heriot- Watt Services Limited (Oriam), Scotland's national performance centre for sport. Heriot-Watt Services Limited is a private limited company registered is Scotland with registered number SC271030 and registered office at Research & Enterprise Services Heriot-Watt University, Riccarton, Edinburgh, EH14 4AS.
The contents (including any attachments) are confidential. If you are not the intended recipient of this e-mail, any disclosure, copying, distribution or use of its contents is strictly prohibited, and you should please notify the sender immediately and then delete it (including any attachments) from your system.
*** Last Mile for Paper Submission ***
10th European Conference On Service-Oriented And Cloud Computing (ESOCC 2023)
October 24-26, 2023, Golden Bay Beach Hotel, Larnaca, Cyprus
https://cyprusconferences.org/esocc2023/
(Proceedings to be published in Springer LNCS;
Journal Special Issue with Springer Computing)
Submission Deadline: Abstracts by June 25, 2023; Full Papers by July 2, 2023
AIM AND SCOPE
Nowadays, Service-Oriented and Cloud Computing are the primary approaches to build
large-scale distributed systems and deliver software services to end users. Cloud-native
software is pervading the delivery of enterprise applications, as they are composed of
(micro)services that can be independently developed and deployed by exploiting multiple
heterogeneous technologies. Resulting applications are polyglot service compositions that can
then be shipped in serverful or serverless platforms (e.g., using virtualization technologies).
These characteristics make Service-Oriented and Cloud Computing the natural answers for
fulfilling the industry’s need for flexibly scalable and maintainable enterprise applications, to
be delivered through state-of-the-art methodologies, like DevOps. To further support this,
researchers and practitioners need to create methods, tools and techniques to support
cost-effective and secure development as well as use of dependable devices, platforms,
services and service-oriented applications in the Cloud, now also considering the Cloud-IoT
computing continuum to exploit widespread adoption of smart connected things and the
increasing growth of their computing capabilities.
The European Conference on Service-Oriented and Cloud Computing (ESOCC) is the premier
conference on advances in the state-of-the-art and practice of Service-Oriented Computing
and Cloud Computing in Europe. ESOCC aims to facilitate the exchange between researchers
and practitioners in the areas of Service-Oriented Computing and Cloud Computing, as well as
to explore the new trends in those areas and foster future collaborations in Europe and beyond.
TOPICS OF INTEREST
ESOCC 2023 seeks original, high-quality contributions related to all aspects of Service-Oriented
and Cloud computing. Specific topics of interest include, but are not limited to:
• Applications for Service-Oriented and Cloud Computing, e.g., big data, commerce, energy,
finance, health, scientific computing, smart cities
• Blockchains for Service-Oriented and Cloud Computing • Business aspects of Service-Oriented and Cloud Computing, e.g., business models,
brokerage, marketplaces, costs, pricing
• Business processes, e.g., service-based workflow deployment and management
• Cloud interoperability, service and Cloud standards
• Cloud-IoT computing continuum, e.g., edge computing, fog computing, mobility computing,
next generation services/IoT
• Cloud-native architectures and paradigms, e.g., microservices and DevOps
• Cloud service models, e.g., IaaS, PaaS, SaaS, DBaaS, FaaS, etc.
• Deployment, composition, and management of applications in Service-Oriented and Cloud
Computing
• Foundations and formal methods for Service-Oriented and Cloud Computing
• Enablers for Service-Oriented and Cloud Computing, e.g., service discovery, orchestration,
matchmaking, monitoring, and analytics
• Model-Driven Engineering for Service-Oriented and Cloud Computing
• Multi-Cloud, cross-Cloud, and federated Cloud solutions
• Requirements engineering, design, development, and testing of applications in
Service-Oriented and Cloud Computing
• Semantic services and service mining
• Service and Cloud middlewares and platforms
• Software/service adaptation and evolution in Service-Oriented and Cloud Computing
• Storage, computation and network Clouds
• Sustainability and energy issues in Service-Oriented and Cloud Computing
• Quality aspects (e.g., governance, privacy, security, and trust) of Service-Oriented and Cloud
Computing
• Quality of Service (QoS) and Service-Level Agreement (SLA) for Service-Oriented and Cloud
Computing
• Social aspects of Service-Oriented and Cloud Computing, e.g., crowdsourcing services, social
and crowd-based Clouds
• Virtualization for Service-Oriented and Cloud Computing, e.g., serverless, container-based
virtualization, VMs
IMPORTANT DATES
• Submission of abstracts: June 25th, 2023 (AoE)
• Submission of full papers: July 2nd, 2023 (AoE)
• Notification to authors: August 4th, 2023 (AoE)
• Camera-ready versions due: August 21st, 2023 (AoE)
• Author registration due: August 21st, 2023 (AoE)
TYPES OF SUBMISSIONS
ESOCC 2023 invites submissions of the following kinds:
• Regular Research Papers (15 pages including references)
• PhD Symposium (12 pages including references)
• Projects and Industry Reports (Projects and Industry Reports (1 to 6 pages including
references, describing an ongoing EU or national project, or providing industrial perspectives
on innovative applications, technologies, or methods in ESOCC’s scope)
We only accept original papers, not submitted for publication elsewhere. The papers must be
formatted according to the proceedings guidelines of Springer’s Lecture Notes in Computer
Science (LNCS) series (http://www.springer.com/lncs).
They must be submitted to the EasyChair site at: https://easychair.org/conferences/?conf=esocc2023 by selecting the right track.
Accepted papers from all tracks will be published in the main conference proceedings by
Springer in the LNCS series. For publication to happen, at least one author of each accepted
paper is expected to register and present the work at the conference.
The best papers accepted will be invited to submit extended versions for a Journal Special
Issue to be published by Springer Computing.
ORGANISATION
General Chair
• George A. Papadopoulos, University of Cyprus, CY
(george at ucy.ac.cy)
Program Chairs
• Florian Rademacher, University of Applied Sciences and Arts Dortmund, DE
(florian.rademacher at fh-dortmund.de)
• Jacopo Soldani, University of Pisa, IT
(jacopo.soldani at unipi.it)
Steering and Program Committee
https://cyprusconferences.org/esocc2023/committees/
An opportunity to join the Assessment Research Group at the British Council as Emerging Researcher: Data Analyst. Full details and link to application here: https://careers.britishcouncil.org/job/Emerging-Researcher-Data-Analyst/946…
For any enquiries, please contact Dr Karen Dunn at Karen.Dunn(a)britishcouncil.org<mailto:Karen.Dunn@britishcouncil.org>. Application deadline is 2nd July.
The British Council is the United Kingdom's international organisation for cultural relations and educational opportunities. A registered charity: 209131 (England and Wales) SC037733 (Scotland). This message is for the use of the intended recipient(s) only and may contain confidential information. If you have received this message in error, please notify the sender and delete it. The British Council accepts no liability for loss or damage caused by viruses and other malware and you are advised to carry out a virus and malware check on any attachments contained in this message.
(apologies for cross-postings)
Dear Corpora list members,
We are delighted to invite you to an upcoming CLARIN Cafe event "A New Clarin Resource Family for Lexical Semantic Change Research". This event is organised by Barbara McGillivray (King’s College London), Fahad Khan (CNR-ILC), and Paola Marongiu (Université de Neuchâtel), and will be held online on July 5 at 14:00-16:00 CEST. The event is supported by CLARIN through CLARIN Resource Families Project Funding.
During the event we will present our work for the creation of a new CLARIN Resource Family for lexical semantic change research. We are also thrilled to have two distinguished speakers who will share their expertise and insights on the topic of lexical semantic change research: Florentina Armaselu (University of Luxembourg) and Sabine Tittel (University of Heidelberg). A roundtable discussion at the end of the event aims to foster collaboration and knowledge exchange between the speakers and online participants, and ultimately to inform the future sustainability of our proposal for a CLARIN resource that can adapt to various types of research in this area.
Please note that registration is required to receive the details to join this CLARIN Cafe. To register for this event and obtain additional information about the programme, speakers and any relevant updates, please visit the following link: https://www.clarin.eu/event/2023/clarin-cafe-new-clarin-resource-family-lex…
Thank you for your attention, and we hope to see you at the CLARIN Cafe on July 5!
Best regards,
Barbara McGillivray, Fahad Khan and Paola Marongiu
Barbara McGillivray | @BarbaraMcGilli<https://twitter.com/BarbaraMcGilli>
Lecturer in Digital Humanities and Cultural Computation
Group lead of the Computational Humanities Research Group<https://www.kcl.ac.uk/research/computational-humanities-research-group>
Strand Campus, Strand, London, WC2R 2LS, Room 3.28, Department of Digital Humanities, King’s College London
Group lead of the Computational Humanities Research Group at King’s College London<https://www.kcl.ac.uk/research/computational-humanities-research-group>
Turing Fellow<https://www.turing.ac.uk/people/researchers/barbara-mcgillivray>, The Alan Turing Institute
Editor-in-chief of Journal of Open Humanities Data<https://openhumanitiesdata.metajnl.com/>
Our colleagues are currently taking part in a marking and assessment boycott as part of a dispute with employers about pay and working conditions. King’s has threatened punitive pay deductions for taking this action. Please visit this page<https://docs.google.com/document/d/1B20Z4FzehJErfZGnLSngP8gbDSPveiytnlA4nOd…> to see how you can help.
Dear colleagues,
*Apologies for cross-posting
Please consider submitting to DHd2024 (Digital Humanities Conference of the German Speaking Area) in Passau.
The Call for Papers also can be found here: https://dhd2024.dig-hum.de/call-for-papers/
We also warmly welcome reviewers. Please sign up at Conftool (https://www.conftool.net/dhd2024/) and select your preferred topics to review.
If you already had a conftool account for DHd2023, you can use your previous credentials.
Important Dates:
19.07.2023 Submission Deadline
02.11.2023 Notification of acceptance
04.12.2023 Submission of camera ready version
04.12.2023 Start of registration
15.02.2024 End of registration
26.02.2024 - 01.03.2024 Conference in Passau
Call For Papers (English)
(German and English CfP here: https://dhd2024.dig-hum.de/call-for-papers/ )
DHd Annual Conference PASSAU 2024
Conference theme ��DH Quo Vadis��
It has been a decade since the representatives of the DHd �C Association for Digital Humanities in the German Speaking Area first gathered in Passau for the annual conference.
The conference has since been held in various locations in Germany and other German-speaking countries.
As we approach the DHd2024, it is an opportunity to reflect on the progress made within the discipline and develop future perspectives.
Over the past ten years, not only has the DHd community steadily grown, but the fields of research and the applications of digital humanities have also evolved and transformed.
Digital methods and tools have become integral to everyday research and various aspects of life, with digitization being a prominent topic in public discourse, such as artificial intelligence and big data.
Therefore, the Digital Humanities have consistently faced the task of accompanying and providing crucial support in this process, especially in offering interpretation and assistance.
This raises questions such as:
How can the Digital Humanities meet the increasing demands in areas such as sustainability, ethics, data protection, interdisciplinary collaboration, and data-intensive research?
How will the field of DH develop in the next ten years?
What are the current and future challenges facing DH?
Which methods should be further pursued, and which ones should be reconsidered?
For DH researchers, who benefit from the interdisciplinary approach and have access to a wide range of methods, it is a challenge to maintain an overview and adequately assess all approaches.
For instance, digital methods and tools need to be critically examined and their application to diverse subject areas critically questioned.
This process also involves learning from failures and leveraging both positive and negative experiences from the last decade.
The DHd2024 conference will provide a platform to exchange these experiences and discuss potential avenues for development.
The representatives of the field have the responsibility of addressing these challenges and shaping the future of DH through collaborative discussions, to meet the demands of a changing digital society.
Possible topics for submissions:
- Methodological and technical perspectives for the future
- Artificial intelligence and its role in the humanities
- The impact of Digital Humanities in public discourse
- Citizen Science and Digital Humanities
- Reflections on ten years of DH in the German-speaking areas: trends, successes, and failures
- Current status of standardization efforts: standardized vocabularies, ontologies, markup languages, etc.
Besides these focus areas, submissions on any topic related to Digital Humanities are welcome.
Additionally, both positive and negative project results can be presented and discussed.
Submission Requirements
The following may be submitted:
Presentations (submission of at least 1500, maximum 2000 words)
Presentations in the Doctoral Consortium (submission of at least 500, maximum 750 words)
Panels (minimum three, maximum six participants, submission of at least 1200, maximum 1500 words)
Posters (minimum 500, maximum 750 words)
Pre-conference half-day or full-day workshops (minimum submission 1200 words, maximum 1500 words)
To submit, register on ConfTool (https://www.conftool.net/dhd2024/) and upload a dhc file created using the DHConvalidator web service for peer review. Additionally, provide a short summary of approximately 100-150 words in ConfTool. The accepted submissions, which are considered as citable small publications, will be published individually and in the conference publication on the Zenodo Community of the DHd Association. They will also be featured on the conference website and program with short abstracts. The DHConvalidator provides a template for submitting in Word or OpenOffice formats, which includes guidelines for citations, references, and bibliography.
The deadline for submissions is 19th July 2023, at 23:59 CEST.
Please note that, like in previous years, this deadline will not be extended.
The review process will follow an open peer review system, where the names of submitters and reviewers will be disclosed to each other (known as open identities). However, the actual reviews will not be published. Notification of acceptance will be sent by 2nd November 2023. For any submission-related queries, please contact dhd2024(a)uni-passau.de via email.
The main languages of exchange in the DHd community are German and English. Proposals can be submitted in German as well as in English. We encourage all participants to present their contributions to the conference in German, but contributions in English are also welcome.
It is understood that accepted submissions will be presented by the submitters in person and on site.
Each individual may submit only one talk or poster (as the presenting person in ConfTool) and can give only one talk. Co-authorship is limited to a maximum of two additional oral or poster submissions. Furthermore, each individual can be involved in a maximum of one panel or workshop submission.
DHd working groups are allowed one additional submission (workshop, panel, or poster), clearly identified as such. For submissions with multiple contributors, it is recommended to use the CRediT taxonomy (https://credit.niso.org/) to indicate the roles of each contributor in a footnote at the beginning of the submission (e.g., ��Contributor Roles: Firstname1 Name1 (Conceptualization), Firstname2 Name2 (Software), Firstname3 Name3 (Writing �C review & editing)��).An academic paper typically includes a bibliography at the end. The word count does not include the words in the bibliography or captions. All other words are counted.
A good submission adheres to the principles of sound research work, providing a substantial and formally structured description of the research question, materials, methods, and results. Please note that submissions should be concise yet complete academic publications suitable for publication. If you are uncertain about the format of the submission, we recommend referring to successful examples from the previous conference. The Book of Abstracts for DHd2023 offers an overview of accepted presentations, panels, posters, and workshops. Additionally, the review process handout (in German) for the DHd annual conferences is a helpful resource.
Possible formats:
Individual presentations (20�� presentation + 10�� discussion; submission of at least 1500, maximum 2000 words): Unpublished results or developments of significant new methods or digital resources and/or a methodological or theoretical concept should be presented. Presentation submissions should contextualise the research contribution in an appropriate way against the background of the current state of research and clearly demonstrate its significance for the (digital) humanities or a respective sub-field thereof. For projects without interim results, the poster format is recommended.For accepted presentation submissions, a camera-ready version of max. 2,500 words must be submitted by 01.12.2023, taking into account the expert opinions.
Presentations in the Doctoral Consortium (submission of at least 500, maximum 750 words): The promotion of young researchers is a particular concern of the annual conferences of the Dhd. For this reason, presentations can be submitted separately to a Doctoral Consortium to present dissertation projects of selected participants. In addition to the opportunity to present one��s own research topic, feedback from the specialist audience can also be obtained in this way to further develop one��s own doctoral project thematically and methodologically. The focus is on the initial phase of the doctorate. Doctoral candidates with advanced projects are therefore encouraged to submit individual presentations (see above).Expos��s (500-750 words, plus bibliography) for the Doctoral Consortium can be submitted in the usual way via the ConfTool as a contribution of the type (conference track) ��Doctoral Consortium�� until 19 July 2023. The best submissions will be invited to the Doctoral Consortium. For them, the participation fee for the conference will be waived; in addition, the costs for an overnight stay will be covered by the organisers, if possible. Additional accommodation and travel costs will be borne by the participants themselves.In addition, it is planned to award travel grants to lecturers and participants who have little or no financial resources within the framework of their own positions and projects. Participants in the Doctoral Consortium will also be able to apply for these travel grants. A separate call will be published for the travel grants.
Panels (minimum three, maximum six participants including moderator; 60�� panel discussion, 30�� Q&A from audience; minimum submission 1200, maximum 1500 words): Panels provide an opportunity for three to six participants to discuss a topic that goes beyond the scope of a single project, project network or research centre. Panels should not be composed exclusively of participants from the same project. It is encouraged to submit panels related to the conference theme and to give the audience the chance to discuss controversial topics. Care should be taken to ensure that a panel is staffed according to diversity criteria. It is expected that no more than one third of the 90-minute session will be devoted to prepared statements and that the debate within the panel will allow sufficient time for discussion with the audience. Panel organisers will submit a brief description of the topic of at least 1200 words, maximum 1500 words, and confirm the willingness of the listed individuals to participate in the panel. For the acceptance of a panel submission, the stringent presentation of the thematic or methodological coherence of the individual contributions is of decisive importance.
Posters (minimum submission 500 words, maximum 750 words): Posters can be submitted on any topic of the Call for Papers. They may also describe the status of individual projects in a descriptive manner. The posters will each be published together with the submissions in the Zenodo Community of the DHd Association under a CC-BY licence. Posters that are accepted for presentation must therefore be submitted by the presenters as a file via ConfTool by 28.02.2024 at the latest. Further information on the procedure will be communicated to the poster presenters after acceptance.
Pre-conference workshops (half day/4 h to two half days/8 h incl. breaks; submission of at least 1200, maximum 1500 words): In addition to teaching, training or tutorial formats (e.g. on specific topics, technologies, tools, key skills), collaborative forms of work on topics and/or data (e.g. hackathons, bar camps, tool testing) and longer meetings of DHd working groups geared towards a predefined output can also be submitted. Workshops will last half a day (4 hours, including break) or two half days (7-8 hours, including breaks) and will take place on Monday and Tuesday of the conference week. Submissions must include the following information:
Title and a brief description of the topic (minimum 1200, maximum 1500 words), full contact details of all contributors and a paragraph on their research interests.
Details of the format
Information on the target audience, in particular on necessary prior knowledge
The number of possible participants
Information on any technical equipment required
The call for papers specific to the workshop, if one is published.
Workshop leaders are expected to register for the conference. Only presenters who are not otherwise participating in other events of the conference can be exempted from registration.
Best
Thomas Haider
----
Dr. phil. Thomas Nikolaus Haider
Digital Humanities and Multilingual Computational Linguistics
University of Passau
The NLLP Workshop 2023<https://nllpw.org/workshop/> will take place on 7 December 2023 and will be co-located with the EMNLP 2023<https://2023.emnlp.org/> conference in Singapore.
We will be accepting direct submissions, papers submitted to ARR and EMNLP rejected papers. Submission link is here<https://softconf.com/emnlp2023/nllp2023/>
Important dates
* Submission deadline ― 18 September 2023
* Notification ― 6 October 2023
* Submission of EMNLP papers with reviews ― 10 October 2023
* ARR Committment deadline ― 10 October 2023
* ARR and EMNLP notification ― 17 October 2023
* Camera ready due ― 20 October 2023
* Presentation due ― 1 December 2023
* Workshop ― 7 December 2023
All deadlines are 11.59pm UTC -12h
Call for papers
Goal
Following the success of the first four editions of the NLLP workshop (NAACL 2019, KDD 2020, EMNLP 2021, EMNLP 2022), we aim to bring researchers and practitioners from NLP, machine learning and other artificial intelligence disciplines together with legal practitioners and researchers. We welcome submissions describing original work on legal data, as well as data with legal relevance, such as:
* Applications of NLP to legal tasks including, but not limited to:
* Legal Citation Resolution
* Case Outcome Analysis and Prediction
* Models of Legal Reasoning
* E-Discovery
* Lexical and other Data Resources for the Legal Domain
* Bias and Privacy
* Experimental results using and adapting NLP methods for legal data including:
* Classification
* Information Retrieval
* Anomaly Detection
* Clustering
* Knowledge Base Population
* Multimedia Search
* Link Analysis
* Entity Recognition and Disambiguation
* Training and Using Embeddings
* Parsing
* Dialogue and Discourse Analysis
* Text Summarization and Generation
* Relation and Event Extraction
* Anaphora Resolution
* Question Answering
* Query Understanding
* Combining Text with Structured Data
* Tasks:
* Description of new legal tasks for NLP
* Structured overviews of a specific task with the goal of identifying new areas for research
* Position papers presenting new visions, challenges and changes to existing research practices
* Resources:
* Creation of curated and/or annotated data sets that can be publicly released and used by the community to advance the field
* Demos:
* Descriptions of systems which use NLP technologies for legal text;
* Industrial Research:
* Industrial applications
* Papers describing research on proprietary data
* Interdisciplinary position papers:
* Legal or socio-legal analyses relating to the role NLP can play in the legal field
* Critical reflections on the legality and ethics of data collection and processing practices
* Critical reflections about the benefits and challenges of Large Language Models (LLMs) from a legal and regulatory perspective
* Critical reflections on the legality and ethics of data collection and processing practices
Submission
We accept papers reporting original (unpublished) research of two types:
* Long papers (max 8 pages + references)
* Short papers (max 4 pages + references)
Appendices and acknowledgements do not count against the maximum page limit and should be formatted according to the guidelines below.
To submit a paper, please access the submission link<https://softconf.com/emnlp2023/nllp2023>.
Conference proceedings will be published on the ACL Anthology<https://aclanthology.org/>.
Ethics section
The NLLP workshop adheres to the same standards regarding ethics as the EMNLP 2023 conference<https://2023.emnlp.org/calls/main_conference_papers/#ethics-policy>. Authors will be allowed extra space after the 8th page (4th for short papers) for an optional broader impact statement or other discussion of ethics. Note that an ethical considerations section is not required, but papers working with sensitive data or on sensitive tasks that do not discuss these issues will not be accepted.
Non-archival option
The authors have the option of submitting previously unpublished research as non-archival, meaning that only the abstract will be published in the conference proceedings. We expect these submissions to describe the same quality of work as archival submissions. These will be reviewed following the same procedure as archival submissions. This option accommodates publication of the work or a superset at a later date in a conference or journal which does not allow previously archived work and to encourage presentation and feedback on mature, yet unpublished work. Non-archival submissions should adhere to the same formatting and length constraints as archival submissions.
Dual Submission and Pre-print Policy
Papers that have been or will be submitted to workshops, conferences or journals during the review period must indicate so at submission time. Authors of papers accepted for presentation at the NLLP 2023 workshop must notify the organizers by the camera-ready deadline as to whether the paper will be presented or withdrawn.
If the preliminary version of a paper was posted in arXiv, the authors should NOT mention it as their own paper in the submission. Papers that violate the double-blind review requirements will be desk rejected.
Exception: Submissions with the non-archival option are excepted from these requirements.
ACL Rolling Review Submissions
Our workshop also welcomes submissions from ACL Rolling Review (ARR)<https://aclrollingreview.org/>. Authors of any papers that are submitted to ARR and have their meta review ready may submit their papers and reviews for consideration for the workshop until 10 October 2023. This should include submissions to ARR for the 15 August deadline. The decision of publication will be announced by 17 October 2023. The committment should be done via the workshop submission website: https://softconf.com/emnlp2023/nllp2023/ ("ACL Rolling Review Committment" submission type)
EMNLP 2023 Submissions
Authors of any papers that have been reviewed for EMNLP 2023 and were rejected have the opportunity to send their paper and reviews to be considered for publication in the NLLP workshop proceedings. The deadline for submitting papers and reviews is 10 October 2023. The decision of publication will be announced by 17 October 2023. The submission should be done via the workshop submission website: https://softconf.com/emnlp2023/nllp2023 ("EMNLP 2023 Submission with reviews" submission type)
Double-Blind reviewing
The review process is double-blind. Submitted papers must not include author names and affiliations and they must be written in a way so that they do not break the double-blind reviewing process. If the preliminary version of a paper was posted in arXiv, the authors should NOT mention it as their own paper in the submission. Papers that violate the double-blind review requirements will be desk rejected.
Submission Style & Format Guidelines
Paper submissions must use the official ACL style templates, which are available here<https://github.com/acl-org/acl-style-files> (Latex<https://github.com/acl-org/acl-style-files/tree/master/latex> and Word<https://github.com/acl-org/acl-style-files/tree/master/word>). Please follow the paper formatting guidelines general to "*ACL" conferences available here<https://acl-org.github.io/ACLPUB/formatting.html>.
Authors may not modify these style files or use templates designed for other conferences. Submissions that do not conform to the required styles, including paper size, margin width, and font size restrictions, will be rejected without review.
All long, short and theme papers must follow the ACL Author Guidelines<https://www.aclweb.org/adminwiki/index.php?title=ACL_Author_Guidelines>.
Presentation
* Presentation format for each paper and schedule will be announced between acceptance notification and the camera-ready deadline.
* At least one author of each accepted paper must register for the NLLP 2023 workshop by the registration deadline in order for the submission to be published in the proceedings.