We are pleased to announce release 4.4.0 of Coptic Scriptorium <https://copticscriptorium.org/> ! Our data now includes over 1,267,000 tokens of searchable, linguistically analyzed Coptic data from dozens of ancient Coptic works (an increase of almost 100,000 tokens from the previous release). Annotations include POS tagging, lemmatization, morphological analysis, dependency parses, nested NNER, entity linking and more. We are very grateful to all of our collaborators and contributors, without whom this project could not function.
This release corrects a large number of consistency errors identified in our existing data, and also adds some new documents:
* Sections of three works by Shenoute of Artipe <https://data.copticscriptorium.org/search?author=Shenoute> :
* I Have Been Considering <https://data.copticscriptorium.org/texts/shenouteconsidering/>
* So Concerning the Little Place <https://data.copticscriptorium.org/texts/shenouteplace/>
* The Lord Thundered <https://data.copticscriptorium.org/texts/shenoutethundered/>
* New documents added to existing works:
* Acephalous Work 22 <https://data.copticscriptorium.org/texts/acephalous_work_22/>
* Apophthegmata Patrum <https://data.copticscriptorium.org/texts/ap/>
* The remaining books 2-4, as well as the postscript of Pistis Sophia <https://data.copticscriptorium.org/texts/pistissophia/> , which are now added to the previously released book 1 in our online interfaces
* Newly treebanked data with syntactic gold standard annotations for the Life of John the Kalybites <https://data.copticscriptorium.org/texts/lifejohnkalybites/> , part 1
We would like to thank the Marcion Project <https://marcion.sourceforge.net/> for making the underlying digitized text of Pistis Sophia available, and all of the annotators for their hard work. Tamara Siuda, Rebecca Krawiec, Philippe Zaher, and Lance Martin contributed, in addition to Amir and Carrie. As our current DHAG grant <https://copticscriptorium.org/dhag> ends, we would like to give special thanks to Lance, who has been working as our DH specialist on the project since 2019, for doing an amazing job of keeping track of all the data and the various tasks he’s been in charge of over the past three years!
As with all releases, raw machine readable data for all corpora can be found, including morphological and syntactic analysis, as well as named entity recognition and entity linking, on our GitHub repository, in a variety of popular formats:
https://github.com/copticscriptorium/corpora
You can also search for complex linguistic annotations in the data using our ANNIS server – please see our new tutorial here to get started with some query tips and a helpful cheat sheet:
https://copticscriptorium.org/ANNIS_tutorial
We hope this release will be useful and look forward to the next one as always,
The Coptic Scriptorium team
�
�
************************************
EMNLP EvoNLP Workshop: The First Workshop on Ever Evolving NLP
-> The main submission deadline has been extended to October 12.
Workshop: https://sites.google.com/view/evonlp/
Shared Task: https://sites.google.com/view/evonlp/shared-task
Submission deadline (for papers requiring review / non-archival): 12 October, 2022 [EXTENDED!]
Submission deadline (with ARR reviews): 25 October, 2022
Notification of acceptance: 31 October, 2022
Camera-ready paper deadline: 11 November, 2022
Workshop date: 7 December, 2022 (co-located with EMNLP)
************************************
Advances in language modeling have led to remarkable accuracy on several NLP tasks, but most benchmarks used for evaluation are static, ignoring the practical setting under which training data from the past and present must be used for generalizing to future data. Consequently, training paradigms also ignore the time sensitivity of language and essentially treat all text as if it was written at a single point in time. Recent studies have shown that in a dynamic setting, where the test data is drawn from a different time period than the training data, the accuracy of such static models degrades as the gap between the two periods increases.
--------------------------------------------------------------
This workshop focuses on these time-related issues in NLP models and benchmarks. We invite researchers from both academia and industry to redesign experimental settings, benchmark datasets, and modeling by especially focusing on the “time” variable. We will welcome papers / work-in-progress on several topics including (but not limited to):
- Dynamic Benchmarks: Evaluation of Model Degradation in Time
Measuring how NLP models age
Random splits vs time-based splits (past/future)
Latency (days vs years) at which models need to be updated for maintaining task accuracy
Time-sensitivity of different tasks and the type of knowledge which gets stale
Time-sensitivity of different domains (e.g., news vs scientific papers) and how domain shifts interact with time shifts
Sensitivity of different models and architectures to time shifts
- Time-Aware Models
Incorporating time information into NLP models
Techniques for updating / replacing models which degrade with time
Learning strategies for improving temporal degradation
Trade-offs between updating a degraded model vs replacing it altogether
Mitigating catastrophic forgetting of old knowledge as we update models with new knowledge
Improving plasticity of models so that they can be easily updated
Retrieval based models for improving temporal generalization
- Analysis of existing models / datasets
Characterizing whether degradation on a task is due to outdated facts or changes in language use
Effect of model scale on temporal degradation – do large models exhibit less degradation?
Efficiency / accuracy trade-offs when updating models
--------------------------------------------------------------
All accepted papers will be published in the workshop proceedings unless requested otherwise by the authors. Submissions can be made either via OpenReview where they will go through the standard double-blind process, or through ACL Rolling Review with existing reviews. See details below.
---- Submission guidelines ----
We seek submissions of original work or work-in-progress. Submissions can be in the form of long/short papers and should follow the ACL main conference template. Authors can choose to make their paper archival/non-archival. All accepted papers will be presented at the workshop.
Archival track
We will follow double-blind review process and use OpenReview for the submissions. We also will accept ACL rolling review (ARR) submissions with reviews. Since these submissions already come with reviews, the submission deadline is much later than the initial deadline. We will use Open Review for the submissions.
Submission link: https://openreview.net/group?id=EMNLP/2022/Workshop/EvoNLP
For papers needing review click “EMNLP 2022 Workshop EvoNLP submission”
For papers from ARR click “EMNLP 2022 Workshop EvoNLP commitment Submission”
---- Non-archival track ----
Non-archival track seeks recently accepted / published work as well as work-in-progress. It does not need to be anonymized and will not go through the review process. The submission should clearly indicate the original venue and will be accepted if the organizers think the work will benefit from exposure to the audience of this workshop.
Submission: Please email your submission as a single PDF file to evonlp(a)googlegroups.com. Include “EvoNLP Non-Archival Submission” in the title and the author names and affiliation within the body of your email.
---- Shared task (TempoWiC) ----
The workshop features a shared task on meaning shift detection in social media. Official competition is closed, but new submissions can be made to Codalab: https://codalab.lisn.upsaclay.fr/competitions/5360
--
Jose Camacho Collados
http://www.josecamachocollados.com<http://www.josecamachocollados.com/>
Corpus Linguistics, Learner Corpora, and SLA: Employing Technology to Analyze Language Use
Online talk. October 5, 18:00 (Madrid time) / 17:00 (UK time)
Prof Tony McEnery, University of Lancaster
Free registration link<https://umurcia.zoom.us/webinar/register/WN_ey8fEpC3RkC7_vAUKEpAgg>. You’ll receive an email with the ZOOM link and code.
Abstract
In this talk I will explore the relationship between learner corpus and second language acquisition research. I begin by considering the origins of learner corpus research, noting its roots in smaller scale studies of learner language. This development of learner corpus studies will be considered in the broader context of the development of corpus linguistics. I then consider the aspirations that learner corpus researchers have had to engage with second language acquisition research and explore why, to date, the interaction between the two fields has been minimal. By exploring some of the corpus building practices of learner corpus research, and the theoretical goals of second language acquisition studies, I will identify reasons for this lack of interaction and make proposals for how this situation could be fruitfully addressed.
Tony McEnery is distinguished professor of Linguistics and English Langauge in the Department of English Language and Linguistics at Lancaster University and Changjiang Chair at Xi’an Jiaotong University in China. He has published widely on corpus linguistics and is the author of Corpus Linguistics: Method, Theory and Practice (with Andrew Hardie, Cambridge University Press, 2011). His latest book, Fundamental Principles of Corpus Linguistics (with Vaclav Brezina) is due out from Cambridge University Press soon.
More info:
https://www.um.es/languagecorpora/2022/09/20/corpus-linguistics-learner-cor…
Pascual Pérez-Paredes
https://webs.um.es/pascualf
Dear All,
Sorry for the confusion. Please see the updated version below.
Could you please circulate the following announcement about the revised application deadline for the R for Stylistics Workshop?
Thank you.
Sara Bartl
Dear Colleagues,
Due to the high number of sign-ups for the R for Stylistics workshop, we have decided to close applications early. The new deadline is 7 October.
Please note that this is an in-person event only!
R for Stylistics is a two-day (January 19-20, 2023), hands-on workshop for students and scholars of stylistics/literary linguistics who are eager to learn how to use the R programming environment to handle textual data. The workshop will follow an intensive two-day program in which participants will learn the basics of working with R and R Studio, as well as some methods of basic textual analysis with R. The workshop is best suited to advanced students (i.e. MA, PhD) and scholars of stylistics with no prior programming experience. For more information and to register/apply: https://docs.google.com/forms/d/e/1FAIpQLSf1Wrv14EmX2_hsXG_rt0IkxXPvnibl9Qd…
Sara Bartl (she/they)
@SaraBartl<https://twitter.com/SaraBartl>
Doctoral Researcher
English Language and Linguistics
University of Birmingham
Currently reading: Everything Under - Daisy Johnson
The independent research group on Diversity-Aware NLP Intelligent
Systems (DANIS) headed by Dr. Agnieszka Faleńska [1] invites
applications for a *PhD position*. The group is a part of the project
Reflecting Intelligent Systems for Diversity, Demography, and Democracy
(IRIS3D) [2] funded by the Ministry of Science, Research and the Arts of
the State of Baden-Württemberg [3]. It is located at the Institute for
Natural Language Processing (IMS) [4] at the University of Stuttgart,
Germany [5].
*Project*
The immense influence of NLP systems on human lives raises increasing
concerns about the possible harm these tools can cause. Harmful
behaviors of such systems are regarded as symptoms of their bias, i.e.,
the systematic preference or discrimination against certain groups of
users. NLP tools are commonly trained on textual corpora that display
such biases already at the level of their authors. For example,
Wikipedia, which is one of the most commonly used sources of training
data, is created by a predominantly white and male group of editors.
Such a lack of diversity among authors can lessen the impact of data
from minorities and, as a consequence, result in NLP models that reflect
the underlying demographic imbalances. DANIS contributes to the
discourse of fairness in AI by facilitating the design of NLP
intelligent systems that can recognize inputs from underrepresented
groups of users and strengthen their role in the training processes.
*Position*
The successful candidate will work on topics ranging from (i)
recognition of textual phenomena that identify distinctive demographic
groups, (ii) analysis techniques to investigate the influence of
particular training signals on neural NLP models, and (iii)
incorporating the knowledge of biases into a standard NLP methodology.
The position is available for four years, starting in December 2022 or
soon thereafter. The salary is according to the German university pay
scale (TV-L 13 100%, approx. 50k EUR per year before taxes, see [6] for
details). The PhD candidate will benefit from the support structures
offered by the Graduate Academy of the University of Stuttgart (GRADUS)
[7] and IRIS [8]. Moreover, they will have the chance to collaborate
with other IRIS3D projects as well as the interdisciplinary research
group E-DELIB (Powering up e-deliberation: towards AI-supported
moderation) [9] and the Emmy Noether research group MisT (Computational
Models of Misunderstanding) [13].
*Candidate's Profile*
* Master’s degree in computational linguistics, computer science,
computational social science, or related fields
* knowledge of natural language processing (NLP) / computational
linguistics (CL)
* strong programming skills
* advanced knowledge of machine learning methods, experience in deep
learning is a plus
* excellent communication skills and interest in interdisciplinary work
* proficiency in English (knowledge of German is not required)
*How to Apply?*
To apply, please send as a single PDF document:
* a brief motivation letter including your research interests
* a CV including a list of publications (if applicable) and contact
information of one to two references
Applications should be sent to Agnieszka Faleńska (agnieszka.falenska at
ims.uni-stuttgart.de). Applications submitted before October 21, 2022
will receive full consideration. The position will remain open until
filled, so do not hesitate to get in touch when you find this opening
after October 21st. The (online) interviews will take place at the
beginning of November.
The University of Stuttgart aims to achieve equal opportunities for men
and women in university studies and science and to increase the number
of women in academia [12].Therefore, people of all genders are equally
encouraged to apply. Applications of severely disabled candidates with
equivalent qualifications will be given priority.
*About IRIS and IRIS3D*
The IRIS3D project is embedded in the Cyber Valley ecosystem [10]. It
consists of three independent research groups contributing to the
reflection on the societal impacts of intelligent systems. The project
is a part of the Interchange Forum for Reflecting on Intelligent Systems
(IRIS). IRIS provides a platform to stimulate critical reflection on the
foundations, mechanisms, implications, and effects of intelligent
systems in research, teaching, and society.
*About Stuttgart and the University of Stuttgart*
The University of Stuttgart is a technically oriented university in
Germany. It is especially known for engineering and related topics, with
its computer science department being ranked highly, both nationally and
internationally. The Institute for Natural Language Processing, which is
part of the Faculty of Computer Science and Electrical Engineering, is
one of the largest academic research institutes for natural language
processing in Germany, with three full professors, an assistant
professor, three senior lecturers, and a staff of more than thirty
researchers. Its activities range from computational corpus linguistics
to semantic processing, deep learning, machine translation,
psycholinguistics, and phonetics, and hosts several projects funded by
the European Commission (EC), the German Research Foundation (DFG), and
various foundations. The institute manages dedicated BSc and MSc
programs in Computational Linguistics.
The city of Stuttgart [11] is the capital of the state of
Baden-Württemberg in southwest Germany. It is a lively and international
city, known for its strong economy and rich culture. With Germany's
high-speed train system, it is well-connected to many other interesting
places, for instance, Munich and Cologne (~2.5 hours), Paris (~3.5
hours), Berlin (~5.5 hours), Strasbourg (<1.5 hours), and Lake of
Constance (~2.5 hours).
*Links*
[1] www.ims.uni-stuttgart.de/en/institute/team/Falenska
[2] www.iris.uni-stuttgart.de/research/iris3d
[3] mwk.baden-wuerttemberg.de/de/startseite
[4] www.ims.uni-stuttgart.de
[5] www.uni-stuttgart.de
[6] oeffentlicher-dienst.info/c/t/rechner/tv-l/west?id=tv-l-2015
[7]
www.student.uni-stuttgart.de/en/uni-a-to-z/Graduate-Academy-of-the-Universi…
[8] www.iris.uni-stuttgart.de
[9] www.ims.uni-stuttgart.de/en/institute/researchgroups/edelib
[10] cyber-valley.de
[11] www.stuttgart-tourist.de/en
[12]
www.uni-stuttgart.de/en/university/organization/management/staff-positions/…
[13] www.ims.uni-stuttgart.de/en/institute/researchgroups/mist
Dear All,
Could you please circulate the following announcement about the revised application deadline for the R for Stylistics Workshop?
Thank you.
Sara Bartl
Dear Colleagues,
Due to the high number of sign-ups for the R for Stylistics workshop, we have decided to close applications early. The new deadline is 7 October.
R for Stylistics is a two-day (January 19-20, 2023), hands-on workshop for students and scholars of stylistics/literary linguistics who are eager to learn how to use the R programming environment to handle textual data. The workshop will follow an intensive two-day program in which participants will learn the basics of working with R and R Studio, as well as some methods of basic textual analysis with R. The workshop is best suited to advanced students (i.e. MA, PhD) and scholars of stylistics with no prior programming experience. For more information and to register/apply: https://docs.google.com/forms/d/e/1FAIpQLSf1Wrv14EmX2_hsXG_rt0IkxXPvnibl9Qd…
Sara Bartl (she/they)
@SaraBartl<https://twitter.com/SaraBartl>
Doctoral Researcher
English Language and Linguistics
University of Birmingham
Currently reading: Everything Under - Daisy Johnson
We are pleased to announce the following *PhD position* in the area of Natural Language Processing at the Digital Age Research Center (D!ARC) at the University of Klagenfurt to commence at the earliest opportunity.
Level of employment: 75% (30hours/week)
Minimum salary: € 32.116,00 per annum (gross), Classification according to collective agreement: B1
Limited to: 4 years
Application deadline: open until filled
Tasks and responsibilities: contributing to a new research project in the area abusive language detection/hate speech detection
Prerequistites for the appointment:
• A Master’s degree in the field of computational linguistics, linguistics, computer science or alike, graded with excellent success and corresponding knowledge in the field
• Good programming skills (preferably Python)
• Fluent in English (both spoken and written); knowledge in German is a plus
Additional desired qualifications:
• Familiarity with setting up annotation tasks via crowdsourcing
• Experience with supervised learning, particularly state-of-the-art deep learning algorithms
• Profound knowledge of publicly available NLP tools and resources
• Experience with NLP on social media, particularly Twitter
• Experience with web crawling and processing large amounts of textual data
• Publications at scientific conferences and in journals in the field relating to the position
• Social and communication skills, ability to work independently
Our offer:
The employment contract is concluded with a starting salary of € 2.294,00 gross per month (14 times a year; previous experience deemed relevant to the job can be recognised in accordance with the collective agreement). The University of Klagenfurt also offers:
• Personal and professional advanced training courses, management and career coaching
• Numerous attractive additional benefits, see also https://jobs.aau.at/en/the-university-as-employer/
• Diversity- and family-friendly university culture
• The opportunity to live and work in the attractive Alps-Adriatic region with a wide range of leisure activities in the spheres of culture, nature and sports
The application:
If you are interested in this position, please apply in German or English providing the usual documents:
• Letter of application
• Curriculum vitae
• Proof of all completed higher education programmes (certificates, supplements, if applicable)
• Other documentary evidence that may be relevant to this announcement (see prerequisites and desired qualifications)
To apply, please send all your application materials to michael.wiegand(a)aau.at<mailto:michael.wiegand@aau.at>. Please also use this address for obtaining further information on this specific vacancy.
General information about the university as an employer can be found at https://jobs.aau.at/en/the-university-as-employer/.
The University of Klagenfurt aims to increase the proportion of women and therefore specifically invites qualified women to apply for the position. Where the qualification is equivalent, women will be given preferential consideration.
People with disabilities or chronic diseases, who fulfil the requirements, are particularly encouraged to apply.
**** We apologize for the multiple copies of this email. In case you are
already registered to the next webinar, you do not need to register
again. ****
Dear colleague,
We are happy to announce the next webinar in the Language Technology
webinar series organized by the HiTZ research center (Basque Center for
Language Technology, http://hitz.eus). We are organizing one seminar
every month. You can check the videos of previous webinars and the
schedule for upcoming webinars here: http://www.hitz.eus/webinars Next
webinar:
* *Speaker*: Xiang Ren (University of Southern California - USC)
* *Title*: Commonsense Reasoning in the Wild
* *Date*: October 6, 2022, 17:00 CET
* *Summary*: Current NLP systems impress us by achieving
close-to-human performance on benchmarks of answering commonsense
questions or writing interesting stories. However, most of the
progress is evaluated using static, closed-ended datasets created
for individual tasks. To deploy commonsense reasoning services in
the wild, we look to develop and evaluate systems that can generate
answers in an open-ended way, perform robust logical reasoning, and
generalize across diverse task formats, domains, and datasets. In
this talk I will share our effort on introducing new formulations of
commonsense reasoning challenges and novel evaluation protocols,
towards broadening the scope in approaching machine common sense. We
hope that such a shift of evaluation paradigm would encourage more
research on externalizing the model reasoning process and improving
model robustness and cross-task generalization.
* *Upcoming webinars*:
o Incorporating Commonsense Reasoning into NLP Models -- Vered
Shwartz (November 3, 2022)
o Title to be announced -- Maarit Koponen (December 1, 2022)
Check past and upcoming webinars at the following url:
http://www.hitz.eus/webinars If you are interested in participating,
please complete this registration form:
http://www.hitz.eus/webinar_izenematea
If you cannot attend this seminar, but you want to be informed of the
following HiTZ webinars, please complete this registration form instead:
http://www.hitz.eus/webinar_info
Best wishes,
HiTZ Zentroa
Hi
We are looking for two researchers (at post-doc or pre-doc level) to
work with us in Edinburgh on two new Horizon Europe projects. Both
projects start in Autumn 2022 and run for 3 years.
The first project (High Performance Language Technologies -- HPLT) will
create an open platform that trains reproducible high-quality machine
translation systems and language models. The project is gathering over
a million GPU hours and 7 petabytes of data. Research in HPLT focuses on
efficiency and models that apply to a wide range of languages. The
second project (UTTER -- Unified Transcription and Translation for
Extended Reality) aims to use large language models to develop scalable,
adaptable, contextualised, emotion-aware and explainable technologies
for translation, summarisation and transcription. The project will
develop prototypes for a multilingual meeting assistant and for dialogue
translation in customer care. The new researchers will work on problems
related to these goals.
For more information, and how to apply, see https://neural.mt/jobs/
best
Barry
The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336. Is e buidheann carthannais a th’ ann an Oilthigh Dhùn Èideann, clàraichte an Alba, àireamh clàraidh SC005336.
(apologies for cross-posting)
*=== Important Information ===*
Website:
https://2023.eacl.org/
*Submission Deadline*: Thursday, 20 October 2022
*Conference Dates*: Tuesday, 2 May 2023 to 6 May 2023
*Location*: Dubrovnik, Croatia
*Contact*: Alessandro Moschitti (General Chair): amoschitti(a)gmail.com //
Isabelle Augenstein & Andreas Vlachos (Program Chairs):
eacl2023pcs(a)googlegroups.com
*=== Overview ===*The 17th Conference of the European Chapter of the
Association for Computational Linguistics (EACL 2023) invites the
submission of long and short papers on substantial, original, and
unpublished research on empirical methods for Natural Language Processing.
As in recent years, some of the presentations at the conference will be for
papers accepted by the Transactions of the ACL (TACL) and Computational
Linguistics (CL) journals.
EACL 2023 will follow EMNLP 2022 and go with a hybrid format with respect
to ARR. This means that while EACL will accept ARR-reviewed papers, it will
also accept submissions directly to EACL through the START system.
However, in order to keep the review load on the community as a whole
manageable, we ask authors to decide up-front if they want their papers to
be reviewed through ARR or EACL.
*Review Process:*Papers submitted directly to EACL will have the “regular”
review process: paper reviewed by 3 reviewers, authors are invited to write
an author response and revise their paper before the camera ready deadline,
if accepted. ARR papers committed to EACL will be handled by the Senior
Area Chairs. For these papers, the authors may provide an author response
but not revise their paper (with the exception of adding the required
“limitations” section, if it was missing from the ARR submission; see
below).
*Cross-Submission Policy with ARR:*Any ARR-reviewed paper that has received
all of its reviews and meta-reviews available by the ARR-to-conference
commitment deadline (January 8, 2023), can be committed to EACL 2023.
Note: Submissions from ARR cannot be modified except that they can be
associated with an author response.
Note: While EACL will consider any ARR paper that has been fully reviewed
by January 8, 2023, unlike for ACL and NAACL 2022, EACL does not guarantee
a date such that all papers submitted to ARR before the date are guaranteed
to receive necessary reviews in time for the EACL commitment deadline.
Consequently, care must be taken in deciding whether a submission should be
made to ARR or EACL directly if the work has not been submitted anywhere
before the call. Plan accordingly.
Note: The START system deadline for direct submission papers, namely
non-ARR submission papers, is October 20, 2022.
Papers submitted to ARR before October 13, 2022, can be withdrawn and
submitted to EACL 2023.
Note: In order for a paper to be submitted directly to EACL 2023, it must
be inactive in the ARR system. This means that the submission must either
be explicitly withdrawn by the authors, or the ARR reviews are finished and
shared with the authors before October 13, 2022, and the paper was not
re-submitted to ARR.
Note: The authors can withdraw their paper from ARR by October 13, 2022,
regardless of how many reviews it has received.
Papers that are in the ARR system after October 13, 2022, either submitted
after or submitted before and not withdrawn, cannot be directly submitted
to EACL 2023.
Papers submitted to EACL 2023 may not be submitted for review elsewhere
(including ARR) while being under review at EACL 2023.
*=== Important Dates ===*- Abstract registration deadline for START direct
submissions: October 13, 2022
- Anonymity period begins: October 13, 2022 (same as the abstract
registration date)
- Direct paper submission deadline (long & short papers): October 20, 2022
- Author response period: December 6-12, 2022
- Commitment deadline for ARR papers: January 8, 2023
- Notification of acceptance (long & short papers): January 20, 2023
- Camera-ready papers due (long & short papers): February 10, 2023
- Workshops & Tutorials: May 2 and 6 2023
- Main Conference: May 3-5, 2023
All deadlines are 11.59 pm UTC -12h ("anywhere on Earth").
*=== Mandatory abstract submission ===*The paper title, author names,
contact details, and a brief abstract must be submitted electronically
through the EACL 2023 paper submission site by the abstract submission
deadline (October 13, 2022). It will be possible to make minor edits to the
title and abstract until the full paper submission deadline, but you cannot
change authors and subject areas. Submissions with “placeholder” abstracts
will be removed without consideration; Important: if you miss the abstract
submission deadline, you cannot submit the full paper.
*=== Submissions ===*EACL 2023 has the goal of a broad technical program.
Relevant topics for the conference include, but are not limited to, the
following areas (in alphabetical order):
- Anaphora, Discourse and Pragmatics
- Computational Social Science and Social Media
- Dialogue and Interactive Systems
- Document analysis, Text Categorization and Topic Models
- Generation and Summarization
- Green and Sustainable NLP
- Information Retrieval and Search
- Information Extraction
- Interpretability and Model Analysis in NLP
- Language Resources and Evaluation
- Language Grounding and Multi-Modality
- Linguistic Theories, Cognitive Modeling and Psycholinguistics
- Machine Learning in NLP
- Machine Translation
- Multilinguality
- Multidisciplinary and NLP Applications
- Question Answering
- Semantics: lexical
- Semantics: sentence level and other areas
- Sentiment Analysis and Argument Mining
- Phonology, Morphology, and Word Segmentation
- Tagging, Chunking, Syntax, and Parsing
*=== Paper Submission Information ===*
*Long Papers*Long paper submissions must describe substantial, original,
completed and unpublished work. Wherever appropriate, concrete evaluation
and analysis should be included. Review forms will be made available prior
to the deadlines. Long papers may consist of up to 8 pages of content, plus
unlimited pages for references and appendix; final versions of long papers
will be given one additional page of content (up to 9 pages) so that
reviewers’ comments can be taken into account.
*Short Papers*Short paper submissions must describe original and
unpublished work. Please note that a short paper is not a shortened long
paper. Instead short papers should have a point that can be made in a few
pages.
Short papers may consist of up to 4 pages of content, plus unlimited
references and appendix. Upon acceptance, short papers will be given 5
content pages in the proceedings. Authors are encouraged to use this
additional page to address reviewers’ comments in their final versions.
*Presentation Mode*Long and short papers will be presented orally or as
posters as determined by the program committee. The decisions as to which
papers will be presented orally and which as poster presentations will be
based on the nature rather than the quality of the work. While short papers
will be distinguished from long papers in the proceedings, there will be no
distinction in the proceedings between papers presented orally and as
posters.
*Authorship*
The author list for submissions should include all (and only) individuals
who made substantial contributions to the work presented. Each author
listed on a submission to EACL 2023 will be notified of submissions,
revisions and the final decision. No changes to the order or composition of
authorship may be made to submissions to EACL 2023 after the abstract
submission deadline.
*Citation and Comparison*You are expected to cite all refereed publications
relevant to your submission, but you may be excused for not knowing about
all unpublished work (especially work that has been recently posted and/or
is not widely cited). While not citing such unpublished works upon
submission is not sufficient grounds for paper rejection, you are expected
to cite such relevant work in camera ready, if notified about it by
reviewers.
In cases where a preprint has been superseded by a refereed publication,
the refereed publication should be cited instead of the preprint version.
Papers (whether refereed or not) appearing less than 3 months before the
submission deadline are considered contemporaneous to your submission, and
you are therefore not obliged to make detailed comparisons that require
additional experimentation and/or in-depth analysis. However, you are
expected to mention such works in your submission, and list their published
results if they are directly relevant.
For more information, see the ACL Policies for Submission, Review, and
Citation:
https://www.aclweb.org/adminwiki/index.php/ACL_Policies_for_Submission,_Rev…
*Multiple Submission Policy*
EACL 2023 will not consider any paper that is under review in a journal or
another conference at the time of submission, and submitted papers must not
be submitted elsewhere during the EACL 2023 review period. This policy
covers all refereed and archival conferences and workshops (e.g., NeurIPS,
ACL workshops), as well as ARR. In addition, we will not consider any paper
that overlaps significantly in content or results with papers that will be
(or have been) published elsewhere. Authors submitting more than one paper
to EACL 2023 must ensure that their submissions do not overlap
significantly (>25%) with each other in content or results.
EACL 2023 will also accept submissions of ARR-reviewed papers, provided
that the ARR reviews and meta-reviews are available by the
ARR-to-conference submission deadline. However, EACL 2023 will not accept
direct submissions that are actively under review in ARR, or that overlap
significantly (>25%) with such submissions.
*Mandatory Discussion of Limitations*We believe that it is also important
to discuss the limitations of your work, in addition to its strengths.
Following the EMNLP format, EACL 2023 requires all papers to have a clear
discussion of limitations, in a dedicated section titled “Limitations”.
This section will appear at the end of the paper, after the
discussion/conclusions section and before the references, and will not
count towards the page limit. Papers without a limitation section will be
automatically rejected without review.
ARR-reviewed papers that did not include “Limitations” section in their
prior submission, should submit a PDF with such a section together when
they commit their papers to EACL 2023 via Softconf.
While we are open to different types of limitations, just mentioning that a
set of results have been shown for English only probably does not reflect
what we expect. Mentioning that the method works mostly for languages with
limited morphology, like English, is a much better alternative. In
addition, limitations such as low scalability to long text, the requirement
of large GPU resources, or other things that inspire crucial further
investigation are welcome.
*Ethics Policy*Authors are required to honor the ethical code set out in
the ACL Code of Ethics. The consideration of the ethical impact of our
research, use of data, and potential applications of our work has always
been an important consideration, and as artificial intelligence is becoming
more mainstream, these issues are increasingly pertinent. We ask that all
authors read the code, and ensure that their work is conformant to this
code. Where a paper may raise ethical issues, we ask that you include in
the paper an explicit discussion of these issues, which will be taken into
account in the review process. We reserve the right to reject papers on
ethical grounds, where the authors are judged to have operated counter to
the code of ethics, or have inadequately addressed legitimate ethical
concerns with their work.
Authors will be allowed extra space after the 8th page (4th for short
papers) for an optional broader impact statement or other discussion of
ethics. The EACL review form will include a section addressing these issues
and papers flagged for ethical concerns by reviewers or ACs will be further
reviewed by an ethics committee. Note that an ethical considerations
section is not required, but papers working with sensitive data or on
sensitive tasks that do not discuss these issues will not be accepted.
Conversely, the mere inclusion of an ethical considerations section does
not guarantee acceptance. In addition to acceptance or rejection, papers
may receive a conditional acceptance recommendation. Camera-ready versions
of papers designated as conditional accept will be re-reviewed by the
ethics committee to determine whether the concerns have been adequately
addressed. Please read the ethics FAQ for more guidance on some problems to
look out for and key concerns to consider relative to the code of ethics.
*Paper Submission and Templates*Submission is electronic, using the
Softconf START conference management system. Both long and short papers
must follow the EACL 2023 two-column format, using the supplied official
style files. Please do not modify these style files, nor should you use
templates designed for other conferences. Submissions that do not conform
to the required styles, including paper size, margin width, and font size
restrictions, will be rejected without review.
*Verification*To guarantee conformance to publication standards, we will be
using the ACL Pubcheck tool (https://github.com/acl-org/aclpubcheck). The
PDFs of camera-ready papers must be run through this tool prior to their
final submission, and we recommend its use also at submission time.
*Optional Supplementary Materials: Appendices, Software and Data*Each EACL
2023 submission can be accompanied by an appendix, which will appear in the
main paper’s PDF, after the bibliography. A submission may also be
accompanied by one .tgz or .zip archive containing software, and one .tgz
or .zip archive containing data. EACL 2023 encourages the submission of
these supplementary materials to improve the reproducibility of results,
and to enable authors to provide additional information that does not fit
in the paper. For example, anonymised related work (see above),
preprocessing decisions, model parameters, feature templates, lengthy
proofs or derivations, pseudocode, sample system inputs/outputs, and other
details that are necessary for the exact replication of the work described
in the paper can be put into the appendix. However, the paper submissions
need to remain fully self-contained, as these supplementary materials are
completely optional, and reviewers are not even asked to review or download
them. If the pseudo-code or derivations or model specifications are an
important part of the contribution, or if they are important for the
reviewers to assess the technical correctness of the work, they should be a
part of the main paper, and not appear in the appendix. Supplementary
materials need to be fully anonymized to preserve the double-blind
reviewing policy.
*=== Anonymity Period ===*The following rules and guidelines are meant to
protect the integrity of double-blind review and ensure that submissions
are reviewed fairly. The rules make reference to the anonymity period,
which runs from 1 week before the direct full paper submission deadline
(starting October 13, 2022) up to the date when your paper is accepted or
rejected (January 20, 2023). Papers that are withdrawn during this period
will no longer be subject to these rules.
- You may not make a non-anonymized version of your paper available online
to the general community (for example, via a preprint server) during the
anonymity period. Versions of the paper include papers having essentially
the same scientific content but possibly differing in minor details
(including title and structure) and/or in length.
- If you have posted a non-anonymized version of your paper online before
the start of the anonymity period, you may submit an anonymized version to
the conference. The submitted version must not refer to the non-anonymized
version, and you must inform the programme chairs that a non-anonymized
version exists.
- You may not update the non-anonymized version during the anonymity
period, and we ask you not to advertise it on social media or take other
actions that would further compromise double-blind reviewing during the
anonymity period.
- You may make an anonymized version of your paper available (for example,
on OpenReview), even during the anonymity period.
- For arXiv submissions, October 13, 2022 11:59pm UTC-12h (anywhere on
earth) is the latest time the paper can be uploaded.
*=== Instructions For Double-Blind Review ===*As reviewing will be double
blind, papers must not include authors’ names and affiliations.
Furthermore, self-references or links (such as github) that reveal the
author’s identity, e.g., “We previously showed (Smith, 1991) …” must be
avoided. Instead, use citations such as “Smith previously showed (Smith,
1991) …” Papers that do not conform to these requirements will be rejected
without review. Papers should not refer, for further detail, to documents
that are not available to the reviewers. For example, do not omit or redact
important citation information to preserve anonymity. Instead, use third
person or named reference to this work, as described above (“Smith showed”
rather than “we showed”). If important citations are not available to
reviewers (e.g., awaiting publication), these paper/s should be anonymised
and included in the appendix. They can then be referenced from the
submission without compromising anonymity. Papers may be accompanied by a
resource (software and/or data) described in the paper, but these resources
should also be anonymized.
*=== Reproducibility Criteria ===*Reviewers will be asked to assess the
reproducibility of the work as part of their reviews. The following are
the criteria that reviews will take under consideration.
For all reported experimental results:
- A clear description of the mathematical setting, algorithm, and/or model.
- Submission of a zip file containing source code, with specification of
all dependencies, including external libraries, or a link to such resources
(while still anonymized)
- Description of computing infrastructure used
- The average runtime for each model or algorithm (e.g., training,
inference, etc.), or estimated energy cost
- Number of parameters in each model
- Corresponding validation performance for each reported test result
- Explanation of evaluation metrics used, with links to code
For all experiments with hyperparameter search:
- The exact number of training and evaluation runs
- Bounds for each hyperparameter
- Hyperparameter configurations for best-performing models
- Number of hyperparameter search trials
- The method of choosing hyperparameter values (e.g., uniform sampling,
manual tuning, etc.) and the criterion used to select among them (e.g.,
accuracy)
- Summary statistics of the results (e.g., mean, variance, error bars, etc.)
For all datasets used:
- Relevant details such as languages, and number of examples and label
distributions
- Details of train/validation/test splits
- Explanation of any data that were excluded, and all pre-processing steps
- A zip file containing data or link to a downloadable version of the data
- For new data collected, a complete description of the data collection
process, such as instructions to annotators and methods for quality control.
This list is based on Dodge et al, 2019 and Joelle Pineau's reproducibility
checklist.
*=== Presentation Requirement ===*All accepted papers must be presented at
the conference—either on-line or in-person—in order to appear in the
proceedings. Authors of papers accepted for presentation at EACL 2023 must
notify the program chairs by the camera-ready deadline if they wish to
withdraw the paper.
At least one author of each accepted paper must register for EACL 2023 by
the early registration deadline.
More information can be found in the Committee blog. If you have questions
that are not answered there, please email the program co-chairs at
eacl2023pcs(a)gmail.com.