*Call for Papers: GenBench, the first workshop on generalisation
(benchmarking) in NLP*
****New: the Collaborative Benchmarking Task submissions are now open;
visit *
*https://github.com/GenBench/genbench_cbt
<https://github.com/GenBench/genbench_cbt>.With the support of our workshop
sponsor Amazon, we are now offering scholarships for travel expenses.****
Workshop description
The ability to generalise well is often mentioned as one of the primary
desiderata for models of natural language processing.
It is crucial to ensure that models behave robustly, reliably and fairly
when making predictions about data that is different from the data that
they were trained on.
Generalisation is also important when NLP models are considered from a
cognitive perspective, as models of human language.
Yet, there are still many open questions related to what it means for an
NLP model to generalise well and how generalisation should be evaluated.
The first GenBench workshop aims to serve as a cornerstone to catalyse
research on generalisation in the NLP community.
In particular, the workshop aims to:
-
Bring together different expert communities to discuss challenging
questions relating to generalisation in NLP;
-
Crowd-source a collaborative generalisation benchmark hosted on a
platform for democratic state-of-the-art (SOTA) generalisation testing in
NLP.
The first GenBench workshop on generalisation (benchmarking) in NLP will be
co-located with EMNLP 2023.
Submission types
We call for two types of submissions: regular workshop submissions and
collaborative benchmarking task submissions.
The latter will consist of a data/task artefact and a companion paper
motivating and evaluating the submission. In both cases, we accept archival
papers and extended abstracts.
1. Regular workshop submissions
Regular workshop submissions present papers on the topic of generalisation
(see examples listed below) but are not intended to be included on the
GenBench evaluation platform.
Regular workshop papers may be submitted as an archival paper when they
report on completed, original and unpublished research; or as a shorter
extended abstract. More details on this category can be found below.
Topics of interest include, but are not limited to:
-
Opinion or position papers about generalisation and how it should be
evaluated;
-
Analyses of how existing or new models generalise;
-
Empirical studies that propose new paradigms to evaluate generalisation;
-
Meta-analyses that investigate how results from different generalisation
studies compare to one another;
-
Meta-analyses that study how different types of generalisation are
related;
-
Papers that discuss how generalisation of LLMs can be evaluated without
access to training data;
-
Papers that discuss why generalisation is (not) important in the era of
LLMs;
-
Studies on the relationship between generalisation and fairness or
robustness.
If you are unsure whether a specific topic is well-suited for submission,
feel free to reach out to the organisers of the workshop at
genbench(a)googlegroups.com.
2. Collaborative Benchmarking Task submissions
Collaborative benchmarking task submissions consist of a data/task artefact
and a paper describing and motivating the submission and showcasing it on a
select number of models.
We accept submissions that introduce new datasets, resplits of existing
datasets along particular dimensions, or in-context learning tasks, with
the goal of measuring generalisation of NLP models.
We especially encourage submissions that focus on:
-
Generalisation in the context of fairness and inclusivity;
-
Multilingual generalisation;
-
Generalisation in LLMs, where we have no control over the training data.
Each submission should contain information about the data (URIs, format,
preprocessing), model preparation (finetuning loss, ICL prompt templates),
and evaluation metrics. These will be defined either in a configuration
file or in code.
More details about the collaborative benchmarking task submissions and
example submissions can be found on our website: visit genbench.org/cbt for
more information or github.com/GenBench/genbench_cbt to prepare your
submission.
Note that there is a sample data submission deadline (August 1) in addition
to the final submission deadline (September 1).
Participants proposing previously unpublished datasets or splits may choose
to submit an archival paper or an extended abstract.
Generalisation evaluation datasets that have already been published
elsewhere (or will be published at EMNLP 2023) can be submitted to the
platform, as well, but only through an extended abstract, citing the
original publication.
We allow dual submissions with EMNLP. For more information, see below.
If you are in doubt about whether a particular type of dataset is suitable
for submission, please consult the information page on our website, or
reach out to the organisers of the workshop at genbench(a)googlegroups.com.
Archival vs extended abstract
Archival papers are up to 8 pages excluding references and report on
completed, original and unpublished research. They follow the requirements
of regular EMNLP 2023 submissions.
Accepted papers will be published in the workshop proceedings and are
expected to be presented at the workshop.
The papers will undergo double-blind peer review and should thus be
anonymised.
Extended abstracts can be up to 2 pages excluding references and may report
on work in progress or be cross-submissions of work that has already
appeared in another venue. Abstract titles will be posted on the workshop
website but will not be included in the proceedings.
Submission instructions
For both archival papers and extended abstracts, we refer to the EMNLP 2023
website for paper templates. Additional requirements for both regular
workshop papers and collaborative benchmarking task submissions can be
found on our website.
All papers can be submitted through OpenReview:
https://openreview.net/group?id=GenBench.org/2023/Workshop.
Collaborative Benchmarking Task submissions can be made via
https://github.com/GenBench/genbench_cbt.
We also accept regular workshop submissions (papers of category 1) through
the ACL Rolling Review system. Authors that have their ARR reviews ready
may submit their papers and reviews for consideration to the workshop.
Important dates
-
August 1, 2023 – Sample data submission deadline
-
September 1, 2023 – Paper submission deadline
-
September 15, 2023 – ARR submission deadline
-
October 6, 2023 – Notification deadline
-
October 18, 2023 – Camera-ready deadline
-
December 6, 2023 – Workshop
Note: all deadlines are 11:59 PM UTC-12:00
Dual submissions
We allow dual submissions with EMNLP and encourage relevant papers that
were dual-submitted and accepted at EMNLP to redirect to a non-archival
extended abstract submission.
We furthermore welcome submissions of extended abstracts that describe work
already presented at an earlier venue, both in the collaborative
benchmarking task and in the regular submission track.
Preprints
We do not have an anonymity deadline. Preprints are allowed, both before
the submission deadline as well as after.
Scholarships
With the support of our workshop sponsor Amazon, we are offering 6
scholarships each covering up to $500 of travel expenses and/or (virtual)
registration fees. Please check out our website for more information about
the application process.
Contact
Email address: genbench(a)googlegroups.com
Website: genbench.org/workshop
On behalf of the GenBench team,
Dieuwke Hupkes
Khuyagbaatar Batsuren
Koustuv Sinha
Amirhossein Kazemnejad
Christos Christodoulopoulos
Ryan Cotterell
Elia Bruni
Verna Dankers
Hi all,
If you'd like to find out more about Corpus linguistics programmes (MA, Post-graduate Certificate) as well as individual courses for credit offered by Lancaster University - all online learning -
please join us tomorrow for a free webinar:
https://forms.office.com/e/82JE20Wx88
Best,
Vaclav
Professor Vaclav Brezina
Professor in Corpus Linguistics
Department of Linguistics and English Language
ESRC Centre for Corpus Approaches to Social Science
Faculty of Arts and Social Sciences, Lancaster University
Lancaster, LA1 4YD
Office: County South, room C05
T: +44 (0)1524 510828
[cid:5d86929e-b78e-40c0-87c3-97c565294b0d]@vaclavbrezina
[cid:584b3354-e395-47db-8cef-48305a301d03]<http://www.lancaster.ac.uk/arts-and-social-sciences/about-us/people/vaclav-…>
*Call for Papers*: The sixth edition of BlackboxNLP, co-located with EMNLP
2023, in Singapore.
*Important dates*
---------------------
September 1, 2023 – Submission deadline.
October 6, 2023 – Notification of acceptance.
October 18, 2023 – Camera-ready papers due.
December 7, 2023 – Workshop.
Note: All deadlines are 11:59PM UTC-12 (anywhere on Earth).
*Workshop description:*
-----------------
Many recent performance improvements in NLP have come at the cost of
understanding of the systems. How do we assess what representations and
computations models learn? How do we formalize desirable properties of
interpretable models, and measure the extent to which existing models
achieve them? How can we build models that better encode these properties?
What can new or existing tools tell us about these systems’ inductive
biases?
The goal of this workshop is to bring together researchers focused on
interpreting and explaining NLP models by taking inspiration from fields
such as machine learning, psychology, linguistics, and neuroscience. We
hope the workshop will serve as an interdisciplinary meetup that allows for
cross-collaboration.
Topics of interest include, but are not limited to:
* Applying analysis techniques from neuroscience to analyze
high-dimensional vector representations in artificial neural networks;
* Analyzing the network’s response to strategically chosen input in order
to infer the linguistic generalizations that the network has acquired;
* Examining network performance on simplified or formal languages;
* Mechanistic interpretability, reverse engineering approaches to
understanding particular properties of neural models;
* Proposing modifications to neural architectures that increase their
interpretability;
* Testing whether interpretable information can be decoded from
intermediate representations;
* Explaining specific model predictions made by neural networks;
* Generating and evaluating the quality of adversarial examples in NLP;
* Developing open-source tools for analyzing neural networks in NLP;
* Evaluating the analysis results: how do we know that the analysis is
valid?
*Submissions*
-----------------
We call for two types of papers:
1) Archival papers. These are papers reporting on completed, original and
unpublished research, with a maximum length of 8 pages + references. Papers
shorter than this maximum are also welcome. Accepted papers are expected to
be presented at the workshop and will be published in the workshop
proceedings. They should report on obtained results rather than intended
work. These papers will undergo double-blind peer-review, and should thus
be anonymized.
2) Extended abstracts. These may report on work in progress or may be cross
submissions that have already appeared in a non-NLP venue. The extended
abstracts are of maximum 2 pages + references. These submissions are
non-archival in order to allow submission to another venue. The selection
will not be based on a double-blind review and thus submissions of this
type need not be anonymized.
Submissions should follow the official EMNLP 2023 style guidelines.
*The submission site is:*
https://www.softconf.com/emnlp2023/BlackboxNLP
*Organizers*
-----------------
Yonatan Belinkov, Technion
Najoung Kim, Boston University
Sophie Hao, New York University
Arya McCarthy, Johns Hopkins University
Jaap Jumelet, University of Amsterdam
Hosein Mohebbi, Tilburg University
*Contact*
---------------------
Please contact the organizers at blackboxnlp(a)googlegroups.com for any
questions.
Read more:
https://www.aclweb.org/portal/content/blackboxnlp-2023-6th-workshop-analysi…
[Apologies for cross-posting]
The University of Gothenburg, Sweden, is offering four fully-funded PhD positions in computer science and engineering where the candidates can choose the project themselves out of fourteen options.
Two of the projects are related to NLP, one about efficient algorithms for corpus searching, and another about automatic generation of Wikipedia articles. See the ad for more information:
https://web103.reachmee.com/ext/I005/1035/job?site=7&lang=UK&validator=9b89…
The positions are fully funded for 5 years, including 20% teaching or other departemental duties.
Application deadline: 20 August 2023
best regards,
Peter Ljunglöf
------- ------ ----- ---- --- -- - - - - -
peter ljunglöf
peter.ljunglof(a)gu.se
data- och informationsteknik, och språkbanken
göteborgs universitet och chalmers tekniska högskola
-------------- --------- -------- ------- ------ ----- ---- --- -- - - - - -
Dear all
Just wanted to let you know that APJCR Vol. 3, No. 1 is now available to
view online.
http://icr.or.kr/ejournals-apjcr
CK
---
*CK Jung BEng(Hons) Birmingham MSc Warwick EdD Warwick Cert Oxford*
Department of English Language and Literature, Incheon National
University, *South
Korea*
Vice President | The Korea Association of Primary English Education
(KAPEE), *South Korea*
Vice President | The Korea Association of Secondary English Education
(KASEE), *South Korea*
Director | Institute for Corpus Research, Incheon National University, *South
Korea* (http://icr.or.kr)
Editor | Asia Pacific Journal of Corpus Research, ICR, *International* (
http://icr.or.kr/apjcr)
Deputy Editor | Korean Journal of English Language and Linguistics,
KASELL, *South
Korea*
Editorial Board | Corpora, Edinburgh University Press, *UK*
Editorial Board | English Today, Cambridge University Press, *UK*
E: ckjung(a)inu.ac.kr / T: +82 (0)32 835 8129
H(EN): http://ckjung.org
H(KR): http://prof1.inu.ac.kr/user/ckjung
*SIG Writing 2024*
*Call for Conference Submissions and Research School Participation*
The EARLI Special Interest Group *Writing*, *Paris Nanterre University*
<https://www.parisnanterre.fr/> (France), *Sorbonne Nouvelle University *
<http://www.univ-paris3.fr/>(France), and the *University of Turku *
<https://www.utu.fi/fi>(Finland) invite proposals to the *20th biennial SIG
Writing conference* to be held at Paris Nanterre University, Nanterre,
France, from 26-28 June 2024. Prior to the conference, from 24-25 June
2024, the *Research School will be held.*
The conference theme for SIG Writing 2024 is *ways2write*. Writing is a
ubiquitous social and professional practice in most parts of the world. It
takes on diverse, increasingly mixed and heterogeneous forms, manifests
itself over ever wider spans of life and activity, exploits more and more
media and technologies, and is increasingly fluid. The source and its
individual and collective characteristics (age, cognitive abilities,
human/non-human...), the medium, the context and the production objectives,
time and space—all give rise to a diversity of writings and ways of
writing, raising numerous questions in an interdisciplinary field. While
the conference traditionally hosts a diversity of topics and approaches,
proposals within the *ways2write* theme are particularly welcome.
More information and *call for conference papers and Research School*
participation at *SIG WRITING 2024 (google.com)*
<https://sites.google.com/view/sig-writing-2024/conference-home?authuser=0>
*Submission deadline: October 31, 2023*
*Submission guidelines*
The aim of the conference is to promote interaction among researchers who
are interested in understanding the cognitive, social, and developmental
processes involved in writing, who are concerned with designing writing
instruction in various educational settings, or who are engaged with
exploring the functions of writing in different social and institutional
contexts. The scope of the conference is broad, and we hope to draw
together a wide range of researchers and professionals.
*CONFERENCE*: Presentation formats are Paper, Symposium, Poster,
Roundtable, and Demonstration Session.
The abstract should be 250-350 words (including any references). All
proposals must be written in English and submitted using the online
submission system: https://www.earli-eapril.org
PhD students and early career researchers are invited to submit to and
participate in the pre-conference Research School.
*RESEARCH SCHOOL*: Presentation formats are Paper Presentation, Poster or
30' Article Manuscript Discussion.
The abstract should be 350 words (including any references). All proposals
must be written in English and submitted using the online submission
system: https://www.earli-eapril.org
For any questions, please contact us at *sigwriting24.paris(a)gmail.com*
<sigwriting24.paris(a)gmail.com>
You can also follow the updates on Twitter: *@SIGWriting2024 *
<https://twitter.com/SIGWriting2024>
And Facebook: SigWriting Paris
**************************************************************
16th Workshop on Building and Using Comparable Corpora (BUCC)
with Shared Task on Multilingual Terminology Extraction
from Comparable Specialized Corpora
Co-located with RANLP 2023
September 7 or 8, 2023
Workshop website: https://comparable.limsi.fr/bucc2023/
Shared task website: https://comparable.limsi.fr/bucc2023/bucc2023-task.html
RANLP website: http://ranlp.org/ranlp2023/
Workshop proceedings to be published in ACL Anthology
Invited speaker: Sida I. Wang, Meta AI (FAIR)
**************************************************************
MOTIVATION
In the language engineering and the linguistics communities, research in
comparable corpora has been motivated by two main reasons. In language
engineering, on the one hand, it is chiefly motivated by the need to use
comparable corpora as training data for statistical NLP applications
such as statistical and neural machine translation or cross-lingual
retrieval. In linguistics, on the other hand, comparable corpora are of
interest because they enable cross-language discoveries and comparisons.
It is generally accepted in both communities that comparable corpora
consist of documents that are comparable in content and form in various
degrees and dimensions across several languages. Parallel corpora are on
the one end of this spectrum, unrelated corpora on the other.
Comparable corpora have been used in a range of applications, including
Information Retrieval, Machine Translation, Cross-lingual text
classification, etc.? The linguistic definitions and observations
related to comparable corpora can improve methods to mine such corpora
for applications of statistical NLP, for example to extract parallel
corpora from comparable corpora for neural MT. As such, it is of great
interest to bring together builders and users of such corpora.
TOPICS
We solicit contributions on all topics related to comparable (and
parallel) corpora, including but not limited to the following:
Building Comparable Corpora:
* Automatic and semi-automatic methods
* Methods to mine parallel and non-parallel corpora from the web
* Tools and criteria to evaluate the comparability of corpora
* Parallel vs non-parallel corpora, monolingual corpora
* Rare and minority languages, across language families
* Multi-media/multi-modal comparable corpora
Applications of comparable corpora:
* Human translation
* Language learning
* Cross-language information retrieval & document categorization
* Bilingual and multilingual projections
* (Unsupervised) Machine translation
* Writing assistance
* Machine learning techniques using comparable corpora
Mining from Comparable Corpora:
* Cross-language distributional semantics, word embeddings and
pre-trained multilingual transformer models
* Extraction of parallel segments or paraphrases from comparable corpora
* Methods to derive parallel from non-parallel corpora (e.g. to provide
for low-resource languages in neural machine translation)
* Extraction of bilingual and multilingual translations of single words,
multi-word expressions, proper names, named entities, sentences,
paraphrases etc. from comparable corpora
* Induction of morphological, grammatical, and translation rules from
comparable corpora
* Induction of multilingual word classes from comparable corpora
Comparable Corpora in the Humanities:
* Comparing linguistic phenomena across languages in contrastive linguistics
* Analyzing properties of translated language in translation studies
* Studying language change over time in diachronic linguistics
* Assigning texts to authors via authors' corpora in forensic linguistics
* Comparing rhetorical features in discourse analysis
* Studying cultural differences in sociolinguistics
* Analyzing language universals in typological research
IMPORTANT DATES
July 18, 2023: Paper submission deadline
July 31, 2021: Notification of acceptance
August 25, 2021: Camera ready final papers
September 7 or 8, 2023: Workshop date
For updates see the workshop website at
https://comparable.limsi.fr/bucc2023/
PRACTICAL INFORMATION
Workshop registration is via the main conference registration site,
see http://ranlp.org/ranlp2023/index.php/fees-registration/
The workshop proceedings will be published in the ACL Anthology.
SUBMISSION GUIDELINES
Please follow the style sheet and templates (for LaTeX, Overleaf and
MS-Word) provided for the main conference at
http://ranlp.org/ranlp2023/index.php/submissions/
Papers should be submitted as a PDF file using the START conference
manager at https://softconf.com/ranlp23/BUCC/
Submissions must describe original and unpublished work and range
from 4 to 8 pages plus unlimited references.
Reviewing will be double blind, so the papers should not reveal the
authors' identity. Accepted papers will be published in the workshop
proceedings, which will be included in the ACL Anthology.
Double submission policy: Parallel submission to other meetings or
publications is possible but must be immediately (i.e. as soon as known
to the authors) notified to the workshop organizers by e-mail.
For further information and updates see the BUCC 2023 website:
https://comparable.limsi.fr/bucc2023/
BUCC 2023 SHARED TASK
Bilingual Term Alignment in Comparable Specialized Corpora
The BUCC 2023 shared task is on multilingual terminology alignment in
comparable corpora. Many research groups are working on this problem
using a wide variety of approaches. However, as there is no standard way
to measure the performance of the systems, the published results are not
comparable and the pros and cons of the various approaches are not
clear. The shared task aims at solving these problems by organizing a
fair comparison of systems. This is accomplished by providing corpora
and evaluation datasets for a number of language pairs and domains.
Moreover, the importance of dealing with multi-word expressions in
Natural Language Processing applications has been recognized for a long
time. In particular, multi-word expressions pose serious challenges for
machine translation systems because of their syntactic and semantic
properties. Furthermore, multi-word expressions tend to be more
frequent in domain-specific text, hence the need to handle them in tasks
with specialized-domain corpora.
Through the 2023 BUCC shared task, we seek to evaluate methods that
detect pairs of terms that are translations of each other in two
comparable corpora, with an emphasis on multi-word terms in specialized
domains.
For the schedule and further details see the shared task website at
https://comparable.limsi.fr/bucc2023/bucc2023-task.html
WORKSHOP ORGANIZERS
* Reinhard Rapp (University of Mainz and Magdeburg-Stendal University of
Applied Sciences, Germany)
* Pierre Zweigenbaum (Université Paris-Saclay, CNRS, LISN, Orsay, France)
* Serge Sharoff (University of Leeds, United Kingdom)
Contact workshop: reinhardrapp (at) gmx (dot) de
Contact shared task: pz (at) lisn (dot) fr
PROGRAMME COMMITTEE
* Ebrahim Ansari (Institue for Advanced Studies in Basic Sciences, Iran)
* Thierry Etchegoyhen (Vicomtech, Spain)
* Philippe Langlais (Université de Montréal, Canada)
* Yves Lepage (Waseda University, Japan)
* Shervin Malmasi (Amazon, USA)
* Emmanuel Morin (Université de Nantes, France)
* Dragos Stefan Munteanu (RWS, USA)
* Reinhard Rapp (University of Mainz and Magdeburg-Stendal University of
Applied Sciences, Germany)
* Nasredine Semmar (CEA LIST, Paris, France)
* Serge Sharoff (University of Leeds, UK)
* Richard Sproat (OGI School of Science & Technology, USA)
* Tim Van de Cruys (KU Leuven, Belgium)
* Pierre Zweigenbaum (Université Paris-Saclay, CNRS, LISN, Orsay, France)
Apologies for cross posting
International workshop
NLP for translation and interpreting applications (NLP4TIA)
Varna, Bulgaria, 7/8 September 2023
https://nlp4tia.web.uah.es/
Last Call for Papers – Extended Deadline
In the last two decades, we have been able to witness a technological turn in translation and interpreting studies with Natural Language Processing (NLP) and deep learning playing more and more prominent part. There is already a growing number of NLP applications that are used to support the work of translators and interpreters. In addition, the recent advances in (and latest models of) deep learning have powered the further development and success of high performing Neural Machine Translation (NMT) systems.
Translation technology has revolutionised the translation profession and nowadays most professional translators employ tools such as translation memory (TM) systems in their daily work. Latest advances of Neural Machine Translation (NMT) have resulted in NMT not only becoming an integral part of most state-of-the art TM tools but also typical for the translation workflow of many companies, organisations and freelance translators.
Although translation has benefited more from technological advances, interpreting has also experienced a technological turn. However, it has not been until some years ago that soft technology has permeated interpreting practice and research. Computer assisted translation, MT and NLP tools have been adapted to be used by interpreters. In addition, corpus-based studies have also underpinned dialogue interpreting.
The increasing interest in NLP, MT and the automation of processes has brought us to multidisciplinary projects that deal with the development of models for automated oral communication. Machine interpreting has already been developed and is being improved, focusing on speed and accuracy matters. Either domain-specific (commercial, military, humanitarian) or general (Skype Translator), there is still a long way to go to render machine interpreting more human-like.
Many of the above recent developments have to do with the employment of Natural Language Processing tools and resources to support the work of translators and interpreters. This workshop is expected to discuss the growing importance of NLP in different translation and interpreting scenarios.
Workshop topics
The workshop invites submissions reporting original unpublished work on topics including but not limited to:
* NLP and MT for under-resourced languages;
* Translation Memory systems;
* NLP and MT for translation memory systems;
* NLP for CAT and CAI tools;
* Integration of NLP tools in remote interpreting platforms;
* NLP for dialogue interpreting;
* Development of NLP based applications for communication in public service settings (healthcare, education, law, emergency services);
* Corpus-based studies applied to translation and interpreting.;
* Machine translation and machine interpreting;
* Resources for translation and machine translation;
* Resources for interpreting and interpreting technology application;
* Quality estimation of human and machine translation;
* Post-editing strategies and tools;
* Automatic post-editing of MT;
* NLP and MT for subtitling.
* Technology acceptance by interpreters and translations;
* Machine Translation and translation tools for literary texts;
* Evaluation of machine translation and translation and interpreting tools in general;
* The impact of the technological turn in translation and interpreting;
* Cognitive effort and eye-tracking experiments in translation and interpreting;
* Development of models for research and practice of translation and interpreting;
* Multidisciplinary cooperation in NLP applied to translation and interpreting.
Submissions and publication
Submissions must consist of full-text papers and should not exceed 7 pages excluding references, they should be a minimum of 5 pages long. The accepted papers will be published as NLP4TIAworkshop e-proceedings with ISBN, will be assigned a DOI and will be also available at the time of the conference. The papers should be in English and should be submitted via the conference management system START using this link<https://softconf.com/ranlp23/NLP4TIA/>.
Authors of accepted papers will receive guidelines regarding how to produce camera-ready versions of their papers for inclusion in the proceedings.
Each submission will be reviewed by at least two programme committee members. Accepted papers will be presented orally as part of the programme of the workshop.
Submissions should be compliant with the below templates and should be uploaded as pdf files in START (START is configured to accept pdf files only).
The following templates should be used: LaTeX at Overleaf<https://www.overleaf.com/latex/templates/instructions-for-ranlp-2023-procee…>, LaTeX<http://ranlp.org/ranlp2023/Templates/ranlp2023-LaTeX.zip> , MS Office<http://ranlp.org/ranlp2023/Templates/ranlp2023-word.docx>
Important dates
Deadline for paper submission: 10 July 2023
Deadline for paper submission (extended): 23 July 2023
Acceptance notification: 10 August 2023
Final camera-ready version: 25 August 2023
Workshop camera-ready proceedings ready: 31 August 2023
NLP4TIA workshop: 7/8 September 2023
Workshop Chairs
Raquel Lázaro Gutiérrez (Universidad de Alcalá)
Antonio Pareja Lora (Universidad de Alcalá)
Ruslan Mitkov (Lancaster University)
Programme Committee
Cristina Aranda (Big Onion)
Juanjo Arevalillo (Hermes Traducciones)
Silvia Bernardini (University of Bologna)
Gabriel Cabrera Méndez (Dualia Teletraducciones)
Matt Coler (University of Groningen)
Elena Davitti (University of Surrey)
Joanna Drugan (Heriot-Watt University)
Marie Escribe (LanguageWire)
Claudio Fantinuoli (Mainz University/KUDO Inc)
Antonio García Cabot (Universidad de Alcalá)
Adriana Jaime Pérez (Migralingua Voze)
Miguel Ángel Jiménez Crespo (Rutgers University)
Óscar Luis Jiménez Serrano (University of Granada)
Koen Kerremans (Free University Brussel)
Maria Kunilovskaya (Saarland University)
Els Lefever (Ghent University)
Pilar León Arauz (University of Granada)
Johanna Monti (University of Naples L’Orientale)
Elena Montiel Ponsoda (Polytechnic University of Madrid)
Helena Moriz (University of Lisbon)
Elena Murgolo (Orbital 14)
Dora Murgu (Interprefy)
Constantin Orasan (University of Surrey)
María Teresa Ortego Antón (University of Valladolid)
Tharindu Ranasinghe (Aston University)
Celia Rico (Universidad Complutense de Madrid)
Caroline Rossi (University Grenoble les Alpes)
María del Mar Sánchez Ramos (Universidad de Alcalá)
Miriam Seghiri (University of Malaga)
Vilelmini Sosoni (Ionian University)
Rui Manuel Sousa Silva (University of Porto)
Nicoletta Spinolo (University of Bologna)
Venue
The workshop will take place at hotel Cherno More<https://www.chernomorebg.com/en/> in Varna.
Further information and contact details
Registration for NLP4TIA is now open and is done via the RANLP main conference page. To register, please complete the registration form<https://url6.mailanyone.net/scanner?m=1pii0v-000B6E-3x&d=4%7Cmail%2F14%2F16…>.
The conference website (
https://nlp4tia.web.uah.es/) will be updated on a regular basis. For further information, please email raquel.lazaro(a)uah.es<mailto:raquel.lazaro@uah.es>.
> [Apologies for cross-posting]
> ======================================================================
> NEW DATES - CFP - SIMBig 2023
> ======================================================================
>
> SIMBig 2023 - 10th International Conference on Information Management and Big Data
> Where: Instituto Politécnico Nacional, Mexico DF, MEXICO
> When: October 18 - 20, 2023
> Website: https://simbig.org/SIMBig2023/
>
> ======================================================================
>
> OVERVIEW
> ----------------------------------
>
> SIMBig 2023 seeks to present new methods of Artificial Intelligence (AI), Data Science, Machine Learning, Natural Language Processing, Semantic Web, and related fields, for analyzing, managing, and extracting insights and patterns from large volumes of data.
>
>
> KEYNOTE SPEAKERS (to be confirmed)
> ----------------------------------
>
 Mona Diab, Meta AI, USA
 Finale Doshi-Velez, Harvard University, USA
 Huan Liu, Arizona State University, USA
>
> IMPORTANT DATES
> ----------------------------------
>
> July 24, 2023 --> Full papers and short papers due
> August 28, 2023 --> Notification of acceptance
> September 10, 2023 --> Camera-ready versions
> October 18 - 20, 2023 --> Conference held in Mexico DF, Mexico
>
> PUBLICATION
> ----------------------------------
>
> All accepted papers of SIMBig 2023 (tracks including) will be published with Springer CCIS Series <https://www.springer.com/series/7899> (to be confirmed).
>
> Best papers of SIMBig 2023 (tracks including) will be selected to submit an extension to be published in the Springer SN Computer Science Journal. <https://www.springer.com/journal/42979>
 
> TOPICS OF INTEREST
> ----------------------------------
>
> SIMBig 2023 has a broad scope. We invite contributions on theory and practice, including but not limited to the following technical areas:
>
> Artificial Intelligence
> Big/Masive Data
> Data Science
> Machine Learning
> Deep Learning
> Natural Language Processing
> Semantic Web
> Data-driven Software Engineering
> Data-driven software adaptation
> Healthcare Informatics
> Biomedical Informatics
> Data Privacy and Security
> Information Retrieval
> Ontologies and Knowledge Representation
> Social Networks and Social Web
> Information Visualization
> OLAP and Business intelligence
> Crowdsourcing
>
> SPECIAL TRACKS
> ----------------------------------
>
> SIMBig 2023 proposes six special tracks in addition to the main conference:
>
> ANLP <https://simbig.org/SIMBig2023/en/anlp.html> - Applied Natural Language Processing
> DISE <https://simbig.org/SIMBig2023/en/dise.html> - Data-Driven Software Engineering
> EE-AI-HPC <https://simbig.org/SIMBig2023/en/eeaihpc.html> - Efficiency Enhancement for AI and High-Performance Computing
> SNMAM <https://simbig.org/SIMBig2023/en/snmam.html> - Social Network and Media Analysis and Mining
>
> CONTACT
> ----------------------------------
>
> SIMBig 2023 General Chairs
>
> Juan Antonio Lossio-Ventura, National Institutes of Health, USA (juan.lossio(a)nih.gov <mailto:juan.lossio@nih.gov>)
> Hugo Alatrista-Salas, Pontificia Universidad Católica del Perú, Peru (halatrista(a)pucp.pe <mailto:halatrista@pucp.pe>)
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Ninth Italian Conference on Computational Linguistics
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
30 November - 2 December, 2023, Venice, Italy
Second Call for Papers (and deadline extension)
https://clic2023.ilc.cnr.it/
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
The Italian Conference on Computational Linguistics, CLiC-it, aims at establishing a reference forum for the Italian community of researchers working in the fields of Computational Linguistics (CL) and Natural Language Processing (NLP). CLiC-it promotes and disseminates high-level, original research on all aspects of automatic language processing, both written and spoken, and targets state-of-the-art theoretical results, experimental methodologies, technologies, as well as application perspectives, which may contribute to the advancement of the CL and NLP fields.
The spirit of the conference is inclusive. In the conviction that the complexity of language phenomena needs cross-disciplinary competences, CLiC-it intends to bring together researchers of related disciplines such as Computational Linguistics, Natural Language Processing, Linguistics, Cognitive Science, Machine Learning, Computer Science, Knowledge Representation, Information Retrieval, and Digital Humanities. CLiC-it is open to contributions on all languages, with a particular emphasis on Italian.
The ninth edition of CLiC-it will be held in Venice, on 30 November - 2 December, 2023. CLiC-it is organised by the Italian Association of Computational Linguistics (AILC -- www.ai-lc.it).
Submission Format
-------------------------
CLiC-it 2023 invites the submission of long and short papers in all aspects of Computational Linguistics and Natural Language Processing.
CLiC-it 2023 aims to have a broad technical program. Relevant topics for the conference include, but are not limited to (in alphabetical order):
- Computational Historical Linguistics
- Computational Social Science and Cultural Analytics
- Dialogue and Interactive Systems
- Discourse and Pragmatics
- Ethics and NLP
- Generation
- Handwritten Text Recognition
- Information Extraction
- Information Retrieval and Text Mining
- Interpretability and Analysis of Models for NLP
- Language Grounding to Vision, Robotics and Beyond
- Large Language Models
- Linguistic Diversity
- Linguistic Theories, Cognitive Modeling, and Psycholinguistics
- Machine Learning for NLP
- Machine Translation
- Multilingualism and Cross-Lingual NLP
- NLP Applications
- NLP for the Humanities
- Phonology, Morphology, and Word Segmentation
- Pragmatics and Creativity
- Question Answering
- Resources and Evaluation
- Semantics: Lexical, Sentence-level Semantics, Textual Inference, and Other Areas
- Sentiment Analysis, Stylistic Analysis, and Argument Mining
- Speech and Multimodality
- Summarization
- Syntax: Tagging, Chunking and Parsing
Paper Types and Formats
---------------------------------
- Long papers must describe substantial, original, completed, and unpublished work. Wherever appropriate, concrete evaluation and analysis should be included. Long papers may consist of up to six (6) pages of content, plus unlimited pages of references and appendices. Final versions of long papers will be given one additional page of content (up to seven (7) pages), so that reviewers’ comments can be taken into account. Please note that, following CEUR specifications, the minimum length of the camera-ready version of a long paper is 25,000 characters, spaces and references included.
- Short paper submissions must describe original and unpublished work. Please note that a short paper is not a shortened long paper. Instead, short papers should have a small, focused contribution - a point that can be made in a few pages with sufficient level of detail. Short papers may consist of up to three (3) pages of content, plus unlimited pages of references and appendices. Final versions of short papers will be given one additional page of content (up to four (4) pages), so that reviewers’ comments can be taken into account. Please note that, following CEUR specifications, the minimum length of the camera-ready version of a short paper is 12,500 characters, spaces and references included.
In particular, CLiC-it 2023 welcomes the following kinds of contributions:
- Computationally-aided linguistic and literary analysis (of either models or data resources)
- NLP engineering experiment
- Reproduction study
- New data resources, particularly for Italian and for low-resource languages
- Approaches for data- and compute efficiency
- Position papers
- Surveys
- Publicly available software and pre-trained models
Both, long and short papers, will be evaluated according to the following criteria:
- soundness of approach
- relevance to computational linguistics
- novelty and clarity of relation with related work
- quality of presentation
- quality of evaluation (only for long papers, if applicable)
- verifiability and ability to replicate (only for long papers, if applicable)
While there is no direct mapping between types of contributions and paper length, some kinds of papers naturally gravitate towards a certain length: e.g. surveys are more likely to be long rather than short papers. One paper can make more than one contribution of different types (e.g., a paper presenting a new resource and experiments using the resource).
Papers can be either in English or Italian, with the abstract both in English and Italian.
Papers must be submitted through the OpenReview platform using the following link: https://openreview.net/group?id=AI-LC.it/CLiC-it/2023/Conference
Both long and short papers must follow the CLiC-it two-column format, using the supplied official style files. We strongly recommend the use of LaTeX style files or Microsoft Word style files according to the following formats:
- LaTex Template: https://clic2023.ilc.cnr.it/wp-content/uploads/2023/04/clic2023.zip
- Word Template: https://clic2023.ilc.cnr.it/wp-content/uploads/2023/04/clic2023.odt
Please do not modify these style files, nor should you use templates designed for other conferences. Submissions that do not conform to the required styles, including paper size, margin width, and font size restrictions, will be rejected without review.
Reviewing will NOT be blind, so there is no need to remove author information from manuscripts.
Research Communication Papers
-------------------------------------------
CLiC-it 2023 favours a parallel submission policy for outstanding papers that have been submitted and accepted elsewhere in 2022 and 2023. If you are the author of a paper accepted at a major international CL conference or journal in 2022 or 2023, you can submit your work at CLiC-it 2023 in the form of a short research communication, within a dedicated session at the conference. Research communications will not be published in the proceedings, but are mostly intended to enforce dissemination of excellence in research within the Italian CL community. The papers submitted must be in scope for the CLiC-it 2023 conference.
The authors of any paper that meets the above criteria are invited to submit a written (maximum) one-page document with the abstract of the original paper, the paper’s title, the paper’s authors, and a pointer to the original journal paper at the conference or journal Web site.
Authors will be invited to present their papers at CLiC-it 2023 after a check that the paper satisfies the above listed criteria. As the papers have already been reviewed and published, they will not be reviewed again. In the case that an exceptionally high number of submissions is received, a selection will be made, also based on the original publication venue. Priority will be given to papers that better fit the conference program, offering a balance across the conference topics.
Awards
----------
In order to award the work of young researchers, the title of “best paper” will be attributed to the best papers in different research areas with a Master/PhD student among the authors and presenting the work at the conference. The awarded authors will be invited to submit an extended version of their papers to the Italian Journal of Computational Linguistics (IJCoL).
To recognise excellence in student research as well as promote awareness of our field and with the endorsement of AILC, we are conferring the “Emanuele Pianta” prize for the best Master Thesis (Laurea Magistrale) in Computational Linguistics submitted at an Italian University. The prize consists of 500 euro plus free membership to AILC for one year and free registration to the upcoming CLiC-it, where the author will have the chance to present her/his thesis. The complete call is available on the conference website at: “Calls > AILC Master Thesis Award”.
Multiple Submission Policy
-----------------------------------
CLiC-it 2023 allows for a multiple submission policy. In case of acceptance of the paper in other venues, the authors must communicate this information to the CLiC-it 2023 Chairs as soon as possible.
Important Dates
----------------------
20/07/2023 31/07/2023 at 10AM (extended): Paper submission deadline: long, short, research communications
29/09/2023: Notification to authors of reviewing outcome
20/10/2023: Camera ready version of accepted papers
30/11 - 2/12/2023: CLiC-it Conference, Venice
People
---------
Conference Chairs:
- Federico Boschetti (Institute for Computational Linguistics "A. Zampolli", CNR / Ca' Foscari University of Venice)
- Gianluca E. Lebani (Ca' Foscari University of Venice)
- Bernardo Magnini (Fondazione Bruno Kessler)
- Nicole Novielli (University of Bari "Aldo Moro")
Local Organizing Committee:
- Franz Fischer (Ca' Foscari University of Venice)
- Francesca Dall’Igna (IUSS, School for Advanced Studies Pavia)
- Luca Molinari (Ca' Foscari University of Venice / University of Warsaw)
- Cristina Procentese (Ca' Foscari University of Venice)
Proceedings Chairs:
- Alice Suozzi (Ca' Foscari University of Venice)
- Andrea Zaninello (Fondazione Bruno Kessler)
Webmasters:
- Eleonora Ghizzota (University of Bari "Aldo Moro")
- Tiziano Labruna (Fondazione Bruno Kessler)
Publicity Chair
- Sofia Brenna (Fondazione Bruno Kessler)
Further information
--------------------------
Conference website: https://clic2023.ilc.cnr.it/
Mail: clicit2023(a)gmail.com
Twitter: @CLiC_it_conf
--
--
Le informazioni contenute nella presente comunicazione sono di natura
privata e come tali sono da considerarsi riservate ed indirizzate
esclusivamente ai destinatari indicati e per le finalità strettamente
legate al relativo contenuto. Se avete ricevuto questo messaggio per
errore, vi preghiamo di eliminarlo e di inviare una comunicazione
all’indirizzo e-mail del mittente.
--
The information transmitted is
intended only for the person or entity to which it is addressed and may
contain confidential and/or privileged material. If you received this in
error, please contact the sender and delete the material.