Dear all,
We are happy to remind that there will be a full-day workshop on
computational approaches to historical language change (LChange’23)
co-located with EMNLP (December 6-10, 2023).
The Second Call for Papers is below. The main updates compared to the
1st CfP are:
1. The submission page is now up and running at
https://openreview.net/group?id=EMNLP/2023/Workshop/LChange
2. To encourage resource sharing at the reviewing phase, model and
dataset papers do not need to be anonymous. The reviewing for the papers
of this type will thus be single-blind.
========================================
4th International Workshop on Computational Approaches to Historical
Language Change 2023 (LChange’23)
========================================
Website: https://www.changeiskey.org/event/2023-emnlp-lchange/
Date: Dec 6, 2023
Location: Singapore and online
Contact email: lchange2023(a)changeiskey.org
LChange'23 is the fourth workshop for computational approaches to
historical language change with the focus on digital text corpora. Come
join us for this exciting adventure!
The workshop builds upon its first iteration in 2019
(https://languagechange.org/events/2019-acl-lcworkshop/), and the
subsequent events (2021, 2022). LChange'19 resulted in a book on
Computational approaches to semantic change
(https://langsci-press.org/catalog/book/303).
This year, LChange will be colocated with EMNLP 2023 in Singapore, as a
hybrid event. The workshop will take place on Wednesday 6 December 2023.
The main topic of the workshop remains the same: all aspects around
computational approaches to historical language change with the focus on
digital text corpora. We hope to make this fourth edition another
resounding success!
==Important Dates==
September 1, 2023: Paper submission
October 6, 2023: Notification of acceptance
October 18, 2023: Camera-ready papers due
December 6, 2023: Workshop date
==Workshop Topics==
This workshop explores state-of-the-art computational methodologies,
theories and digital text resources on exploring the time-varying nature
of human language.
The aim of this workshop is three-fold. First, we want to provide
pioneering researchers who work on computational methods, evaluation,
and large-scale modelling of language change an outlet for disseminating
cutting-edge research on topics concerning language change. We want to
utilize this workshop as a platform for sharing state-of-the-art
research progress in this fundamental domain of natural language research.
Second, in doing so we want to bring together domain experts across
disciplines by connecting researchers in historical linguistics with
those that develop and test computational methods for detecting semantic
change and laws of semantic change; and those that need knowledge (of
the occurrence and shape) of language change, for example, in digital
humanities and computational social sciences where text mining is
applied to diachronic corpora subject to e.g., lexical semantic change.
Third, the detection and modelling of language change using diachronic
text and text mining raise fundamental theoretical and methodological
challenges for future research.
Besides these goals, this workshop will also support discussion on the
evaluation of computational methodologies for uncovering language
change. SemEval2020 Task 1 on unsupervised detection of lexical semantic
change attracted three figure submission numbers and a total of 21
submitted system papers. Since then, three more tasks have been
completed in Italian, Russian, and Spanish.
We invite original research papers from a wide range of topics,
including but not limited to:
- Novel methods for detecting diachronic semantic change and lexical
replacement
- Automatic discovery and quantitative evaluation of laws of language change
- Computational theories and generative models of language change
- Sense-aware (semantic) change analysis
- Diachronic word sense disambiguation
- Novel methods for diachronic analysis of low-resource languages
- Novel methods for diachronic linguistic data visualization
- Novel applications and implications of language change detection
- Quantification of sociocultural influences on language change
- Cross-linguistic, phylogenetic, and developmental approaches to
language change
- Novel datasets for cross-linguistic and diachronic analyses of language
==Keynote Talks==
To be announced. If you have any good suggestions, or anyone you would
like to listen to, please contact us.
==Submissions==
URL for submissions:
https://openreview.net/group?id=EMNLP/2023/Workshop/LChange.
We accept two types of submissions, long and short papers, following the
EMNLP 2023 style (you can also directly use the Overleaf template), and
the ACL submission policy:
* https://2023.emnlp.org/calls/style-and-formatting/
*
https://www.overleaf.com/latex/templates/instructions-for-emnlp-2023-procee…
*
https://www.aclweb.org/adminwiki/index.php?title=ACL_Policies_for_Submissio…
Long and short papers may consist of up to eight (8) and four (4) pages
of content, respectively, plus unlimited references; final versions will
be given one additional page of content so that reviewers' comments can
be taken into account.
LChange’23 also welcomes papers focusing on releasing a dataset or a
model; these papers fall into the short paper category. To encourage
resource sharing at the reviewing phase, model and dataset papers do not
need to be anonymous.
Accepted papers will be presented orally or as posters and included in
the workshop proceedings. Submissions are open to all, and are to be
submitted anonymously. All papers will be refereed through a
double-blind peer review process by at least three reviewers with final
acceptance decisions made by the workshop organizers.
==Contact==
Contact us if you have any questions: lchange2023(a)changeiskey.org
Organizers: Nina Tahmasebi, Syrielle Montariol, Haim Dubossarsky, Andrey
Kutuzov, Simon Hengchen, David Alfter, Francesco Periti, and Pierluigi
Cassotti.
If you have published in the field previously, and are interested in
helping out in the PC to review papers, send us an email.
==Anti-Harassment Policy==
Our workshop highly values the open exchange of ideas, the freedom of
thought and expression, and respectful scientific debate. We support and
uphold the ACL Anti-Harassment policy
(https://www.aclweb.org/adminwiki/index.php?title=Anti-Harassment_Policy),
and any workshop participant should feel free to contact any of the
workshop organizers or acl(a)aclweb.org, in case of any issues.
--
Andrey
Language Technology Group (LTG)
University of Oslo
*Call for Papers: GenBench, the first workshop on generalisation
(benchmarking) in NLP*
****New: the Collaborative Benchmarking Task submissions are now open;
visit *
*https://github.com/GenBench/genbench_cbt
<https://github.com/GenBench/genbench_cbt>.With the support of our workshop
sponsor Amazon, we are now offering scholarships for travel expenses.****
Workshop description
The ability to generalise well is often mentioned as one of the primary
desiderata for models of natural language processing.
It is crucial to ensure that models behave robustly, reliably and fairly
when making predictions about data that is different from the data that
they were trained on.
Generalisation is also important when NLP models are considered from a
cognitive perspective, as models of human language.
Yet, there are still many open questions related to what it means for an
NLP model to generalise well and how generalisation should be evaluated.
The first GenBench workshop aims to serve as a cornerstone to catalyse
research on generalisation in the NLP community.
In particular, the workshop aims to:
-
Bring together different expert communities to discuss challenging
questions relating to generalisation in NLP;
-
Crowd-source a collaborative generalisation benchmark hosted on a
platform for democratic state-of-the-art (SOTA) generalisation testing in
NLP.
The first GenBench workshop on generalisation (benchmarking) in NLP will be
co-located with EMNLP 2023.
Submission types
We call for two types of submissions: regular workshop submissions and
collaborative benchmarking task submissions.
The latter will consist of a data/task artefact and a companion paper
motivating and evaluating the submission. In both cases, we accept archival
papers and extended abstracts.
1. Regular workshop submissions
Regular workshop submissions present papers on the topic of generalisation
(see examples listed below) but are not intended to be included on the
GenBench evaluation platform.
Regular workshop papers may be submitted as an archival paper when they
report on completed, original and unpublished research; or as a shorter
extended abstract. More details on this category can be found below.
Topics of interest include, but are not limited to:
-
Opinion or position papers about generalisation and how it should be
evaluated;
-
Analyses of how existing or new models generalise;
-
Empirical studies that propose new paradigms to evaluate generalisation;
-
Meta-analyses that investigate how results from different generalisation
studies compare to one another;
-
Meta-analyses that study how different types of generalisation are
related;
-
Papers that discuss how generalisation of LLMs can be evaluated without
access to training data;
-
Papers that discuss why generalisation is (not) important in the era of
LLMs;
-
Studies on the relationship between generalisation and fairness or
robustness.
If you are unsure whether a specific topic is well-suited for submission,
feel free to reach out to the organisers of the workshop at
genbench(a)googlegroups.com.
2. Collaborative Benchmarking Task submissions
Collaborative benchmarking task submissions consist of a data/task artefact
and a paper describing and motivating the submission and showcasing it on a
select number of models.
We accept submissions that introduce new datasets, resplits of existing
datasets along particular dimensions, or in-context learning tasks, with
the goal of measuring generalisation of NLP models.
We especially encourage submissions that focus on:
-
Generalisation in the context of fairness and inclusivity;
-
Multilingual generalisation;
-
Generalisation in LLMs, where we have no control over the training data.
Each submission should contain information about the data (URIs, format,
preprocessing), model preparation (finetuning loss, ICL prompt templates),
and evaluation metrics. These will be defined either in a configuration
file or in code.
More details about the collaborative benchmarking task submissions and
example submissions can be found on our website: visit genbench.org/cbt for
more information or github.com/GenBench/genbench_cbt to prepare your
submission.
Note that there is a sample data submission deadline (August 1) in addition
to the final submission deadline (September 1).
Participants proposing previously unpublished datasets or splits may choose
to submit an archival paper or an extended abstract.
Generalisation evaluation datasets that have already been published
elsewhere (or will be published at EMNLP 2023) can be submitted to the
platform, as well, but only through an extended abstract, citing the
original publication.
We allow dual submissions with EMNLP. For more information, see below.
If you are in doubt about whether a particular type of dataset is suitable
for submission, please consult the information page on our website, or
reach out to the organisers of the workshop at genbench(a)googlegroups.com.
Archival vs extended abstract
Archival papers are up to 8 pages excluding references and report on
completed, original and unpublished research. They follow the requirements
of regular EMNLP 2023 submissions.
Accepted papers will be published in the workshop proceedings and are
expected to be presented at the workshop.
The papers will undergo double-blind peer review and should thus be
anonymised.
Extended abstracts can be up to 2 pages excluding references and may report
on work in progress or be cross-submissions of work that has already
appeared in another venue. Abstract titles will be posted on the workshop
website but will not be included in the proceedings.
Submission instructions
For both archival papers and extended abstracts, we refer to the EMNLP 2023
website for paper templates. Additional requirements for both regular
workshop papers and collaborative benchmarking task submissions can be
found on our website.
All papers can be submitted through OpenReview:
https://openreview.net/group?id=GenBench.org/2023/Workshop.
Collaborative Benchmarking Task submissions can be made via
https://github.com/GenBench/genbench_cbt.
We also accept regular workshop submissions (papers of category 1) through
the ACL Rolling Review system. Authors that have their ARR reviews ready
may submit their papers and reviews for consideration to the workshop.
Important dates
-
August 1, 2023 – Sample data submission deadline
-
September 1, 2023 – Paper submission deadline
-
September 15, 2023 – ARR submission deadline
-
October 6, 2023 – Notification deadline
-
October 18, 2023 – Camera-ready deadline
-
December 6, 2023 – Workshop
Note: all deadlines are 11:59 PM UTC-12:00
Dual submissions
We allow dual submissions with EMNLP and encourage relevant papers that
were dual-submitted and accepted at EMNLP to redirect to a non-archival
extended abstract submission.
We furthermore welcome submissions of extended abstracts that describe work
already presented at an earlier venue, both in the collaborative
benchmarking task and in the regular submission track.
Preprints
We do not have an anonymity deadline. Preprints are allowed, both before
the submission deadline as well as after.
Scholarships
With the support of our workshop sponsor Amazon, we are offering 6
scholarships each covering up to $500 of travel expenses and/or (virtual)
registration fees. Please check out our website for more information about
the application process.
Contact
Email address: genbench(a)googlegroups.com
Website: genbench.org/workshop
On behalf of the GenBench team,
Dieuwke Hupkes
Khuyagbaatar Batsuren
Koustuv Sinha
Amirhossein Kazemnejad
Christos Christodoulopoulos
Ryan Cotterell
Elia Bruni
Verna Dankers
Hi all,
If you'd like to find out more about Corpus linguistics programmes (MA, Post-graduate Certificate) as well as individual courses for credit offered by Lancaster University - all online learning -
please join us tomorrow for a free webinar:
https://forms.office.com/e/82JE20Wx88
Best,
Vaclav
Professor Vaclav Brezina
Professor in Corpus Linguistics
Department of Linguistics and English Language
ESRC Centre for Corpus Approaches to Social Science
Faculty of Arts and Social Sciences, Lancaster University
Lancaster, LA1 4YD
Office: County South, room C05
T: +44 (0)1524 510828
[cid:5d86929e-b78e-40c0-87c3-97c565294b0d]@vaclavbrezina
[cid:584b3354-e395-47db-8cef-48305a301d03]<http://www.lancaster.ac.uk/arts-and-social-sciences/about-us/people/vaclav-…>
*Call for Papers*: The sixth edition of BlackboxNLP, co-located with EMNLP
2023, in Singapore.
*Important dates*
---------------------
September 1, 2023 – Submission deadline.
October 6, 2023 – Notification of acceptance.
October 18, 2023 – Camera-ready papers due.
December 7, 2023 – Workshop.
Note: All deadlines are 11:59PM UTC-12 (anywhere on Earth).
*Workshop description:*
-----------------
Many recent performance improvements in NLP have come at the cost of
understanding of the systems. How do we assess what representations and
computations models learn? How do we formalize desirable properties of
interpretable models, and measure the extent to which existing models
achieve them? How can we build models that better encode these properties?
What can new or existing tools tell us about these systems’ inductive
biases?
The goal of this workshop is to bring together researchers focused on
interpreting and explaining NLP models by taking inspiration from fields
such as machine learning, psychology, linguistics, and neuroscience. We
hope the workshop will serve as an interdisciplinary meetup that allows for
cross-collaboration.
Topics of interest include, but are not limited to:
* Applying analysis techniques from neuroscience to analyze
high-dimensional vector representations in artificial neural networks;
* Analyzing the network’s response to strategically chosen input in order
to infer the linguistic generalizations that the network has acquired;
* Examining network performance on simplified or formal languages;
* Mechanistic interpretability, reverse engineering approaches to
understanding particular properties of neural models;
* Proposing modifications to neural architectures that increase their
interpretability;
* Testing whether interpretable information can be decoded from
intermediate representations;
* Explaining specific model predictions made by neural networks;
* Generating and evaluating the quality of adversarial examples in NLP;
* Developing open-source tools for analyzing neural networks in NLP;
* Evaluating the analysis results: how do we know that the analysis is
valid?
*Submissions*
-----------------
We call for two types of papers:
1) Archival papers. These are papers reporting on completed, original and
unpublished research, with a maximum length of 8 pages + references. Papers
shorter than this maximum are also welcome. Accepted papers are expected to
be presented at the workshop and will be published in the workshop
proceedings. They should report on obtained results rather than intended
work. These papers will undergo double-blind peer-review, and should thus
be anonymized.
2) Extended abstracts. These may report on work in progress or may be cross
submissions that have already appeared in a non-NLP venue. The extended
abstracts are of maximum 2 pages + references. These submissions are
non-archival in order to allow submission to another venue. The selection
will not be based on a double-blind review and thus submissions of this
type need not be anonymized.
Submissions should follow the official EMNLP 2023 style guidelines.
*The submission site is:*
https://www.softconf.com/emnlp2023/BlackboxNLP
*Organizers*
-----------------
Yonatan Belinkov, Technion
Najoung Kim, Boston University
Sophie Hao, New York University
Arya McCarthy, Johns Hopkins University
Jaap Jumelet, University of Amsterdam
Hosein Mohebbi, Tilburg University
*Contact*
---------------------
Please contact the organizers at blackboxnlp(a)googlegroups.com for any
questions.
Read more:
https://www.aclweb.org/portal/content/blackboxnlp-2023-6th-workshop-analysi…
[Apologies for cross-posting]
The University of Gothenburg, Sweden, is offering four fully-funded PhD positions in computer science and engineering where the candidates can choose the project themselves out of fourteen options.
Two of the projects are related to NLP, one about efficient algorithms for corpus searching, and another about automatic generation of Wikipedia articles. See the ad for more information:
https://web103.reachmee.com/ext/I005/1035/job?site=7&lang=UK&validator=9b89…
The positions are fully funded for 5 years, including 20% teaching or other departemental duties.
Application deadline: 20 August 2023
best regards,
Peter Ljunglöf
------- ------ ----- ---- --- -- - - - - -
peter ljunglöf
peter.ljunglof(a)gu.se
data- och informationsteknik, och språkbanken
göteborgs universitet och chalmers tekniska högskola
-------------- --------- -------- ------- ------ ----- ---- --- -- - - - - -
Dear all
Just wanted to let you know that APJCR Vol. 3, No. 1 is now available to
view online.
http://icr.or.kr/ejournals-apjcr
CK
---
*CK Jung BEng(Hons) Birmingham MSc Warwick EdD Warwick Cert Oxford*
Department of English Language and Literature, Incheon National
University, *South
Korea*
Vice President | The Korea Association of Primary English Education
(KAPEE), *South Korea*
Vice President | The Korea Association of Secondary English Education
(KASEE), *South Korea*
Director | Institute for Corpus Research, Incheon National University, *South
Korea* (http://icr.or.kr)
Editor | Asia Pacific Journal of Corpus Research, ICR, *International* (
http://icr.or.kr/apjcr)
Deputy Editor | Korean Journal of English Language and Linguistics,
KASELL, *South
Korea*
Editorial Board | Corpora, Edinburgh University Press, *UK*
Editorial Board | English Today, Cambridge University Press, *UK*
E: ckjung(a)inu.ac.kr / T: +82 (0)32 835 8129
H(EN): http://ckjung.org
H(KR): http://prof1.inu.ac.kr/user/ckjung
*SIG Writing 2024*
*Call for Conference Submissions and Research School Participation*
The EARLI Special Interest Group *Writing*, *Paris Nanterre University*
<https://www.parisnanterre.fr/> (France), *Sorbonne Nouvelle University *
<http://www.univ-paris3.fr/>(France), and the *University of Turku *
<https://www.utu.fi/fi>(Finland) invite proposals to the *20th biennial SIG
Writing conference* to be held at Paris Nanterre University, Nanterre,
France, from 26-28 June 2024. Prior to the conference, from 24-25 June
2024, the *Research School will be held.*
The conference theme for SIG Writing 2024 is *ways2write*. Writing is a
ubiquitous social and professional practice in most parts of the world. It
takes on diverse, increasingly mixed and heterogeneous forms, manifests
itself over ever wider spans of life and activity, exploits more and more
media and technologies, and is increasingly fluid. The source and its
individual and collective characteristics (age, cognitive abilities,
human/non-human...), the medium, the context and the production objectives,
time and space—all give rise to a diversity of writings and ways of
writing, raising numerous questions in an interdisciplinary field. While
the conference traditionally hosts a diversity of topics and approaches,
proposals within the *ways2write* theme are particularly welcome.
More information and *call for conference papers and Research School*
participation at *SIG WRITING 2024 (google.com)*
<https://sites.google.com/view/sig-writing-2024/conference-home?authuser=0>
*Submission deadline: October 31, 2023*
*Submission guidelines*
The aim of the conference is to promote interaction among researchers who
are interested in understanding the cognitive, social, and developmental
processes involved in writing, who are concerned with designing writing
instruction in various educational settings, or who are engaged with
exploring the functions of writing in different social and institutional
contexts. The scope of the conference is broad, and we hope to draw
together a wide range of researchers and professionals.
*CONFERENCE*: Presentation formats are Paper, Symposium, Poster,
Roundtable, and Demonstration Session.
The abstract should be 250-350 words (including any references). All
proposals must be written in English and submitted using the online
submission system: https://www.earli-eapril.org
PhD students and early career researchers are invited to submit to and
participate in the pre-conference Research School.
*RESEARCH SCHOOL*: Presentation formats are Paper Presentation, Poster or
30' Article Manuscript Discussion.
The abstract should be 350 words (including any references). All proposals
must be written in English and submitted using the online submission
system: https://www.earli-eapril.org
For any questions, please contact us at *sigwriting24.paris(a)gmail.com*
<sigwriting24.paris(a)gmail.com>
You can also follow the updates on Twitter: *@SIGWriting2024 *
<https://twitter.com/SIGWriting2024>
And Facebook: SigWriting Paris
**************************************************************
16th Workshop on Building and Using Comparable Corpora (BUCC)
with Shared Task on Multilingual Terminology Extraction
from Comparable Specialized Corpora
Co-located with RANLP 2023
September 7 or 8, 2023
Workshop website: https://comparable.limsi.fr/bucc2023/
Shared task website: https://comparable.limsi.fr/bucc2023/bucc2023-task.html
RANLP website: http://ranlp.org/ranlp2023/
Workshop proceedings to be published in ACL Anthology
Invited speaker: Sida I. Wang, Meta AI (FAIR)
**************************************************************
MOTIVATION
In the language engineering and the linguistics communities, research in
comparable corpora has been motivated by two main reasons. In language
engineering, on the one hand, it is chiefly motivated by the need to use
comparable corpora as training data for statistical NLP applications
such as statistical and neural machine translation or cross-lingual
retrieval. In linguistics, on the other hand, comparable corpora are of
interest because they enable cross-language discoveries and comparisons.
It is generally accepted in both communities that comparable corpora
consist of documents that are comparable in content and form in various
degrees and dimensions across several languages. Parallel corpora are on
the one end of this spectrum, unrelated corpora on the other.
Comparable corpora have been used in a range of applications, including
Information Retrieval, Machine Translation, Cross-lingual text
classification, etc.? The linguistic definitions and observations
related to comparable corpora can improve methods to mine such corpora
for applications of statistical NLP, for example to extract parallel
corpora from comparable corpora for neural MT. As such, it is of great
interest to bring together builders and users of such corpora.
TOPICS
We solicit contributions on all topics related to comparable (and
parallel) corpora, including but not limited to the following:
Building Comparable Corpora:
* Automatic and semi-automatic methods
* Methods to mine parallel and non-parallel corpora from the web
* Tools and criteria to evaluate the comparability of corpora
* Parallel vs non-parallel corpora, monolingual corpora
* Rare and minority languages, across language families
* Multi-media/multi-modal comparable corpora
Applications of comparable corpora:
* Human translation
* Language learning
* Cross-language information retrieval & document categorization
* Bilingual and multilingual projections
* (Unsupervised) Machine translation
* Writing assistance
* Machine learning techniques using comparable corpora
Mining from Comparable Corpora:
* Cross-language distributional semantics, word embeddings and
pre-trained multilingual transformer models
* Extraction of parallel segments or paraphrases from comparable corpora
* Methods to derive parallel from non-parallel corpora (e.g. to provide
for low-resource languages in neural machine translation)
* Extraction of bilingual and multilingual translations of single words,
multi-word expressions, proper names, named entities, sentences,
paraphrases etc. from comparable corpora
* Induction of morphological, grammatical, and translation rules from
comparable corpora
* Induction of multilingual word classes from comparable corpora
Comparable Corpora in the Humanities:
* Comparing linguistic phenomena across languages in contrastive linguistics
* Analyzing properties of translated language in translation studies
* Studying language change over time in diachronic linguistics
* Assigning texts to authors via authors' corpora in forensic linguistics
* Comparing rhetorical features in discourse analysis
* Studying cultural differences in sociolinguistics
* Analyzing language universals in typological research
IMPORTANT DATES
July 18, 2023: Paper submission deadline
July 31, 2021: Notification of acceptance
August 25, 2021: Camera ready final papers
September 7 or 8, 2023: Workshop date
For updates see the workshop website at
https://comparable.limsi.fr/bucc2023/
PRACTICAL INFORMATION
Workshop registration is via the main conference registration site,
see http://ranlp.org/ranlp2023/index.php/fees-registration/
The workshop proceedings will be published in the ACL Anthology.
SUBMISSION GUIDELINES
Please follow the style sheet and templates (for LaTeX, Overleaf and
MS-Word) provided for the main conference at
http://ranlp.org/ranlp2023/index.php/submissions/
Papers should be submitted as a PDF file using the START conference
manager at https://softconf.com/ranlp23/BUCC/
Submissions must describe original and unpublished work and range
from 4 to 8 pages plus unlimited references.
Reviewing will be double blind, so the papers should not reveal the
authors' identity. Accepted papers will be published in the workshop
proceedings, which will be included in the ACL Anthology.
Double submission policy: Parallel submission to other meetings or
publications is possible but must be immediately (i.e. as soon as known
to the authors) notified to the workshop organizers by e-mail.
For further information and updates see the BUCC 2023 website:
https://comparable.limsi.fr/bucc2023/
BUCC 2023 SHARED TASK
Bilingual Term Alignment in Comparable Specialized Corpora
The BUCC 2023 shared task is on multilingual terminology alignment in
comparable corpora. Many research groups are working on this problem
using a wide variety of approaches. However, as there is no standard way
to measure the performance of the systems, the published results are not
comparable and the pros and cons of the various approaches are not
clear. The shared task aims at solving these problems by organizing a
fair comparison of systems. This is accomplished by providing corpora
and evaluation datasets for a number of language pairs and domains.
Moreover, the importance of dealing with multi-word expressions in
Natural Language Processing applications has been recognized for a long
time. In particular, multi-word expressions pose serious challenges for
machine translation systems because of their syntactic and semantic
properties. Furthermore, multi-word expressions tend to be more
frequent in domain-specific text, hence the need to handle them in tasks
with specialized-domain corpora.
Through the 2023 BUCC shared task, we seek to evaluate methods that
detect pairs of terms that are translations of each other in two
comparable corpora, with an emphasis on multi-word terms in specialized
domains.
For the schedule and further details see the shared task website at
https://comparable.limsi.fr/bucc2023/bucc2023-task.html
WORKSHOP ORGANIZERS
* Reinhard Rapp (University of Mainz and Magdeburg-Stendal University of
Applied Sciences, Germany)
* Pierre Zweigenbaum (Université Paris-Saclay, CNRS, LISN, Orsay, France)
* Serge Sharoff (University of Leeds, United Kingdom)
Contact workshop: reinhardrapp (at) gmx (dot) de
Contact shared task: pz (at) lisn (dot) fr
PROGRAMME COMMITTEE
* Ebrahim Ansari (Institue for Advanced Studies in Basic Sciences, Iran)
* Thierry Etchegoyhen (Vicomtech, Spain)
* Philippe Langlais (Université de Montréal, Canada)
* Yves Lepage (Waseda University, Japan)
* Shervin Malmasi (Amazon, USA)
* Emmanuel Morin (Université de Nantes, France)
* Dragos Stefan Munteanu (RWS, USA)
* Reinhard Rapp (University of Mainz and Magdeburg-Stendal University of
Applied Sciences, Germany)
* Nasredine Semmar (CEA LIST, Paris, France)
* Serge Sharoff (University of Leeds, UK)
* Richard Sproat (OGI School of Science & Technology, USA)
* Tim Van de Cruys (KU Leuven, Belgium)
* Pierre Zweigenbaum (Université Paris-Saclay, CNRS, LISN, Orsay, France)
Apologies for cross posting
International workshop
NLP for translation and interpreting applications (NLP4TIA)
Varna, Bulgaria, 7/8 September 2023
https://nlp4tia.web.uah.es/
Last Call for Papers – Extended Deadline
In the last two decades, we have been able to witness a technological turn in translation and interpreting studies with Natural Language Processing (NLP) and deep learning playing more and more prominent part. There is already a growing number of NLP applications that are used to support the work of translators and interpreters. In addition, the recent advances in (and latest models of) deep learning have powered the further development and success of high performing Neural Machine Translation (NMT) systems.
Translation technology has revolutionised the translation profession and nowadays most professional translators employ tools such as translation memory (TM) systems in their daily work. Latest advances of Neural Machine Translation (NMT) have resulted in NMT not only becoming an integral part of most state-of-the art TM tools but also typical for the translation workflow of many companies, organisations and freelance translators.
Although translation has benefited more from technological advances, interpreting has also experienced a technological turn. However, it has not been until some years ago that soft technology has permeated interpreting practice and research. Computer assisted translation, MT and NLP tools have been adapted to be used by interpreters. In addition, corpus-based studies have also underpinned dialogue interpreting.
The increasing interest in NLP, MT and the automation of processes has brought us to multidisciplinary projects that deal with the development of models for automated oral communication. Machine interpreting has already been developed and is being improved, focusing on speed and accuracy matters. Either domain-specific (commercial, military, humanitarian) or general (Skype Translator), there is still a long way to go to render machine interpreting more human-like.
Many of the above recent developments have to do with the employment of Natural Language Processing tools and resources to support the work of translators and interpreters. This workshop is expected to discuss the growing importance of NLP in different translation and interpreting scenarios.
Workshop topics
The workshop invites submissions reporting original unpublished work on topics including but not limited to:
* NLP and MT for under-resourced languages;
* Translation Memory systems;
* NLP and MT for translation memory systems;
* NLP for CAT and CAI tools;
* Integration of NLP tools in remote interpreting platforms;
* NLP for dialogue interpreting;
* Development of NLP based applications for communication in public service settings (healthcare, education, law, emergency services);
* Corpus-based studies applied to translation and interpreting.;
* Machine translation and machine interpreting;
* Resources for translation and machine translation;
* Resources for interpreting and interpreting technology application;
* Quality estimation of human and machine translation;
* Post-editing strategies and tools;
* Automatic post-editing of MT;
* NLP and MT for subtitling.
* Technology acceptance by interpreters and translations;
* Machine Translation and translation tools for literary texts;
* Evaluation of machine translation and translation and interpreting tools in general;
* The impact of the technological turn in translation and interpreting;
* Cognitive effort and eye-tracking experiments in translation and interpreting;
* Development of models for research and practice of translation and interpreting;
* Multidisciplinary cooperation in NLP applied to translation and interpreting.
Submissions and publication
Submissions must consist of full-text papers and should not exceed 7 pages excluding references, they should be a minimum of 5 pages long. The accepted papers will be published as NLP4TIAworkshop e-proceedings with ISBN, will be assigned a DOI and will be also available at the time of the conference. The papers should be in English and should be submitted via the conference management system START using this link<https://softconf.com/ranlp23/NLP4TIA/>.
Authors of accepted papers will receive guidelines regarding how to produce camera-ready versions of their papers for inclusion in the proceedings.
Each submission will be reviewed by at least two programme committee members. Accepted papers will be presented orally as part of the programme of the workshop.
Submissions should be compliant with the below templates and should be uploaded as pdf files in START (START is configured to accept pdf files only).
The following templates should be used: LaTeX at Overleaf<https://www.overleaf.com/latex/templates/instructions-for-ranlp-2023-procee…>, LaTeX<http://ranlp.org/ranlp2023/Templates/ranlp2023-LaTeX.zip> , MS Office<http://ranlp.org/ranlp2023/Templates/ranlp2023-word.docx>
Important dates
Deadline for paper submission: 10 July 2023
Deadline for paper submission (extended): 23 July 2023
Acceptance notification: 10 August 2023
Final camera-ready version: 25 August 2023
Workshop camera-ready proceedings ready: 31 August 2023
NLP4TIA workshop: 7/8 September 2023
Workshop Chairs
Raquel Lázaro Gutiérrez (Universidad de Alcalá)
Antonio Pareja Lora (Universidad de Alcalá)
Ruslan Mitkov (Lancaster University)
Programme Committee
Cristina Aranda (Big Onion)
Juanjo Arevalillo (Hermes Traducciones)
Silvia Bernardini (University of Bologna)
Gabriel Cabrera Méndez (Dualia Teletraducciones)
Matt Coler (University of Groningen)
Elena Davitti (University of Surrey)
Joanna Drugan (Heriot-Watt University)
Marie Escribe (LanguageWire)
Claudio Fantinuoli (Mainz University/KUDO Inc)
Antonio García Cabot (Universidad de Alcalá)
Adriana Jaime Pérez (Migralingua Voze)
Miguel Ángel Jiménez Crespo (Rutgers University)
Óscar Luis Jiménez Serrano (University of Granada)
Koen Kerremans (Free University Brussel)
Maria Kunilovskaya (Saarland University)
Els Lefever (Ghent University)
Pilar León Arauz (University of Granada)
Johanna Monti (University of Naples L’Orientale)
Elena Montiel Ponsoda (Polytechnic University of Madrid)
Helena Moriz (University of Lisbon)
Elena Murgolo (Orbital 14)
Dora Murgu (Interprefy)
Constantin Orasan (University of Surrey)
María Teresa Ortego Antón (University of Valladolid)
Tharindu Ranasinghe (Aston University)
Celia Rico (Universidad Complutense de Madrid)
Caroline Rossi (University Grenoble les Alpes)
María del Mar Sánchez Ramos (Universidad de Alcalá)
Miriam Seghiri (University of Malaga)
Vilelmini Sosoni (Ionian University)
Rui Manuel Sousa Silva (University of Porto)
Nicoletta Spinolo (University of Bologna)
Venue
The workshop will take place at hotel Cherno More<https://www.chernomorebg.com/en/> in Varna.
Further information and contact details
Registration for NLP4TIA is now open and is done via the RANLP main conference page. To register, please complete the registration form<https://url6.mailanyone.net/scanner?m=1pii0v-000B6E-3x&d=4%7Cmail%2F14%2F16…>.
The conference website (
https://nlp4tia.web.uah.es/) will be updated on a regular basis. For further information, please email raquel.lazaro(a)uah.es<mailto:raquel.lazaro@uah.es>.
> [Apologies for cross-posting]
> ======================================================================
> NEW DATES - CFP - SIMBig 2023
> ======================================================================
>
> SIMBig 2023 - 10th International Conference on Information Management and Big Data
> Where: Instituto Politécnico Nacional, Mexico DF, MEXICO
> When: October 18 - 20, 2023
> Website: https://simbig.org/SIMBig2023/
>
> ======================================================================
>
> OVERVIEW
> ----------------------------------
>
> SIMBig 2023 seeks to present new methods of Artificial Intelligence (AI), Data Science, Machine Learning, Natural Language Processing, Semantic Web, and related fields, for analyzing, managing, and extracting insights and patterns from large volumes of data.
>
>
> KEYNOTE SPEAKERS (to be confirmed)
> ----------------------------------
>
 Mona Diab, Meta AI, USA
 Finale Doshi-Velez, Harvard University, USA
 Huan Liu, Arizona State University, USA
>
> IMPORTANT DATES
> ----------------------------------
>
> July 24, 2023 --> Full papers and short papers due
> August 28, 2023 --> Notification of acceptance
> September 10, 2023 --> Camera-ready versions
> October 18 - 20, 2023 --> Conference held in Mexico DF, Mexico
>
> PUBLICATION
> ----------------------------------
>
> All accepted papers of SIMBig 2023 (tracks including) will be published with Springer CCIS Series <https://www.springer.com/series/7899> (to be confirmed).
>
> Best papers of SIMBig 2023 (tracks including) will be selected to submit an extension to be published in the Springer SN Computer Science Journal. <https://www.springer.com/journal/42979>
 
> TOPICS OF INTEREST
> ----------------------------------
>
> SIMBig 2023 has a broad scope. We invite contributions on theory and practice, including but not limited to the following technical areas:
>
> Artificial Intelligence
> Big/Masive Data
> Data Science
> Machine Learning
> Deep Learning
> Natural Language Processing
> Semantic Web
> Data-driven Software Engineering
> Data-driven software adaptation
> Healthcare Informatics
> Biomedical Informatics
> Data Privacy and Security
> Information Retrieval
> Ontologies and Knowledge Representation
> Social Networks and Social Web
> Information Visualization
> OLAP and Business intelligence
> Crowdsourcing
>
> SPECIAL TRACKS
> ----------------------------------
>
> SIMBig 2023 proposes six special tracks in addition to the main conference:
>
> ANLP <https://simbig.org/SIMBig2023/en/anlp.html> - Applied Natural Language Processing
> DISE <https://simbig.org/SIMBig2023/en/dise.html> - Data-Driven Software Engineering
> EE-AI-HPC <https://simbig.org/SIMBig2023/en/eeaihpc.html> - Efficiency Enhancement for AI and High-Performance Computing
> SNMAM <https://simbig.org/SIMBig2023/en/snmam.html> - Social Network and Media Analysis and Mining
>
> CONTACT
> ----------------------------------
>
> SIMBig 2023 General Chairs
>
> Juan Antonio Lossio-Ventura, National Institutes of Health, USA (juan.lossio(a)nih.gov <mailto:juan.lossio@nih.gov>)
> Hugo Alatrista-Salas, Pontificia Universidad Católica del Perú, Peru (halatrista(a)pucp.pe <mailto:halatrista@pucp.pe>)