The 19th Workshop on Innovative Use of NLP for Building Educational
Applications (BEA 2024)
Location: Mexico City (co-located with NAACL 2024)
Date: Thursday, June 20 or Friday, June 21, 2024 (TBD)
Website: https://sig-edu.org/bea/current
*Submission Deadline: Sunday, March 10, 2024, 11:59pm UTC-12*
WORKSHOP DESCRIPTION
The BEA Workshop is a leading venue for NLP innovation in the context of
educational applications. It is one of the largest one-day workshops in the
ACL community with over 100 registered attendees in the past several years.
The growing interest in educational applications and a diverse community of
researchers involved resulted in the creation of the Special Interest Group
in Educational Applications (SIGEDU)
<https://www.aclweb.org/adminwiki/index.php?title=2019Q3_Reports:_SIGEDU> in
2017, which currently has over 370 members.
The 19th BEA workshop will have a keynote by Alla Rozovskaya
<https://sites.google.com/site/allamrozovskaya/> (Queens College, CUNY), an
invited paper presentation by a member of one of the educational societies
from the International Alliance to Advance Learning in the Digital Era (
IAALDE <https://alliancelss.com/>), oral presentation sessions, and a large
poster session to maximize the amount of original work presented. This
year, the workshop is also hosting two shared tasks: on Automated
Prediction of Item Difficulty and Item Response Time
<https://sig-edu.org/sharedtask/2024> and on Multilingual Lexical
Simplification <https://sites.google.com/view/mlsp-sharedtask-2024>. We
expect that the workshop will continue to highlight novel technologies and
opportunities for educational NLP in English as well as other languages.
The workshop will solicit long, short and demo papers for either oral or
poster presentation.
We will solicit papers that incorporate NLP methods, including, but not
limited to:
-
use of LLMs and generative AI in educational contexts
-
automated scoring of open-ended textual and spoken responses;
-
automated scoring/evaluation for written student responses (across
multiple genres);
-
game-based instruction and assessment;
-
educational data mining;
-
intelligent tutoring;
-
collaborative learning environments;
-
peer review;
-
grammatical error detection and correction;
-
learner cognition;
-
spoken dialog;
-
multimodal applications;
-
annotation standards and schemas;
-
tools and applications for classroom teachers, learners and/or test
developers; and
-
use of corpora in educational tools.
INVITED TALKS
The workshop will feature a keynote by Alla Rozovskaya
<https://sites.google.com/site/allamrozovskaya/> (Queens College, CUNY) and
an invited talk by a speaker from one of the IAALDE
<https://alliancelss.com/> societies.
IMPORTANT DATES
All deadlines are 11.59 pm UTC-12 (anywhere on earth).
-
Submission Deadline: Sunday, March 10, 2024
-
Notification of Acceptance: Sunday, April 14, 2024
-
Camera-ready Papers Due: Wednesday, April 24, 2024
-
Workshop: Thursday, June 20 or Friday, June 21, 2024 (TBD)
SUBMISSION INFORMATION
We will be using the ACL Submission Guidelines for the BEA Workshop this
year. Authors are invited to submit a long paper of up to eight (8) pages
of content, plus unlimited references; final versions of long papers will
be given one additional page of content (up to 9 pages) so that reviewers’
comments can be taken into account. We also invite short papers of up to
four (4) pages of content, plus unlimited references. Upon acceptance,
short papers will be given five (5) content pages in the proceedings.
Authors are encouraged to use this additional page to address reviewers’
comments in their final versions. We generally follow ACL submission
guidelines <https://aclrollingreview.org/cfp> and will require that all
submitted papers should include a dedicated "Limitations" section, which
does not count toward the page limit.
Papers which describe systems are also invited to give a demo of their
system. If you would like to present a demo in addition to presenting the
paper, please make sure to select either “long paper + demo” or “short
paper + demo” under “Submission Category” in the START submission page.
Previously published papers cannot be accepted. The submissions will be
reviewed by the program committee. As reviewing will be blind, please
ensure that papers are anonymous. Self-references that reveal the author’s
identity, e.g., “We previously showed (Smith, 1991) …”, should be avoided.
Instead, use citations such as “Smith previously showed (Smith, 1991) …”.
We have also included conflict of interest in the submission form. You
should mark all potential reviewers who have been authors on the paper, are
from the same research group or institution, or who have seen versions of
this paper or discussed it with you.
We will be using the START conference system to manage submissions:
*https://softconf.com/naacl2024/BEA2024*
<https://softconf.com/naacl2024/BEA2024/>
DOUBLE SUBMISSION POLICY
We will follow the official ACL double-submission policy
<https://aclrollingreview.org/cfp>. Specifically:
Papers being submitted both to BEA and another conference or workshop must:
● Note on the title page the other conference or workshop to which
they are being submitted.
● State on the title page that if the authors choose to present their
paper at BEA (assuming it was accepted), then the paper will be withdrawn
from other conferences and workshops.
ORGANIZING COMMITTEE
-
Ekaterina Kochmar <https://ekochmar.github.io/about/>, MBZUAI
-
*Marie Bexte
<https://www.fernuni-hagen.de/english/research/clusters/catalpa/about-catalp…>*
, FernUniversität in Hagen
-
Jill Burstein <https://sites.google.com/site/jbursteinets/>, Duolingo
-
Andrea Horbach
<https://www.fernuni-hagen.de/english/research/clusters/catalpa/about-catalp…>
, FernUniversität in Hagen
-
Ronja Laarmann-Quante
<https://www.ltl.uni-due.de/team/ronja-laarmann-quante>, Ruhr University
Bochum
-
Anaïs Tack <https://anaistack.github.io/>, KU Leuven
-
Victoria Yaneva <http://www.victoriayaneva.info/>, National Board of
Medical Examiners
-
Zheng Yuan <https://www.cl.cam.ac.uk/~zy249/>, King’s College London
Workshop contact email address: bea.nlp.workshop(a)gmail.com
PROGRAM COMMITTEE
David Alfter; Tazin Afrin; Erfan Al-Hossami; Giora Alexandron; Desislava
Aleksandrova; Bashar Alhafni; Alejandro Andrade; Nischal Ashok Kumar; Berk
Atil; Shiva Baghel; Rabin Banjade; Michael Gringo Angelo Bayona; Lee
Becker; Lisa Beinborn; Luca Benedetto; Marie Bexte; Serge Bibauw; Ted
Briscoe; Jie Cao; Dumitru-Clementin Cercel; Mei-Hua Chen; Hyundong Cho;
Mark Core; Orphee De Clercq; Kordula De Kuthy; Jasper Degraeuwe; Yo Ehara;
Carrie Demmans Epp; Yang Deng; Chris Develder; Ann Devitt; Barbara Di
Eugenio; Elisa Di Nuovo; Yuning Ding; Rahul Divekar; A. Seza Dogruoz;
George Dueñas; Mariano Felice; Nigel Fernandez; Michael Flor; Jennifer
Frey; Diana Galvan-Sosa; Ashwinkumar Ganesan; Rujun Gao; Sebastian Gombert;
Cyril Goutte; Abigail Gurin Schleifer; Samar Haider; Handoko Handoko; Ching
Nam Hang; Jiangang Hao; Nicolas Hernandez; Heiko Holz; Chieh-Yang Huang;
Chung- Chi Huang; Yi-Ting Huang; Joseph Marvin Imperial; Radu Tudor
Ionescu; Qinjin Jia; Helen Jin; Richard Johansson; Masahiro Kaneko; Elma
Kerz; Fazel Keshtkar; Alfiya Khabibullina; Mamoru Komachi; Roland Kuhn;
Alexander Kwako; Kristopher Kyle; Antonio Laverghetta Jr.; Arun Balajiee
Lekshmi Narayanan; Bruce W. Lee; Seolhwa Lee; Xu Li; John Licato; Yudong
Liu; Zhexiong Liu; Anastassia Loukina; Jakub Macina; Nitin Madnani; Arianna
Masciolini; Sandeep Mathias; Detmar Meurers; Amit Kumar Mishra; Masato
Mita; Farah Nadeem; Ryo Nagata; Sungjin Nam; Diane Napolitano; Arun
Balajiee Lekshmi Narayanan; Tanya Nazaretsky; Kamel Nebhi; Seyed Parsa
Neshaei; Hwee Tou Ng; Gebregziabihier Nigusie; Christina Niklaus; S Jaya
Nirmala; Huy Nguyen; Eda Okur; Kostiantyn Omelianchuk; Amin Omidvar; Ulrike
Pado; Chanjun Park; E. Margaret Perkoff; Jakob Prange; Long Qin; Mengyang
Qiu; Martí Quixal; Manav Rathod; Hanumant Redkar; Robert Reynolds; Frankie
Robertson; Alla Rozovskaya; Nicy Scaria; Alexander Scarlatos; Gyu-Ho Shin;
Mayank Soni; Katherine Stasaski; Helmer Strik; Hakyung Sung; Hanna
Suominen; Abhijit Suresh; Jan Švec; Chee Wei Tan; Zhongwei Teng; Rushil
Thareja; Xiaoyi Tian; Sowmya Vajjala; Piper Vasicek; Giulia Venturi;
Anthony Verardi; Carl Vogel; Elena Volodina; Taro Watanabe; Michael White;
Alistair Willis; Simon Woodhead; Marcos Zampieri; Fabian Zehner; Torsten
Zesch; Jing Zhang; Yiyun Zhou; Jessica Zipf; Bowei Zou
The Language Technology Group at the Department of Informatics, University of Oslo, invites applications for a 3-year PhD research fellowship in Natural Language Processing.
The PhD project will focus on variety-aware machine translation in a broad sense. While there have been impressive improvements in the field of machine translation in recent years, most systems define their source and target languages at a relatively coarse level (e.g. “English”, “Norwegian”) and do not take into account different types of variation that exist within these language denominations: national varieties (e.g. American vs. British English), writing forms (e.g. Bokmål vs. Nynorsk), dialects, historical varieties, learner language, simple/plain language, as well as different genres and styles.
Applicants have considerable freedom to define their research project within the area of variety-aware machine translation. Depending on the chosen variation type, different setups can be explored (variation on the source side, variation on the target side, or translation between two varieties of the same language). The research project should not only produce variety-aware machine translation models, but also contribute to the evaluation of such models. The application should contain a research proposal that specifies the research direction that the applicant wishes to take.
The LTG ( https://www.mn.uio.no/ifi/english/research/groups/ltg/ ) is an international and diverse group with a heavily machine-learning oriented research profile combined with an interest for linguistic analysis and knowledge. The LTG currently participates in the EU-funded HPLT project ( https://hplt-project.org/ ), one of whose goals is to provide large datasets for training language and translation models. The LTG is also an active member of Integreat ( https://www.integreat.no/ ), the Norwegian center of excellence for knowledge-driven machine learning. The group has access to excellent high-performance computing infrastructure.
The application deadline is 29 February 2024. Further details about the position and application requirements can be found here: https://www.jobbnorge.no/en/available-jobs/job/257218/phd-research-fellow-i…
For further information about the position and the application process, please contact Associate Professor Yves Scherrer: yves.scherrer(a)ifi.uio.no <mailto:yves.scherrer@ifi.uio.no>
Hi,
We invite applications for an internship position at CNRS / Laboratoire
d'informatique de Grenoble
to work on syntactic parsing of speech.
The internship may be followed by a 3-year funded PhD position to work on
the same topic.
Details can be found here:
https://mcoavoux.github.io/internship_proposal.html
Best,
Maximin Coavoux
Saarland University is a campus-based university with a strong
international focus and a research-oriented
profile. Numerous research institutes on campus and the systematic
promotion of collaborative projects
make Saarland University an ideal environment for innovation and
technology transfer.
The German Research Center for Artificial Intelligence (DFKI) is the
world's largest independent research
institute in the field of artificial intelligence and is internationally
recognized as a center of excellence for
applied basic research. More information about the DFKI can be found at
http://www.dfki.de/.
The Department of Philosophy at Saarland University seeks to hire a
Professor (W3)
in Ethics of Digitalization
(m/f/x; Reference no.: W2428)
As part of a long-standing collaboration agreement, this professorship
position is linked to the leadership of
a new research department at the DFKI in Saarbrücken in the same area.
At the university, the professorship
will complement the existing expertise of colleagues at the Department
of Philosophy in the field of ethics of
digitalization and establish a strong bridge to the DFKI and the
Department of Computer Science. We expect
co-optation with the Department of Computer Science.
We are looking for a highly motivated and outstanding researcher in the
area of Ethics of Digitalization with
extensive knowledge in the relevant fields of practical philosophy,
computer science, and AI. A
demonstrated ability to attract external funding for research projects
is highly desired as research at the
DFKI is almost exclusively third-party funded. The successful candidate
is expected to actively and
significantly contribute to departmental research and to teach at the
M.Sc./M.A., B.Sc./B.A., and PhD level.
The teaching language is English and German. We expect that the
successful candidate has, or is willing to
acquire within an appropriate period, sufficient proficiency to teach in
both languages. It would also be
deeply appreciated if the future professor would also participate in
teaching at the Department of Computer
Science (e.g., the award-winning “Ethics for Nerds” course). The
professor will also, as one of his/her main
duties, establish and lead a research group of the DFKI with research
themes at the intersection of computer
science and philosophy that extends the university research group
towards interdisciplinary and application-
oriented research in the area of ethics of AI.
What we offer:
Professors (W3) have faculty status at Saarland University, including
the right to supervise Bachelor’s,
Master’s, and PhD students. The successful candidate will focus on
carrying out world-class research, will
lead his/her own research group, and will undertake teaching and
supervision responsibilities. The position
offers excellent working conditions in a lively and international
scientific community on campus and across
the different DFKI sites and research departments. DFKI and Saarland
University are part of the widely known
Saarland Informatics Campus, that also includes the two Max-Planck
Institutes for Informatics as well as
Software Systems, plus the Helmholtz Center for Cyber Security, CISPA.
The Department of Philosophy is
devoted to Analytic Philosophy and well connected to other university
departments, especially to the
Department of Computer Science, and to other philosophy departments
across Germany and Europe.
Together, the Saarland campus provides an interdisciplinary, dynamic,
and stimulating research environment.
Qualifications:
The appointment will be made in accordance with the general provisions
of German public sector
employment law. Applicants will have a PhD or doctorate in an
appropriate area/subject (i.e., related to
Ethics of Digitalization) and will have demonstrated an excellent track
record of independent academic
research (e.g., as a junior or assistant professor, or by having
completed an advanced, post-doctoral research
degree (habilitation) or equivalent academic activity at a university or
research institution) in that
area/subject. They will typically have completed a period of
postdoctoral research and have teaching and
supervision experience at the university level. They must have
demonstrated outstanding research
capabilities and have the potential to successfully lead and fund their
own research group.
Your Application:
Applications should be submitted online at
www.uni-saarland.de/berufungen. No additional paper copy is
required. The application must contain:
• a cover letter and curriculum vitae (including phone number and email
address),
• a full list of publications,
• a full list of third-party funding (stating own share, if multiple
grant holders)
• your proposed research plan (2-5 pages),
• a teaching statement (1 page),
• copies of your degree certificates,
• full-text copies of your 5 most important publications,
• a list of 3 academic references (including email addresses), at least
one of whom must be a person who is
outside the group of your current or former supervisors or colleagues.
• If available: Proof of equivalence of the foreign university degree
from the Central Office for Foreign
Education (ZAB; does not apply to university degrees in Germany). If the
proof has not yet been requested
at the time of application, it must be submitted later upon request.
Applications must be received no later than 14 March 2024.
Please include the job reference number W2428 when you apply. Selected
candidates will be interviewed.
Please contact dietrich.klakow(a)lsv.uni-saarland.de if you have any
questions.
Saarland University regards internationalization as an institution-wide
process spanning all aspects of
university life and it therefore encourages applications that align with
its internationalization strategy.
Members of the university's professorial staff are therefore expected to
engage in activities that promote
and foster further internationalization. Special support will be
provided for projects that continue with or
expand on collaborative interactions within existing international
cooperative networks.
Saarland University is an equal opportunity employer. In accordance with
its affirmative action policy,
Saarland University is actively seeking to increase the proportion of
women in this field. Qualified women
candidates are therefore strongly encouraged to apply. Preferential
consideration will be given to
applications from disabled candidates of equal eligibility. We welcome
applications regardless of nationality,
ethnic and social origin, religion/belief, age, sexual orientation and
identity.
When you submit a job application to Saarland University you will be
transmitting personal data. Please refer
to our privacy notice
(https://www.uni-saarland.de/verwaltung/datenschutz/) for information on
how we
collect and process personal data in accordance with Art. 13 of the
General Data Protection Regulation
(GDPR). By submitting your application, you confirm that you have taken
note of the information in the
Saarland University privacy notice.
The full job advertisement as PDF can be found at:
https://www.uni-saarland.de/fileadmin/upload/verwaltung/stellen/Wissenschaf…
DLnLD: Deep Learning and Linked Data
Workshop colocated with LREC-COLING 2024,
Date: May 21, 2024
Venue: Torino, Italy and online
For up to date info, check: https://dl-n-ld.github.io/ <https://dl-n-ld.github.io/>
Call for Papers
----------------------------------------------------------------------------------------
What does Linguistic Linked Data brings to Deep Learning and vice versa ? Let’s bring together these two complementary approaches in NLP.
----------------------------------------------------------------------------------------
Motivations for the Workshop
Since the appearance of transformers (Vaswani et al., 2017), Deep Learning (DL) and neural approaches have brought a huge contribution to Natural Language Processing (NLP) either with highly specialized models for specific application or via Large Language Models (LLMs) (Devlin et al., 2019; Brown et al., 2020; Touvron et al., 2023) that are efficient few-shot learners for many NLP tasks. Such models usually build on huge web-scale data (raw multilingual corpora and annotated specialized, task related, corpora) that are now widely available on the Web. This approach has clearly shown many successes, but still suffers from several weaknesses, such as the cost/impact of training on raw data, biases, hallucinations, explainability, among others (Nah et al., 2023).
The Linguistic Linked Open Data (LLOD) (Chiarcos et al., 2013) community aims at creating/distributing explicitly structured data (modelled as RDF graphs) and interlinking such data across languages. This collection of datasets, gathered inside the LLOD Cloud (Chiarcos et al., 2020), contains a huge amount of multilingual ontological (e.g. DBpedia (Lehmann et al., 2015)); lexical (e.g., DBnary (Sérasset, 2015), Wordnet (McCrae et al., 2014), Wikidata (Vrandečić and Krötzsch, 2014)); or linguistic (e.g., Universal Dependencies Treebank (Nivre et al., 2020; Chiarcos et al., 2021), DBpedia Abstract Corpus (Brümmer et al., 2016)) information, structured using common metadata (e.g., OntoLex (McCrae et al., 2017), NIF (Hellmann et al., 2013), etc.) and standardised data categories (e.g., lexinfo (Cimiano et al., 2011), OliA (Chiarcos and Sukhareva, 2015)).
Both communities bring striking contributions that seem to be highly complementary. However, if knowledge (ontological) graphs are now routinely used in DL, there is still very few research studying the value of Linguistic/Lexical knowledge in the context of DL. We think that, today, there is a real opportunity to bring both communities together to take the best of both worlds. Indeed, with more and more work on Graph Neural Networks (Wu et al., 2023) and Embeddings on RDF graphs (Ristoski et al., 2019), there is more and more opportunity to apply DL techniques to build, interlink or enhance Linguistic Linked Open Datasets, to borrow data from the LLOD Cloud for enhancing Neural Models on NLP tasks, or to take the best of both worlds for specific NLP use cases.
Submission Topics
This workshop aims at gathering researchers that work on the interaction between DL and LLOD in order to discuss what each approach has to bring to the other. For this, we welcome contributions on original work involving some of the following (non exhaustive) topics:
• Deep Learning for Linguistic Linked Data, among which (but not exclusively):
• Modelling, Resources & Interlinking,
• Relation Extraction
• Corpus annotation
• Ontology localization
• Knowledge/Linguistic Graphs creation or expansion
• Linguistic Linked Data for Deep Learning, among which (but not exclusively):
• Linguistic/Knowledge Graphs as training data
• Fine tuning LLMs using Linguistic Linked (meta)Data
• Graph Neural Networks
• Knowledge/Linguistic Graphs embeddings
• LLOD for model explainability/sourcing
• Neural models for under-resourced languages
• Joint Deep Learning and Linguistic Data applications
• Use cases combining Language Models and Structured Linguistic Data
• LLOD and DL for Digital Humanities
• Question-Answering on graph data
All application domains (Digital Humanities, FinTech, Education, Linguistics, Cybersecurity…) as well as approaches (NLG, NLU, Data Extraction…) are welcome, provided that the work is based on the use of BOTH Deep Learning techniques and Linguistic Linked (meta)Data.
Important Dates
All deadlines are 11:59PM UTC-12:00 (“anywhere on Earth”)
• Final submissions due: 9th March 2024
• Notification of acceptance: 2nd April 2024
• Camera-ready due: 12th April 2024
Authors kit
All papers must follow the LREC-COLING 2024 two-column format, using the supplied official style files. The templates can be downloaded from the Style Files and Formatting page provided on the website. Please do not modify these style files, nor should you use templates designed for other conferences. Submissions that do not conform to the required styles, including paper size, margin width, and font size restrictions, will be rejected without review.
LREC-COLING 2024 Author’s Kit Page: https://lrec-coling-2024.org/authors-kit/ <https://lrec-coling-2024.org/authors-kit/>
Paper submission
Submission is electronic at https://softconf.com/lrec-coling2024/dlnld2024/ <https://softconf.com/lrec-coling2024/dlnld2024/>
Workshop Chairs
• Gilles Sérasset, Université Grenoble Alpes, France
• Hugo Gonçalo Oliveira, University of Coimbra, Portugal
• Giedre Valunaite Oleskeviciene, Mykolas Romeris University, Lithuania
Program Committee
• Mehwish Alam, Télécom Paris, Institut Polytechnique de Paris, France
• Russa Biswas, Hasso Plattner Institute, Potsdam, Germany
• Milana Bolatbek, Al-Farabi Kazakh National University, Kazakhstan
• Michael Cochez, Vrije Universiteit Amsterdam, Netherlands
• Milan Dojchinovski, Czech Technical University in Prague, Czech Republic
• Basil Ell, University of Oslo, Norway
• Robert Fuchs, University of Hamburg, Germany
• Radovan Garabík, L’. Štúr Institute of Linguistics, Slovak Academy of Sciences, Slovakia
• Daniela Gifu, Romanian Academy, Iasi branch & Alexandru Ioan Cuza University of Iasi, Romania
• Katerina Gkirtzou, Athena Research Center, Maroussi, Greece
• Jorge Gracia del Río, University of Zaragoza, Spain
• Dagmar Gromann, University of Vienna, Austria
• Dangis Gudelis, Mykolas Romeris University, Lithuania
• Ilan Kernerman, Lexicala by K Dictionaries, Israel
• Chaya Liebeskind, Jerusalem College of Technology, Israel
• Marco C. Passarotti, Università Cattolica del Sacro Cuore, Milan, Italy
• Heiko Paulheim, University of Mannheim, Germany
• Alexandre Rademaker, IBM Research Brazil and EMAp/FGV, Brazil
• Georg Rehm, DFKI GmbH, Berlin, Germany
• Harald Sack, Karlsruhe Institute of Technology, Karlsruhe, Germany
• Didier Schwab, Université Grenoble Alpes, France
• Ranka Stanković, University of Belgrade, Serbia
• Andon Tchechmedjiev, IMT Mines Alès, France
• Dimitar Trajanov, Ss. Cyril and Methodius University – Skopje, Macedonia
• Ciprian-Octavian Truică, POLITEHNICA Bucharest, Romania
• Nicolas Turenne, Guangdong University of Foreign Studies, China
• Slavko Žitnik, University of Ljubljana, Slovenia
Call for Participation - VarDial Evaluation Campaign 2024
The VarDial 2024 organizers are excited to announce the two shared tasks of the 2024 evaluation campaign:
1. The DIALECT-COPA shared task on dialectal causal commonsense reasoning
The shared task invites the community to propose, develop, and test approaches for adapting models for causal commonsense language understanding to three dialects of South-Slavic languages: the Slovenian Cerkno dialect, the Croatian Chakavian dialect, and the Serbian, Macedonian and Bulgarian Torlak dialect. Training and development data based on the COPA (Choice of plausible alternatives, Roemmele et al. 2011) dataset are available for four related standard languages (Slovenian, Croatian, Serbian, Macedonian) and two out of the three testing dialects (Cerkno, Torlak), the Chakavian dialect serving as a surprise dialect.
Details:
https://sites.google.com/view/vardial-2024/shared-tasks/dialect-copa
Registration:
https://forms.gle/UcLYcPgDFJoiAVip7
Organisers:
Nikola Ljubešić, Jožef Stefan Institute, Ljubljana
Ivan Vulić, University of Cambridge
Goran Glavaš, University of Würzburg
2. DSL-ML - Multi-label classification of similar languages
The DSL-ML task is a multi-label extension of the classic "Discriminating similar languages" task that has been popular with VarDial since the beginnings of the workshop. The motivation behind this new task formulation is that some texts do not present any linguistic markers to unambiguously determine their origin. It therefore makes sense to predict several possible labels for such texts. The 2024 DSL-ML task is based on multi-label conversions of existing datasets from five different macro-languages: English, Spanish, Portuguese, French and BCMS (Bosnian, Croatian, Montenegrin, Serbian).
Details:
https://sites.google.com/view/vardial-2024/shared-tasks/dsl-ml
Registration:
https://forms.gle/UcLYcPgDFJoiAVip7
Organizers:
Adrian Chifu, Aix-Marseille University
Radu Ionescu, University of Bucharest
Aleksandra Miletić, University of Helsinki
Filip Miletić, University of Stuttgart
Yves Scherrer, University of Oslo
Important dates
Shared task announcement and training data release: February 1
Test data release: March 4
Test results submission: March 11
System description paper submission: March 24
Notification of acceptance: April 14
Camera-ready papers due: April 24
VarDial workshop: June 21/22
The Survey of English Usage at University College London will be running the 11th Summer School in English Corpus Linguistics online from 1-3 July 2024.
This Summer School is an accessible and inspiring introductory course in English Corpus Linguistics for students of linguistics and students of the English language.
The course will be taught over three days in the morning (UK time). The course consists of theoretical and practical sessions.
Students are expected to have a basic knowledge of concepts in linguistics, especially grammar.
Places are limited. Be sure to book early to get the early bird rate.
For students in full-time education the course fee includes a free copy of either the ICE-GB Corpus (https://www.ucl.ac.uk/english-usage/projects/ice-gb/) or the DCPSE Corpus (https://www.ucl.ac.uk/english-usage/projects/dcpse/), with the associated exploration software ICECUP.
For more information about the course and how to apply, see:
https://www.ucl.ac.uk/english-usage/summer-school/
Prof. Bas Aarts
Department of English Language and Literature UCL
Grammarianism Blog: http://bit.ly/1d1zKzN
Continuous Professional Development and INSET courses for teachers: https://bit.ly/39qnKIH
Twitter: @UCLEnglishUsage and @EngliciousUCL
Note: I respect your work/life balance. If I send you an email outside of your normal working hours there is no expectation that you will read or respond to the message at that time.
Dear linguists,
We would like to announce that the deadline for the abstract submission of 18th NooJ Call for Paper is now extended to the 14th of February!
The linguistic software- NooJ, is organising its 18th International Conference in Bergamo, italy! This conference is for linguists, scholars, and professionals to engage in thought-provoking discussions on a myriad of topics encompassing Natural Language Processing (NLP), Linguistic Resources, Digital Humanities, and Language in Society.
We are thrilled to invite you to apply for the Call for Papers by the 4th of FEB, which covers the following topics:
📚NLP Societal applications and citizen science:
Typography, Spelling, Syllabification, Phonemic and Prosodic Transcription, Morphology, Lexical Analysis, Local Syntax, Structural Syntax, Transformational Analysis, Paraphrase Generation, Semantic Annotations, Semantic Analysis.
🗣️Linguistic Resources:
Corpus Linguistics, Discourse Analysis, Sentiment analysis, Literature Studies, Second-Language Teaching, Narrative content analysis, Corpus processing for the Social Sciences.
🧠Digital Humanities:
Business Intelligence, Text Mining, Text Generation. Language Teaching Software, Automatic Paraphrasing, Machine Translation, etc.
💻Natural Language Processing Applications:
Computational Socio-Linguistic (migration, geography, tourism, political discourse, cinema, social media, gender studies…)
Important dates!
Abstract Submission: Feb 14 2024
Notification of accept: March 10 2024
Camera ready: March 24 2024
Early bird registrations: From March 11 to March 31st 2024
Deadline for the other registrations: April 15 2024
Selected papers submission: Sept 15 2024
Important links!
NooJ Conference website: https://nooj2024.x-23.org/https://nooj2024.x-23.org/
Submitting the paper via EasyChair: https://easychair.org/conferences/?conf=18njhttps://easychair.org/conferences/?conf=18nj
If you have any questions, feel free to reach out to us.
Best,
The 18th NooJ Organisation Board
_____________
THE 18TH NOOJ INTERNATIONAL CONFERENCE 2024
JUN 4th to 7th, 2024 — Bergamo, Italy
Managed by The Nooj Association
Powered and hosted by X23 Srl
The Natural Language Processing Section at the Department of Computer Science at University of Copenhagen is advertising for two Ph.D Fellowships in Natural Language Processing. The positions are funded by both the Pixel-based Encoding of Language project, which is financed by the Velux Foundations, and the Democratize Trustworthy and Efficient Large Language Model Technology for Europe project funded by the European Union.
The overall goal of the research is to develop and evaluate language models that can process any written language by rendering text as images, which allows the models to learn from the visual similarities between written languages. Realizing this goal includes creating and implementing tokenization-free multilingual language models, the collection and curation of visually diverse language data, the training of small-scale and large-scale models, developing techniques for effective model quantization and compression, creating models that jointly process natural images and rendered text using a single interface, and the evaluation of such models. Your research will contribute to many aspects of this process and require detailed hands-on implementation and evaluation of cutting-edge machine learning models. Further information about the projects are available at https://veluxfoundations.dk/en/about/projects-granted#/0066700000uJIZZAA4 and https://cordis.europa.eu/project/id/101135671.
The successful candidates will join the Language and Multimodal Processing group, which is part of a section with a strong, international, and diverse environment for research within core as well as emerging topics in natural language processing, natural language understanding, computational linguistics and multi-modal language processing. It is housed within the main Science Campus, which is centrally located in Copenhagen. Further information about the group is available here: https://lampgroup.github.io/ and further information about research at the Department is available here: https://di.ku.dk/english/research/.
The application deadline is 29 February 2024, with an expected start date of 1 June 2024, or as soon as possible thereafter. Further information about the position can be found here: https://employment.ku.dk/faculty/?show=160979
Informal enquiries about the positions can be made to Assistant Professor Desmond Elliott, Department of Computer Science, University of Copenhagen, e-mail: de(a)di.ku.dk.