Hi,
We invite applications for an internship position at CNRS / Laboratoire
d'informatique de Grenoble
to work on syntactic parsing of speech.
The internship may be followed by a 3-year funded PhD position to work on
the same topic.
Details can be found here:
https://mcoavoux.github.io/internship_proposal.html
Best,
Maximin Coavoux
Saarland University is a campus-based university with a strong
international focus and a research-oriented
profile. Numerous research institutes on campus and the systematic
promotion of collaborative projects
make Saarland University an ideal environment for innovation and
technology transfer.
The German Research Center for Artificial Intelligence (DFKI) is the
world's largest independent research
institute in the field of artificial intelligence and is internationally
recognized as a center of excellence for
applied basic research. More information about the DFKI can be found at
http://www.dfki.de/.
The Department of Philosophy at Saarland University seeks to hire a
Professor (W3)
in Ethics of Digitalization
(m/f/x; Reference no.: W2428)
As part of a long-standing collaboration agreement, this professorship
position is linked to the leadership of
a new research department at the DFKI in Saarbrücken in the same area.
At the university, the professorship
will complement the existing expertise of colleagues at the Department
of Philosophy in the field of ethics of
digitalization and establish a strong bridge to the DFKI and the
Department of Computer Science. We expect
co-optation with the Department of Computer Science.
We are looking for a highly motivated and outstanding researcher in the
area of Ethics of Digitalization with
extensive knowledge in the relevant fields of practical philosophy,
computer science, and AI. A
demonstrated ability to attract external funding for research projects
is highly desired as research at the
DFKI is almost exclusively third-party funded. The successful candidate
is expected to actively and
significantly contribute to departmental research and to teach at the
M.Sc./M.A., B.Sc./B.A., and PhD level.
The teaching language is English and German. We expect that the
successful candidate has, or is willing to
acquire within an appropriate period, sufficient proficiency to teach in
both languages. It would also be
deeply appreciated if the future professor would also participate in
teaching at the Department of Computer
Science (e.g., the award-winning “Ethics for Nerds” course). The
professor will also, as one of his/her main
duties, establish and lead a research group of the DFKI with research
themes at the intersection of computer
science and philosophy that extends the university research group
towards interdisciplinary and application-
oriented research in the area of ethics of AI.
What we offer:
Professors (W3) have faculty status at Saarland University, including
the right to supervise Bachelor’s,
Master’s, and PhD students. The successful candidate will focus on
carrying out world-class research, will
lead his/her own research group, and will undertake teaching and
supervision responsibilities. The position
offers excellent working conditions in a lively and international
scientific community on campus and across
the different DFKI sites and research departments. DFKI and Saarland
University are part of the widely known
Saarland Informatics Campus, that also includes the two Max-Planck
Institutes for Informatics as well as
Software Systems, plus the Helmholtz Center for Cyber Security, CISPA.
The Department of Philosophy is
devoted to Analytic Philosophy and well connected to other university
departments, especially to the
Department of Computer Science, and to other philosophy departments
across Germany and Europe.
Together, the Saarland campus provides an interdisciplinary, dynamic,
and stimulating research environment.
Qualifications:
The appointment will be made in accordance with the general provisions
of German public sector
employment law. Applicants will have a PhD or doctorate in an
appropriate area/subject (i.e., related to
Ethics of Digitalization) and will have demonstrated an excellent track
record of independent academic
research (e.g., as a junior or assistant professor, or by having
completed an advanced, post-doctoral research
degree (habilitation) or equivalent academic activity at a university or
research institution) in that
area/subject. They will typically have completed a period of
postdoctoral research and have teaching and
supervision experience at the university level. They must have
demonstrated outstanding research
capabilities and have the potential to successfully lead and fund their
own research group.
Your Application:
Applications should be submitted online at
www.uni-saarland.de/berufungen. No additional paper copy is
required. The application must contain:
• a cover letter and curriculum vitae (including phone number and email
address),
• a full list of publications,
• a full list of third-party funding (stating own share, if multiple
grant holders)
• your proposed research plan (2-5 pages),
• a teaching statement (1 page),
• copies of your degree certificates,
• full-text copies of your 5 most important publications,
• a list of 3 academic references (including email addresses), at least
one of whom must be a person who is
outside the group of your current or former supervisors or colleagues.
• If available: Proof of equivalence of the foreign university degree
from the Central Office for Foreign
Education (ZAB; does not apply to university degrees in Germany). If the
proof has not yet been requested
at the time of application, it must be submitted later upon request.
Applications must be received no later than 14 March 2024.
Please include the job reference number W2428 when you apply. Selected
candidates will be interviewed.
Please contact dietrich.klakow(a)lsv.uni-saarland.de if you have any
questions.
Saarland University regards internationalization as an institution-wide
process spanning all aspects of
university life and it therefore encourages applications that align with
its internationalization strategy.
Members of the university's professorial staff are therefore expected to
engage in activities that promote
and foster further internationalization. Special support will be
provided for projects that continue with or
expand on collaborative interactions within existing international
cooperative networks.
Saarland University is an equal opportunity employer. In accordance with
its affirmative action policy,
Saarland University is actively seeking to increase the proportion of
women in this field. Qualified women
candidates are therefore strongly encouraged to apply. Preferential
consideration will be given to
applications from disabled candidates of equal eligibility. We welcome
applications regardless of nationality,
ethnic and social origin, religion/belief, age, sexual orientation and
identity.
When you submit a job application to Saarland University you will be
transmitting personal data. Please refer
to our privacy notice
(https://www.uni-saarland.de/verwaltung/datenschutz/) for information on
how we
collect and process personal data in accordance with Art. 13 of the
General Data Protection Regulation
(GDPR). By submitting your application, you confirm that you have taken
note of the information in the
Saarland University privacy notice.
The full job advertisement as PDF can be found at:
https://www.uni-saarland.de/fileadmin/upload/verwaltung/stellen/Wissenschaf…
DLnLD: Deep Learning and Linked Data
Workshop colocated with LREC-COLING 2024,
Date: May 21, 2024
Venue: Torino, Italy and online
For up to date info, check: https://dl-n-ld.github.io/ <https://dl-n-ld.github.io/>
Call for Papers
----------------------------------------------------------------------------------------
What does Linguistic Linked Data brings to Deep Learning and vice versa ? Let’s bring together these two complementary approaches in NLP.
----------------------------------------------------------------------------------------
Motivations for the Workshop
Since the appearance of transformers (Vaswani et al., 2017), Deep Learning (DL) and neural approaches have brought a huge contribution to Natural Language Processing (NLP) either with highly specialized models for specific application or via Large Language Models (LLMs) (Devlin et al., 2019; Brown et al., 2020; Touvron et al., 2023) that are efficient few-shot learners for many NLP tasks. Such models usually build on huge web-scale data (raw multilingual corpora and annotated specialized, task related, corpora) that are now widely available on the Web. This approach has clearly shown many successes, but still suffers from several weaknesses, such as the cost/impact of training on raw data, biases, hallucinations, explainability, among others (Nah et al., 2023).
The Linguistic Linked Open Data (LLOD) (Chiarcos et al., 2013) community aims at creating/distributing explicitly structured data (modelled as RDF graphs) and interlinking such data across languages. This collection of datasets, gathered inside the LLOD Cloud (Chiarcos et al., 2020), contains a huge amount of multilingual ontological (e.g. DBpedia (Lehmann et al., 2015)); lexical (e.g., DBnary (Sérasset, 2015), Wordnet (McCrae et al., 2014), Wikidata (Vrandečić and Krötzsch, 2014)); or linguistic (e.g., Universal Dependencies Treebank (Nivre et al., 2020; Chiarcos et al., 2021), DBpedia Abstract Corpus (Brümmer et al., 2016)) information, structured using common metadata (e.g., OntoLex (McCrae et al., 2017), NIF (Hellmann et al., 2013), etc.) and standardised data categories (e.g., lexinfo (Cimiano et al., 2011), OliA (Chiarcos and Sukhareva, 2015)).
Both communities bring striking contributions that seem to be highly complementary. However, if knowledge (ontological) graphs are now routinely used in DL, there is still very few research studying the value of Linguistic/Lexical knowledge in the context of DL. We think that, today, there is a real opportunity to bring both communities together to take the best of both worlds. Indeed, with more and more work on Graph Neural Networks (Wu et al., 2023) and Embeddings on RDF graphs (Ristoski et al., 2019), there is more and more opportunity to apply DL techniques to build, interlink or enhance Linguistic Linked Open Datasets, to borrow data from the LLOD Cloud for enhancing Neural Models on NLP tasks, or to take the best of both worlds for specific NLP use cases.
Submission Topics
This workshop aims at gathering researchers that work on the interaction between DL and LLOD in order to discuss what each approach has to bring to the other. For this, we welcome contributions on original work involving some of the following (non exhaustive) topics:
• Deep Learning for Linguistic Linked Data, among which (but not exclusively):
• Modelling, Resources & Interlinking,
• Relation Extraction
• Corpus annotation
• Ontology localization
• Knowledge/Linguistic Graphs creation or expansion
• Linguistic Linked Data for Deep Learning, among which (but not exclusively):
• Linguistic/Knowledge Graphs as training data
• Fine tuning LLMs using Linguistic Linked (meta)Data
• Graph Neural Networks
• Knowledge/Linguistic Graphs embeddings
• LLOD for model explainability/sourcing
• Neural models for under-resourced languages
• Joint Deep Learning and Linguistic Data applications
• Use cases combining Language Models and Structured Linguistic Data
• LLOD and DL for Digital Humanities
• Question-Answering on graph data
All application domains (Digital Humanities, FinTech, Education, Linguistics, Cybersecurity…) as well as approaches (NLG, NLU, Data Extraction…) are welcome, provided that the work is based on the use of BOTH Deep Learning techniques and Linguistic Linked (meta)Data.
Important Dates
All deadlines are 11:59PM UTC-12:00 (“anywhere on Earth”)
• Final submissions due: 9th March 2024
• Notification of acceptance: 2nd April 2024
• Camera-ready due: 12th April 2024
Authors kit
All papers must follow the LREC-COLING 2024 two-column format, using the supplied official style files. The templates can be downloaded from the Style Files and Formatting page provided on the website. Please do not modify these style files, nor should you use templates designed for other conferences. Submissions that do not conform to the required styles, including paper size, margin width, and font size restrictions, will be rejected without review.
LREC-COLING 2024 Author’s Kit Page: https://lrec-coling-2024.org/authors-kit/ <https://lrec-coling-2024.org/authors-kit/>
Paper submission
Submission is electronic at https://softconf.com/lrec-coling2024/dlnld2024/ <https://softconf.com/lrec-coling2024/dlnld2024/>
Workshop Chairs
• Gilles Sérasset, Université Grenoble Alpes, France
• Hugo Gonçalo Oliveira, University of Coimbra, Portugal
• Giedre Valunaite Oleskeviciene, Mykolas Romeris University, Lithuania
Program Committee
• Mehwish Alam, Télécom Paris, Institut Polytechnique de Paris, France
• Russa Biswas, Hasso Plattner Institute, Potsdam, Germany
• Milana Bolatbek, Al-Farabi Kazakh National University, Kazakhstan
• Michael Cochez, Vrije Universiteit Amsterdam, Netherlands
• Milan Dojchinovski, Czech Technical University in Prague, Czech Republic
• Basil Ell, University of Oslo, Norway
• Robert Fuchs, University of Hamburg, Germany
• Radovan Garabík, L’. Štúr Institute of Linguistics, Slovak Academy of Sciences, Slovakia
• Daniela Gifu, Romanian Academy, Iasi branch & Alexandru Ioan Cuza University of Iasi, Romania
• Katerina Gkirtzou, Athena Research Center, Maroussi, Greece
• Jorge Gracia del Río, University of Zaragoza, Spain
• Dagmar Gromann, University of Vienna, Austria
• Dangis Gudelis, Mykolas Romeris University, Lithuania
• Ilan Kernerman, Lexicala by K Dictionaries, Israel
• Chaya Liebeskind, Jerusalem College of Technology, Israel
• Marco C. Passarotti, Università Cattolica del Sacro Cuore, Milan, Italy
• Heiko Paulheim, University of Mannheim, Germany
• Alexandre Rademaker, IBM Research Brazil and EMAp/FGV, Brazil
• Georg Rehm, DFKI GmbH, Berlin, Germany
• Harald Sack, Karlsruhe Institute of Technology, Karlsruhe, Germany
• Didier Schwab, Université Grenoble Alpes, France
• Ranka Stanković, University of Belgrade, Serbia
• Andon Tchechmedjiev, IMT Mines Alès, France
• Dimitar Trajanov, Ss. Cyril and Methodius University – Skopje, Macedonia
• Ciprian-Octavian Truică, POLITEHNICA Bucharest, Romania
• Nicolas Turenne, Guangdong University of Foreign Studies, China
• Slavko Žitnik, University of Ljubljana, Slovenia
Call for Participation - VarDial Evaluation Campaign 2024
The VarDial 2024 organizers are excited to announce the two shared tasks of the 2024 evaluation campaign:
1. The DIALECT-COPA shared task on dialectal causal commonsense reasoning
The shared task invites the community to propose, develop, and test approaches for adapting models for causal commonsense language understanding to three dialects of South-Slavic languages: the Slovenian Cerkno dialect, the Croatian Chakavian dialect, and the Serbian, Macedonian and Bulgarian Torlak dialect. Training and development data based on the COPA (Choice of plausible alternatives, Roemmele et al. 2011) dataset are available for four related standard languages (Slovenian, Croatian, Serbian, Macedonian) and two out of the three testing dialects (Cerkno, Torlak), the Chakavian dialect serving as a surprise dialect.
Details:
https://sites.google.com/view/vardial-2024/shared-tasks/dialect-copa
Registration:
https://forms.gle/UcLYcPgDFJoiAVip7
Organisers:
Nikola Ljubešić, Jožef Stefan Institute, Ljubljana
Ivan Vulić, University of Cambridge
Goran Glavaš, University of Würzburg
2. DSL-ML - Multi-label classification of similar languages
The DSL-ML task is a multi-label extension of the classic "Discriminating similar languages" task that has been popular with VarDial since the beginnings of the workshop. The motivation behind this new task formulation is that some texts do not present any linguistic markers to unambiguously determine their origin. It therefore makes sense to predict several possible labels for such texts. The 2024 DSL-ML task is based on multi-label conversions of existing datasets from five different macro-languages: English, Spanish, Portuguese, French and BCMS (Bosnian, Croatian, Montenegrin, Serbian).
Details:
https://sites.google.com/view/vardial-2024/shared-tasks/dsl-ml
Registration:
https://forms.gle/UcLYcPgDFJoiAVip7
Organizers:
Adrian Chifu, Aix-Marseille University
Radu Ionescu, University of Bucharest
Aleksandra Miletić, University of Helsinki
Filip Miletić, University of Stuttgart
Yves Scherrer, University of Oslo
Important dates
Shared task announcement and training data release: February 1
Test data release: March 4
Test results submission: March 11
System description paper submission: March 24
Notification of acceptance: April 14
Camera-ready papers due: April 24
VarDial workshop: June 21/22
The Survey of English Usage at University College London will be running the 11th Summer School in English Corpus Linguistics online from 1-3 July 2024.
This Summer School is an accessible and inspiring introductory course in English Corpus Linguistics for students of linguistics and students of the English language.
The course will be taught over three days in the morning (UK time). The course consists of theoretical and practical sessions.
Students are expected to have a basic knowledge of concepts in linguistics, especially grammar.
Places are limited. Be sure to book early to get the early bird rate.
For students in full-time education the course fee includes a free copy of either the ICE-GB Corpus (https://www.ucl.ac.uk/english-usage/projects/ice-gb/) or the DCPSE Corpus (https://www.ucl.ac.uk/english-usage/projects/dcpse/), with the associated exploration software ICECUP.
For more information about the course and how to apply, see:
https://www.ucl.ac.uk/english-usage/summer-school/
Prof. Bas Aarts
Department of English Language and Literature UCL
Grammarianism Blog: http://bit.ly/1d1zKzN
Continuous Professional Development and INSET courses for teachers: https://bit.ly/39qnKIH
Twitter: @UCLEnglishUsage and @EngliciousUCL
Note: I respect your work/life balance. If I send you an email outside of your normal working hours there is no expectation that you will read or respond to the message at that time.
Dear linguists,
We would like to announce that the deadline for the abstract submission of 18th NooJ Call for Paper is now extended to the 14th of February!
The linguistic software- NooJ, is organising its 18th International Conference in Bergamo, italy! This conference is for linguists, scholars, and professionals to engage in thought-provoking discussions on a myriad of topics encompassing Natural Language Processing (NLP), Linguistic Resources, Digital Humanities, and Language in Society.
We are thrilled to invite you to apply for the Call for Papers by the 4th of FEB, which covers the following topics:
📚NLP Societal applications and citizen science:
Typography, Spelling, Syllabification, Phonemic and Prosodic Transcription, Morphology, Lexical Analysis, Local Syntax, Structural Syntax, Transformational Analysis, Paraphrase Generation, Semantic Annotations, Semantic Analysis.
🗣️Linguistic Resources:
Corpus Linguistics, Discourse Analysis, Sentiment analysis, Literature Studies, Second-Language Teaching, Narrative content analysis, Corpus processing for the Social Sciences.
🧠Digital Humanities:
Business Intelligence, Text Mining, Text Generation. Language Teaching Software, Automatic Paraphrasing, Machine Translation, etc.
💻Natural Language Processing Applications:
Computational Socio-Linguistic (migration, geography, tourism, political discourse, cinema, social media, gender studies…)
Important dates!
Abstract Submission: Feb 14 2024
Notification of accept: March 10 2024
Camera ready: March 24 2024
Early bird registrations: From March 11 to March 31st 2024
Deadline for the other registrations: April 15 2024
Selected papers submission: Sept 15 2024
Important links!
NooJ Conference website: https://nooj2024.x-23.org/https://nooj2024.x-23.org/
Submitting the paper via EasyChair: https://easychair.org/conferences/?conf=18njhttps://easychair.org/conferences/?conf=18nj
If you have any questions, feel free to reach out to us.
Best,
The 18th NooJ Organisation Board
_____________
THE 18TH NOOJ INTERNATIONAL CONFERENCE 2024
JUN 4th to 7th, 2024 — Bergamo, Italy
Managed by The Nooj Association
Powered and hosted by X23 Srl
The Natural Language Processing Section at the Department of Computer Science at University of Copenhagen is advertising for two Ph.D Fellowships in Natural Language Processing. The positions are funded by both the Pixel-based Encoding of Language project, which is financed by the Velux Foundations, and the Democratize Trustworthy and Efficient Large Language Model Technology for Europe project funded by the European Union.
The overall goal of the research is to develop and evaluate language models that can process any written language by rendering text as images, which allows the models to learn from the visual similarities between written languages. Realizing this goal includes creating and implementing tokenization-free multilingual language models, the collection and curation of visually diverse language data, the training of small-scale and large-scale models, developing techniques for effective model quantization and compression, creating models that jointly process natural images and rendered text using a single interface, and the evaluation of such models. Your research will contribute to many aspects of this process and require detailed hands-on implementation and evaluation of cutting-edge machine learning models. Further information about the projects are available at https://veluxfoundations.dk/en/about/projects-granted#/0066700000uJIZZAA4 and https://cordis.europa.eu/project/id/101135671.
The successful candidates will join the Language and Multimodal Processing group, which is part of a section with a strong, international, and diverse environment for research within core as well as emerging topics in natural language processing, natural language understanding, computational linguistics and multi-modal language processing. It is housed within the main Science Campus, which is centrally located in Copenhagen. Further information about the group is available here: https://lampgroup.github.io/ and further information about research at the Department is available here: https://di.ku.dk/english/research/.
The application deadline is 29 February 2024, with an expected start date of 1 June 2024, or as soon as possible thereafter. Further information about the position can be found here: https://employment.ku.dk/faculty/?show=160979
Informal enquiries about the positions can be made to Assistant Professor Desmond Elliott, Department of Computer Science, University of Copenhagen, e-mail: de(a)di.ku.dk.
DMR 2024 - 2nd Call for Papers
Timeline
When Tues, May 21
Where Torino, Italy
Mode hybrid
Direct Submission Deadline February 19
ARR Commitment Deadline March 25
Notification of Acceptance March 27
Final Version Due April 8
Workshop site: https://dmr2024.github.io/index.html
DMR 2024 will be co-located with LREC-COLING 2024 (the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation), 20-25 May, 2024 at the Lingotto Conference Centre, Torino, Italy. DMR 2024 will be a hybrid event (real-time virtual participation allowed), but in-person participation is encouraged.
DMR 2024 submission website: https://softconf.com/lrec-coling2024/dmr2024/
LREC-COLING 2024 website: https://lrec-coling-2024.org/
Contact us with questions at dmr.workshop.0(a)gmail.com
Overview
DMR 2024 invites the submissions of long and short papers about original works on meaning representations. As the special theme of DMR 2024, we also invite the submissions of original research that have in any way leveraged, expanded, or been inspired by the “Marthaverse of Meaning”-- the 50 years of gold-standard contributions to the field of NLP by 2023 ACL lifetime Achievement Award recipient, Dr. Martha Palmer.
Broader Goals
DMR intends to bring together researchers who are producers and consumers of meaning representations and, through their interaction, gain a deeper understanding of the key elements of meaning representations that are the most valuable to the NLP community. The workshop will provide an opportunity for meaning representation researchers to present new frameworks and to critically examine existing frameworks with the goal of using their findings to inform the design of next-generation meaning representations. One particular goal is to explore opportunities and identify challenges in the design and use of meaning representations in multilingual settings. Another is to understand the relationship between distributed meaning representations trained on large data sets using network models and the symbolic meaning representations that are carefully designed and annotated by NLP researchers, with an aim of gaining a deeper understanding of areas where each type of meaning representation is the most effective.
Special Theme: A Marthaverse of Meaning
In her 2023 ACL Lifetime Achievement Award acceptance speech, Dr. Martha Palmer (University of Colorado, Boulder) sums up her 50 years of research in AI and NLP in six words: “Finding meaning, quite literally, in words.” This year's workshop honors Dr. Palmer's contributions with a special theme on resources, approaches, and applications that draw upon her manifold contributions to the field. These resources include Treebanks (Chinese and Arabic TreeBanks, Hindi and Urdu Treebanks), PropBanks (English, Chinese and Arabic), VerbNet, OntoNotes, Abstract Meaning Representation (AMR), and Uniform Meaning Representation (UMR). These resources share attention to semantic detail combined with scalability and, therefore, an ability to generalize to and support a variety of different NLP applications and tasks. Indeed, the applicability of her research extends beyond the textual to the multimodal, where she has broadly contributed to the cross-modal event understanding.
DMR 2024 seeks to highlight the depth and the breadth of Dr. Palmer's contributions and their influence over the field of natural language processing by inviting the submission of original works that have in any way leveraged, expanded, or been inspired by the ``Marthaverse of Meaning.'' We also seek to recognize Dr. Palmer's long tenure of dedication to outstanding mentorship that has been so powerful for the many students who have gone on to shape the NLP research community and the field at large.
Topics
The workshop solicits papers that address one or more of the following topics:
Treebanks and the syntax-semantics interface;
PropBanks, VerbNets, and semantic role labeling resources;
OntoNotes and word sense disambiguation resources;
Expansion or pairing of semantic resources with LLMs;
Design and annotation of meaning representations;
Cross-framework comparison of meaning representations;
Automatic parsing of meaning representations;
Automatic generation of text from meaning representations;
Strengths and weaknesses of existing meaning representations exposed as a result of using them in natural language applications or natural language understanding systems;
Use of meaning representations in real-world applications;
Issues in applying meaning representations to multilingual settings;
Issues in bringing multimodality into meaning representations;
The relationship between symbolic meaning representations and distributed semantic representations;
The use of LLMs to create meaning representations
Formal properties of meaning representations;
Any other topics that address the design, processing, and use of meaning representations or Dr. Martha Palmer's contributions to NLP.
Submission Details
Submissions should report original and unpublished research on topics of interest to the workshop. Accepted papers are expected to be presented at the workshop and will be published in the workshop proceedings on the ACL Anthology. They should emphasize obtained results rather than intended work and should clearly indicate the state of completion of the reported results. A paper accepted for presentation at the workshop must not be or have been presented at any other meeting with publicly available proceedings.
Submissions and Templates: Submission is electronic, using the Softconf START conference management system at https://softconf.com/lrec-coling2024/dmr2024/. Submissions must adhere to the two-column LREC-COLING format. Long papers must not exceed eight (8) pages of content and short papers must not exceed four (4) pages of content. If a paper is accepted, the authors will be given an additional page to address reviewers’ comments in the final version. References and appendices do not count against these limits.
When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. Moreover, ELRA encourages all LREC-COLING authors to share the described LRs (data, tools, services, etc.) to enable their reuse and replicability of experiments (including evaluation ones).
We also accept commitments from the ACL Rolling Review (ARR). All ARR commitments to DMR must have received all reviews and meta-reviews by March 25, 2024. For more info on ARR in general, see https://aclrollingreview.org.
Author Responsibilities: Reviewing of papers will be double-blind. Therefore, the paper must not include the authors’ names and affiliations or self-references that reveal any author’s identity–e.g., “We previously showed (Smith, 1991) …” should be replaced with citations such as “Smith (1991) previously showed …”. The submissions should also avoid links to non-anonymized repositories: the code should be either submitted as supplementary material in the final version of the paper, or as a link to an anonymized repository (e.g., Anonymous GitHub or Anonym Share). Papers that do not conform to these requirements will be rejected without review.
If the paper is available as a preprint, this must be indicated on the submission form but not in the paper itself. In addition, DMR 2024, in accordance with LREC-COLING 2024, will follow the same policy as ACL conferences establishing an anonymity period during which non-anonymous posting of preprints is not allowed.
Papers that have been or will be under consideration for other venues at the same time must be declared at submission time. If a paper is accepted for publication at DMR 2024, it must be immediately withdrawn from other venues. If a paper under review at DMR 2024 is accepted elsewhere and authors intend to proceed there, the workshop committee must be notified immediately.
Authors of papers that have been or will be submitted to other meetings or publications must provide this information to the workshop organizers dmr.workshop.0(a)gmail.com. Authors of accepted papers must notify the program chairs within 10 days of acceptance if the paper is withdrawn for any reason.
The Research unit ATILF (Computer Processing and Analysis of the French Language) offers a postdoctoral position in natural language processing (NLP).
Topic: Discovery of multiword expressions, their meaning and their linguistic properties in texts using large language models
Location: ATILF, Nancy, France
Starting date: from April 2024
Duration: 12 months (possibility to extend the duration for one more year)
Supervisors: Mathieu Constant (Univ. Lorraine, France) and Agata Savary (Univ. Paris-Saclay, France)
Salary: depends on experience after PhD and salary grids, from 3070 (<2-year experience) to 4465 euros (>7-year-experience) before tax
Application deadline: 22th February 2024
Subject. The term « multiword expression » refers to a combination of multiple lexical items that displays irregular composition possibly on different linguistic levels (morphology, syntax, semantics, …). They include a large variety of phenomena such as idioms (run around in circles), support verb constructions (take a walk), nominal compounds (dry run), complex function units (in spite of). They have been the subject of extensive research work in the NLP community over the last 50 years.
The goal of this post-doc position is to investigate new methods for discovering multiword expressions, their meaning and their linguistic properties in texts, in order to enrich an induced semantic lexicon with new multiword entries, definitions, argumental structure, and other properties. The emergence of Large Language Models (LLM) opens new promising perspectives for multiword expressions, not only regarding their semantic compositionality but also their linguistic characterization. The methods will be primarily experimented on French, but other languages are also possible.
Context. The position is part of the SELEXINI project (https://selexini.lis-lab.fr <https://selexini.lis-lab.fr/>, 2022-2026) funded by the French National Research Agency (ANR). The goal of the SELEXINI project is to develop next-generation lexicon induction methods for natural language processing. The induced lexicons will not only cluster word usages according to their senses, but also contain multiword expressions, argumental structure, generated definitions, etc, combining the power of large pre-trained language models and existing lexical resources to address the lack of interpretability and diversity in current language technology. The hired researcher will be fully integrated in the project team.
Requirements. Applicants should hold a PhD thesis in computer science, in applied mathematics, in natural language processing, or in computational linguistics.
The hired post-doc researcher should have the following skills:
expertise in deep learning for NLP and notably large language models
excellent programming skills
good linguistic skills
good knowledge of French would be a plus
team spirit
Application. The applicants should submit a cover letter, a CV including their publications, a list of references for recommendation, a transcript of Master grades, on the following official web site: https://emploi.cnrs.fr/Offres/CDD/UMR7118-MATCON-001/Default.aspx?lang=EN <https://emploi.cnrs.fr/Offres/CDD/UMR7118-MATCON-001/Default.aspx?lang=EN>. The applications should be submitted not later than 22th of February 2024.
The Survey of English Usage at University College London will be running the 11th Summer School in English Corpus Linguistics online from 1-3 July 2024.
This Summer School is an accessible and inspiring introductory course in English Corpus Linguistics for students of linguistics and students of the English language.
The course will be taught over three days in the morning (UK time). The course consists of theoretical and practical sessions.
Students are expected to have a basic knowledge of concepts in linguistics, especially grammar.
Places are limited. Be sure to book early to get the early bird rate.
For students in full-time education the course fee includes a free copy of either the ICE-GB Corpus (https://www.ucl.ac.uk/english-usage/projects/ice-gb/) or the DCPSE Corpus (https://www.ucl.ac.uk/english-usage/projects/dcpse/), with the associated exploration software ICECUP.
For more information about the course and how to apply, see:
https://www.ucl.ac.uk/english-usage/summer-school/