An opportunity to join the Assessment Research Group at the British Council as Emerging Researcher: Data Analyst. Full details and link to application here: https://careers.britishcouncil.org/job/London-Emerging-Researcher-Data-Anal…
For any enquiries, please contact Dr Karen Dunn at Karen.Dunn(a)britishcouncil.org<mailto:Karen.Dunn@britishcouncil.org>. Deadline for applications is 8th January.
The British Council is the United Kingdom's international organisation for cultural relations and educational opportunities. A registered charity: 209131 (England and Wales) SC037733 (Scotland). This message is for the use of the intended recipient(s) only and may contain confidential information. If you have received this message in error, please notify the sender and delete it. The British Council accepts no liability for loss or damage caused by viruses and other malware and you are advised to carry out a virus and malware check on any attachments contained in this message.
**With apologies for cross-posting**
The Clinical Linguistics@LLiS Group at the University of Bologna, Italy, invites applications for a research position in NLP for Clinical Linguistics.
The research activities will be carried out within the ReMind project - an ecological, cost-effective AI platform for early detection of prodromal stages of cognitive impairment, founded by MUR with the PRIN 2022 grant (CUP J53D23008380006), under the supervision of Prof. Gloria Gagliardi.
We are looking for a proactive team member with a master’s degree or a PhD in Computer Science/Computational Linguistics. The candidate should have a strong background in corpus linguistics, machine learning and Natural Language Processing and good programming skills (i.e., Python).
What we offer:
- Duration: 18 months
- Salary: 22.293,68 EUR/year (gross), around 1650 EUR/month (net)
- Flexible working hours and home office arrangements
For further information please contact Gloria Gagliardi (gloria.gagliardi(a)unibo.it).
Interested? Apply here: https://bandi.unibo.it/ricerca/assegni-ricerca?id_bando=66888
The Research unit ATILF (Computer Processing and Analysis of the French Language) offers a postdoctoral position in natural language processing (NLP).
Topic: Discovery of multiword expressions, their meaning and their linguistic properties in texts using large language models
Location: ATILF, Nancy, France
Starting date: from February 2024
Duration: 12 months (possibility to extend the duration for one more year)
Supervisors: Mathieu Constant (Univ. Lorraine, France) and Agata Savary (Univ. Paris-Saclay, France)
Salary: depends on experience after PhD and salary grids, from 3070 (<2-year experience) to 4465 euros (>7-year-experience) before tax
Application deadline: 14th December 2023 (extended deadline)
Subject. The term « multiword expression » refers to a combination of multiple lexical items that displays irregular composition possibly on different linguistic levels (morphology, syntax, semantics, …). They include a large variety of phenomena such as idioms (run around in circles), support verb constructions (take a walk), nominal compounds (dry run), complex function units (in spite of). They have been the subject of extensive research work in the NLP community over the last 50 years.
The goal of this post-doc position is to investigate new methods for discovering multiword expressions, their meaning and their linguistic properties in texts, in order to enrich an induced semantic lexicon with new multiword entries, definitions, argumental structure, and other properties. The emergence of Large Language Models (LLM) opens new promising perspectives for multiword expressions, not only regarding their semantic compositionality but also their linguistic characterization. The methods will be primarily experimented on French, but other languages are also possible.
Context. The position is part of the SELEXINI project (https://selexini.lis-lab.fr <https://selexini.lis-lab.fr/>, 2022-2026) funded by the French National Research Agency (ANR). The goal of the SELEXINI project is to develop next-generation lexicon induction methods for natural language processing. The induced lexicons will not only cluster word usages according to their senses, but also contain multiword expressions, argumental structure, generated definitions, etc, combining the power of large pre-trained language models and existing lexical resources to address the lack of interpretability and diversity in current language technology. The hired researcher will be fully integrated in the project team.
Requirements. Applicants should hold a PhD thesis in computer science, in applied mathematics, in natural language processing, or in computational linguistics. Applications from PhD students planning their defense by December 31st, 2023 are also welcome.
The hired post-doc researcher should have the following skills:
expertise in deep learning for NLP and notably large language models
excellent programming skills
good linguistic skills
good knowledge of French would be a plus
team spirit
Application. The applicants should submit a cover letter, a CV including their publications, a list of references for recommendation, a transcript of Master grades, on the following official web site: https://emploi.cnrs.fr/Offres/CDD/UMR7118-SABMAR-017/Default.aspx?Lang=EN <https://emploi.cnrs.fr/Offres/CDD/UMR7118-SABMAR-017/Default.aspx?Lang=EN>. The applications should be submitted not later than December 14.
[Apologies for multiple postings]
We are happy to announce that 3 new monolingual lexicons are now
available in our catalogue.
DiaLEX – Egyptian (DiaLEX-EA)
<https://catalog.elra.info/en-us/repository/browse/ELRA-L0206/>
ISLRN: 697-328-151-668-9 <http://www.islrn.org/resources/697-328-151-668-9>
A comprehensive full-form lexicon of Egyptian Arabic general vocabulary
(DiaLEX-EA) including 78 million entries for 31,000 lemmas with all
inflected forms, enclitics, proclitics, case endings, declensions, and
conjugated forms.
Each entry is accompanied by a full and accurate diacriticization
(vocalization) as well as an extensive coverage of variants. The lexicon
is ideally suited to support natural language processing applications
for Egyptian Arabic, especially
morphological analysis and speech technology.
Quantity and size: 75,204,644 lines / 11,217 MB (11.0 GB)
DiaLEX – Emirati (DiaLEX-UA)
<https://catalog.elra.info/en-us/repository/browse/ELRA-L0207/>
ISLRN: 836-793-503-213-8 <http://www.islrn.org/resources/836-793-503-213-8>
A comprehensive full-form lexicon of Emirati Arabic general vocabulary
(DiaLEX-UA) including 28 million entries for 29,000 lemmas with all
inflected forms, enclitics, proclitics, case endings, declensions, and
conjugated forms.
Each entry is accompanied by a full and accurate diacriticization
(vocalization) as well as an extensive coverage of variants. The lexicon
is ideally suited to support natural language processing applications
for Emirati Arabic, especially
morphological analysis and speech technology.
Quantity and size: 24,976,871 lines / 3,841 MB (3.8 GB)
DiaLEX – Saudi Arabian Hijazi (DiaLEX-HA)
<https://catalog.elra.info/en-us/repository/browse/ELRA-L0208/>
ISLRN: 849-157-479-216-3 <http://www.islrn.org/resources/849-157-479-216-3>
A comprehensive full-form lexicon of Hijazi Arabic general vocabulary
(DiaLEX-HA) including 21 million entries for 30,000 lemmas with all
inflected forms, enclitics, proclitics, case endings, declensions, and
conjugated forms.
Each entry is accompanied by a full and accurate diacriticization
(vocalization) as well as an extensive coverage of variants. The lexicon
is ideally suited to support natural language processing applications
for Hijazi Arabic, especially
morphological analysis and speech technology.
Quantity and size: 20,247,655 lines / 2,835 MB (2.8 GB)
For more information on the catalogue or if you would like to enquire
about having your resources distributed by ELRA, please contact us
<mailto:contact@elda.org>.
_________________________________________
Visit the ELRA Catalogue of Language Resources <http://catalog.elra.info>
Visit the Universal Catalogue <http://universal.elra.info>
Archives
<http://www.elra.info/en/catalogues/language-resources-announcements> of
ELRA Language Resources Catalogue Updates
Release of BabelNet 5.3
https://babelnet.org
We are proud to announce the release of a new version of BabelNet
<https://babelnet.org/> and its programmatic *Java and Python API*,
developed jointly by the Sapienza NLP Group <http://nlp.uniroma1.it>
of *Sapienza
University of Rome* under the supervision of prof. Roberto Navigli
<https://www.diag.uniroma1.it/navigli/> and Babelscape
<http://babelscape.com/>, *a deep-tech multilingual NLP company* providing
innovative solutions for natural language understanding.
BabelNet -- winner of the *prominent paper award 2017* from the Artificial
Intelligence Journal and the META prize 2015, and covered in media such as The
Guardian
<https://www.theguardian.com/news/2018/feb/23/oxford-english-dictionary-can-…>
and Time Magazine
<http://wwwusers.di.uniroma1.it/~navigli/img/Redefining_the_modern_dictionar…>
-- is today's *most far-reaching multilingual lexical-semantic knowledge
graph* which, according to need, can be used as an *encyclopedic dictionary*,
or a *semantic network* or a huge *knowledge base/ontology* e.g. to be
integrated into *deep learning solutions*. It has been used by more than *1000
universities and research institutions*, enabling multilinguality in
several fields of AI and NLP, such as multilingual semantic search, Word
Sense Disambiguation and Entity Linking, Semantic Role Labeling, image
tagging and semantically-enhanced multimodality.
BabelNet was created by means of the seamless integration and interlinking
of the largest multilingual Web encyclopedia - i.e., Wikipedia - with the
most popular computational lexicon of English - i.e., WordNet, and other
lexical-semantic resources such as Wikidata, Wiktionary, OmegaWiki, dozens
of wordnets (including Open English WordNet), GeoNames, and ImageNet. The
BabelNet model is centered around *multilingual synsets*, i.e., concepts
and named entities lexicalized in many languages, and connected with large
amounts of semantic relations.
*Version 5.3* ships with the following features:
- *80 new languages* for a grand total of *600 languages*;
- *23 million synsets* covered;
- *Lemma casing updated in 24 languages*;
- *Wikipedia and Wikidata updated* thanks to BabelNet live (November
2023 dump);
- *Open English WordNet* has been updated to version 2023;
- *Images* associated with synsets have been updated;
- *Wiktionary* has been *updated* and *20k new concepts* have been
integrated (November 2023 dump);
- *Significantly improved cross-lingual resource mapping*, ensuring more
accurate and contextually relevant lexicalizations and translations;
- *General data cleanup* (glosses, senses, Named Entity vs. Concept
labels);
- *Wikipedia multilingual labels updated.*
More statistics are available at: babelnet.org/statistics.
Kind regards,
The BabelNet group
--
==============================================
Roberto Navigli* - Professor*
Department of Computer, Control and Management Engineering
Sapienza University of Rome
Via Ariosto, 25
00185 Roma Italy
Phone: +39 06 77274109
Home Page: https://www.diag.uniroma1.it/navigli/
Sapienza NLP Group: http://nlp.uniroma1.it
Co-founder of Babelscape <https://babelscape.com>
==============================================
Apologies for cross posting
*Fourth Workshop on Language Technology for Equality, Diversity, Inclusion
(LT-EDI-2024) at EACL 2024*
*Website link: https://sites.google.com/view/lt-edi-2024/
<https://sites.google.com/view/lt-edi-2024/>*
Equality, Diversity and Inclusion (EDI) is an important agenda across every
field throughout the world. Language as a major part of communication
should be inclusive and treat everyone with equality. Today’s large
internet community uses language technology (LT) and has a direct impact on
people across the globe. EDI is crucial to ensure everyone is valued and
included, so it is necessary to build LT that serves this purpose. Recent
results have shown that big data and deep learning are entrenching existing
biases and that some algorithms are even naturally biased due to problems
such as ‘regression to the mode’. Our focus is on creating LT that will be
more inclusive of gender, racial, sexual orientation, persons with
disability. The workshop will focus on creating speech and language
technology to address EDI not only in English, but also in less resourced
languages.
The broader objective of LT-EDI-2024 will be
- To investigate challenges related to speech and language resource
creation for EDI.
- To promote research in inclusive LT.
- To adopt and adapt appropriate LT models to suit EDI.
- To provide opportunities for researchers from the LT community around
the world to collaborate with other researchers to identify and propose
possible solutions for the challenges of EDI.
Our workshop theme focuses on being more inclusive and providing a platform
for researchers to create LT of a more inclusive nature. We hope that
through these engagements we can develop LT tools to be more inclusive of
everyone, including marginalized people.
*Call for Papers:*
Our main theme in this workshop is equality, diversity, and inclusivity in
LT. We invite researchers and practitioners to submit papers reporting on
these issues and datasets to avoid these issues. We also encourage
qualitative studies related to these issues and how to avoid them. LT-EDI-
2024 welcomes theoretical and practical paper submissions on any languages
that contribute to research in Equality, Diversity and Inclusion. We will
particularly encourage studies that address either practical application or
improving resources.
*Topics of interest include, but are not limited to:*
- Data set development to include EDI
- Gender inclusivity in LT
- LGBTQ+ inclusivity in LT
- Racial inclusivity in LT
- Persons with disability inclusivity in LT
- Speech and language recognition for minority groups
- Unconscious bias and how to avoid them in natural language processing,
machine learning and other LT technologies.
- Tackling rumours and fake news about gender, racial, and LGBTQ+
minorities.
- Tackling discrimination against gender, racial, and LGBTQ+ minorities.
Submissions:
At LTEDI we accept the following submission types:
- Long paper submissions must describe substantial, original, completed
and unpublished work. Wherever appropriate, concrete evaluation and
analysis should be included. Long papers may consist of up to 8 pages of
content, plus unlimited pages for references and appendices. Upon
acceptance, long papers will be given one additional page of content (i.e.
up to 9 pages) in the proceedings so that reviewers’ comments can be taken
into account.
- Short paper submissions must describe original and unpublished work.
Please note that a short paper is not a shortened long paper. Instead,
short papers should have a point that can be made in a few pages. Short
papers may consist of up to 4 pages of content, plus unlimited references
and appendices. Upon acceptance, short papers will be given one additional
page of content (i.e. up to 5 pages) in the proceedings so that reviewers’
comments can be taken into account.
- Poster and demo submissions should be no longer than 4 pages (plus
unlimited number of pages for references and ethics/broader impact
statement).
More information on submission can be found at
https://sites.google.com/view/lt-edi-2024/submission
For electronic submission of all papers, please use:
https://openreview.net/group?id=eacl.org/EACL/2024/Workshop/LTEDI
*Important Dates*
- Workshop paper due: December 12, 2023
- Direct Submission deadline (pre-reviewed ARR & main conference)
January 17, 2024
- Notification of acceptance: January 15, 2024
- Camera-ready papers due: January 25 2024
- Workshop dates: March 21-22, 2024
with regards,
Dr. Bharathi Raja Chakravarthi,
Assistant Professor / Lecturer-above-the-bar
School of Computer Science, University of Galway, Ireland
Insight SFI Research Centre for Data Analytics, Data Science Institute,
University of Galway, Ireland
E-mail: bharathiraja.akr(a)gmail.com , bharathi.raja(a)universityofgalway.ie
<bharathiraja.asokachakravarthi(a)universityofgalway.ie>
Google Scholar: https://scholar.google.com/citations?user=irCl028AAAAJ&hl=en
Website:
https://www.universityofgalway.ie/our-research/people/computer-science/bhar…
<https://www.universityofgalway.ie/our-research/people/computer-science/bhar…>
With great enthusiasm, we announce the first edition of the *Natural
Language Processing (NLP) Workshop for Indigenous Languages of Lusophone
Countries*.
The workshop aims to explore, discuss, and enhance the development of
resources, methods, and applications of NLP for indigenous languages,
especially those spoken or that have influenced languages spoken in
countries where Portuguese is currently the official language. We hope to
contribute to the preservation and promotion of these languages. The
workshop will be held in conjunction with PROPOR 2024.
*Workshop Date: 13 or 14 March 2024, in Santiago de Compostela.*
*Paper Submission: 5 January 2024 AoE.*
*More information: *https://sites.google.com/view/illc-nlp-2024/home
This event aims to expand knowledge and research in NLP for
underrepresented languages. We encourage the participation of everyone who
shares an interest in preserving and enriching the linguistic and cultural
heritage of indigenous languages in a broad sense. This way, we welcome the
submission of works including languages from all Portuguese-speaking
nations, like those of African origin in Angola, Mozambique, and the
Atlantic islands, as well as minority languages in Portugal.
Please help us spread the word about this event by sharing this call with
your contacts and institutions. Your participation and support are crucial
for the success of this workshop.
We are excited to see your contributions and active participation.
Sincerely,
Aline Paes, Aline Villavicencio, Claudio Pinhanez, Edward Gow-Smith, Paulo
Rodrigo Cavalin (Workshop organisers)
-------------------------------------------------------------------------------------------------
*Profa. Dra. Aline Paes (she/her)*
*Associate professor - Computer Science (Artificial Intelligence)*
Institute of Computing / Universidade Federal Fluminense (IC/UFF)
Member of CE-PLN <https://sites.google.com/view/ce-pln/inicio> and BPLN
<https://brasileiraspln.com/>
CNPq PQ-2 and FAPERJ JCNE
__________________________________________________________
url: www.ic.uff.br/~alinepaes
Av Gal Milton Tavares de Souza, S/N, Computing Building, Office 504
São Domingos, Niterói, RJ, Brazil. ZIP 24210-346
-------------------------------------------------------------------------------------------------
****Please do not feel any pressure to respond out of your own regular
working hours. Remember that this is supposed to be an asynchronous tool***
*** First Call for Workshop Papers ***
36th International Conference on Advanced Information Systems Engineering
(CAiSE'24)
June 3-7, 2024, 5* St. Raphael Resort and Marina, Limassol, Cyprus
https://cyprusconferences.org/caise2024/
(*** Submission Deadline: 26th February, 2024 AoE ***)
CAiSE is a well-established, highly visible conference series on Advanced Information Systems
(IS) Engineering. It covers all relevant topics in the area, including methodologies and
approaches for IS engineering, innovative platforms, architectures and technologies, and
engineering of specific kinds of IS. CAiSE conferences also have the tradition of hosting
workshops in related fields. Workshops are intended to focus on particular topics and provide
ample room for discussions of new ideas and developments.
CAiSE'24, the 36th edition of the CAiSE series, will host the following workshops. For more
information including important dates for each workshop, please visit the workshops' web
sites.
CAiSE'24 Workshops
• 3rd International Workshop on Agile Methods for Information Systems Engineering (Agil-ISE)
https://agilise.github.io/2024/index.html
• International Workshop on Blockchain for Information Systems (BC4IS24) and Blockchain for
Trusted Data Sharing (B4TDS)
https://pros.unicam.it/bc4isb4tds/
• 2nd International Workshop on Hybrid Artificial Intelligence and Enterprise Modelling for
Intelligent Information Systems (HybridAIMS)
https://hybridaims.com/
• 2nd Workshop on Knowledge Graphs for Semantics-driven Systems Engineering
https://www.omilab.org/activities/events/caise2024_kg4sdse/
• 16th International Workshop on Enterprise & Organizational Modeling and Simulation
(EOMAS 2024)
https://eomas2024.fel.cvut.cz/
• Digital Transformation with Business Process Mining (DigPro2024)
https://digpro.iiita.ac.in/
IMPORTANT DATES
• Paper Submission Deadline: 26th February, 2024 (AoE)
• Notification of Acceptance: 27th March, 2024
• Camera-ready Deadline: 5th April, 2024
• Author Registration Deadline: 5th April, 2024
Workshop Chairs
• João Paulo A. Almeida, Federal University of Espírito Santo, Brazil
• Claudio di Ciccio, Sapienza University of Rome, Italy
• Christos Kalloniatis, University of the Aegean, Greece
Dear colleagues:
We invite participants to a three-day winter school on large-scale neural NLP research – with special emphasis on language modeling for non-English languages – using massive Web data. The school will provide lectures and space for discussion by, among others, Afra Alishahi (Tilburg University), Desmond Elliot (University of Copenhagen), Aurélie Névéol (LISN, CNRS), and a few more international experts.
The winter school is organized as a collaboration between the Horizon Europe project High-Performance Language Technologies (HPLT) and the Nordic Language Processing Laboratory (NLPL). The event will be held ‘in real life’ on February 4–6, 2024, in Norway. For additional information, please see:
http://wiki.nlpl.eu/Community/training
There is no participant fee for the winter school, and HPLT will provide free bus transfer between the Oslo airport and the conference hotel (about two hours north of Oslo, with skiing facilities just outside the door). Participants will need to cover their own travel to Oslo and accommodation at the hotel (NOK 3745 for two nights in a single room, including all meals and conference facilities).
We kindly invite expressions of interest in participation in the winter school. Please register through the on-line form linked up from the above overview page. We will process requests for participation on a first-come, first-served basis, with an eye toward regional balance. Participation will be confirmed in three batches, one on December 8, another one on December 15, and finally after the closing date for registration, which is Thursday, December 22, 2023.
Welcome to Skeikampen in February 2024!
Andrey Kutuzov & Stephan Oepen (for the organizing team)