Dear Colleagues and Friends,
We welcome submissions to the special issue of the Cambridge University
Press journal
<https://www.cambridge.org/core/journals/natural-language-engineering>Natural
Language Engineering
<https://www.cambridge.org/core/journals/natural-language-engineering> on
Natural Language Processing Application on Low-Resource Languages.
*Cambridge NLE - Special Issue*
*Natural Language Processing Applications for Low-Resource Languages*
*The deadline for submissions is November 30, 2022.*
*Call for Papers*
We welcome papers dealing with one or more of the following topics:
● *Machine Translation for Low-Resource Languages*
● *Caption Generation for Low-Resource Languages*
● *POS Tagging on Low-Resource Languages*
● *Optical character recognition and generation of Low-Resource
Languages.*
● *Chatbot assistant for Low-Resource Languages*
● *Speech/Language recognition*
● *Regional music identification and classification*
● *Textual Entailment of Low-Resource Languages*
● *Face-to-Face Translation of Low-Resource Languages*
● *News/Social media analysis*
● *Text prediction and suggestion*
● *Sentiment analysis*
● *Text classification*
*Important Dates*
*Deadline for submissions*: November 30, 2022
*First-round author notification*: Between March 30, 2023 and April 30, 2023
*Submission of revised versions*: Between June 30, 2023 and July 30, 2023
*Second-round author notification (final)*: Between August 30, 2023 and
September 30, 2023
*Camera-ready versions*: Between Oct 01, 2023 and Oct 30, 2023
*Submissions*
Instructions for preparing your manuscript for the *Journal of Natural
Language Engineering* are available
<https://www.cambridge.org/core/journals/natural-language-engineering/inform…>
here
<https://www.cambridge.org/core/journals/natural-language-engineering/inform…>
.
Please submit your article through the NLE
<https://mc.manuscriptcentral.com/nle>manuscript submission system
<https://mc.manuscriptcentral.com/nle>. When submitting your manuscript,
please select *Natural Language Processing Applications for Low-Resource
Languages *in the field *Special Issue Designation.*
*Guest Editors*
*Dr. Partha Pakray, National Institute of Technology Silchar*
*Prof. Alexander Gelbukh, Instituto Politécnico Nacional, Mexico City,
Mexico*
*Prof. Sivaji Bandyopadhyay, National Institute of Technology Silchar*
Yours Cordially,
Professor Sivaji Bandyopadhyay
Guest Editor
Director, National Institute of Technology(NIT) Silchar (Assam)
Professor, Computer Science and Engineering Department
NIT Silchar (Assam)
Professor, Computer Science and Engineering Department (on lien)
Jadavpur University, Kolkata - 700032
INDIA
*Natural Language Processing acceleration: foundations and applications**
**Two complementary courses on Deep Learning and NLP*
January and February, 2023. San Sebastian, Spain
ONSITE ATTENDANCE ONLY
HANDS-ON, YOU LEARN TO IMPLEMENT THE MODELS
http://ixa.eus/NLPacceleration
Deep Learning neural network models have been successfully applied to
natural language processing, and are now changing radically how we
interact with machines (Siri, Alexa, machine translation or the Google
search engine). Large Language Models are at the core of these
developments, and are being used to crack "languages" in other
disciplines, ranging from programming languages (Copilot) to proteins
(AlphaFold) and gene sequences (GenSLM).
You can take either one or both courses:
Deep Learning for NLP <https://ixa.eus/dl4nlp> (January 9th to 27th, 35
hours, 14 evenings. 11th edition):
This course introduces in detail the machinery that makes Deep Learning
work for NLP, including the latest transformers and large language
models like GPT, BERT and T5. It also covers the use of prompts for
zero-shot and few-shot learning, as well as multimodal text-image models
like DALL-E. The course combines theoretical and practical hands-on
classes. Attendants will be able to understand the internal working of
the models, and implement them nearly from scratch in Tensorflow. The
aim is to allow attendees to acquire the ability to understand, modify
and apply current and future Deep Learning models to NLP and other areas.
Introduction to LT Applications <https://ixa.eus/iltapp> (July 11th to
15th, 20 hours, 5 afternoons. 3rd edition):
This course introduces the main techniques to build applications, and
apply them to core Language Technology applications such us Opinion
Mining, Named Entity Recognition, Fake News Detection or Question
Answering. The course will have a practical focus, learning to use
readily available LT toolkits (Spacy, Flair, etc.). The aim is to allow
attendees to acquire the required autonomy to apply LT technologies to
target applications.
*Registration* has to be done for each course, see respective websites.
Note that you only need to pay the insurance for one of the courses.
Do you have a PhD in Computational Linguistics or related? Would you like to develop text mining methodology, and improve the public health research, policy advice and communication of RIVM? Please apply at Vrije Universiteit Amsterdam (VU).
Location: AMSTERDAM/BILTHOVEN
FTE: 0.8 - 1
Job description
The project you are working on as a postdoctoral researcher, is part of a research project funded by the National Institute for Public Health and the Environment (RIVM). Our goal is to use text mining methodologies to enable systematic insight in suggestions from citizens and professionals on specific RIVM themes. You work partly at the Social AI research group at VU, and partly at the National Coordination Centre for Communicable Disease Control (LCI) of RIVM. We offer you the opportunity to further develop your research skills, with a rich data infrastructure and the relevant expertise and guidance in the group. You get to apply Natural Language Processing (NLP) in practical use cases of RIVM, including end user evaluations.
Every day RIVM receives hundreds of questions, comments and suggestions of citizens and professionals about the various topics that RIVM deals with. These many questions, comments and suggestions have the potential to provide RIVM with relevant societal feedback loops to better adapt its work to societal needs. As a postdoctoral researcher on this project, you set out to develop and validate text mining methodology, both supervised and unsupervised. Your methodologies are applicable to different data sources (including Social Media and internal platforms), themes and user groups (e.g.: citizens and health professionals). The predictions need to be presented in a suitable way to the envisioned end users: researchers, policy makers, medical doctors and communication experts at RIVM. These methods may target attitudes, beliefs, sentiments, needs, suggestions and experiences of professionals and citizens. At the end of the project, a multidisciplinary RIVM team may use them to support and enhance RIVM research, policy advice and communication. We also intend to publish our research results in international peer-reviewed (open access) journals.
Your duties
* doing high-quality application-oriented research and publish this research in international conferences with a focus on NLP
* contributing to text mining education for staff members at RIVM
* developing (re-)usable text mining code and tools
Requirements
* you hold, or will soon hold, a PhD degree in data science, computer science, artificial intelligence, information retrieval, computational linguistics, or a related field
* you have experience with applying Natural Language Processing
* you have a strong research potential, preferably evidenced by publications
* you have an interest in the application domain, i.e., public health and health communication
* you are proactive and a good communicator; creative and engaged
* you have the ability to work independently and in a team
* you can work together well with project members that have different expertise
* you are a fluent speaker / writer of English
What are we offering?
A challenging position in a socially involved organization. On full-time basis the remuneration amounts to a minimum gross monthly salary of €2,960 (scale 10) and a maximum €4,670 (scale 10), depending on your education and experience. The job profile is based on the university job ranking system and is vacant for at least 0.8 FTE.
The appointment will initially be for 1 year. After a satisfactory evaluation of the initial appointment, the contract will be extended for a total duration of 2.5 years
Additionally, Vrije Universiteit Amsterdam offers excellent fringe benefits and various schemes and regulations to promote a good work/life balance, such as:
* a maximum of 41 days of annual leave based on full-time employment
* 8% holiday allowance and 8.3% end-of-year bonus
* solid pension scheme (ABP)
* contribution to commuting expenses
* optional model for designing a personalized benefits package
You will have a (physical) workplace both at RIVM (in Bilthoven) and at Vrije Universiteit in Amsterdam.
Application
Are you interested in this position? Please apply via https://workingat.vu.nl/ad/postdoctoral-research-position-developing-text-m… and upload your curriculum vitae and cover letter until 07/12/2022.
Florian Kunneman
Assistant Professor at the department of Computer Science | Vrije Universiteit Amsterdam
AI Master Thesis coordinator
Working days: Mo, Tu, Th, Fr
Room 11A85 (NU Building) | T: +31.20.59 84452 | M: +31.6.33726175
fkunneman.github.io
Dear all,
It is my great pleasure to announce that we have a special issue on
"Multimodal Processing and Robotics for Dialogue Systems" in the
journal of Advanced Robotics (Taylor & Francis). Please consider
submitting your original paper to this special issue. The deadline is
31 Jan. 2023.
https://think.taylorandfrancis.com/special_issues/advanced-robotics-multimo…https://www.rsj.or.jp/content/files/pub/ar/CFP/CFP_37_21.pdf
I would also appreciate it if you could help distribute the CFP and
encourage your colleagues to make submissions.
Best regards,
Ryuichiro Higashinaka
on behalf of guest editors
/////////////////////////////////////////////////////////////////////////
[Call for Papers]
Advanced Robotics Special Issue on
Multimodal Processing and Robotics for Dialogue Systems
Co-Editors:
Prof. David Traum (University of Southern California, USA)
Prof. Gabriel Skantze (KTH Royal Institute of Technology, Sweden)
Prof. Hiromitsu Nishizaki (University of Yamanashi, Japan)
Prof. Ryuichiro Higashinaka (Nagoya University, Japan)
Dr. Takashi Minato (RIKEN/ATR, Japan)
Prof. Takayuki Nagai (Osaka University, Japan)
Publication in Vol. 37, Issue 21 (Nov 2023)
SUBMISSION DEADLINE: 31 Jan 2023
In recent years, as seen in smart speakers such as Google Home and
Amazon Alexa, there has been remarkable progress in spoken dialogue
systems technology to converse with users with human-like utterances.
In the future, such dialogue systems are expected to support our daily
activities in various ways. However, dialogue in daily activities is
more complex than that with smart speakers; even with current spoken
dialogue technology, it is still difficult to maintain a successful
dialogue in various situations. For example, in customer service
through dialogue, it is necessary for operators to respond
appropriately to the different ways of speaking and requests of
various customers. In such cases, we humans can switch the speaking
manner depending on the type of customer, and we can successfully
perform the dialogue by not only using our voice but also our gaze and
facial expressions.
This type of human-like interaction is far from possible with the
existing spoken dialogue systems. Humanoid robots have the possibility
to realize such an interaction, because they can recognize not only
the user's voice but also facial expressions and gestures using
various sensors, and can express themselves in various ways such as
gestures and facial expressions using their bodies. Their many means
of expressions have the potential to successfully continue dialogue in
a manner different from conventional dialogue systems.
The combination of such robots and dialogue systems can greatly expand
the possibilities of dialogue systems, while at the same time,
providing a variety of new challenges. Various research and
development efforts are currently underway to address these new
challenges, including "dialogue robot competition" at IROS2022.
In this special issue, we invite a wide range of papers on multimodal
dialogue systems and dialogue robots, their applications, and
fundamental research. Prospective contributed papers are invited to
cover, but are not limited to, the following topics on multimodal
dialogue systems and robots:
*Spoken dialogue processing
*Multimodal processing
*Speech recognition
*Text-to-speech
*Emotion recognition
*Motion generation
*Facial expression generation
*System architecture
*Natural language processing
*Knowledge representation
*Benchmarking
*Evaluation method
*Ethics
*Dialogue systems and robots for competition
Submission:
The full-length manuscript (either PDF file or MS word file) should be
sent by 31st Jan 2023 to the office of Advanced Robotics, the Robotics
Society of Japan through the on-line submission system of the journal
(https://www.rsj.or.jp/AR/submission). Sample manuscript templates and
detailed instructions for authors are available at the website of the
journal.
Dear Corpora members,
We would be grateful if you could share this call for applications within
your network.
Applications for the 2023-25 edition of the EMILDAI programme are now open.
The European Master in Law, Data and Artificial Intelligence (EMILDAI
<http://www.emildai.eu/>) is an international interdisciplinary programme
offered by Dublin City University, Avignon Université, Universidad de León
and Università di Pisa. The master is funded by the Erasmus Mundus Joint
Master programme of the European Union.
This year we offer *23 full scholarships* (1400€/month x 24 months), up to
6 full fee waivers and up to 2 partial fee waivers. We are seeking
excellent students from all nationalities with a degree in *computing* or
*law*, and a knowledge of English and either French or Spanish.
Applications will close on 20 January 2023.
All information on how to apply is available on www.emildai.eu.
Best Regards,
--
Sheila Castilho
--
__
Séanadh Ríomhphoist/_
Email Disclaimer__
**
Tá an ríomhphost seo agus
aon chomhad a sheoltar leis faoi rún agus is lena úsáid ag an seolaí agus
sin amháin é. Is féidir tuilleadh a léamh anseo.
<https://www.dcu.ie/iss/seanadh-riomhphoist.shtml>
<https://www4.dcu.ie/iss/seanadh-riomhphoist.shtml>*
_
This e-mail and any
files transmitted with it are confidential and are intended solely for use
by the addressee. Read more here.
<https://www.dcu.ie/iss/email-disclaimer.shtml> _
*_
*******We apologize for the multiple copies of this e-mail******
*Call for papers for the issue 70 of the journal Procesamiento del
Lenguaje Natural*
http://www.sepln.org/en/journalhttp://www.sepln.org/en/journal/author-guidelines
--------------------------------------------**
*Abstract (TL;DR)*:
*Submission platform*: http://myreview.sepln.org/myreview-sepln70/
*Submission deadline*: December 2nd, 2022.
*Notification of acceptance*: January 31st, 2023.
*Camera-ready*: February 6th, 2023.
*Published*: March 2023.
--------------------------------------------*
*
*Introduction*
The aim of the journal Procesamiento del Lenguaje Natural is to provide
a forum for the publication of scientific-technical articles in the
field of Natural Language Processing (NLP), for both the national and
international scientific community. The articles must be unpublished and
cannot be simultaneously submitted for publication in other journals or
conference proceedings. The journal also aims to promote the development
of areas related to NLP, disseminate research carried out, identify
future guidelines for basic research, and present software applications
in this field. Every year the Sociedad Española de Procesamiento del
Lenguaje Natural (SEPLN) (Spanish Society for the Natural Language
Processing) publishes two issues of the journal, including original
articles, presentations of R&D projects, book reviews and summaries of
PhD theses.
The scientific quality of the Journal is supported by the 2021 JCR index
(JCI: 0.21, Q4-Linguistics - ESCI), the SCImago Journal Ranking (SJR:
0.217, Q4-Computer Science Applications, Q2-Linguistics and Language),
the Scopus Index (CiteScore: 1.5, Q4-Computer Science Applications,
Q2-Linguistics and Language) and the index SNIP (Source Normalized
Impact per Paper) with 0.37 points. More information at:
http://www.sepln.org/en/journal/quality.
*Topics*
* Linguistic, mathematical and psycholinguistic models of language
* Machine learning in NLP
* Computational lexicography and terminology
* Corpus linguistics
* Development of linguistic resources and tools
* Grammars and formalisms for morphological and syntactic analysis
* Semantics, pragmatics and discourse
* Word sense disambiguation
* Monolingual and multilingual text generation
* Machine translation
* Knowledge and common sense
* Multimodality
* Speech synthesis and recognition
* Dialogue systems and interactive systems/ Conversational assistants
* Audio indexing and retrieval
* Monolingual and multilingual information extraction and retrieval
* Question answering systems
* Evaluation of NLP systems
* Automatic textual content analysis
* Sentiment analysis, opinion mining and argument mining
* Plagiarism detection
* Negation and speculation processing
* Text mining in blogosphere and social networks
* Text summarization
* Text simplification
* Image retrieval
* NLP in biomedical domain
* NLP-based generation of teaching resources
* NLP for languages with limited resources
* NLP industrial applications
* Low-resource NLP tasks, data augmentation
*Submission Information*
The proposal must be submitted by *December 2nd, 2022* and must meet
certain format and style requirements.
All submissions must be in PDF format and submitted electronically using
the Myreview system available at:
*http://myreview.sepln.org/myreview-sepln70*.
Submitted papers will be subjected to a blind review by at least three
members of the program committee.
*Categories of papers*
* Regular papers with original contributions.
* Summary of PhD thesis.
*Information for Authors*
The proposals can be written in Spanish or English and should be at most
10 A4-size pages of content, plus unlimited pages for references, and 4
pages maximum for summaries of PhD theses.
The papers must include the following sections:
* The title of the communication (in English and Spanish).
* An abstract in English and Spanish (maximum 150 words).
* A list of keywords or related topics (in English and Spanish).
* The documents must not include headers or footers.
As reviewing will be blind, the paper should not include the authors’
names and affiliation. Furthermore, self-references that reveal the
author’s identity should be avoided. The articles should only include
the title, the abstract, the keywords and the proposal.
We recommend using the LaTeX and Word templates that can be downloaded
from the SEPLN web (author guidelines have been updated):
http://www.sepln.org/index.php/en/journal/author-guidelines
*Note on camera ready*
The final version of the paper (camera ready) should be submitted
together with a cover letter explaining how the suggestions of the
reviewers were implemented in the final version. This cover letter will
be considered in order to accept or finally reject the selected paper.
*Preprint policy*
The Journal allows the publication of preprints (non-refereed paper
posted online, such as ArXiv) anytime, but during the review period the
preprint must indicate that the paper it is “under review” in the
Journal Procesamiento del Lenguaje Natural. Likewise, if the paper is
accepted, the preprint must be updated with the DOI, name of the Journal
and the bibliographic information of the paper.
*Important dates*
* Submission deadline: 2 December 2022
* Notification of acceptance: 31 January 2023
* Camera ready: 6 February 2023
* Publication: March 2023
Contact person: Eugenio Martínez Cámara (emcamara(a)decsai.ugr.es)
Editorial Committee of the Procesamiento del Lenguaje Natural
--------------------------------------------------------------------------------------------------------------------
***********Disculpen si reciben varias copias de este mensaje ************
Por favor, si lo considera oportuno, distribuya este llamamiento entre
sus colegas.
*Petición de artículos para la revista Procesamiento del Lenguaje
Natural nº 70.*
*http://www.sepln.org/la-revista
http://www.sepln.org/la-revista/informacion-para-autores*
--------------------------------------------**
*Resumen*:
*Plataforma de envío*: http://myreview.sepln.org/myreview-sepln70/*.
Fecha final para el envío de trabajos*: 2 de diciembre de 2022.*
**Notificación de aceptación*: 31 de enero de 2023.*
Camera Ready*: 06 de febrero de 2023.*
Publicación nº 70*: Marzo 2023.
--------------------------------------------
*Objetivos de la revista*
La revista Procesamiento del Lenguaje Natural es un foro de publicación
de artículos científico-técnicos en el ámbito del Procesamiento del
Lenguaje Natural (PLN), tanto para la comunidad científica nacional como
internacional. Los artículos tienen que ser inéditos y no haber sido
postulados para ser publicados simultáneamente en otras revistas o actas
de congresos. La revista quiere potenciar el desarrollo de las
diferentes áreas relacionadas con el PLN, mejorar la divulgación de las
investigaciones que se llevan a cabo, identificar las futuras
directrices de la investigación básica y mostrar las posibilidades
reales de aplicación en este campo. Anualmente la SEPLN (Sociedad
Española para el Procesamiento del Lenguaje Natural) publica dos números
de la revista, que incluyen artículos originales, presentaciones de
proyectos, reseñas bibliográficas y resúmenes de tesis doctorales.
La calidad científica de la Revista está respaldada por el índice del
JCR 2021 (JCI: 0,21, Q4-Linguistics - ESCI), el índice SCImago Journal
Ranking (SJR: 0,217, Q4-Computer Science Applications, Q2-Linguistics
and Language), el índice de Scopus (CiteScore: 1,5, Q4-Computer Science
Applications, Q2-Linguistics and Language) y el índice SNIP (Source
Normalized Impact per Paper) con 0,37 puntos. Más información en
http://www.sepln.org/la-revista/calidad.
*Áreas temáticas*
* Modelos de lenguaje matemáticos y psicolingüísticos
* Aprendizaje automático en PLN
* Lexicografía y terminología computacional
* Lingüística de corpus
* Desarrollo de recursos y herramientas lingüísticas
* Gramáticas y formalismos para análisis morfológico y sintáctico
* Semántica, pragmática y discurso
* Resolución de ambigüedad léxico-semántica
* Generación de texto monolingüe y multilingüe
* Traducción automática
* Multimodalidad
* Reconocimiento y síntesis de habla
* Sistemas de diálogo/ asistentes conversacionales
* Auto-indexación
* Recuperación y extracción de información monolingüe y multilingüe
* Sistemas de búsqueda de respuestas
* Evaluación de sistemas de PLN
* Análisis automático de contenido textual
* Análisis de sentimiento y minería de opiniones
* Detección de plagio
* Procesamiento de la negación y la especulación
* Minería de texto en la blogosfera y las redes sociales
* Resumen automático de texto
* Simplificación de texto
* Recuperación de imágenes
* Conocimiento y sentido común
* PLN en el ámbito biomédico
* Generación de recursos didácticos basada en PLN
* PLN para lenguas con recursos limitados
* Aplicaciones industriales del PLN
* Tratamiento del Lenguaje Hablado
*Envío de trabajos*
Las propuestas de trabajos (artículos y resúmenes de tesis) podrán ser
enviadas hasta la fecha límite del *2 de diciembre de 2022*.
El envío y la revisión de las propuestas se realizarán exclusivamente en
formato PDF y se gestionarán a través del sistema Myreview:
*http://myreview.sepln.org/myreview-sepln70/*.
La evaluación de los trabajos pasará por un proceso de revisión ciego
realizado como mínimo por tres miembros del consejo asesor de la SEPLN.
*Tipos de trabajos*
* Artículos sobre contribuciones originales.
* Reseñas de tesis doctorales.
*Instrucciones para los Autores*
Los trabajos pueden estar escritos en español o en inglés y su longitud
máxima será de 10 páginas de contenido más un número ilimitado de
páginas de referencias para los artículos científicos, y de un máximo de
4 páginas para los resúmenes de tesis.
Las propuestas deben contener los siguientes apartados:
* El título del artículo (en español e inglés).
* Un resumen en español y un abstract en inglés de un máximo de 150
palabras.
* Un listado de temas relacionados o palabras clave (en español e inglés).
* Los documentos no podrán incluir cabeceras ni pies de página.
Como la fase de revisión de los trabajos es ciega, en los artículos que
se envíen no se debe incluir ninguna referencia a los autores ni
referencias propias que revelen la identidad de los mismos. Todas las
contribuciones deben contener únicamente el título, el resumen, las
palabras claves y la propuesta.
En el caso de los resúmenes de tesis, el anonimato no es necesario.
Los trabajos deben seguir el formato de las revistas de la SEPLN
disponible en la siguiente dirección:
http://www.sepln.org/la-revista/informacion-para-autores
Las guías se han actualizado, por favor, utilicen las que están
disponibles en la página web de la revista.
*Nota sobre la versión final*
La versión final del trabajo (camera ready) debe enviarse con un
documento en el que se explique cómo se han implementado las sugerencias
de los revisores. Dicho documento se tendrá en cuenta para aceptar o
rechazar el trabajo en cuestión.
*Política de prepublicación*
La revista permite publicar una versión no revisada de los artículos en
plataformas de prepublicación (plataformas de artículos no evaluados
como ArXiv). Sin embargo, durante el periodo de revisión se debe indicar
que el artículo está “en revisión” en la revista Procesamiento del
Lenguaje Natural. Si el artículo es aceptado, se debe actualizar la
publicación en la plataforma de prepublicación con el DOI, nombre de la
revista y la información bibliográfica del artículo.
*Fechas importantes*
* Envío de trabajos: 12 de
diciembre de 2022
* Notificación de aceptación o rechazo: 31 de enero de 2023
* Versión final: 6 de
febrero de 2023
* Publicación: Marzo de 2023
Persona de contacto: Eugenio Martínez Cámara (emcamara(a)decsai.ugr.es)
Consejo de redacción de la revista Procesamiento del Lenguaje Natural.
--
Eugenio Martínez Cámara
Profesor Ayudante Doctor | Junior Lecturer
DaSCI, Instituto Andaluz de Inteligencia Artificial | DaSCI, Andalusian Institute in Artificial Intelligence.
Dpto. Ciencias de la Computación e Inteligencia Artificial | Computer Science and Artificial Intelligence department.
Universidad de Granada
The GroNLP group is further expanding and we are looking for enthusiastic
assistant professors in any (or related to) the areas of speech and
language technology, artificial intelligence, data science, or
computational social science. Follow the link for more information!
https://www.rug.nl/about-ug/work-with-us/job-opportunities/?details=00347-0…
Closing date: 10 Jan 2023
Feel free to contact me to get more information about these positions!
Best regards,
Johan Bos
We invite you to participate in SemEval-2023 Task 2: *Multi*lingual *Co*
mplex *N*amed *E*ntity *R*ecognition (MultiCoNER) II.
*Task Website:* https://multiconer.github.io/
This task focuses on the *fine-grained* detection of complex entities, such
as movie, book, music and product titles, in low context settings (short
and uncased text).
The task provides data in 12 language. Here are some examples in different
languages where entities are enclosed inside brackets with their type:
- *English: [wes anderson | Artist]*'s film *[the grand budapest hotel |
VisualWork]* opened the festival .
- *Spanish:* fue superado por el [aon center | Facility] de [los ángeles
| HumanSettlement] .
- *Ukranian:* назва альбому походить з роману « *[кінець дитинства |
WrittenWork]* » англійського письменника* [артура кларка | Artist]* .
- *Swedish: [tom hamilton | Artist]* amerikansk musiker basist i *[aerosmith
| MusicalGRP]* .
- *Portuguese:* também é utilizado para se fazer *[licor | Drink]*
e *[vinhos
| Drink]*.
- *Hindi:* १७९६ में उन्हें *[शाही स्वीडिश विज्ञान अकादमी | Facility]* का
सदस्य चुना गया।
- *French:* l *[amiral de coligny | Politician]* réussit à s y glisser .
- *German:* in *[frühgeborenes | Disease]* führt dies zu *[irds |
Symptom]* .
- *Bangla [লিটল মিক্স | MusicalGrp]* এ যোগদানের আগে তিনি *[পিৎজা হাট |
ORG]* এ ওয়েট্রেস হিসাবে কাজ করেছিলেন।
- *Italian*: è conservato nel [rijksmuseum | Facility] di [amsterdam |
HumanSettlement] .
- *Chinese:* 它的纤维穿过 [锁骨 | AnatomicalStructure] 并沿颈部侧面倾斜向上和内侧.
- *Farsi: *مرکزاین استان شهر [ناگویا |HumanSettlement] است
Additionally, a *multilingual NER track* is also offered for multilingual
systems that can process all languages.
The task focuses on detecting semantically ambiguous and complex entities
in short and low-context settings. Participants are welcome to build
NER systems
for any number of languages. And we encourage to aim for a bigger challenge
of building NER systems for multiple languages.
We have released training data for 12 languages along with a baseline
system to start with. Participants can submit their system for one language
but are encouraged to aim for a bigger challenge and build multi-lingual NER
systems.
*Task Website:* https://multiconer.github.io/
*Mailing List:* multiconer-semeval(a)googlegroups.com
*Slack Workspace:* https://join.slack.com/t/multiconer
/shared_invite/zt-vi3g97cx-MpqTvS07XX22S78nRC2s0Q
*Training Data:* https://multiconer.github.io/dataset
*Baseline System:* https://multiconer.github.io/baseline
*Shared task schedule:*
- Evaluation start: mid-January, 2022
- Evaluation end: by January 31, 2023 (latest date; task organizers may
choose an earlier date)
- System description paper submissions due: February 1, 2023
- Notification to authors: March 1, 2023
*Task organizers*
- Shervin Malmasi (Amazon)
- Besnik Fetahu (Amazon)
- Sudipta Kar (Amazon)
Please reach out to the organizers at multiconer
-semeval-organizers(a)googlegroups.com, or join the Slack workspace to
connect with the other participants and organizers.
Applications are invited for a 1-year research scholarship in English Linguistics at the Department of Foreign Languages and Literatures, University of Verona, within the project “Interconnected Nord-Est Innovation Ecosystem (iNEST)”, financed by NextGenerationEU in the context of the National Recovery and Resilience Plan (NPRR).
The project entails the compilation and analysis of a corpus of English-language texts promoting and describing prominent urban and extra-urban destinations in the Veneto region of Italy as well as the creation of a set of guidelines for tourism promotion. Experience in corpus linguistics, tourism discourse and discourse analysis, among others, are requested from applicants.
The closing date for applications is 30th November.
For more information about the research project, requirements and application process, you can find the complete call at this link:
https://www.univr.it/it/concorsi/assegnisti-di-ricerca/assegni-di-ricerca/0…
Best wishes,
----------------------------------------
Valeria Franceschi
Temporary Assistant Professor - English Language and Translation (L-LIN/12)
Department of Foreign Languages and Literatures
University of Verona
We are very happy to announce the seventeenth release of annotated
treebanks in Universal Dependencies, v2.11, available at
http://universaldependencies.org/.
Universal Dependencies is a project that seeks to develop
cross-linguistically consistent treebank annotation for many languages
with the goal of facilitating multilingual parser development,
cross-lingual learning, and parsing research from a language typology
perspective (de Marneffe et al., 2021; Nivre et al., 2020). The
annotation scheme is based on (universal) Stanford dependencies (de
Marneffe et al., 2006, 2008, 2014), Google universal part-of-speech tags
(Petrov et al., 2012), and the Interset interlingua for morphosyntactic
tagsets (Zeman, 2008). The general philosophy is to provide a universal
inventory of categories and guidelines to facilitate consistent
annotation of similar constructions across languages, while allowing
language-specific extensions when necessary.
The *243* treebanks in v2.11 are annotated according to version 2 of the
UD guidelines and represent the following *138 languages:* Abaza,
Afrikaans, Akkadian, Akuntsu, Albanian, Amharic, Ancient Greek, Ancient
Hebrew, Apurina, Arabic, Armenian, Assyrian, Bambara, Basque, Beja,
Belarusian, Bengali, Bhojpuri, Breton, Bulgarian, Buryat, Cantonese,
Catalan, Cebuano, Chinese, Chukchi, Classical Chinese, Coptic, Croatian,
Czech, Danish, Dutch, English, Erzya, Estonian, Faroese, Finnish,
French, Frisian Dutch, Galician, German, Gheg, Gothic, Greek, Guajajara,
Guarani, Hebrew, Hindi, Hindi English, Hittite, Hungarian, Icelandic,
Indonesian, Irish, Italian, Japanese, Javanese, Kaapor, Kangri,
Karelian, Karo, Kazakh, Khunsari, Kiche, Komi Permyak, Komi Zyrian,
Korean, Kurmanji, Latin, Latvian, Ligurian, Lithuanian, Livvi, Low
Saxon, Madi, Makurap, Malayalam, Maltese, Manx, Marathi, Mbya Guarani,
Moksha, Munduruku, Naija, Nayini, Neapolitan, Nheengatu, North Sami,
Norwegian, Old Church Slavonic, Old East Slavic, Old French, Old
Turkish, Persian, Polish, Pomak, Portuguese, Romanian, Russian,
Sanskrit, Scottish Gaelic, Serbian, Sinhala, Skolt Sami, Slovak,
Slovenian, Soi, South Levantine Arabic, Spanish, Swedish, Swedish Sign
Language, Swiss German, Tagalog, Tamil, Tatar, Teko, Telugu, Thai,
Tupinamba, Turkish, Turkish German, Ukrainian, Umbrian, Upper Sorbian,
Urdu, Uyghur, Vietnamese, Warlpiri, Welsh, Western Armenian, Western
Sierra Puebla Nahuatl, Wolof, Xavante, Xibe, Yakut, Yoruba, Yupik and
Zaar. The 138 languages belong to *29 families:* Afro-Asiatic, Arawakan,
Arawan, Austro-Asiatic, Austronesian, Basque, Chukotko-Kamchatkan, Code
switching, Creole, Dravidian, Eskimo-Aleut, Indo-European, Japanese,
Korean, Macro-Je, Mande, Mayan, Mongolic, Niger-Congo, Northwest
Caucasian, Pama-Nyungan, Sign Language, Sino-Tibetan, Tai-Kadai,
Tungusic, Tupian, Turkic, Uralic and Uto-Aztecan. Depending on the
language, the treebanks range in size from less than 1,000 tokens to
over 3 million tokens. We expect the next release to be available in May
2023.
The size of the following 28 treebanks changed significantly since the
last release:
Abaza ATB : 0 → 652
Akuntsu TuDeT : 1074 → 1324
Apurina UFPA : 776 → 865
Chinese PatentChar : 0 → 2160
Erzya JR : 17412 → 20541
Estonian EWT : 78331 → 90694
French ParisStories : 30004 → 42865
Gheg GPS : 0 → 15990
Icelandic GC : 0 → 99611
Icelandic Modern : 158150 → 80395
Irish Cadhan : 0 → 3804
Italian ParlaMint : 0 → 20460
Low Saxon LSDC : 2547 → 2935
Makurap TuDeT : 146 → 178
Malayalam UFAL : 0 → 202
Nheengatu CompLin : 0 → 2146
Old East Slavic RNC : 35606 → 48647
Old East Slavic Ruthenian : 0 → 3069
Portuguese CINTIL : 0 → 475860
Portuguese PetroGold : 0 → 250605
Sinhala STB : 0 → 880
Tatar NMCTT : 1458 → 2280
Teko TuDeT : 242 → 1375
Umbrian IKUVINA : 602 → 786
Western Sierra Puebla Nahuatl ITML: 0 → 10120
Xavante XDT : 0 → 120
Yakut YKTDT : 496 → 1403
Zaar Autogramm : 0 → 7625
Daniel Zeman, Joakim Nivre, Mitchell Abrams, Elia Ackermann, Noëmi
Aepli, Hamid Aghaei, Željko Agić, Amir Ahmadi, Lars Ahrenberg, Chika
Kennedy Ajede, Salih Furkan Akkurt, Gabrielė Aleksandravičiūtė, Ika
Alfina, Avner Algom, Chiara Alzetta, Erik Andersen, Lene Antonsen, Katya
Aplonova, Angelina Aquino, Carolina Aragon, Glyd Aranes, Maria Jesus
Aranzabe, Bilge Nas Arıcan, Þórunn Arnardóttir, Gashaw Arutie, Jessica
Naraiswari Arwidarasti, Masayuki Asahara, Katla Ásgeirsdóttir, Deniz
Baran Aslan, Cengiz Asmazoğlu, Luma Ateyah, Furkan Atmaca, Mohammed
Attia, Aitziber Atutxa, Liesbeth Augustinus, Elena Badmaeva, Keerthana
Balasubramani, Miguel Ballesteros, Esha Banerjee, Sebastian Bank,
Verginica Barbu Mititelu, Starkaður Barkarson, Rodolfo Basile, Victoria
Basmov, Colin Batchelor, John Bauer, Seyyit Talha Bedir, Juan Belieni,
Kepa Bengoetxea, Yifat Ben Moshe, Gözde Berk, Yevgeni Berzak, Irshad
Ahmad Bhat, Riyaz Ahmad Bhat, Erica Biagetti, Eckhard Bick, Agnė
Bielinskienė, Kristín Bjarnadóttir, Rogier Blokland, Victoria Bobicev,
Loïc Boizou, Emanuel Borges Völker, Carl Börstell, Cristina Bosco, Gosse
Bouma, Sam Bowman, Adriane Boyd, Anouck Braggaar, Kristina Brokaitė,
Aljoscha Burchardt, Marie Candito, Bernard Caron, Gauthier Caron, Lauren
Cassidy, Maria Clara Castro, Tatiana Cavalcanti, Gülşen Cebiroğlu
Eryiğit, Flavio Massimiliano Cecchini, Giuseppe G. A. Celano, Slavomír
Čéplö, Neslihan Cesur, Savas Cetin, Özlem Çetinoğlu, Fabricio Chalub,
Liyanage Chamila, Shweta Chauhan, Ethan Chi, Taishi Chika, Yongseok Cho,
Jinho Choi, Jayeol Chun, Juyeon Chung, Alessandra T. Cignarella, Silvie
Cinková, Aurélie Collomb, Çağrı Çöltekin, Miriam Connor, Daniela
Corbetta, Marine Courtin, Mihaela Cristescu, Philemon Daniel, Elizabeth
Davidson, Leonel Figueiredo de Alencar, Mathieu Dehouck, Martina de
Laurentiis, Marie-Catherine de Marneffe, Valeria de Paiva, Mehmet Oguz
Derin, Elvis de Souza, Arantza Diaz de Ilarraza, Carly Dickerson,
Arawinda Dinakaramani, Elisa Di Nuovo, Bamba Dione, Peter Dirix, Kaja
Dobrovoljc, Timothy Dozat, Kira Droganova, Puneet Dwivedi, Christian
Ebert, Hanne Eckhoff, Sandra Eiche, Marhaba Eli, Ali Elkahky, Binyam
Ephrem, Olga Erina, Tomaž Erjavec, Aline Etienne, Wograine Evelyn,
Sidney Facundes, Richárd Farkas, Federica Favero, Jannatul Ferdaousi,
Marília Fernanda, Hector Fernandez Alcalde, Jennifer Foster, Cláudia
Freitas, Kazunori Fujita, Katarína Gajdošová, Daniel Galbraith, Federica
Gamba, Marcos Garcia, Moa Gärdenfors, Sebastian Garza, Fabrício Ferraz
Gerardi, Kim Gerdes, Filip Ginter, Gustavo Godoy, Iakes Goenaga, Koldo
Gojenola, Memduh Gökırmak, Yoav Goldberg, Xavier Gómez Guinovart, Berta
González Saavedra, Bernadeta Griciūtė, Matias Grioni, Loïc Grobol,
Normunds Grūzītis, Bruno Guillaume, Céline Guillot-Barbance, Tunga
Güngör, Nizar Habash, Hinrik Hafsteinsson, Jan Hajič, Jan Hajič jr.,
Mika Hämäläinen, Linh Hà Mỹ, Na-Rae Han, Muhammad Yudistira Hanifmuti,
Takahiro Harada, Sam Hardwick, Kim Harris, Dag Haug, Johannes Heinecke,
Oliver Hellwig, Felix Hennig, Barbora Hladká, Jaroslava Hlaváčová,
Florinel Hociung, Petter Hohle, Marivel Huerta Mendez, Jena Hwang,
Takumi Ikeda, Anton Karl Ingason, Radu Ion, Elena Irimia, Ọlájídé
Ishola, Artan Islamaj, Kaoru Ito, Siratun Jannat, Tomáš Jelínek, Apoorva
Jha, Katharine Jiang, Anders Johannsen, Hildur Jónsdóttir, Fredrik
Jørgensen, Markus Juutinen, Hüner Kaşıkara, Andre Kaasen, Nadezhda
Kabaeva, Sylvain Kahane, Hiroshi Kanayama, Jenna Kanerva, Neslihan Kara,
Ritván Karahóǧa, Boris Katz, Tolga Kayadelen, Sarveswaran
Kengatharaiyer, Jessica Kenney, Václava Kettnerová, Jesse Kirchner,
Elena Klementieva, Elena Klyachko, Arne Köhn, Abdullatif Köksal, Kamil
Kopacewicz, Timo Korkiakangas, Mehmet Köse, Alexey Koshevoy, Natalia
Kotsyba, Jolanta Kovalevskaitė, Simon Krek, Parameswari Krishnamurthy,
Sandra Kübler, Adrian Kuqi, Oğuzhan Kuyrukçu, Aslı Kuzgun, Sookyoung
Kwak, Veronika Laippala, Lucia Lam, Lorenzo Lambertino, Tatiana Lando,
Septina Dian Larasati, Alexei Lavrentiev, John Lee, Phương Lê Hồng,
Alessandro Lenci, Saran Lertpradit, Herman Leung, Maria Levina, Cheuk
Ying Li, Josie Li, Keying Li, Yixuan Li, Yuan Li, KyungTae Lim, Bruna
Lima Padovani, Krister Lindén, Nikola Ljubešić, Olga Loginova, Stefano
Lusito, Andry Luthfi, Mikko Luukko, Olga Lyashevskaya, Teresa Lynn,
Vivien Macketanz, Menel Mahamdi, Jean Maillard, Ilya Makarchuk, Aibek
Makazhanov, Michael Mandl, Christopher Manning, Ruli Manurung, Büşra
Marşan, Cătălina Mărănduc, David Mareček, Katrin Marheinecke, Stella
Markantonatou, Héctor Martínez Alonso, Lorena Martín Rodríguez, André
Martins, Jan Mašek, Hiroshi Matsuda, Yuji Matsumoto, Alessandro Mazzei,
Ryan McDonald, Sarah McGuinness, Gustavo Mendonça, Tatiana Merzhevich,
Niko Miekka, Karina Mischenkova, Margarita Misirpashayeva, Anna Missilä,
Cătălin Mititelu, Maria Mitrofan, Yusuke Miyao, AmirHossein Mojiri
Foroushani, Judit Molnár, Amirsaeid Moloodi, Simonetta Montemagni, Amir
More, Laura Moreno Romero, Giovanni Moretti, Keiko Sophie Mori, Shinsuke
Mori, Tomohiko Morioka, Shigeki Moro, Bjartur Mortensen, Bohdan
Moskalevskyi, Kadri Muischnek, Robert Munro, Yugo Murawaki, Kaili
Müürisep, Pinkey Nainwani, Mariam Nakhlé, Juan Ignacio Navarro
Horñiacek, Anna Nedoluzhko, Gunta Nešpore-Bērzkalne, Manuela Nevaci,
Lương Nguyễn Thị, Huyền Nguyễn Thị Minh, Yoshihiro Nikaido, Vitaly
Nikolaev, Rattima Nitisaroj, Alireza Nourian, Hanna Nurmi, Stina Ojala,
Atul Kr. Ojha, Hulda Óladóttir, Adédayọ̀ Olúòkun, Mai Omura, Emeka
Onwuegbuzia, Noam Ordan, Petya Osenova, Robert Östling, Lilja Øvrelid,
Şaziye Betül Özateş, Betül Özateş, Merve Özçelik, Arzucan Özgür, Balkız
Öztürk Başaran, Teresa Paccosi, Alessio Palmero Aprosio, Anastasia
Panova, Hyunji Hayley Park, Niko Partanen, Elena Pascual, Marco
Passarotti, Agnieszka Patejuk, Guilherme Paulino-Passos, Giulia
Pedonese, Angelika Peljak-Łapińska, Siyao Peng, Cenel-Augusto Perez,
Natalia Perkova, Guy Perrier, Slav Petrov, Daria Petrova, Andrea
Peverelli, Jason Phelan, Jussi Piitulainen, Rodrigo Pintucci, Tommi A
Pirinen, Emily Pitler, Magdalena Plamada, Barbara Plank, Thierry
Poibeau, Larisa Ponomareva, Martin Popel, Lauma Pretkalniņa, Sophie
Prévost, Prokopis Prokopidis, Adam Przepiórkowski, Robert Pugh, Tiina
Puolakainen, Sampo Pyysalo, Peng Qi, Andriela Rääbis, Alexandre
Rademaker, Mizanur Rahoman, Taraka Rama, Loganathan Ramasamy, Carlos
Ramisch, Fam Rashel, Mohammad Sadegh Rasooli, Vinit Ravishankar, Livy
Real, Petru Rebeja, Siva Reddy, Mathilde Regnault, Georg Rehm, Ivan
Riabov, Michael Rießler, Erika Rimkutė, Larissa Rinaldi, Laura Rituma,
Putri Rizqiyah, Luisa Rocha, Eiríkur Rögnvaldsson, Ivan Roksandic,
Mykhailo Romanenko, Rudolf Rosa, Valentin Roșca, Davide Rovati, Ben
Rozonoyer, Olga Rudina, Jack Rueter, Kristján Rúnarsson, Shoval Sadde,
Pegah Safari, Benoît Sagot, Aleksi Sahala, Shadi Saleh, Alessio
Salomoni, Tanja Samardžić, Stephanie Samson, Manuela Sanguinetti, Ezgi
Sanıyar, Dage Särg, Marta Sartor, Mitsuya Sasaki, Baiba Saulīte, Yanin
Sawanakunanon, Shefali Saxena, Kevin Scannell, Salvatore Scarlata,
Nathan Schneider, Sebastian Schuster, Lane Schwartz, Djamé Seddah,
Wolfgang Seeker, Mojgan Seraji, Syeda Shahzadi, Mo Shen, Atsuko Shimada,
Hiroyuki Shirasu, Yana Shishkina, Muh Shohibussirri, Maria Shvedova,
Janine Siewert, Einar Freyr Sigurðsson, João Ricardo Silva, Aline
Silveira, Natalia Silveira, Maria Simi, Radu Simionescu, Katalin Simkó,
Mária Šimková, Haukur Barri Símonarson, Kiril Simov, Dmitri Sitchinava,
Maria Skachedubova, Aaron Smith, Isabela Soares-Bastos, Barbara
Sonnenhauser, Shafi Sourov, Carolyn Spadine, Rachele Sprugnoli, Vivian
Stamou, Steinþór Steingrímsson, Antonio Stella, Abishek Stephen, Milan
Straka, Emmett Strickland, Jana Strnadová, Alane Suhr, Yogi Lesmana
Sulestio, Umut Sulubacak, Shingo Suzuki, Daniel Swanson, Zsolt Szántó,
Chihiro Taguchi, Dima Taji, Yuta Takahashi, Fabio Tamburini, Mary Ann C.
Tan, Takaaki Tanaka, Dipta Tanaya, Mirko Tavoni, Samson Tella, Isabelle
Tellier, Marinella Testori, Guillaume Thomas, Sara Tonelli, Liisi Torga,
Marsida Toska, Trond Trosterud, Anna Trukhina, Reut Tsarfaty, Utku Türk,
Francis Tyers, Sveinbjörn Þórðarson, Vilhjálmur Þorsteinsson, Sumire
Uematsu, Roman Untilov, Zdeňka Urešová, Larraitz Uria, Hans Uszkoreit,
Andrius Utka, Elena Vagnoni, Sowmya Vajjala, Rob van der Goot, Martine
Vanhove, Daniel van Niekerk, Gertjan van Noord, Viktor Varga, Uliana
Vedenina, Giulia Venturi, Eric Villemonte de la Clergerie, Veronika
Vincze, Natalia Vlasova, Aya Wakasa, Joel C. Wallenberg, Lars Wallin,
Abigail Walsh, Jing Xian Wang, Jonathan North Washington, Maximilan
Wendt, Paul Widmer, Shira Wigderson, Sri Hartati Wijono, Vanessa
Berwanger Wille, Seyi Williams, Mats Wirén, Christian Wittern, Tsegay
Woldemariam, Tak-sum Wong, Alina Wróblewska, Mary Yako, Kayo Yamashita,
Naoki Yamazaki, Chunxiao Yan, Koichi Yasuoka, Marat M. Yavrumyan, Arife
Betül Yenice, Olcay Taner Yıldız, Zhuoran Yu, Arlisa Yuliawati, Zdeněk
Žabokrtský, Shorouq Zahra, Amir Zeldes, He Zhou, Hanzhi Zhu, Anna
Zhuravleva, Rayan Ziane
References
Marie-Catherine de Marneffe, Christopher Manning, Joakim Nivre, Daniel
Zeman. 2021. Universal Dependencies. In Computational Linguistics 47:2,
pp. 255–308.
Joakim Nivre, Marie-Catherine de Marneffe, Filip Ginter, Jan Hajič,
Christopher D. Manning, Sampo Pyysalo, Sebastian Schuster, Francis
Tyers, Daniel Zeman. 2020. Universal Dependencies v2: An Evergrowing
Multilingual Treebank Collection. In Proceedings of LREC.
--------------------------------------------------------------------------------
Marie-Catherine de Marneffe, Bill MacCartney, and Christopher D.
Manning. 2006. Generating typed dependency parses from phrase structure
parses. In Proceedings of LREC.
Marie-Catherine de Marneffe and Christopher D. Manning. 2008. The
Stanford typed dependencies representation. In COLING Workshop on
Cross-framework and Cross-domain Parser Evaluation.
Marie-Catherine de Marneffe, Timothy Dozat, Natalia Silveira, Katri
Haverinen, Filip Ginter, Joakim Nivre, and Christopher Manning. 2014.
Universal Stanford Dependencies: A cross-linguistic typology. In
Proceedings of LREC.
Joakim Nivre, Marie-Catherine de Marneffe, Filip Ginter, Yoav Goldberg,
Jan Hajič, Christopher D. Manning, Ryan McDonald, Slav Petrov, Sampo
Pyysalo, Natalia Silveira, Reut Tsarfaty, Daniel Zeman. 2016. Universal
Dependencies v1: A Multilingual Treebank Collection. In Proceedings of LREC.
Slav Petrov, Dipanjan Das, and Ryan McDonald. 2012. A universal
part-of-speech tagset. In Proceedings of LREC.
Daniel Zeman. 2008. Reusable Tagset Conversion Using Tagset Drivers. In
Proceedings of LREC.