- Corpora - ELRA lists

Two-year postdoctoral position - An engaging discussion robot for mental health preservation
by Christophe Lohr 10 Jul '24

10 Jul '24

Hello, The Interaction Lab-STICC Department is seeking candidates to fill a 2-year postdoctoral position within a project about conversational agents for the preservation of mental health of socially isolated people. The job description is available here: "DiscoBot - An engaging discussion robot for mental health preservation" https://labsticc.fr/mediacenter/uploads/job-profile-discobot_may2024.pdf Regards, Christophe IMT Atlantique <https://www.imt-atlantique.fr> *Christophe LOHR* Enseignant Chercheur 02 29 00 14 05 Technopôle Brest-Iroise CS 83818 29238 Brest Cedex 3 Une école de l'IMT <https://www.imt.fr>

1 0

Second Call for Participation - CoLI-Dravidian@FIRE 2024: Word-level Code-Mixed Language Identification in Dravidian Languages
by Sabur B 09 Jul '24

09 Jul '24

****We apologize for multiple postings of this e-mail**** CALL FOR PARTICIPATION FIRE 2024 Task - CoLI-Dravidian: Word-level Code-Mixed Language Identification in Dravidian Languages Held as a shared task in the 16th meeting of Forum for Information Retrieval Evaluation (FIRE 2024 <http://fire.irsi.org.in/fire/2024/home>) December 12-15, 2024. DAIICT, Gandhinagar, India Website: https://sites.google.com/view/coli-dravidian-2024/datasets?authuser=0 Codalab link: https://codalab.lisn.upsaclay.fr/competitions/19357 Dear All, We are inviting researchers and students to participate in the shared task CoLI-Dravidian: Word-level Code-Mixed Language Identification in Dravidian Languages, which is held as a shared task in the 16th meeting of Forum for Information Retrieval Evaluation (FIRE 2024 <http://fire.irsi.org.in/fire/2024/home>). Language Identification (LI) involves detecting the language(s) used in a given text, which is a preliminary step for many applications such as sentiment analysis, machine translation, information retrieval, and natural language understanding. In multilingual India, especially among the youth, social media often features code-mixed text, blending local languages with English at various levels. However, this poses significant challenges for LI, particularly when languages are mixed within a single word. Dravidian languages, extensively spoken in southern India, are under-resourced despite their rich morphological structure. These languages face technological challenges, especially in script representation on digital platforms, leading users to prefer Roman or hybrid scripts for communication. This prevalent code-mixing offers vast linguistic data for research yet remains understudied. To address word-level LI challenges in code-mixed Dravidian languages, we are conducting a shared task by providing code-mixed datasets for four languages - Kannada, Tamil, Malayalam, and Tulu, to encourage the development of advanced LI models. There will be a real-time leaderboard, and the participants will be allowed to make a maximum of 10 submissions in the training phase and 5 submissions in the testing phase through CodaLab. Each team will have to select the best submission for ranking. To download the data and participate, go to: https://codalab.lisn.upsaclay.fr/competitions/19357. Best regards, The CoLI-Dravidian 2024 Organizing Committee Important dates - 14th June 2024 - open track websites and training data release - 1st July 2024– test data release - 25th July – run submission deadline - 27th July – results declared - 27th August – Working notes due - 10th September - Reviews - 30th October – Camera-ready copies of working notes NOTE: All dates mentioned here are in the AoE (Anywhere on Earth) zone. Organizing Committee - Shashirekha Hosahalli Lakshmaiah, Department of Computer Science, Mangalore University, India. - Ameeta Agrawal, Department of Computer Science, Portland State University, USA. - Fazlourrahman Balouchzahi, CIC, IPN, Mexico. - Asha Hegde, Department of Computer Science, Mangalore University, India. - Sabur Butt, IFE, Tecnologico de Monterrey, Mexico. - Sharal Coelho, Department of Computer Science, Mangalore University, India. - Kavya G, Department of Computer Science, Mangalore University, India. - Harshitha, Department of Computer Science, Mangalore University, India. - Sonith D, Department of Computer Science, Mangalore University, India. *Sabur Butt, Ph.D. *(He/Him) Institute for the Future of Education (IFE) *Tecnológico de Monterrey, Mexico* Address: Av. Eugenio Garza Sada 2501 Sur Tecnológico, 64849 Monterrey, N.L. LinkedIn <https://www.linkedin.com/in/saburb> - GitHub <https://github.com/saburbutt> - Scholar <https://scholar.google.com/citations?user=re7md-0AAAAJ&hl=en> - Website <https://saburbutt.github.io/>

1 0

Genuine loan offers apply now
by Mohammed Aameen 08 Jul '24

08 Jul '24

Do you need finance to start up your own business or expand your business, Do you need funds to pay off your debt? We give out loans to interested individuals and companies who are seeking loans with good faith. Are you seriously in need of an urgent loan? then contact us for more information today. Email: aabidullahaameen62(a)gmail.com WhatsApp: +919233593652

1 0

LOAN OFFER APPLY FOR MORE INFO
by Mohammed Aameen 08 Jul '24

08 Jul '24

I know of a genuine loan company that can grant you a loan without any problem or delay with a very low interest rate of 3% on your loan request. Kindly write to the below contact info. Email: aabidullahaameen62(a)gmail.com WhatsApp: +919233593652

1 0

Jan Svartvik in memoriam
by Carita Paradis 07 Jul '24

07 Jul '24

It is with great sadness that we announce the death of Emeritus Professor Jan Svartvik, Lund University, Sweden. Jan was born on 18 August 1931 in the county of Värmland in Sweden and died in Lund on 18 June 2024. He studied at Uppsala and University College London (UCL) and became Professor of English Language at Lund University in 1970, a chair he held for 25 years till his retirement. Jan is well-known for his early and innovative development of machine-readable corpora in collaboration with the Survey of English Usage at UCL, in particular the world’s first spoken corpus, London–Lund Corpus of spoken British English, launched in the mid-1970s. He wrote many books and articles, both single-authored publications and publications in collaboration with colleagues. Best-known of them all is perhaps the impressive Comprehensive Grammar of the English Language (1985), which he co-authored with Randolph Quirk, Sidney Greenbaum and Geoffrey Leech. This grammar book is still one of the standard reference grammars of English. He was also known for working with Geoffrey Leech on A Communicative Grammar of English (third edition, 2002), English: One Tongue, Many Voices (2006). Jan played a leading role in the foundation of ICAME, the International Computer Archive of Medieval and Modern English, which he co-founded in 1977. ICAME grew into an important international organisation with a focus on the computational analysis of the English language. It has an annual conference which attracts scholars from all over the world. https://icame.info/history-the-beginnings/, Jan’s autobiography is here http://www.ucl.ac.uk/english-usage/about/svartvik.htm Carita Paradis Bas Aarts -- Professor Carita Paradis, PhD, MAE Centre for Languages and Literature Lund University Box 201, SE-221 00 Lund Website:http://www.sol.lu.se/en/person/CaritaParadis [1664519590158]<https://projekt.ht.lu.se/llc2> When you send emails to Lund University, we process your personal data in accordance with existing legislation. To find out more about the processing of your personal data, visit the Lund University website at: https://www.lunduniversity.lu.se/about/contact-us/processing-of-personal-da… P Please consider the environment before printing this e-mail

1 0

CFP - SIMBig 2024 in Peru - Artificial Intelligence (Dates Correction)
by Juan Antonio Lossio-Ventura 03 Jul '24

03 Jul '24

> [Apologies for cross-posting] > Correction of conference date from previous announcement (Nov. instead of Oct.) > ========================================================================== > FIRST CALL FOR PAPERS - SIMBig 2024 > ========================================================================== > > 11th International Conference on Information Management and Big Data - SIMBig 2024 > Where: Universidad Nacional de Moquegua, Ilo, PERU > When: November 20 - 22, 2024 > Website: https://simbig.org/SIMBig2024/ > > ========================================================================== > > OVERVIEW > ---------------------------------- > > SIMBig 2024 seeks to present new methods of Artificial Intelligence (AI), Data Science, Machine Learning, Natural Language Processing, Semantic Web, and related fields, for analyzing, managing, and extracting insights and patterns from large volumes of data. > > > KEYNOTE SPEAKERS > ---------------------------------- > Aaron Courville, Université de Montréal, Canada Huan Liu, Arizona State University, USA > > and more to be announced soon... > > IMPORTANT DATES > ---------------------------------- > > August 10, 2024 --> Full papers and short papers due > September 30, 2024 --> Notification of acceptance > October 28, 2024 --> Camera-ready versions > November 20-22, 2024 --> Conference held in Moquegua, Peru > > PUBLICATION > ---------------------------------- > > All accepted papers of SIMBig 2024 (tracks including) will be published with Springer CCIS Series <https://www.springer.com/series/7899>. > > > TOPICS OF INTEREST > ---------------------------------- > > SIMBig 2024 has a broad scope. We invite contributions on theory and practice, including but not limited to the following technical areas: > > Artificial Intelligence > Big/Masive Data > Data Science > Machine Learning > Deep Learning > Natural Language Processing > Semantic Web > Data-driven Software Engineering > Healthcare Informatics > Biomedical Informatics > Data Privacy and Security > Information Retrieval > Ontologies and Knowledge Representation > Social Networks and Social Web > Information Visualization > > CONTACT > ---------------------------------- > > SIMBig 2024 General Chairs > > Juan Antonio Lossio-Ventura, National Institutes of Health, USA (juan.lossio(a)nih.gov <mailto:juan.lossio@nih.gov>) > Hugo Alatrista-Salas, Léonard de Vinci Pôle Universitaire Research Center, Paris, France (hugo.alatrista_salas(a)devinci.fr <mailto:halatrista@pucp.pe>)

1 0

Call for Papers for the 22nd Annual Workshop of the Australasian Language Technology Association is now open
by Kathy Reid 03 Jul '24

03 Jul '24

Dear Colleagues, We're delighted to announce that the CfP for the 22nd Annual Workshop of the Australasian Language Technology Association - ALTA 2024 - is now open and closes on 20th September (23:59hrs Anywhere on Earth UTC -12) Details are available on our website at https://alta2024.alta.asn.au/calls/papers and a summary follows. --- Important Dates * Submission deadline for short/long papers, presentation abstracts and industry demonstrations: 20 September 2024 (23:59 Anywhere On Earth UTC-12). * Main conference: 3 December and 4 December 2024, ANU, Canberra, ACT, hybrid (in person and online) Overview The 22nd Annual Workshop of the Australasian Language Technology Association (ALTA) will be held in a hybrid format at the Australian National University, Canberra, from 2 December to 4 December 2024. The ALTA 2024 workshop is the key local forum for socialising research results in Natural Language Processing (NLP) and Computational Linguistics (CL). It will feature presentations, posters, and demonstrations from students, industry, and academic researchers. Like previous years, we also encourage submissions and participation from industry and government researchers and developers. Note that ALTA is listed in the CORE 2023 Conference Rankings as Australasian C<https://www.core.edu.au/conference-portal>. Topics ALTA invites the submission of papers and presentations on all aspects of NLP and CL, including, but not limited to: * Commonsense Reasoning. * Computational Social Science and Cultural Analytics. * Dialogue and Interactive Systems. * Discourse and Pragmatics. * Efficient Methods for NLP. * Ethics in NLP. * Information Extraction. * Information Retrieval and Text Mining. * Interpretability, Interactivity and Analysis of Models for NLP. * Language Grounding to Vision, Robotics and Beyond. * Language Modeling and Analysis of Language Models. * Linguistic Theories, Cognitive Modeling and Psycholinguistics. * Machine Learning for NLP. * Machine Translation. * Multilinguality and Linguistic Diversity. * Natural Language Generation. * NLP Applications. * Phonology, Morphology and Word Segmentation. * Question Answering. * Resources and Evaluation. * Semantics: Lexical, Sentence level, Document Level, Textual Inference, etc. * Sentiment Analysis, Stylistic Analysis, and Argument Mining. * Speech and Multimodality. * Summarisation. * Syntax, Parsing and their Applications. We particularly encourage submissions that broaden the scope of our community by considering practical applications of language technology and multidisciplinary research. We also specifically encourage submissions from the industry. Format and instructions for authors Please refer to our CfP webpage for specifics.<https://alta2024.alta.asn.au/calls/papers> We are using OpenReview for submissions, and invite submissions of three different formats: (1) Original Research Papers, (2) Abstract-based Presentations, and (3) Industry Demonstrations. --- You can follow ALTA on social media at the following links: * LinkedIn (page): https://www.linkedin.com/company/australasian-language-technology-associati… * LinkedIn (group):https://www.linkedin.com/groups/1849979/ * Twitter: https://twitter.com/altanlp * Mastodon: https://sigmoid.social/@ALTAnlp * Hashtag is #ALTA2024 With kind regards, on behalf of the ALTA 2024 Team: Dr Gabriela Ferraro, General Chair Professor Tim Baldwin, Program Chair Dr Sergio José Rodríguez Méndez, Program Chair Dr Nicholas Kuo, Program Chair Dr Anton Malko, Publication Chair Dr Dawei Chen, Technology Chair A/Prof Shunichi Ishihara, Finance Chair Charbel El-Khaissi, PhD candidate, Sponsorship Chair Ned Cooper, PhD candidate, Local Chair Kathy Reid, PhD candidate, Publicity Chair

1 0

CFP - SIMBig 2024 in Peru - Artificial Intelligence
by Juan Antonio Lossio-Ventura 01 Jul '24

01 Jul '24

> [Apologies for cross-posting] > ========================================================================== > FIRST CALL FOR PAPERS - SIMBig 2024 > ========================================================================== > > 11th International Conference on Information Management and Big Data - SIMBig 2024 > Where: Universidad Nacional de Moquegua, Ilo, PERU > When: October 20 - 22, 2024 > Website: https://simbig.org/SIMBig2024/ > > ========================================================================== > > OVERVIEW > ---------------------------------- > > SIMBig 2024 seeks to present new methods of Artificial Intelligence (AI), Data Science, Machine Learning, Natural Language Processing, Semantic Web, and related fields, for analyzing, managing, and extracting insights and patterns from large volumes of data. > > > KEYNOTE SPEAKERS > ---------------------------------- > Aaron Courville, Université de Montréal, Canada Huan Liu, Arizona State University, USA > > and more to be announced soon... > > IMPORTANT DATES > ---------------------------------- > > August 10, 2024 --> Full papers and short papers due > September 30, 2024 --> Notification of acceptance > October 28, 2024 --> Camera-ready versions > November 20-22, 2024 --> Conference held in Moquegua, Peru > > PUBLICATION > ---------------------------------- > > All accepted papers of SIMBig 2024 (tracks including) will be published with Springer CCIS Series <https://www.springer.com/series/7899>. > > > TOPICS OF INTEREST > ---------------------------------- > > SIMBig 2024 has a broad scope. We invite contributions on theory and practice, including but not limited to the following technical areas: > > Artificial Intelligence > Big/Masive Data > Data Science > Machine Learning > Deep Learning > Natural Language Processing > Semantic Web > Data-driven Software Engineering > Healthcare Informatics > Biomedical Informatics > Data Privacy and Security > Information Retrieval > Ontologies and Knowledge Representation > Social Networks and Social Web > Information Visualization > > CONTACT > ---------------------------------- > > SIMBig 2024 General Chairs > > Juan Antonio Lossio-Ventura, National Institutes of Health, USA (juan.lossio(a)nih.gov <mailto:juan.lossio@nih.gov>) > Hugo Alatrista-Salas, Léonard de Vinci Pôle Universitaire Research Center, Paris, France (hugo.alatrista_salas(a)devinci.fr <mailto:halatrista@pucp.pe>)

1 0

Call for Participation - CoLI-Dravidian@FIRE 2024: Word-level Code-Mixed Language Identification in Dravidian Languages
by Sabur B 29 Jun '24

29 Jun '24

****We apologize for multiple postings of this e-mail**** CALL FOR PARTICIPATION FIRE 2024 Task - CoLI-Dravidian: Word-level Code-Mixed Language Identification in Dravidian Languages Held as a shared task in the 16th meeting of Forum for Information Retrieval Evaluation (FIRE 2024 <http://fire.irsi.org.in/fire/2024/home>) December 12-15, 2024. DAIICT, Gandhinagar, India Website: https://sites.google.com/view/coli-dravidian-2024/datasets?authuser=0 Codalab link: https://codalab.lisn.upsaclay.fr/competitions/19357 Dear All, We are inviting researchers and students to participate in the shared task CoLI-Dravidian: Word-level Code-Mixed Language Identification in Dravidian Languages, which is held as a shared task in the 16th meeting of Forum for Information Retrieval Evaluation (FIRE 2024 <http://fire.irsi.org.in/fire/2024/home>). Language Identification (LI) involves detecting the language(s) used in a given text, which is a preliminary step for many applications such as sentiment analysis, machine translation, information retrieval, and natural language understanding. In multilingual India, especially among the youth, social media often features code-mixed text, blending local languages with English at various levels. However, this poses significant challenges for LI, particularly when languages are mixed within a single word. Dravidian languages, extensively spoken in southern India, are under-resourced despite their rich morphological structure. These languages face technological challenges, especially in script representation on digital platforms, leading users to prefer Roman or hybrid scripts for communication. This prevalent code-mixing offers vast linguistic data for research yet remains understudied. To address word-level LI challenges in code-mixed Dravidian languages, we are conducting a shared task by providing code-mixed datasets for four languages - Kannada, Tamil, Malayalam, and Tulu, to encourage the development of advanced LI models. There will be a real-time leaderboard, and the participants will be allowed to make a maximum of 10 submissions in the training phase and 5 submissions in the testing phase through CodaLab. Each team will have to select the best submission for ranking. To download the data and participate, go to: https://codalab.lisn.upsaclay.fr/competitions/19357. Best regards, The CoLI-Dravidian 2024 Organizing Committee Important dates - 14th June 2024 - open track websites and training data release - 1st July 2024– test data release - 25th July – run submission deadline - 27th July – results declared - 27th August – Working notes due - 10th September - Reviews - 30th October – Camera-ready copies of working notes NOTE: All dates mentioned here are in the AoE (Anywhere on Earth) zone. Organizing Committee - Shashirekha Hosahalli Lakshmaiah, Department of Computer Science, Mangalore University, India. - Ameeta Agrawal, Department of Computer Science, Portland State University, USA. - Fazlourrahman Balouchzahi, CIC, IPN, Mexico. - Asha Hegde, Department of Computer Science, Mangalore University, India. - Sabur Butt, IFE, Tecnologico de Monterrey, Mexico. - Sharal Coelho, Department of Computer Science, Mangalore University, India. - Kavya G, Department of Computer Science, Mangalore University, India. - Harshitha, Department of Computer Science, Mangalore University, India. - Sonith D, Department of Computer Science, Mangalore University, India. *Sabur Butt, Ph.D. *(He/Him) Institute for the Future of Education (IFE) *Tecnológico de Monterrey, Mexico* Address: Av. Eugenio Garza Sada 2501 Sur Tecnológico, 64849 Monterrey, N.L. LinkedIn <https://www.linkedin.com/in/saburb> - GitHub <https://github.com/saburbutt> - Scholar <https://scholar.google.com/citations?user=re7md-0AAAAJ&hl=en> - Website <https://saburbutt.github.io/>

1 0

AthensNLP Summer School: applications deadline 1st JULY
by Georgios Moschovis 28 Jun '24

28 Jun '24

Call for Participation EXTENDED to Monday 1 July for the Athens Natural Language Processing Summer School! Applications: https://athnlp.github.io/2024/cfp.html Summer School dates: 19-15 September 2024 in Athens.

1 0

2026

2025

2024

2023

2022

Corpora