- Corpora - ELRA lists

Multiple Teaching Track Positions at the University of Maryland (UMD) in College Park, Maryland USA
by jbg＠umiacs.umd.edu 27 Jan '24

27 Jan '24

Dear Colleagues, We at the University have eight openings for professional teaching faculty at the University of Maryland at all levels of seniority. The minimum requirement is a MS degree (although PhD is a plus), and one of the degrees needs to be in CS or a related field (computational linguistics, information science, etc. all count). All areas are needed, including computational linguistics and data science (and I'd particularly want to see those kinds of applications!). You'd be teaching courses at all levels of the curriculum: from introductory courses to courses around your research specialty to supervising undergraduate research or collaborating with the faculty at the University of Maryland. We're located just outside Washington, DC, an exceedingly international city. Please consider applying here or forwarding to your colleagues: https://ejobs.umd.edu/postings/116061 The best consideration date is 02/03/2024. Best, Jordan

1 0

Second Call for Papers: RaPID-5@LREC-COLING 2024
by Dimitrios Kokkinakis 27 Jan '24

27 Jan '24

*********************************************************************************** Second Call for Papers: The 5th workshop on: "Resources and ProcessIng of linguistic, para-linguistic and extra-linguistic Data from people with various forms of cognitive/psychiatric/developmental impairments" Workshop: co-located with LREC-COLING 2024 | Turin, Italy | May 21st, 2024 RaPID-5 serves as an interdisciplinary platform for researchers to exchange insights, methods, and experiences related to collecting and processing data from individuals with mental, cognitive, neuropsychiatric, or neurodegenerative impairments. The workshop focuses on creating, processing, and applying such data resources from individuals at different stages and severity levels of these impairments. The ultimate goal of RaPID-5 is to facilitate the study of relationships among linguistic, paralinguistic, and extra-linguistic observations, with applications ranging from aiding diagnosis to enhancing monitoring and predicting individuals at higher risk, ultimately promoting multidisciplinary collaboration across clinical, language technology, computational linguistics, and computer science communities. Submission deadline: Sun., 17th of March, 2024 (anywhere on earth - new date!) Paper submission: https://softconf.com/lrec-coling2024/rapid2024/ Website and more details: https://spraakbanken.gu.se/en/rapid-2024 Contact: Dimitrios Kokkinakis Contact email: dimitrios.kokkinakis(a)gu.se<mailto:dimitrios.kokkinakis@gu.se> Invited Speakers: * Dr. Alexandra König, BSc MSc PhD, Institut national de recherche en informatique et en automatique (INRIA); Cobtek (Cognition; Behaviour; Technology) Lab; University Côte d'Azur, France * Prof. Maria Liakata, EPSRC/UKRI Turing Institute AI fellow, Queen Mary University of London, UK Organizing committee: * Kathleen C. Fraser, National Research Council, Canada; * Dimitrios Kokkinakis, University of Gothenburg, Sweden; * Kristina Lundholm Fors, Lund University, Sweden; * Charalambos K. Themistocleous, University of Oslo, Norway; * Athanasios Tsanas, The University of Edinburgh, UK; * Fredrik Öhman, University of Gothenburg and Sahlgrenska University Hospital, Sweden ************************************************************************************

1 0

CAiSE'24 Forum: Third Call for Papers and Tool Demonstrations
by Announce 27 Jan '24

27 Jan '24

*** CAiSE'24 Forum: Third Call for Papers and Tool Demonstrations *** 36th International Conference on Advanced Information Systems Engineering (CAiSE'24) June 3-7, 2024, 5* St. Raphael Resort and Marina, Limassol, Cyprus https://cyprusconferences.org/caise2024/ (*** Submission Deadline: 4th March, 2024 AoE ***) The CAiSE Forum is a space within the CAiSE conference to present and discuss the new exciting ideas and tools related to Information Systems Engineering. The Forum intends to serve as an interactive platform, encourage potential authors to present emerging topics and controversial positions, and demonstrate innovative systems, tools, and applications. The Forum sessions at the CAiSE conference will facilitate the interaction, discussion, and exchange of ideas among presenters and participants. Contributions to the CAiSE'24 Forum are welcome to address any of the CAiSE'24 conference topics and, particularly, this year's theme—Information Systems in the Age of Artificial Intelligence. We invite two types of submissions: • Visionary papers present innovative research projects, which are still at a relatively early stage and do not necessarily include a full-scale validation. Visionary papers will be presented as posters in the Forum. • Demo papers describe innovative tools and prototypes that implement the results of research efforts. The tools and prototypes will be presented as demos in the Forum, accompanied by a poster. Both visionary papers and demo papers must not exceed 8 pages in LNCS format. See authors' guidelines at the Springer site: https://www.springer.com/gp/computer-science/lncs/conference-proceedings-gu… . Papers should be submitted in PDF format through the conference management system available at Easy Chair (https://easychair.org/my/conference?conf=caise2024) and select the Forum option. The submitted papers must be unpublished and must not be under review elsewhere. PUBLICATION AND PRESENTATIONS Accepted papers will be published by Springer in a CAISE Forum proceedings volume within the Lecture Notes in Business Information Processing (LNBIP) series (https://www.springer.com/series/7911). Authors should consult Springer's authors guidelines and use their LaTeX or Word proceedings templates for the preparation of their papers. Springer encourages authors to include their ORCIDs in their papers. In addition, the corresponding author of each paper, acting on behalf of all of the authors of that paper, must complete and sign a Consent-to-Publish form. The corresponding author signing the copyright form should match the corresponding author marked on the paper. Once the files have been sent to Springer, changes relating to the authorship of the papers cannot be made. It is expected that at least one of the authors attends CAiSE'24, presents the poster/delivers the demo, and interacts with the Forum participants. We also envision a short oral presentation for all papers to attract participants to the posters. IMPORTANT DATES • Paper Submission Deadline: 4th March, 2024 (AoE) • Notification of Acceptance: 1st April, 2024 • Camera-ready Deadline: 8th April, 2024 • Author Registration Deadline: 8th April, 2024 FORUM CHAIRS • Shareeful Islam, Anglia Ruskin University, United Kingdom • Arnon Sturm, Ben-Gurion University of the Negev, Israel FORUM COMMITTEE • Steven Alter, University of San Francisco • Abel Armas Cervantes, The University of Melbourne • Giuseppe Berio, Université de Bretagne Sud and IRISA UMR 6074 • Drazen Brdjanin, University of Banja Luka • Corentin Burnay, University of Namur • Cinzia Cappiello, Politecnico di Milano • Suphamit Chittayasothorn, King Mongkut's Institute of Technology Ladkrabang • Maya Daneva, University of Twente • Sergio de Cesare, University of Westminster • Johannes De Smedt, KU Leuven • Marne de Vries, University of Pretoria • Michael Fellmann, University of Rostock • Christophe Feltus, Luxembourg Institute of Science and Technology • Hans-Georg Fill, University of Fribourg • Janis Grabis, Riga Technical University • Sergio Guerreiro, INESC-ID / Instituto Superior Técnico • Martin Henkel, Stockholm University • Jennifer Horkoff, Chalmers University of Technology • Shareeful Islam, Anglia Ruskin University • Janis Kampars, RTU • Evangelia Kavakli, University of the Aegean • Marite Kirikova, Riga Technical University • Janne J. Korhonen, Aalto University • Elena Kornyshova, CNAM • Agnes Koschmider, University of Bayreuth • Chung Lawrence, University of Texas at Dallas • Henrik Leopold, Kühne Logistics University • Tong Li, Beijing University of Technology • Beatriz Marín, Universidad Politecnica de Valencia • Andrea Marrella, Sapienza University of Rome • Raimundas Matulevicius, University of Tartu • Jose Ignacio Panach Navarrete, Universitat de València • Oscar Pastor, Universidad Politécnica de Valencia • Francisca Pérez, Universidad San Jorge • Pierluigi Plebani, Politecnico di Milano • Manuel Resinas, University of Seville • Genaina Rodrigues, University of Brasilia • Ben Roelens , Open Universiteit, Ghent University • Mattia Salnitri, Politecnico di Milano • Stefan Strecker, University of Hagen • Arnon Sturm, Ben-Gurion University of the Negev • Irene Vanderfeesten, Katholieke Universiteit Leuven • Yves Wautelet, Katholieke Universiteit Leuven • Hans Weigand, Tilburg University • Manuel Wimmer, Johannes Kepler University Linz • Anna Zamansky, University of Haifa

1 0

Call for Papers on Evolutionary, Typological, and Cognitive Dimensions of Word Families
by Johann-Mattis List 26 Jan '24

26 Jan '24

Dear all, We will organize a focus stream at the International Congress of Linguists**that will take place from 8 to 14 September 2024 in Poznań. Our focus stream concentrates on word families and lexical compositionality, and we invite all kinds of papers that focus on any aspect of word families that has something to do with their evolution, their typology, or their interaction with human cognition. The deadline has been extended until 1st of February. If you are interested in submitting an abstract, please do so. Through our ERC grant, it may even be possible to provide funding for travel in limited form (on a competitive basis). If this is interesting for you, please get directly in touch with us. Information on abstract submission can be found on the website of the conference: https://icl2024poznan.pl/ Please indicate our focus stream "Productive Signs" (number 10) if you want to submit for this event. Sincerely, Mattis List -- Prof. Dr. Johann-Mattis List Chair of Multilingual Computational Linguistics University of Passau Dr.-Hans-Kapfinger-Str. 16 04032 Passau Germany Chair Website:https://phil.uni-passau.de/multilinguale-computerlinguistik/ Personal Website:https://lingulist.de Telephone: +49(0)851/509-3480

1 0

NLP4HR@EACL 2024: Call for Presentations of EACL Findings Papers
by Naoki Otani 26 Jan '24

26 Jan '24

We invite authors of accepted EACL Findings papers to present their work at the NLP for Human Resources (NLP4HR) workshop, scheduled for March 22nd in St. Julians, Malta. Application form: https://forms.gle/SX4fdxSnTEGjuxVq6 Deadline: February 1st 2024, 11:59 pm AoE The NLP4HR workshop is centered around various aspects of applying NLP techniques in the HR domain, including but not limited to: - Knowledge acquisition and reasoning in the HR domain - Parsing, extracting, or inferring information from HR documents - Learning representations for HR entities - Search and recommendation systems tailored to HR - Dialogue-based HR assistants - Language generation for HR purposes (e.g., generating a job description) - QA systems for HR-related queries - Fairness and bias in HR applications For more details, visit the workshop website at https://megagon.ai/nlp4hr-2024/ NLP4HR 2024 Organizing Committee - Estevam Hruschka, Megagon Labs, USA - Thom Lake, Indeed, USA - Naoki Otani, Megagon Labs, USA - Tom Mitchell, Carnegie Mellon University, USA Contact: nlp4hr-workshop(a)megagon.ai -- Naoki Otani Megagon Labs - Mountain View, CA, USA naoki(a)megagon.ai

1 0

CFP: SIGIR 2024 Industry Track (SIRIP), 14-18 July 2024, Washington D.C.
by Edgar Meij 26 Jan '24

26 Jan '24

The SIGIR Symposium on IR in Practice (SIRIP) 2024 will be held as part of ACM SIGIR 2024, onsite (in Washington DC, USA). We aim to provide an opportunity for researchers, engineers, practitioners, analysts and consumers to meet and discuss the latest and greatest Information Retrieval (IR) technologies as deployed in companies, big and small, and to be the premier forum for knowledge sharing across the boundary between academia and industry. The annual SIGIR conference is the major international forum for the presentation of new research results, and the demonstration of new systems and techniques, in the broad field of information retrieval (IR). The 47th ACM SIGIR conference, will be run as an in-person conference from July 14th to 18th, 2024 in Washington D.C., USA. Important Dates for SIRIP Papers (Time zone: Anywhere on Earth (AoE)) - SIRIP Proposal abstract due: Feb 21, 2024 - SIRIP Proposal due: Feb 28, 2024 - SIRIP Notifications: April 10, 2024 - SIRIP Camera ready: April 24, 2024 - SIRIP Days: TBD, 2024 We solicit position papers, talk proposals, and panel proposals for SIRIP in the following categories: - Open problems and challenges in industry, from industry research to production - Presentations creating a connection with academia to solve interesting problems, including presentations from academics spending time in industry, or vice-versa, covering insights for other practitioners - Novel applications of IR/Recsys/NLP/Multimodal learning systems, and complex user interaction modeling in real-world situations - Innovative approaches used in deployed systems and products. We also encourage presentations from small companies, especially startups or spin-offs from either a university project or a large company. Papers discussing domain specific challenges are also welcome. - Position papers on the current and future state of IR in practice, and the role IR could play in shaping the next generation of information access systems. - Building IR systems with an emphasis on trust and safety: Combating misinformation spread; Building privacy preserving retrieval systems; Algorithmic responsibility & fairness. - Role of search & IR in the creator economy (e.g. short video platforms, audio platforms) & marketplaces (e.g. delivery services, hospitality industry, crowdfunding platforms, retail platforms, rentals) - System design case-studies from industry practitioners, identifying best practices and design principles for learning systems - Metrics and measurement techniques used at scale to understand performance of industrial systems. Success in achieving offline/online evaluation consistency - Best practices and successful applications in combining LLM and IR in new or existing products. Submission Presentation proposals should be 2-4 pages (excluding references) and follow the ACM format. Any appendices will be counted towards the page limit. Formatting guidelines are available at this ACM publication site (use the “sigconf” proceedings template). Please include: - Title, abstract, main body of proposal. - All author names and a short bio of the main presenter (~100 words, which will NOT count towards the page limit) - Please do NOT submit a sales pitch We also solicit panel discussion proposals in the above categories. Panel proposals should be 1-2 pages and include: - Panel title, description, proposed moderator (with a short CV), topics of discussion, and profiles of proposed panelists. - We strongly encourage a diverse slate of candidates for panelists and moderators. Proposals should be submitted electronically via Easy Chair: https://easychair.org/conferences?conf=sigir24 Presentation and Publication The presentation format of the Symposium will be decided based on submissions and interest to the wider community, and is likely to be a mix of short and long presentations as well as panels. A condition of acceptance is that at least one author commits to registering and attending SIRIP 2024 (in-person) to present the work. The authors of accepted proposals will be invited to submit a camera ready copy to be included in the proceedings. SIRIP Chairs - Edgar Meij (Bloomberg) - Tao Ye (Amazon) For any questions, you may contact the Chairs by emailing sigir24-sirip(a)easychair.org https://sigir-2024.github.io/call_for_SIRIP.html

1 0

3rd CfP 5th workshop on Resources for African Indigenous Language (RAIL) @ LREC-COLING
by Menno Van Zaanen 26 Jan '24

26 Jan '24

The fifth workshop on Resources for African Indigenous Language (RAIL) Colocated with LREC-COLING 2024 https://bit.ly/rail2024 New: deadline and article submission type Conference dates: 20-25 May 2024 Workshop date: 25 May 2024 Venue: Lingotto Conference Centre, Torino (Italy) The fifth RAIL workshop website: https://bit.ly/rail2024 LREC-COLING 2024 website: https://lrec-coling-2024.org/ Submission website: https://softconf.com/lrec-coling2024/rail2024/ The fifth Resources for African Indigenous Languages (RAIL) workshop will be co-located with LREC-COLING 2024 in Lingotto Conference Centre, Torino, Italy on 25 May 2024. The RAIL workshop is an interdisciplinary platform for researchers working on resources (data collections, tools, etc.) specifically targeted towards African indigenous languages. In particular, it aims to create the conditions for the emergence of a scientific community of practice that focuses on data, as well as computational linguistic tools specifically designed for or applied to indigenous languages found in Africa. Many African languages are under-resourced while only a few of them are somewhat better resourced. These languages often share interesting properties such as writing systems, or tone, making them different from most high-resourced languages. From a computational perspective, these languages lack enough corpora to undertake high level development of Human Language Technologies (HLT) and Natural Language Processing (NLP) tools, which in turn impedes the development of African languages in these areas. During previous workshops, it has become clear that the problems and solutions presented are not only applicable to African languages but are also relevant to many other low-resource languages. Because these languages share similar challenges, this workshop provides researchers with opportunities to work collaboratively on issues of language resource development and learn from each other. The RAIL workshop has several aims. First, the workshop brings together researchers who work on African indigenous languages, forming a community of practice for people working on indigenous languages. Second, the workshop aims to reveal currently unknown or unpublished existing resources (corpora, NLP tools, and applications), resulting in a better overview of the current state-of-the-art, and also allows for discussions on novel, desired resources for future research in this area. Third, it enhances sharing of knowledge on the development of low-resource languages. Finally, it enables discussions on how to improve the quality as well as availability of the resources. The workshop has “Creating resources for less-resourced languages” as its theme, but submissions on any topic related to properties of African indigenous languages (including non-African languages) may be accepted. Suggested topics include (but are not limited to) the following: * Digital representations of linguistic structures * Descriptions of corpora or other data sets of African indigenous languages * Building resources for (under resourced) African indigenous languages * Developing and using African indigenous languages in the digital age * Effectiveness of digital technologies for the development of African indigenous languages * Revealing unknown or unpublished existing resources for African indigenous languages * Developing desired resources for African indigenous languages * Improving quality, availability and accessibility of African indigenous language resources Submission requirements: We invite papers on original, unpublished work related to the topics of the workshop. Submissions, presenting completed work, may consist of up to eight (8) pages of content for a long submission and up to four (4) pages of content for a short submission plus additional pages of references. The final camera-ready version of accepted long papers are allowed one additional page of content (up to 9 pages) so that reviewers’ feedback can be incorporated. Papers should be formatted according to the LREC-COLING style sheet (https://lrec-coling-2024.org/authors-kit/), which is provided on the LREC-COLING 2024 website (https://lrec-coling-2024.org/). Reviewing is double-blind, so make sure to anonymise your submission (e.g., do not provide author names, affiliations, project names, etc.) Limit the amount of self citations (anonymised citations should not be used). The RAIL workshop follows the LREC-COLING submission requirements. Please submit papers in PDF format to the START account (https://softconf.com/lrec-coling2024/rail2024/). Accepted papers will be published in proceedings linked to the LREC-COLING conference. Important dates: Submission deadline: 23 February 2024 Date of notification: 15 March 2024 Camera ready deadline: 29 March 2024 RAIL workshop: 25 May 2024 Organising Committee Rooweither Mabuya, South African Centre for Digital Language Resources (SADiLaR), South Africa Muzi Matfunjwa, South African Centre for Digital Language Resources (SADiLaR), South Africa Mmasibidi Setaka, South African Centre for Digital Language Resources (SADiLaR), South Africa Menno van Zaanen, South African Centre for Digital Language Resources (SADiLaR), South Africa -- Prof Menno van Zaanen menno.vanzaanen(a)nwu.ac.za<mailto:menno.vanzaanen@nwu.ac.za> Professor in Digital Humanities South African Centre for Digital Language Resources https://www.sadilar.org<https://www.sadilar.org/> ________________________________ NWU PRIVACY STATEMENT: http://www.nwu.ac.za/it/gov-man/disclaimer.html DISCLAIMER: This e-mail message and attachments thereto are intended solely for the recipient(s) and may contain confidential and privileged information. Any unauthorised review, use, disclosure, or distribution is prohibited. If you have received the e-mail by mistake, please contact the sender or reply e-mail and delete the e-mail and its attachments (where appropriate) from your system. ________________________________

1 0

First CfP: First Workshop on Data Contamination (CONDA) @ ACL 2024
by Eneko Agirre 26 Jan '24

26 Jan '24

We invite you to participate and submit your work to the First Workshop on Data Contamination (CONDA) co-located with ACL 2024 in Bangkok, Thailand. Data contamination, where evaluation data is inadvertently included in pre-training corpora of large scale models, and language models (LMs) in particular, has become a concern in recent times. The growing scale of both models and data, coupled with massive web crawling, has led to the inclusion of segments from evaluation benchmarks in the pre-training data of LMs. The scale of internet data makes it difficult to prevent this contamination from happening, or even detect when it has happened. Crucially, when evaluation data becomes part of pre-training data, it introduces biases and can artificially inflate the performance of LMs on specific tasks or benchmarks. This poses a challenge for fair and unbiased evaluation of models, as their performance may not accurately reflect their generalization capabilities. Although a growing number of papers and state-of-the-art models mention issues of data contamination, there is no agreed-upon definition or standard methodology to ensure that a model does not report results on contaminated benchmarks. Addressing data contamination is a shared responsibility among researchers, developers, and the broader community. By adopting best practices, increasing transparency, documenting vulnerabilities, and conducting thorough evaluations, we can work towards minimizing the impact of data contamination and ensuring fair and reliable evaluations. We welcome paper submissions on all topics related to data contamination, including but not limited to: * Definitions, taxonomies, and gradings of contamination * Contamination detection (both manual and automatic) * Community efforts to discover, report, and organize contamination events * Documentation frameworks for datasets or models * Methods to avoid data contamination * Methods to forget contaminated data * Scaling laws and contamination * Memorization and contamination * Policies to avoid impact of contamination in publication venues and open source communities * Reproducing and attributing results from previous work to data contamination * Survey work on data contamination research * Data contamination in other modalities *Submission Instructions* We welcome two types of papers: regular workshop papers and non-archival submissions. Regular workshop papers will be included in the workshop proceedings. All submissions must be in PDF format and made through OpenReview <https://openreview.net/group?id=aclweb.org/ACL/2024/Workshop/CONDA>. * *Regular workshop papers:* Authors can submit papers up to 8 pages, with unlimited pages for references. Authors may submit up to 100 MB of supplementary materials separately and their code for reproducibility. All submissions undergo an double-blind single-track review. Best Paper Award(s) will be given based on nomination by the reviewers. Accepted papers will be presented as posters with the possibility of oral presentations. * *Non-archival submissions:* Cross-submissions are welcome. Accepted papers will be presented at the workshop, but will not be included in the workshop proceedings. Papers must be in PDF format and will be reviewed in a double-blind fashion by workshop reviewers. We also welcome extended abstracts (up to 2 pages) of papers that are work in progress, under review or to be submitted to other venues. Papers in this category need to follow the ACL format. In addition to papers submitted directly to the workshop, which will be reviewed by our Programme Committee. We also accept papers reviewed through ACL Rolling Review and committed to the workshop. Please, check the relevant dates for each type of submission. *Important dates* Relevant deadlines to consider when submitting your paper are: * Paper submission deadline: May 17 (Friday), 2024 * ARR pre-reviewed commitment deadline: TBD, 2024 * Notification of acceptance: June 17 (Monday), 2024 * Camera-ready paper due: July 1 (Monday), 2024 * Workshop date: August 16, 2024 *Contact* * *Website:* https://conda-workshop.github.io/ * *Contact:* conda-workshop(a)googlegroups.com <mailto:conda-workshop@googlegroups.com> *Workshop organizers* Oscar Sainz, University of the Basque Country (UPV/EHU) Iker García Ferrero, University of the Basque Country (UPV/EHU) Eneko Agirre, University of the Basque Country (UPV/EHU) Jon Ander Campos, Cohere Alon Jacovi, Bar Ilan University Yanai Elazar, Allen Institute for Artificial Intelligence and University of Washington Yoav Goldberg, Bar Ilan University and Allen Institute for Artificial Intelligence

1 0

Second CfP: 2nd Workshop on Resources and Technologies for Indigenous, Endangered and Lesser-resourced Languages in Eurasia (EURALI)
by Theodorus Fransen 26 Jan '24

26 Jan '24

Apologies for cross-posting *2nd Workshop on Resources and Technologies for Indigenous, Endangered and Lesser-resourced Languages in Eurasia (EURALI) @ LREC-COLING 2024* Date: 20-25 May, 2024 Venue: Lingotto Conference Centre - Torino (Italia) Main website: https://sites.google.com/view/eurali/ LREC-COLING 2024 website: https://lrec-coling-2024.org/ Submission website: https://softconf.com/lrec-coling2024/eurali2024/ —————————————————————————————————— *Workshop overview and objectives* This workshop will focus on the development of language technology resources and tools for indigenous, endangered and lesser-resourced languages on the Eurasian continent. In a media-centric world where language technology allows people to break cultural and language barriers, it is important that speakers of endangered and indigenous languages can be empowered to use this technology to continue to share their knowledge and culture with the world. With the hope of bridging this gap, the goal of this workshop is to increase visibility and promote research for lesser-resourced and under-represented languages in Europe and Asia. Through collaboration between NLP researchers, language experts and linguists working for the benefit of endangered languages in these communities, we aim to create language technology resources that will help to preserve and revive these languages for future generations. Furthermore, the workshop aims to promote the emergence of new methods that benefit linguists (e.g. automating analysis and validation processes), field linguists (facilitating data collection and analysis processes), and computational linguists (developing new techniques necessary for linguistic analysis, development of supervised or weakly supervised methods for the analysis of poorly written or undocumented languages). The main objective of the workshop is to create basic resources and develop tools for Eurasiatic languages, including but not limited to the following topics: - identifying languages and variants spoken in these regions - creation of language resources and applications, e.g. sentiment analysis, named entity recognition, and syntactic parsing - standardization for endangered languages - automatic identification and classification of lexical variation and language varieties - adaptation of fundamental NLP tools for these languages, e.g., morphological analysis, taggers and parsers - reusability of language resources in NLP applications, e.g. machine translation, and POS tagging - machine translation between closely related languages - evaluation of language resources and tools when applied to lesser-resourced languages in the same language families - corpora, resources, and tools for closely related languages - linguistic and textual similarities among languages in Eurasia - digitalization of endangered languages - challenges in the creation of language resources and tools from linguistic perspectives (which includes any perspective formal theory) *Submissions* We are seeking submissions in the following categories: - Full papers: 8 pages+unlimited references - Short papers (work in progress): 4 pages+unlimited references - Posters (innovative ideas/proposals, a research idea of students): 4 pages+unlimited references - Demo (of working online/standalone systems): 2 pages Papers must describe original, completed or in progress, and unpublished work. The accepted papers will be given up for full/short paper and poster in the workshop proceedings and will be presented as an oral presentation or poster. Papers should be formatted according to the LREC-COLING style sheet (https://lrec-coling-2024.org/authors-kit/), which is provided on the LREC-COLING 2024 website (https://lrec-coling-2024.org/). Please submit papers in PDF format to the START account (https://softconf.com/lrec-coling2024/eurali2024/). For further information on this initiative, please refer to https://sites.google.com/view/eurali/. *Important Dates* February 23, 2024: Paper submissions due March 22, 2024: Paper notification of acceptance May 25, 2024: Workshop *Workshop Chairs* Atul Kr. Ojha, University of Galway, Galway (Ireland) Sina Ahmadi, George Mason University, Fairfax VA (USA) Chao-Hong Liu, Potamu Research Ltd, Dublin (Ireland) John P. McCrae, University of Galway, Galway (Ireland) Theodorus Fransen, Università Cattolica del Sacro Cuore, Milan (Italy) Silvie Cinková, Charles University, Prague (Czech Republic) *Programme Committee (to be updated)* Abigail Walsh, Dublin City University, Dublin (Ireland) Aiala Rosá, Universidad de la República - Uruguay, Montevideo (Uruguay) Aryaman Arora, Stanford University, Stanford, California (USA) A. Seza Doğruöz, Ghent University, Ghent (Belgium) Alina Karakanta, University of Leiden, Leiden (Netherlands) Alina Wróblewska, Institute of Computer Science, Jana Kazimierza, Warszawa (Poland) Akanksha Bansal, Panlingua, Delhi (India) Atul Kr. Ojha, University of Galway, Galway (Ireland) & Panlingua, (India) Bharathi Raja Chakravarthi, University of Galway, Galway (Ireland) Bogdan Babych, Heidelberg University, Heidelberg (Germany) Çağrı Çöltekin, University of Tübingen, Tübingen (Germany) Chao-Hong Liu, Potamu Research Ltd, Dublin (Ireland) Chihiro Taguchi, the University of Notre Dame, Notre Dame (USA) Daan van Esch, Google, Amsterdam (Netherlands) Daniel Zeman, Charles University, Prague (Czech Republic) Deepak Alok, IIT-Delhi, Delhi (India) Dorothee Beermann, Norwegian University of Science and Technology, Trøndelag (Norway) Esha Banerjee, J.P. Morgan, Bengaluru (India) Ekaterina Vylomova, University of Melbourne, Melbourne (Australia) George Rehm, GmbH, Berlin (Germany) Hiwa Asadpour, Goethe University, Frankfurt (Germany) Jamal Abdul Nasir, University of Galway, Galway (Ireland) Joakim Nivre, Uppsala University, (Sweden) John P. McCrae, University of Galway, (Ireland) John E. Ortega, New York University (USA) Jonathan Washington, Swarthmore College, Swarthmore (USA) Joseph Mariani, LIMSI-CNRS, Pairs (France) Kaja Dobrovoljc, University of Ljubljana, Ljubljana (Slovenia) Khalid Choukri, ELDA/ELRA, Paris (France) Luke D. Gessler, University of Colorado at Boulder (USA) Maitrey Mehta, University of Utah, Utah (USA) Marie-Catherine de Marneffe, UCLouvainCollège Léon Durpiez, (Belgium) Olesea Caftanatov, Vladimir Andrunachievici Institute of Mathematics and Computer Science, Chişinău (Moldova) Ranka Stanković, University of Belgrade, Belgrade (Serbia) Rico Sennrich, University of Zurich, Zurich (Switzerland) Ritesh Kumar, Agra University, Agra (India) Rute Costa, the Universidade NOVA de Lisboa, Lisbon (Portugal) Saliha Muradoglu, Australian National University, Canberra (Australia) Sarah Moeller, University of Florida, Gainesville, FL (USA) Silvie Cinkovà, Charles University, Prague (Czech Republic) Sina Ahmadi, George Mason University, (USA) Stella Markantonatou, Athena RC, Athens (Greece) Sourabrata Mukherjee, Charles University, Prague (Czech Republic) Theodorus Fransen, Università Cattolica del Sacro Cuore, Milan (Italy) Valentin Malykh, MTS AI / ITMO University Verginica Barbu Mititelu, Research Institute for Artificial Intelligence, Bucharest (Romania) Victoria Bobicev, University of Moldova, Chișinău (Moldova) Voula Giouli, Institute for Language and Speech Processing, Athens (Greece)

1 0

First Call for Participation: Shared Task: Cross-Lingual Few-Shot Relation Extraction for Pharmacovigilance (SMM4H - Task 2)
by Lisa Raithel 26 Jan '24

26 Jan '24

Apologies for cross-posting. --------------------------------------------------------------------------- **Social Media Mining For Health 2024** https://healthlanguageprocessing.org/smm4h-2024/ The Social Media Mining for Health (SMM4H) workshop and shared tasks have been running successfully since 2016. They now go into the 9th round, with the workshop being co-located at ACL 2024 in Bangkok. https://2024.aclweb.org/ Bangkok, Thailand , August 12–17, 2024 **Important Dates for all SMM4H Shared Tasks** Training data available: January 10, 2024 CodaLab Available: January 17, 2024 Evaluation Phase: April 17 - 24, 2024 System description paper due: May 17, 2024 Paper acceptance notification: June 17, 2024 Camera-ready papers due: July 1, 2024 Workshop in Bangkok, Thailand , August 15, 2024 **Task 2: Task Description** Adverse Drug Events (ADEs) are negative medical side effects related to a drug. Mining ADEs from user-generated text has become a popular topic and is an important use case for research, as it could help detecting crowd signals from users online. Being able to make use of information across languages by applying multi-lingual methods further supports this endeavor. Our task targets the languages *German, French and Japanese* and is split into two subtasks. Subtask 2a focuses on Named Entity Recognition (NER) of of medication, disorder, and function mentions from user-generated texts. Subtask 2b performs joint NER and Relation Extraction (RE) to determine if these disorders are ADEs by finding the correct relations between medications, disorders and functions. We distinguish two types of relations between medication mentions and disorder/function mentions: - "caused": the disorder/function was caused by a medication, i.e., the disorder/function is an ADE - "treatment_for": the disorder/function is the reason for the medication, i.e., the medication is supposed to treat the disorder/function *Tasks*: Participants can choose between participating in subtask 2a, or subtask 2b, or both. ~~ We explicitly encourage the submission of new and creative approaches! ~~ - Subtask 2a) Named entity recognition of the entities "drug", "disorder" and "function" from user-generated texts. - Subtask 2b) Joint named entity and relation extraction of the entities "drug", "disorder" and "function" and the relations "caused" and "treatment_for". *Data*: The data originates from social media platforms, e.g., patient fora and X (Twitter). We provide data in German and Japanese, and a few examples in French. The submitted systems will be evaluated on German, French and Japanese data. Please find more information here: https://healthlanguageprocessing.org/smm4h-2024/ Please use this form to register: https://forms.gle/7w4si27uJrCMiTyL8 Organizers of Subtask 2: Pierre Zweigenbaum, Université Paris-Saclay, CNRS, LISN, France Sebastian Möller, Technische Universität Berlin, DFKI GmbH, Germany Roland Roller, DFKI GmbH, Germany Philippe Thomas, DFKI GmbH, Germany Eiji Aramaki, NAIST, Japan Shoko Wakamiya, NAIST, Japan Shuntaro Yada, NAIST, Japan Katherine Yeh, Université Paris-Saclay, CNRS, LISN, France Lisa Raithel, Technische Universität Berlin, Germany & Université Paris-Saclay, CNRS, LISN, France -- Lisa Raithel PhD candidate at TU Berlin, BIFOLD & Université Paris-Saclay, LISN, CNRS Guest researcher at DFKI GmbH Berlin (she/her)

1 0

2026

2025

2024

2023

2022

Corpora