- Corpora - ELRA lists

April 2025 Newsletter - LDC
by Penn LDC 15 Apr '25

15 Apr '25

In this newsletter: LDC launches upgraded, mobile-friendly website Connect with LDC on Bluesky New publications: DEFT Spanish Light and Rich ERE Annotation<https://catalog.ldc.upenn.edu/LDC2025T04> MATERIAL Kazakh-English Language Pack<https://catalog.ldc.upenn.edu/LDC2025S03> ________________________________ LDC launches upgraded, mobile-friendly website We are pleased to announce the launch of the newly upgraded LDC main website: https://www.ldc.upenn.edu/. Designed with a modern layout, the site now offers an improved experience across all devices. While the LDC Catalog, LDC user accounts, and LDC Submissions are not affected by this upgrade, they are now more accessible than ever from any page on the site. We invite you to explore the website and enjoy a smoother, more intuitive LDC web experience. Connect with LDC on Bluesky In addition to Facebook, X and LinkedIn, you can now connect with LDC on the microblogging platform, Bluesky<https://bsky.app/profile/ldcupenn.bsky.social>. Follow us today to learn the latest news, announcements and corpora releases from the Consortium. ________________________________ New publications: DEFT Spanish Light and Rich ERE Annotation<https://catalog.ldc.upenn.edu/LDC2025T04> was developed by LDC and consists of 158 Spanish discussion forum and newswire documents annotated for entities, relations, and events (ERE). Light ERE annotation labels entity mentions for the target set of entity, relation, and event types between and among those entities including coreference. Rich ERE annotation expands types and tagging in the entities, relations, and events annotation tasks and replaces strict event coreference with a more loosely defined event hopper annotation. The source data consists of Spanish newswire text and Latin American discussion forum data from DEFT Spanish Treebank LDC2018T01<https://catalog.ldc.upenn.edu/LDC2018T01>. 128 documents were annotated following Light ERE annotation guidelines. 154 files were labeled with Rich ERE annotation, 124 of which were also labeled with Light ERE annotation. DARPA's Deep Exploration and Filtering of Text (DEFT) program aimed to address remaining capability gaps in state-of-the-art natural language processing technologies related to inference, causal relationships and anomaly detection. LDC supported the DEFT program by collecting, creating and annotating a variety of data sources. 2025 members can access this corpus through their LDC accounts. Non-members may license this data for a fee. * MATERIAL Kazakh-English Language Pack<https://catalog.ldc.upenn.edu/LDC2025S03> was developed by Appen<http://www.appen.com/> for the IARPA MATERIAL<https://www.iarpa.gov/index.php/research-programs/material> program and contains 57 hours of Kazakh conversational telephone speech, transcripts, English translations, annotations, and queries. Calls were made using different telephones (e.g., mobile, landline) from a variety of environments. Transcripts cover approximately 17% of the speech files, all of which were translated into English. This release also includes English queries and their relevance annotations. The MATERIAL program focused on underserved languages with the ultimate goal to build cross language information retrieval systems to find speech and text content using English search queries. 2025 members can access this corpus through their LDC accounts provided they have submitted a completed copy of the special license agreement. Non-members may license this data for a fee. To unsubscribe from this newsletter, log in to your LDC account<https://catalog.ldc.upenn.edu/login> and uncheck the box next to "Receive Newsletter" under Account Options or contact LDC for assistance. Membership Coordinator Linguistic Data Consortium<ldc.upenn.edu> University of Pennsylvania T: +1-215-573-1275 E: ldc(a)ldc.upenn.edu<mailto:ldc@ldc.upenn.edu> M: 3600 Market St. Suite 810 Philadelphia, PA 19104

1 0

Reminder: EURALEX Talk - Mark Davies
by Iztok Kosem 15 Apr '25

15 Apr '25

Dear colleagues, This is a reminder that the next instalment of EURALEX Talks will take place tomorrow, on Wednesday 16 April, at 16.00 (CET). In this video lecture, Mark Davies, Professor Emeritus of Linguistics at Brigham Young University in Provo, Utah, USA, will talk about his recent large-scale investigation of how the predictions on linguistic variation from two Large Language Models match actual corpus data. He will also present and demo his current work on integrating LLMs into his interface for English-Corpora.org. Further details, including a Zoom link, are available at https://euralex.org/euralex-talks/. Iztok Kosem EURALEX President

1 0

2nd CfP for IBERLEF-PRESTA
by Eugenio Martínez Camara 15 Apr '25

15 Apr '25

SECOND CALL FOR PARTICIPATION - IberLEF 2025 - PRESTA: Questions and Answers about Tables in Spanish *Web*: https://www.codabench.org/competitions/5538/ We are pleased to announce the first IberLEF task on Question Answering on Tabular Data: PRESTA. The PRESTA shared-task consists of Question Answering over Tabular Data making use of the DataBenchSPA benchmark. DataBenchSPA is a benchmark composed of real-world table datasets from different domains and with large size of rows and columns, as well as a wide variety of data types that allow to assess distinct sort of questions related to each data type. We propose a task to encourage participants to develop a system that answers the questions of the kind present in DataBenchSPA over day-to-day datasets, where the answer is either a number, a categorical value, a boolean value or lists of several types. DataBenchSPA can be used as a training and validation set, while we will release another test set explicitly compiled for the task competition. The system developed by the participants will be provided by a series of (dataset, question) pairs and will need to provide an answer which would then be compared with a gold standard. The answer might be achieved through a variety of methods. In our paper [1] we illustrate two different approaches: In-Context Learning and Code Generation. You may use any of these or come up with your own approach. There will be two subtasks: Subtask I : DataBenchSPA QA Participants will be provided with a dataset (of any size) and a question over it. The question should be answered using the data from the dataset only. Subtask II: DataBenchSPA Lite QA The task is essentially the same as the previous subtask, but involves using the sampled version of each dataset with a maximum of 20 rows per dataset. The question should be answered using the data from the sampled dataset only. For the test set, we will similarly provide a reduced version of each dataset for this subtask. This task is especially relevant when testing for models with a smaller window size. Important Dates Release of training data: 18 March 2025 Release of test data - competition starts: 30 April 2025 Submission of the results - competition ends: 12 May 2025 Submission of the description paper: 30 May 2025 Task Organizers Jorge Osés Grijalba - Graphext L. Alfonso Ureña-López - University of Jaén Eugenio Martínez Cámara - University of Jaén Jose Camacho-Collados - Cardiff University Codabench: https://www.codabench.org/competitions/5538/ -- Suelo trabajar a deshoras por lo que este correo puede haberte llegado fuera de tu horario laboral, y al cual puedes responder en el momento que mejor se ajuste a tus hábitos de trabajo. | I sometimes work at irregular times and this email might arrive out of working hours so please be assured that I respect your working pattern and look forward to your response when it suits you. [image: Universidad de Jaén] <https://www.ujaen.es/> Eugenio Martínez Cámara Vicepresidente de la SEPLN <http://www.sepln.org/> | Vice President of the SEPLN <http://www.sepln.org/en>. Profesor Titular de Universidad | Associate Professor. Investigador en Proc. del Lenguaje Natural | Postdoctoral Researcher in Natural Language Proc. Grupo de Investigación SINAI <http://sinai.ujaen.es/> | SINAI <http://sinai.ujaen.es/> Research Group. emcamara(a)ujaen.es Código ORCID:0000-0002-5279-8355 <http://orcid.org/0000-0002-5279-8355> Universidad de Jaén Dpto. de Informática | Computer Science Department. Edificio A3, despacho 145 | +34 953212883 <https://www.ujaen.es/servicios/sinformatica/sites/servicio_sinformatica/fil…> [image: Universidad de Jaén] <https://www.ujaen.es/> Este mensaje y los ficheros anexos son confidenciales dirigiéndose exclusivamente al destinatario mencionado en el encabezamiento. Los mismos contienen información reservada que no puede ser difundida. Si usted ha recibido este correo por error, tenga la amabilidad de eliminarlo de su sistema y avisar al remitente mediante reenvío a su dirección electrónica; no deberá copiar el mensaje ni divulgar su contenido a ninguna persona. Los datos personales facilitados por usted o por terceros serán tratados por UNIVERSIDAD DE JAÉN, con la finalidad de gestionar y mantener los contactos y relaciones que se produzcan como consecuencia de la relación que mantiene con UJA. Normalmente, la base jurídica que legitima este tratamiento, será su consentimiento, el interés legítimo o la necesidad para gestionar una relación contractual o similar. El plazo de conservación de sus datos vendrá determinado por la relación que mantiene con nosotros. Para más información al respecto, o para ejercer sus derechos de acceso, rectificación, cancelación/supresión, oposición, limitación o portabilidad, dirija una comunicación por escrito a UNIVERSIDAD DE JAÉN, Campus Las Lagunillas s/n. 23071 – Jaén, o a nuestro delegado de protección de datos [ dpo(a)ujaen.es]. En caso de considerar vulnerado su derecho a la protección de datos personales, podrá interponer una reclamación ante el Consejo Andaluz de Transparencia y Protección de Datos (www.ctpdandalucia.es). Asimismo, es su responsabilidad comprobar que este mensaje o sus archivos adjuntos no contengan virus informáticos, y en caso que los tuvieran eliminarlos.

1 0

Fwd: [CfP] Sci-K @ISWC 2025 – 5th Intl. Workshop on Scientific Knowledge Representation, Discovery, and Assessment
by Khalid Choukri 15 Apr '25

15 Apr '25

1 0

IWCS International Conference on Computational Semantics: 2nd CfP
by Kilian Evang 15 Apr '25

15 Apr '25

16th International Conference on Computational Semantics (IWCS) Heinrich Heine University Düsseldorf 22-24 September 2025 https://iwcs2025.github.io/ IWCS is a biennial conference on computational semantics. This year's edition is organized by Heinrich Heine University Düsseldorf. The conference is endorsed by SIGSEM, the ACL Special Interest Group on Computational Semantics. The aim of the IWCS conference is to bring together researchers interested in any aspects of the computation, annotation, extraction, representation and learning of meaning in natural language, whether this is from a lexical or structural semantic perspective. IWCS embraces both symbolic and machine learning approaches to computational semantics, and everything in between. The conference and workshops will take place 22-24 September 2025. The invited speakers of IWCS 2025 are: Oana-Maria Camburu (University College London) Alexander Koller (Saarland University) Denis Paperno (Utrecht University) We invite paper submissions in all areas of computational semantics, in other words all computational aspects of meaning of natural language within written, spoken, signed, or multi-modal communication. Submissions are invited on these closely related areas: design of meaning representations syntax-semantics interface representing and resolving semantic ambiguity shallow and deep semantic processing and reasoning hybrid symbolic and statistical approaches to semantics distributional semantics alternative approaches to compositional semantics inference methods for computational semantics recognising textual entailment learning by reading methodologies and practices for semantic annotation machine learning of semantic structures probabilistic computational semantics neural semantic parsing computing meaning with large language models computational aspects of lexical semantics semantics and ontologies semantic web and natural language processing semantic aspects of language generation generating from meaning representations semantic relations in discourse and dialogue semantics and pragmatics of dialogue acts multimodal and grounded approaches to computing meaning semantics-pragmatics interface applications of computational semantics SUBMISSION INFORMATION Two types of submission are solicited: long papers and short papers. Both types should be submitted no later than 06 June 2025 (anywhere on earth). Long papers should describe original research and must not exceed 8 pages. Short papers (typically system or project descriptions, or ongoing research) must not exceed 4 pages. Acknowledgments, references, a limitations section (optional), an ethics statement (optional), and a technical appendix (optional, not subject to reviewing) do not count towards the page limit. Accepted papers get an extra page in the camera-ready version and will be published in the conference proceedings in the ACL Anthology. For inclusion in the proceedings, at least one author must register to the conference and present the paper in person. Papers will be accepted either for oral presentation or for a poster presentation. Submissions should be fully anonymous to ensure double-blind reviewing. Style-files IWCS 2025 papers should be formatted following the common two-column structure as used by IWCS 2021 (borrowed from ACL 2021). Please use these specific style-files or the Overleaf template. Style files: https://iwcs2021.github.io/download/iwcs2021-templates.zip Overleaf template: https://www.overleaf.com/latex/templates/instructions-for-iwcs-2021-proceed… Submitting Papers should be submitted in PDF format. Submission link: https://openreview.net/group?id=IWCS/2025/Conference Please contact the program chairs if you have problems using OpenReview. No anonymity period IWCS 2025 does not have an anonymity period. However, we ask you to be reasonable and not publicly advertise your preprint during (or right before) review. Double submission policy Papers that have been or will be submitted to other meetings or publications must indicate this at submission time. Authors of papers accepted for presentation at IWCS 2025 must notify the program chairs by the camera-ready deadline as to whether the paper will be presented. All accepted papers must be presented at the conference to appear in the proceedings. We will not accept for publication or presentation papers that overlap significantly in content or results with papers that will be (or have been) published elsewhere. IMPORTANT DATES All dates are anywhere on earth. Paper submission: 06 June 2025 Notification of acceptance: 01 August 2025 Camera-ready due: 22 August 2025 IWCS conference: 22-24 September 2025 CONTACT Local Organizers Chen Long Rafael Ehren Kilian Evang Laura Kallmeyer Rainer Osswald Christian Wurm Deniz Ekin Yavaş iwcs2025-organizers(a)uni-duesseldorf.de Program Chairs Kilian Evang (Heinrich Heine University Düsseldorf) Laura Kallmeyer (Heinrich Heine University Düsseldorf) Sylvain Pogodalla (INRIA Nancy) iwcs2025-program-chairs(a)uni-duesseldorf.de -- Dr. Kilian Evang · Institut für Linguistik · Universität Düsseldorf Universitätsstr. 1 · 40225 Düsseldorf · https://kilian.evang.name

1 0

The 24th IFIP Conference e-Business, e-Services, and e-Society (I3E 2025): Fifth Call for Papers
by Announce 15 Apr '25

15 Apr '25

*** Fifth Call for Papers *** The 24th IFIP Conference e-Business, e-Services, and e-Society (I3E 2025) September 9-11, 2025, 5* St. Raphael Resort and Marina, Limassol, Cyprus https://cyprusconferences.org/i3e2025/ (*** Proceedings to be published by Springer in LNCS ***) (*** Journal Special Issue with Springer's SN Computer Science ***) Conference theme: “Pervasive digital services for people’s well-being, inclusion and sustainable development” OVERVIEW Next-gen digital services contribute to people’s well-being, inclusion, and sustainable development, re-shaping e-business, e-services, and e-society. Such services are pervasive both since they run on a large variety of heterogeneous devices and they permeate various aspects of daily life, by offering accessible and personalised experiences to all individuals. The proposed theme advocates for the design, implementation and operations of novel digital solutions that satisfy the needs of different individuals, while contributing to their well-being and to preserving the Planet. I3E 2025 will collect contributions about the creation and management of user-centric accessible platforms, applications, and services that empower individuals to live healthier and more fulfilling lives. The proposed theme aims at emphasizing how it is possible to leverage different technologies to address pressing societal challenges such as, for instance, healthcare access, education, poverty alleviation, sustainable usage of resources, and social equity, towards a more inclusive and sustainable future. TOPICS OF INTEREST Areas of particular interest include but are not limited to: e-Business • Innovative e-business models • Inter-organizational systems • Business process integration • Business process re-engineering • e-Marketplaces, e-Hubs and portals • Digital goods and products • User behaviour modeling • Mobile business • Enterprise application integration • e-Negotiations, auctioning and contracting • Supply, demand, and value chains • e-Commerce content management • Dynamic pricing models • Trust and security • Mobile Commerce • Business Intelligence • Business Ontologies and Models • E-Business Models e-Services • e-Service composition • Inter-organizational services • e-Collaboration and e-Services • Service-oriented computing • Web services • Semantic web services • Service workflows • Virtual organizations and coalitions • Virtual enterprises and virtual markets • Web 2.0 applications • Agent-oriented e-Services • P2P co-operation models • Ubiquitous, mobile, and pervasive services • Application service management • Services and service management in the cloud-edge continuum • Next-gen AI services • Enterprise Ontologies • Accessibility • Usability e-Society • e-Government (e.g. G2G, G2B, or G2C) • Digital cities and regions • e-Democracy and e-Governance • e-Inclusion to information society • e-Health and e-Education • Public e-Services for citizens and enterprises • One-stop government service integration • Mobile public services • Multimedia and multilingualism • Digital culture and digital divide • Privacy and security • Legal societal and cultural issues • Public-private partnerships • International dimension of e-Gov • E-society and AI • Digital Transformation • Social Computing • Green Computing • Sustainable Technologies • Humanitarian & Emergency Management • Digital Inclusion • Digital Literacy SUBMISSION Authors should submit original, unpublished research papers. All papers must not simultaneously be submitted to another journal or conference. All accepted papers will be published in the conference proceedings. Therefore, submissions should not be under consideration for any other conference or journal outlet. Authors should consult Springer’s authors’ guidelines and use their proceedings templates to prepare their papers (https://www.springer.com/gp/computer-science/lncs/conference-proceedings-gu…). Authors can submit their proceedings articles using the EasyChair platform. Please use the following link: https://easychair.org/conferences/?conf=i3e2025 . Length of papers The most common types of papers accepted for publication are full papers (12 pages) and short papers (7 pages). We only wish to publish papers of significant scientific content. Journal Special Issue Authors of selected papers will be invited to submit an extended and revised version of their paper (with at least 30% additional material) for fast-track review and publication in Springer's SN Computer Science (https://link.springer.com/journal/42979). IMPORTANT DATES • Paper Submission: May 19, 2025 (AoE) • Author Notification: June 23, 2025 • Camera-ready: June 30, 2025 • Author Registration: June 30, 2025 ORGANISATION Conference Chair • George A. Papadopoulos, University of Cyprus Conference Co-Chairs • Yogesh K. Dwivedi, Emerging Markets Research Centre (EMaRC) • Georgia Kapitsaki, University of Cyprus • Matti Mäntymäki, University of Turku • Ilias Pappas, Norwegian University of Science and Technology • Marinos Themistocleous, University of Nicosia Program Co-Chairs • Achilleas Achilleos, Frederick University of Cyprus • Stefano Forti, University of Pisa • Angelika Kokkinaki, University of Nicosia

1 0

PhD students at Saarland University - RTG Neuroexplicit Models
by Alexander Koller 15 Apr '25

15 Apr '25

Hi all, this is a really cool opportunity, and the last chance to apply. I look forward to receiving your applications! Best, Alexander. The Research Training Group 2853 “Neuroexplicit Models of Language, Vision, and Action” is looking for Multiple PhD Students - Fall 2025 Neuroexplicit models combine neural and human-interpretable (“explicit”) models in order to overcome the limitations that each model class has separately. They include neurosymbolic models, which combine neural and symbolic models, but also e.g. combinations of neural and physics-based models. In the RTG, we will improve the state of the art in natural language processing (“Language”), computer vision (“Vision”), and planning and reinforcement learning (“Action”). We also develop novel machine learning techniques for neuroexplicit models (“Foundations”). Our overarching aim is to contribute to a better understanding of the cross-cutting design principles of effective neuroexplicit models through interdisciplinary collaboration. We are now filling the last few remaining positions to grow to a total of 24 PhD students by the end of 2025. You will join a very international crowd of sixteen PhD students and one postdoc who are already being funded by the RTG. Through the inclusion of ~15 associated PhD students and postdocs funded from other sources, it will be one of the largest research centers on neuroexplicit or neurosymbolic models in the world. The RTG brings together researchers at Saarland University, the Max Planck Institute for Informatics, the Max Planck Institute for Software Systems, the CISPA Helmholtz Center for Information Security, and the German Research Center for Artificial Intelligence (DFKI). All of these institutions are collocated on the same campus in Saarbrücken, Germany. The positions will be funded for four years at the TV-L E13 100% pay scale. They are intended to start in September 2025, but could start a little earlier or later depending on the student’s availability. You should have or be about to complete an MSc degree in computer science or a related field and have demonstrated expertise in one of the research areas of the RTG, e.g. through an excellent Master’s thesis or relevant publications. The RTG is part of the Saarland Informatics Campus, one of the leading centers for research in computer science, artificial intelligence, and natural language processing in Europe. The Saarland Informatics Campus brings together 900 researchers and 2500 students from 81 countries. The CISPA Helmholtz Center, located on the same campus, is home to an additional 350 researchers and on track to grow to 800 by 2026. Researchers at SIC and CISPA are part of the ELLIS network and have been awarded more than 40 ERC grants. Each PhD student in the RTG will be jointly supervised by two PhD advisors from the list of Principal Investigators below. Each student will freely define their own research topic; we encourage the choice of topics that cross the traditional boundaries of research fields. Students may be affiliated with Saarland University or with one of the participating institutes. Vera Demberg, Saarland University - Computational Linguistics Dietrich Klakow, Saarland University - Natural Language Processing Alexander Koller, Saarland University - Computational Linguistics Mariya Toneva, MPI for Software Systems - Computational Neuroscience, Machine Learning Jörg Hoffmann, Saarland University - AI Planning Bernt Schiele, MPI for Informatics - Computer Vision, Machine Learning Philipp Slusallek, DFKI and Saarland University - Computer Graphics, Artificial Intelligence Christian Theobalt, MPI for Informatics - Visual Computing, Machine Learning Isabel Valera, Saarland University - Machine Learning Jilles Vreeken, CISPA - Machine Learning, Causality Joachim Weickert, Saarland University - Mathematical Data Analysis Verena Wolf, DFKI and Saarland University - Modeling and Simulation, Reinforcement Learning Ellie Pavlick, Brown University and Google AI, is joining us regularly as a Mercator Fellow. Please send your application by May 7th 2025 to apply(a)neuroexplicit.org and include the reference number W2639. We aim to conduct job interviews in June 2025. For more details on the position, including what materials to submit with your application, please see our website: https://www.neuroexplicit.org/jobs/

1 0

CALL FOR BIDS TO HOST EACL 2026
by ACL Announcements 15 Apr '25

15 Apr '25

*CALL FOR BIDS TO HOST EACL 2026* The European Chapter of the Association for Computational Linguistics (EACL) invites expressions of interest to host the 2026 EACL conference, to be held in Europe, the Middle East or Africa (EMEA) in Spring (preferably April/May) 2026. The 2026 conference will be the 18th meeting of the EACL. At the same time, we also invite expressions of interest to host the 2027 EACL conference. *At this stage, we seek draft proposals from prospective bidders.* These will be evaluated, and promising bidders will be asked to provide additional information for the final selection. The EACL Board will appoint the general chair for the conference, the programme committee co- chairs, and all other chairs (tutorial co-chairs, workshop co-chairs, etc.), except for the local arrangements chair. *It seems we have had a malfunction with our mailing list. Please resend your bid if you haven't received a confirmation from us!* Draft bid proposals (due *April 16th, 2025*) should include information on all of the following items: 1. *Proposed dates:* in Spring (preferably March/April) 2026 2. *Location:* city and conference venue. Indicate whether the conference would be held at a university, hotel or convention center. Bear in mind that EACL is growing. While Gothenburg (EACL 2014) had 520 registered participants, Valencia (EACL 2017) had 680 registered participants and EACL 2024 had over 800 participants. So please suggest a location that could host 1000 people for plenary sessions, plus at least 4 conference rooms hosting parallel sessions (200-250 people each), a large poster or exhibit room; 11 rooms on the workshops/tutorials days among which at least two host 200 people and the others 60 persons; and rooms for demos, small meetings and registration. 3. *Local arrangements team:* local chair/co-chair, committee, volunteer labour (e.g. students), registration handling. The local arrangements team will be responsible for activities such as arranging meeting rooms, equipment, refreshments, accommodation, on-site registration, participant internet access, the reception, the conference dinner, and working with the other chairs and the EACL Board to develop the budget and registration materials. Indicate whether a professional conference organizer (PCO) will be involved in the organization. Also, indicate whether any national/regional associations for Computational Linguistics would be on board of the local organization *The final bids will also include detailed information on the following items:* 1. Computing/wifi/audiovisual: whether there will be desktop/laptop in conference rooms and high-speed wireless Internet access, what the audiovisual facilities are 2. Printing of conference booklet 3. Food catering including breaks, reception, poster sessions and conference dinner 4. Accommodation options at the venue, including low-cost student accommodation 5. Travel alternatives to the venue from Europe and beyond 6. Social events including infrastructure for banquet/other social event and reception 7. Potential for local sponsorships 8. Opportunities for co-location with other meetings 9. The costs related to all of the above items, which should be indicated in the expenses spreadsheet (template provided below). Proposals will be evaluated with respect to a number of criteria (unordered): - Adequacy of conference and exhibit facilities for the anticipated number of registrants - Adequacy of accommodations and food services (in a range of price categories) and proximity to the conference facilities - Adequacy of expenses projections and expected surplus - Appropriateness of proposed dates - Geographical and national balance with regard to previous EACL and ACL conferences, and other major Natural Language Processing conferences held in EMEA - Co-location with national/regional conferences - Experience with the local arrangements team - Local CL community support - Local government and industry support - Appropriateness of expected registration fees - Accessibility of the proposed site Reports, lessons learnt and successful bids from previous years: - EACL 23 report [1]https://www.romanklinger.de/blog-assets/2023-05-12/eacl2023-conf-report.pdf [2] The EACL conference handbook: https://2024.eacl.org/downloads/handbook.pdf [3] Please send your expressions of interest electronically to the EACL Board: nina.tahmasebi(a)gu.se The EACL board encourages groups who intend to submit a proposal to ask questions about how to prepare the proposal. *Important Dates:* 16th April 2025: Deadline for draft bids April 2025: Feedback to bidders, announcement of shortlist of bidders May 2025: Deadline for final bids June 2025: Final bid chosen (to be publicly announced in July at ACL2025) April/May 2026: EACL Conference Best regards, Nina Tahmasebi - Secretary of EACL - Links: ------ [1] http://aclweb.org/adminwiki/index.php?title=2017Q3_Reports:_EACL_2017 [2] https://ems-urlprotect.trendmicro.com/wis/clicktime/v1/query?url=https%3a%2… [3] https://ems-urlprotect.trendmicro.com/wis/clicktime/v1/query?url=https%3a%2…

1 0

[RANLP 2025 Shared Task - Ahasis] Phase 1 Started: Training Data Now Available! 🔔
by chafik.salmane＠um6p.ma 14 Apr '25

14 Apr '25

🔔 Training Set Now Available! The training dataset for Ahasis is now live! 👉 If you're registered, access it directly via CodaBench: https://www.codabench.org/competitions/5871 👉 Not registered yet? Visit our official website to register, then head to CodaBench to get started! 😊 Sentiment Across Multi-Dialectal Arabic: A Benchmark for Sentiment Analysis in the Hospitality Domain We invite researchers, practitioners, and NLP enthusiasts to participate in the Sentiment Across Multi-Dialectal Arabic shared task, a challenge aimed at advancing sentiment analysis for Arabic dialects in the hospitality sector. 🧠 About the Task Arabic is one of the world’s most spoken languages, characterised by rich dialectal variation across different regions. These dialects significantly differ in syntax, vocabulary, and sentiment expression, making sentiment analysis a challenging NLP task. This task focuses on multi-dialectal sentiment detection in hotel reviews, where participants will classify sentiment as positive, neutral, or negative across multiple Arabic dialects, including Saudi, Moroccan, and Egyptian Arabic. This shared task provides a high-quality multi-dialect parallel dataset, enabling participants to explore: 1. Dialect-Specific Sentiment Detection – Understanding how sentiment varies across dialects. 2. Cross-Linguistic Sentiment Analysis – Investigating sentiment preservation across dialects. 3. Benchmarking on Multi-Dialect Data – Evaluating models on a standardised Arabic dialect dataset. 📦 Dataset Overview - Hotel reviews across multiple Arabic dialects. - Balanced sentiment distribution (positive, neutral, negative). - Multi-Dialect Parallel Dataset – Each review is available in multiple dialects, allowing for cross-linguistic comparison. 📏 Evaluation Metrics - Primary Metric: F1-Score. - Additional Analysis: Comparison of sentiment accuracy across dialects. 🧪 Baseline System - Pre-trained BERT-based model (AraBERT) fine-tuned on MSA and Arabic dialect data. - Participants are encouraged to improve upon the baseline model with their own techniques and use LLMs. 🌟 Why Participate? - Contribute to Arabic NLP Research – Help advance sentiment analysis for Arabic dialects. - Gain Access to a High-Quality Dataset – A unique multi-dialect benchmark for future research. - Collaborate with the NLP Community – Engage with leading researchers and practitioners. - Showcase Your Work – High-performing models may be featured in a post-task publication. 🗓️ Timeline - Training data ready – April 15, 2024 - Test Evaluation starts – May 1, 2025 - Test Evaluation end – May 5, 2025 - Paper submission due – May 16, 2025 - Notification to authors – May 31, 2025 - Shared task presentation co-located with RANLP 2025 – September 11 and September 12, 2025 ✅ How to Participate? 1. Register for the task via https://ahasis-42267.web.app/ 2. Download the dataset and baseline system. 3. Develop and test your sentiment analysis model. 4. Submit your results for evaluation. 👥 Organising Team - Maram Alharbi, Lancaster University, UK - Salmane Chafik, Mohammed VI Polytechnic University, Morocco - Professor Ruslan Mitkov, Lancaster University, UK - Dr. Saad Ezzini, King Fahd University of Petroleum and Minerals, Saudi Arabia - Dr. Tharindo Ranasinghe, Lancaster University, UK - Dr. Hansi Hettiarachchi, Lancaster University, UK 📬 For inquiries, please contact us at ahasis.task(a)gmail.com 🎉 Don’t forget to enjoy the challenge, explore the beauty of Arabic dialects, and push the boundaries of what your models can do! 🚀

1 0

Edge Hill Corpus Research Group: Meeting #16
by Costas Gabrielatos 14 Apr '25

14 Apr '25

The next meeting of the Edge Hill Corpus Research Group will take place online (MS Teams) on Friday 2 May 2025, 2:00-3:30 pm (BST<https://time.is/United_Kingdom>). Topic: LLMs and Lexical Priming Theory Speaker: Michael Pace-Sigge<https://uefconnect.uef.fi/en/person/michael.pace-sigge/> (University of Eastern Finland) Title: Large-Language-Model Tools and the Theory of Lexical Priming: Where technology and human cognition meet and diverge The abstract and registration link are here: https://sites.edgehill.ac.uk/crg/next Attendance is free. Registration closes on Wednesday 30 April. If you have problems registering, or have any questions, please send an email to: gabrielc(a)edgehill.ac.uk<mailto:gabrielc@edgehill.ac.uk> ________________________________ Edge Hill University<http://ehu.ac.uk/home/emailfooter> Modern University of the Year, The Times and Sunday Times Good University Guide 2022<http://ehu.ac.uk/tef/emailfooter> University of the Year, Educate North 2021/21 ________________________________ This message is private and confidential. If you have received this message in error, please notify the sender and remove it from your system. Any views or opinions presented are solely those of the author and do not necessarily represent those of Edge Hill or associated companies. Edge Hill University may monitor email traffic data and also the content of email for the purposes of security and business communications during staff absence.<http://ehu.ac.uk/itspolicies/emailfooter>

1 0