- Corpora - ELRA lists

3-year PhD position in Computational Models of Semantic Memory and its Acquisition (Inria and University of Lille, France)
by Pascal Denis 13 May '25

13 May '25

Hello, Could you please distribute the following job offer? Thanks. Best, Pascal ------------------------------------------------------------------------------------- 3-year PhD position in Computational Models of Semantic Memory and its Acquisition (Inria and University of Lille, France) We invite applications for a 3-year PhD position at the University of Lille in the context of the recently funded research project "COMANCHE" (Computational Models of Lexical Meaning and Change). The position is funded by Inria, the French national research institute in Computer Science and Applied Mathematics. COMANCHE proposes to transfer and adapt neural word embeddings algorithms to model the acquisition and evolution of word meaning, by comparing them with linguistic theories on language acquisition and language evolution. At the intersection between Natural Language Processing, psycholinguistics and historical linguistics, this project intends to validate or revise some of these theories, while also developing computational models that are less data hungry and computationally intensive as they exploit new inductive biases inspired by these disciplines. The first strand of the project, on which the successful candidate will work, focuses on the development of computational models of semantic memory and its acquisition. Two main research directions will be pursued. On the one hand, we will compare the structural properties associated to different semantic spaces derived from word embedding algorithms to those found in human semantic memory as reflected in behavioral data (such as typicality norms) as well as brain imaging data. The latter data will then used as additional supervision to inject more hierarchical structure into the learned semantic spaces. One the other hand, we intend to experiment with training regimes for word embedding algorithms that are closer to those of humans when they acquire language, controlling the quantity as well as the linguistic complexity of the inputs fed to the learning algorithms through the use of longitudinal and child directed speech corpora (e.g., CHILDES, Colaje). In both cases, both English and French data will be considered. The successful candidate holds a Master's degree in computational linguistics or computer science or cognitive science and has prior experience in word embedding models. Furthermore, the candidate will provide strong programming skills, expertise in machine learning approaches and is eager to work across languages. The position is affiliated with the MAGNET team at Inria, Lille [1] as well as with the SCALAB group at University of Lille [2] in an effort to strenghten collaborations between these two groups, and ultimately foster cross-fertilizations between Natural Language Processing and Psycholinguistics. Applications will be considered until the position is filled. However, you are encouraged to apply early as we shall start processing the applications as and when they are received. Applications, written in English or French, should include a brief cover letter with research interests and vision, a CV (including your contact address, work experience, publications), and contact information for at least 2 referees. Applications (and questions) should be sent to Angèle Brunellière (angele.brunelliere(a)univ-lille.fr) and Pascal Denis (pascal.denis(a)inria.fr). The starting date of the position is 1 October 2022 or soon thereafter, for a total of 3 full years. Best regards, Angèle Brunellière and Pascal Denis [1] https://team.inria.fr/magnet/ [2] https://scalab.univ-lille.fr/ -- Pascal ---- Pour une évaluation indépendante, transparente et rigoureuse ! Je soutiens la Commission d'Évaluation de l'Inria. ---- +++++++++++++++++++++++++++++++++++++++++++++++ Pascal Denis Equipe MAGNET, INRIA Lille Nord Europe Bâtiment B, Avenue Heloïse Parc scientifique de la Haute Borne 59650 Villeneuve d'Ascq Tel: ++33 3 59 35 87 24 Url: http://researchers.lille.inria.fr/~pdenis/ +++++++++++++++++++++++++++++++++++++++++++++++

1 2

Call for papers: Computational Forensic Linguistics: Law, Language and Evidence in the Virtual Worlds
by Rui Sousa Silva 12 May '25

12 May '25

***APOLOGIES FOR CROSS-POSTINGS — PLEASE FEEL FREE TO CIRCULATE*** CALL FOR PAPERS - INTERNATIONAL JOURNAL FOR THE SEMIOTICS OF LAW SPECIAL ISSUE: Computational Forensic Linguistics: Law, Language and Evidence in the Virtual Worlds Volume 39 (2026) Guest Editor – Rui Sousa-Silva (University of Porto – Faculty of Arts and Humanities & Centre for Linguistics of the University of Porto) Forensic linguistics, the branch of linguistics applied to forensic contexts, is inherently multidisciplinary, although it predominantly stands at the intersection of language and the law. Despite its status as a young discipline, forensic linguistics is wide in scope, and has significantly contributed to a fair and just administration of Justice, especially since the late 1990s, across its three different areas: the written language of the law, interaction in legal contexts, and language as evidence (May, Sousa-Silva & Coulthard, ‘Introduction’, The Routledge Handbook of Forensic Linguistics, Routledge, 2021). The discipline is thus profoundly semiotic in nature, and has a significant impact on how law is interpreted and administered, and on how investigative processes are conducted. However, as a young discipline, it still faces methodological and technical challenges. Forensic linguistics is often questioned as a science, for instance owing to the fact that forensic linguists can hardly, if ever, establish the known error rate, which is often demanded to meet the legal requirements (e.g., Daubert criteria) in some jurisdictions. It is also frequently distrusted as a science, and particularly as a forensic science, on the grounds that it is subjective. Furthermore, the perception that anyone – and hence legal professionals, including judges and counsellors – can “understand” the semiotics of language use and analyse language has often led the courts into believing that forensic linguists are dispensable. To counter these misconceptions about the field, forensic linguists have constantly researched and devised new methods, including statistical and computational approaches, to counter the “subjectivity effect”. Computational approaches, specifically, will play a core role in forensic linguistics: they not only allow forensic linguists to analyse large amounts of data quickly and systematically, but also enable the reproducibility of the forensic linguistic analysis, which can be essential in forensic sciences. This will be especially the case in the near future, with progress in the metaverse, as the seamless interaction between users via technology-mediated communication in the virtual worlds will raise even more issues that can only be resolved with the assistance of rigorous and transparent forensic linguistic analyses. Computational forensic linguistics, thus understood as the use of efficient and effective computational linguistics tools, methods and techniques in forensic contexts, will thus play an increasingly core role in forensic linguistics across the areas of written language of the law, interaction in legal contexts and, especially, language as evidence. Original proposals that explore the relationship between the semiotics of law and one or more computational approaches to forensic linguistics are thus invited for the special issue “Computational Forensic Linguistics: Law, Language, Evidence and Rigour in the Virtual Worlds”. Submissions may range from (but not limited to) systems to help the courts interpret and draft just and fair decisions to software and tools to assist law enforcement agencies in the fight against crime, including platforms to support the investigation in the collection and analysis of evidence. Manuscripts should establish a clear connection between the semiotics of law, computational approaches and forensic linguistics/language and the law. Submissions should be addressed to: Rui Sousa-Silva (rssilva(a)letras.up.pt). - Abstracts of 300 words (maximum) by 15 May 2025. - After selection, final papers (15,000 words maximum, including endnotes and references) should be submitted by 15 November 2025. Further information: https://link.springer.com/collections/ebcefecdcf Rui Sousa Silva Faculdade de Letras, Universidade do Porto Faculty of Arts and Humanities, University of Porto www.linguisticaforense.pt | https://s.up.pt/qjur | http://tinyurl.com/37w2ec6x Publicação mais recente / Latest publication: ‘We Attempted to Deliver Your Package’: Forensic Translation in the Fight Against Cross-Border Cybercrime AVISO DE CONFIDENCIALIDADE: Esta mensagem e os seus anexos são confidenciais e dirigidos unicamente aos destinatários da mesma. Se não for o destinatário, solicito que não faça qualquer uso do seu conteúdo e proceda à sua eliminação, notificando-me do sucedido. Obrigado. // CONFIDENTIALITY WARNING: This message and its attachments are confidential and exclusively addressed to the recipients above. Should you not be one of the recipients, I kindly ask you not to make use of its contents and delete the message and its attachments. Please reply to this e-mail to warn me about this incident. Thank you.

1 0

[Jobs] Assistant Professorship with Tenure Track at TU Wien (Deadline May 22, 2025)
by Pia Pachinger 12 May '25

12 May '25

1 0

Job: Tenure-Track Assistant Professor of Natural Language Processing at the TU Wien and Complexity Science Hub in Vienna, Austria
by Allan Hanbury 12 May '25

12 May '25

The application deadline is approaching! Tenure-track assistant professor of Natural Language Processing, jointly atthe TU Wien Faculty of Informaticsand theComplexity Science Hubin Vienna, Austria. Details and application: https://jobs.tuwien.ac.at/Job/248962 Application Deadline: 22nd May 2025 A great opportunity to do basic research and work on interesting problems and data from government administration and industry. The working language at both the TU Wien Faculty of Informatics and the Complexity Science Hub is English. Complexity Science Hub - https://csh.ac.at/ Data Science Research Unit, TU Wien - https://informatics.tuwien.ac.at/orgs/e194-04 Living in Vienna - https://informatics.tuwien.ac.at/living-in-vienna/ -- Allan Hanbury Professor of Data Intelligence Head of the Data Science Research Unit, Institute of Information Systems Engineering Faculty Representative for Financial Affairs and Internationalization, Faculty of Informatics TU Wien (Vienna University of Technology) Favoritenstrasse 9-11/194-04 1040 Vienna, Austria +43 1 58801 188310

1 0

Post-doc position in computational linguistics at Univ. Lorraine (France)
by Mathieu Constant 11 May '25

11 May '25

The Research unit ATILF (Computer Processing and Analysis of the French Language) offers a postdoctoral position in computational linguistics. Topic: multiword expressions in large language models Location: ATILF, Nancy, France (Univ. Lorraine and CNRS) Starting date: September 2025 Duration: 12 months (possibility to extend the duration for one more year) Supervisors: Mathieu Constant (Univ. Lorraine, France) and Patrick Watrin (UC Louvain, Belgium) Salary: depends on experience and salary grids (from 3000 to 4200 euros before tax) Application deadline: June 1st, 2025 Subject. The term « multiword expression » (MWE) refers to a combination of multiple lexical items that displays irregular composition possibly on different linguistic levels (morphology, syntax, semantics, …). They include a large variety of phenomena such as idioms (run around in circles), support verb constructions (take a walk), nominal compounds (dry run), complex function units (in spite of). They have been the subject of extensive research work in the NLP community over the last 50 years. The goal of this post-doc position is to investigate to what extent large language models encode multiword expressions and their various levels of idiomaticity and fixedness. In particular, the hired post-doc will develop methods to extract linguistic features about multiword expressions in context from large language models. The methods will be experimented on French and will be used to provide aids for French L2 learners when reading MWE occurrences in authentic texts. Context. The position is part of the STAR-FLE project (STrategic Adaptations for better Reading and Text Comprehension in FFL, https://www.starfle.fr/en <https://www.starfle.fr/en>, 2024-2027) funded by the French National Research Agency (ANR). The project aims to propose innovative digital solutions in the area of Natural Language Processing (NLP) that may improve text comprehension for French L2 learners and assist teachers in managing multiple levels of learners. In particular, it will propose context-based aids for understanding lexical issues as well as MWEs found in authentic texts. The hired researcher will be fully integrated in the project team. Requirements. Applicants should hold a PhD thesis n natural language processing, in computational linguistics, in computer science, or in applied mathematics, . The hired post-doc researcher should have the following skills: * expertise in deep learning for NLP and notably large language models * excellent programming skills * Good linguistic skills * good knowledge of French would be a plus * team spirit Application. The applicants should submit a coverage letter, a CV including their publications, a list of references for recommandation, on the following official web site: https://emploi.cnrs.fr/Offres/CDD/UMR7118-SABMAR-022/Default.aspx?lang=EN <https://emploi.cnrs.fr/Offres/CDD/UMR7118-SABMAR-022/Default.aspx?lang=EN>. The applications should be sent not later than June 1st, 2025. For more information, contact Mathieu Constant (Mathieu.Constant(a)univ-lorraine.fr <mailto:Mathieu.Constant@univ-lorraine.fr>)

1 0

Remote AI Research Internship in India – Apply Now!
by HR Team 11 May '25

11 May '25

Hello Everyone, 🎯 Join us for an exciting 6-month remote internship @ CortexTor Labs (https://cortextor.com/), where you’ll work on innovative projects at the forefront of AI. CortexTor Labs is an agile startup, focused on building adaptive AI-driven systems that address real-world challenges through innovation and rapid iteration. As an intern, you’ll explore a range of impactful topics, including: 1) Text-to-Image and Video Generation 2) Visual Storytelling 3) Talking Face Generation 4) Prompt-Based Media Generation and Editing 5) Open-Vocabulary Detection and Segmentation 6) Multimodal Logical and Mathematical Reasoning 7) Video Summarization 8) Multimodal RAG-based QA Chatbot 9) Multimodal Knowledge Graph 10) Knowledge distillation (Teacher-Student Models) 💰 Paid Internship Track (For B.Tech & M.Tech Students – Full Project Involvement) - The first 3 months are unpaid, focusing on onboarding, training, and initial contributions. - The next 3 months offer a stipend of ₹12,400/month, aligned with M.Tech internship standards. - Top performers may receive up to ₹15,000–₹20,000/month. - After 6 months, high-performing interns may be offered a full-time position. 📚 Unpaid Research Track (For PhD Scholars & Research-Focused B.Tech/M.Tech Students) - This is a remote, unpaid internship focused entirely on academic research and publication in the above-mentioned cutting-edge AI topics. - Open to: - PhD scholars interested in publishing in advanced AI domains - B.Tech and M.Tech students who want to focus solely on research and co-authoring papers, without project or stipend commitments - Ideal for individuals seeking to build a strong publication record, collaborate with researchers, and deepen expertise in cutting-edge AI topics. If you're interested in this internship and would like to learn more, please fill out the Google Form, which includes a detailed description of the internship. Google form: https://forms.gle/YkH3uo4wjUgiRLYh9 Thank you. Regards, HR Manager CortexTor Labs, India Email: hr(a)cortextor.com https://cortextor.com/

3 2

Call for Papers: FEL XXIX 2025 - Basque Country, Spain - 22-25 October - Deadline extended to 1 June!
by Steven Krauwer 10 May '25

10 May '25

The 29th Annual Conference of the Foundation for Endangered Languages - FEL XXIX 2025 The Foundation for Endangered Languages and the UNESCO Chair on World Language Heritage are organising the 2025 edition of the FEL conference, “The Missing SDG: Endangered Languages and Sustainable Development”. Date: 22-25 October, 2025 Place: Vitoria-Gasteiz, Faculty of Arts at the UPV/EHU (Basque Country, Spain) Call for Papers now OPEN UNTIL 1 JUNE. More information on the website: https://www.ehu.eus/en/web/mho-unesco-katedra/fel-xxix-2025 -- _______________________________________________________________________ Steven Krauwer, CLARIN/FEL/ELSNET/ILS, Utrecht, NL, s.krauwer(a)uu.nl

1 0

SIGIR 2025 Call for Participation - 13-17 July 2025, Padua, Italy
by Diego Ceccarelli (BLOOMBERG/ LONDON) 09 May '25

09 May '25

SIGIR 2025 Call for Participation - 13-17 July 2025, Padua, Italy Registration at: https://sigir2025.dei.unipd.it/registration.html Registration deadlines: Early registration: 20 May 2025 Regular registration: 13 June 2025 Late registration: 18 July 2025 The annual SIGIR conference is the major international forum for the presentation of new research results, and the demonstration of new systems and techniques, in the broad field of information retrieval (IR). The 48th ACM SIGIR conference, will be run as an in-person conference from July 13th to 17th, 2025 in Padua, Italy, followed by ICTIR 2025 (https://ictir2025.cs.umass.edu/) on July 18th, 2025. SIGIR 2025 will feature an extremely rich program, consisting of keynote talks, research and industry sessions, posters and demos, doctoral consortium, workshops and tutorials, and the LiveRAG challenge, not forgetting amazing social events. SIGIR 2025 features: 239 full papers, 106 short papers, 26 demos, 10 perspectives papers, 71 resource and reproducibility papers, 35 SIRIP/industry papers, 16 TOIS papers, and 10 Low Resource Environments papers, a brand new track launched at SIGIR 2025 (https://sigir2025.dei.unipd.it/low-resource-environments-track.html). The overall program can be found at: https://sigir2025.dei.unipd.it/overall-program.html. We strongly encourage you to book your hotel rooms as soon as possible! An Iron Maiden concert is scheduled in Padua for the first day of the conference (Sunday, July 13, 2025). Accommodations are expected to fill up quickly. Please see the SIGIR 2025 website for a list of hotels with conference discount (https://sigir2025.dei.unipd.it/recommended-hotels.html) and recommended hotels without conference discount (https://sigir2025.dei.unipd.it/accomodation.html). Travel information is available at: https://sigir2025.dei.unipd.it/how-to-get-here.html. Child Care (https://sigir2025.dei.unipd.it/child-care.html) Children from 0 to 12 years of age are welcomed in a specially set up room inside Padua Congress Center throughout the duration of the congress. A colorful, “warm”, safe and welcoming environment, in the same structure that hosts the congresses, a protected place where parents can leave their children, but at the same time stay in touch with them during the breaks of the different sessions. The nursery space dedicated to the 0-3 age group, equipped with a changing table and other facilities dedicated to the little ones, is next to the kids’ area for children over 3 years old and a cinema room for screenings. The activities are run by a company specialized in education services with expert staff, appointed by Padua Congress Center, and include thematic workshops - from painting to creative activities - as well as playtime also in the open air, in the facility's outdoor spaces. Children are also offered lunch. The child care service will be run by operators able to speak English, German, Spanish, and Italian. Other languages might be possibile but they will require specific agreements and the actual availability of an operator able to speak that language. The service is available to all SIGIR 2025 attendees who request it. The registration form allows you to request the childcare service and, in case, in the registration confirmation email you will receive a request for additional information about your children. Catering service is available on demand, included in the cost of service, and it covers lunch, morning and afternoon snack, juices and water. Lunches and snacks can be managed directly by the educators. Alternatively parents can go to the Child Area to take care of lunches and snacks directly. Parents will be allowed to use the kitchen in the area to heat up milk or food for younger children. Two microwave ovens and a refrigerator are available in the area. Keynotes (https://sigir2025.dei.unipd.it/keynote-speakers.html) Digitized Health by Ophir Frieder, Georgetown University, Washington DC (USA) The "AI revolution" is lavished with accolades and showered with concerns, including some dire warnings. Regardless, this revolution continues to shape nearly all technology and domains. We focus specifically on medical applications that rely on search or recommendation technology. Relying on these technologies, we alleviate the ever-growing shortage of medical care personnel. Specifically, patient interactions are simplified by conversational agents, medical triage is accomplished by self-administered surrogates, early-onset of mental health conditions are detected through opt-in monitoring agents, and treatment suggestions are generated and evaluated via retrieval and mining applications. These are just some examples where search and related technologies are reshaping medical practice. Currently or soon to be deployed systems are described. "In progress" efforts are likewise highlighted. While some of the described systems rely on recent technology advances, others are simply based on "bread and butter" approaches, reminding us that "new and improved" is not always needed, and at times, is overkill and needlessly costly. We conclude with some observations. Please meet AI, our dear new colleague. In other words: can scientists and machines truly cooperate? By Iryna Gurevych, Technical University of Darmstadt, Germany How can AI and LLMs facilitate the work of scientists in different stages of the research process? Can technology even make scientists obsolete? The role of AI and Large Language Models (LLMs) in science as the target application domain has recently been rapidly growing. This includes assessing the impact of scientific work, facilitating writing and revising manuscripts as well as intelligent support for manuscript quality assessment, peer-review and scientific discussions. The talk will illustrate such methods and models using several tasks from the scientific domain. We argue that while AI and LLMs can effectively support and augment specific steps of the research process, expert-AI collaboration may be a more promising mode for complex research tasks. BM25 and All That - A Look Back, by Stephen Robertson, Girton College, Cambridge (UK) It is 30 years since the weighting-and-ranking function BM25 was published, and more than 55 years since I started work in the field we know as information retrieval. I will be talking about my experiences as an IR researcher over the period from 1968 to the early 2000s, including the development of the probabilistic model which led to BM25, and also some of the work on IR evaluation in the years since the Cranfield experiment. More generally, I will talk about some of the ways in which the field has changed and developed over that time, and about some of the characters who helped to shape the field, including my own interactions with them. LiveRAG Challenge (https://sigir2025.dei.unipd.it/live-rag-challenge.html) The goal of the LiveRAG Challenge is to allow research teams across academia and industry to advance their RAG research and compare the performance of their solutions with other teams, on a fixed corpus (derived from the publicly available FineWeb) and a fixed open-source LLM, Falcon3-10B-Instruct. The SIGIR 2025 LiveRAG Challenge is organized by TII (Technology Innovation Institute) with support from AI71, AWS, Pinecone, and Hugging Face. It requires an application process, after which selected teams will be awarded up to 1500 USD in AWS compute credits to train their RAG solution, and up to 750 USD in Pinecone compute credits to use/generate their RAG indices. given early access to TII's DataMorgana tool to help them generate synthetic benchmarks for training and testing. During the Live Challenge Day, the teams will be provided with a stream of unseen questions and will have to return their answers under strict response-time constraints. Finalists will be requested to present their results at the LiveRAG workshop day to be held at the SIGIR 2025 conference, during which winners will be announced and prizes will be awarded. Workshops (https://sigir2025.dei.unipd.it/attend-workshops.html) Full Day Workshops SIGIR 2025 Workshop on eCommerce (ECOM25) - https://sigir-ecom.github.io/ LLM4Eval: Large Language Model for Evaluation in IR - https://llm4eval.github.io/SIGIR2025/ Second SIGIR Workshop on Simulations for Information Access (Sim4IA 2025) - https://sim4ia.org/sigir2025/ 2nd Workshop on Information Retrieval for Understudied Users (IR4U2) - Bridging User-centered AI with IR: Making Information Retrieval Accessible for All - https://ir4u2workshop.wixsite.com/ir4u2-2 International Workshop on Algorithmic Bias in Search and Recommendation (BIAS) - https://biasinrecsys.github.io/sigir2025 6th Workshop on Patent Text Mining and Semantic Technologies (PatentSemTech2025) - http://ifs.tuwien.ac.at/patentsemtech/ Workshop on Explainability in Information Retrieval - https://xirworkshop.github.io/ IR-RAG: Workshop on Information Retrieval's Role in RAG Systems - https://coda.io/@rstless-group/ir-rag-sigir25 ReNeuIR at SIGIR 2025: The Fourth Workshop on Reaching Efficiency in Neural Information Retrieval - https://reneuir.org/ Robust-IR @ SIGIR 2025: The First Workshop on Robust Information Retrieval - https://sigir-2025-workshop-on-robust-ir.github.io/ Half Day Workshops MANILA25: SIGIR 2025 Workshop on Information Retrieval for Climate Impact - https://sites.google.com/view/ir-for-climate-impact/home AgentIR: 2nd Workshop on Agent-based Information Retrieval - https://applied-machine-learning-lab.github.io/2nd-AgentIR-Workshop-SIGIR-2… The 1st NIP@IR Workshop on New Interaction Paradigms for Information Retrieval in the Era of Generative AI - https://hellozicky.github.io/nip-ir2025.github.io/ GENNEXT: The Next Generation of IR and Recommender Systems with Language Agents, Generative Models, and Conversational AI - https://sigirgennext.github.io/GENNEXT-SIGIR-25/ FinIR: The 2nd Workshop on Financial Information Retrieval in the Era of Generative AI - https://finir2025.github.io/ Tutorials (https://sigir2025.dei.unipd.it/attend-tutorials.html) Full Day Tutorials Information Retrieval in Finance: Industry and Academic Perspectives on Innovation - https://sites.google.com/view/irfin/ Half Day Tutorials Conversational Search: From Fundamentals to Frontiers in the LLM Era Efficient In-Memory Inverted Indexes: Theory and Practice - https://pisa-engine.github.io/sigir-2025.html R2LLMs: Retrieval and Ranking with LLMs - https://ielab.io/tutorials/r2llms.html Retrieval-Enhanced Machine Learning: Synthesis and Opportunities - https://retrieval-enhanced-ml.github.io/sigir-2025.html Query Understanding in LLM-based Conversational Information Seeking Navigating Large Language Models for Recommendation: From Architecture to Learning Paradigms and Deployment - https://generative-rec.github.io/tutorial-sigir25/ Theory and Toolkits for User Simulation in the Era of Generative AI: User Modeling, Synthetic Data Generation, and System Evaluation - http://usersim.ai/sigir2025-tutorial Psychological Aspects in Retrieval and Recommendation - https://github.com/aisocietylab/Psy-IR-RecSys-SIGIR25 Dynamic and Parametric Retrieval-Augmented Generation - https://sites.google.com/view/sigir2025-tutorial-dprag/home-page Fairness in Information Retrieval from an Economic Perspective - https://economic-fairness-ir.github.io/ Unveiling Knowledge Boundary of Large Language Models for Trustworthy Information Access Neural Lexical Search with Learned Sparse Retrieval - https://lsr-tutorial.github.io/ Long Context vs. RAG: Strategies for Processing Long Documents in LLMs - https://sites.google.com/view/sigir25-lc-vs-rag/ Social Events (https://sigir2025.dei.unipd.it/social-events.html) Welcome Reception, Sunday July, 13, 2025 Venue: Piazza Della Frutta, Padua (https://www.padova.com/discover/QG/piazza-della-frutta) Student Event, Monday July, 14, 2025 Venue: Roman Arena, Padua (https://giardinidellarena.com/) SIGIR Social Dinner, Tuesday July, 15, 2025 Venue: Villa Contarini, Piazzola sul Brenta (https://www.villacontarini.eu/) ICTIR Social Dinner, Thursday July, 17, 2025 Venue: Caffè Pedrocchi, Padua (https://www.caffepedrocchi.it/en/)

1 0

3 postdocs, 1 PhD student in digital humanities / text analysis in University of Tartu, Estonia
by Peeter Tinits 09 May '25

09 May '25

Dear all, Forwarding a job announcement for 3 postdocs and 1 PhD Student. Please share with interested parties! Thank you! Peeter We are happy to announce 3 postdoctoral research fellow<https://ut.ee/en/job-offer/research-fellow-digital-humanities> and 1 PhD junior researcher<https://ut.ee/en/content/phd-open-calls> positions in Digital Humanities affiliated with the Center for Digital Text Scholarship (DigiTS) at the University of Tartu, and funded by the European Union. DigiTS is aimed at carrying out cutting-edge, interdisciplinary research using modern digital methods applied to textual data in Humanities and Social Sciences. To this end, an international and interdisciplinary team will be formed at the Faculty of Arts and Humanities at the University of Tartu (UT), including postdocs and doctoral students. This new team of experts will enable key actions establishing UT as a center for research and education in Digital Humanities. Each Research Fellow and the junior researcher will mainly conduct research in the field of digital humanities, computational linguistics, computational literary studies, digital history, or related fields (depending on individual competencies), while also having the opportunity to teach and to participate in governance and institutional development. The postdoctoral research fellowships are full-time positions running through the end of the project in 2030. The deadline for applications is 2 June. More details and information about how to apply can be found here: https://ut.ee/en/job-offer/research-fellow-digital-humanities The junior researcher position is a full-time, 4-year position, aimed at the completion of a PhD thesis. The deadline for applications is 15 May for international applicants. More details and information about how to apply can be found here: https://ut.ee/en/content/phd-open-calls (under Faculty of Arts and Humanities > Digital Humanities) Why apply now? Currently there are multiple innovative projects being conducted in Estonia on closely related topics. This creates an exciting and promising scene for both formal and informal collaboration, especially since many of these projects are carried out at least partly at the University of Tartu, including the Estonian Center of Excellence in AI (EXAI<https://exai.ee/>), the ERC Consolidator project Rise and Demise of Industrial Modernity (RiDe<https://cordis.europa.eu/project/id/101170823>), the Language Data Research Infrastructure (KeTa), and Neural text analysis models enhanced with external linguistic resources (description<https://www.etis.ee/Portal/Projects/Display/7f84cf46-e183-4248-988c-7d777e2…>); note that RiDe is also offering PhD and postdoc positions. Why study or work in Estonia? Estonia offers Nordic quality of life, a strong academic environment and convenient digitized services—all while maintaining a reasonable cost of living that supports comfortable student life. The University of Tartu, founded in 1632, ranks among the top 1% of the world’s most cited universities and actively fosters sustainability and intersectoral collaboration, having produced numerous successful startups. Estonia itself ranks #1 in startups per capita in Europe. As a member of the EU and NATO, Estonia is internationally minded, and English is widely spoken. Estonia’s digital infrastructure streamlines official procedures, as everything from contracts to taxes can all be handled online in minutes by citizens and residents alike. Tartu is a lively university town known for its cozy atmosphere, vibrant student life, bike and walking friendly spaces, and scenic riverside. It has been named the UNESCO City of Literature, and the European Capital of Culture in 2024. The city is well connected to Europe and the world, but also offers easy access to nature, with nearby vast networks of forest hiking trails, excellent winter sports opportunities, and the charm of four distinct seasons.

1 0

AthNLP 2025 - 3rd ATHENS NLP SUMMER SCHOOL
by A. Vlachos 09 May '25

09 May '25

Call for Participation AthNLP 2025 - 3rd ATHENS NLP SUMMER SCHOOL ============================================ ** Application Deadline: May 30, 2025 ** Info for sponsors: see here<https://docs.google.com/presentation/d/1b79pLs0hJn-5FaZfqDY_1Wd6OHslO0i_/ed…> We invite everyone interested in Natural Language Processing and Machine Learning to attend the 3rd Athens Natural Language Processing Summer School - AthNLP 2025: https://athnlp.github.io/2025/ Important Dates -------------------------- * Application Deadline: May 30, 2025 * Decision: June 10, 2025 * Registration: June 17, 2025 * Summer School: September 4-10, 2025 Description ------------------ Following on from the success of the 1st and 2nd AthNLP school in 2019 and 2024 respectively, AthNLP 2025 will take place at the campus of NCSR “Demokritos" in Athens and is organised jointly by NCSR "Demokritos", the Athens University of Economics and Business, RC "Athena", and Heriot-Watt University. AthNLP cooperates closely with the organisers of LxMLS, taking place in Lisbon, in July 19-25. The school will cover a range of NLP topics focusing on machine learning (ML) methods. There will be morning lectures focusing on theoretical aspects, afternoon lab sessions focusing on implementation and experimentation, and evening talks on research topics and perspectives, as well as demos and posters from the participants. The lectures and the evening talks will be given by internationally recognized researchers from academic and industrial research labs. The topics to be covered include: classification, sequence prediction, linear models, neural networks, encoder-decoder architectures, machine translation, large language models, and multimodality. Our target audience is: * Researchers and graduate students in the fields of NLP and Computational Linguistics; * Computer scientists who have interests in natural language processing and machine learning; * Industry practitioners who desire a more in-depth understanding of these subjects. While previous experience with the topics will be helpful, the school assumes no previous knowledge of natural language processing and machine learning. The only background assumed is basic mathematics and Python programming. Features of AthNLP: * Attendance at the Social Event, daily lunch as well as morning and afternoon coffee breaks are included in the application fee. * Lecturers are leading researchers in machine learning and natural language processing. * Students will be able to (optionally) show their current work in poster sessions during coffee breaks. * In the demo day, students will be able to interact with technical companies and research institutions working in machine learning. Confirmed Speakers --------------------------------- * Antonis Anastasopoulos, George Mason Computer Science * Yulan He, King's College London, UK * Raquel Fernández, University of Amsterdam * Ryan McDonald * Preslav Nakov, MBZUAI * Vlad Niculae, University of Amsterdam * Anna Rogers, IT University of Copenhagen Participation --------------------- To apply, please fill this<https://ijerm0co.forms.app/athens-nlp-2025-summer-school> form. The fees are the following: * 300 EUR for students * 400 EUR for university professors or researchers at a public institute * 500 EUR for everyone else Any questions should be directed to: athnlp2024(a)athenarc.gr We are looking forward to your participation! -- The organizers of AthNLP 2025

1 0

2026

2025

2024

2023

2022