- Corpora - ELRA lists

2nd CFP: 5th Workshop on Scholarly Document Processing (SDP 2025) @ACL 2025
by Tirthankar Ghosal 22 Feb '25

22 Feb '25

Dear colleagues, You are invited to participate in the 5th Workshop on Scholarly Document Processing (SDP 2025) to be held at ACL 2025 in Vienna, Austria. SDP 2025 will consist of a research track and five shared tasks. The call for research papers is described below, and more details can be found on our website, https://sdproc.org/2025/. Papers must follow the *ACL format* and conform to the *ACL 2025 Submission Guidelines*. Paper submission has to be done through OpenReview: https://openreview.net/group?id=aclweb.org/ACL/2025/Workshop/SDProc - Website: https://sdproc.org/2025/ - Submission site: < https://openreview.net/group?id=aclweb.org/ACL/2025/Workshop/SDProc> - X (Twitter): https://twitter.com/sdpworkshop - Shared tasks: https://sdproc.org/2025/sharedtasks.html - *Paper submission deadline: March 1 (Saturday), 2025* Call for Research Papers Scholarly literature is the chief means by which scientists and academics document and communicate their results and is therefore critical to the advancement of knowledge and improvement of human well-being. At the same time, this literature poses challenges to NLP uncommon in other genres, such as specialized language and high background knowledge requirements, long documents and strong structural conventions, multimodal presentation, citation relationships among documents, an emphasis on rational argumentation, and the frequent availability of detailed metadata and experimental data. These challenges necessitate the development of NLP methods and resources optimized for this domain. The Scholarly Document Processing (SDP) workshop provides a venue for discussing these challenges, bringing together stakeholders from different communities including computational linguistics, machine learning, text mining, information retrieval, digital libraries, scientometrics and others, to develop methods, tasks, and resources in support of these goals. This workshop builds on the success of prior workshops: SDP workshops held at EMNLP 2020, NAACL 2021, COLING 2022, and ACL 2024, and the 1st and 2nd SciNLP workshops held at AKBC 2020 and 2021. In addition to having broad appeal within the NLP community, we hope the SDP workshop will attract researchers from other relevant fields including meta-science, scientometrics, data mining, information retrieval, and digital libraries, bringing together these disparate communities within ACL. Topics of Interest We invite submissions from all communities demonstrating usage of and challenges associated with natural language processing, information retrieval, and data mining of scholarly and scientific documents. Relevant topics include (but are not limited to): - Large Language Models (LLMs) for science - Representation learning and language modeling - Information extraction and NER - Document understanding - Summarization and generation - Question-answering - Discourse modeling/argumentation mining - Network analysis - Bibliometrics, scientometrics, and altmetrics - Reproducibility and research integrity, including new challenges posed by generative AI - Peer review tools, principles and technology - Metadata and indexing - Inclusion of datasets and computational resources - Research infrastructures and digital libraries - Increasing the representation in scholarly work of disadvantaged populations - LLM-based interfaces to consume/produce scholarly documents - Impact of scholarly communication on popular discourse Submission Information Authors are invited to submit full and short papers with unpublished, original work. Submissions will be subject to a double-blind peer-review process. Accepted papers will be presented by the authors at the workshop either as a talk or a poster. All accepted papers will be published in the workshop proceedings (proceedings from previous years can be found here: https://aclanthology.org/venues/sdp/), which will be published in the ACL Anthology. The submissions must be in PDF format and anonymized for review. All submissions must be written in English and follow the ACL 2025 formatting requirements: *Long paper submissions:* up to 8 pages of content, plus unlimited references. *Short paper submissions:* up to 4 pages of content, plus unlimited references. *Submission Website:* Paper submission has to be done through openreview: <https://openreview.net/group?id=aclweb.org/ACL/2025/Workshop/SDProc> Final versions of accepted papers will be allowed 1 additional page of content so that reviewer comments can be taken into account. Important Dates (Main Research Track) - First call for workshop papers: December 19, 2024 - Second call for workshop papers: February 21, 2025 - *Paper submission deadline: March 1, 2025* - Pre-reviewed (ARR) submission deadline: March 25, 2025 - Notification of acceptance: April 17, 2025 - Camera-ready paper due: May 16, 2025 - Workshop dates: July 31 – August 1, 2025 Note: Shared task submission deadlines and other important dates to be announced. SDP 2024 Keynote Speakers We are excited to have several keynote speakers at SDP 2025. 1. *Tom Hope* <https://tomhoper.github.io/>, Assistant Professor at Hebrew University of Jerusalem and Research Scientist at Allen Institute for AI. 2. *James A. Evans* <https://sociology.uchicago.edu/directory/James-A-Evans>, Professor and Director of the Knowledge Lab at University of Chicago and External Professor at the Santa Fe Institute. 3. *TBA* SDP 2025 Shared Tasks SDP 2025 will host six exciting shared tasks. More information about all shared tasks is provided on the workshop website: https://sdproc.org/2025/sharedtasks.html *Detecting automatically generated scientific papers (DAGPap 25)* A big problem with the ubiquity of Generative AI is that it has now become very easy to generate fake scientific papers. This can erode public trust in science and attack the foundations of science: are we standing on the shoulders of robots? The Detecting Automatically Generated Papers (DAGPAP) competition aims to encourage the development of robust, reliable AI-generated scientific text detection systems, utilizing a diverse dataset and varied machine learning models in a number of scientific domains. Organizers: Savvas Chamezopoulos, Dan Li, Anita de Waard (Elsevier). *Contextualizing Scientific Figures and Tables (Context 25)* Interpreting scientific claims in the context of empirical findings is a valuable practice, yet extremely time-consuming for researchers. Such interpretation requires identifying key results (often captured in tables and figures) that provide supporting evidence from research papers, and contextualizing these results with associated methodological details (e.g., measures, sample, etc.). During the previous version of this shared task in 2024, we released datasets to support the development of methods for automatic identification of key result figures or tables as well as additional grounding context to make claim interpretation more efficient. However, the released datasets contained tables and images already extracted from the scientific papers to allow participants to bypass PDF pre-processing issues. In Context 2025, given recent advances in multimodal LLMs, we plan to extend the difficulty of this task by requiring participants to identify key results from paper PDFs directly, and add a new sub-task on multi-hop reasoning over scientific evidence. Organizers: Joel Chan, Matthew Akamatsu, Aakanksha Naik *Scientific Visual Question Answering (SciVQA)* Scholarly articles convey valuable information not only through unstructured text but also via (semi-)structured figures such as charts and diagrams. Automatically interpreting the semantics of knowledge encoded in these figures can be beneficial for downstream tasks such as question answering (QA). In the SciVQA challenge, the participants will develop multimodal systems capable of efficiently processing both visual (i.e., addressing attributes such as colour, shape, size, etc.) and non-visual QA pairs based on images of scientific figures and their captions. Organizers: Ekaterina Borisova, Georg Rehm *Scientific Fact-checking of Social Media Posts on Climate Change (ClimateCheck)* The ClimateCheck shared task focuses on fact-checking claims from social media about climate change against peer-reviewed scholarly articles. Participants will retrieve relevant publications from a corpus of 400,000 climate research articles and classify each abstract as supporting, refuting, or not having enough information about the claim. Training data will include human-annotated claim-publication pairs, and the evaluation will combine nDCG@K and Bpref for retrieval and F1 score for classification. The task aims to develop models that link social media claims to scientific evidence, promoting informed and evidence-based discussions on climate change. Organizers: Raia Abu Ahmad, Georg Rehm *Software Mention Detection in Scholarly Publications (SOMD 25)* Software plays an essential role in computational research methods and is considered one of the crucial entities in scholarly documents. However, software mentions are not always cited in academic documents, resulting in various informal mentions of software across a paper. Automatic identification of such software mention contributes to the better understanding, accessibility, and reproducibility of the research work. In addition to the mention of software, to understand the research context, it is necessary to understand the purpose of a software mention and its attributes, making software mention detection a comprehensive task. We are extending our first iteration of the shared task SOMD 2024 <https://nfdi4ds.github.io/nslp2024/docs/somd_shared_task.html> with new challenges. In addition to information extraction techniques, our extended focus would be on Joint Named Entity and Relation Classification techniques. Organizers: Sharmila Upadhyaya, Frank Krueger, Stefan Dietze *SciHal2025: Hallucination Detection for Scientific Content* Generative AI-enhanced academic research assistants are transforming how research is conducted. By allowing users to pose research-related questions in natural language, these systems can generate structured and concise summaries supported by relevant references. However, hallucinations — unsupported claims introduced by large language models — remain a significant obstacle to fully trusting these automatically generated scientific answers. Organizing Committee - Tirthankar Ghosal, Oak Ridge National Laboratory, USA - Philipp Mayr, GESIS – Leibniz Institute for the Social Sciences, Germany - Aakanksha Naik, Allen Institute for AI, USA - Amanpreet Singh, Allen Institute for AI, USA - Anita de Waard, Elsevier, Netherlands - Dayne Freitag, SRI International, USA - Georg Rehm, German Research Center for Artificial Intelligence (DFKI), Germany - Sonja Schimmler, Fraunhofer FOKUS, Germany - Dan Li, Elsevier, Netherlands

1 0

Second CFP: The 7th Workshop on Research in Computational Linguistic Typology and Multilingual NLP (SIGTYP 2025), held jointly with FieldMatters workshop on NLP Application for Field Linguistics
by Kat Vylomova 22 Feb '25

22 Feb '25

To be held at ACL 2025, running jointly with FieldMatters (July 31 or August 1 in Vienna, Austria) Workshop description The aim of the 7th edition of SIGTYP workshop is to act as a platform and a forum for the exchange of information between typology-related research, multilingual NLP, and other research areas that can lead to the development of truly multilingual NLP methods. The workshop is specifically aimed at raising awareness of linguistic typology and its potential in supporting and widening the global reach of multilingual NLP, as well as at introducing computational approaches to linguistic typology. It will foster research and discussion on open problems, not only within the active community working on cross- and multilingual NLP but also inviting input from leading researchers in linguistic typology. Our workshop will serve as a platform to enable fruitful discussions. In 2025, we would additionally focus on the utility of LLMs for typological research. SIGTYP is the first dedicated venue for typology-related research and its integration in multilingual NLP. Appropriate topics include (but are not limited to) the following as they relate to the areas of the workshop: - Integration of typological features in language transfer and joint multilingual learning. In addition to established techniques such as “selective sharing”, are there alternative ways to encoding heterogeneous external knowledge in machine learning algorithms? - Development of unified taxonomy and resources. Building universal databases and models to facilitate understanding and processing of diverse languages. - Automatic inference of typological features. The pros and cons of existing techniques (e.g. heuristics derived from morphosyntactic annotation, propagation from features of other languages, supervised Bayesian and neural models) and discussion on emerging ones. - Typology and interpretability. The use of typological knowledge for interpretation of hidden representations of multilingual neural models, multilingual data generation and selection, and typological annotation of texts. - Improvement and completion of typological databases. Combining linguistic knowledge and automatic data-driven methods towards the joint goal of improving the knowledge on cross-linguistic variation and universals. - Linguistic diversity and universals. Challenges of cross-lingual annotation. Which linguistic phenomena or categories should be considered universal? How should they be annotated? - Using LLMs for typological studies. Can LLMs be utilised to formulate or prove typological hypotheses? Are they capable of making useful cross-linguistic generalisations? - Extra topics also include: generation of constructed languages, universals in diachronic languages changes, information-theoretic approaches to typology, automated approaches to etymology. Important Dates (all deadlines are 23:59 AoE) Direct submission deadline: March 1, 2025 Pre-reviewed (ARR) submission deadline: March 25, 2025 Notification of acceptance: April 17, 2025 Camera-ready paper deadline: May 16, 2025 Workshop dates: July 31st or August 1st 2025 Submissions We invite both extended abstract submissions (non-archival) and general paper submissions (archival). The accepted submissions will be presented at the workshop, providing new insights and ideas. Extended abstracts should describe already published work or work in progress and should not exceed two (2) pages. This way, we will not discourage researchers from preferring main conference proceedings, at the same time ensuring that interesting and thought-provoking research is presented at the workshop. For general (archival) submissions we accept both long and short papers. Short papers should not exceed four (4) pages, long papers should not exceed eight (8) pages papers. Unlimited additional pages are allowed for the references section in all submission types. Submissions should be anonymous, without authors or an acknowledgement section; self-citations should appear in third person. Submissions must follow the ACL 2025 stylesheet https://github.com/acl-org/acl-style-files; both long and short paper submissions must follow the two-column format of ACL proceedings. All submissions must be in PDF format. These should be submitted via OpenReview: https://openreview.net/group?id=aclweb.org/ACL/2025/Workshop/SIGTYP. Organizing Committee Michael Hahn, Priya Rani, Andreas Shcherbakov, Oleg Serikov, Aleksey Sorokin, Ryan Cotterell and Kat Vylomova Anti-harassment policy The workshop follows the ACL anti-harassment policy: https://www.aclweb.org/adminwiki/index.php?title=Anti-Harassment_Policy. Contact For any inquiries regarding the workshop, please send an email to the Organizing Committee at sigtyp(a)gmail.com

1 0

AI for Research Data Management, Workshop AIRDeM @ ESWC 2025
by khuram shehzad 21 Feb '25

21 Feb '25

Dear Researchers, Are you exploring AI-driven solutions for Research Data Management (RDM)? Join us at AIRDeM 2025 <https://ai4rdm.github.io/AIRDeM/>, a workshop at ESWC 2025 <https://2025.eswc-conferences.org/> (June 1-5, Portoroz, Slovenia), to discuss AI applications enhancing RDM and FAIR principles for research data. Topics include but not limited to: - Automated Metadata Generation - FAIR Metrics & AI - Knowledge Graphs for RDM - AI for Data Summarization & Querying - Automated Data Cleaning & Harmonization - AI-Driven Data Interoperability - Explainable AI in RDM - Domain-Specific AI for RDM - Collaborative AI for RDM Challenges - Neuro-Symbolic AI for Data Management We invite full papers, short papers, demos, and position papers, as well as submissions for our collaborative "Bring Your Challenge" session—where real-world RDM problems meet innovative AI solutions! - Submission Deadline: March 20, 2025 - Submit your work here: [Submission page] <https://app.oxfordabstracts.com/stages/77706/submissions/new?behalf=false&f…> Let’s shape the future of RDM together! Best, Hajira & AIRDeM Organizing Team

1 0

1st Call for Papers: Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025)
by Manuela Sanguinetti 21 Feb '25

21 Feb '25

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ CLiC-it 2025 - Eleventh Italian Conference on Computational Linguistics ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 24 - 26 September 2025, Cagliari, Italy https://clic2025.unica.it/ Conference Announcement and First Call for Papers Over the years, CLiC-it has evolved into an important forum for the Italian community of researchers in Computational Linguistics (CL) and Natural Language Processing (NLP). CLiC-it aims to promote and disseminate high-quality, original research covering different aspects of automatic language processing, involving both written and spoken language. Furthermore, it seeks to showcase cutting-edge theoretical findings, experimental methodologies, technologies, and application perspectives. The spirit of the conference is inclusive. Recognizing the multifaceted nature of language phenomena and the need for interdisciplinary expertise, CLiC-it aims to bring together researchers from different fields including Computational Linguistics and Natural Language Processing, Linguistics, Cognitive Science, Machine Learning, Computer Science, Knowledge Representation, Information Retrieval, and Digital Humanities. CLiC-it welcomes contributions focusing on all languages, with a particular emphasis on Italian. CLiC-it 2025 will be held in Cagliari, from the 24th to the 26th of September. CLiC-it is organised by the Italian Association of Computational Linguistics (AILC -- http://www.ai-lc.it/). Conference topics ----------------------------- CLiC-it 2025 aims to have a broad technical program. Relevant topics for the conference include, but are not limited to (in alphabetical order): - Computational Historical Linguistics - Computational Social Science and Cultural Analytics - Dialogue and Interactive Systems - Discourse and Pragmatics - Ethics and NLP - Generation - Handwritten Text Recognition - Information Extraction - Information Retrieval and Text Mining - Interpretability and Analysis of Models for NLP - Language Grounding to Vision, Robotics and Beyond - Large Language Models - Linguistic Diversity - Linguistic Theories, Cognitive Modeling, and Psycholinguistics - Machine Learning for NLP - Machine Translation - Multilingualism and Cross-Lingual NLP - NLP Applications - NLP for the Humanities - Phonology, Morphology, and Word Segmentation - Pragmatics and Creativity - Question Answering - Resources and Evaluation - Semantics: Lexical, Sentence-level Semantics, Textual Inference, and Other Areas - Sentiment Analysis, Stylistic Analysis, and Argument Mining - Speech and Multimodality - Summarization - Syntax: Tagging, Chunking and Parsing Research Communications ---------------------------------- CLiC-it 2025 adopts a parallel submission policy for outstanding papers accepted in 2024 and 2025 by major publication venues, namely the major international CL conferences (workshops excluded) or international journals. These contributions can be submitted to CLiC-it 2025 as short research communications. Research communications will not be published in the conference proceedings, they serve primarily to promote the dissemination of high-quality research within the Italian CL community. Submitted research communications must be in the scope of the CLiC-it 2025 conference. The authors of papers that meet the above criteria are invited to submit a written (maximum) one-page abstract of the original paper, including the paper’s title and authors as well as a pointer to the original conference or journal where the paper was published. If needed, research communications will undergo a selection process overseen by the conference chairs. Since these papers have already been reviewed, the selection criteria will primarily consider their original publication venue. Priority will be granted to papers that align most closely with the conference program, ensuring a balanced representation across various conference topics. The research communication papers will be presented at the conference either orally or as a poster according to the number of submissions received. Paper Submission ----------------------- Submitted papers must describe substantial, original, completed, and unpublished work. Wherever appropriate, concrete evaluation and analysis should be included. CLiC-it 2025 allows for a multiple-submission policy. In case of acceptance of the paper in other venues, the authors must communicate this information to the CLiC-it 2025 Chairs as soon as possible. Papers may consist of at least six (6) and no more than eight (8) pages of content, and up to three (3) pages of references. Supplementary material is also allowed, but it should not exceed one (1) page in length. Authors are reminded that all relevant content should be included in the main text of the paper. Upon acceptance, final versions of papers will be given one additional page of content, so that reviewers’ comments can be taken into account. Papers will be evaluated according to the following criteria: - soundness of approach - relevance to computational linguistics - novelty and clarity of relation with related work - quality of presentation - quality of evaluation (if applicable) - verifiability and ability to replicate (if applicable) Papers can be either in English or Italian, with the abstract in English. Accepted papers will be published online and will be presented at the conference either orally or as a poster. Reviewing will NOT be blind, so there is no need to remove author information from manuscripts. The required template for CLiC-it submissions must be compatible with CEUR (https://ceur-ws.org/). You can download the conference-adapted version at the following links: LaTeX template https://clic2025.unica.it/wp-content/uploads/2025/02/CLiC-it-2025-template.… Word template https://clic2025.unica.it/wp-content/uploads/2025/02/CLiC_it_2025_template.… Should you encounter any issues with the compilation (as the CEUR template has historically presented some challenges and is not modifiable without risking exclusion from the proceedings), we provide a read-only Overleaf template: https://www.overleaf.com/read/hzyckyjzwhwb#06b27c This template can be accessed and cloned to help resolve any technical difficulties. Papers and research communications must be submitted through the START platform using the following link: https://softconf.com/p/clic-it2025 For research communications, the appropriate track should be selected. Awards --------- To acknowledge the contribution of young researchers to the field, the title of "best paper" will be awarded to outstanding papers, provided that a Master's or PhD student is the first author and presents the work at the conference. Recipients of this award will be invited to submit an extended version of their papers to the Italian Journal of Computational Linguistics (IJCoL). To recognise excellence in student research as well as promote awareness of our field, AILC is also conferring the “Emanuele Pianta” prize for the best Master Thesis (Laurea Magistrale) in Computational Linguistics submitted at an Italian University. The prize consists of 500 Euros plus free membership to AILC for one year and free registration to the upcoming CLiC-it. Important Dates --------------- - 09/06/2025: Paper submission deadline: regular papers and research communications - 21/07/2025: Notification to authors of reviewing/selection outcome - 04/08/2025: Camera ready version of accepted papers - 24-26/09/2025: CLiC-it 2025 Conference, Cagliari Conference Chairs ------ - Cristina Bosco (University of Torino) - Elisabetta Jezek (University of Pavia) - Marco Polignano (University of Bari) - Manuela Sanguinetti (University of Cagliari) Program Committee: ------ TBA Local Organizing Committee: ------ - Maurizio Atzori (DMI, University of Cagliari) - Andrea Loddo (DMI, University of Cagliari) - Alessandro Pani (DMI, University of Cagliari) - Alessandra Perniciano (DMI, University of Cagliari) - Luca Zedda (DMI, University of Cagliari) Web chairs: ------ - Maurizio Atzori - Andrea Loddo Further information ------------------- Conference website: https://clic2025.unica.it/ Mail: clicit2025cagliari(a)gmail.com

1 0

3rd CFP-GITT-2025 at MT Summit 2025
by bsavoldi＠fbk.eu 20 Feb '25

20 Feb '25

3rd CALL FOR PAPERS Third International Workshop on Gender-Inclusive Translation Technologies (GITT) at MT Summit 2025 23 June 2025, Geneva, Switzerland https://sites.google.com/tilburguniversity.edu/gitt2025 @gitt-workshop.bsky.social Important Dates (Time zone: Anywhere on Earth) Submission deadline: 10 March, 2025 Notification of Acceptance: 4 April, 2025 Camera Ready Copy due: 11 April, 2025 Workshop: 23 June, 2025 **Aim and scope** The Gender-Inclusive Translation Technologies Workshop (GITT) is set out to be the dedicated workshop that focuses on gender-inclusive language in translation and cross-lingual scenarios. The workshop aims to bring together researchers from diverse areas, including industry partners, MT practitioners, and language professionals. GITT aims to encourage multidisciplinary research that develops and interrogates both solutions and challenges for addressing bias and promoting gender inclusivity in MT and translation tools, including LMs applications for the translation task. **Topics** GITT invites technical as well as non-technical submissions, which consist of experimental, theoretical or methodological contributions. We explicitly welcome interdisciplinary submissions and submissions that focus on innovative, non-binary linguistic strategies and/or with sociolinguistically-informed perspectives. The topics of interest include, but are not limited to: - Models or methods for assessing and mitigating gender bias - New resources for inclusive language and gender translation (e.g., datasets, translation memories, dictionaries) - Social, cross-lingual, and ethical implications of gender bias - Qualitative and quantitative analyses on the potential limits of current approaches to gender bias in translation and MT, error taxonomies as well as best practices and guidelines - User-centric case studies on the impact of biased language and/or mitigating approaches which can include translators, post-editors, or monolingual MT users GITT is also open to other non-listed topics aligned with the scope of the workshop and works focusing on non-textual modalities (e.g., audiovisual translation) **Submission** We welcome four types of submissions, two archival and two non-archival. ARCHIVAL - Research papers: of at least 4 up to 10 pages (excluding references) - Extended Abstracts: up to 2 pages (including references) Accepted papers and extended abstracts consisting of novel work will be published online as proceedings in the ACL Anthology. NON-ARCHIVAL - Research Communications: up to 2 pages (including references). We include a parallel submission policy in the form of Research Communications for papers related to the topic of GITT that were accepted in other venues in 2024 and 2025. - Potluck Communications: short abstract up to 500 words (including references). Potluck Communications offer a space for anyone—especially students and early career researchers—to discuss bold new ideas for collaboration, brainstorm about ongoing work, and explore future research directions. The communications will not be included in the proceedings, but will serve to promote the dissemination of research aligned with the scope of the workshop. All submissions should adhere to the MT Summit 2025 guidelines and style templates (PDF, LaTeX, Word) and be uploaded on Easychair (https://easychair.org/my/conference?conf=mtsummit2025) **Workshop organizers** Janiça Hackenbuchner, University of Ghent Luisa Bentivogli, Fondazione Bruno Kessler Joke Daems, University of Ghent Chiara Manna, University of Tilburg Beatrice Savoldi, Fondazione Bruno Kessler Eva Vanmassenhove, University of Tilburg

1 0

Interdisciplinary postdoc position at IT University of Copenhagen
by Anna Rogers 20 Feb '25

20 Feb '25

The Data Science section at the IT University of Copenhagen has an open position for a postdoc, on the topic of *scaling up qualitative interviews with Large Language Models* (LLMs). The project is a collaboration with the Center for Social Data Science <https://sodas.ku.dk/> at the university of Copenhagen. This is a NLP position, but some relevant background in social sciences or psychology is a plus. *Application deadline is March 17 2025*. Applications can be submitted via ITU job portal: Application link: https://candidate.hr-manager.net/ApplicationInit.aspx?cid=119&ProjectId=181… IT University of Copenhagen is the leading Danish university dedicated to various aspects of IT technology, and Copenhagen is one of the happiest and most livable cities in the world. The candidates will be part of NLPNorth research group <https://nlpnorth.github.io/> (with 5 full-time faculty working in various areas of NLP), as well as Pioneer Center for AI <https://www.aicentre.dk/>. The postdoc will be hosted by Assoc. Prof. Anna Rogers <https://annargrs.github.io/>, to whom inquiries about the project can be directed (arog(a)itu.dk). The domain expert on this project is Hjalmar Alexander Bang Carlsen (sociology, hc(a)sodas.ku.dk). -- Anna Rogers Associate Professor IT University of Copenhagen http://annargrs.github.io/

1 0

Linguistic Accessibility of Legal Texts: Challenges, Methods, and Applications
by Špela Arhar Holdt 20 Feb '25

20 Feb '25

Dear colleagues, The scientific journal Slovenščina 2.0 (diamond open-access; indexed in Scopus, ERIH PLUS, COBISS.si, dLib.si, DOAJ, Sherpa Romeo, Linguistic Bibliography, and CNKI) warmly invites you to submit abstracts for a special issue dedicated to linguistic accessibility in legal and administrative language. Contributions may address various aspects of linguistic accessibility, including linguistic characteristics of legal and administrative texts, user studies, specialized language resources, technologies, and tools, as well as related social and ethical issues. Empirical and interdisciplinary studies are particularly appreciated. Abstracts (450–550 words) may be submitted in English and Slovenian and should be sent by *March 31, 2025*, to senja.pollak(a)ijs.si and spela.arharholdt(a)ff.uni-lj.si. More information: https://journals.uni-lj.si/slovenscina2/announcement. We look forward to your contributions! Špela Arhar Holdt

1 0

CfP: Deadline Extension for Challenges Proposals at ISWC 2025 (March 4)
by Genet Asefa Gesese 20 Feb '25

20 Feb '25

CfP: Deadline Extension for Challenges Proposals at ISWC 2025 (March 4) 24th International Semantic Web Conference (ISWC 2025) Nara, Japan November 2-6, 2025 Follow us: Twitter/X: @iswc_conf #iswc_conf ( https://twitter.com/iswc_conf ) LinkedIn: https://www.linkedin.com/company/iswc/ Mastodon Social: https://mastodon.social/@iswc_conf Bsky: https://bsky.app/profile/iswc-conf.bsky.social A great way to advance the state of the art in a given domain is to create competition. We invite you to propose an ISWC 2025 Challenge, in which you define an open competition on a problem of your choice within the Semantic Web domain. Topics: For ISWC 2025, Challenge proposals are invited for any challenge involving Semantic Web tasks, including but not limited to: - Ontology and knowledge graph alignment - Ontology and knowledge graph quality assurance (QA) - Knowledge graph construction and refinement - Graph embeddings and graph neural networks - Query and reasoning scalability - Open information extraction - Neurosymbolic reasoning - Semantic Web and data mining - Semantic Web, machine learning, and neuro-symbolic AI - Link prediction - Question answering - Stream processing and reasoning - Semantic table understanding - Agents (especially using neuro-symbolic methods) and agentic workflows - Retrieval Augmented Generation (RAG) Proposal Stage: ISWC 2025 will run multiple challenges with the aim of evaluating and comparing software solutions for the Semantic Web and Knowledge Graphs in a systematic way. To propose a challenge, you are required to: - Specify a task to be addressed. - Provide an evaluation dataset. - Define evaluation measures to compare the performance of participating systems. Important Date: Challenge proposal submission due March 4th, 2025 Website: https://iswc2025.semanticweb.org/#/calls/challenges Semantic Web Challenge Chairs: Contact: iswc2025-challenge(a)easychair.org Mayank Kejriwal University of Southern California, United States Pablo Mendes Upwork, United States Would you like to know about the host city, Nara? Check out this blog: https://iswc2025.semanticweb.org/#/blogs/host -- *Dr.-Ing. **Genet Asefa Gesese* Head of Machine Learning Department (Abteilungsleitung Maschinelles Lernen) FIZ Karlsruhe – Leibniz Institute for Information Infrastructure, Germany ( *https://www.fiz-karlsruhe.de/en/bereiche/lebenslauf-und-publikationen-dr-ing-genet-asefa-gesese <https://www.fiz-karlsruhe.de/en/bereiche/lebenslauf-und-publikationen-dr-in…>* ) AND Karlsruhe Institute of Technology (KIT) *( https://ise.aifb.kit.edu/21_89.php <https://ise.aifb.kit.edu/21_89.php> )*

1 0

Short-term research position in Computational Social Science
by Ines Rehbein 20 Feb '25

20 Feb '25

The Natural Language Processing and Information Retrieval Group at the University of Mannheim invites applications for a short-term position for ONE RESEARCHER (all genders welcome) IN COMPUTATIONAL POLITICAL SCIENCE / DATA SCIENCE / COMPUTATIONAL LINGUISTICS / STATISTICAL NLP The position is affiliated to the interdisciplinary research project Uncovering ideological frames in political discourses (UNCOVER) headed by Ines Rehbein [1] and Simone Ponzetto [2]. We are looking for a highly motivated candidate to conduct research in the area of NLP for political science. The candidate should have a strong interest and background in one or several of the following areas * computational political science * NLP / data science / text-as-data * computational linguistics / semantic text processing with a strong drive for interdisciplinary research and high proficiency in German. Candidates should have a Masters degree in Computational Linguistics, Natural Language Processing, Computer Science, Data Science, Political Science or related areas. The position will be located at the Data and Web Science Group (DWS) of the University of Mannheim, one of leading centers for Data Science in Germany (more information can be found at http://dws.informatik.uni-mannheim.de). Duration: The position will be available for 6 to 9 months and can be arranged as a full or part-time position. Salary range: according to German public scale TV-L 13. Applications can be made per e-mail (rehbein(a)uni-mannheim.de) and should include a short research statement, CV, copy of university degrees and transcripts. All documents should be e-mailed as a single PDF. All applications sent before March, 10, 2025 will receive full consideration. The position remains open until filled. Please feel free to contact Ines Rehbein (rehbein(a)uni-mannheim.de) for informal enquiries. The University of Mannheim is committed to increase the percentage of female scientists and encourages female applicants to apply. Among candidates of equal aptitude and qualifications, a person with disabilities will be given preference. Please note that threats to confidentiality and unauthorized access by third parties cannot be ruled out when communicating by unencrypted e-mail. The information on the collection of personal data from the data subject pursuant to Article 13 DS-GVO can be found on our homepage: https://www.uni-mannheim.de/universitaet/stellenanzeigen/datenschutz-bei-be…. [1] https://www.uni-mannheim.de/dws/people/researchers/postdoctoral-research-fe… [2] https://www.uni-mannheim.de/dws/people/professors/prof-dr-simone-paolo-ponz… -- Ines Rehbein Data and Web Science Group University of Mannheim, Germany

1 0

10th Annual Law & Corpus Linguistics Conference
by Brett Hashimoto 19 Feb '25

19 Feb '25

Dear Colleagues, The 10th annual Law & Corpus Linguistics Conference will be held on Friday, October 24, 2025 at Brigham Young University's J. Reuben Clark Law School in Provo, Utah. The keynote address will be delivered by Abbe Gluck, the Alfred M. Rankin Professor of Law and Faculty Director of the Solomon Center for Health Law and Policy at Yale Law School. Proposals are invited for individual papers and panels. We are open to submissions on a broad range of topics, including but not limited to: * applications of corpus linguistics to constitutional, statutory, contract, patent, trademark, probate, administrative, and criminal law in any state or nation; * philosophical, normative, and pragmatic commentary on the use of corpus linguistics in the law; * triangulation between corpus linguistics and other empirical methods in legal interpretation; * corpus-based analysis of legal discourse or topics; * best practices in corpus design and corpus linguistic methods in legal settings. The proposal deadline is May 15, 2025. Proposals should include an abstract of no more than 750 words and complete contact information for presenters. Please send proposals to byulawcorpus(a)law.byu.edu<mailto:byulawcorpus@law.byu.edu>. We hope you'll consider submitting a proposal! Sincerely, LCL 2025 Conference Organizing Committee (Tom Lee, Jesse Egbert, Brett Hashimoto, James Heilpern, James Phillips)

1 0

2026

2025

2024

2023

2022

Corpora