May 2024 - Corpora - ELRA lists

[CFP] ACL 2024 Workshop on Knowledge Graphs and LLMs (KaLLM)
by russa biswas 29 May '24

29 May '24

Dear colleagues, Apologies for cross-posting. *FINAL CALL FOR PAPERS* We want to invite you to submit the unpublished results of your research on Knowledge Graphs and Large Language Models to: *The 1st Workshop on Knowledge Graphs and Large Language Models (KaLLM)*, to be held on *August 15, 2024*, co-located with *ACL 2024*, Bangkok, Thailand. Call for Participation *Submission Deadline: May 30, 2024 at 23:59, UTC -12h, AoE* *Website*: https://kallmworkshop.github.io/kallm2024/ *Contact email*: kallmworkshop2024(a)googlegroups.com The workshop intends to provide a platform for researchers, practitioners, and industry professionals to explore the synergies between LLMs and KGs. We aim to provide a space for the LLM community and the community of KG researchers to interact and explore how these two communities could collaborate and support one another. *Submission Guidelines:* Papers must be submitted in PDF format using the official ACL template. More details are available on the website. *Scope of the workshop:* KaLLM invites quality research contributions as short or long papers and resource papers. All submissions will undergo a double-blind review process, and accepted submissions will be presented at the workshop. The submissions should focus on the interaction between LLMs and KGs in the context of NLP. The workshop will cover a diverse range of topics related to the integration of LLMs and KGs, including but not limited to: - Knowledge-enhanced language generation - KG-based question answering using LLMs - Fact validation and bias mitigation - KG creation and completion using LLMs - Privacy considerations in LLM-KG integration - Interpretability and explainability - Cross-domain applications - KG-based text summarisation with LLMs - Ethical implications of LLM-KG technologies - Multimodality of KGs and LLMs - Multilingual LLMs for KGs and vice-versa We look forward to receiving your submissions and having your valuable contribution to the success of the workshop. If you have any questions or require further information, please do not hesitate to contact us at kallmworkshop2024(a)googlegroups.com or visit https://kallmworkshop.github.io/kallm2024/. Thank you and best regards, Workshop Organisers Russa Biswas, Hasso Plattner Institute, Germany Lucie Aimée Kaffee, Hugging Face Oshin Agarwal, Bloomberg, USA Pasquale Minervini, University of Edinburgh, UK Sameer Singh, University of California, Irvine, USA Gerard de Melo, Hasso Plattner Institute, University of Potsdam, Germany

1 0

Last CfP: FEL2024, The XXVIIIth Annual Conference of the Foundation for Endangered Languages, 25-27 September 2024
by Steven Krauwer 28 May '24

28 May '24

LAST CALL FOR PAPERS -- DEADLINE IS 31 MAY! =========================================== The Foundation for Endangered Languages (FEL) and the Forum for Language Initiatives (FLI), in collaboration with Allama Iqbal Open University Islamabad will hold the 28th Annual Conference - FEL XXVIII in Islamabad, Pakistan, 25 – 27 September 2024 Main theme of the conference: Endangered Languages and Oral Traditions. Conference topics include, but are not limited to: 1. Endangered oral literatures: heritage preservation (music, poetry, mushaira, contests...) 2. Oral cultures and traditional knowledge 3. Documentation and digitalization of oral art and literature 4. Language policy, planning, and oral art 5. Oral art and mother tongue education 6. Mother-tongue education policies: oral art and literature 7. Rediscovering oral traditions and expressions 8. Oral Traditions as vehicle for transmission of culture and language The main focus of the conference will be on the dynamic relationship between language endangerment and the role of oral traditions and expressions in safeguarding them. While it has a universal scope, it specifically aims to highlight interesting and creative oral traditions and expressions of the indigenous communities of Pakistan and encourage scholarship and accounts of community initiatives for preserving and promoting them. Studies highlighting the oral traditions of indigenous communities from anywhere are welcome. Abstracts in PDF of 600 - 800 words are invited for submission on EasyChair at this address: https://easychair.org/conferences/?conf=felxxviii2024 by the deadline of 31 May 2024 at 23:59 GMT Abstracts received after this deadline will not be reviewed. If you have already submitted your abstract and want to use the extra time to make it even better you can just go to EasyChair and update the earlier version! Important Dates ▪ 31 May 2024: Deadline for submission of abstract - NEW DEADLINE ▪ 21 June 2024: Selected applicants informed ▪ 31 July 2024: Deadline for extended version of accepted abstract ▪ 25-27 September 2024: Conference dates ▪ 28 September: Excursion to a local community Conference website: https://fli-online.org/site/conference-of-the-foundation-for-endangered-lan… For more information please contact: felconf2024.islamabad(a)gmail.com -- _______________________________________________________________________ Steven Krauwer, CLARIN/FEL/ELSNET/UiLOTS, Utrecht, NL, s.krauwer(a)uu.nl

1 0

Call for Papers: IWCLUL 2024 in Helsinki
by Mika Hämäläinen 28 May '24

28 May '24

The 9th International Workshop on Computational Linguistics for Uralic Languages (IWCLUL 2024) will be organized by ACL SIGUR. The proceedings of the event will be published in the ACL anthology. The workshop will take place in November 28-29, 2024 in Helsinki, Finland at Metropolia University of Applied Sciences. https://acl-sigur.github.io/iwclul2024.html Submission deadline: October 20, 2024 Registration/publication fees: 0€! We solicit original and unpublished work related to NLP methods for Uralic languages, including multilingual methods that include at least one Uralic language (e.g. Finnish, Estonian, Hungarian etc). Appropriate topics include (but are not limited to): - Multilingual approaches in NLP presenting work on at least one Uralic language - Parsers, analysers and processing pipelines of Uralic languages - Lexical databases, electronic dictionaries - Finished end-user applications aimed at Uralic languages, such as spelling or grammar checkers, machine translation or speech processing - Evaluation methods and gold standards, tagged corpora, treebanks - Reports on language-independent or unsupervised methods as applied to Uralic languages - Surveys and review articles on subjects related to computational linguistics for one or more Uralic languages - Any work that aims at combining efforts and reducing duplication of work - How to elicit activity from the language community, agitation campaigns, games with a purpose Short papers can be up to 4 pages in length (5 for camera-ready version). Short papers can report on work in progress or a more targeted contribution such as software or partial results. Long papers can be up to 8 pages in length (9 for camera-ready version). Long papers should report on previously unpublished, completed, original work. Lightning talks submitted as 750-word abstracts. Lightning talks are suited for discussing ideas or presenting work in progress. The abstracts will be published in a lightning proceedings on Zenodo. All submission formats can have an unlimited number of pages for references. All submissions must follow the ACL stylesheet. The submissions must be anonymous, and they will be peer-reviewed by our program committee. The peer review is double blinded. Papers must be submitted using the conference submission system by the deadline. At least one of the authors of an accepted paper must attend the event and present their paper. Accepted papers (short and long) will be published in the joint proceedings that will appear in the ACL Anthology. Accepted papers will also be given an additional page to address the reviewers’ comments. The length of a camera-ready submission can then be 5 pages for a short paper and 9 for a long paper with an unlimited number of pages for references. Important dates: - Paper submission (full and short): October 20, 2024 - Notification of acceptance: November 3, 2024 - Camera ready deadline: November 10, 2024 - Registration deadline: November 10, 2024 - Workshop: November 28-29, 2024

1 0

Programming for Corpus Linguistics with Python and Dataframes
by Susan Hunston 28 May '24

28 May '24

Dear list members, I'm delighted to announce a new publication in the Cambridge Elements in Corpus Linguistics series. This publication is FREE to download until 10 June 2024 (see the link at the bottom of this email). Title: Programming for Corpus Linguistics with Python and Dataframes Author: Daniel Keller, Western Kentucky University Summary: This Element offers intermediate or experienced programmers algorithms for Corpus Linguistic (CL) programming in the Python language using dataframes that provide a fast, efficient, intuitive set of methods for working with large, complex datasets such as corpora. This Element demonstrates principles of dataframe programming applied to CL analyses, as well as complete algorithms for creating concordances; producing lists of collocates, keywords, and lexical bundles; and performing key feature analysis. An additional algorithm for creating dataframe corpora is presented including methods for tokenizing, part-of-speech tagging, and lemmatizing using spaCy. This Element provides a set of core skills that can be applied to a range of CL research questions, as well as to original analyses not possible with existing corpus software. The Element can be accessed using this link: https://doi.org/10.1017/9781108904094. Susan Hunston (she/her) Professor of English Language +44 121 414 5675 University of Birmingham Department of English Language and Linguistics www.birmingham.ac.uk

1 0

Post-Doc or PhD Position (3 Years) at the Chair of Multilingual Computational Linguistics in Passau
by Mattis List 28 May '24

28 May '24

Dear all, Our Chair of Multilingual Computational Linguistics in Passau offers one post-doc or doctoral positions as part of the ERC project "Productive Signs: A computer-assisted investigation of evolutionary, typological, and cognitive aspects of word families". The position is 3 years. An extension for doctoral candidates may be possible, pending sufficient funding. You find more information on the position on the following link (for the time being only in German, but the English version will also follow soon): https://www.uni-passau.de/fileadmin/dokumente/beschaeftigte/Stellenangebote… Sincerely, Mattis List -- Prof. Dr. Johann-Mattis List Chair of Multilingual Computational Linguistics University of Passau Dr.-Hans-Kapfinger-Str. 16 04032 Passau Germany Chair Website: https://phil.uni-passau.de/multilinguale-computerlinguistik/ Personal Website: https://lingulist.de Telephone: +49(0)851/509-3480

2 1

PostDoc/Senior Researcher Position in NLP & Computational Social Science @GESIS
by Stefan Dietze 28 May '24

28 May '24

Dear all, We are offering an exciting PostDoc/Senior Researcher position at the intersection of NLP and computational social science within the department Knowledge Technologies for the Social Sciences (https://gesis.org/en/kts) at GESIS in Cologne, Germany. The position is limited to 4 years, with option for tenure/permanency. Further information can be found at https://www.hidden-professionals.de/HPv3.Jobs/gesis/stellenangebot/37621/Se…. For any questions, please don't hesitate to get in touch with me. Best regards, Stefan -- Prof. Dr. Stefan Dietze Scientific Director Knowledge Technologies for the Social Sciences GESIS - Leibniz Institute for the Social Sciences Web: https://www.gesis.org/en/kts Chair of Data & Knowledge Engineering Heinrich-Heine-University Düsseldorf Web: https://www.cs.hhu.de/en/research-groups/data-knowledge-engineering Phone: +49 (0)221-47694-421 Web: http://stefandietze.net

1 0

Last Call for Participation in the IJCAI 2024 FinLLM Challenge
by Jimin Huang 28 May '24

28 May '24

Dear Colleagues, This is the last call for participation in the FinLLM Challenge of the FinNLP-agentscen workshop at IJCAI 2024! The registration will close at AOE 00:00 AM on May 28, 2024. Don't miss this opportunity to test your skills and contribute to the exciting field of AI in Finance. For the latest updates and to access the train set, please visit our website: FinLLM Challenge: https://sites.google.com/nlg.csie.ntu.edu.tw/finnlp-agentscen/shared-task-f… We also welcome submissions of existing models or checkpoints of LLMs. If you have any questions or encounter any issues, feel free to reach out to our support team via email at ijcaifinllmcontest(a)thefin.ai. Stay tuned for further announcements and good luck! Best regards, FinLLM Organizers Contact email: ijcaifinllmcontest(a)thefin.ai Discord Link: https://discord.gg/pCrWUQct

1 0

Postdoc position in NLP/CL at IT University of Copenhagen [Deadline May 31]
by Anna Rogers 27 May '24

27 May '24

The Department of Computer Science at the IT University of Copenhagen is offering a Postdoc position in Natural Language Processing/Computational Linguistics*,* with a start date of *1 September 2024* or as soon as possible. The *application deadline is 31* *May** 2024.* Applications for the position can be submitted via ITU job portal <https://candidate.hr-manager.net/ApplicationInit.aspx?cid=119&ProjectId=181…> . *Proposed project title: *Efficiency and Robustness in Language Model Pre-training *Proposed project description.* Recent generative systems based on pre-trained language models are remarkably fluent, but this is achieved by extreme volumes of computation and training data. This means not only high energy costs, but also training on data that is problematic in various ways: copyright, harmful social stereotypes, non-representative sampling, misinformation, junk SEO texts, pornography, and contamination with NLP datasets used for evaluation. This project will create an ambitious resource for research on transfer learning, in which pre-training data is held constant, and evaluation takes into account how much similar data was observed in training, and in what ways it was similar. This resource will encourage the development of more efficient and robust approaches, since it will not be possible to improve benchmark scores by simply training on more data. The ideal candidate will have a strong background in Computational Linguistics/Natural Language Processing and experience developing NLP resources, as well as core skills in programming in Python and machine learning. The position is funded for 1 year, and it is our intention to find additional funding to extend this postdoc to a 2- or 3-year position. Besides research, the postdoc will gain experience with organization of an international workshop and shared task and build up their international network. For those interested in pursuing an academic career, the following is also possible (but entirely optional): - gain experience in applying for external funding with professional support (either for the continuation of the postdoc’s own position, e.g. Marie Curie postdoctoral fellowship, or by contributing to PI’s grant proposals); - supervise Master students solo, and/or assist in supervising a PhD student; - undertake a formal teacher training program, including teaching guest lectures in the relevant data science courses at the ITU computer science department. The successful candidate will be a member of the national Pioneer Centre for Artificial Intelligence <https://aicentre.dk/>, a 5-university Danish research endeavor, and of the NLPnorth <https://nlpnorth.github.io/>research group at the IT University’s Computer Science Department. Both the centre and research group are highly international and well-funded, working on a broad range of research topics. The project will be supervised by Associate Professor Anna Rogers <https://annargrs.github.io/> (arog(a)itu.dk), to whom inquiries about the project can be directed. -- Best regards, Anna Rogers Associate Professor Computer Science Department, IT University of Copenhagen http://annargrs.github.io/

1 0

[jobs] 3 PhD positions in NLP at Sorbonne Université (Paris)
by François Yvon 27 May '24

27 May '24

[apologies for cross-posting] Dear all, We are offering three fully funded PhD positions to study the political biases of large language models in various application settings (simplification and paraphrasing, summarization and question answering, translation): how to define them? How to measure them? How to mitigate them? The starting date for these positions is october 2024. Depending on their interests and preferences, the successful candidates will either join : - the MLIA team of ISIR: https://www.isir.upmc.fr/ - the Computational Linguistic group of STIH: http://stih-sorbonne-universite.fr Both labs are affiliated with Sorbonne Université and located in the center of Paris. Applications should be submitted only via this link, where more information about these positions is available: https://emploi.cnrs.fr/Offres/Doctorant/UMR7222-FRAYVO-002/Default.aspx?lan… If you are interested, please submit your application *before june 15th*, including the following elements: • Detailed CV, • Letter of motivation, • Details of transcripts (especially M1 and M2), • Elements of bibliography or personal achievements related to a research activity (e.g. master project, research internship subject, etc.), • 2 recommendation letters Interviews will be conducted online between june 10th and june 24th, 2024. G. Lejeune (STIH) - B. Piwowarski & F. Yvon (ISIR/MLIA) --- François Yvon ISIR, CNRS & Sorbonne Université +33 (0)1 44 27 62 11 4 Place jussieu F-75005 Paris

1 0

[CFP] Final CFP: ACLing 2024 || Sept 2024 ||Hybrid Conference|| Dubai, UAE
by Khaled Shaalan 27 May '24

27 May '24

Apologies for cross-postings! ** Please forward to anyone who might be interested ** ************************************************************************************ CALL FOR PAPERS Sixth International Conference on AI in Computational Linguistics (ACLing2024) September 21-22, 2024 (Hybrid Conference) Brochure: http://acling.org/wp-content/uploads/2024/05/ACLing24_CFP.pdf Publication: Procedia Computer Science by ELSEVIER ( https://www.sciencedirect.com/journal/procedia-computer-science) Website: https://acling.org/ ************************************************************************************ IMPORTANT DATES * Paper submission deadline: 31 May 2024 (Extended and Final) * Notification of Acceptance: 21 June 2024 * Registration: 25 June 2024 * Camera ready version submission: 14 July 2024 * Conference Date: 20 – 21 September 2024‎ ************************************************************************************ INTRODUCTION Are you interested in artificial intelligence and computational linguistics? The ACLing ‎Conference is a great opportunity to share your latest research with a global audience of experts. ‎We're inviting submissions on a wide range of topics, including machine learning, natural ‎language processing, and applications of these technologies.‎ SOME KEY TOPICS OF INTEREST:‎ * Large Language Models and their applications * Information Retrieval and Question Answering * Information Extraction * Linguistic Theories and Resources * Language Modeling * Speech and Multimodality * Machine Learning, Text Categorization, and Text Mining * Machine Translation * Multilinguality and Cross-linguality * NLP Applications * Segmentation, Tagging, and Parsing * Semantics * Sentiment Analysis and Opinion Mining * Web, Social Media and Computational Social Science * Natural Language Generation * Text Categorization and Topic Modeling * Text Mining * Language and Vision * AI applications in Computational linguistics SUBMISSION INSTRUCTIONS For detailed instructions on how to submit your paper, please visit the conference website:‎ https://acling.org/submission/. All the submissions should be submitted via EasyChair: https://easychair.org/my/conference?conf=acling2024 BENEFITS OF PUBLISHING AT ACLING:‎ • Your work will be seen by a global audience of leading researchers in computational ‎linguistics.‎ • All accepted papers will be published in the Procedia Computer Science by ELSEVIER.‎ INDEXING, RANKING, AND IMPACT (web sources): * Abstracting and indexing: https://www.sciencedirect.com/journal/procedia-computer-science/about/insig… * Scopus: https://www.scopus.com/sourceid/19700182801?origin=sbrowse * CiteScore: https://www.scopus.com/sourceid/19700182801?origin=sbrowse * SJR (scimago): https://www.scimagojr.com/journalsearch.php?q=19700182801&tip=sid * ACLingy by Google Citation: https://scholar.google.com/citations?user=jkpMuFMAAAAJ&hl=en * ACLing by DBLP: https://dblp.org/db/conf/acling/index.html FURTHER INFORMATION & CONTACT DETAILS * Vist the conference website link https://acling.org/ (will be updated on a regular basis). * For further information, please contact us at ACLing2024(a)gmail.com Regards, CONFERENCE ORGANIZERS * Prof. Dr. Khaled Shaalan, The British University of Dubai, UAE * Prof. Dr. Samhaa R. El-Beltagy, Newgiza University, Egypt

1 0

2026

2025

2024

2023

2022

Corpora May 2024