Dear colleague,
The 34th Meeting of Computational Linguistics in The Netherlands (CLIN34) will take place soon, on Friday 30 August 2024! We cordially invite you to participate. Online registration<https://clin34.leidenuniv.nl/2024/07/05/registration-open/> ends on Wednesday (21st of August).
Besides a large and diverse programme of posters and oral presentations, we are happy to report that CLIN34 will have two keynote talks by:
* Diana Maynard, Sheffield University
* Dominique Blok and Erik de Graaf, TNO
The programme can also be found at: clin34.leidenuniv.nl/program/<https://clin34.leidenuniv.nl/program/>
We hope to see you in Leiden about two weeks from now!
The CLIN34 organizers
Leiden University
Dear all,
Below is a call for submissions to our annual contest for student
writers. Contributions from undergraduate students of computational
linguistics are most welcome.
Sincerely, Tristan Miller
Babel Advisory Panel
----------------------
This year, Babel: The Language Magazine <https://babelzine.co.uk/> will
be running the tenth edition of our Young Writers' Competition, which
encourages young linguists who are starting out on their study of language.
The competition is open to anyone studying a linguistics-related subject
at the 16–18-year-old or undergraduate level. The winner(s) will have
their article published in Babel's 50th issue (Spring 2025) and receive
a year's subscription to the magazine.
Keep an eye on @Babelzine on X or @babel_zine on Instagram for
inspiration from previous winners on topics ranging from sign language
to spoonerisms, and from language birth to language death.
Competition rules are as follows:
Topic: An original discussion of any linguistic topic, written in an
accessible and interesting style
Length: 2000 to 2500 words
Deadline: Monday, 16 December 2024
Format: Word file
Submission: By e-mail to babelthelanguagemagazine(a)gmail.com with the
subject "Young Writer's Competition"
Please e-mail babelthelanguagemagazine(a)gmail.com if you have any
questions about the competition.
--
Dr. Tristan Miller, Assistant Professor
Department of Computer Science, University of Manitoba
https://clam.logological.org/ | Tel. +1 204 474 6792
University College London (UCL) Department of Computer Science invites applications for a Lecturer/Associate Professor position in Natural Language Processing. Interested applicants can submit their applications until September 5th using this link<https://www.ucl.ac.uk/work-at-ucl/search-ucl-jobs/details?jobId=25979&jobTi…>.
About UCL
UCL’s Department of Computer Science (CS) is a top-ranked Computer Science Department in the UK. In the 2021 Research Excellence Framework (REF) evaluation, UCL Computer Science was ranked second in the UK for research power and first in England. London is a global hub for AI, where UCL plays a central role through close collaborations and joint PhD programmes with for example Meta and Google DeepMind.
About the role
University College London, Department of Computer Science is seeking a Lecturer (equivalent of Assistant Professor in the UK)/Associate Professor to join the Natural Language Processing Group. Successful candidates are expected to contribute to the teaching and research activities at the department. Expected duties and responsibilities include conducting research in the broader field of natural language processing, securing funding and engagement in the management of research projects, and dissemination of research through publications at top conferences/journals, talks and external engagements.
About you
Candidates should have a PhD (or equivalent qualification) or have held a previous postdoctoral position in natural language processing, information retrieval, machine learning, or a strongly related field. Candidates are expected to have a strong publication record in top conferences such as ACL, ICLR, NeurIPS, EMNLP, SIGIR. Experience in applying for research funding is not necessary, but highly desired. We also welcome applications from candidates with research experience from industry.
Please contact Emine Yilmaz (emine.yilmaz(a)ucl.ac.uk<mailto:emine.yilmaz@ucl.ac.uk>) if you need any further information.
We are very happy to release
𝐐𝐚𝐛𝐚𝐬 - 𝐚𝐧 𝐎𝐩𝐞𝐧-𝐒𝐨𝐮𝐫𝐜𝐞 𝐋𝐞𝐱𝐢𝐜𝐨𝐠𝐫𝐚𝐩𝐡𝐢𝐜 𝐃𝐚𝐭𝐚𝐛𝐚𝐬𝐞
Birzeit University’s SinaLab for Computational Linguistics and Artificial Intelligence <https://sina.birzeit.edu/> has officially launched Qabas <https://sina.birzeit.edu/qabas>, an open-source lexicographic database for Arabic, designed specifically for Natural Language Processing (NLP) applications.
Qabas stands out by linking its lexical entries (lemmas) with lemmas from 110 different lexicons and numerous morphologically annotated corpora (around 2 million tokens), creating an extensive lexicographic graph. This project has been under development for over fourteen years.
Lexicons have evolved from being primarily hard-copy resources for human use to having substantial significance in NLP applications. Although Arabic is a highly resourced language in terms of traditional lexicons, not enough attention is given to developing AI-oriented lexicographic databases. Additionally, none of the Arabic lexicons are available open-source, due to copyright restrictions imposed by their owners. As for Qabas, it is an open-source Arabic lexicon designed for NLP applications, and its novelty lies in its synthesis of many lexical resources. Each lexical entry (i.e., lemma) in Qabas is linked with equivalent lemmas in 110 other lexicons, and with 12 morphologically-annotated corpora (about 2M tokens); The philosophy of Qabas is to construct a large lexicographic data graph by linking existing Arabic lexicons and annotated corpora. Qabas stands as the largest Arabic lexicon, encompassing about 58K lemmas (45K nominal lemmas, 12.5K verbal lemmas, and 500 function word lemmas).
Prof. Mustafa Jarrar, the project’s manager and main author, emphasized the importance of making Qabas freely available as an open-source resource, allowing everyone to access and use it for both commercial and non-commercial purposes. Prof. Jarrar hopes that researchers, companies, and software developers will leverage the lexicon’s data to develop innovative content and applications that benefit humanity.
Prof. Talal Shahwan, President of Birzeit University, stated that despite the challenging conditions in Palestine, the university remains committed to excellence and to its mission towards knowledge. He emphasized that this achievement was made possible by the dedication of the university’s faculty and researchers.
Qabas is publicly available online at: https://sina.birzeit.edu/qabas
To download Qabas and find out more, see: https://sina.birzeit.edu/qabas/about
Article: https://www.jarrar.info/publications/JH24.pdf
We’d love your feedback:
Facebook: https://www.facebook.com/watch?v=880418097306662
LinkedIn: https://www.facebook.com/watch?v=880418097306662
Best
--Mustafa
__________________________
Mustafa Jarrar, PhD
Professor of Artificial Intelligence
Chair, PhD Program in Computer Science
Birzeit University, Palestine
Page: http://www.jarrar.info
SinaLab: https://sina.birzeit.edu
CfP: Diversity and Change in Easy German (Workshop at DGfS 2025)
Date: March 5-7, 2025
Location: University of Mainz, Germany
Meeting Email: workshop-easy-german-dgfs2025(a)uni-saarland.de<mailto:workshop-easy-german-dgfs2025@uni-saarland.de>
Website: https://sfb1102.uni-saarland.de/vielfalt-und-wandel-in-leichter-sprache/
Linguistic Field(s): Applied Linguistics, Computational Linguistics, Psycholinguistics
Language Family: Germanic
Call Deadline: August 18, 2024
Shortened Workshop Description:
Easy German, which has been systematically developed since the 2000s to aid individuals with learning difficulties among others, focuses on enhancing text comprehensibility by avoiding linguistic complexity. Despite its intended uniformity, there is a lack of consensus on its precise conceptualization, with various frameworks and guidelines proposing different approaches.
This workshop aims to:
1. Provide a platform for researchers to discuss the production and evolution of Easy German texts.
2. Highlight dynamic changes and variability in Easy German texts compared to Standard German.
3. Examine the cognitive processing of Easy German through psycholinguistic studies involving the target demographic.
4. Critically assess AI-driven systems for Easy German text production, exploring their implications, opportunities, and challenges.
For further information, please visit the workshop website: https://sfb1102.uni-saarland.de/vielfalt-und-wandel-in-leichter-sprache/
Organizers:
Ingo Reich (Saarland University, Germany)
Heike Zinsmeister (University of Hamburg, Germany)
Sarah Jablotschkin (University of Hamburg, Germany)
Lena Wieland (Saarland University, Germany)
Invited Speakers:
Bettina Bock (University of Cologne)
Ted Sanders (Utrecht University)
Call for Papers:
We invite contributions on all aspects of Easy German and easy-to-read variants in other Germanic languages. The workshop will include a small poster session, and submissions for both talks and posters are welcome. Contributions in English are preferred, but submissions in German are also accepted.
* Submission Details:
* Abstract submission deadline: August 18, 2024
* Abstracts should be submitted to workshop-easy-german-dgfs2025(a)uni-saarland.de<mailto:workshop-easy-german-dgfs2025@uni-saarland.de>
* Abstracts should not exceed one page (DIN A4, 2.5 cm margins, 12pt font)
* Examples, graphics, or references may be included on a second page
Important Workshop Information: The workshop is part of the 47th annual meeting of the German Linguistic Society (DGfS 2025) at Johannes Gutenberg University Mainz. Participants must register for the DGfS conference and pay the conference fee. For more information, visit http://dgfs.uni-mainz.de<http://dgfs.uni-mainz.de/>.
Important Dates:
Deadline for abstract submission: August 18, 2024
Notification of acceptance: September 2, 2024
Workshop dates: March 5-7, 2025
--
Lena Wieland
SFB 1102, Project T1 – Information Density and Linguistic Encoding in “Leichte Sprache”
Universität des Saarlandes
Campus A2.2 Raum 3.12
D-66123 Saarbrücken
T: +49 681 302 57543
www.uni-saarland.de/fakultaet-p/nds/team/wieland<https://www.uni-saarland.de/fakultaet-p/nds/team/wieland.html>
===Workshop Description===
The RegNLP 2025 Workshop will take place on January 20th, 2025, in conjunction with the COLING 2025 conference in Abu Dhabi, UAE.
Regulatory documents are foundational to governance, compliance, and legal frameworks across various sectors. However, the sheer complexity, volume, and constantly evolving nature of these documents present significant challenges. To address these, the field of Natural Language Processing (NLP) is increasingly being harnessed to develop tools and methodologies that enable the effective management, analysis, and utilization of regulatory content.
This workshop seeks to bring together researchers and practitioners from NLP, legal informatics, compliance, and related fields to discuss the latest advancements and challenges in regulatory NLP. The focus will be on innovative methods for document parsing, entity recognition, automated compliance checking, and other applications critical to navigating the intricate landscape of regulatory requirements. We will explore how NLP can be adapted to the specialized language and context of regulatory texts and how it can be employed to enhance the accuracy, efficiency, and reliability of regulatory processes.
By fostering collaboration and knowledge exchange, RegNLP 2025 aims to build a community dedicated to advancing the application of NLP in the regulatory domain and to identify promising directions for future research.
===Important Dates===
Paper Submission Deadline: November 5, 2024
Notification of Acceptance: December 3, 2024
Camera-Ready Papers Due: December 10, 2024
Workshop Date: January 20, 2025
===Submission Topics===
We invite submissions of original and high-quality research papers on topics related to the application of NLP in regulatory contexts, including but not limited to:
-Applications of NLP to Regulatory Tasks:
--Compliance monitoring and management
--Risk assessment and regulatory reporting
--Interpretation and classification of regulatory changes
--Summarization of regulatory documents for decision-making
--Creation of domain-specific lexical resources
-Adapting NLP Methods for Regulatory Data:
--Information retrieval and anomaly detection
--Clustering and multimodality analysis
--Entity recognition, linking, and disambiguation
--Syntax: Tagging, chunking, and parsing
--Dialogue and discourse analysis
--Text summarization and relation extraction
--Question answering using regulatory data
-Tasks and Resources:
--New regulatory tasks and datasets for NLP
--Evaluation frameworks for regulatory NLP tasks
-Demos:
--Systems and software solutions utilizing NLP for regulatory text processing
-Industrial Research:
--Case studies of industrial applications in regulatory compliance
--Research involving proprietary regulatory data
-Interdisciplinary Position Papers:
--The role of NLP in the regulatory landscape
--Reflections on the use of Large Language Models (LLMs) in regulatory contexts
--Legal and ethical considerations in regulatory data processing
===More Details===
For more information about the workshop, please visit our website: https://regnlp.github.io/
===Organization===
Workshop Chairs:
Tuba Gokhan - MBZUAI
Kexin Wang - UKP Lab, Technical University of Darmstadt
Iryna Gurevych - UKP Lab, Technical University of Darmstadt & MBZUAI
Ted Briscoe - MBZUAI
===Contact Information===
For inquiries, please contact us via email at: regnlp2025(a)gmail.com
Apologies for the multiple postings.
-----------------------------
*Indian Language Summarization (ILSUM 2024)*
Website: https://ilsum.github.io/
To be organized in conjunction with FIRE 2024 (fire.irsi.org.in)
12th-15th December 2024, Gandhinagar, India
------------------------------
The third shared task on Indian Language Summarization (ILSUM) aims at
extending evaluation benchmark dataset for Indian Language
Summarization. Three Dravidian languages Kannada, Telugu and Tamil are
introduced this year. We also extend the misinformation detection
subtask to a cross-lingual setup.
*Subtask 1*: This task builds upon the task from the first two
editions. In the previous editions we covered three major Indian
languages Hindi, Gujarati and Bengali alongside Indian English, a
widely recognized dialect of the English Language. This year's edition
adds the three Dravidian languages Kannada, Tamil and Telugu and an
expanded dataset for the languages from last year.
Like the previous edition, this will be a classic summarization task,
where we will provide article-summary pairs for each language and the
participants are expected to generate a fixed-length summary.
*Subtask 2*: The task is centred around identifying factual errors in
machine-generated summaries. While LLMs are very good at
summarization, among other NLP tasks, they are often prone to
hallucinations. This means the model generates information that is not
accurate, not based on its training data, or is completely made up but
looks accurate and reliable. Further, such tools can be misused to
generate misleading or outright incorrect information. Identifying
such inaccuracies can be a challenging task.
This year's subtask builds upon a similar task from the previous
edition in a cross-lingual setup. Participants will be provided with
an article in English and its corresponding
machine-generated summary in Hindi and Gujarati. The objective is to
identify the presence of factual incorrectness in the summaries if
any, and classify them in
one of the predefined categories.
*Tentative Timeline*
-------------
15th August - Training Data Released and Registrations open
30th August - Test Data Release
30th September - Run Submission Deadline
10th October - Results Declared
20th October - Working notes due
20th November - Camera Ready Submissions due
12th-15th December - FIRE 2024 at Gandhinagar, India
*Organisers*
----------------
Shrey Satapara, Indian Institute of Technology, Hyderabad, India
Sandip Modha, LDRP-ITR, Gandhinagar, India
Shashirekha HL, Mangalore University, India
Asha Hegde, Mangalore University, India
Parth Mehta, Parmonic, USA
Debasis Ganguly, University of Glasgow, Scotland
*For regular updates subscribe to our mailing list: **ilsum(a)googlegroups.com**
> [Apologies for cross-posting]
>
> ***** Deadline Extension for Paper Submission until August 24th, 2024 ******
>
> ==========================================================================
> EXTENDED CALL FOR PAPERS - SIMBig 2024
> ==========================================================================
>
> 11th International Conference on Information Management and Big Data - SIMBig 2024
> Where: Universidad Nacional de Moquegua, Ilo, PERU
> When: November 20 - 22, 2024
> Website: https://simbig.org/SIMBig2024/
> Free of Cost: no fees associated with Publication
> ==========================================================================
>
> OVERVIEW
> ----------------------------------
>
> SIMBig 2024 seeks to present new methods of Artificial Intelligence (AI), Data Science, Machine Learning, Natural Language Processing, Semantic Web, and related fields, for analyzing, managing, and extracting insights and patterns from large volumes of data.
>
>
> KEYNOTE SPEAKERS
> ----------------------------------
>
 Aaron Courville, Université de Montréal, Canada
 Mona Diab, Carnegie Mellon University, USA
 Anna Korhonen, University of Cambridge, UK
 Huan Liu, Arizona State University, USA
>
> IMPORTANT DATES
> ----------------------------------
>
> August 10, 2024 August 24, 2024 --> Full papers and short papers due
> September 30, 2024 --> Notification of acceptance
> October 28, 2024 --> Camera-ready versions
> November 20-22, 2024 --> Conference held in Moquegua, Peru
>
> PUBLICATION
> ----------------------------------
>
> All accepted papers of SIMBig 2024 (tracks including) will be published with Springer CCIS Series <https://www.springer.com/series/7899>.
>
>

> CONFERENCE FEES
> ----------------------------------
>
> To disseminate new advances in data science, SIMBig 2034 offers a conference registration fee of 30 USD, which includes access to the conference, materials, and publication of the proceedings for the authors. Submit your articles HERE <https://cmt3.research.microsoft.com/User/Login?ReturnUrl=%2FSIMBIG2024>.
>
>
> TOPICS OF INTEREST
> ----------------------------------
>
> SIMBig 2024 has a broad scope. We invite contributions on theory and practice, including but not limited to the following technical areas:
>
> Artificial Intelligence
> Big/Masive Data
> Data Science
> Machine Learning
> Deep Learning
> Natural Language Processing
> Semantic Web
> Data-driven Software Engineering
> Healthcare Informatics
> Biomedical Informatics
> Data Privacy and Security
> Information Retrieval
> Ontologies and Knowledge Representation
> Social Networks and Social Web
> Information Visualization
>
> SPECIAL TRACKS
> ----------------------------------
>
> SIMBig 2024 proposes a special track in addition to the main conference:
>
> DISE <https://simbig.org/SIMBig2024/call-for-paper/track-on-data-driven-software-…> - Data-Driven Software Engineering
>
> CONTACT
> ----------------------------------
>
> SIMBig 2024 General Chairs
>
> Juan Antonio Lossio-Ventura, National Institutes of Health, USA (juan.lossio(a)nih.gov <mailto:juan.lossio@nih.gov>)
> Hugo Alatrista-Salas, Léonard de Vinci Pôle Universitaire Research Center, Paris, France (hugo.alatrista_salas(a)devinci.fr <mailto:halatrista@pucp.pe>)
> [Apologies for cross-posting]
>
> ***** Deadline Extension for Paper Submission until August 24th, 2024 ******
>
> ==========================================================================
> EXTENDED CALL FOR PAPERS - SIMBig 2024
> ==========================================================================
>
> 11th International Conference on Information Management and Big Data - SIMBig 2024
> Where: Universidad Nacional de Moquegua, Ilo, PERU
> When: November 20 - 22, 2024
> Website: https://simbig.org/SIMBig2024/
> Free of Cost: no fees associated with Publication
> ==========================================================================
>
> OVERVIEW
> ----------------------------------
>
> SIMBig 2024 seeks to present new methods of Artificial Intelligence (AI), Data Science, Machine Learning, Natural Language Processing, Semantic Web, and related fields, for analyzing, managing, and extracting insights and patterns from large volumes of data.
>
>
> KEYNOTE SPEAKERS
> ----------------------------------
>
 Aaron Courville, Université de Montréal, Canada
 Mona Diab, Carnegie Mellon University, USA
 Anna Korhonen, University of Cambridge, UK
 Huan Liu, Arizona State University, USA
>
> IMPORTANT DATES
> ----------------------------------
>
> August 10, 2024 August 24, 2024 --> Full papers and short papers due
> September 30, 2024 --> Notification of acceptance
> October 28, 2024 --> Camera-ready versions
> November 20-22, 2024 --> Conference held in Moquegua, Peru
>
> PUBLICATION
> ----------------------------------
>
> All accepted papers of SIMBig 2024 (tracks including) will be published with Springer CCIS Series <https://www.springer.com/series/7899>.
>
>

> CONFERENCE FEES
> ----------------------------------
>
> To disseminate new advances in data science, SIMBig 2034 offers a conference registration fee of 30 USD, which includes access to the conference, materials, and publication of the proceedings for the authors. Submit your articles HERE <https://cmt3.research.microsoft.com/User/Login?ReturnUrl=%2FSIMBIG2024>.
>
>
> TOPICS OF INTEREST
> ----------------------------------
>
> SIMBig 2024 has a broad scope. We invite contributions on theory and practice, including but not limited to the following technical areas:
>
> Artificial Intelligence
> Big/Masive Data
> Data Science
> Machine Learning
> Deep Learning
> Natural Language Processing
> Semantic Web
> Data-driven Software Engineering
> Healthcare Informatics
> Biomedical Informatics
> Data Privacy and Security
> Information Retrieval
> Ontologies and Knowledge Representation
> Social Networks and Social Web
> Information Visualization
>
> SPECIAL TRACKS
> ----------------------------------
>
> SIMBig 2024 proposes a special track in addition to the main conference:
>
> DISE <https://simbig.org/SIMBig2024/call-for-paper/track-on-data-driven-software-…> - Data-Driven Software Engineering
>
> CONTACT
> ----------------------------------
>
> SIMBig 2024 General Chairs
>
> Juan Antonio Lossio-Ventura, National Institutes of Health, USA (juan.lossio(a)nih.gov <mailto:juan.lossio@nih.gov>)
> Hugo Alatrista-Salas, Léonard de Vinci Pôle Universitaire Research Center, Paris, France (hugo.alatrista_salas(a)devinci.fr <mailto:halatrista@pucp.pe>)