Apologies for the multiple postings.
-----------------------------
*Indian Language Summarization (ILSUM 2024)*
Website: https://ilsum.github.io/
To be organized in conjunction with FIRE 2024 (fire.irsi.org.in)
12th-15th December 2024, Gandhinagar, India
------------------------------
The third shared task on Indian Language Summarization (ILSUM) aims at
extending evaluation benchmark dataset for Indian Language
Summarization. Three Dravidian languages Kannada, Telugu and Tamil are
introduced this year. We also extend the misinformation detection
subtask to a cross-lingual setup.
*Subtask 1*: This task builds upon the task from the first two
editions. In the previous editions we covered three major Indian
languages Hindi, Gujarati and Bengali alongside Indian English, a
widely recognized dialect of the English Language. This year's edition
adds the three Dravidian languages Kannada, Tamil and Telugu and an
expanded dataset for the languages from last year.
Like the previous edition, this will be a classic summarization task,
where we will provide article-summary pairs for each language and the
participants are expected to generate a fixed-length summary.
*Subtask 2*: The task is centred around identifying factual errors in
machine-generated summaries. While LLMs are very good at
summarization, among other NLP tasks, they are often prone to
hallucinations. This means the model generates information that is not
accurate, not based on its training data, or is completely made up but
looks accurate and reliable. Further, such tools can be misused to
generate misleading or outright incorrect information. Identifying
such inaccuracies can be a challenging task.
This year's subtask builds upon a similar task from the previous
edition in a cross-lingual setup. Participants will be provided with
an article in English and its corresponding
machine-generated summary in Hindi and Gujarati. The objective is to
identify the presence of factual incorrectness in the summaries if
any, and classify them in
one of the predefined categories.
*Tentative Timeline*
-------------
15th August - Training Data Released and Registrations open
30th August - Test Data Release
30th September - Run Submission Deadline
10th October - Results Declared
20th October - Working notes due
20th November - Camera Ready Submissions due
12th-15th December - FIRE 2024 at Gandhinagar, India
*Organisers*
----------------
Shrey Satapara, Indian Institute of Technology, Hyderabad, India
Sandip Modha, LDRP-ITR, Gandhinagar, India
Shashirekha HL, Mangalore University, India
Asha Hegde, Mangalore University, India
Parth Mehta, Parmonic, USA
Debasis Ganguly, University of Glasgow, Scotland
*For regular updates subscribe to our mailing list: **ilsum(a)googlegroups.com**
> [Apologies for cross-posting]
>
> ***** Deadline Extension for Paper Submission until August 24th, 2024 ******
>
> ==========================================================================
> EXTENDED CALL FOR PAPERS - SIMBig 2024
> ==========================================================================
>
> 11th International Conference on Information Management and Big Data - SIMBig 2024
> Where: Universidad Nacional de Moquegua, Ilo, PERU
> When: November 20 - 22, 2024
> Website: https://simbig.org/SIMBig2024/
> Free of Cost: no fees associated with Publication
> ==========================================================================
>
> OVERVIEW
> ----------------------------------
>
> SIMBig 2024 seeks to present new methods of Artificial Intelligence (AI), Data Science, Machine Learning, Natural Language Processing, Semantic Web, and related fields, for analyzing, managing, and extracting insights and patterns from large volumes of data.
>
>
> KEYNOTE SPEAKERS
> ----------------------------------
>
 Aaron Courville, Université de Montréal, Canada
 Mona Diab, Carnegie Mellon University, USA
 Anna Korhonen, University of Cambridge, UK
 Huan Liu, Arizona State University, USA
>
> IMPORTANT DATES
> ----------------------------------
>
> August 10, 2024 August 24, 2024 --> Full papers and short papers due
> September 30, 2024 --> Notification of acceptance
> October 28, 2024 --> Camera-ready versions
> November 20-22, 2024 --> Conference held in Moquegua, Peru
>
> PUBLICATION
> ----------------------------------
>
> All accepted papers of SIMBig 2024 (tracks including) will be published with Springer CCIS Series <https://www.springer.com/series/7899>.
>
>

> CONFERENCE FEES
> ----------------------------------
>
> To disseminate new advances in data science, SIMBig 2034 offers a conference registration fee of 30 USD, which includes access to the conference, materials, and publication of the proceedings for the authors. Submit your articles HERE <https://cmt3.research.microsoft.com/User/Login?ReturnUrl=%2FSIMBIG2024>.
>
>
> TOPICS OF INTEREST
> ----------------------------------
>
> SIMBig 2024 has a broad scope. We invite contributions on theory and practice, including but not limited to the following technical areas:
>
> Artificial Intelligence
> Big/Masive Data
> Data Science
> Machine Learning
> Deep Learning
> Natural Language Processing
> Semantic Web
> Data-driven Software Engineering
> Healthcare Informatics
> Biomedical Informatics
> Data Privacy and Security
> Information Retrieval
> Ontologies and Knowledge Representation
> Social Networks and Social Web
> Information Visualization
>
> SPECIAL TRACKS
> ----------------------------------
>
> SIMBig 2024 proposes a special track in addition to the main conference:
>
> DISE <https://simbig.org/SIMBig2024/call-for-paper/track-on-data-driven-software-…> - Data-Driven Software Engineering
>
> CONTACT
> ----------------------------------
>
> SIMBig 2024 General Chairs
>
> Juan Antonio Lossio-Ventura, National Institutes of Health, USA (juan.lossio(a)nih.gov <mailto:juan.lossio@nih.gov>)
> Hugo Alatrista-Salas, Léonard de Vinci Pôle Universitaire Research Center, Paris, France (hugo.alatrista_salas(a)devinci.fr <mailto:halatrista@pucp.pe>)
> [Apologies for cross-posting]
>
> ***** Deadline Extension for Paper Submission until August 24th, 2024 ******
>
> ==========================================================================
> EXTENDED CALL FOR PAPERS - SIMBig 2024
> ==========================================================================
>
> 11th International Conference on Information Management and Big Data - SIMBig 2024
> Where: Universidad Nacional de Moquegua, Ilo, PERU
> When: November 20 - 22, 2024
> Website: https://simbig.org/SIMBig2024/
> Free of Cost: no fees associated with Publication
> ==========================================================================
>
> OVERVIEW
> ----------------------------------
>
> SIMBig 2024 seeks to present new methods of Artificial Intelligence (AI), Data Science, Machine Learning, Natural Language Processing, Semantic Web, and related fields, for analyzing, managing, and extracting insights and patterns from large volumes of data.
>
>
> KEYNOTE SPEAKERS
> ----------------------------------
>
 Aaron Courville, Université de Montréal, Canada
 Mona Diab, Carnegie Mellon University, USA
 Anna Korhonen, University of Cambridge, UK
 Huan Liu, Arizona State University, USA
>
> IMPORTANT DATES
> ----------------------------------
>
> August 10, 2024 August 24, 2024 --> Full papers and short papers due
> September 30, 2024 --> Notification of acceptance
> October 28, 2024 --> Camera-ready versions
> November 20-22, 2024 --> Conference held in Moquegua, Peru
>
> PUBLICATION
> ----------------------------------
>
> All accepted papers of SIMBig 2024 (tracks including) will be published with Springer CCIS Series <https://www.springer.com/series/7899>.
>
>

> CONFERENCE FEES
> ----------------------------------
>
> To disseminate new advances in data science, SIMBig 2034 offers a conference registration fee of 30 USD, which includes access to the conference, materials, and publication of the proceedings for the authors. Submit your articles HERE <https://cmt3.research.microsoft.com/User/Login?ReturnUrl=%2FSIMBIG2024>.
>
>
> TOPICS OF INTEREST
> ----------------------------------
>
> SIMBig 2024 has a broad scope. We invite contributions on theory and practice, including but not limited to the following technical areas:
>
> Artificial Intelligence
> Big/Masive Data
> Data Science
> Machine Learning
> Deep Learning
> Natural Language Processing
> Semantic Web
> Data-driven Software Engineering
> Healthcare Informatics
> Biomedical Informatics
> Data Privacy and Security
> Information Retrieval
> Ontologies and Knowledge Representation
> Social Networks and Social Web
> Information Visualization
>
> SPECIAL TRACKS
> ----------------------------------
>
> SIMBig 2024 proposes a special track in addition to the main conference:
>
> DISE <https://simbig.org/SIMBig2024/call-for-paper/track-on-data-driven-software-…> - Data-Driven Software Engineering
>
> CONTACT
> ----------------------------------
>
> SIMBig 2024 General Chairs
>
> Juan Antonio Lossio-Ventura, National Institutes of Health, USA (juan.lossio(a)nih.gov <mailto:juan.lossio@nih.gov>)
> Hugo Alatrista-Salas, Léonard de Vinci Pôle Universitaire Research Center, Paris, France (hugo.alatrista_salas(a)devinci.fr <mailto:halatrista@pucp.pe>)
The 1st Workshop on NLP for Languages Using Arabic Script
(AbjadNLP 2025)
Abu Dhabi, UAE
19-20 January 2024
Submission URL: https://softconf.com/coling2025/AbjadNLP25/
Co-located with COLING 2025 Conference, Abu Dhabi, UAE (19-20 January
2025)
AbjadNLP is dedicated to advancing innovation and gaining deeper
insights into Natural Language Processing (NLP) for languages that use
the Arabic script. Our primary focus is on Abjad and Ajami languages
that utilise the Arabic script or its variations. Traditionally
associated with Semitic languages, Abjad scripts represent consonants in
every syllable. In contrast, Ajami scripts denote the alphabetic use of
the Arabic script in various African contexts, representing non-Arabic
languages. We are interested in research on languages that fall under
the Abjad or Ajami categories that use the Arabic script or any
variations of it.
We invite contributions, discussions, and explorations that delve deep
into the unique linguistic structures, resources, challenges, and
untapped potential presented by Abjad and Ajami languages within the
realm of NLP and language resources. Our goal is to create synergies
among researchers by addressing the diverse phenomena and challenges
inherent in these rich linguistic traditions.
The workshop is proud to highlight our connections with the Masakhane
NLP community and collaborations with institutions worldwide, such as
COMSATS on Urdu, and the long-standing UCREL NLP Group at Lancaster
University, whose work encompasses over 20 languages worldwide,
including Abjad and Ajami languages.
Note: We chose the name Abjad for simplicity, but our focus includes
Abjad and other languages that have adopted the Arabic and Perso-Arabic
scripts, as well as Ajami languages. We acknowledge that Sorani Kurdish,
when written in Arabic script, follows an alphabet style rather than an
Abjad style.
Workshop Description:
We welcome contributions, discussions, and explorations that thoroughly
investigate the distinctive linguistic structures, resources,
challenges, and untapped potential of Abjad and Ajami languages within
the field of NLP and language resources. Our aim is to foster
collaboration among researchers by addressing the varied phenomena and
challenges inherent in these rich linguistic traditions.
Ajami languages, representing a myriad of African languages that have
adopted the Arabic script, span at least 43 distinct languages,
including Hausa, Fulfulde, Mandingo, Swahili, Wolof, Kanuri, and
Tamazight. The combined number of speakers of these languages is
estimated to exceed 200 million within Africa alone. Although Abjad has
been traditionally associated with Semitic languages such as Arabic,
Hebrew and Syriac, it has been adopted for writing by many other
language communities as in Perso-Arabic scripts used in Persian, Urdu,
Pashto, Sorani Kurdish, Azeri Turkish, Sindhi, and Uyghur, with a
collective estimated speaker population exceeding 500 million.
Altogether, these languages represent an approximate global aggregate of
1 billion speakers.
The adoption of the Arabic script across diverse linguistic landscapes
highlights its expansive and varied application, transcending genres
such as governmental correspondences, poetic compositions, religious
texts, and journalistic pursuits. This widespread use underscores the
imperative need to enhance digital infrastructure, tools, and resources
for these under-resourced languages. Advancing such resources is crucial
to nurturing linguistic diversity and resilience in both digital and
print media, ensuring the preservation of linguistic heritage in the
digital age.
Currently, there is an increasing interest in various NLP communities,
both in academia and industry, in writing systems. However, there is a
lack of initiatives focusing on the diverse phenomena and challenges of
the languages using an Abjad script. The AbjadNLP workshop aims to fill
this gap, fostering collaboration and innovation in this vital area of
study.
Motivation
Languages employing an Abjad script signify a pivotal and diverse
fragment of the global linguistic mosaic, traversing numerous countries
and regions and embodying a considerable populace of speakers. The
linguistic wealth and geographical diffusion of languages covered by
AbjadNLP present a prolific environment for exploration and advancement
in NLP. By channeling attention towards these languages, the realm of
NLP is poised to unlock access to an expansive and varied array of
linguistic constructions, subtleties, and cultural contexts, pivotal for
bolstering the versatility and adaptability of NLP models and
applications. The extensive spectrum of these languages not only unfolds
a valuable opportunity to amplify multilingualism and multiculturalism
in NLP research but also forges pathways for addressing the requisites
and challenges intrinsic to a diverse and extensive speaker population.
The broad adoption of Abjad scripts transcends diverse genres, including
governmental correspondences, poetic compositions, religious texts, and
journalistic pursuits. The sustained use of such scripts underscores the
imperative need to enhance digital infrastructure, tools, and resources
that elucidate the varied writing systems inherent to under-resourced
languages. Such advancement is crucial to nurturing linguistic diversity
and resilience in both digital and print media, ensuring that the
linguistic heritage does not diminish in the digital age.
This workshop can contribute to more inclusive and equitable
progressions in NLP, accommodating a broader assortment of languages and
dialects and promoting enhanced comprehension and interconnectivity
amongst varied linguistic communities. The assimilation and
prioritization of these linguistically affluent and diverse languages
are indispensable for the comprehensive progression and the universal
adaptability of NLP technologies. While our workshop primarily targets
languages using an Abjad script, we recognize that many historical
languages such as Aramaic , Sogdian, Parthian and Phoenician employed
such a writing system. As such, we believe that our workshop can enforce
links with researchers working on endangered languages as well.
We are proud to highlight our existing connections with the Masakhane
NLP community (www.masakhane.io) and collaborations with institutions
worldwide, such as COMSAT on Urdu (www.comsats.edu.pk), and the
long-standing UCREL NLP Group at Lancaster University, whose work
encompasses over 20 languages worldwide, including Abjad and Ajami
languages (http://ucrel-web-dev.lancs.ac.uk/ucrelng/).
Team
Our team is uniquely diverse and gender-balanced, comprising individuals
from a wide range of ethnic backgrounds. We represent a spectrum of
languages that use the Arabic script and include researchers from both
Linguistics and NLP, enriching the ever-needed collaboration between
these two fields. With expertise in language technology, Unicode, NLP,
resources, and multilingual text analysis, together, we aim to foster a
dynamic and inclusive environment for research and collaboration in the
field of NLP.
Call for papers
We invite submissions on topics that include, but are not limited to,
the following:
* Enabling core technologies: morphological analysis, disambiguation,
tokenisation, POS tagging, named entity detection, chunking, parsing,
semantic role labelling, sentiment analysis, language modelling, etc.
* Applications: machine translation, speech recognition, speech
synthesis, optical character recognition, pedagogy, assistive
technologies, social media, etc.
* Resources: dictionaries, annotated data, corpus, etc.
In addition, we extend a warm invitation to researchers and stakeholders
across the spectrum to contribute papers focusing on, but not limited
to, the following dimensions:
* Orthography descriptions (Constable 2002; Hosken 2003)
* Advancements in Font Technology, Glyph Rendering, and OCR
* Text Input Methodologies
* Development and Utilisation of Exploitable Dictionaries
* Enhancements in Spell-Checking Support
* Advancements in Speech-to-Text Solutions
* Progressive Natural Language Processing Techniques
* BLARK - Basic Language Resource Kit descriptions for languages using
Abjad or Ajami
* Insights and Experiences Utilising Data Supplied by the Unicode
Hosted Common Locale Data Repository in Abjad or Ajami.
* Morphological and syntactical challenges in Abjad or Ajami
Orthographies.
* Development of open access corpora in Abjad or Ajami.
* Text processing and transliteration challenges and solutions for
languages using Abjad or Ajami.
* Cultural and sociolinguistic considerations in NLP applications for
Abjad or Ajami.
* Languages resources and NLP tools for Abjad or Ajami.
Summary of the Call:
We welcome submissions of papers centred around the Abjad and Ajami
theme, focusing on supporting NLP language resources for non-Arabic
languages utilising Arabic script. We encourage submissions that span a
spectrum from theoretical investigations to practical applications,
aiming to underscore the distinctive challenges, solutions, and insights
that languages using Ajami and Abjad scripts introduce to the field of
NLP.
For the submission format and guidelines, we follow the COLING 2025
standards. Authors are encouraged to thoroughly review and adhere to the
COLING 2025 submission guidelines and author kit, which can be found at:
https://coling2025.org/calls/submission_guidlines/.
If authors are describing an orthography, we request that they include
the points recommended in (Hosken 2003
https://scripts.sil.org/WP-Encoding). For continuity across the workshop
and greater impact across industry applications, authors should consider
terminological (orthography, script, writing system, etc.) differences
presented by Constable (2002)
https://www.sil.org/resources/publications/entry/7853. The model
presented by Constable is the current Unicode model.
Please ensure that all submissions strictly conform to these standards
to streamline the review process and maintain uniformity across all
contributions. Both long papers (up to 8 pages) and short papers (up to
4 pages) are welcome. All submissions will undergo a rigorous
peer-review process, emphasizing originality, relevance, and clarity.
Submissions may be of two types:
* Long papers - up to eight (8) pages maximum, presenting substantial,
original, completed, and unpublished work.
* Short papers - up to four (4) pages, describing a small focused
contribution, negative results, system demonstrations, etc.
Submission URL: https://softconf.com/coling2025/AbjadNLP25/
Submission Guidelines:
https://coling2025.org/calls/submission_guidlines/
Provisional Key Dates:
* 1st Call for Papers Announcement: 16 July 2024
* 2nd Call for Papers Announcement: 16 August 2024
* Paper Submission Deadline: 15 November 2024
* Notification of Paper Acceptance: 6 December 2024
* Camera-ready Paper Deadline: 13 December 2024
* Workshop Date: either on 19 or 20 January 2024
Anti-Harassment Policy:
The workshop supports the COLING anti-harassment policy
https://coling2022.org/policy
Organising Committee:
General Chair:
* Dr. Mo El-Haj, Senior Lecturer at Lancaster University, is a Natural
Language Processing expert with a focus on Arabic and under-resourced
languages. He founded the FNP workshop series in 2018 and has organised
workshops at top NLP conferences including LREC and COLING.
http://elhaj.uk/ [1]
Programme Chairs:
* Mr Hugh Paterson III. Collaborative Scholar in linguistics, writing
systems, metadata, and archives. http://hugh4.us [2]
* Dr Saad Ezzini. Lecturer at Lancaster University, UK. Saad has
experience working on low-resource languages with a focus on machine
translation, QA, IR, and language modelling. http://ezzini.github.io [3]
* Dr Ignatius Ezeani. Senior Research Associate working on
multilingual NLP, Lancaster University, UK
https://www.lancaster.ac.uk/scc/about-us/people/ignatius-ezeani [4]
Review Committee:
* Dr Manum Hayat Khan. Cognitive Linguistics Researcher at the
University of La Rioja, Spain.
https://investigacion.unirioja.es/investigadores/1183/detalle
* Dr Muhammad Sharjeel. Assistant Professor working on Urdu NLP,
COMSATS University Islamabad, Pakistan
https://scholar.google.com/citations?user=xUF3l9gAAAAJ&hl=en [5]
Publication Chair:
* Dr Sina Ahmadi. Postdoctoral researcher at University of Zurich
focusing on leveraging language technology to assist languages with
constrained digital resources, with an emphasis on adapting current
natural language processing approaches and existing resources for
less-resourced languages. https://sinaahmadi.github.io/
Publicity Chairs:
* Ms Cynthia Amol. NLP Researcher focusing on low-resource languages
at Maseno University, Kenya.
https://ke.linkedin.com/in/cynthia-amol-779668119
* Ms Amal Haddad Haddad. PhD Student in translations and terminologies
at the University of Granada, Spain. http://lexicon.ugr.es/haddad
* Dr Jaleh Delfani. Research Fellow in Translation at University of
Surrey https://www.surrey.ac.uk/people/jaleh-delfani [6]
Advisory Committee:
* Prof. Ruslan Mitkov, Professor of Computing and Communications at
Lancaster University, actively working on different research topics from
the areas of Natural Language Processing (NLP), Computational
Linguistics and Translation Technology.https://wp.lancs.ac.uk/mitkov/
[7]
* Prof. Paul Rayson, Director of UCREL research centre at Lancaster
University, specialises in semantic-based NLP across 20 languages,
including Urdu and Arabic. With 25 years of experience, he excels in
noisy language environments like financial disclosures and has organised
various conferences and workshops.
https://www.lancaster.ac.uk/staff/rayson/ [8]
Programme Committee*
* Abdoulaye Diallo. Fula & Wolof. Independent Researcher
* Ahmed Abdelali. Arabic/Multilingual NLP. Qatar Computing Research
Institute (QCRI), Qatar.
* Ahmed AbuRa'ed. Arabic. UBC. Canada.
* Alp Oktem. Tigrinya and Kanuri. Translators without Borders
* Antonio Moreno Sandoval. Low Resourced Languages. UAM. Spain
* Azizud Din. Pashto. University Malaysia Sarawak. Malaysia
* Behnam Sabeti. Persian. Miras Technologies International. Iran
* Chenggang Mi. Uyghur. Xinjiang Technical Institute. China
* Clement Oyeleke. Yoruba. University of Ibadan. Nigeria
* Daniel Whitenack. Kimbundu, Fulfulde, Pular. SIL International. USA
* Derguene Mbaye. Wolof. Baamtu. Senegal
* Djamel Mostefa. Pashto. ELDA, France.
* Doaa Samy. Arabic. Cairo University, Egypt and LLI-UAM. Spain
* Elias W BA. Fula and Wolof. Baamtu. Senegal
* Eric Atwell. Arabic/Multilingual NLP. Leeds University, UK.
* Frederick Apina. Swahili. Parrot.AI. Tanzania
* George Giannakopoulos. Multilingual NLP. SKEL Lab - NCSR Demokritos.
Greece
* Haithem Afli. Arabic/Multilingual NLP. Dublin City University,
Ireland.
* Hazem Hajj. Arabic/Multilingual NLP. American University of Beirut,
Lebanon.
* Houda Bouamor. Arabic/Multilingual NLP. CMU. Qatar
* Ignatius Ezeani. Igbo, African Languages NLP. Lancaster University,
UK.
* Imed Zitouni. Arabic/Multilingual NLP. Microsoft Research, USA.
* Karim Bouzoubaa. Arabic/Multilingual NLP. Mohamed Vth University,
Morocco.
* Mariam Masoud. Sorani Kurdish. National University of Ireland
Galway. Ireland
* Lei Wang. Uyghur. Xinjiang Technical Institute. China
* Marina Litvak. Hebrew and Arabic. Sami Shamoon College of
Engineering, Israel
* Mo El-Haj. Arabic/Multilingual and Low resourced Languages.
Lancaster University, UK
* Muhammad Sharjeel. Urdu. COMSATS University Islamabad, Pakistan.
* Omid Momenzadeh. Persian. Miras Technologies International. Iran
* Paul Rayson. Multilingual and Low resourced Languages. Lancaster
University, UK
* Preni Golazizian. Persian. Miras Technologies International. Iran
* Rao Muhammad Adeel Nawab. Urdu. COMSATS University Islamabad,
Pakistan.
* Reza Fahmi. Persian. Miras Technologies International. Iran
* Samuel Olanrewaju. Yoruba, Yagba and Basa. University of Ibadan.
Nigeria
* Scott Piao. Multilingual and Low resourced Languages. Lancaster
University, UK
* Seyed Arad Ashrafi Asli. Persian. Miras Technologies International.
Iran
* Shervin Malmasi. Sorani Kurdish. Macquarie University. Australia
* Sina Ahmadi. Sorani Kurdish. National University of Ireland Galway.
Ireland
* Sokhar Samb. Wolof. ML & NLP. Senegal
* Tonghai Jiang. Uyghur. Xinjiang Technical Institute. China
* Waziri Shebogholo. Swahili. Parrot.AI. Tanzania
* Wole Akin. IsiXhosa, Yorùbá, Hausa, and Igbo. University of
Johannesburg. South Africa
* Xi Zhou. Uyghur. Xinjiang Technical Institute. China
* Yating Yang. Uyghur. Xinjiang Technical Institute. China
* Zahra Majdabadi. Persian. Miras Technologies International. Iran
_*We are in the process of forming a linguistically diverse program
committee who are experts in languages that use Arabic Script (Abjad and
Ajami), with the majority of the list already confirmed to serve as
reviewers. As soon as we gain access to SoftConf, we will extend
invitations to the remaining committee (if you see your name on the list
and want it removed, please contact any of the organisers). If your name
appears in this list and you want it removed, please contact us as soon
as possible and we'll make sure it's removed. Thanks_
Links:
------
[1] http://elhaj.uk/
[2] http://hugh4.us/
[3] http://ezzini.github.io/
[4] https://www.lancaster.ac.uk/scc/about-us/people/ignatius-ezeani
[5] https://scholar.google.com/citations?user=xUF3l9gAAAAJ&hl=en
[6] https://www.surrey.ac.uk/people/jaleh-delfani
[7] https://wp.lancs.ac.uk/mitkov/
[8] https://www.lancaster.ac.uk/staff/rayson/
Apologies for multiple posts!
Nakba-NLP 2025
International Workshop on Nakba Narratives as Language Resources
Part of the COLING 2025 <https://coling2025.org/> Conference
Abu Dhabi (virtual)
January 19, 2025
https://sina.birzeit.edu/nakba-nlp <https://sina.birzeit.edu/nakba-nlp/>
We invite submissions for Nakba-NLP 2025, a workshop dedicated to exploring and preserving Nakba narratives through the application of artificial intelligence, natural language processing, and corpus linguistics. We seek contributions on the following topics:
◈ Digitization of oral and written narratives
◈ Creation and labeling of language corpora and datasets
◈ Digital archives, metadata, and semantic/content mark-up
◈ Annotation tools and annotation guidelines
◈ Document classification, topic modeling, and information retrieval
◈ Named entity recognition for identifying people, places, organizations, and events
◈ Entity linking and relationship extraction
◈ Event detection and event argument extraction
◈ Knowledge Graphs and Linked Data
◈ Vocabularies, dictionaries, and ontologies
◈ Data visualization
◈ Knowledge representation
◈ Machine translation, summarisation, and paraphrasing
◈ Natural Language Generation
◈ Large Language Models
◈ Sentiment analysis and emotional content extraction
◈ Discourse analysis (e.g., bias, offensive language, and misinformation) related to Nakba narratives
◈ Voice & dialogue-based systems; ASR
◈ Palestinian dialects (written and spoken)
Suggested Datasets: a list of datasets can be found here https://t.ly/00Ul6
Important Dates:
=====================
All deadlines are 11:59 pm UTC-12 (anywhere on Earth).
- Submission Deadline: 25 November 2024
- Notifications of Acceptance: 5 December 2024
- Camera Ready Deadline: 13 December 2024 (cannot be changed)
Organizing Committee:
=====================
- Mustafa Jarrar, Birzeit University, Palestine
- Nizar Habash, New York University, UAE
- Mo El-Haj, Lancaster University, UK
- Zeina Jallad, Harvard Law School, USA
- Camille Mansour, Paris-Sorbonne University, France
- Diana Allan, McGill University, Canada
- Paul Rayson, Lancaster University, UK
Publicity Chairs
=====================
- Amal Haddad, University of Granada, Spain
- Sanad Malaysha, Birzeit University, Palestine
Contact: Nakba-NLP25_coling2025(a)softconf.com <mailto:Nakba-NLP25_coling2025@softconf.com>
Best, Mustafa
Dear Colleagues,
A draft version of my English-Portuguese Dictionary of Verbal Collocations
has seen the light at https://zenodo.org/records/12761211.
It is available in two formats
1. An interactive “complete” version accessible at
https://portuguese-english-collocations.shinyapps.io/PortugueseEnglishVerba…
2. A machine-readable interoperable version of the data available
at:
https://mangalamresearch.shinyapps.io/EnglishPortugueseVerbalCollocations/
The conversion of the dictionary data and its display in the form of a
digital dictionary were made possible thanks Ligeia Lugli, Tilak
Balavijayan and the NEH-funded project *Democratizing Digital Lexicography*
(HAA-290402-23).
Just in case, here is how to cite it
APA format
Tagnin, S. E. O. (2024). English-Portuguese Dictionary of Verbal
Collocations (0.1) [Data set]. Zenodo.
https://doi.org/10.5281/zenodo.12761211
MLA format
Tagnin, S. E. O. English-portuguese Dictionary of Verbal Collocations. 0.1,
Zenodo, 17 de Julho de 2024, doi:10.5281/zenodo.12761211.
Please, feel free to explore it. We welcome criticisms and suggestions. We
know there are still problems with diacritics (hopefully to be solved).
Best wishes
Stella
Stella E. O. Tagnin
Projeto COMET - Corpus Multilíngue para Ensino e Tradução
Universidade de São Paulo - FFLCH - DLM
Av. Prof. Luciano Gualberto, 403
05508-900 -São Paulo - SP
Brazil
comet.fflch.usp.br
55 11 3091 2929
ORCID: 0000-0002-5517-2710
The 9th International Workshop on Computational Linguistics for Uralic Languages (IWCLUL 2024) will be organized by ACL SIGUR. The proceedings of the event will be published in the ACL anthology. The workshop will take place in November 28-29, 2024 in Helsinki, Finland at Metropolia University of Applied Sciences.
https://acl-sigur.github.io/iwclul2024.html
Submission deadline: October 20, 2024
Registration/publication fees: 0€!
We solicit original and unpublished work related to NLP methods for Uralic languages, including multilingual methods that include at least one Uralic language (e.g. Finnish, Estonian, Hungarian etc). Appropriate topics include (but are not limited to):
- Multilingual approaches in NLP presenting work on at least one Uralic language
- Parsers, analysers and processing pipelines of Uralic languages
- Lexical databases, electronic dictionaries
- Finished end-user applications aimed at Uralic languages, such as spelling or grammar checkers, machine translation or speech processing
- Evaluation methods and gold standards, tagged corpora, treebanks
- Reports on language-independent or unsupervised methods as applied to Uralic languages
- Surveys and review articles on subjects related to computational linguistics for one or more Uralic languages
- Any work that aims at combining efforts and reducing duplication of work
- How to elicit activity from the language community, agitation campaigns, games with a purpose
Short papers can be up to 4 pages in length (5 for camera-ready version). Short papers can report on work in progress or a more targeted contribution such as software or partial results.
Long papers can be up to 8 pages in length (9 for camera-ready version). Long papers should report on previously unpublished, completed, original work.
Lightning talks submitted as 750-word abstracts. Lightning talks are suited for discussing ideas or presenting work in progress. The abstracts will be published in a lightning proceedings on Zenodo.
All submission formats can have an unlimited number of pages for references. All submissions must follow the ACL stylesheet.
The submissions must be anonymous, and they will be peer-reviewed by our program committee. The peer review is double blinded. Papers must be submitted using the conference submission system by the deadline. At least one of the authors of an accepted paper must attend the event and present their paper.
Accepted papers (short and long) will be published in the joint proceedings that will appear in the ACL Anthology. Accepted papers will also be given an additional page to address the reviewers’ comments. The length of a camera-ready submission can then be 5 pages for a short paper and 9 for a long paper with an unlimited number of pages for references.
Important dates:
- Paper submission (full and short): October 20, 2024
- Notification of acceptance: November 3, 2024
- Camera ready deadline: November 10, 2024
- Registration deadline: November 10, 2024
- Workshop: November 28-29, 2024
*****************************************
CALL FOR PAPERS
The 40th ACM/SIGAPP Symposium on Applied Computing
ACM SAC 2025
March 31 - April 4, 2025 - Catania, Italy
Knowledge and Natural Language Processing Track
*****************************************
Important Dates
Author deadline for submissions: September 20, 2024
Author notification of acceptance: October 30, 2024
Author camera ready and registration due: November 29, 2024
*****************************************
Aim
Aim of the Knowledge and Natural Language Processing (KNLP) track at ACM SAC is to investigate techniques and application of knowledge engineering and natural language processing, two extremely interdisciplinary and lively research areas at the core of Artificial Intelligence.
In particular, the track welcomes contributions combining and complementing methods and approaches from both areas.
Topics of interest include, but are not limited to:
- Natural Language Processing
NLP tasks for Knowledge Extraction
NLP for Ontology Population and Learning
Sentiment Analysis and Opinion Mining for Knowledge Applications
Interplay between Language and Ontologies
NLP for Explainable Knowledge
Machine Translation techniques for Multi-lingual Knowledge
NLP for the Web
(Large) Language Models and Knowledge
- Knowledge
Knowledge to improve NLP tasks
Knowledge for Information Retrieval
Knowledge-based Sentiment Analysis and Opinion Mining
Combining Knowledge and Deep Learning for NLP
Knowledge for Text Summarization and Generation
Knowledge for Persuasion
Knowledge-based Machine Translation
Knowledge for the Web
Linked Data for NLP
Knowledge-based NL Explainability
RAG and Knowledge injection for Language Models
- Applications
Real-world applications that exploit Knowledge and NLP
Knowledge and NLP Systems for Big Data scenarios
Knowledge and NLP technology for diverse, equitable, and inclusive society
Deployment of Knowledge and NLP Systems in specific domains, such as:
Digital Humanities and Social Sciences
eGovernment and public administration
Life sciences, health and medicine
News and Data Streaming
*****************************************
Paper Submission
Research papers and experience reports related to the above topics are solicited. Submissions must not have been published or be concurrently considered for publication elsewhere. Papers should be submitted in PDF using the ACM-SAC proceedings format using the submission link on the SAC 2025 website (https://www.sigapp.org/sac/sac2025/). Authors' names and affiliations should be entered separately at the submission site and not appear in the submitted papers. Each submission will be reviewed in a DOUBLE-BLIND process according to the ACM-SAC Regulations. Student Research Competition (SRC) submissions are welcome (see SAC 2025 website for details).
Full papers are limited to 8 pages, in camera-ready format, included in the registration fee. Authors have the option to include up to two (2) extra pages (paying an extra charge).
Posters are limited to 2 pages, in camera-ready format, included in the registration fee. Authors have the option to include only one (1) extra page (paying an extra charge).
SRC Abstracts are limited to 3 pages, in camera-ready format, included in the registration fee. No extra pages are allowed.
Paper selection is based on originality, technical contribution, presentation quality, and relevance to the Knowledge and Natural Language Processing Track. Some papers may be accepted as posters.
Paper registration is required, allowing the inclusion of the paper/poster in the conference proceedings. An author or a proxy attending SAC MUST present the paper. This is a requirement for the paper/poster to be included in the ACM digital library. No-show of registered papers and posters will result in excluding them from the ACM digital library.
*****************************************
Track Co-Chairs
Patrizio Bellan, Fondazione Bruno Kessler (FBK)
Marco Bombieri, Università degli Studi di Verona
Mauro Dragoni, Fondazione Bruno Kessler (FBK)
Marco Rospocher, Università degli Studi di Verona
*****************************************
Programme Committee
TBA
*****************************************
General Inquiries
For further information, please visit SAC Knowledge and Natural Language Processing Track (https://knlp-sac.github.io/2025/) and SAC 2024 conference websites (https://www.sigapp.org/sac/sac2025/) or feel free to contact the Track Co-Chairs at knlp(a)fbk.eu .
*****************************************
CALL FOR PAPERS
The 40th ACM/SIGAPP Symposium on Applied Computing
ACM SAC 2025
March 31 - April 4, 2025 - Catania, Italy
Knowledge and Natural Language Processing Track
*****************************************
Important Dates
Author deadline for submissions: September 20, 2024
Author notification of acceptance: October 30, 2024
Author camera ready and registration due: November 29, 2024
*****************************************
Aim
Aim of the Knowledge and Natural Language Processing (KNLP) track at ACM SAC is to investigate techniques and application of knowledge engineering and natural language processing, two extremely interdisciplinary and lively research areas at the core of Artificial Intelligence.
In particular, the track welcomes contributions combining and complementing methods and approaches from both areas.
Topics of interest include, but are not limited to:
- Natural Language Processing
NLP tasks for Knowledge Extraction
NLP for Ontology Population and Learning
Sentiment Analysis and Opinion Mining for Knowledge Applications
Interplay between Language and Ontologies
NLP for Explainable Knowledge
Machine Translation techniques for Multi-lingual Knowledge
NLP for the Web
(Large) Language Models and Knowledge
- Knowledge
Knowledge to improve NLP tasks
Knowledge for Information Retrieval
Knowledge-based Sentiment Analysis and Opinion Mining
Combining Knowledge and Deep Learning for NLP
Knowledge for Text Summarization and Generation
Knowledge for Persuasion
Knowledge-based Machine Translation
Knowledge for the Web
Linked Data for NLP
Knowledge-based NL Explainability
RAG and Knowledge injection for Language Models
- Applications
Real-world applications that exploit Knowledge and NLP
Knowledge and NLP Systems for Big Data scenarios
Knowledge and NLP technology for diverse, equitable, and inclusive society
Deployment of Knowledge and NLP Systems in specific domains, such as:
Digital Humanities and Social Sciences
eGovernment and public administration
Life sciences, health and medicine
News and Data Streaming
*****************************************
Paper Submission
Research papers and experience reports related to the above topics are solicited. Submissions must not have been published or be concurrently considered for publication elsewhere. Papers should be submitted in PDF using the ACM-SAC proceedings format using the submission link on the SAC 2025 website (https://www.sigapp.org/sac/sac2025/). Authors' names and affiliations should be entered separately at the submission site and not appear in the submitted papers. Each submission will be reviewed in a DOUBLE-BLIND process according to the ACM-SAC Regulations. Student Research Competition (SRC) submissions are welcome (see SAC 2025 website for details).
Full papers are limited to 8 pages, in camera-ready format, included in the registration fee. Authors have the option to include up to two (2) extra pages (paying an extra charge).
Posters are limited to 2 pages, in camera-ready format, included in the registration fee. Authors have the option to include only one (1) extra page (paying an extra charge).
SRC Abstracts are limited to 3 pages, in camera-ready format, included in the registration fee. No extra pages are allowed.
Paper selection is based on originality, technical contribution, presentation quality, and relevance to the Knowledge and Natural Language Processing Track. Some papers may be accepted as posters.
Paper registration is required, allowing the inclusion of the paper/poster in the conference proceedings. An author or a proxy attending SAC MUST present the paper. This is a requirement for the paper/poster to be included in the ACM digital library. No-show of registered papers and posters will result in excluding them from the ACM digital library.
*****************************************
Track Co-Chairs
Patrizio Bellan, Fondazione Bruno Kessler (FBK)
Marco Bombieri, Università degli Studi di Verona
Mauro Dragoni, Fondazione Bruno Kessler (FBK)
Marco Rospocher, Università degli Studi di Verona
*****************************************
Programme Committee
TBA
*****************************************
General Inquiries
For further information, please visit SAC Knowledge and Natural Language Processing Track (https://knlp-sac.github.io/2025/) and SAC 2024 conference websites (https://www.sigapp.org/sac/sac2025/) or feel free to contact the Track Co-Chairs at knlp(a)fbk.eu .
Deadline extension: 15 August 2024
TRANSLATING AND THE COMPUTER CONFERENCE 2024 (TC46) - Call for
Presentations and Papers
https://asling.org/tc46/call-for-papers-cfp/
"Navigating the future of language: Innovation, integration,
inspiration"
Embracing innovation in language technologies - From AI to traditional
practices -, TC46 welcomes submissions on a broad spectrum of topics
related to language technologies in the provision of language services.
While there is a special emphasis on the advancements and implications
of AI and Generative AI, we strongly encourage contributions that cover
a wide range of interests and perspectives in the language services
field. Whether you are deeply involved in AI-driven projects or are
focused on traditional or emerging practices independent of AI, your
insights are invaluable.
* Deadline for submitting proposals for full length talks (academic
and user-experience) and short/Poster talks for TC46 is extended to 15
August
* Deadline for submitting proposals for workshops and panels is
extended to 15 August
https://asling.org/tc46/call-for-papers-cfp/