Neural language models have revolutionised natural language processing (NLP) and have provided state-of-the-art results for many tasks. However, their effectiveness is largely dependent on the pre-training resources. Therefore, language models (LMs) often struggle with low-resource languages in both training and evaluation. Recently, there has been a growing trend in developing and adopting LMs for low-resource languages. LoResLM aims to provide a forum for researchers to share and discuss their ongoing work on LMs for low-resource languages.
Topics
LoResLM 2025 invites submissions on a broad range of topics related to the development and evaluation of neural language models for low-resource languages, including but not limited to the following.
* Building language models for low-resource languages. * Adapting/extending existing language models/large language models for low-resource languages. * Corpora creation and curation technologies for training language models/large language models for low-resource languages. * Benchmarks to evaluate language models/large language models in low-resource languages. * Prompting/in-context learning strategies for low-resource languages with large language models. * Review of available corpora to train/fine-tune language models/large language models for low-resource languages. * Multilingual/cross-lingual language models/large language models for low-resource languages. * Applications of language models/large language models for low-resource languages (i.e. machine translation, chatbots, content moderation, etc.
Important Dates
* Paper submission due – 5th November 2024 * Notification of acceptance – 25th November 2024 * Camera-ready due – 13th December 2024 * LoResLM 2025 workshop – 19th / 20th January 2025 co-located with COLING 2025
Submission Guidelines
We follow the COLING 2025 standards for submission format and guidelines. LoResLM 2025 invites the submission of long papers of up to eight pages and short papers of up to four pages. These page limits only apply to the main body of the paper. At the end of the paper (after the conclusions but before the references), papers need to include a mandatory section discussing the limitations of the work and, optionally, a section discussing ethical considerations. Papers can include unlimited pages of references and an unlimited appendix. To prepare your submission, please make sure to use the COLING 2025 style files available here:
* Latex - https://coling2025.org/downloads/coling-2025.zip * Word - https://coling2025.org/downloads/coling-2025.docx * Overleaf - https://www.overleaf.com/latex/templates/instructions-for-coling-2025-procee...
Papers should be submitted through Softconf/START using the following link: https://softconf.com/coling2025/LoResLM25/
Organising Committee
* Hansi Hettiarachchi, Lancaster University, UK * Tharindu Ranasinghe, Lancaster University, UK * Paul Rayson, Lancaster University, UK * Ruslan Mitkov, Lancaster University, UK * Mohamed Gaber, Birmingham City University, UK * Damith Premasiri, Lancaster University, UK * Fiona Anting Tan, National University of Singapore, Singapore * Lasitha Uyangodage, University of Münster, Germany
URL - https://loreslm.github.io/ Twitter - https://x.com/LoResLM2025
Best Regards Tharindu Ranasinghe
Dear Sender,
I am currently out of the office and will not be checking emails regularly. I will return on September 9, and will respond to your message as soon as possible after that date.
Best regards, Charlott Jakob
On 19 Jul 2024, at 13:00, Ranasinghe, Tharindu via Corpora corpora@list.elra.info wrote:
Neural language models have revolutionised natural language processing (NLP) and have provided state-of-the-art results for many tasks. However, their effectiveness is largely dependent on the pre-training resources. Therefore, language models (LMs) often struggle with low-resource languages in both training and evaluation. Recently, there has been a growing trend in developing and adopting LMs for low-resource languages. LoResLM aims to provide a forum for researchers to share and discuss their ongoing work on LMs for low-resource languages.
Topics
LoResLM 2025 invites submissions on a broad range of topics related to the development and evaluation of neural language models for low-resource languages, including but not limited to the following. • Building language models for low-resource languages.
• Adapting/extending existing language models/large language models for low-resource languages.
• Corpora creation and curation technologies for training language models/large language models for low-resource languages.
• Benchmarks to evaluate language models/large language models in low-resource languages.
• Prompting/in-context learning strategies for low-resource languages with large language models.
• Review of available corpora to train/fine-tune language models/large language models for low-resource languages.
• Multilingual/cross-lingual language models/large language models for low-resource languages.
• Applications of language models/large language models for low-resource languages (i.e. machine translation, chatbots, content moderation, etc.
Important Dates
• Paper submission due – 5th November 2024
• Notification of acceptance – 25th November 2024
• Camera-ready due – 13th December 2024
• LoResLM 2025 workshop – 19th / 20th January 2025 co-located with COLING 2025
Submission Guidelines
We follow the COLING 2025 standards for submission format and guidelines. LoResLM 2025 invites the submission of long papers of up to eight pages and short papers of up to four pages. These page limits only apply to the main body of the paper. At the end of the paper (after the conclusions but before the references), papers need to include a mandatory section discussing the limitations of the work and, optionally, a section discussing ethical considerations. Papers can include unlimited pages of references and an unlimited appendix. To prepare your submission, please make sure to use the COLING 2025 style files available here: • Latex - https://coling2025.org/downloads/coling-2025.zip •
• Word - https://coling2025.org/downloads/coling-2025.docx •
• Overleaf - https://www.overleaf.com/latex/templates/instructions-for-coling-2025-procee... •
Papers should be submitted through Softconf/START using the following link: https://softconf.com/coling2025/LoResLM25/
Organising Committee
• Hansi Hettiarachchi, Lancaster University, UK
• Tharindu Ranasinghe, Lancaster University, UK
• Paul Rayson, Lancaster University, UK
• Ruslan Mitkov, Lancaster University, UK
• Mohamed Gaber, Birmingham City University, UK
• Damith Premasiri, Lancaster University, UK
• Fiona Anting Tan, National University of Singapore, Singapore
• Lasitha Uyangodage, University of Münster, Germany
URL - https://loreslm.github.io/ Twitter - https://x.com/LoResLM2025
Best Regards Tharindu Ranasinghe
_______________________________________________ Corpora mailing list -- corpora@list.elra.info https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/ To unsubscribe send an email to corpora-leave@list.elra.info