- Corpora - ELRA lists

CALL FOR NTCIR-18 TASK PARTICIPATION
by CHUNG-CHI CHEN 13 May '24

13 May '24

CALL FOR NTCIR-18 TASK PARTICIPATION NTCIR-18 tasks: http://research.nii.ac.jp/ntcir/ntcir-18/tasks.html NTCIR-18 registration: http://research.nii.ac.jp/ntcir/ntcir-18/howto.html ____________________________________________________________ NTCIR-18 (June 10-13, 2025, Tokyo, Japan) now calls for task participation of anyone interested in research on information access technologies and their evaluation, such as retrieval from a large amount of document collections, question answering and natural language processing. We welcome students, young researchers, professors who supervise students, researchers working for a company, and anyone who is interested in informatics. == What is NTCIR? == Development of Information Access technologies based on techniques of Information Retrieval, Natural Language Processing, and Database Management becomes increasingly more important for many applications, (e.g., providing effective access to Web resources and text archives, and analyzing big data obtained from various kinds of sensors). It is indispensable for developing such technologies to experimentally evaluate them by using test collections constructed under collaborations of many researchers. Over the past 20 years, NTCIR has been formulating the infrastructure for the evaluation, and contributing to development of the Information Access technologies. A total of over 80 “evaluation tasks” have been organized, attracting over 1,000 participant research groups worldwide so far. Furthermore, over 4,600 research groups have signed up to use the NTCIR test collections in their research. Consequently, NTCIR has been a major forum for researchers to intensively discuss the evaluation methodology of emerging information access technologies. == NTCIR-18 Tasks == NTCIR-18 Program Committee has selected 10 tasks. The overview slide of each task can be available at the kick-off page: https://research.nii.ac.jp/ntcir/ntcir-18/kickoffcfp.html For more details, please visit the websites of each task: Task Overview page: http://research.nii.ac.jp/ntcir/ntcir-18/tasks.html ---------------------------------------------------------------------------- NTCIR-18 Tasks Core Tasks AEOLLM: Automatic Evaluation of LLMs https://aeollm.github.io/ FairWeb-2: The Second Fair Web Task sakailab.com/fairweb2/ FinArg-2: Temporal Inference of Financial Arguments https://sites.google.com/nlg.csie.ntu.edu.tw/ntcir-18-finarg-2/finarg-2 Lifelog-6: Personal Lifelog Organisation & Retrieval Task http://lifelogsearch.org/ntcir-lifelog/ RadNLP: Natural Language Processing for Radiology https://sociocom.naist.jp/radnlp-2024 MedNLP-CHAT Medical Natural Language Processing for AI Chat https://sociocom.naist.jp/mednlp-chat Transfer-2: Resource Transfer Based Dense Retrieval https://github.com/ntcirtransfer/transfer2/discussions Pilot Tasks HIDDEN-RAD: Hidden Causality Inclusion in Radiology Report Generation https://sites.google.com/view/ntcir-18-hidden-rad/hidden-rad SUSHI: Searching Unseen Sources for Historical Information https://sites.google.com/view/ntcir-sushi-task/ U4: Unifying, Understanding, and Utilizing Unstructured Data in Financial Reports https://sites.google.com/view/ntcir18-u4/ == How to Participate == 1. Please carefully read “How to Participate to NTCIR-18 Task(s)“: http://research.nii.ac.jp/ntcir/ntcir-18/howto.html 2. Please register at http://ntcir.nii.ac.jp/index.php/ntcir-18-registration-form/ The datasets of each task will be delivered to the team after registration (the date may vary depending on the task). Registration Due (Depend on the task): Nov. 1st, 2024 AEOLLM: Automatic Evaluation of LLMs FairWeb-2: The Second Fair Web Task RadNLP: Natural Language Processing for Radiology MedNLP-CHAT: Medical Natural Language Processing for AI Chat HIDDEN-RAD: Hidden Causality Inclusion in Radiology Report Generation Dec. 15th, 2024 Transfer-2: Resource Transfer Based Dense Retrieval SUSHI Searching Unseen Sources for Historical Information U4: Unifying, Understanding, and Utilizing Unstructured Data in Financial Reports Jan 8th, 2025 FinArg-2: Temporal Inference of Financial Arguments Lifelog-6: Personal Lifelog Organisation & Retrieval Task == Schedule == (*Schedule can be different in different tasks. Please visit webpages of each task for the details.) May 2024 Dataset release* Jun-Dec 2024 Dry run* Sep 2024-Feb 2025 Formal run* Feb 1, 2025 Evaluation results return Feb 1, 2025 Task overview release (draft) Mar 1, 2025 Submission due of participant papers (draft) May 1, 2025 Camera-ready participant paper due Jun 10-13, 2025 NTCIR-18 Conference (At the NTCIR-18 Conference, an online presentation will be available) == Questions? == For information regarding the task specifications etc., please contact the NTCIR Program Chairs: ntc18-pcc(a)nii.ac.jp or task organizers. For information regarding the online registration and previous NTCIR test collections, please contact the NTCIR office: ntc-secretariat(a)nii.ac.jp We are looking forward to your participation! NTCIR-18 Program Co-Chairs: Qingyao Ai (Tsinghua University, China) Chung-Chi Chen (AIST, Japan) Shoko Wakamiya (NAIST, Japan) NTCIR-18 General Co-Chairs: Charles Clarke (University of Waterloo, Canada) Noriko Kando (National Institute of Informatics, Japan) Makoto P. Kato (University of Tsukuba, Japan) Yiqun Liu (Tsinghua University, China)

1 0

Call for Participation: MWE-UD @ LREC-COLING
by Gosse Bouma 12 May '24

12 May '24

Joint Workshop on Multiword Expressions and Universal Dependencies (MWE-UD 2024): Call for Participation Co-located with LREC-Coling, Turin, Italy or online, May, 25 (full day) Invited Speakers: * Harish Tayyar Madabushi /, /University of Bath/, Every Time We Hire an LLM, the Reasoning Performance of the Linguists Goes Up/ * Natalia Leshina, Radboud University, /Using Universal Dependencies for testing hypotheses about communicative efficiency/ The MWE workshop has been organised by the MWE sectionof SIGLEXat major NLP conferences since 2003. Universal Dependencies is a framework for cross-linguistically consistent treebank annotation that has so far been applied to over 100 languages. Starting with the first UD workshop in 2017, this joint workshop is the 7th edition in the series. The Unidive COST action s an interdisciplinary scientific network devoted to universality, diversity, and idiosyncrasy in language technology. UniDive is the co-organiser of this year’s joint WS. Conference website and program details: https://multiword.org/mweud2024/ Registration: https://lrec-coling-2024.org

1 0

1st CFP MultiCardioNER (CLEF/BioASQ 2024): Clinical Named Entity Recognition adaptation shared task (multilingual & cardiology)
by Martin Krallinger 10 May '24

10 May '24

(Apologies for cross-posting) *CFP MultiCardioNER (CLEF/BioASQ 2024): Clinical Named Entity Recognition adaptation shared task (multilingual & cardiology)* *https://temu.bsc.es/multicardioner/ <https://temu.bsc.es/multicardioner/>* The MultiCardioNER track focuses on the adaptation of clinical NER systems to specific high impact clinical application domains (cardiovascular diseases, the leading cause of death globally) as well as to multiple languages (English, Spanish and Italian), focusing on two clinical entity types: diseases and medications. *Key information:* · *Web*: https://temu.bsc.es/multicardioner · *Data*: https://zenodo.org/records/10948355 · *BioASQ** web*: http://bioasq.org/ · *Registration**:* https://temu.bsc.es/multicardioner/registration/ *Motivation *The extraction of clinical variables from medical content is key to enable efficient healthcare data analytics. Due to the highly specialized medical language, with considerable variation depending on the medical discipline, more specialized automatic semantic annotation resources are needed, not only for English but also other languages. This is particularly true for clinical content related to cardiovascular diseases (CVDs), which represent the leading cause of death globally, responsible for approximately 17.9 million deaths/year. The MultiCardioNER task will focus on the automatic recognition of two key clinical variables or concept types, namely diseases and medications in cardiology clinical case documents with the following two aims: · Adaptation of general clinical concept recognition systems to cardiology case reports to assess and determine how well such systems can be adapted to high impact clinical application domains / specialties (cardiology disease NER - CardioDis subtrack: Spanish). · Promote the comparative assessment and development of clinical entity recognition systems for multiple languages (i.e., medication mention detection) as well as adaptation to specific medical specialties (MultiDrug subtrack: English, Spanish and Italian) To enable the adaptation of general medical NER systems for diseases and medications the MultiCardioNER task will rely on a training collection of 1000 general clinical case reports in Spanish annotated with diseases (Spanish) and medications (English, Spanish and Italian). Moreover, to be able to adapt such general medical NER approaches to cardiology case reports a development set of 250 cardiology cases will be released. The test set will consist of an additional test collection of 250 cardiology case reports. The evaluation of systems for this task will use flat evaluation, mainly micro-averaged Precision, Recall and F-measure (MiF). Sub-tracks: *Subtask 1 (CardioDis): *Spanish adaptation of disease recognition systems to the cardiology domain *Subtask 2 (MultiDrug): *Multilingual (Spanish, English and Italian) adaptation of medication recognition systems to the cardiology domain *Tentative schedule* · MultiCardioNER Train+Dev Set Release April 9th, 2024 · MultiCardioNER Annotation Guidelines Release April 17th, 2024 · MultiCardioNER Gazetteer Release April 17th, 2024 · MultiCardioNER Test Set Texts Release May 2nd, 2024 · Participant Test Predictions Deadline · May 15th, 2024 · Participant Evaluation Result Release May 19th, 2024 · Submission of Participant Papers Deadline May 31st, 2024 · Notification of Acceptance of Participant Papers June 24th, 2024 · Submission of Camera-ready Participant Papers Deadline July 8th, 2024 · BioASQ @ CLEF2024 September 9th-12th, 2024 *Publications & conference* Following previous BioASQ/CLEF efforts, participating teams will be invited to contribute a short systems description paper for the CLEF 2024 proceedings, and to give a short presentation of their approach at the BioASQ workshop at the CLEF 2024 conference (September 09-12, 2024, in Grenoble, France) *The **MultiCardioNER Organizers & collaborators:* - Salvador Lima-López, Barcelona Supercomputing Center (BSC), Spain - Eulàlia Farré-Maduell, Barcelona Supercomputing Center (BSC), Spain - Jan Rodríguez-Miret, Barcelona Supercomputing Center (BSC), Spain - Martin Krallinger, Barcelona Supercomputing Center (BSC), Spain - Anastasios Nentidis, National Center for Scientific Research Demokritos, Greece - Anastasia Krithara, National Center for Scientific Research Demokritos, Greece - Georgios Katsimpras, National Center for Scientific Research Demokritos, Greece - Livia Lilli, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Italy - Jacopo Lenkowicz, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Italy - Jonathan Kossoff, University College London Hospitals NHS Foundation Trust, UK - Giovanna Ceroni, University College London, UK - Anoop Shah, University College London Hospitals NHS Foundation Trust, UK ======================================= Martin Krallinger, Dr. Head of NLP for Biomedical Information Analysis Unit Barcelona Supercomputing Center (BSC-CNS) https://www.linkedin.com/in/martin-krallinger-85495920/ =======================================

1 1

Deadline Extension: 5th International Workshop on Computational Approaches to Historical Language Change 2024 (LChange’24)
by Syrielle Montariol 10 May '24

10 May '24

***Apologies for possible cross-posting *** Last Call for Papers + Deadline Extension: 5th International Workshop on Computational Approaches to Historical Language Change (LChange’24) We're organizing a full-day workshop co-located with the ACL conference on Aug 15, 2024 in Bangkok and online. The deadline is being *extended by one week, *to *May 17th AOE.* Workshop: https://www.changeiskey.org/event/2024-acl-lchange/ Contact email: lchange(a)changeiskey.org Workshop description The LChange workshop targets all aspects of computational modeling of language change, historical as well as synchronic change. It is running in its fifth iteration following successful workshops in 2019 <https://languagechange.org/events/2019-acl-lcworkshop/>, 2021 <https://languagechange.org/events/2021-acl-lcworkshop/>, 2022 <https://languagechange.org/events/2022-acl-lchange/>, and 2023 <https://languagechange.org/events/2023-emnlp-lchange/>, and will be co-located with ACL 2024 in Bangkok (Thailand), as a hybrid event. The workshop will take place on Thursday 15 August 2024. The main topics of the workshop remain the same: all aspects around computational approaches to language change with a focus on digital text corpora. LChange explores state-of-the-art computational methodologies, theories and digital text resources on exploring the time-varying nature of human language. The aim of this workshop is to provide pioneering researchers who work on computational methods, evaluation, and large-scale modeling of language change an outlet for disseminating research on topics concerning language change. Besides these goals, this workshop will also support discussion on evaluating computational methodologies for uncovering language change. We’ll also be offering mentorship to students, to discuss their research topic with a member of the field, regardless of whether they are submitting a paper or not. We'll have two amazing *keynote speakers*, Antske Fokkens <http://wordpress.let.vupr.nl/antske/> (Vrije Universiteit in Amsterdam) and Johann-Mattis List <https://www.eva.mpg.de/linguistic-and-cultural-evolution/staff/mattis-list/> (University of Passau). Important Dates * May 10 *May 17th*, 2024: Paper submission * June 20, 2024: Notification of acceptance * June 30, 2024: Camera-ready papers due * August 15, 2024: Workshop date Submissions We accept two types of submissions, long and short papers, consisting of up to eight (8) and four (4) pages of content, respectively, plus unlimited references; final versions will be given one additional page of content so that reviewers' comments can be taken into account. We also welcome papers focusing on releasing a dataset or a model; these papers fall into the short paper category. We invite original research papers from a wide range of topics, including but not limited to: - Novel methods for detecting diachronic semantic change and lexical replacement - Automatic discovery and quantitative evaluation of laws of language change - Computational theories and generative models of language change - Sense-aware (semantic) change analysis - Diachronic word sense disambiguation - Novel methods for diachronic analysis of low-resource languages - Novel methods for diachronic linguistic data visualization - Novel applications and implications of language change detection - Quantification of sociocultural influences on language change - Cross-linguistic, phylogenetic, and developmental approaches to language change - Novel datasets for cross-linguistic and diachronic analyses of language Accepted papers will be presented orally or as posters and included in the workshop proceedings. Submissions are open to all and are to be submitted anonymously. All papers will be refereed through a double-blind peer review process by at least three reviewers with final acceptance decisions made by the workshop organizers. If you have published in the field previously, and are interested in helping out in the program committee to review papers, please send us an email! Workshop organizers Nina Tahmasebi, University of Gothenburg Syrielle Montariol, École polytechnique fédérale de Lausanne Andrey Kutuzov, University of Oslo Simon Hengchen, iguanodon.ai and Université de Genève David Alfter, University of Gothenburg Francesco Periti, University of Milan Pierluigi Cassotti, University of Gothenburg

1 0

CfP (extended deadline): The EPIA Track of Natural Language, Text Mining and Applications (NLP-TeMA 2024)
by Pablo Gamallo 10 May '24

10 May '24

The EPIA Track of Natural Language, Text Mining and Applications (NLP-TeMA 2024) *Important dates* Paper submission extended deadline *May 20, 2024 * Notification of paper acceptance June 20, 2024 Camera-ready papers deadline July 15, 2024 Conference dates September 3-6, 2024 The EPIA Conference on Artificial Intelligence (AI) is a well-established European conference in the field of AI. The 23st edition, EPIA 2024, will take place in Viana do Castelo from 3th to 6th of September, 2024. As in previous editions, this international conference is hosted with the patronage of the Portuguese Association for Artificial Intelligence (APPIA). The Track of Natural Language, Text Mining and Applications (NLP-TeMA 2024) of EPIA is a forum for researchers working in Human Language Technologies, i.e. Natural Language Processing (NLP), Computational Linguistics (CL), Natural Language Engineering (NLE), Text Mining (TM), Information Retrieval (IR), and related areas. The most natural form of sharing knowledge is indeed through textual documents. Especially on the Web, a huge amount of textual information is openly published every day, on many different topics and written in natural language, thus offering new insights and many opportunities for innovative applications of Human Language Technologies. Following advances in general AI sub-fields such as NLP, Machine Learning (ML) and Deep Learning (DL), text mining is now even more valuable as tool for bridging the gap between language theories and effective use of natural language contents, for harnessing the power of semi-structured and unstructured data, and to enable important applications in real-world heterogeneous environments. Both hidden and new knowledge can be discovered by using NLP and Text Mining methods, at multiple levels and in multiple dimensions, and often with high commercial value. Topics of Interest Natural Language Processing: Language and Cognitive Modeling Tagging, Chunking and Parsing Morphology and Word Segmentation Natural Language Generation Discourse and Pragmatics Sentence-level Semantics and Text Inference Language Resources: Acquisition and Usage. Lexical Knowledge Acquisition Entailment and Paraphrases Entity Recognition and Word Sense Disambiguation Natural Language Understanding Language Modeling Mathematical Properties of Language NLP for Low-Resource Languages Text Mining and Applications: Text Clustering, Classification and Summarization Sentiment Analysis and Argument Mining Computational Social Science Multi-Word Units Machine Learning for NLP and Text Mining Spatio-Temporal and Big Text Mining Cross-Lingual Approaches Algorithms and Data Structures for Text Mining Information Retrieval and Information Extraction Question-Answering and Dialogue Systems Text-Based Prediction and Forecasting Web Content Annotation Health/Biomedical/Legal and other Text Mining Applications The Track Organizing Committee: Joaquim Silva Pablo Gamallo Irene Rodrigues Paulo Quaresma Alípio Jorge Joaquim Ferreira da Silva Departamento de Informática, Nova School of Science and Technology, Quinta da Torre, 2829-516 Caparica, Portugal. Tel: +351 21 294 8536 ext. 10732 Fax; +351 21 294 8541; e-mail: jfs(a)fct.unl.pt

1 0

PhD in NLP, robotics and user-centred design at CSIRO and Queensland University of Technology
by Xiang DAI 10 May '24

10 May '24

CSIRO and Queensland University of Technology are offering a PhD scholarship on the topic of Human-Robot Collaboration. The goal of this project is to investigate how to process logs of raw information coming from the robot to present to the human operator an appropriate and relevant summary. The project will combine natural language processing (and using large language models), robotics and user-centred design. Expressions of Interest close: 20th May 2024 https://www.australiancobotics.org/project/project-2-5-spot-whats-happening… For informal enquiries please contact: Xiang Dai (dai.dai(a)csiro.au) Stephen Wan (stephen.wan(a)csiro.au)

1 0

SUBJECT: Call for applications (all areas of linguistics): fully funded conference, ‘Exploring the Dark Side of Future Language Technologies: Linguistic (In)security, Ethics, and Privacy in the Human-Machine Era’, UCLouvain, Belgium, 2-3 Sept 2024
by Rui Sousa Silva 09 May '24

09 May '24

Dear all, ‘Language in the Human-Machine Era’ (https://lithme.eu/) welcomes everyone interested in the impact of new and emerging language technologies that integrate with human senses. Whether you are a tech developer who wants to learn more about linguistics, or a linguist who wants to know more about tech, we want to hear from you! You can find out more about our themes of interest from our published forecast report (https://doi.org/10.17011/jyx/reports/20210518/1) and our animations (https://lithme.eu/animations). Our 4th annual conference will be held at UCLouvain (Université catholique de Louvain, Louvain-la-Neuve, Belgium) on 2-3 September 2024. This year's theme is: ‘Exploring the Dark Side of Future Language Technologies: Linguistic (In)security, Ethics, and Privacy in the Human-Machine Era’. The call for papers is now open, and we warmly encourage submissions from any eligible researcher or practitioner who is interested in exploring these timely topics. We welcome experienced developers, but no technological expertise is required, only an interest in exploring the possible effects of these near-future advances in language technology. Presentations can address any of the topics that fall within the interests of LITHME. Selection for funded places will be made by the conference scientific committee. The deadline for submitting your abstract is June 8th, 2024. More details of funding eligibility, the conference theme, and a link to the abstract submission form are on our website: https://lithme.eu/conference2024/ Please forward this message to anyone who may be interested, and please repost the social media announcements here: https://twitter.com/LgHumanMachine/status/1787799682338390457 https://bsky.app/profile/lghumanmachine.bsky.social/post/3krvwfjaqpk2f We hope to see you at the conference! All best, Rui Sousa Silva Faculdade de Letras, Universidade do Porto Faculty of Arts and Humanities, University of Porto www.linguisticaforense.pt | https://s.up.pt/qjur | http://tinyurl.com/37w2ec6x Publicação mais recente / Latest publication: ‘We Attempted to Deliver Your Package’: Forensic Translation in the Fight Against Cross-Border Cybercrime AVISO DE CONFIDENCIALIDADE: Esta mensagem e os seus anexos são confidenciais e dirigidos unicamente aos destinatários da mesma. Se não for o destinatário, solicito que não faça qualquer uso do seu conteúdo e proceda à sua eliminação, notificando-me do sucedido. Obrigado. // CONFIDENTIALITY WARNING: This message and its attachments are confidential and exclusively addressed to the recipients above. Should you not be one of the recipients, I kindly ask you not to make use of its contents and delete the message and its attachments. Please reply to this e-mail to warn me about this incident. Thank you.

1 0

2nd CFP: Special Issue on Natural Language for Artificial Intelligence in the Era of LLMs published in IJCoL
by Dominique Brunato 09 May '24

09 May '24

[Apologize for cross-posting] *Special Issue on Natural Language for Artificial Intelligence in the Era of LLMs* IJCoL - Italian Journal of Computational Linguistics, OpenEdition Journal, ISSN 2499-4553, https://www.ai-lc.it/en/journal/ Manuscript Submission Deadline: *1st July 2024* Latest Acceptance Deadline for all papers: *1st October 2024* ======================================================================== ########### *Guest Editors:* ########### • Elisa Bassignana, IT University of Copenhagen (Denmark), https://elisabassignana.github.io/ • Dominique Brunato, Institute for Computational Linguistics “A. Zampolli” (CNR-ILC) (Italy), http://www.italianlp.it/people/dominique-brunato/ • Marco Polignano, University of Bari Aldo Moro (Italy), https://marcopoli.github.io/ • Alan Ramponi, Fondazione Bruno Kessler (Italy), https://alanramponi.github.io/ ########### *Special issue information: *########### The advancement of Large Language Models (LLMs) has revolutionized the field of Natural Language Processing (NLP) and Artificial Intelligence (AI) in recent years. Transformer-based models such as GPT-3 and BERT have demonstrated remarkable capabilities in modeling and generating human-like text. These models have significantly impacted various applications, including machine translation, sentiment analysis, question answering, and more. The era of LLMs has opened up new opportunities and challenges in harnessing natural language for AI systems. The motivation behind this special issue is to provide a platform for researchers and practitioners to explore and discuss the latest advancements, methodologies, and applications of Natural Language for Artificial Intelligence in the Era of LLMs. The aim is to foster collaboration and knowledge sharing among the NLP and AI communities, enabling them to leverage LLMs effectively and ethically for solving real-world problems, as well as for tackling open research questions that contribute to a deeper understanding of the similarities and differences between human and machine learning. This special issue invites original research papers, reviews, and case studies that focus on utilizing LLMs in various natural language processing tasks and applications within the context of Artificial Intelligence. This is a natural extension of the topics covered in the NL4AI 2023 workshop at AIxIA 2023, where the primary focus of discussion among the many papers received was related to the use of LLMs to tackle many typical tasks in natural language understanding and generation. We invite researchers from both academia and industry to submit their cutting-edge research findings, unearthing novel insights and pushing the boundaries of knowledge. We hope to receive submissions not only from people attending NL4AI but also from researchers outside this ring to enrich the discourse with a broader spectrum of perspectives. ########### *Topics*: ########### Relevant topics for the proposed special issue include, but are not limited to: • Robustness and generalization of LLMs • Diversity and inclusion of LLMs • The role of linguistics in the era of LLMs • Benchmarking and evaluation of LLMs • Explainability and interpretability of LLMs through Computational Linguistics and related disciplines • Domain-specific applications of LLMs (e.g., healthcare, education, cultural heritage) • Knowledge representation and reasoning with LLMs • Machine translation and cross-lingual applications with LLMs • Applications to the Italian language and under-studied languages • Dialogue systems and conversational agents using LLms • Ethical and social implications of LLMs • Exploration of multimodality and data augmentation approaches in the era of LLMs ########### *Submission Information*: ########### Contributions will be processed as they are submitted no later than *1st July, 2024*. The latest acceptance deadline for all papers is *1st October, 2024*. Submissions must be prepared according to the submission guidelines: • https://www.ai-lc.it/en/journal/instructions-for-authors/ and must be submitted via the dedicated web page: • https://www.ai-lc.it/ijcolreview/index.php/ijcol/about/submissions • On the platform, select the menu option: “Special Issue: Natural Language for Artificial Intelligence in the Era of LLMs” The special issue will also consider extended versions (at least 30% new content) of papers published at conferences or on preprint platforms (i.e., arXiv.org). The Editors of the Journal will pre-screen all submitted articles. Articles that do not reach the scientific standards of the journal will be desk-rejected. Articles that meet the requirements will be sent to expert reviewers and will then be reviewed for publication. The peer-review procedure will involve two experts (or three, in the rare case of disagreement). IJCoL Editors avoid engaging reviewers who are close to – or have any sort of conflict of interest with – a given author. Referees may request a major or minor revision of the article. The final decision on acceptability is the Editors’ responsibility. For questions and further information, please contact the Guest Editors: Elisa Bassignana (elba(a)itu.dk), Dominique Brunato (dominique.brunato(a)ilc.cnr.it), Marco Polignano (marco.polignano(a)uniba.it), Alan Ramponi (alramponi(a)fbk.eu) More details will be provided at: https://www.ai-lc.it/en/journal/

1 0

Second CFP: LoResMT 2024 at ACL 2024
by Atul K. Ojha 09 May '24

09 May '24

Apologies for cross-posting. --------------------------------------------------------------------------- The Seventh Workshop on Technologies for Machine Translation of Low-Resource Languages (LoResMT 2024) https://www.loresmt.org/ @ ACL 2024 (August 11–16, 2024) Bangkok, Thailand SUBMISSION https://openreview.net/group?id=aclweb.org/ACL/2024/Workshop/LoResMT TIMELINE Paper submission due: May 17 (Friday), 2024, at 23:59 (Anywhere on Earth) Notification of acceptance: June 17 (Monday), 2024 Camera-ready papers due: July 1 (Monday), 2024, at 23:59 (Anywhere on Earth) Workshop dates at ACL: August 15, 2024 SCOPE Based on the success of past low-resource machine translation (MT) workshops at AMTA 2018 (https://amtaweb.org/), MT Summit 2019 ( https://www.mtsummit2019.com), AACL-IJCNLP 2020 (http://aacl2020.org/), AMTA 2021, COLING 2022 and EACL 2023, we introduce the Seventh LoResMT Workshop at ACL 2024. The workshop provides a discussion panel for researchers working on MT systems/methods for low-resource and under-represented languages in general. We would like to help review/overview the state of MT for low-resource languages and define the most important directions. We also solicit papers dedicated to supplementary NLP tools that are used in any language and especially in low-resource languages. Overview papers on these NLP tools are very welcome. It will be beneficial if the evaluations of these tools in research papers include their impact on the quality of MT output. TOPICS We are highly interested in (1) original research papers, (2) review/opinion papers, and (3) online systems on the topics below; however, we welcome all novel ideas that cover research on low-resource languages. - Neural machine translation (NMT) for low-resource languages - Use of LLMs (large language models) for low-resource MT systems - COVID-related corpora, their translations and corresponding NLP/MT systems - Work that presents online systems for practical use by native speakers - Word tokenizers/de-tokenizers for specific languages - Word/morpheme segmenters for specific languages - Alignment/Re-ordering tools for specific language pairs - Use of morphology analyzers and/or morpheme segmenters in MT - Multilingual/cross-lingual NLP tools for MT - Corpora creation and curation technologies for low-resource languages - Review of available parallel corpora for low-resource languages - Research and review papers on MT methods for low-resource languages - MT systems/methods (e.g. rule-based, SMT, NMT) for low-resource languages - Pivot MT for low-resource languages - Zero-shot MT for low-resource languages - Fast building of MT systems for low-resource languages - Re-usability of existing MT systems for low-resource languages - Machine translation for language preservation SUBMISSION INFORMATION We are soliciting two types of submissions: (1) research, review, and position papers and (2) system demonstration papers. For research, review and position papers, the length of each paper should be at least four (4) and not exceed eight (8) pages, plus unlimited pages for references. For system demonstration papers, the limit is four (4) pages. Submissions should be formatted according to the official ACL 2024 style templates. Accepted papers will be published online in the ACL 2024 proceedings and will be presented at the conference. Submissions must be anonymized and should be done using the provided submission system. Scientific papers that have been or will be submitted to other venues must be declared as such and must be withdrawn from the other venues if accepted and published at LoResMT. The review will be double-blind. Authors of an accepted paper should present their paper in person at ACL 2024. Papers should be submitted in PDF to the LoResMT Open Review. We would like to encourage authors to cite papers written in ANY language that are related to the topics, as long as both original bibliographic items and their corresponding English translations are provided. Registration is handled by the main conference (https://2024.aclweb.org/). ORGANIZING COMMITTEE (LISTED ALPHABETICALLY) Atul Kr. Ojha, University of Galway & Panlingua Language Processing LLP Chao-Hong Liu, Potamu Research Ltd Ekaterina Vylomova, University of Melbourne, Australia Jade Abbott, Retro Rabbit Jonathan Washington, Swarthmore College Nathaniel Oco, National University (Philippines) Tommi A Pirinen, UiT The Arctic University of Norway, Tromsø Valentin Malykh, Huawei Noah’s Ark lab and Kazan Federal University Varvara Logacheva, Skolkovo Institute of Science and Technology Xiaobing Zhao, Minzu University of China PROGRAM COMMITTEE (LISTED ALPHABETICALLY) Abigail Walsh, ADAPT Centre, Dublin City University, Ireland Alberto Poncelas, Rakuten, Singapore Alina Karakanta, Leiden University Amirhossein Tebbifakhr, Fondazione Bruno Kessler Anna Currey, Amazon Web Services Aswarth Abhilash Dara, Amazon Arturo Oncevay, University of Edinburgh Atul Kr. Ojha, DSI, University of Galway & Panlingua Language Processing LLP Barry Haddow, University of Edinburgh Bogdan Babych, Heidelberg University Chao-Hong Liu, Potamu Research Ltd Constantine Lignos, Brandeis University, USA Daan van Esch, Google Diptesh Kanojia, University of Surrey, UK Duygu Ataman, University of Zurich Ekaterina Vylomova, University of Melbourne, Australia Eleni Metheniti, CLLE-CNRS and IRIT-CNRS Flammie Pirinen, UiT The Arctic University of Norway, Tromsø Koel Dutta Chowdhury, Saarland University (Germany) Jade Abbott, Retro Rabbit Jasper Kyle Catapang, University of the Philippines Jindřich Libovicky, Charles University John P. McCrae, DSI, University of Galway Liangyou Li, Noah’s Ark Lab, Huawei Technologies Majid Latifi, University of York, York, UK Maria Art Antonette Clariño, University of the Philippines Los Baños Mathias Müller, University of Zurich Nathaniel Oco, De La Salle University (Philippines) Rajdeep Sarkar, Yahoo Rico Sennrich, University of Zurich Saliha Muradoglu, The Australian National University Sangjee Dondrub, Qinghai Normal University Santanu Pal, WIPRO AI Sardana Ivanova, University of Helsinki Shantipriya Parida, Silo AI Sunit Bhattacharya, Charles University Surafel Melaku Lakew, Amazon AI Wen Lai, Center for Information and Language Processing, LMU Munich Valentin Malykh, Huawei Noah’s Ark lab and Kazan Federal University CONTACT Please email loresmt(a)googlegroups.com if you have any questions/comments/suggestions.

1 0

Publication journal Research in Corpus Linguistics 12/1 (2024)
by pradocarlos＠uniovi.es 09 May '24

09 May '24

Dear all, We are very pleased to announce that the first issue of volume 12 (2024) of Research in Corpus Linguistics (RiCL) has just come out. The issue can be found at: https://ricl.aelinco.es/index.php/ricl/issue/view/25 Please find below the table of contents. With best wishes, Paula Rodríguez-Puente & Carlos Prado-Alonso Editors of RiCL ARTICLES: •A corpus-assisted approach to discursive news values analysis. Arash Javadinejad 1–29 •The contribution of aspectual auxiliary verbs to the factual value of verb periphrases in Spanish: An empirical study. Ana Fernández-Montraveta, Glòria Vázquez, Hortènsia Curell 30–58 •Recent trends in corpus design and reporting: A methodological synthesis. Brett Hashimoto, Kyra Nelson 59–88 •Adjective comparison in African varieties of English. Cristina Suárez-Gómez, Cristhian Tomàs-Vidal 89–113 •Constructions and representations of Chinese identity through England’s curatorial imagination: A corpus-assisted analysis. JJ Chan, Mathew Gillings 114–139 •A semantic analysis of bilingual compound verbs in two contact Spanish communities. Osmer Balam, Lidia Pérez Leutza, Ian Michalski, María del Carmen Parafita Couto 140–170 BOOK REVIEWS: •Review of Peters, Pam and Kate Burridge eds. 2023. Exploring the Ecology of World Englishes in the Twenty-first Century: Language, Society and Culture. Edinburgh: Edinburgh University Press. ISBN: 978-1-474-46286-0. DOI: https://doi.org/10.3366/edinburgh/9781474462853.001.0001. Philip Shaw 171–179 •Review of Leńko-Szymańska, Agnieszka and Sandra Götz eds. 2022. Complexity, Accuracy and Fluency in Learner Corpus Research. Amsterdam: John Benjamins. ISBN: 978-9-027-21258-0. DOI: https://doi.org/10.1075/scl.104. Paweł Szudarski 180–188 •Review of Mattiello, Elisa. 2022. Transitional Morphology: Combining Forms in Modern English. Cambridge: Cambridge University Press. ISBN: 978-1-009-16828-1. DOI: https://doi. org/10.1017/9781009168274. Cristina Lara-Clares, Salvador Valera 189–195 •Review of Taavitsainen, Irma, Turo Hiltunen, Jeremy J. Smith and Carla Suhr eds. 2022. Genre in English Medical Writing, 1500–1820: Sociocultural Contexts of Production and Use. Cambridge: Cambridge University Press. ISBN: 978-1-009-10534-7. DOI: https:// doi.org/10.1017/9781009105347. Irene Diego Rodríguez 196–204 •Review of Sánchez Fajardo, José A. 2022. Pejorative Suffixes and Combining Forms in English. Amsterdam: John Benjamins. ISBN: 978-9-027-25822-9. DOI: https://doi.org/10.1075/slcs.222. Anke Lensch 205–211 •Review of Zihan Yin and Elaine Vine eds. 2022. Multifunctionality in English: Corpora, Language and Academic Literacy Pedagogy. London: Routledge. ISBN 978-0-367-72509-9. DOI: https://doi. org/10.4324/9781003155072. Pascual Pérez-Paredes 212–219

1 0

2026

2025

2024

2023

2022