SIGUL

sigul@list.elra.info

2 participants
325 discussions

Second Call for Participation- IWSLT 2024
by Atul K. Ojha 16 Jan '24

16 Jan '24

Apologies for cross-posting. ---------------------------------------- *The International Conference on Spoken Language Translation* *21st IWSLT 2024 – **Second** Call for Participation* *August 15-16, 2024 – Bangkok, Thailand* *http://iwslt.org <http://iwslt.org/>* The International Conference on Spoken Language Translation (IWSLT) is the premier annual conference for all aspects of Spoken Language Translation. Every year, the conference organizes and sponsors open evaluation campaigns around key challenges in simultaneous and consecutive translation, under real-time/low latency or offline conditions and under low-resource or multilingual constraints. System descriptions and results from participants’ systems and scientific papers related to key algorithmic advances and best practices are presented. IWSLT is the venue of the SIGSLTs, the Special Interest Group on Spoken Language Translation of ACL, ISCA and ELRA. With a track record of 20 years, IWSLT benchmarks and proceedings serve as reference for all researchers and practitioners working on speech translation and related fields. The 21st edition of IWSLT <https://iwslt.org/2024/> will be run as an *ELRA/ACL* event and co-located with ACL 2024 <https://2024.aclweb.org/> on August 15-16, 2024. It will be run as a hybrid event. Important Dates January 15, 2024: Release of shared task training and dev data April 01-15, 2024: Evaluation period April 29, 2024: Paper submission due (all papers) June 4, 2024: Notification of acceptance June 24, 2024: Camera-ready paper due July 22, 2024: Pre-recorded video due August 15-16, 2024: Conference Evaluation The IWSLT 2024 features shared tasks <https://iwslt.org/2024/#shared-tasks> that address the following focus areas: - Speech-to-speech track - Simultaneous track - Subtitling track - Offline track - Dubbing track - Low-resource track - Indic track Training, development and test data for each shared task will be prepared and released by the respective organizers (for further information on this initiative, please refer to the website <https://iwslt.org/2024/>). Participants will receive instructions about how to submit their runs. In addition, participants have the opportunity to present their work through a system paper that will be published in the ACL Proceedings. Conference IWSLT also invites submissions of scientific papers to be published in the ACL Proceedings and presented either in oral or poster format. The conference selects high-quality, original contributions on theoretical and practical issues of spoken language translation research, technologies and applications. For further information on this initiative, please refer to the website <https://iwslt.org/2024/#paper-submission> Contact Please send an email to iwslt-evaluation-campaign(a)googlegroups.com if you have any questions related to the shared tasks. Thanks, Marine, Marcello, Alex, Jan, Sebastian, Elizabeth, Atul (IWSLT organisers)

1 0

2nd CfP 5th workshop on Resources for African Indigenous Language (RAIL) @ LREC-COLING
by Menno Van Zaanen 16 Jan '24

16 Jan '24

The fifth workshop on Resources for African Indigenous Language (RAIL) Colocated with LREC-COLING 2024 https://bit.ly/rail2024 Conference dates: 20-25 May 2024 Workshop date: 25 May 2024 Venue: Lingotto Conference Centre, Torino (Italy) The fifth RAIL workshop website: https://bit.ly/rail2024 LREC-COLING 2024 website: https://lrec-coling-2024.org/ Submission website: https://softconf.com/lrec-coling2024/rail2024/ The fifth Resources for African Indigenous Languages (RAIL) workshop will be co-located with LREC-COLING 2024 in Lingotto Conference Centre, Torino, Italy on 25 May 2024. The RAIL workshop is an interdisciplinary platform for researchers working on resources (data collections, tools, etc.) specifically targeted towards African indigenous languages. In particular, it aims to create the conditions for the emergence of a scientific community of practice that focuses on data, as well as computational linguistic tools specifically designed for or applied to indigenous languages found in Africa. Many African languages are under-resourced while only a few of them are somewhat better resourced. These languages often share interesting properties such as writing systems, or tone, making them different from most high-resourced languages. From a computational perspective, these languages lack enough corpora to undertake high level development of Human Language Technologies (HLT) and Natural Language Processing (NLP) tools, which in turn impedes the development of African languages in these areas. During previous workshops, it has become clear that the problems and solutions presented are not only applicable to African languages but are also relevant to many other low-resource languages. Because these languages share similar challenges, this workshop provides researchers with opportunities to work collaboratively on issues of language resource development and learn from each other. The RAIL workshop has several aims. First, the workshop brings together researchers who work on African indigenous languages, forming a community of practice for people working on indigenous languages. Second, the workshop aims to reveal currently unknown or unpublished existing resources (corpora, NLP tools, and applications), resulting in a better overview of the current state-of-the-art, and also allows for discussions on novel, desired resources for future research in this area. Third, it enhances sharing of knowledge on the development of low-resource languages. Finally, it enables discussions on how to improve the quality as well as availability of the resources. The workshop has “Creating resources for less-resourced languages” as its theme, but submissions on any topic related to properties of African indigenous languages (including non-African languages) may be accepted. Suggested topics include (but are not limited to) the following: * Digital representations of linguistic structures * Descriptions of corpora or other data sets of African indigenous languages * Building resources for (under resourced) African indigenous languages * Developing and using African indigenous languages in the digital age * Effectiveness of digital technologies for the development of African indigenous languages * Revealing unknown or unpublished existing resources for African indigenous languages * Developing desired resources for African indigenous languages * Improving quality, availability and accessibility of African indigenous language resources Submission requirements: We invite papers on original, unpublished work related to the topics of the workshop. Submissions, presenting completed work, may consist of up to eight (8) pages of content plus additional pages of references. The final camera-ready version of accepted long papers are allowed one additional page of content (up to 9 pages) so that reviewers’ feedback can be incorporated. Papers should be formatted according to the LREC- COLING style sheet (https://lrec-coling-2024.org/authors-kit/), which is provided on the LREC-COLING 2024 website (https://lrec-coling-2024.org/). Reviewing is double-blind, so make sure to anonymise your submission (e.g., do not provide author names, affiliations, project names, etc.) Limit the amount of self citations (anonymised citations should not be used). The RAIL workshop follows the LREC-COLING submission requirements. Please submit papers in PDF format to the START account (https://softconf.com/lrec-coling2024/rail2024/). Accepted papers will be published in proceedings linked to the LREC-COLING conference. Important dates: Submission deadline: 16 February 2024 Date of notification: 15 March 2024 Camera ready deadline: 29 March 2024 RAIL workshop: 25 May 2024 Organising Committee Rooweither Mabuya, South African Centre for Digital Language Resources (SADiLaR), South Africa Muzi Matfunjwa, South African Centre for Digital Language Resources (SADiLaR), South Africa Mmasibidi Setaka, South African Centre for Digital Language Resources (SADiLaR), South Africa Menno van Zaanen, South African Centre for Digital Language Resources (SADiLaR), South Africa -- Prof Menno van Zaanen menno.vanzaanen(a)nwu.ac.za Professor in Digital Humanities South African Centre for Digital Language Resources https://www.sadilar.org ________________________________ NWU PRIVACY STATEMENT: http://www.nwu.ac.za/it/gov-man/disclaimer.html DISCLAIMER: This e-mail message and attachments thereto are intended solely for the recipient(s) and may contain confidential and privileged information. Any unauthorised review, use, disclosure, or distribution is prohibited. If you have received the e-mail by mistake, please contact the sender or reply e-mail and delete the e-mail and its attachments (where appropriate) from your system. ________________________________

1 0

(no subject)
by Pranaydeep Singh 12 Jan '24

12 Jan '24

Hello everyone, My name is Pranay and I am a PhD candidate in my final year at the Language & Translation Technology Team at Ghent University, Belgium. My area of research is efficient language modelling for low-resourced languages, and I have worked on various digital humanities projects as well, Assisting with technical expertise for ancient languages such as Byzantine Greek and CUNE-IIIFORM. (Google scholar link<https://scholar.google.com/citations?user=8KSmDe4AAAAJ&hl=en>) I would like to request to join SIGUL, since my work is highly related to the research interests of the group. I have also previously published and attended at the SIGUL workshop co-located with LREC’22 in Marseille, and will be attending the SIGUL workshop at LREC-COLING’24 as well. Look forward to hearing from you. Best Regards, Pranaydeep Singh Doctoral Candidate Language & Translation Technology Team, Ghent University

1 0

Second Call for Papers - International Conference ‘New Trends in Translation and Technology’ (NeTTT’2024)
by t.ranasinghe＠aston.ac.uk 08 Jan '24

08 Jan '24

International Conference ‘New Trends in Translation and Technology’ (NeTTT’2024) Varna, Bulgaria, 3-6 July 2024 Second Call for Papers The conference The second edition of the forthcoming International Conference ‘New Trends in Translation and Technology’ (NeTTT’2024) will take place in Varna, Bulgaria, 3-6 July 2024. The objective of the conference is (i) to bridge the gap between academia and industry in the field of translation and interpreting by bringing together academics in linguistics, translation studies, machine translation and natural language processing, developers, practitioners, language service providers and vendors who work on or are interested in different aspects of technology for translation and interpreting, and (ii) to be a distinctive event for discussing the latest developments and practices. NeTTT’2024 invites all professionals who would like to learn about the new trends, present the latest work or/and share their experience in the field, and who would like to establish business and research contacts, collaborations and new ventures. The conference will take the form of presentations (peer-reviewed research and user presentations, keynote speeches), and posters; it will also feature panel discussions. The accepted papers will be published as open-access conference e-proceedings. Conference topics Contributions are invited on any topic related to latest technology and practices in machine translation, translation, subtitling, localisation and interpreting. NeTTT’2024 will feature a Special Theme Track "Future of Translation Technology in the Era of LLMs and Generative AI". The conference topics include but are not limited to: CAT tools - Translation Memory (TM) systems - NLP and MT for translation memory systems - Terminology extraction tools - Localisation tools Machine Translation - Latest developments in Neural Machine Translation - MT for under-resourced languages - MT with low computing resources - Multimodal MT - Integration of MT in TM systems - Resources for MT Technologies for MT deployment - MT evaluation techniques, metrics and evaluation results - Human evaluations of MT output - Evaluating MT in a real-world setting - Quality estimation for MT - Domain adaptation Translation Studies - Corpus-based studies applied to translation - Corpora and resources for translation - Translationese - Cognitive effort and eye-tracking experiments in translation Interpreting studies - Corpus-based studies applied to interpreting - Corpora and resources for interpreting - Interpretese - Resources for interpreting and interpreting technology applications - Cognitive effort and eye-tracking experiments in interpreting Interpreting technology - Machine interpreting - Computer-aided interpreting - NLP for dialogue interpreting - Development of NLP based applications for communication in public service settings (healthcare, education, law, emergency services) Emerging Areas in Translation and Interpreting - MT and translation tools for literary texts and creative texts - MT for social media and real-time conversations - Sign language recognition and translation Subtitling - NLP and MT for subtitling - Latest technology for subtitling User needs - Analysis of translators’ and interpreters’ needs in terms of translation and interpreting technology - User requirements for interpreting and translation tools - Incorporating human knowledge into translation and interpreting technology - What existing translators’ (including subtitlers’) and interpreters’ tools do not offer - User requirements for electronic resources for translators and interpreters - Translation and interpreting workflows in larger organisations and the tools for translation and interpreting employed The business of translation and interpreting - Translation workflow and management - Technology adoption by translators and industry - Setting up translation /interpreting / language provider company Teaching translation and interpreting - Teaching Machine Translation - Teaching translation technology - Teaching interpreting technology - Latest AI developments in the syllabi of translation and interpreting curricula Ethical issues in translation and technology - Bias and fairness in MT - Privacy and security in cloud MT systems - Transparency and explainability of MT systems - Environmental impact on MT systems Special Theme Track - Future of Translation Technology in the Era of LLMs and Generative AI We are excited to share that NeTTT’2024 will have a special theme with the goal of stimulating discussion around Large Language Models, Generative AI and the Future of Translation and Interpreting Technology. While the new generation of Large Language Models such as CHATGPT and LLAMA showcase remarkable advancements in language generation and understanding, we find ourselves in uncharted territory when it comes to their performance on various Translation and Interpreting Technology tasks with regards to fairness, interpretability, ethics and transparency. The theme track invites studies on how LLMs perform on Translation and Interpreting Technology tasks and applications, and what this means for the future of the field. The possible topics of discussion include (but are not limited to) the following: - Changes in the translators and interpreters’ professions in the new AI era especially as a result of the latest developments in LLMSs and Generative AI - Generative AI and translation - Generative AI and interpreting - Augmenting machine translation systems with generative AI - Domain and terminology adaptation with Large Language Models - Literary translation with Large Language Models - Improving Machine Translation Quality with Contextual Prompts in Large Language Models - Prompt engineering for translation - Generative AI for professional translation - Generative AI for professional interpreting We anticipate having a special session on this theme at the conference. Submissions and publication NETTT’2024 invites the following types of submissions: User papers – for industry and practitioners. References to related work are optional. Allowed paper length: between 1 and 4 pages. Academic submissions, in three different categories (have to follow formatting requirements, references to related work are required): • (academic) full papers – describing original completed research. Allowed paper length: maximum 12 pages + unlimited references. • (academic) work-in-progress papers/posters – describing work in progress, late breaking research, papers at a more conceptual stage, and other types of papers that do not fit in the ‘full’ papers category. Allowed paper length: maximum 7 pages + unlimited references. • (academic) demo papers – describing working systems. Allowed paper length: maximum 5 pages + unlimited references. In addition to the papers, the authors will be expected to demonstrate the systems at the workshop. The conference will not consider and evaluate abstracts only. Each submission will be reviewed by three members of the Programme Committee. Submission is organised via Softconf START conference management system at https://softconf.com/n/nettt2024. For submitting the papers, we invite the authors to comply with the Springer format, following the templates: • LaTeX, • Overleaf, • Word. The accepted papers will be published in the conference proceedings and made available online on the conference website. Authors of accepted papers will receive guidelines regarding how to produce camera-ready versions of their papers. The final version of the accepted papers will be published in e-proceedings with assigned ISBN and DOI. All accepted papers will be included in the conference e-proceedings which will be available at the conference website. Schedule Submission deadline: 31 March 2024 Notification: 5 June 2024 Final version due: 20 June 2024 All deadlines are valid for 23.59 Anywhere on Earth. Venue The conference will take place at Conference Hotel Cherno More, Varna, situated only 200 m away from the fine sandy Black Sea beach. Further information and contact details Registration will open on 15 January 2024. The follow-up calls will list keynote speakers and members of the programme committee once confirmed. The conference website is https://nettt-conference.com and will be updated on a regular basis. For further information, please contact us at nettt2024(a)nettt-conference.com

1 0

[2nd call] SIGTYP 2024 Shared Task on Word Embedding Evaluation for Ancient and Historical Languages
by Oksana Dereza 04 Jan '24

04 Jan '24

Dear colleagues, [apologies for cross-posting] We would like to remind you that this year SIGTYP is hosting a Shared Task on Word Embedding Evaluation for Ancient and Historical Language: https://github.com/sigtyp/ST2024/ Test data has been released, and CodaLab competitions are up and running, so we encourage you to register if you still haven't! There is still a week before the deadline. :) *Summary* In recent years, sets of downstream tasks called benchmarks have become a very popular, if not default, method to evaluate general-purpose word and sentence embeddings. Starting with decaNLP (McCann et al., 2018) and SentEval (Conneau & Kiela, 2018), multitask benchmarks for NLU keep appearing and improving every year. However, even the largest multilingual benchmarks, such as XGLUE, XTREME, XTREME-R or XTREME-UP (Hu et al., 2020; Liang et al., 2020; Ruder et al., 2021, 2023), only include modern languages. When it comes to ancient and historical languages, scholars mostly adapt/translate intrinsic evaluation datasets from modern languages or create their own diagnostic tests. We argue that there is a need for a universal evaluation benchmark for embeddings learned from ancient and historical language data and view this shared task as a proving ground for it. The shared task involves solving the following problems for 12+ ancient and historical languages that belong to 4 language families and use 6 different scripts. Participants will be invited to describe their system in a paper for the SIGTYP workshop proceedings. The task organizers will write an overview paper that describes the task and summarizes the different approaches taken, and analyzes their results. *Subtasks* For subtask A, participants are not allowed to use any additional data; however, they can reduce and balance provided training datasets if they see fit. For subtask B, participants are allowed to use any additional data in any language, including pre-trained embeddings and LLMs. A. Constrained 1. POS-tagging 2. Full morphological annotation 3. Lemmatisation B. Unconstrained 1. POS-tagging 2. Detailed morphological annotation 3. Lemmatisation 4. Filling the gaps - Word-level - Character-level *Important links* - *Registration form* <https://docs.google.com/forms/d/e/1FAIpQLSdINgMfzzZGIZ-uBVQhvyndB6yeaaj-wT7…> - Detailed description, incl. submission format: https://github.com/ sigtyp/ST2024 <https://github.com/sigtyp/ST2024> - Constrained subtask on CodaLab: https://codalab.lisn.upsaclay.fr/competitions/16822 - Unconstrained subtask on CodaLab: https://codalab.lisn.upsaclay.fr/competitions/16818 *Important dates* *05 Nov 2023*: Release of training and validation data *02 Jan 2024*: Release of test data - * 09 Jan 2024:* Submission of results for Phase 1 of the Constrained Subtask - * 12 Jan 2024:* Submission of results for Phase 2 of the Constrained Subtask and for the Unconstrained Subtask *13 Jan 2024*: Notification of results *20 Jan 2024*: Submission of shared task papers *27 Jan 2024*: Notification of acceptance to authors *03 Feb 2024*: Camera-ready *15 Mar 2024*: Video recordings due *21/22 Mar 2024*: SIGTYP workshop Kind regards, Oksana and the organisers' team -- [image: https://nuig.insight-centre.org/] <https://www.insight-centre.org/> Oksana Dereza | PhD student on the Cardamom <http://cardamom.insight-centre.org/> project | Unit for Linguistic Data | Insight Centre for Data Analytics | Data Science Institute | University of Galway Oksana Dereza | Iarrthóir PhD ar thionscadal Cardamom <http://cardamom.insight-centre.org/> | An tAonad um Shonraí Teangeolaíocha | Insight, Ionad na hAnailísíochta Sonraí | Institiúid Eolaíochta Sonraí | Ollscoil na Gaillimhe

1 0

CFP: The 3rd Annual Meeting of the ELRA-ISCA Special Interest Group on Under-resourced Languages (SIGUL2024)
by Claudia Soria 19 Dec '23

19 Dec '23

** *CFP: The3rd Annual Meeting of the ELRA-ISCA Special Interest Group on Under-resourced Languages (SIGUL2024)* * * Workshop website (under construction): https://sigul-2024.ilc.cnr.it <https://sigul-2024.ilc.cnr.it/> * When: Monday and Tuesday, May 20th-21st, 2024 * Where: Torino, Italy (co-located with LREC-COLING 2024) * Deadline for submissions: February 26th, 2024 * Paper submission link: https://softconf.com/lrec-coling2024/sigul2024/ <https://softconf.com/lrec-coling2024/sigul2024/> * Deadline for camera-ready papers: April 5th, 2024 The 3rd Annual Meeting of the ELRA <http://www.elra.info/>/ISCA <https://www.isca-speech.org/iscaweb/index.php>Special Interest Group on Under-Resourced Languages <http://www.elra.info/en/sig/sigul/>(SIGUL2024) will provide a forum for the presentation and discussion of cutting-edge research in language processing for under-resourced languages by academic and industry researchers. Following the long-standing series of previous meetings, the SIGUL workshop will also offer a venue where researchers in different disciplines and from varied backgrounds can fruitfully explore new areas of intellectual and practical development while honoring their common interest of sustaining less-resourced languages. We invite contributions (regular long papers of 8 pages or short papers of 4 pages) targeting any of the following - non-exhaustive - list of topics: * Processing any under-resourced languages (covering less-resourced, under-resourced, endangered, minority, and minoritized languages) * Cognitive and linguistic studies of under-resourced languages * Fast resources acquisition: text and speech corpora, parallel texts, dictionaries, grammars, and language models * Zero and few-shot methodologies and self-supervised learning in language and speech technologies * Cross-lingual and multilingual acoustic and lexical modeling * Speech recognition and synthesis for under-resourced languages and dialects * Machine translation and speech-to-speech translation * Spoken dialogue systems * Applications of language technologies for under-resourced languages * Large language models and under-resourced languages * Special topic: o Text and speech resources and technologies for the languages of Italy Special Session on languages of Italy and language technologies Italy is known for its linguistic diversity that reflects its long and varied history. To celebrate it, SIGUL2024 will provide a special session or forum for researchers interested in developing language resources and technologies for the many languages of Italy (regional, minority, or heritage languages, including those of the neighboring countries). Submissions Authors can choose among three paper categories: * Regular long papers – up to eight (8) pages maximum*, presenting substantial, original, completed, and unpublished work. * Short papers – up to four (4) pages*, describing work-in-progress projects in the early stage of development, new resources, negative results, system demonstrations, and early-career/student work. * Position papers – up to eight (8) pages*, for reflective considerations of methodological, best practice, and institutional issues (e.g., ethics, data ownership, speakers’ community involvement, de-colonizing approaches). The above page limits exclude any number of additional pages that may be needed for references. The form of the presentation may be oral or poster, whereas in the proceedings there is no difference between the accepted papers. Submission is NOT anonymous and the official LREC-COLING 2024 format must be adopted. Each paper will be reviewed by three independent reviewers. Invited speakers TBA Important Dates • 26 February 2024: submission due • 18 March 2024: reviews due • 22 March 2024: notifications to authors • 5 April 2024: camera-ready (PDF) due Identify, Describe and Share your LRs! When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research. Moreover, ELRA encourages all LREC-COLING authors to share the described LRs (data, tools, services, etc.) to enable their reuse and replicability of experiments (including evaluation ones). Workshop Organizers Maite Melero, Sakriani Sakti, Claudia Soria Program Committee * Mohammad A. M. Abushariah (The University of Jordan, Jordan) * Manex Agirrezabal (University of Copenhagen – Center for Sprogteknologi | Center for Language Technology, Denmark) * Shyam S. Agrawal (KIIT, Gurugram ,India) * Begoña Altuna (HiTZ Center - Ixa, Euskal Herriko Unibertsitatea | University of the Basque Country, Spain) * Antti Arppe (University of Alberta, Canada) * Martin Benjamin (Kamusi Project International) * Delphine Bernhard (Université de Strasbourg, LiLPa, France) * Steven Bird (Charles Darwin University, Australia) * Claudia Borg (University of Malta) * Matt Coler (University of Groningen, Campus Fryslân, The Netherlands) * Dan Cristea (Romanian Academy, Romania) * Pradip Kumar Das (IIT Guwahati, India) * A. Seza Doğruöz (Universiteit Gent, België | Ghent University, Belgium) * Stefano Ghazzali (Language Technologies Unit Bangor University Prifysgol Bangor | Bangor University, Bangor, Gwynedd) * Itziar Gonzalez-Dios (HiTZ Basque Center for Language Technologies - Ixa, University of the Basque Country UPV/EHU) * Lars Hellan (Norwegian University of Science and Technology, Norway) * Mélanie Jouitteau (IKER, CNRS, France) * Richard Littauer (unaffiliated) * Teresa Lynn (Mohamed bin Zayed University of Artificial Intelligence, United Arab Emirates) * Nina Markl (University of Essex, UK) * Maite Melero (Barcelona Supercomputing Center, Espanya | Spain) * Peter Mihajlik (Budapest University of Technology and Economics, Hungary) * Win Pa Pa (UCS Yangon, Myanmar) * Sandy Ritchie (Google Research) * Sakriani Sakti (JAIST, Japan) * Claudia Soria (CNR-ILC, Italia | Italy) * Daan Van Esch (Google Research) * Menno van Zaanen (South African Centre for Digital Language Resources, South Africa) * Jenifer Vega Rodriguez (GIPSA-lab, Université Grenoble Alpes, France) * Marcely Zanon Boito (NAVER Labs Europe, France) Contact claudia.soria[AT]ilc.cnr.it Please, write “SIGUL2024” in the subject of your e-mail. * -- facebook <https://www.facebook.com/CNRsocialFB> twitter <https://twitter.com/CNRsocial_> instagram <https://www.instagram.com/cnrsocial/> linkedin <https://www.linkedin.com/company/283032> Claudia Soria CNR, ISTITUTO DI LINGUISTICA COMPUTAZIONALE "ANTONIO ZAMPOLLI" claudia.soria(a)ilc.cnr.it Tel. 0503153166 Via Giuseppe Moruzzi, 1, 56124 – Pisa www.ilc.cnr.it *www.cnr.it* <http://www.cnr.it/> Devolvi il 5×1000 al CNR CF 80054330586

1 0

Partnerships in Practice - Special Theme Session at ComputEL-7
by Antti Arppe 18 Dec '23

18 Dec '23

Dear everyone, ----- CALL FOR SUBMISSIONS - PARTNERSHIPS IN PRACTICE AT COMPUTEL-7 In addition to the regular programming, we will be hosting again at the 7th ComputEL workshop a special theme session discussion at the workshop. The theme for this Special Session is “Partnerships in Practice”. The goal of this Special Session is to increase our shared understanding of how best to work together across disciplinary and cultural boundaries to support community goals for language revitalization. We invite presentations that address two broad topics: (1) Lessons Learned from Existing Partnerships and (2) Solicitations for Future Partnerships. 1. Presentations that describe existing partnerships between language communities, documentary linguists and computational linguists. We encourage submissions which address questions such as: * How did the team members meet and come to work together? * What projects have you worked on, and what tools and resources have you created? * How have those tools and resources benefitted community efforts at language maintenance and revitalization? * What are some challenges (logistical, technical, interdisciplinary, intercultural) that you encountered, and how did you address them? * How have you balanced the needs and priorities of different team members through the lifespan of the project? * What lessons have you learned that might benefit other similar collaborations 2. Presentations that describe a project between a language community and a documentary linguist, where they have identified a need for assistance from a computational linguist. We encourage submissions which address questions such as: * How did the project come about and what are its goals? * What tools or resources, if any, has the team been able to produce so far? * What other tools or resources do you wish to create, and what challenges are you facing in creating them? * How do you see that the project might benefit from assistance from a computationally oriented linguist? Submissions with participation from community members are strongly encouraged. SUBMISSIONS TO THE SPECIAL THEME SESSION Please submit anonymous extended abstracts of up to 1500 words, excluding references. Submissions are to be made through SoftConf, via the following link (among the submission options, please choose Extended abstract as the submission type and Special Session Only as the preferrred session): https://softconf.com/eacl2024/Computel-7/ The deadline for submissions is January 15, 2024 (Anywhere on Earth). Alternatively, you may indicate that your full paper or extended abstract submitted to the regular workshop can be considered for inclusion in the Special Session. Notification of acceptance to the Special Session will be sent out by January 29, 2024. All authors of papers in the Special Theme Session will be invited to contribute to a follow-up paper that synthesizes the findings of the Session. IMPORTANT DATES 15 January 2024 Deadline for submission of abstracts 29 January 2024 Notification of acceptance 21 and 22 March 2024 Workshop and Special Theme Session CONTACT AND MORE INFORMATION The organizers of the Special Theme Session can be reached by: computel.workshop(a)gmail.com. For further information, please consult our website: https://computel-workshop.org/special-theme-session-partnerships-in-practic… -- ====================================================================== Antti Arppe - Ph.D (General Linguistics), M.Sc. (Engineering) Associate Professor of Quantitative Linguistics Director, Alberta Language Technology Lab (ALTLab) Project Director, 21st Century Tools for Indigenous Languages (21C) Past President, ACL SIG for Endangered Languages (SIGEL) Department of Linguistics, University of Alberta E-mail: arppe(a)ualberta.ca, antti.arppe(a)iki.fi WWW: www.ualberta.ca/~arppe, altlab.artsrn.ualberta.ca Mānahtu ina rēdûti ihza ummânūti ihannaq - dulum ugulak úmun ingul ----------------------------------------------------------------------

1 0

CfP 5th workshop on Resources for African Indigenous Language (RAIL) @ LREC-COLING
by Menno Van Zaanen 14 Dec '23

14 Dec '23

First call for papers The fifth workshop on Resources for African Indigenous Language (RAIL) Colocated with LREC-COLING 2024 https://bit.ly/rail2024 Conference dates: 20-25 May 2024 Workshop date: 25 May 2024 Venue: Lingotto Conference Centre, Torino (Italy) The fifth RAIL workshop website: https://bit.ly/rail2024 LREC-COLING 2024 website: https://lrec-coling-2024.org/ The fifth Resources for African Indigenous Languages (RAIL) workshop will be co-located with LREC-COLING 2024 in Lingotto Conference Centre, Torino, Italy on 25 May 2024. The RAIL workshop is an interdisciplinary platform for researchers working on resources (data collections, tools, etc.) specifically targeted towards African indigenous languages. In particular, it aims to create the conditions for the emergence of a scientific community of practice that focuses on data, as well as computational linguistic tools specifically designed for or applied to indigenous languages found in Africa. Many African languages are under-resourced while only a few of them are somewhat better resourced. These languages often share interesting properties such as writing systems, or tone, making them different from most high-resourced languages. From a computational perspective, these languages lack enough corpora to undertake high level development of Human Language Technologies (HLT) and Natural Language Processing (NLP) tools, which in turn impedes the development of African languages in these areas. During previous workshops, it has become clear that the problems and solutions presented are not only applicable to African languages but are also relevant to many other low-resource languages. Because these languages share similar challenges, this workshop provides researchers with opportunities to work collaboratively on issues of language resource development and learn from each other. The RAIL workshop has several aims. First, the workshop brings together researchers who work on African indigenous languages, forming a community of practice for people working on indigenous languages. Second, the workshop aims to reveal currently unknown or unpublished existing resources (corpora, NLP tools, and applications), resulting in a better overview of the current state-of-the-art, and also allows for discussions on novel, desired resources for future research in this area. Third, it enhances sharing of knowledge on the development of low-resource languages. Finally, it enables discussions on how to improve the quality as well as availability of the resources. The workshop has “Creating resources for less-resourced languages” as its theme, but submissions on any topic related to properties of African indigenous languages (including non-African languages) may be accepted. Suggested topics include (but are not limited to) the following: * Digital representations of linguistic structures * Descriptions of corpora or other data sets of African indigenous languages * Building resources for (under resourced) African indigenous languages * Developing and using African indigenous languages in the digital age * Effectiveness of digital technologies for the development of African indigenous languages * Revealing unknown or unpublished existing resources for African indigenous languages * Developing desired resources for African indigenous languages * Improving quality, availability and accessibility of African indigenous language resources Submission requirements: We invite papers on original, unpublished work related to the topics of the workshop. Submissions, presenting completed work, may consist of up to eight (8) pages of content plus additional pages of references. The final camera-ready version of accepted long papers are allowed one additional page of content (up to 9 pages) so that reviewers’ feedback can be incorporated. Papers should be formatted according to the LREC- COLING style sheet (https://lrec-coling-2024.org/authors-kit/), which is provided on the LREC-COLING 2024 website (https://lrec-coling-2024.org/). Reviewing is double-blind, so make sure to anonymise your submission (e.g., do not provide author names, affiliations, project names, etc.) Limit the amount of self citations (anonymised citations should not be used). The RAIL workshop follows the LREC-COLING submission requirements. Please submit papers in PDF format to the START account (the submission link will be available soon). Accepted papers will be published in proceedings linked to the LREC-COLING conference. Important dates: Submission deadline: 16 February 2024 Date of notification: 15 March 2024 Camera ready deadline: 29 March 2024 RAIL workshop: 25 May 2024 Organising Committee Rooweither Mabuya, South African Centre for Digital Language Resources (SADiLaR), South Africa Muzi Matfunjwa, South African Centre for Digital Language Resources (SADiLaR), South Africa Mmasibidi Setaka, South African Centre for Digital Language Resources (SADiLaR), South Africa Menno van Zaanen, South African Centre for Digital Language Resources (SADiLaR), South Africa -- Prof Menno van Zaanen menno.vanzaanen(a)nwu.ac.za Professor in Digital Humanities South African Centre for Digital Language Resources https://www.sadilar.org ________________________________ NWU PRIVACY STATEMENT: http://www.nwu.ac.za/it/gov-man/disclaimer.html DISCLAIMER: This e-mail message and attachments thereto are intended solely for the recipient(s) and may contain confidential and privileged information. Any unauthorised review, use, disclosure, or distribution is prohibited. If you have received the e-mail by mistake, please contact the sender or reply e-mail and delete the e-mail and its attachments (where appropriate) from your system. ________________________________

1 0

PROPOR24 Deadline extension + last call for demos.
by Iria de Dios Flores 11 Dec '23

11 Dec '23

The PROPOR 2024 demonstration program committee invites submissions for demonstrations. Following the spirit of previous PROPOR editions, the demonstration track aims at bringing together academia and industry, creating a forum where more than written or spoken descriptions of research are available. Thus, demos should allow attendees to try and test them during their presentation in a dedicated session that will provide a more informal and interactive setting. Products, systems, or tools are examples of acceptable demos. Both early-research prototypes and mature systems may also be considered. *Important dates:* Demos Submission: January 10 2024 Notification of acceptance or rejection: February 21 2024 Camera-ready demo paper: February 28 2024 Conference: March 14 and 15 2024 *Topics:* The areas of interest include all topics related to theoretical and applied issues of written and spoken Portuguese and Galician, such as, but not limited to, the same topics as for the conference paper submission: Natural language processing tasks (e.g. parsing, word sense disambiguation, coreference resolution) Natural language processing applications (e.g. question answering, subtitling, summarization, sentiment analysis) Natural language generation Information extraction and information retrieval Speech technologies (e.g. spoken language generation, speech and speaker recognition, spoken language understanding) Speech applications (e.g. spoken language interfaces, dialogue systems, speech-to-speech translation) Resources, standardization and evaluation (e.g. corpora, ontologies, lexicons, grammars) NLP-oriented linguistic description or theoretical analysis Distributional semantics and language modeling Portuguese language varieties and dialect processing (including the language varieties of Angola, Brazil, Cape Verde, East Timor, Galicia, Guinea-Bissau, Macau, Mozambique, Portugal, São Tomé, and Principe) Multilingual studies, methods, applications, and resources including Portuguese/Galician The systems may be of the following kinds: Natural Language Processing systems or system components Application systems using language technology components Software tools for computational linguistics research Software for demonstration or evaluation Development tools *Submissions:* Submissions should consist of a non-anonymous brief description document of up to three pages of content, including references. Developers must outline the main characteristics of their system/product/tool, provide sufficient details to allow its evaluation, and give information on how they plan to demonstrate it. Developers are encouraged to focus their description on the relevance of the computational processing component of Portuguese or Galician in the proposed system. Submissions should be written in English. At submission time, only PDF format is accepted. For the final versions, authors of accepted papers will be given one extra content page to take the reviews into account. Authors of accepted papers will be requested to send the source files for the production of the proceedings. Submissions must be sent via EasyChair ( https://easychair.org/my/conference?conf=propor2024) — please select the track: PROPOR2024 Demo Paper. All submitted papers must conform to the official ACL style guidelines. ACL provides style files for LaTeX and Microsoft Word that meet these requirements. They can be found at: LaTeX styelesheet: https://github.com/acl-org/acl-style-files/tree/master/latex MS Word stylesheet: https://github.com/acl-org/acl-style-files/tree/master/word Publication: Accepted demo papers are expected to be published by ACL as a volume in ACL Anthology (https://aclanthology.org/) as part of the PROPOR 2024 proceedings. They will be available online. To ensure publication, at least one author of each accepted paper must complete an adequate registration for PROPOR 2024 by the early registration deadline. *Presentation format:* Accepted demos will be presented at a designated demo session with an optional accompanying poster. Developers should make sure they could run their demos properly. Thus, it is the authors’ responsibility to provide the necessary technical conditions (i.e. equipment) for the demo at the conference. Note that the local organizers will not provide any hardware or software. Free high-speed Internet access will be available. There will be a best demo award for the best-presented project. Further details on the date, time, and instructions of the demonstration session(s) will be determined and provided at a later date. *Demo chairs:* Marlo Souza (Universidade Federal da Bahia, Brazil) Iria de-Dios-Flores (Universidade de Santiago de Compostela, Spain) -- *Iria de-Dios-Flores (PhD)* *https://sites.google.com/view/iriadediosflores/ <https://sites.google.com/view/iriadediosflores/>*

1 0

Call for Papers MM4SG@WebConf 2024
by Surendrabikram Thapa 07 Dec '23

07 Dec '23

Hello everyone, We are organizing the 𝐅𝐢𝐫𝐬𝐭 𝐈𝐧𝐭𝐞𝐫𝐧𝐚𝐭𝐢𝐨𝐧𝐚𝐥 𝐖𝐨𝐫𝐤𝐬𝐡𝐨𝐩 𝐨𝐧 𝐌𝐮𝐥𝐭𝐢𝐦𝐨𝐝𝐚𝐥 𝐂𝐨𝐧𝐭𝐞𝐧𝐭 𝐀𝐧𝐚𝐥𝐲𝐬𝐢𝐬 𝐟𝐨𝐫 𝐒𝐨𝐜𝐢𝐚𝐥 𝐆𝐨𝐨𝐝 (𝐌𝐌𝟒𝐒𝐆) co-located with the WebConf 2024 (A* Conference). We invite original contributions on a wide range of topics related to multimodal content analysis for social good with a focus on computational linguistics, including, but not limited to: ⦿ 𝐌𝐌𝟒𝐒𝐆: Hate, Troll, Cyberbullying, Scams and Abuse Detection ⦿ 𝐌𝐌𝟒𝐒𝐆: Fake News, Misinformation, Rumor and Event Detection ⦿ 𝐌𝐌𝟒𝐒𝐆: Multimodal Sentiment Analysis ⦿ 𝐌𝐌𝟒𝐒𝐆: Disaster Response and Crisis Management in the Web ⦿ 𝐌𝐌𝟒𝐒𝐆: Multimodal Healthcare applications using Web data ⦿ 𝐌𝐌𝟒𝐒𝐆: Multimodal content analysis for sustainable development goals (SDGs) ⦿ 𝐌𝐌𝟒𝐒𝐆: New Datasets for Multimodal Content Analysis on the internet ⦿ 𝐌𝐌𝟒𝐒𝐆: Multimodal content generation and analysis ⦿ 𝐌𝐌𝟒𝐒𝐆: Large Language Models for Multimodal Model Content Analysis on the internet ⦿ 𝐌𝐌𝟒𝐒𝐆: Foundation Models for Multimodal Content Analysis on the internet ⦿ 𝐌𝐌𝟒𝐒𝐆: Socially Responsible Multimodal Content Analysis: Fairness, Bias, Accountability, and Transparency Website: https://lnkd.in/eA3Xaa2m Submit Paper: https://lnkd.in/eyy_DCBN If you have any questions, kindly email: surendrabikram [at] vt [dot] com We appreciate your support. Best Regards, Workshop Chairs First International Workshop on Multimodal Content Analysis for Social Good (MM4SG)

1 0

← Newer
1
...
14
15
16
17
18
19
20
...
33
Older →

Jump to page:

2026

2025

2024

2023

2022

2021

2020

2019

2018

2017

SIGUL