- Corpora - ELRA lists

PhD position in Language Technology (3 years)
by Samia Touileb 02 May '24

02 May '24

We are seeking a motivated candidate to join our research team in MediaFutures, at the University of Bergen, Norway. The primary task of this position will be to develop novel techniques for generating news articles. This involves creating resources that adapt lexical, grammatical, and stylistic choices based on various parameters, including user profiles, cognitive accessibility, and journalistic formats. We are also interested in exploring how news content can be versioned and adapted dynamically. This includes tailoring news articles to different user preferences and user segments, ensuring readability, and optimizing content delivery across various platforms. We expect that the candidate will explore how large language models can be used for news generation while maintaining ethical and responsible practices. The position also offers the opportunity to collaborate with industry partners and gather domain-specific datasets from leading Norwegian media houses. This real-world collaboration will enhance the relevance and impact of the produced research. The PhD candidate will work at MediaFutures in Work Package 5 and will cooperate with researchers and partners in the work package, including the Language Technology Group at the University of Oslo, the National Library of Norway, Schibsted, Amedia, and TV 2. In addition to relevant researchers and partners in other work packages. - As an applicant you should have an excellent written and spoken command of English. Proficiency in Norwegian is an important advantage, but *not* a requirement. The deadline is 25th May 2024. For more details about the position and how to apply see: https://www.jobbnorge.no/en/available-jobs/job/262259/phd-position-in-langu… If you have any questions, do not hesitate to contact me. Best, Samia *---* *Samia Touileb* *Associate Professor in Natural Language Processing* *Department of Information Science and Media Studies,* *University of Bergen* *MediaFutures: Research Center for Responsible Media Technology & Innovation*

1 0

2nd CfP: Sixth Workshop on Teaching NLP
by Biester, Laura 02 May '24

02 May '24

Second Call For Papers: Sixth Workshop on Teaching NLP at ACL 2024 The Sixth Workshop on Teaching NLP will be co-located with the 2024 Annual Meeting of the Association for Computational Linguistics in Bangkok, Thailand. The workshop will occur on August 15 (hybrid option available). The one-day workshop will combine a program of traditional keynotes, posters, and oral presentations, with discourse through panel discussion, and focus on building a community for sharing resources. Call for Papers The field of Natural Language Processing (NLP) is growing rapidly, with new state-of-the-art methods emerging every year. This rapid growth challenges educators of NLP courses and degree programs to constantly revise their old material and create fresh NLP courses and degree programs, as well as new best practices and educational materials focused on emerging subareas of NLP. To support those facing these challenges, our one-day workshop will bring together the communities of NLP research and education to facilitate active discussion on questions including (but not limited to): * How can we facilitate meaningful conversations about language among Computer Science students? * How do we include user-centered design in core NLP curricula? * How should NLP educators design curricula that equip students with the ability to advance responsible and ethical NLP? * How can we design assignments that require GPU access or the use of paid APIs? * What are best practices that NLP educators from universities, industry groups, and Massive OpenOnline Courses (MOOCs) can use to share tools and resources for NLP education? This timely sixth edition of the Teaching NLP Workshop builds on prior successful offerings to tackle the most pressing issues in how to design NLP courses and bring together instructors from various backgrounds to discuss, create, and refine instructional design and material. Submission Information We welcome two submission types: teaching materials and papers: Teaching Materials (short papers) We invite short paper submissions of 1-2 pages that describe teaching materials such as curricula, course GitHub repositories, Jupyter notebooks, slides, homework, and assignments. These short papers do not need to be anonymised, but will be peer-reviewed and published in workshop proceedings, as well as presented in posters or demos. The corresponding teaching materials, while not being part of proceedings, should be submitted in addition to the short paper. We will create a Teaching NLP repository/wiki where authors may opt-in to make their materials available for the community after the workshop. Papers We invite papers of up to 8 pages discussing pedagogical aspects of NLP, focusing on (but not limited to) any of the following general topics: * Tools and methodologies (e.g., active learning, flipped classroom) * Scaling curricula to fit large class sizes * Adapting existing curricula to incorporate new NLP advancements * Teaching online NLP courses or adjusting courses to become remote * Challenges of designing the first NLP course or related degree program at a college, university, or on a MOOC platform * Teaching heterogenous groups of students (e.g., with respect to prior experience in computer science and linguistics) * Teaching underrepresented students * Bridging the gap between academic training and industry needs * Incorporating ethics, reproducibility, and responsible practices in NLP courses * Teaching multilingual NLP All submissions will be processed through OpenReview<https://openreview.net/group?id=aclweb.org/ACL/2024/Workshop/TeachNLP>. Important Dates * Paper Submission: May 17, 2024 * Notification of Acceptance: June 17, 2024 * Camera-Ready Deadline: July 1, 2024 * Teaching NLP Workshop: August 15, 2024 Website: https://sites.google.com/view/teachingnlpacl2024/ Contact: teachingnlp.yt(a)gmail.com<mailto:teachingnlp.yt@gmail.com> Best, TeachingNLP 2024 Organizers (Sana Al-azzawi, Laura Biester, György Kovács, Ana Marasović, Leena Mathur, Margot Mieskes, Leonie Weissweiler)

1 0

ESSLLI 2024 - Call for Participation
by Tim Van de Cruys 02 May '24

02 May '24

We invite you to participate in the 35th European Summer School in Logic, Language and Information (ESSLLI), taking place from 29 July - 9 August, 2024 at the University of Leuven, Belgium. https://2024.esslli.eu/ * Overview The European Summer School in Logic, Language and Information (ESSLLI) is a yearly recurring event, organized under the auspices of the Association for Logic, Language and Information (FoLLI), and has been running since 1989. The ESSLLI Summer School provides an interdisciplinary setting in which courses and workshops are offered in logic, linguistics and computer science, also from wider scientific, historical, and philosophical perspectives. ESSLLI attracts around 400 participants from Europe, the Middle East, Asia and Africa, as well as from North America and Latin America. ESSLLI has become the main meeting place for young researchers and students in logic, linguistics and computer science to discuss current research and to share knowledge. The event is unique in its interdisciplinary set-up, with no equivalents in Europe. * Programme The ESSLLI Summer School offers an exciting two-week programme, consisting of the following: - Foundational, introductory and advanced courses in three areas: Language and Computation, Logic and Computation, and Logic and Language - Workshops in logic, linguistics and computer science - Student session - Evening lectures - Social activities The full program can be found on the website. * Registration: Registration for attendees, course lecturers, student session and workshop organizers and speakers is now open. The early-registration deadline is Friday, 31st May. https://2024.esslli.eu/registration/registration.html * Accommodation ESSLLI is offering affordable accommodation to all participants who book before 31st May. We cannot guarantee accommodation for registrations received after this date.

1 0

CLEF-2024 CheckThat! Lab -- Start of the Evaluation Cycle
by Firoj Alam 01 May '24

01 May '24

(apologies for cross-posting) Dear colleague, We have launched the test stage of the CheckThat! 2024 lab at CLEF. The test datasets and the submission websites are ready. Please, visit the CheckThat! website to download the data/notebooks and read the instructions to submit your runs: http://checkthat.gitlab.io/clef2024/ Do notice that each of the 6 tasks runs independently and that you can participate in one or more of them. Do not hesitate to get in touch, would you face any issue. The deadline to submit your runs is on 6 May 2024 (23:59 AOE). Further information: https://checkthat.gitlab.io/ Datasets: https://gitlab.com/checkthat_lab/clef2024-checkthat-lab <https://gitlab.com/checkthat_lab/clef2023-checkthat-lab> Best regards, The CheckThat! organisers

1 0

WASSA 2024 Shared Task on Empathy Emotion and Personality Detection in Interactions within ACL 2024
by Val Barriere 30 Apr '24

30 Apr '24

[Apologies for cross-posting] We invite you to participate in the shared task on Empathy and Personality Detection in interactions, organized as part of WASSA 2024 <https://workshop-wassa.github.io/>at ACL 2024 <https://2024.aclweb.org/>. This task aims to develop models that can predict Empathy, Emotion, and Personality recognition in short text and at the speech-turn in a conversation. Task Description You can participate in five different tracks: - Track 1: Empathy Prediction in Conversations (CONV-dialog), which consists in predicting the perceived empathy at the dialog-level - Track 2: Empathy and Emotion Prediction in Conversations Turns (CONV-turn), which consists in predicting the perceived empathy, emotion polarity, and emotion intensity at the speech-turn-level in a conversation - Track 3: Empathy Prediction (EMP), which consists in predicting both the empathy concern and the personal distress at the essay-level - Track 4: Personality Prediction (PER), which consists in predicting the personality (openness, conscientiousness, extraversion, agreeableness, and emotional stability; OCEAN) of the essay writer, knowing all of their essays, dialogs, and the news article from which they reacted Note: You are free to participate in any or both tracks. For participation, please check: https://codalab.lisn.upsaclay.fr/competitions/18810 Important Dates - April 24, 2024: Codalab competition website goes online, and training/development data released - May 29, 2024: Evaluation phase begins: test data released - June 1, 2024: Evaluation phase ends: Deadline for final submission on Codalab - June 12, 2024: Deadline system description paper (max. 4p) - June 22, 2024: Notification of acceptance - July 1, 2024: Camera-ready papers due Task Organizers - Salvatore Giorgi - National Institute on Drug Abuse [C], USA - Valentin Barriere - Universidad de Chile , Chile - Joao Sedoc - New York University, USA - Shabnam Tafreshi - Evernorth Healthcare, USA Contact wassa24empathy [at] gmail [dot] com Join Google Group wassa24empathy(a)googlegroups.com

1 0

Call for Participation: Holocaust Testimonies as Language Resources - co-located with LREC-COLING 2024
by Francesca Frontini 30 Apr '24

30 Apr '24

*Call for Participation: The Workshop on Holocaust Testimonies as Language Resources* Date: 21 May 2024 (full day) Venue: Lingotto Conference Centre, Turin, Italy Webpage and programme: https://www.clarin.eu/HTRes2024 <https://url6.mailanyone.net/scanner?m=1rX22L-0002ym-4R&d=4%7Cmail%2F90%2F17…> <https://url6.mailanyone.net/scanner?m=1rX22L-0002ym-4R&d=4%7Cmail%2F90%2F17…> *Registration* Registration is obligatory and can be carried out via the LREC-COLING 2024 website: https://lrec-coling-2024.org/. Workshop description Holocaust testimonies serve as a bridge between survivors and history’s darkest chapters, providing a connection to the profound experiences of the past. Testimonies stand as the primary source of information that describe the Holocaust, offering first-hand accounts and personal narratives of those who experienced it. The majority of testimonies are captured in an oral format, as survivors vividly explain and share their personal experiences and observations from that time period. Transforming Holocaust testimonies into a machine-processable digital format can be a difficult task owing to the unstructured nature of the text. The creation of accessible, comprehensive, and well-annotated Holocaust testimony collections is of paramount importance to our society. These collections empower researchers and historians to validate the accuracy of socially and historically significant information, enabling them to share critical insights and trends derived from these data. This workshop will investigate a number of ways in which techniques and tools from natural language processing and corpus linguistics can contribute to the exploration, analysis, dissemination and preservation of Holocaust testimonies. *Programme* Please refer to the website for the details of the programme: https://www.clarin.eu/HTRes2024 Contact Email: holocausttlr(a)gmail.com *Invited speakers* Silvia Calamai (Siena University) Michal Frankl (Masaryk Institute and Archives of the Czech Academy of Sciences) *Organising Committee* Isuri Anuradha, University of Wolverhampton, UK Ingo Frommholz, University of Wolverhampton Francesca Frontini, CNR-ILC, Italy & CLARIN Martin Wynne, Oxford University, UK Ruslan Mitkov, Lancaster University, UK Paul Rayson, Lancaster University, UK Alistair Plum, University of Luxembourg, Luxembourg

1 0

PhD thesis offer : Language and vision-based mobility assistance for visually impaired people
by Christophe Lohr 30 Apr '24

30 Apr '24

*Summary* • Subject: Language & vision-based mobility assistance for visually impaired people • Keywords: Assistive Technologies, Visual Impairment, Computer Vision, Image Captioning • Research Unit: Lab-STICC (UMR CNRS 6285) • Team: RAMBO - Robot interaction, Ambient system, Machine learning, Behaviour, Optimization • Location: IMT Atlantique, Brest • Start: September/October 2024 • Duration: 3 years • Supervision: Panagiotis Papadakis, Christophe Lohr *Full subject description:* https://www.imt-atlantique.fr/sites/default/files/recherche/Offres%20de%20t… *Application * The candidate must hold (or is about to obtain) a Master Degree in Computer Science with theoretical and practical skills in AI algorithms and associated deep-learning tools (e.g. Pytorch), and a solid background in Computer Vision. The candidate should be fluent in English (working and publishing main language), but French speaking is an advantage (meetings with end-users representatives). A detailed application should be addressed to thesis-application-rambo(a)imt-atlantique.fr, including a cover letter, an up-to-date CV, transcripts of grades (last two years), and a list of referees. *Deadline: 17 May 2024 *

1 0

[CFP] Workshop on Multimodal Semantic Representations
by Lucia Donatelli 30 Apr '24

30 Apr '24

SECOND CALL FOR PAPERS *The Second Workshop on Multimodal Semantic Representations (MMSR II)* Co-located with ECAI 2024 (https://www.ecai2024.eu/) 19-24 October, Santiago de Compostela, Spain (workshop on 19 or 20 October) *Workshop website*: https://mmsr-workshop.github.io/ *Description* The demand for more sophisticated natural human-computer and human-robot interactions is rapidly increasing as users become more accustomed to conversation-like interactions with AI and NLP systems. Such interactions require not only the robust recognition and generation of expressions through multiple modalities (language, gesture, vision, action, etc.), but also the encoding of situated meaning. When communications become multimodal, each modality in operation provides an orthogonal angle through which to probe the computational model of the other modalities, including the behaviors and communicative capabilities afforded by each. Multimodal interactions thus require a unified framework and control language through which systems interpret inputs and behaviors and generate informative outputs. This is vital for intelligent and often embodied systems to understand the situation and context that they inhabit, whether in the real world or in a mixed-reality environment shared with humans. Furthermore, multimodal large language models appear to offer the possibility for more dynamic and contextually rich interactions across various modalities, including facial expressions, gestures, actions, and language. We invite discussion on how representations and pipelines can potentially integrate such state-of-the-art language models. We solicit papers on multimodal semantic representation, including but not limited to the following topics: - Semantic frameworks for individual linguistic co-modalities (e.g. gaze, facial expression); - Formal representation of situated conversation and embodiment, including knowledge graphs, designed to represent epistemic state; - Design, annotation, and corpora of multimodal interaction and meaning representation; - Challenges (including cross-lingual and cross-cultural) in multimodal representation and/or processing; - Criteria or frameworks for evaluation of multimodal semantics; - Challenges in aligning co-modalities in formal representation and/or NLP tasks; - Design and implementation of neurosymbolic or fusion models for multimodal processing (with a representational component); - Methods for probing knowledge of multimodal (language and vision) models; - Virtual and situated agents that embody multimodal representations of common ground. *Submission Information* Two types of submissions are solicited: long papers and short papers. Long papers should describe original research and must not exceed 8 pages, excluding references. Short papers (typically system or project descriptions, or ongoing research) must not exceed 4 pages, excluding references. Both types will be published in the workshop proceedings. Accepted papers get an extra page in the camera-ready version. We strongly encourage students to submit to the workshop. *Important Dates* May 15, 2024: Submissions due July 1, 2024: Notification of acceptance decisions August 2, 2024: Camera-ready papers due Papers should be formatted using the ECAI style files, available at: https://www.ecai2024.eu/calls/main-track Papers will be submitted in PDF format via Chairing Tool at the following link: https://chairingtool.com/conferences/2MMSR24/MainTrack Please do not hesitate to reach out with any questions. Best regards, Richard Brutti, Lucia Donatelli, Nikhil Krishnaswamy, Kenneth Lai, & James Pustejovsky MMSR II organizers Web page: https://mmsr-workshop.github.io/

1 0

[Iber AuTexTification|IberLEF2024] Test set released
by Francisco Manuel Rangel Pardo 30 Apr '24

30 Apr '24

Event Notification Type: Test set released. Website: https://sites.google.com/view/iberautextification *TEST SET RELEASED* *IberAuTexTification* *Automated Text Identification on Languages of the Iberian Peninsula* Dear All, The test dataset for the IberAuTexTification 2024 shared task has been released. It can be found on the shared task website ( https://sites.google.com/view/iberautextification/data), in Genaios Github repository (https://github.com/Genaios/IberAuTexTification) and Zenodo ( https://zenodo.org/records/11034382). Please, remember that registration is on a per-team basis, meaning only one member from each team needs to sign up. Once requested the test dataset on one of the previous links, you will receive the download permission and a password to decompress the data within 24 hours. Please, make sure to write your email address correctly, since we will send passwords, as well as future notifications to that address. Please reach out to the organizers or join the Slack workspace to connect with the other participants and organizers. Best regards on behalf of IberAuTexTification shared tasks organizers

1 0

AIRiAL CFP closing today
by Voss, Erik 30 Apr '24

30 Apr '24

** apologies for cross-posting** 2nd Annual Artificial Intelligence Research in Applied Linguistics (AIRiAL) Conference at Teachers College, Columbia University Call for Proposals: *closing today (30 April, 2024)* Theme: AI in Education: Empowering Learners & Preparing Educators Location: Teachers College, Columbia University, New York City Dates: September 27-28, 2024 For more information and submission form: https://tinyurl.com/CFPAIRiAL2024 -- Erik Voss, Ph.D. Assistant Professor, Applied Linguistics & TESOL program Language & Technology Specialization Department of Arts & Humanities Teachers College, Columbia University TC Faculty Profile <https://www.tc.columbia.edu/faculty/ev2449/>, Linkedin Profile <https://www.linkedin.com/in/erik-voss-ph-d-941a3ab9>, Google Scholar <https://scholar.google.com/citations?user=FMnVdjcAAAAJ&hl=en> ALTESOL Language & Technology Research Group <https://sites.google.com/tc.columbia.edu/al-tesol-language-technology/home> Editor-in-Chief of NYS TESOL Journal Associate Editor of Language Assessment Quarterly *Latest Publications* TC Interview: How New Artificial Intelligence Tools Will Keep Changing Education <https://youtu.be/Zh1RB7DLRMI?si=vDIvowSnzrWy480P>(7:28 mins.) Voss, E. et al. (2023). The Use of Assistive Technologies Including Generative AI by Test Takers in Language Assessment: A Debate of Theory and Practice. <https://doi.org/10.1080/15434303.2023.2288256> LAQ Journal Voss, E. (2024) Duolingo Webinar: Current Applications of Artificial Intelligence in Language Assessment <https://youtu.be/b-mjLmvXLBU?si=nmph76-lizkfzi1J> (1 hour)

1 0

2026

2025

2024

2023

2022

Corpora