- Corpora - ELRA lists

2nd CfP: GermEval2024 Shared Task GerMS-Detect - Sexism Detection and Annotator Disagreement Prediction in German Online News Fora @Konvens 2024
by stephanie.gross＠ofai.at 15 May '24

15 May '24

GermEval2024 Shared Task: GerMS-Detect -- Sexism Detection and Annotator Disagreement Prediction in German Online News Fora ===================================================================================== 2nd CALL FOR PARTICIPATION We would like to invite you to the GermEval Shared Task GerMS-Detect on Sexism Detection and Annotator Disagreement Prediction in German Online News Fora collocated with Konvens 2024 (https://konvens-2024.univie.ac.at/). Competition Website: https://ofai.github.io/GermEval2024-GerMS/ Important Dates ------------------ Development phase: May 1 - June 5, 2024 (ongoing) Competition phase: June 7 - June 25, 2024 Paper submission due: July 1, 2024 Camera ready due: July 20, 2024 Shared Task @KONVENS: 10 September, 2024 Task description ------------------ This shared task is not just about the detection of sexism/misogyny in comments posted in (mostly) German language to the comment section of an Austrian online newspaper: many of the texts to be classified contain ambiguous language, very subtle ways to express misogyny or sexism or lack important context. For these reasons, there can be quite some disagreement between annotators on the appropriate label. In many cases, there is no single correct label. For this reason the shared task is not just about correctly predicting a single label chosen from all the labels assigned by human annotators, but about models which can predict the level of disagreement, the range of labels assigned by annotators or the distribution of labels to expect for a specific group of annotators. For details see the Competition Website (https://ofai.github.io/GermEval2024-GerMS/). Organizers ------------ The task is organized by the Austrian Research Institute for Artificial Intelligence (OFAI). Organizing team ------------------ Brigitte Krenn (brigitte.krenn (AT) ofai.at) Johann Petrak (johann.petrak (AT) ofai.at) Stephanie Gross (stephanie.gross (AT) ofai.at)

1 0

Seeking alpha/beta testers for text manipulation software
by Greg Lessard 15 May '24

15 May '24

I have written several little text manipulation tools that I would like anyone interested to try out and give me comments and suggestions on. They are pure JavaScript (no libraries) and work using arrays so they can handle texts up to about the size of a novel. They can be used from their website or alternatively saved and used offline on any device from a phone to a laptop. The tools include: vlviewtext.html: a tool for viewing a text file in either text or concordance mode with fast switching between the two views. vlmakelist.html: a tool for creating a wordlist or frequency list from a text file, the former as a csv file, the latter as html or csv vltaglist.html: a tool for creating or editing tagged wordlists with up to three levels of tags The tools may be found at: https://vincilingua.ca/Tools/index.html Each comes with a basic online manual and sample texts in English and French may be found on the site. Please send any comments or suggestions to me directly at lessardg(a)protonmail.com. With thanks in advance, Greg Lessard

1 0

Workshop on COuntering Disinformation with AI @ECAI: CFP (Deadline extended)
by Rajesh Sharma 15 May '24

15 May '24

Dear All, We invite paper submissions to the Workshop on COuntering Disinformation with AI (CODAI), which will take place on 20 October at ECAI 2024. *Website:* https://codai2024.github.io/ *Important dates* Submission deadline: 24th May 2024 Accept/Reject Communications: 1st July 2024 Camera-ready papers due: 22nd July 2024 Workshop date: 20 October 2024 All deadlines are 11:59 pm UTC-12 (“anywhere on earth”). *Overview* Social media platforms which have been designed primarily to allow users to create and share content with others, have become integral parts of modern communication, enabling people to connect with each other as well as for broadcasting information to a wider audience. On one side these platforms provide an opportunity to facilitate discussions in an open and free environment. On the flip side, new societal issues have started emerging on these platforms. Among all the issues, the topic of misinformation has been prevalent on these platforms. The term misinformation is an umbrella term which encompasses various entities such as fake news, hoaxes, rumors to name a few. While misinformation refers to non-intentional spread of non-authentic information, the term disinformation points to spreading of a piece of inauthentic information with certain malign intentions. *Topics* Areas of interest to include, but are not limited to, the following: - Information diffusion models for understanding and thwarting the spread of low-quality information; - Characterization and detection of coordinated inauthentic behavior; - Novel techniques for detecting malicious accounts (e.g., bots, cyborgs and trolls); - Information diffusion models for understanding and thwarting the spread of low-quality information; - Understanding and detection of disinformation; - Study, inference and detection of narratives in disinformation campaigns; - Impact/Harm of misinformation on society. - Case-studies on the spread and impact of fake news in controversial topics such as politics, health, climate change, economics, migration. - Social and psychological studies, or data analytics related to misinformation spreaders. - Metrics, tools and methods for measuring the impact of fake news and of coordinated inauthentic behaviors; - Datasets for evaluation. *Submission Link:* https://chairingtool.com/ *Submission Types* *Original submissions:* The submissions will be reviewed through a double-blind process and must remain anonymous. They can be either short papers (2-4 pages) or long papers (6-8 pages), with additional pages allowed for references. . *Non-archival option:* In addition to regular paper submissions, authors have the option of submitting previous research or abstract as non-archival. Accepted submissions will be presented at the workshop as oral presentations. *Format and styling* Submissions should be formatted according to the ECAI formatting instructions and not exceed 7 pages (plus 1 extra page for references). All submissions should use the ECAI 2024 template and formatting requirements specified by ECAI. Please send any questions about the workshop to codaihelp(a)gmail.com *Organisers* Rajesh Sharma, University of Tartu, Estonia Anselmo Peñas, Universidad Nacional de Educación a Distancia (UNED), Spain

1 0

[CFP] ACM TORS: Special Issue on Recommender Systems for Good
by antonela.tommasel＠isistan.unicen.edu.ar 14 May '24

14 May '24

CALL FOR PAPERS: ACM Transactions on Recommender Systems Special Issue on Recommender Systems for Good Submission deadline: 1 September 2024 Guest Editors: - Marko Tkalčič, University of Primorska, Slovenia - Noemi Mauro, University of Turin, Italy - Alan Said, University of Gothenburg, Sweden - Nava Tintarev, University of Maastricht, Netherlands - Antonela Tommasel, ISISTAN, CONICET-UNCPBA, Argentina Recommender systems are among the most widely used applications of machine learning. Since they are so widely used, it is important that we, as practitioners and researchers, think about the impact these systems may have on users, society, and other stakeholders. In practice, the focus is often on systems and values of improving key performance indicators (KPIs), such as increased sales or customer retention. Recommendation technology is currently underutilized to serve societal goals that go beyond the business objectives of individual corporations. However, other values, bound more to societal good, could be considered in the development and goals of a recommender system. In fact, recommender systems have already been explored to stimulate healthier eating behavior and for improved health and well-being in general, to help low-income families make school choices, to suggest successful learning paths for students, to entice climate-protecting energy-saving behavior, to support fair micro-lending, or improve the information diets of news readers. Research in these areas is however limited in numbers, compared to the many papers that are published every year that propose new models for improved movie recommendations. Moreover, concerning the methodology and evaluation perspective in this area, it is essential to find a clear methodology and criteria for evaluating the effectiveness and "goodness" of the proposed algorithms. This includes acknowledging that different values may be conflicting, as well as resolving how and when (and by whom) certain values should be prioritized over others. Research on "Recommender Systems for Good" may benefit from an interdisciplinary approach, drawing on insights from fields such as computer science, ethics, sociology, psychology, law, and economics. Collaborations with stakeholders from diverse backgrounds can enrich the research and ensure that recommendations are grounded in real-world needs and values. This special issue aims to present state-of-the-art research works where recommender systems have a positive societal impact and help us address urgent societal challenges. It will thereby serve as a call to action for more research in these areas. Ultimately, through this special issue, we hope to establish a vision of "Recommender Systems for Good', following the spirit of the "AI for Good" initiative (https://aiforgood.itu.int) to achieve the United Nations Sustainable Development Goals (2015) and the more recent UNESCO recommendation on the Ethics of Artificial Intelligence (2024) (https://www.unesco.org/en/artificial-intelligence/recommendation-ethics). Topics: We aim to collect the latest research on recommender systems for societal good. The topics of the special issues include (but are not limited to): - Recommender systems for safety, security, and privacy (e.g., reducing poverty and inequality) - Recommender systems that protect the environment and ecosystems (e.g., lower energy consumption, water and energy management) - Recommender systems that give control of data back to the users (e.g., transparency of data, models, and outputs) - Recommender systems for the interconnected society (e.g., increase of solidarity, online conversational health, multi-stakeholder recommenders) - Accountability in recommender systems, including addressing emerging regulations, such as the DSA (Digital Service Act) - Recommender systems for the public good (e.g., mental and physical health, welfare, digital literacy, stakeholder engagement, e-learning) - Introspective studies on the current state of RSs concerning societal good - Fairness-preserving and fairness-enhancing recommender systems, unbiased recommendations (e.g. to preserve gender equality) - Responsible recommendation (e.g., in social media and traditional news, avoiding filter bubbles and echo chambers) - Sustainability and Cultural recommendations (e.g., art, cultural heritage) - Recommendations to support disadvantaged groups (e.g., elderly, minorities) - Recommender systems for personal development and well-being (e.g., behavioral change, fitness, self-actualization, personal growth) Important Dates: - Submission deadline: September 1, 2024 - First-round review decisions: December 1, 2024 - Deadline for revision submissions: February 1, 2025 - Notification of final decisions: April 1, 2025 Submissions that are received before the first deadline will be directly sent out for review; papers will be immediately published online after acceptance. Submission Information: The special issue welcomes technical research papers, survey papers, and opinion/reflective papers. Each paper should address one or more of the abovementioned topics or be in other scopes of Recommender Systems for Good. The special issue will also consider peer-reviewed journal versions (at least 30% new content) of top papers from related recommender system conferences such as RecSys, SIGIR, KDD, CIKM, IUI, UMAP, CHI, WSDM, ACL, etc. The new content must be in terms of intellectual contributions, technical experiments, and findings. Submissions must be prepared according to the TORS submission guidelines (https://dl.acm.org/journal/tors/author-guidelines) and must be submitted via Manuscript Central (https://mc.manuscriptcentral.com/tors). For questions and further information, please contact the guest editors at rs4good [at] acm [dot] org.

1 0

Deadline Extension: Language Technologies and Digital Humanities Conference (JT-DH 2024)
by Špela Arhar Holdt 14 May '24

14 May '24

We have **extended the submission deadline** for the Language Technologies and Digital Humanities Conference (JT-DH 2024), which will take place on September 19 and 20, 2024, in Ljubljana, Slovenia. More about the venue, topics, templates, and programme is available here: https://www.sdjt.si/wp/jtdh-2024-en. Important dates - May 31, 2024: **Extended deadline for abstract/paper submission** - July 5, 2024: Notification of acceptance - August 23, 2024: Final abstract/paper submission - August 23, 2024: Registration deadline - September 18, 2024: Pre-conference events and workshops - September 19 & 20, 2024: JTDH 2024 Conference.

1 0

PhD position in Language Technology (3 years)
by Samia Touileb 14 May '24

14 May '24

-- Reminder -- We are seeking a motivated candidate to join our research team in MediaFutures, at the University of Bergen, Norway. The primary task of this position will be to develop novel techniques for generating news articles. This involves creating resources that adapt lexical, grammatical, and stylistic choices based on various parameters, including user profiles, cognitive accessibility, and journalistic formats. We are also interested in exploring how news content can be versioned and adapted dynamically. This includes tailoring news articles to different user preferences and user segments, ensuring readability, and optimizing content delivery across various platforms. We expect that the candidate will explore how large language models can be used for news generation while maintaining ethical and responsible practices. The position also offers the opportunity to collaborate with industry partners and gather domain-specific datasets from leading Norwegian media houses. This real-world collaboration will enhance the relevance and impact of the produced research. The PhD candidate will work at MediaFutures in Work Package 5 and will cooperate with researchers and partners in the work package, including the Language Technology Group at the University of Oslo, the National Library of Norway, Schibsted, Amedia, and TV 2. In addition to relevant researchers and partners in other work packages. As an applicant you should have an excellent written and spoken command of English. Proficiency in Norwegian is an important advantage, but *not* a requirement. The deadline is 25th May 2024. For more details about the position and how to apply see: https://www.jobbnorge.no/en/available-jobs/job/262259/phd-position-in-langu… If you have any questions, do not hesitate to contact me. Best, Samia *---* *Samia Touileb* *Associate Professor in Natural Language Processing* *Department of Information Science and Media Studies,* *University of Bergen* *MediaFutures: Research Center for Responsible Media Technology & Innovation*

1 0

[CfP] Call for Posters and Demos || SEMANTiCS 2024 EU || Sep 17 - 19, 2024 || Amsterdam, Netherlands
by SEMANTiCS 14 May '24

14 May '24

==== SEMANTiCS - 20th International Conference on Semantic Systems Amsterdam, Netherlands Call for Posters and Demos September 17 - 19, 2024 https://2024-eu.semantics.cc/page/cfp_posters_demos ==== The Posters & Demos Track provides a platform for researchers to present their latest findings, ongoing projects, and cutting-edge work in progress. This track is open to a range of submissions on innovative applications, the latest results, unpublished ideas, prototypes of semantic technologies, and their use in various domains. It also welcomes contributions related to applications, use cases, and datasets that may attract developers and potential research or business partners. The Posters & Demos Track offers an informal setting that promotes engagement and dialogue between presenters and attendees. These discussions can provide valuable feedback for presenters' future work, while also allowing participants to gain insight into emerging research trends and network with other researchers. =Important Dates= * Paper Submission Deadline: June 25, * Notification of Acceptance: July 29, 2024 * Camera-Ready Paper Deadline: August 6, 2024 All deadlines are set for 11:59 pm, Anywhere On Earth time (UTC-12) Submission via Easychair on https://easychair.org/my/conference?conf=sem24 Proceedings of SEMANTiCS 2024 EU will be made available open access by CEUR-WS.org. =Topics of Interest= We welcome contributions in the context of semantic-based research and systems, which address – but are not limited to – the topics of the Research Track ttps://2024-eu.semantics.cc/page/cfp_rev_rep. Additionally, we encourage submissions of visionary ideas, position statements, negative results, and unconventional ideas. Demos should showcase innovative implementations and technologies both, from academia and industry. We warmly welcome contributions from industry professionals, provided that they concentrate on introducing innovative solutions to particular challenges, rather than serving as promotional material or descriptions of commercial products. =Author Guidelines and Submission= Poster and demo submissions should consist of a paper that describes the work, its contribution to the field or innovative aspects. * Poster and demo submissions are at most 5 pages long, including references. * No double-blind submissions required. * Submissions must be either in PDF or HTML. * Submissions must be formatted in the style of CEUR-ART (https://ceur-ws.org/HOWTOSUBMIT.html). An Overleaf page for LaTeX users is available. * For demos, we ask authors to include links enabling the reviewers to test the application or review the component. The absence of a pointer affects the overall rating of the contribution. * Submissions must be original and must not have been submitted for publication elsewhere. * At least one author of each accepted paper must register for the conference and present the paper. We look forward to receiving your contributions! =Poster and Demo Chairs= * Francesco Osborne & Anelia Kurteva

1 0

Shared Task on Machine Translation Gender Bias Evaluation with Multilingual Holistic Bias
by Agnieszka Falenska 14 May '24

14 May '24

*Apologies for crossposting* We are proud to announce that the Multilingual Holistic Bias task is now open in Dynabench <https://dynabench.org/>. The main objectives of this task are: To investigate the quality of MT systems on the particular case of gender preservation for tens of languages To examine and understand special gender challenges in translating in different language families. To investigate the performance of gender translation of low-resource, morphologically rich languages To open to the community the first challenge of this kind While the task is intended to be open without a particular deadline, we encourage you to submit models by April 15th and participate in the shared task from the 5th Workshop on Gender Bias in Natural Language Processing <https://genderbiasnlp.talp.cat/gebnlp-2024/shared-task-on-machine-translati…>. We are looking forward to having your participation! Shared Task organizers

1 2

Deadline Extension: Sixth Workshop on Teaching NLP
by Biester, Laura 13 May '24

13 May '24

Second Call For Papers: Sixth Workshop on Teaching NLP at ACL 2024 The Sixth Workshop on Teaching NLP will be co-located with the 2024 Annual Meeting of the Association for Computational Linguistics in Bangkok, Thailand. The workshop will occur on August 15 (hybrid option available). The one-day workshop will combine a program of traditional keynotes, posters, and oral presentations, with discourse through panel discussion, and focus on building a community for sharing resources. The submission deadline is being extended to May 19th, 2024 AOE. Call for Papers The field of Natural Language Processing (NLP) is growing rapidly, with new state-of-the-art methods emerging every year. This rapid growth challenges educators of NLP courses and degree programs to constantly revise their old material and create fresh NLP courses and degree programs, as well as new best practices and educational materials focused on emerging subareas of NLP. To support those facing these challenges, our one-day workshop will bring together the communities of NLP research and education to facilitate active discussion on questions including (but not limited to): * How can we facilitate meaningful conversations about language among Computer Science students? * How do we include user-centered design in core NLP curricula? * How should NLP educators design curricula that equip students with the ability to advance responsible and ethical NLP? * How can we design assignments that require GPU access or the use of paid APIs? * What are best practices that NLP educators from universities, industry groups, and Massive OpenOnline Courses (MOOCs) can use to share tools and resources for NLP education? This timely sixth edition of the Teaching NLP Workshop builds on prior successful offerings to tackle the most pressing issues in how to design NLP courses and bring together instructors from various backgrounds to discuss, create, and refine instructional design and material. Submission Information We welcome two submission types: teaching materials and papers: Teaching Materials (short papers) We invite short paper submissions of 1-2 pages that describe teaching materials such as curricula, course GitHub repositories, Jupyter notebooks, slides, homework, and assignments. These short papers do not need to be anonymised, but will be peer-reviewed and published in workshop proceedings, as well as presented in posters or demos. The corresponding teaching materials, while not being part of proceedings, should be submitted in addition to the short paper. We will create a Teaching NLP repository/wiki where authors may opt-in to make their materials available for the community after the workshop. Papers We invite papers of up to 8 pages discussing pedagogical aspects of NLP, focusing on (but not limited to) any of the following general topics: * Tools and methodologies (e.g., active learning, flipped classroom) * Scaling curricula to fit large class sizes * Adapting existing curricula to incorporate new NLP advancements * Teaching online NLP courses or adjusting courses to become remote * Challenges of designing the first NLP course or related degree program at a college, university, or on a MOOC platform * Teaching heterogenous groups of students (e.g., with respect to prior experience in computer science and linguistics) * Teaching underrepresented students * Bridging the gap between academic training and industry needs * Incorporating ethics, reproducibility, and responsible practices in NLP courses * Teaching multilingual NLP All submissions will be processed through OpenReview<https://openreview.net/group?id=aclweb.org/ACL/2024/Workshop/TeachNLP>. Important Dates * Paper Submission: May 19, 2024 AOE [Extended] * Notification of Acceptance: June 17, 2024 * Camera-Ready Deadline: July 1, 2024 * Teaching NLP Workshop: August 15, 2024 Website: https://sites.google.com/view/teachingnlpacl2024/ Contact: teachingnlp.yt(a)gmail.com<mailto:teachingnlp.yt@gmail.com> Best, TeachingNLP 2024 Organizers (Sana Al-azzawi, Laura Biester, György Kovács, Ana Marasović, Leena Mathur, Margot Mieskes, Leonie Weissweiler)

1 0

DEADLINE EXTENSION: The First Workshop on Data Contamination (CONDA) @ ACL 2024
by Eneko Agirre 13 May '24

13 May '24

*New paper submission and ARR commitment deadlines (see below)* We invite you to participate and submit your work to the First Workshop on Data Contamination (CONDA) co-located with ACL 2024 in Bangkok, Thailand. Data contamination, where evaluation data is inadvertently included in pre-training corpora of large scale models, and language models (LMs) in particular, has become a concern in recent times. The growing scale of both models and data, coupled with massive web crawling, has led to the inclusion of segments from evaluation benchmarks in the pre-training data of LMs. The scale of internet data makes it difficult to prevent this contamination from happening, or even detect when it has happened. Crucially, when evaluation data becomes part of pre-training data, it introduces biases and can artificially inflate the performance of LMs on specific tasks or benchmarks. This poses a challenge for fair and unbiased evaluation of models, as their performance may not accurately reflect their generalization capabilities. Although a growing number of papers and state-of-the-art models mention issues of data contamination, there is no agreed-upon definition or standard methodology to ensure that a model does not report results on contaminated benchmarks. Addressing data contamination is a shared responsibility among researchers, developers, and the broader community. By adopting best practices, increasing transparency, documenting vulnerabilities, and conducting thorough evaluations, we can work towards minimizing the impact of data contamination and ensuring fair and reliable evaluations. We welcome paper submissions on all topics related to data contamination, including but not limited to: * Definitions, taxonomies, and gradings of contamination * Contamination detection (both manual and automatic) * Community efforts to discover, report, and organize contamination events * Documentation frameworks for datasets or models * Methods to avoid data contamination * Methods to forget contaminated data * Scaling laws and contamination * Memorization and contamination * Policies to avoid impact of contamination in publication venues and open source communities * Reproducing and attributing results from previous work to data contamination * Survey work on data contamination research * Data contamination in other modalities */ /* */Submission Instructions/* We welcome two types of papers: regular workshop papers and non-archival submissions. Regular workshop papers will be included in the workshop proceedings. All submissions must be in PDF format and made through OpenReview. * * *Regular workshop papers:*Authors can submit papers up to 8 pages, with unlimited pages for references. Authors may submit up to 100 MB of supplementary materials separately and their code for reproducibility. All submissions undergo a double-blind single-track review. Best Paper Award(s) will be given based on nomination by the reviewers. Accepted papers will be presented as posters with the possibility of oral presentations. * * *Non-archival submissions:*Cross-submissions are welcome. Accepted papers will be presented at the workshop but not included in the workshop proceedings. Papers must be in PDF format and will be reviewed in a double-blind fashion by workshop reviewers. We also welcome extended abstracts (up to 2 pages) of papers that are work in progress, under review or to be submitted to other venues. Papers in this category need to follow the ACL format. * In addition to papers submitted directly to the workshop, which will be reviewed by our Programme Committee. We also accept papers reviewed through ACL Rolling Review and committed to the workshop. Please, check the relevant dates for each type of submission. */ /* */Important dates/* Relevant deadlines to consider when submitting your paper are: * *Paper submission deadline: May 31 (Friday), 2024* * *ARR pre-reviewed commitment deadline: June 14 (Friday), 2024* * Notification of acceptance: June 17 (Monday), 2024 * Camera-ready paper due: July 1 (Monday), 2024 * Workshop date: August 16, 2024 */ /* */Sponsors/* * AWS AI and Amazon Bedrock * HuggingFace * Google */ /* */Contact/* * Website:https://conda-workshop.github.io/ <https://conda-workshop.github.io/> * Email:conda-workshop@googlegroups.com<mailto:conda-workshop@googlegroups.com> */ /* */Organizers/* Oscar Sainz, University of the Basque Country (UPV/EHU) Iker García Ferrero, University of the Basque Country (UPV/EHU) Eneko Agirre, University of the Basque Country (UPV/EHU) Jon Ander Campos, Cohere Alon Jacovi, Bar Ilan University Yanai Elazar, Allen Institute for Artificial Intelligence and University of Washington Yoav Goldberg, Bar Ilan University and Allen Institute for Artificial Intelligence -- Eneko Agirre HiTZ Hizkuntza Teknologiako Zentroa - Ixa Taldea Centro Vasco de Tecnología de la Lengua - Grupo Ixa Basque Center for Language Technology - Ixa NLP Group University of the Basque Country (UPV/EHU) hitz.ehu.eus/eneko <https://hitz.ehu.eus/eneko>

1 0

2026

2025

2024

2023

2022

Corpora