- Corpora - ELRA lists

[CIKM-2024] - Data challenge starts!
by antonela.tommasel＠isistan.unicen.edu.ar 22 Aug '24

22 Aug '24

=============== =============== * We apologize if you receive multiple copies of this data challenge * * For the online version of this Call, visit: https://cikm2024.org/cikm-2024-analyticup/ =============== CIKM 2024: 33rd ACM International Conference on Information and Knowledge Management Boise, Idaho, USA October 21–25, 2024 =============== CIKM 2024:AI Aided Advanced Aerodynamics-Streamlining Automobile Design for Optimal Performance In this competition, participants will develop AI4Science methods towards the prediction of aerodynamic drag, a pivotal factor shaping the performance and efficiency of vehicles. The computation will be performed based on industry-level steady vehicle aerodynamics simulated based on the 3D ShapeNet Car, and a total of 2,700 different shaped car data. The challenge will be held on the OpenAtom Foundation’s Open Source Competition Platform and Baidu’s AI Studio competition platform. -------------------------- Key Dates -------------------------- * Registration open: June 23rd 2024. * Initial data release: July 1st 2024. * Leaderboard A submission starts: July 20th 2024. * Leaderboard B submission starts: August 23rd 2024. * Final submission deadline: August 30th 2024. * Winner’s presentations: October 25th 2024. -------------------------- Prizes -------------------------- * 1st Place: $3,000 * 2nd Place: $2,000 * 3rd Place: $1,000 * 4th-10th Place: $500 each -------------------------- More information -------------------------- For further information, please contact: zhangzhe [at] openatom [dot] org More details can be found at the competition website: https://competition.atomgit.com/competitionInfo?id=cda4e961b0c25858ca0fd2a4…

2 1

Antw: Corpora Digest, Vol 855, Issue 1 (Abwesenheitsnotiz)
by Karolina Suchowolec 22 Aug '24

22 Aug '24

Vielen Dank für Ihre Nachricht. Ich bin am 12.07.2024 wieder erreichbar. Ihre E-Mail wird nicht weitergeleitet. Thank you for your message. I am out of the office until 07/11/2024. Your e-mail will not be forwarded. Mit freundlichen Grüßen/Best regards, Karolina Suchowolec -- *Prof. Dr. Karolina Suchowolec* Institut für Translation und Mehrsprachige Kommunikation (Fakultät 03) Lehrgebiet: Terminologie und mehrsprachige Fachkommunikation T: +49 221-8275-3902 E: karolina.suchowolec(a)th-koeln.de Technische Hochschule Köln Campus Südstadt Ubierring 48 50678 Köln Raum: 436 www.th-koeln.de <http://www.th-koeln.de> TH Köln Logo

2 1

1st CfP: 2nd Workshop on Social Influence in Conversations (SICon 2024)
by aina.garisoler＠telecom-paris.fr 22 Aug '24

22 Aug '24

(Apologies for cross-posting) ****** First Call for Papers ****** SICon 2024: 2nd Workshop on Social Influence in Conversations Website: https://sites.google.com/view/sicon2024/home Twitter/X: @SIConWorkshop Paper Submission via Openreview: https://openreview.net/group?id=EMNLP/2024/Workshop/SiCon Contact: sicon-chairs [at] googlegroups.com Venue: Co-located with EMNLP 2024; November 15 or 16, 2024; Miami, Florida *** Workshop description *** Social influence (SI) is the change in an individual's thoughts, feelings, attitudes, or behaviors from interacting with another individual or a group. For example, a buyer uses SI skills to negotiate trade-offs and build rapport with the seller. SI is ubiquitous in everyday life, and hence, realistic human-machine conversations must reflect these dynamics, making it essential to model and understand SI in dialogue research systematically. This would improve SI systems' ability to understand users’ utterances, tailor communication strategies, personalize responses, and actively lead conversations. These challenges draw on perspectives not only from NLP and AI research but also from Game Theory, Affective Computing, Communication, and Social Psychology. SICon 2024 will be the second edition of a venue that uniquely fosters a dedicated discussion on social influence within NLP while involving researchers from other disciplines such as affective computing and the social sciences. SICon 2024 features keynote talks, panel discussions, poster sessions, and lightning talks for accepted papers. We encourage researchers of all stages and backgrounds to share their exciting work! SICon will promote discussion around several key questions: * How should social influence systems model users and plan optimal responses systematically? * How can social influence systems benefit from linguistic theories (e.g.,successful persuasion or negotiation tactics) developed in the social sciences? * What structurally differentiates and unites various social influence tasks? * What are the ethical issues involved with AI that engage in social influence and what guardrails must be implemented before using these systems in the wild? *** Submission Guidelines *** SICon welcomes two types of papers: regular workshop submissions and shared task submissions (more information about the shared task will be announced at a later date). Regular workshop submissions are archival short (4 pages) and long (8 pages) papers. There is also a non-archival track for extended abstracts (2 pages) covering ongoing work on social influence NLP. Topics include but are not limited to: * Analysis-focused contributions (e.g., associations between linguistic behaviors or user attributes with SI task outcomes); * System design contributions (e.g., dialogue systems for SI tasks such as strategic games, emotional support, etc.; SI systems effectively harnessing the capabilities of LLMs); * Contributions advancing relevant subgoals in SI tasks (e.g. detecting SI strategies in text, partner/opponent modeling and emotion recognition in SI interactions); * SI systems benefited from linguistic theories (e.g., successful persuasion, negotiation tactics) developed in social science; * Ethical issues and guardrails involved with AI that engage in SI; * Unintentional aspects of SI for any human-facing NLP system; * Datasets capturing forms of SI; * Opinion or position papers on SI. Submissions should follow the official EMNLP 2024 style guidelines and be submitted through OpenReview: https://openreview.net/group?id=EMNLP/2024/Workshop/SiCon *** Important Dates *** Direct paper submission deadline: August 15, 2024 Notification of acceptance: October 1, 2024 Camera-ready paper due: October 8, 2024 Workshop: November 15 or 16, 2024 (All submission deadlines are 11:59 p.m. UTC-12:00 ‘anywhere on Earth’) *** Contact *** For any questions, please contact the organizers at: sicon-chairs [at] googlegroups.com Consider joining our slack community! A dedicated space to connect researchers working on various topics related to social influence in NLP and beyond, as well as a useful communication channel for live announcements during the workshop: https://join.slack.com/t/acl2023sicon/shared_invite/zt-1y7cv1c2v-Lm21Sm6KX3… ***Organizers*** Muskan Garg (Mayo Clinic) Kushal Chawla (Capital One) Weiyan Shi (Stanford NLP) Ritam Dutt (Carnegie Mellon University) Deuksin Kwon (University of Southern California) James Hale (University of Southern California) Liang Qiu (Amazon) Aina Garí Soler (Télécom-Paris) Alexandros Papangelis (Amazon Alexa AI) Gale Lucas (University of Southern California) Zhou Yu (Columbia University) Daniel Hershcovich (University of Copenhagen)

2 1

RANLP-2025: FIRST ANNOUNCEMENT
by Petya Osenova 22 Aug '24

22 Aug '24

(Apologies for cross-postings) ===== FIRST ANNOUNCEMENT ===== RANLP-25 RECENT ADVANCES IN NATURAL LANGUAGE PROCESSING Varna, Bulgaria http://ranlp.org/ranlp2025/ Summer School ‘Deep Learning and Large Language Models for NLP’: 3-5 September 2025 (Wednesday-Friday) Tutorials: 6-7 September 2025 (Saturday-Sunday) Main Conference: 8-10 September 2025 (Monday-Wednesday) Workshops and shared tasks: 11-12 September 2025 (Thursday-Friday) RANLP (Recent Advances in Natural Language Processing) is one of the most influential and competitive NLP conferences. The event is held biennially and grew out of the International Summer schools "Contemporary topics in Computational Linguistics" which were organised for many years as international training events. Previous RANLP conferences (1995, 1997, 2001, 2003, 2005, 2007, 2009, 2011, 2013, 2015, 2017, 2019, 2021 and 2023) featured keynote talks by leading experts in NLP as well as presentations/papers of high quality, rigorously reviewed by a Programme Committee of well-known researchers. After 2009, the papers accepted at RANLP and the associated workshops have been included in the ACL Anthology. The RANLP proceedings are indexed by SCOPUS and DBLP. The Proceedings has its own Scopus SJR, in 2023 it is 0,299. The conference will be preceded by a Summer School on Deep Learning and Large Language Models in NLP as well as 4-6 tutorials on current topics of particular interest and cutting edge technologies. RANLP-2025 will be followed by selected workshops on specialised NLP topics as well as shared tasks. A Student Research Workshop will be held in parallel with the main conference. The Student Research Workshops (now 9th edition) have become active discussion forums for young researchers. Expected conference submission deadline: end of April/beginning of May 2025 (please regularly check http://ranlp.org/ranlp2025/ for updated information) Call for shared tasks proposals: September 2024 Call for workshop proposals: December 2024 Venue: RANLP-25 will be held in Cherno More Hotel, Varna, Bulgaria. The event venue offers excellent conference facilities. The International airport of Varna conveniently connects Varna to major airports in Central Europe, Sofia, Istanbul and many others. TEAM BEHIND RANLP-25 Galia Angelova, Bulgarian Academy of Sciences, Bulgaria (OC Chair) Ruslan Mitkov, University of Lancaster, UK (PC Chair) Preslav Nakov, MBZUAI, Abu Dhabi, UAE Nikolai Nikolov, Bulgarian Association for Computational Linguistics, Bulgaria Ivelina Nikolova, Bulgarian Academy of Sciences, Bulgaria Petya Osenova, Bulgarian Academy of Sciences, Bulgaria (Workshops Co-Chair) Kiril Simov, Bulgarian Academy of Sciences, Bulgaria (Workshops Co-Chair) Saad Ezzini, Lancaster University, UK (Sponsorship Chair and Shared tasks Co-Chair) Tharindu Ranasinghe, Lancaster University, UK (Shared tasks Co-Chair)

2 1

CLiC-it 2024: third CFP - important updates
by Rachele SPRUGNOLI 22 Aug '24

22 Aug '24

[Apologies for multiple postings] ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ CLiC-it 2024 - Tenth Italian Conference on Computational Linguistics ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 4 - 6 December 2024, Pisa, Italy Third CFP - important updates https://clic2024.ilc.cnr.it/ DEADLINE EXTENSION - **22/07/2024**: EXTENDED paper submission deadline: regular papers and research communications (11:59 pm CEST) - 23/09/2024: Notification to authors of reviewing/selection outcome - 21/10/2024: Camera ready version of accepted papers - 4-6/12/2024: CLiC-it 2024 Conference, Pisa RESEARCH COMMUNICATIONS CLiC-it 2024 adopts a parallel submission policy for outstanding papers accepted **in both 2023 and 2024** by major publication venues, namely the major international CL conferences (workshops excluded) or international journals. When submitting the proposal, the appropriate track should be selected. SUBMISSION TEMPLATE AND PROCEDURE Submission guidelines are detailed in the conference website: https://clic2024.ilc.cnr.it/information-for-authors/ Please note that as stated in the CEUR guidelines (https://ceur-ws.org/HOWTOSUBMIT.html), "The minimum length of a regular or short paper should be five "standard" pages (=2500 chars per page, calculated from a sample of LNCS one-column papers)." Firma il tuo 5xmille all'Universit? di Parma. Aiutaci a potenziare la capacit? di accoglienza, soprattutto abitativa, per le studentesse e gli studenti. - Indica 00308780345 nella tua dichiarazione dei redditi.

2 1

The ReproHum 2024 Survey of NLP and ML researchers' experience and views of reproducibility
by Thomson, Craig 22 Aug '24

22 Aug '24

Dear Corpora list members, As part of the EPSRC UK ReproHum project, PI Prof. Anya Belz (DCU/Adapt & University of Aberdeen), Co-I Prof. Ehud Reiter (University of Aberdeen), RF Dr Craig Thomson (DCU/Adapt & University of Aberdeen), we are performing a survey of NLP and ML researchers’ experience and views of reproducibility. We would like to hear from as many researchers as possible (NLP or ML), not just those who work on evaluation! If you completed a similar survey in 2022 then you can still complete this one, we are interested in the difference in your experience and views between then and now. We would be most grateful if you are able to spend 5-10 minutes taking part in the survey, it can be accessed via the below link: https://docs.google.com/forms/d/e/1FAIpQLSfg1C2seS6ciHGUFdiksf3ofeMzBOtEz7H… With thanks and apologies for cross-posting. Anya, Craig, and Ehud The University of Aberdeen is a charity registered in Scotland, No SC013683. Tha Oilthigh Obar Dheathain na charthannas clàraichte ann an Alba, Àir. SC013683.

2 1

Call for submissions: Conference on Rational Approaches in Language Science (RAILS)
by Stefania Degaetano-Ortlieb 22 Aug '24

22 Aug '24

Call for submissions: Conference on Rational Approaches in Language Science (RAILS) The 2nd Conference on Rational Approaches in Language Science (RAILS) will be held in Saarbruecken, Germany, 13-15 February 2025. The central theme of this conference is rational communication, i.e. the idea that language users continuously strive to optimize their means of communication to effectively convey their intended messages. RAILS brings together research on (1) how interlocutors process and update information in diverse situational contexts, (2) how language use is adapted to certain contexts and intended referents, and (3) how linguistic and conceptual information is stored and maintained in short- and long-term memory. We invite submissions from researchers across the language sciences – including speech science, theoretical linguistics, empirical linguistics, psycholinguistics and neuroscience, computational linguistics, as well as language development, change and evolution – who apply rational probabilistic explanations to linguistic phenomena, or bring novel experimental findings to bear on such accounts. Keynote speakers: Mark Dingemanse (Radboud University) Richard Futrell (University of California, Irvine) Adele Goldberg (Princeton University) Rachel Ryskin (University of California, Merced) Submission guidelines: Abstracts should be submitted as a single PDF file via https://app.oxfordabstracts.com/stages/43090/submissions/new <https://app.oxfordabstracts.com/stages/43090/submissions/new?behalf=false&f…>, adhering to the guidelines listed on our conference website <https://sfb1102.uni-saarland.de/sfb-conference-2025/>. We accept submissions for posters and/or talks. Talks are slated for 20 minutes plus 10 minutes for questions. Submission of planned work is invited for poster presentation only. Note that, if accepted, we expect results to be presented at the conference. Important dates: Submissions open: 8 July, 2024 Submissions due: 16 September, 2024 Notification of acceptance: 4 November, 2024 Registration period: 11 November-16 December, 2024 De-anonymized abstracts due in final form: 2 December, 2024 Conference: 13-15 February, 2025 Scientific Committee: Regine Bader Stefania Degaetano-Ortlieb Katja Haeuser Robin Lemke Ivan Yuen Scientific and financial support for this conference comes from the Collaborative Research Center SFB1102 Information Density and Linguistic Encoding <https://nam10.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.sfb110…>. For inquiries, please send an email to rails2025(a)lst.uni-saarland.de <mailto:rails2025@lst.uni-saarland.de>. Stefania Degaetano-Ortlieb Associate Professor Universität des Saarlandes Language Science and Technology Campus A2.2, 1.06 66123 Saarbrücken Tel.: ++49 681 302 70077 E-Mail: s.degaetano(a)mx.uni-saarland.de www.stefaniadegaetano.com

2 1

CfP: 8th Workshop on the Use of Computational Methods in the Study of Endangered Languages (ComputEL-8)
by Antti Arppe 22 Aug '24

22 Aug '24

Eighth Workshop on the Use of Computational Methods in the Study of Endangered Languages (ComputEL-8) March 3-4, 2024 Honolulu, Hawai’i URL: https://computel-workshop.org/computel-8/ EMAIL: computel.workshop(a)gmail.com Read to the end for guidelines for the Special Session submissions deadline. FIRST CALL FOR PAPERS FOR REGULAR SESSION We invite submissions to the 8th workshop on the Use of Computational Methods in the Study of Endangered Languages, by October 7, 2024. The ComputEL-8 workshop focuses on the use of computational methods in the study, support, and revitalization of endangered languages. The primary aim of the workshop is to continue narrowing the gap between computational linguists interested in methods for low resource languages, academic linguists documenting languages, and the language communities who are striving to maintain their languages. We encourage submissions from scholars and activists representing any or all of these perspectives. The intention of the workshop is not merely to allow for the presentation of research, but also to build a network of computational linguists, documentary linguists, and community language activists who are able to effectively join together and serve their common interests. WORKSHOP VENUE ComputEL-8 will take place March 3-4, 2024, immediately preceding be co-located with the 9th International Conference on Language Documentation & Conservation (ICLDC) in Honolulu, Hawaii (https://ling.lll.hawaii.edu/sites/icldc/). In-person events will be co-located with the ICLDC at the University of Hawai’i Manoa. The workshop will be a virtual/in-person hybrid event. Ability to attend in person will not affect consideration of submissions. CALL FOR PAPERS We encourage submissions that explore the interface and intersection of computational linguistics, documentary linguistics, and community-based efforts in language revitalization and reclamation. This includes submissions that: (i) propose or demonstrate new methods or technologies for tasks or applications focused on low-resource settings, and in particular, endangered languages. (ii) examine the use of specific methods in the analysis of data from low-resource languages, or propose new methods for analysis of such data, oriented toward the goals of language reclamation and revitalization (iii) propose new models for the collection, management, and mobilization of language data in community settings, with attention to issues of data sovereignty and community protocols (iv) explore concrete steps for a more fruitful interaction among computer scientists, documentary linguists, and language communities IMPORTANT DATES 07-Oct-2024 Deadline for submission of papers or extended abstracts 22-Nov-2024 Notification of Acceptance 10-Jan-2025 Camera-ready papers due 3-4 March 2025 Workshop PRESENTATIONS Presentation of accepted papers will be in both oral sessions and a poster session. The decision on whether a presentation for a paper will be oral and/or poster will be made by the Organizing Committee on the advice of the Program Committee, taking into account the subject matter and how the content might be best conveyed. Oral and poster presentations will not be distinguished in the Proceedings. SUBMISSIONS In line with our goal of reaching multiple overlapping communities, we offer two modes of submission: extended abstract and full paper. The mode of submission does not influence the likelihood of acceptance. Either can be submitted to one of the workshop’s tracks: (a) language community perspective and (b) academic perspective. All submissions must be anonymous following ACL guidelines and will be peer-reviewed by the scientific committee. A. Extended Abstract: Please submit anonymous abstracts of up to 1500 words, excluding references. Extended abstracts must be submitted as attached documents. B. Full Paper: Please submit anonymously either a) a long paper (max. 8 pages excluding references and appendices), or b) a short paper (max. 4 pages excluding references) PROCEEDINGS The authors of selected accepted full papers (long or short) will be invited by the Organizing Committee to submit their papers for online publication via the open-access ACL Anthology. Final versions of long and short papers will be allotted one additional page (altogether 5 and 9 pages) excluding references. Proceedings papers should be revised and improved versions of the work that underwent review. Any revisions should concern responses to reviewer comments or the addition of relevant details and clarifications, but not entirely new, unreviewed content. Camera-ready versions of the articles for publication will be due on January 10, 2025. SPECIAL THEME SESSION: BUILDING TOOLS TOGETHER In addition to the Regular Session, ComputEL-8 invites self-identified submissions to a Special Themed Session on “Building Tools Together.” This Session will focus on amplifying our shared understanding of how best to work together across disciplinary and cultural boundaries to build technological tools that support community language revitalization. We invite presentations that: (1) Describe the development of new tools and technologies in collaborative teams, and/or (2) Describe or identify technological or computational needs within community language revitalization contexts, and/or propose solutions. For presentations that describe the development of new tools and technologies in collaboration among language communities, academic researchers, and (in some cases) industry or non-governmental organizations, we encourage submissions which address questions such as: a. How did the idea for the tool or technology come about? b. How did the team members meet and come to work together? c. What has been the impact of this tool? How are you evaluating it? How has the project benefitted community efforts at language maintenance and revitalization? d. What are some challenges (logistical, technical, interdisciplinary, intercultural) that you encountered, and how did you address them? e. How have you balanced the needs and priorities of different team members through the lifespan of the project? f. What lessons have you learned that might benefit similar collaborations? For presentations that identify technological or computational needs within community language revitalization contexts, and/or propose solutions, we encourage submissions which address questions such as: a. What is the need that this tool would meet? Who will it serve? b. What is the blue-sky version of this tool? What is the minimum viable product version? c. What kinds of data, digital assets, or media content would be required to create the tool, and how would they be assembled? d. What challenges might the team face in the development process? e. How do you anticipate the collaborative process to best incorporate diverse areas of expertise from cultural and community-grounded knowledge to academic, technical, and production-oriented knowledge? SUBMISSIONS to the SPECIAL THEME SESSION Please submit anonymous extended abstracts of up to 1500 words, excluding references. Submissions representing community-led collaborations are strongly encouraged. Submissions to the Regular Session may choose to be considered for the Special Session as well. Same considerations will be given for publication whether papers are accepted to the Main Session or the Special Session. Alternatively, authors may submit abstracts only to the Special Session. The deadline for submissions is 11:59pm 7 October, 2024 (Anywhere on Earth). You may indicate that your full paper or extended abstract be considered for inclusion in the Special Session. Notification of acceptance to the Special Session will be sent out by November 22, 2024. All authors of papers in the Special Theme Session will be invited to contribute to a follow-up paper that synthesizes the findings of the Session. IMPORTANT DATES (SPECIAL SESSION) 07-Oct-2024 Deadline for submission of papers or extended abstracts 22-Nov-2024 Notification of Acceptance 10-Jan-2025 Camera-ready papers due 3-4 March 2025 Workshop with Special Session MORE INFORMATION about Special Session submissions will follow on our website and subsequent calls for papers, see: URL: https://computel-workshop.org/computel-8/ ComputEL-8 ORGANIZING COMMITTEE Godfred Agyapong (University of Florida) Antti Arppe (University of Alberta) Aditi Chaudhary (Google DeepMind) Jordan Lachler (University of Alberta) Sarah Moeller (University of Florida) Shruti Rijhwani (Google DeepMind) Daisy Rosenblum (University of British Columbia) CONTACT the OC For further information email us at: computel.workshop(a)gmail.com -- ====================================================================== Antti Arppe - Ph.D (General Linguistics), M.Sc. (Engineering) Professor of Quantitative Linguistics Director, Alberta Language Technology Lab (ALTLab) Project Director, 21st Century Tools for Indigenous Languages (21C) Past President, ACL SIG for Endangered Languages (SIGEL) Department of Linguistics, University of Alberta E-mail: arppe(a)ualberta.ca, antti.arppe(a)iki.fi WWW: www.ualberta.ca/~arppe, altlab.artsrn.ualberta.ca Mānahtu ina rēdûti ihza ummânūti ihannaq - dulum ugulak úmun ingul ----------------------------------------------------------------------

2 1

2nd CfP : Complexity in Language Sciences, December 12-13, 2024, Paris, France
by sascha.diwersy＠univ-montp3.fr 22 Aug '24

22 Aug '24

(Apologies for cross-posting) Workshop Complexity in Language Sciences December 12-13, 2024 Paris, France Submissions deadline : September 9, 2024 ========= While speaking, writing, listening and reading are easy, simple, and natural activities for those who practice them on a daily basis, what can be said about the cognitive and linguistic processes that underlie them? What about the languages in which these activities are practiced, and the theories and models developed to explain and represent the mechanisms involved? And finally, what can be said about individuals (speakers, listeners, writers, readers) who have not yet finished the learning process of these activities (children in the language acquisition phase, adults learning a second language), especially given that certain processes that may prove particularly difficult or even impossible (e.g.: writing in deaf people)? The question of complexity quickly arises, and the notion is regularly invoked in the language sciences, though often in a vague and intuition-driven way. In practice, this question takes on different forms depending on who is formulating it (psycholinguists, linguists, descriptive or model scientists, etc.) and who is targeted by it (speakers, listeners, natives, non-natives, learners, atypical subjects, etc.). In short, how complex, for whom and why? Is it necessary or contingent complexity? To answer these questions, we need to know what kind of complexity we're talking about: conceptual (e.g. representation of time and reference in languages), formal (e.g. at the phonological, graphic, morphological and syntactic level) or physiological (unnatural articulatory gestures, material constraints)? Does one complexity call for another (e.g. does the complex conception of time in a language call for a complex syntax, does formal complexity imply cognitive complexity and vice versa?). The aim of this conference is to discuss the current state of the art on complexity in the language sciences. It will offer the opportunity to examine the history and use of the notion of complexity in linguistics, through a variety of theoretical and epistemological perspectives. Its ambition is to bring together linguists working on spoken and written language, NLP/computer scientists and psycholinguists, etc., to discuss the complexity that runs, to varying degrees, through the different components of language and discourse (segmental, suprasegmental, morphological, syntactic, semantic, pragmatic). The expected result is to elaborate a concept that will work for the community, however stratified it may be, since the criteria on which it is based are obviously many: • For the linguist, complexity is that which is not simple to represent and model, because (i) it is not easily predictable (e.g. unexpected constructions, productions that escape general rules), (ii) it could be of a continuous nature, and therefore difficult to isolate or categorize (e.g. the prosodic level of representation as opposed to the segmental level; opaque or indefinite reference)[1]. A complex element is also an observable that can be described but which resists explanation (e.g. errors in deaf writing). • For the human subject, everything that is unnatural and therefore difficult to produce or to hear (such as a foreign language) can be considered as complex. Complexity can also refer to units which are linguistically underspecified, and thus ambiguous or implicit, entailing a high cognitive load. While human subjects or linguists might view complexity as an obstacle to learning or a challenge in representing language, complexity is conversely necessary to the very existence of natural languages and their uses. From a synchronic point of view, complexity plays a part in regulating the linguistic system, the internal balance of a language, based on a partition between complex and simple elements (e.g. poor morphology vs. complex tonal system in Chinese). What remains to be understood is how this balance is determined in languages. From a diachronic point of view, complexity seems to play the same role, whether it's a question of simplifying certain processes and maintaining the formal economy of the system (e.g. deletion of phonological oppositions with low functional output, grammaticalization processes), or, on the contrary, reintegrating complexity (e.g. the transition from pidgin to creole). This raises a question for the language sciences: how can we account for linguistic complexity? Which approach would be most adequate: typological and contrastive, or internal, experimental, or inductive on large corpora? How should complexity be measured, and what measurement standard should be proposed? Which scale and which descriptors should be used? For example, can we assume the existence of a neutral SVO sentence in order to work on complex structures in syntax? Can the concepts of transformation and movement proposed by generative grammar be used to work on syntactic complexity? If so, how? If not, what descriptors should be used to replace them: "easily" quantifiable descriptors (cf. work on text readability or simplification, which systematically use them), such as sentence length or the types of dependency between elements (e.g. number, length, direction)? The question of medium sheds a different light on complexity, particularly with regards to the syntactic component. Is the syntactic structure of a message more complex in spoken or written form? And from what point of view? In production or reception? From the point of view of language activity or from the point of view of linguistic representation and modeling? In semantics and pragmatics, how can we deal with the meaning-form relationship? How can we deal with ambiguity and implicitness? Can a text be simple, given that it contains a set of units and constructions that are themselves complex. If so, what mechanism of adjustment or qualitative change are necessary? In text linguistics, the notion of complexity has been seen in various ways; for example, through the study of the textualization process itself; through the measurement and quantification of writers' pauses or revisions; by the methods used in applied linguistics to simplify texts, too difficult to be understood and needing to be adapted for a particular audience. Finally, there's a central question in modeling: how does one manage the complexity of the object that is to be represented? How does one break down a complex object into simple elements without losing information? How can we understand which properties are necessary and sufficient to represent the system's operation? How can we approach the question of how multiple descriptors relate to each other using mathematical formulas that go further than the formulas proposed in the field of readability? Among the descriptors, one could for example be interested in units or dependency relations in syntax, in contours or tones in languages with accentual prosody, in operations underlying the description of the semantics of lexical and grammatical units (e.g. deictic operation), in referencing operations to different spaces of validation of predicative contents (e.g. hypothetic spaces), or even in different types of relations between textual units (e.g. embedding, inclusion, successivity). From the point of view of skill acquisition, a point of interest can be the way we can correlate linguistic structures and the stages of an individual's cognitive development. Important dates: September 9, 2024 – abstract submissions October 15, 2024 - scientific committee decision December 12-13, 2024 - Workshop Submission instructions: Abstracts, written in French or English, are due on September 9 at the latest: - 1 cover page including the name and affiliation of the author(s) ; - 1-2 pages of text (excluding references); - 3 to 5 keywords. They should be sent to MAIL to: Delphine Battistelli, Modyco, Paris Nanterre : delphine.battistelli(a)parisnanterre.fr Georgeta Cislaru, Modyco, Paris Nanterre : georgeta.cislaru(a)parisnanterre.fr Sascha Diwersy, Praxiling, Paul Valéry Montpellier : sascha.diwersy(a)univ-montp3.fr Anne Lacheret, Modyco, Paris Nanterre : anne.lacheret(a)parisnanterre.fr Dominique Legallois, Lattice, Sorbonne Nouvelle : dominique.legallois(a)sorbonne-nouvelle.fr Scientific Committee: Basso Pierluigi, Université Lumière Lyon 2 Blache Philippe CNRS, ILCB, Laboratoire Parole & Langage, Université Aix Marseille Blumenthal-Dramé Alice, Freiburg Institute for Advanced Studies Brunetti Lisa, LLF, Université Paris Cité Feltgen Quentin, Université de Gand François Thomas, Cental, UCLouvain Gala Núria, LPL, Aix Marseille Université. Grandjean Didier, Swiss Center for Affective Sciences, Université de Genève Heidlmayr Karin, MoDyCo, CNRS-Université Paris Nanterre Kahane Sylvain, MoDyCo, CNRS-Université Paris Nanterre Lampitelli Nicola, MoDyCo, CNRS-Université Paris Nanterre Landragin Frédéric, Lattice, CNRS Nadvornikova Olga, Université Charles, Prague Olive Thierry, CeRCA, CNRS – Université de Poitiers Prévost Sophie, Lattice, CNRS Watine Marie-Albane, BCL, Université Côte d’Azur Ziegler Johannes, Centre de Recherche en Psychologie et Neuroscience (CRPN) CNRS et Université Aix Marseille. Selected References Barbaresi, Adrien. 2011. La complexité linguistique, méthode d’analyse. TALN Jun 2011, Montpellier, France. pp.229-234. Berthoz, Alain. 2009. La Simplexité, Paris, Odile Jacob. Bottineau, Didier. 2015. Les langues naturelles, objets complexes, systèmes simplexes : le cas du basque. In Begioni et Placella (dir.), Problématiques de langues romanes, Linguistique, politique des langues, didactique, culture, Hommages à Alvaro Rocchetti, Linguistica 69, Fasano, Schena Editore, pp. 55-85. Dahl, Östen. 2004. The Growth and Maintenance of Linguistic Complexity. Amsterdam/Philadelphia, Benjamins. Do-Hurinville, Danh-Thành, Dao, Huy-Linh (dir.). 2017. La complexité et la comparaison des langues, ÉLA. Études de linguistique appliquée, n°185. Ehret, Katharina, Berdicevskis, Aleksandrs, Bentz, Christian, and Blumenthal-Dramé, Alice. 2023. Measuring language complexity: challenges and opportunities. Linguistics Vanguard, vol. 9, no. s1, pp. 1-8. https://doi.org/10.1515/lingvan-2022-0133 Ellis, Nick C. & Diane Larsen-Freeman (eds). 2009. Language as a Complex Adaptive System. Oxford: Wiley-Blackwell. Glaudert, Nathalie. 2011. La complexité linguistique : essai de théorisation et d’application dans un cadre comparatiste, Université de la Réunion. Housen, Alex, Kuiken, Folkert, and Vedder, Ineke. (Eds.). 2012. Dimensions of L2 Performance and Proficiency: Complexity, Accuracy and Fluency in SLA. Amsterdam/Philadelphia: John Benjamins. Laplantine, Chloé, Joseph, John E., and Aussant, Émilie (dir.) 2023. Simplicité et complexité des langues dans l’histoire des théories linguistiques. Paris : SHESL (HEL Livres, 3). Larsen-Freeman, Diane, & Cameron, Lynne. 2008. Complex Systems and AppliedLlinguistics. Oxford: Oxford University Press. Lund, Kristine, Basso Fossali, Pierluigi, Mazur, Audrey & Ollagnier-Beldame, Magali (eds.). 2022. Language is a Complex Adaptive System: Explorations and evidence (Conceptual Foundations of Language Science 8). Berlin: Language Science Press. Martinot, Claire, Bosnjak Botica, Tomislava, Gerolimich, Sonia, and Paprocka-Piotrowska, Urszula (eds.) 2019. Reformulation and Acquisition of Linguistic Complexity. Crosslinguistic perspective. London: ISTE & Wiley. Sampson, Geoffrey, Gil, David, Trudgill, Peter (dir.). 2009. Language Complexity as an Evolving Variable, Oxford Linguistics. Trudgill, Peter. 2001. Contact and simplification: Historical baggage and directionality in linguistic change. Language Typology, 5, 371–37.

2 1

Workshop on the Future of Event Detection: 2nd Call for Papers (Aug 15 deadline)
by Joel Tetreault 22 Aug '24

22 Aug '24

Second Call for Papers Workshop on the Future of Event Detection Miami, USA November 15 or 16, 2024 (co-located with EMNLP 2024 <https://2024.emnlp.org/>) https://future-of-event-detection.github.io/ Submission Deadline: Thursday, August 15, 2024 11:59PM AoE Workshop Description In recent years, there has been a significant increase in the amount of publicly generated digital data. One prominent category of this data, and arguably the largest in terms of daily generation, pertains to various real-world events, ranging from natural disasters to political occurrences to sports events. Detecting these events serves various crucial purposes, including early warning systems, emergency response, situational awareness, tracking public health trends, and understanding societal shifts, among others. However, automatic real-time event detection presents intriguing challenges, primarily stemming from the characteristics of the data. These challenges include the diversity of public online data (multimodal nature), the rapid pace at which data is produced (velocity), the sheer volume of data generated, and the reliability of the data (veracity). Moreover, the recent advancements in powerful Large Language Models (LLMs) and Generative AI Systems offer new opportunities to revise event detection pipelines, enabling novel approaches and applications across various domains. The workshop focuses on: - Looking forward and looking back: The workshop will solicit ideas on how the field of event detection should evolve over the next twenty years, as well as solicit papers reflecting on what has worked and not worked in the field thus far. - Expanding Beyond NLP: As noted above, there are many sibling areas that actively research event detection. Many of these areas have remained siloed and there is not much cross-communication though they are working on similar problem areas. This workshop seeks to address this by actively soliciting research and invited speakers from these areas. - Theory to Application: Finally, this workshop will emphasize how event detection technology can be used in real-world applications. We will solicit novel papers, including, but not limited to the following topics: - Position and opinion papers on the state and future of event detection - Retrospectives - Multimodal event detection - Large language models (LLMs) and their applications for event detection and related areas - Event detection on non-traditional sources of data - Inferring causal, temporal, coreference, and sub-event relations for events - Multilingual event detection - Event representation - Event ontology - Never-ending learning - Streaming algorithms for event detection - Interpretability of event detection methods - Bias detection and mitigation - Human-AI Interaction for event detection frameworks - Information visualization for events - Anomaly detection - Practical application of event detection for different domains such as emergency response - Usability of event detection systems - Datasets for Event Detection Important Dates All deadlines are 11:59 pm UTC-12 (anywhere on Earth). - Submission Deadline: Thursday, August 15 - Notification of Acceptance: Friday, September 20 - Camera Ready Deadline: Friday, October 4 - Workshop: either November 15 or 16 Submission Information We will be using the EMNLP Submission Guidelines <https://2024.emnlp.org/calls/main_conference_papers/#paper-submission-detai…> for the workshop. Authors are invited to submit a full paper of up to 8 pages of content with unlimited pages for references. We also invite short papers of up to 4 pages of content, including unlimited pages for references. Final camera ready versions of accepted papers will be given an additional page of content to address reviewer comments. Previously published papers cannot be accepted. The submissions will be reviewed by the program committee. As reviewing will be blind, please ensure that papers are anonymous. Self-references that reveal the author's identity, e.g., "We previously showed (Smith, 1991) ...", should be avoided. Instead, use citations such as "Smith previously showed (Smith, 1991) ...". Please note that unlike EMNLP, which uses ARR for submission management, we will be using the START conference system. The link will be made live when available. https://softconf.com/emnlp2024/FuturED/ Organizing Committee - Joel Tetreault <https://scholar.google.com/citations?user=Fn52EXUAAAAJ&hl=en>, Dataminr - Thien Huu Nguyen <https://ix.cs.uoregon.edu/~thien/>, University of Oregon - Hemank Lamba <https://sites.google.com/site/hemanklamba/home>, Dataminr - Amanda Hughes <https://cs.byu.edu/department/directory/faculty-directory/amanda-hughes/>, Brigham Young University Contact Information - Workshop contact email address: futureofeventdetection(a)googlegroup.com - Workshop Twitter: @FuturED2024 <https://x.com/FuturED2024>

2 1