- Corpora - ELRA lists

LREC-COLING 2024 Announcement
by Sara Goggi 22 Mar '23

22 Mar '23

* ***LREC-COLING 2024 Announcement**** _LREC-COLING 2024 - The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation__ __Lingotto Conference Centre - Turin (Italy)__ __20-25 May, 2024_ *Conference website: https://lrec-coling-2024.lrec-conf.org/ *Twitter: @LrecColing2024 Two major international key players in the area of computational linguistics, the ELRA Language Resources Association (ELRA) and the International Committee on Computational Linguistics (ICCL), are joining forces to organize the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) to be held in Turin (Italy) on 20-25 May, 2024. The hybrid conference will bring together researchers and practitioners in computational linguistics, speech, multimodality, and natural language processing, with special attention to evaluation and the development of resources that support work in these areas. Following in the tradition of the well-established parent conferences COLING and LREC, the joint conference will feature grand challenges and provide ample opportunity for attendees to exchange information and ideas through both oral presentations and extensive poster sessions, complemented by a friendly social program. The three-day main conference will be accompanied by a total of three days of workshops and tutorials held in the days immediately before and after. *General Chairs* Nicoletta Calzolari, CNR-ILC, Pisa Min-Yen Kan, National University of Singapore *Advisors to General Chairs* Chu-Ren Huang, The Hong Kong Polytechnic University Joseph Mariani, LISN-CNRS, Paris-Saclay University *Programme Chairs* Veronique Hoste, Ghent University Alessandro Lenci, University of Pisa Sakriani Sakti, Japan Advanced Institute of Science and Technology Nianwen Xue, Brandeis University *Management Chair* Khalid Choukri, ELDA/ELRA, Paris *Local Chairs* Valerio Basile, University of Turin Cristina Bosco, University of Turin Viviana Patti, University of Turin

1 0

Post doc position(s) in corpus linguistics, web-as-corpus and NLP at Uni Turku, Finland
by Veronika Laippala 22 Mar '23

22 Mar '23

Job advertisement! TurkuNLP (Natural Language Processing) is a multidisciplinary research group combining NLP and digital linguistics. We develop machine learning methods and tools to automatically process and understand text data and apply these to explore human interaction, communication and language use in very large digital text datasets such as those automatically crawled from the internet and historical text collections. We invite applications for post doctoral researcher positions. The postdocs recruited will work within our research projects on web-as-corpus research, corpus linguistics and NLP on topics such as human diversity, multilingual modeling of web genres (registers), and semantic search. For more details and to leave an application, please see job ID 14647 at https://www.utu.fi/en/university/come-work-with-us/open-vacancies and visit our websites at turkunlp.org and https://sites.utu.fi/humandiversity/. I am also happy to answer any questions you might have, please don't hesitate to contact me! The postdocs are expected to begin their employment 1st of May 2023 or as soon as possible based on agreement. Best regards, Veronika Laippala

1 0

New publication: Corpus-Assisted Discourse Studies
by Susan Hunston 22 Mar '23

22 Mar '23

Dear list members, I am delighted to announce the latest publication in the Elements in Corpus Linguistics series, published by Cambridge University Press. The title is "Corpus-Assisted Discourse Studies", and the authors are Mathew Gillings, Gerlinde Mautner and Paul Baker. This Element is now available FREE until 4 April 2023 at the following URL: https://www.cambridge.org/core/search?q=9781009168151 Here is a summary of the Element: "The breadth and spread of corpus-assisted discourse studies (CADS) indicate its usefulness for exploring language use within a social context. However, its theoretical foundations, limitations, and epistemological implications must be considered so that we can adjust our research designs accordingly. This Element offers a compact guide to which corpus linguistic tools are available and how they can contribute to finding out more about discourse. It will appeal to researchers both new and experienced, within the CADS community and beyond." Best wishes Susan Hunston (Series Editor) Professor Susan Hunston (she/her) Department of English Language and Linguistics University of Birmingham Birmingham B15 2TT UK (+44) 0121 414 5675 s.e.hunston(a)bham.ac.uk

1 0

CFP AuTexTification 🤖 vs 👩🏻: Automatic Text Identification shared task at IberLEF 2023
by Paolo Rosso 22 Mar '23

22 Mar '23

Hola Luis ¿qué tal? Acabo de ver en Corpora-list que estás a tope con temas de chatbots. A lo mejor ya te ha llegado la info: estamos organizando una tarea que puede que os pueda interesar. A ver si participas ;-) Saludos Paolo ----- *Apologies for cross-posting* Do you believe machine generated text is becoming an issue? Are you interested in boosting research to automatically detect machine generated text? 🤖👩🏻 We cordially invite all researchers and practitioners from all fields to participate in the AuTexTification task. If interested, register yourself in the shared task through this link: https://lnkd.in/dzBZsYiD Once registered and training phase started, the datasets will be sent to your email along with a password. Look for more information regarding task description, schedules, or submissions through the Autextification web page: https://sites.google.com/view/autextification More information on the shared task The new era of automatic content generation has surged through powerful causal language models like GPT, PALM, or Bloom that can be used to spread untruthful news, human-looking reviews, or opinions. Thus, it is imperative to develop technology to automatically detect generated text for content moderation and to attribute generated text to specific models to protect intellectual property or to distill responsibilities. In this context, we propose the “Automatic Text Identification” (AuTexTification) shared task, to boost research and development of automatic systems to detect automatically generated text, obtained by state-of-the-art language models, in English and Spanish.  We propose two subtasks: (i) Human or Generated, where given a text participants will have to determine whether a text has been automatically generated or not; and (ii) Model Attribution, where participants will have to determine what model generated a text. The generation models used to generate the text are of increasing number of neural parameters, ranging from 2 to 175 billion, meaning that participants' systems should be versatile enough to detect a diverse set of text generation models and writing styles. In the training phase, participants will be provided with two partitions for subtask 1, i.e., English and Spanish partitions, with binary labels 👩🏻 and 🤖. Similarly, a partition per language will be released for subtask 2. It will include six labels (A, B, C, D, E, and F), each label representing a text generation model. Later, the unlabeled test data will be released. Important Dates March 22, 2023: Release of training data April 21, 2023: Release of test data May 10, 2023: Participant system results submission May 17, 2023: Results notification June 3, 2023: Paper submission June 16, 2023: Paper peer-reviewed July 4, 2023: Camera-ready paper version September 26, 2023: Conference Task organizers José Ángel González (Symanto) Contact Email: jose.gonzalez(a)symanto.com Areg Sarvazyan (Symanto) Contact Email: areg.sarvazyan(a)symanto.com Marc Franco-Salvador (Symanto) Francisco Rangel (Symanto) Berta Chulvi (Universitat Politècnica de València) Paolo Rosso (Universitat Politècnica de València) Please reach out to the organizers or join the Slack workspace to connect with the other participants and organizers: https://lnkd.in/di_zaMHf

1 0

PhD-position in multimodal human-robot interaction (fully funded, 3 years)
by Hendrik Buschmeier 22 Mar '23

22 Mar '23

The Digital Linguistics Lab at Bielefeld University (head: JProf. Dr.-Ing. Hendrik Buschmeier) is seeking to fill a research position (PhD-student, E13 TV-L, 100%, fixed-term) in the area of multimodal human-robot interaction in the research project “Hybrid Living”. Join us to work in an interdisciplinary team on research questions in the intersection of human-robot interaction and computational linguistics. Specifically, you will work (1) on the use of multimodal communication (verbal and nonverbal) to situatively instruct a service robot, (2) on making the robot's behaviour transparent to its users, and (3) on models for solving human-robot interaction problems through communication. The formal job advertisement, with information on how to apply, can be found here: https://uni-bielefeld.hr4you.org/job/view/2265/research-position-in-multimo… Questions? Don't hesitate to get in touch: hbuschme(a)uni-bielefeld.de Hendrik Buschmeier -- JProf. Dr.-Ing. Hendrik Buschmeier Digital Linguistics Lab Faculty of Linguistics and Literary Studies, Bielefeld University https://purl.org/net/hbuschme

1 0

3 Ph.D scholarships at ETSIT-UPM (Spain) - Multimodal Conversational Systems
by Luis Fernando D'Haro 22 Mar '23

22 Mar '23

The Universidad Politécnica de Madrid (UPM, Spain) is pleased to announce the following Three full Ph.D. scholarships to work on the following topics: - Personalization of DNN-based generative conversational systems (chatbots) <https://euraxess.ec.europa.eu/jobs/82169> - Acoustic environment awareness and automatic dialogue evaluation for chatbots <https://euraxess.ec.europa.eu/jobs/82175> - Multimodal task-oriented conversational systems and adaptation with human feedback <https://euraxess.ec.europa.eu/jobs/82183> The UPM is the largest Spanish technological university as well as a renowned European institution. With two recognitions as Campus of International Excellence, it is outstanding in its research activity together with its training of highly-qualified professionals, competitive at an international level. These three Ph.D scholarships are supported by the European Commission through Project ASTOUND <https://astound-project.eu/> (101071191 - HORIZON-EIC-2021-PATHFINDERCHALLENGES-01). For information about our school <http://etsit.upm.es/>and research group <https://blogs.upm.es/gthau> check the corresponding link. **** Prerequisites:* - A Master's degree in computational linguistics, computer science, telecommunications or alike, graded with success and corresponding knowledge in the field. - Candidates should have a fluent competency in written and spoken English (Spanish is a plus) - Good communication and team work skills - Good programming skills (preferably Python) **** Additional desired qualifications* - Knowledge and/or experience in training deep neural networks using different frameworks (Pytorch or Tensorflow). - Knowledge and/or experience of natural language processing, machine learning or speech technologies. - Experience in writing scientific papers and /or participating in international challenges. **** What we offer:* Three full scholarship for 3.5 years including €21k gross salary per annum (2023 rate) and health coverage, the possibility of attending national and international conferences, personal and professional advanced training courses, management and career coaching, the possibility of working for a European research project, and the opportunity to live and work in one of the most attractive cities in the world (Madrid). The start date is negotiable. **** Application:* If you are interested and have related background please send the following documents to luisfernando.dharo(a)upm.es: - Your CV - Academic transcripts (both undergraduate and master) - 2 paragraphs describing your research interests and background - Most relevant publication, if any, or Master thesis (or equivalent) Should you have questions or would like to discuss further details, please get in touch.

1 0

Lecturer/Reader in Computational Social Science, School of Informatics, University of Edinburgh
by Alex Lascarides 21 Mar '23

21 Mar '23

The School of Informatics (https://www.ed.ac.uk/informatics) at the University of Edinburgh is hiring a lecturer/reader in Computational Social Science. The position is permanent. The successful candidate is expected to take part at the social media analysis group (SMASH, https://smash.inf.ed.ac.uk/) at Edinburgh, and work closely with the social and political science school. In addition, they are expected to develop a new course for undergraduates on computation social science and teach it. The application deadline is 25th of April 2023. More details and application is in the following link: https://elxw.fa.em3.oraclecloud.com/hcmUI/CandidateExperience/en/sites/CX_1… If you have any enquiries, feel free to get in touch with Walid Magdy (wmagdy(a)inf.ed.ac.uk, https://homepages.inf.ed.ac.uk/wmagdy).

1 0

ESU 2023 hosting proposals
by Europaeische Sommeruniversitaet-Kulturen und Technologien 21 Mar '23

21 Mar '23

Dear all, On behalf of the voting members of the European Summer University in Digital Humanities 2023 Evaluative Committee, I am pleased to announce that the Babes-Bolyai University in Cluj-Napoca, Romania has been selected as the future host for ESU 2023-2025. We are thrilled that Christian Schuster, the director of the Transylvania Digital Humanities Center there, and Alexandra Cotoc, a former ESU community member, will join an impressive team from across the university to provide the ESU with a new intellectual home. The Evaluative Committee had the difficult task of selecting from an extremely competitive pool of candidates, a testament to the strength of the ESU over the years and to Elisabeth Burr’s extraordinary leadership. I thank each one of the committee members for their time, their thoughtful insights, and the collegiality with which they approached this process. We hope that you will encourage your colleagues and your students to join us in Cluj-Napoca this summer for the next chapter in the European Summer University in Digital Humanities. Carol CAROL CHIODO, Ph.D. Librarian for Collections and Digital Scholarship | Americas, Europe, and Oceania Division (AEOD) carol_chiodo(a)harvard.edu | profile -->  ORCID orcid.org/0000-0002-6424-3445[1] HARVARD LIBRARY Champions of curiosity for the betterment of the world library.harvard.edu HARVARD UNIVERSITY Acknowledgment of land and people of the Massachusett Tribe   Links: ------ [1] http://orcid.org/0000-0002-6424-3445

1 0

[SEBD 2023] Doctoral Consortium - Last Call for Papers
by Stefano Marchesin 21 Mar '23

21 Mar '23

[APOLOGIES FOR MULTIPLE POSTINGS] SEBD 2023 Doctoral Consortium - Last Call for Papers ==================================================================== Important dates Doctoral Consortium Submission Deadline: Friday, March 31, 2023 (AoE) Papers Notification: Wednesday, April 26, 2023 (AoE) Camera-Ready Submission Deadline: Thursday, June 01, 2023 (AoE) Doctoral Consortium Day: Sunday, July 02, 2023 Submission Link: https://cmt3.research.microsoft.com/SEBD2023/ ================================================= The SEBD 2023 Doctoral Consortium will take place in a dedicated session during the 31st Italian Symposium on Advanced Database Systems (SEBD 2023), Galzignano Terme, Padova (Italy), July 02-05, 2023, http://sebd2023.dei.unipd.it/. The goal is to provide a forum for PhD candidates to present their ongoing research and receive feedback from renowned and experienced members of the research community. The Consortium fosters a collaborative environment, encouraging constructive discussions and sharing of ideas. It will be an excellent opportunity for developing person-to-person networks to the benefit of the PhD students in their future careers – as well as of the community. Submissions from students who are in the early stages of their research should provide a clear description of the problem to be addressed and the planned methodology. Submissions from students who are in the middle or final stages of their PhD research should clearly indicate the contributions made to date and future work directions. Each doctoral symposium paper must be single-authored by a current PhD student or a PhD student who submitted the thesis between September and December 2022. The paper should be written in English and must be 6-7 pages long, including selected references. Submissions must be formatted in PDF, prepared in CEUR-ART Column 1 Style (http://ceur-ws.org/Vol-XXX/CEURART.zip), and submitted electronically via the submission system: https://cmt3.research.microsoft.com/SEBD2023/ Submissions will be reviewed by the Doctoral Consortium Program Committee (appointed by the Doctoral Consortium Chairs). All papers will be reviewed with respect to the overall presentation quality, the potential for the future impact of the research on the field, and the expected benefit to the other doctoral students attending the conference. The accepted papers will be published as part of the SEBD 2023 proceedings on WS-CEUR.org and indexed in Scopus, DBLP and Google Scholar. ================================================= Topics The SEBD Symposium and its Doctoral Consortium cover a broad range of topics, including traditional database management, as well as new challenges for data management in any possible domain. Suggested topics include (but are not limited to) the following ones: - Big Data and Smart Computing; - Data integration, Heterogeneous and Federated DBMS; - Data mining, knowledge discovery, information extraction, and machine learning; - Data visualization; - Data warehousing; - Distributed and parallel databases; - Grid, peer-to-peer databases, and Cloud Computing; - Incompleteness, inconsistency, and other aspects of data quality; - Uncertainty in databases; - Ethical problems posed by Big Data Analysis; - Keyword-based and natural language access to structured, semistructured, and unstructured data; - Knowledge representation and reasoning; - Ontology-based data management; - Privacy, security and trust management; - Query processing and optimization, approximate query answering; - Real-time, embedded, sensor, and mobile databases; - Scientific and Statistical Databases; - Semantic Web and Open Linked data; - Social networks and Graph databases; - Transaction and workflow management, interoperability and Web services. ================================================= Contact For any questions regarding Doctoral Consortium submissions, please email the Doctoral Consortium Chairs: - Letizia Tanca (letizia.tanca(a)polimi.it) - Stefano Marchesin (stefano.marchesin(a)unipd.it) -- Stefano Marchesin, PhD Postdoctoral Researcher Information Management Systems (IMS) Group Department of Information Engineering University of Padua Via Gradenigo 6/a, 35131 Padua, Italy Home page: http://www.dei.unipd.it/~marches1/

1 0

Invitation to a Postdoctoral Fellowship Application at the University of Vienna
by Benjamin Roth 21 Mar '23

21 Mar '23

The Natural Language Processing Lab [1] at the Data Mining and Machine Learning Research Group, University of Vienna, invites researchers (who have or will have a PhD degree at the call deadline in September 2023 and fulfill the eligibility criteria) to prepare a joint application for an EU-funded Marie Skłodowska-Curie Fellowship [2,3]. Potential topics can be in current areas of NLP research, e.g.: - Measuring and improving interpretability, fairness, robustness in large language models - Privacy-preserving learning with language data - Using additional knowledge sources for information extraction from text - Reasoning with knowledge graphs and language Potential candidates should have published at top NLP or ML conferences (EMNLP,ACL,NAACL,NeurIPS,ICML,ICLR). In case of agreement on a Fellowship application, the NLP lab will support the candidate in the preparation, including a fully funded research visit at the University of Vienna (if desired). If you want to learn more, please send an email to Benjamin Roth (benjamin.roth [at] univie.ac.at) with your CV attached and the subject line "msca ml nlp". [1] https://dm.cs.univie.ac.at/about-us/natural-language-processing/ [2] https://marie-sklodowska-curie-actions.ec.europa.eu/actions/postdoctoral-fe… [3] https://marie-sklodowska-curie-actions.ec.europa.eu/calls/msca-postdoctoral… -- Univ.-Prof. Dr. Benjamin Roth Digitale Textwissenschaften Universität Wien Kolingasse 14 Raum 5.17 1090 Wien email: benjamin.roth(a)univie.ac.at tel: +43 14277 79513 virtual coffee (Tuesday 2pm CEST): https://www.benjaminroth.net/virtual_coffee video call: https://univienna.zoom.us/j/93796507934?pwd=VFg5dW9JbStPUml6WFVtOWJXV3phQT09 web: https://dm.cs.univie.ac.at/team/person/112089/

1 0