Call for papers
Workshop Discourse studies and linguistic data science: Addressing challenges in interoperability, multilinguality and linguistic data processing - DiSLiDaS 2023
University of Vienna, Vienna, Austria
12-13 September 2023 (TBA)
Website: http://dislidas.mozajka.co<http://dislidas.mozajka.co/>
The fourth biennial conference on Language, Data and Knowledge (LDK 2023) (http://2023.ldk-conf.org<http://2023.ldk-conf.org/>) and Cost Action CA18209 NexusLinguarum (https://nexuslinguarum.eu<https://nexuslinguarum.eu/>) are glad to announce the second workshop Discourse studies and linguistic data science: Addressing challenges in interoperability, multilinguality and linguistic data processing – DiSLiDaS 2023.
*Conference aims and topics*
The workshop aims to follow through the topics discussed during DiSLiDaS 2022 (https://dislidas.mozajka.co/?page_id=211) and to gather current research advances in discourse analysis and representation, in the context of multilinguality, from a linguistic and computational perspective. We invite submissions addressing challenges such as interoperability, linguistic linked open data (LLOD), and language processing and analysis.
The workshop topics are the following (but not limited to):
● Discourse and dialogue annotation: Parsing and representation across languages and frameworks
● Discourse markers and discourse relations (RST, PDTB, SDRT): Identification, prediction and extraction
● Attitudes discovery and interpretation in Discourse: Appraisal and sentiment
● Effects of multimodality on discourse interpretation: Intonation, gesture and text
● Interoperability for Multilingual language data: Challenges of rich and distributed data
● Discourse data and machine learning: Methods and tools
Discourse comprises a wide variety of linguistic phenomena, such as discourse markers, discourse relations, and speaker attitude, which have been largely studied by different communities of practice from Linguistics and Computation, rendering several theoretical frameworks (for instance, RST, SDRT, PDTB, for discourse relations; appraisal theory for sentiment analysis,...), and technological approaches, such as transformer models, embeddings and alike. Nonetheless, there are open issues concerning interoperability, multilinguality, and language processing, in particular, the existence of different annotation schemas, disambiguation, lack of training data for machine learning, scarcity of effective language phenomena detection and interpretation methods, diverse vocabularies, insufficient multilingual parallel corpora of non-dialogue and dialogue, initial stages of exploration of multimodality.
Discourse research is one of the central research areas of natural language processing (NLP) too. NLP research focuses on the formalisation, identification and discovery of semantic phenomena, dialogue exchange structure, and text coherence. Some of the technological approaches of NLP include the use of transformer models, word embeddings, linguistic linked open data, the constitution of aligned multilingual corpora, vocabularies of language phenomena and alike. Computational discourse explores the evidence that language consists not only of placing words in the right order but also of detecting and interpreting the meaning and deeper textual relations and organising ideas into a logical flow. The linguistic approaches study language phenomena referring to coherence and cohesiveness of discourse, lexical, phrasal, syntactic, semantic and pragmatic means to express discourse relations, represent their roles and build language resources for them.
Despite all the advances, there are still plenty of unresolved problems related to interoperability, multilinguality, and language processing. With the growth of the Semantic Web and Linguistic Linked Data, interoperability is key to reading, interpreting and adopting language resources. The existence of different annotation schemas to encode discourse relations constitutes a problem for data exchange and reuse and for theoretical consistency. The treatment of multilinguality is also complicated because of the insufficiency of multilingual parallel corpora of collections of non-dialogue and dialogue texts, which would allow systematic contrastive studies. As to language processing, the lack of training data for machine learning, coupled with the scarcity of effective language phenomena detection and interpretation methods, the coexistence of diverse vocabularies, and the minimal attention to the contribution of the tone of voice, intonation, gestures to the meaning and the informative value of discourse elements make the task of discourse processing still very challenging.
The workshop intends to be a discussion forum for researchers interested in addressing the aforementioned challenges and advancing the state-of-art in discourse studies and linguistic data science.
*Programme*
The Scientific Programme will include one invited talk and oral presentations.
Invited Speaker
Johan Bos, University of Groningen
*Submissions*
Submissions can be in the form of:
• long papers: 9–12 pages;
• short papers: 4–6 pages.
All submission lengths are given including references. Accepted submissions will be published by ACL in an open-access conference proceedings volume, free of charge for authors. The ACL templates should therefore be used for all conference submissions. As the reviewing process is single-blind, submissions should not be anonymised.
The workshop will be hybrid (face-to-face and remote). Note that at least one author of each accepted paper must register to present the paper at the workshop (either remotely or on-site). There will be no registration fee administered for participating in DiSLiDaS 2023.
Submissions must be submitted electronically via EasyChair:
https://easychair.org/conferences/?conf=dislidas2023
*Important dates*
Time Zone: Anywhere on Earth
Papers due: May, 19, 2023
Papers acceptance notifications: June, 16, 2023
Camera-ready papers due: June, 30, 2023
*Programme Committee*
Elena-Simona Apostol, University Politehnica of Bucharest, Romania
Harry Bunt, Tilburg University, Netherlands
Maria Josep Cuenca, Universitat de València
Debopam Das, Humboldt University of Berlin, Germany
Jorge Garcia, University of Zaragoza, Spain
Mikel Iruskieta, University of the Basque Country, Spain
António Leal, University of Porto, Portugal
Chaya Liebeskind, Jerusalem College of Technology, Israel
Amália Mendes, University of Lisbon, Portugal
Maciej Ogrodniczuk, Polish Academy of Sciences, Poland
Giedre Valunaite Oleskevicienė, Mykolas Romeris University, Lithuanian
Georg Rehm, DFKI GmbH, Germany
Ted Sanders, Utrecht University, Netherlands
Merel Scholman, University of Saarland, Germany
Dimitar Trajanov, Ss. Cyril and Methodius University, North Macedonia
Radoslava Trnavac, University of Belgrade, Serbia
Ciprian-Octavian Truica, University Politehnica of Bucharest, Romania
Amir Zeldes, The Georgetown University, USA
*Organising Committee*
Purificação Silvano, University of Porto, Portugal
Mariana Damova, Mozaika, Ltd., Bulgaria
Christian Chiarcos, Goethe-Universität, Germany
Anna Bączkowska, University of Gdansk, Poland
*Contact*
organizers(a)dislidas.mozajka.co<mailto:organizers@dislidas.mozajka.co><mailto:organizers@dislidas.mozajka.co>
[Apologies for cross-posting]
At the end in English.
EDICIÓN XXII PREMIO SEPLN A LA MEJOR TESIS DOCTORAL EN PROCESAMIENTO DEL LENGUAJE NATURAL
[Plazo de presentación: 2 de mayo de 2023]
La Sociedad Española para el Procesamiento del Lenguaje Natural convoca la Edición XXII del Premio SEPLN a la Mejor Tesis Doctoral en Procesamiento del Lenguaje Natural, que se regirá por las siguientes bases:
La finalidad de este premio es la promoción y divulgación de la investigación en el campo del procesamiento del lenguaje natural.
La tesis será premiada con una computadora portátil compacta (tablet). Se dará entrega del premio en el 39 Congreso Internacional de la Sociedad Española del Procesamiento del Lenguaje Natural (SEPLN 2023), tras una breve presentación del trabajo premiado por parte del autor.
Para poder concursar, el autor de la tesis doctoral debe ser socio de la SEPLN en el momento de presentar el trabajo. Ninguna persona concursante podrá participar como autora en más de un trabajo.
Se podrán presentar a concurso tesis doctorales leídas durante el año 2022, escritas en una lengua del Estado español o en lengua inglesa.
Además de la tesis completa, es imprescindible enviar:
Un breve resumen de 4 páginas donde claramente se indique el tema y la relevancia de la investigación, los objetivos, métodos, resultados alcanzados y contribuciones.
Una breve descripción de la trayectoria científica del autor de la tesis, en la que se describa la participación en actividades científicas como organización de de tareas competitivas, congresos, generación de recursos open access como conjuntos de datos, modelos de lenguaje, etc., y participación en proyectos, contratos, y/o patentes.
La calidad de la presentación, la corrección técnica y metodológica, la relevancia, originalidad, la generación, evaluación y publicación de recursos, así como la trayectoria investigadora durante el periodo predoctoral serán los criterios empleados para la adjudicación del premio por parte del jurado.
Los trabajos se enviarán a través de la web de la revista de la Sociedad (http://journal.sepln.org) en formato PDF antes del 2 de mayo de 2023.
La resolución del premio se comunicará durante el 39 Congreso Internacional de la Sociedad Española del Procesamiento del Lenguaje Natural (SEPLN 2023).
Documento con las instrucciones (aquí)
Para más información dirigirse a aitziber.atucha(a)ehu.eus
22nd EDITION OF THE SEPLN AWARD TO THE BEST DOCTORAL THESIS IN NATURAL LANGUAGE PROCESSING
[Submission deadline: May 2nd, 2023]
The Spanish Society for Natural Language Processing announces the 22 Edition of the SEPLN Award for the Best Doctoral Thesis in Natural Language Processing, which will be governed by the following bases:
The purpose of this award is the promotion and dissemination of research in the field of natural language processing.
The thesis will be awarded with a compact laptop (tablet). The award will be presented at the 39th International Congress of the Spanish Society for Natural Language Processing (SEPLN 2023), after a brief presentation of the award-winning work by the author.
In order to compete, the author of the doctoral thesis must be a member of the SEPLN at the time of submitting the work. No contestant may participate as an author in more than one work.
Doctoral theses read during the year 2023, written in a language of the Spanish State or in English, may be submitted to competition.
In addition to the complete thesis, it is essential to send:
a 4-page summary of the thesis, clearly describing the topic and the relevance of the research, the objectives, methods, results achieved and contributions.
a brief description of the scientific career of the author of the thesis, detailing the participation in scientific activities such as organization of competitive tasks, congresses, generation of open access resources such as sets of data, language models, etc., and participation in projects, contracts, and/or patents.
The quality of the presentation, the technical and methodological correctness, the relevance, originality, the generation, evaluation and publication of resources, as well as the research trajectory during the pre-doctoral period will be the criteria used for the award of the prize by the jury.
The works will be submitted through the website of the Society's magazine (http://journal.sepln.org) in PDF format before May 2nd 2023.
The final decision will be communicated during the 39th International Congress of the Spanish Society for Natural Language Processing (SEPLN 2023).
Submission instructions (http://www.sepln.org/sites/default/files/noticia/documentos_relacionados/20…)
For more information: aitziber.atucha(a)ehu.eus
Dear colleagues,
we have a new PhD vacancy in the field of speech- and text anonymization in the medical domain in Berlin, Germany.
The position is in the “Medinym” project of the department of Quality and Usability Labs of Berlin Institute of Technology.
We’re looking for a Researcher or Junior Researcher level, offer a 2 years contract with optional prolongation and PhD perspective.
Application deadline: Feb 28
More details and how to apply:
TU Berlin: https://www.jobs.tu-berlin.de/en/job-postings/161912
Please circulate upon potentially interested. Many thanks!
In case of questions pls contact me, I'm happy to help.
Best regards from Berlin,
Tim
--
Dr.-Ing. Tim Polzehl
Associate Senior Researcher
Technische Universität Berlin
Quality and Usability Lab
Ernst-Reuter-Platz 7
D-10587 Berlin, Germany
Email: tim.polzehl(a)qu.tu-berlin.de
Web: www.qu.tu-berlin.de
Dear all,
We are excited to announce an open fulltime position for a researcher (phd possible, salary grade E 13 TV-L Berliner Hochschulen) in the field of speech signals analysis and assessment of speech quality in different mobile and fixed networks. The ideal candidate will have a passion for analyzing speech signals in listening-only and conversational situations, and will be responsible for developing signal-based and parametric models for the estimation of speech quality.
One of the main focuses of the research will be the evaluation of new speech codecs in different network scenarios. Additionally, the models will be validated based on subjective listening and conversation tests. For this purpose, methods of crowdsourcing can be applied, where real users will carry out data collection and/or evaluation via an online platform. It will be scientifically interesting to compare crowdsourced data to those obtained under laboratory conditions.
The position is located in Berlin, Germany at the Quality and Usability Lab of Technische Universität Berlin.
If you are interested in joining our team and have a background in speech signals analysis or quality assessment, please find the full job description provided under this link: https://tubcloud.tu-berlin.de/s/spSGFYipWsPsDBq .
We look forward to hearing from you!
Best regards,
Stefan Hillmann
--
Dr.-Ing. Stefan Hillmann
(er/sein, he/his)
Wissenschaftlicher Mitarbeiter
Senior Researcher
Technische Universität Berlin
Fakultät IV, Elektrotechnik und Informatik
Quality and Usability Lab
EECS, Electrical Engineering and Computer Science
Quality and Usability Lab
Straße des 17. Juni 135, 10623 Berlin
GERMANY
stefan.hillmann(a)tu-berlin.de
https://tu.berlin/index.php?id=29495
ORCID: https://orcid.org/0000-0002-0795-9834https://www.tu.berlin/qu
Dear CORPORA list members,
I am searching for publications on ethics and NLP *in languages other than
English*. I already have a fairly comprehensive list of French-language
papers, and would like to ensure that I cover other languages, as well.
But, being an American, of course I am monolingual, and don't know how to
search for relevant non-English publications. Any help would be
much appreciated.
Best wishes for a happy Thursday (and I was kidding--I do speak French),
Kevin Cohen
--
Kevin Bretonnel Cohen, PhD
Director, Biomedical Text Mining Group
Computational Bioscience Program, U. Colorado School of Medicine
D'Alembert Chair in Natural Language Processing for the Biomedical Domain
(Emeritus),
LIMSI, CNRS, Université Paris-Saclay
303-916-2417
http://compbio.ucdenver.edu/Hunter_lab/Cohen
Edge Hill Corpus Research Group
Next meeting: Thursday 2 March 2023, 2-4 pm (UK time)
Topic: Manual Annotation for Discourse-Oriented Corpus Studies
Registration (free): https://store.edgehill.ac.uk/conferences-and-events/conferences/events/edge…
Presentations:
-- Katia Adimora (Edge Hill University): Annotating Mexican immigration discourses
-- Dan Malone (Edge Hill University): A lone wolf from the ISIS pack: Hunting discourses through manual annotation
Abstracts: https://sites.edgehill.ac.uk/crg/next
________________________________
Edge Hill University<http://ehu.ac.uk/home/emailfooter>
Modern University of the Year, The Times and Sunday Times Good University Guide 2022<http://ehu.ac.uk/tef/emailfooter>
University of the Year, Educate North 2021/21
________________________________
This message is private and confidential. If you have received this message in error, please notify the sender and remove it from your system. Any views or opinions presented are solely those of the author and do not necessarily represent those of Edge Hill or associated companies. Edge Hill University may monitor email traffic data and also the content of email for the purposes of security and business communications during staff absence.<http://ehu.ac.uk/itspolicies/emailfooter>
Deadline extension: 05.03.2023
Language Technologies and Digital Humanities: Resources and Applications (LTаDH-RA)
CLaDA-BG 2023 Conference
https://clada-bg.eu/en/dissemination/events/international-clada-bg-conferen…
Sofia, Bulgaria
10-12 May 2023
CLaDA-BG is the Bulgarian national research infrastructure for resources and technologies for linguistic, cultural and historical heritage, integrated within CLARIN EU and DARIAH EU. Its mission is to provide access to the necessary resources and technologies that would support the research in Social Sciences and Humanities (SS&H). Modeling and linking of various types of knowledge and its contexts is crucial for the successful research in the interdisciplinary field of resources and technologies related to language, culture and history.
This is the second edition of the CLaDA-BG conference. It aims at bringing together NLP developers, linguists, digital humanitarians, scholars and all parties interested in knowledge modeling and linking data for research.
Topics of Interest
The topics include, but are not limited to, the following ones:
Problems in SS&H – research methods, technological support
Language technologies for sentiment analysis, semantic technologies, trust-worthiness of knowledge graphs, ethical challenges in digital SS&H
Knowledge Modeling and Elicitation for digital SS&H
Specific Language Resources and Technologies for historical texts, parliamentary records, speech and multimodal corpora, social media data
The role of digital libraries, archives and museums in digital SS&H research
Language Interface to Knowledge Graphs in SS&H
Knowledge-modeled and linked applications in SS&H
Best practices and new trends in Knowledge Modeling and Linking for language, culture and history
Invited Speakers
Alessandro Lenci, Università di Pisa, Italy
Erhard Hinrichs, Leibniz Institut für Deutsche Sprache Mannheim and Tübingen University, Germany
Milena Dobreva, Sofia University St Kliment Ohridski, Bulgaria
TBA
Important Dates
Submission deadline: new deadline: 05.03.2023
Notification of acceptance: 3.04.2023
Final Submission: 3.05.2023
Conference: 10-12.05.2023
Submissions
We welcome oral presentations or posters (optionally with demo). There are two modes of submissions: Full papers (6 to 12 pages) or extended abstracts (3-5 pages, references excluded) in PDF format, in accordance with the Springer Computer Science Proceedings (https://www.springer.com/gp/computer-science/lncs/conference-proceedings-gu…).
Please submit your full paper or extended abstract in PDF to this EasyChair link: https://easychair.org/my/conference?conf=ltdhra2023
For contacting organizers please use the following email: ltadh-ra(a)bultreebank.org
The CLaDA-BG Organizer
[Apologies for cross-posting]
A fully-funded position as PhD Research Fellow in Natural Language Processing is available in the Language Technology Group (LTG) in the Machine Learning Section at the Department of Informatics, University of Oslo (UiO), Norway. The 3-year position is affiliated with a new research project dubbed Peace Science Infrastructure (PSI), focusing on event extraction in the domain of armed conflicts.
For more information, please see the full announcement here:
https://www.jobbnorge.no/en/available-jobs/job/239414/phd-research-fellow-i…
The closing date is February 28th, 2023.
Please do not hesitate to contact me for any further information.
Best regards,
-erik
--
Erik Velldal
Language Technology Group
Section for Machine Learning
Department of Informatics, University of Oslo
2nd Call for Participation: DISRPT 2023Shared Task on Discourse Relation Parsing and Treebanking
In conjunction with CODI 2023, ACL 2023 - 14 July 2023
News: the training and dev data sets have been released on our github: https://github.com/disrpt/sharedtask2023. Now you can start training your systems! Surprise datasets will be released along with the test sets in about two months! Get excited!
News: please join us on the google group disrpt2023_participants(a)googlegroups.com and on the dedicated Discord channel https://discord.gg/JDdjhXaK
This year, we are organizing DISRPT 2023 as a shared task on discourse processing across formalisms, for a variety of languages and genres. It is the third iteration of a cross-formalism shared task on discourse analysis, with three subtasks:
* Task 1: discourse segmentation * Task 2: connective identification * Task 3: relation classification
We will provide training, development and test datasets from all available languages in RST, SDRT, PDTB and Discourse Dependencies using a uniform format. Because different corpora, languages, and frameworks use different guidelines, the shared task will promote the design of flexible methods for dealing with various guidelines, and will help to push forward the discussion of converging standards for discourse units, discourse relations and discourse markers. For datasets which have treebanks, we will evaluate segmentation in two different scenarios: with and without gold syntax. An automatically parsed version is provided for all corpora without a gold parse.
Shared Task Data and Formats
Data for the shared task is released via GitHub together with format documentation and tools: https://github.com/disrpt/sharedtask2023
See here for more information about the previous shared tasks:
* 2019: https://sites.google.com/view/disrpt2019/shared-task * 2021: https://sites.google.com/georgetown.edu/disrpt2021/
Tentative Schedule:
* 25 January 2023 – Sample data released * 21 February 2023 – Train / dev data release * 15 April 2023 – Test data release * 8 May 2023 – Submission of system and paper * 22 May 2023 - Notification of acceptance * 1 June 2023 - Camera-ready paper due (This date has been modified since the 1st call) * 14 July 2023 - CODI Workshop at ACL
Information:
Contact the organizers: disrpt_chairs(a)googlegroups.com
Official website: https://sites.google.com/georgetown.edu/disrpt2021
Google group for participants, please join us on: disrpt2023_participants(a)googlegroups.com
Discord group for participants, please join us on: https://discord.gg/JDdjhXaK
Organization:
* Amir Zeldes (Georgetown University, Washington, DC, USA) * Janet Liu (Georgetown University, Washington, DC, USA) * Philippe Muller (IRIT, University of Toulouse, Toulouse, France) * Chloé Braud (IRIT, CNRS, Toulouse, France) * Laura Rivière (IRIT, University of Toulouse, Toulouse, France) * Attapol Te Rutherford (Faculty of Arts Chulalongkorn University, Bangkok, Thaïland)
[Apologies for cross-posting]
I am looking for a PhD student interested in studying explainable
semantic change detection.
Current computational approaches to modeling synchronic and diachronic
semantic change achieve considerable success as measured by scores in
shared tasks and the number of research papers. But they are mostly
non-transparent and obscure for historical linguists and lexicographers.
One of the reasons is that these methods lack explanatory power. This
PhD project is supposed to address this issue. The overall aim is to
transform numerical change predictions into human-readable explanations
linked to rich linguistic tradition of semantic shift categorization.
We will define particular paths towards this aim jointly, in discussion
with the PhD student.
The position is fully-funded and linked to the Language Technology Group
(LTG) in the Machine Learning Section at the Department of Informatics,
University of Oslo (UiO), Norway. The fellowship period is 3 years,
starting no later than August 2023.
A fourth year may be considered with a workload of 25 % that may consist
of teaching, supervision duties, and/or research assistance.
For more information, please see the full announcement here:
https://www.jobbnorge.no/en/available-jobs/job/239381/phd-research-fellow-i…
The application deadline is February 28th, 2023.
Please feel free to contact me for any further information.
--
Andrey
Associate professor
Language Technology Group (LTG)
University of Oslo