Dear readers of corpora list,
I am looking for two PhD students / PostDocs excited to work on computational semantics and discourse processing (both applications with a computer science focus and with a corpus linguistics focus are welcome) to join my young research group at the University of Augsburg (near Munich) in Germany. The positions are paid full-time positions and not tied to any specific project. More information on the research environment can be found here: https://hlt-augsburg.github.io/
If you are interested, please contact me directly. Professors & teachers of CL: I would be highly grateful if you would forward this job ad to students who are potentially interested! Many thanks!
Here’s the official job ad:
https://www.uni-augsburg.de/en/jobs-und-karriere/stellenangebote/2024/09/10…
--
Mit freundlichen Grüßen / Best Regards
Prof. Dr. Annemarie Friedrich
Natural Language Understanding with Applications to DH
Fakultät für Angewandte Informatik, Universität Augsburg
https://annefried.github.io
Last Call for Main Conference Papers (COLING 2025)
Important Dates
All deadlines are 11:59 PM UTC-12:00 (“anywhere on Earth”).
Deadline for direct submissions September 16, 2024
Commitment deadline for ARR papers October 20, 2024
Author rebuttal phase (for direct submissions) October 30 - November 1, 2024
Notification of acceptance for COLING 2025 November 29, 2024
Tutorials and Workshops January 19-20, 2025
Main Conference January 21-24, 2025
Website: https://coling2025.org/calls/main_conference_papers/
---------- CFP:
The 31st International Conference on Computational Linguistics (COLING 2025) will take place in Abu Dhabi, UAE, January 19-24 2025. COLING 2025 invites the submission of long and short papers featuring substantial, original, and unpublished research in all aspects of Computational Linguistics and Natural Language Processing.
Relevant topics include, but are not limited to, the following areas:
Dialogue and Interactive Systems
Discourse and Pragmatics
Document Classification and Topic Modeling
Ethics, Bias, and Fairness
Information Extraction
Information Retrieval and Text Mining
Interpretability and Analysis of Models for NLP
Language Modeling
Language Resources and Evaluation
Linguistic Insights Derived using Computational Techniques
Linguistic Theories, Cognitive Modeling and Psycholinguistics
Low-Resource and Efficient Methods for NLP
Machine Learning for Computational Linguistics and NLP
Machine Translation and Translation Aids
Multilingualism and Language Diversity
Multimodal and Grounded Language Acquisition
NLP and LLM Applications (such as Education, Healthcare, Finance, Legal NLP, Computational Social Science, etc.)
Natural Language Generation
Offensive Speech Detection and Analysis
Phonology, Morphology and Word Segmentation
Question Answering
Lexical Semantics
Sentence-level Semantics (Textual Inference, Paraphrasing, etc)
Sentiment Analysis, Stylistic Analysis, Opinion and Argument Mining
Speech Recognition and Synthesis, and Spoken Language Understanding
Summarization and Simplification
Syntactic analysis (Tagging, Chunking, Parsing)
Vision and Robotics
Papers targeting any of these topics from the perspective of the Sustainability Goals of the UN are especially welcome.
Submission Details
COLING 2025 invites the submission of long papers of up to eight pages and short papers of up to four pages. These page limits only apply to the main body of the paper. At the end of the paper (after the conclusions but before the references) papers need to include a mandatory section discussing the limitations of the work and, optionally, a section discussing ethical considerations. Papers can include unlimited pages of references and an unlimited appendix. Authors should follow the general instructions for COLING 2025 proceedings, which are an adaptation of the general instructions for *ACL proceedings.
To prepare your submission, please make sure to use the COLING 2025 style files available here:
LaTeX
Word
Overleaf
Papers deviating from the provided style files will be rejected without review.
COLING 2025 adopts the ACL Ethics Policy.
There are two routes for paper submission:
Direct submission
Papers should be submitted through Softconf/START using the following link: https://softconf.com/coling2025/papers/
Each paper will receive a minimum of three reviews. Authors will have the opportunity to provide a short rebuttal to clarify any misunderstandings. The review process will be double-blind. Reviewers will not see authors, authors will not see reviewers. Reviews and submissions will not be made publicly visible.
ACL Rolling Review (ARR) Papers
Papers which have already been reviewed through the ACL Rolling Review (ARR) system can be committed to COLING 2025. These papers will not be re-reviewed. Senior Area Chairs and Program Chairs will make acceptance decisions based on the ARR reviews and meta-reviews.
Optional Supplementary Materials: Appendices, Software and Data
Each COLING 2025 submission can be accompanied by a single .tgz or .zip archive containing supplementary materials, such as program code and datasets. COLING 2025 encourages the submission of such supplementary materials to improve the reproducibility of results. For the main track, the supplementary materials need to be fully anonymized to preserve the double-blind reviewing policy.
Additional information, such as preprocessing decisions, model parameters or proofs should be put into the appendix of the main PDF submission. Note that submissions need to remain fully self-contained. In particular, any details that are important for reviewers to assess the technical correctness of the work should be included in the main body of the paper.
Anonymity Period
COLING 2025 will follow the ACL Anonymity Policy. As a result, no anonymity period will be required, although authors are still cautioned against extensive advertising. The submissions themselves must still be fully anonymized.
Multiple Submission Policy
Papers which are submitted to COLING 2025 cannot be under review for other conferences or journals at the same time. The commitment process is treated as being under review for a conference. Authors can either commit their paper through ARR or directly submit it to the conference. Papers reviewed and committed to the conference through ARR cannot be submitted directly to the conference. In addition, we will not consider any paper that overlaps significantly in content or results with papers that will be (or have been) published elsewhere. Submissions that violate these requirements will be desk rejected.
General chairs,
Owen Rambow, Stony Brook University
Leo Wanner, ICREA, Pompeu Fabra University
Program co-chairs
Marianna Apidianaki, University of Pennsylvania
Hend Al-Khalifa, King Saud University
Barbara Di Eugenio, University of Illinois Chicago
Steven Schockaert, Cardiff University
For questions about submissions: coling2025-programchairs(a)googlegroups.com
PhD Studentships at the University of Exeter
We’re currently advertising 9 PhD studentships across the Exeter Biomedical Research Centre, including for projects involving AI/NLP for healthcare:
Using natural language processing to understand and enhance therapeutic mechanisms in digital psychological therapy <https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.exete…>
Unlocking the Power of UK Hospital Data: Leveraging Machine Learning and Natural Language Processing for High-Fidelity Clinical Research <https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.exete…>
For more details:
Website – link here <https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.exete…>
LinkedIn – link here <https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.linke…>
‘X’ – link here <https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fx.com%2FE…>
Fees and funding
For eligible students, the studentship will cover Home tuition fees plus an annual tax-free stipend of at least £19,237 (in alignment with standard Research Council UK rate) for 3 years full-time, in addition to a Research Training and Support Grant (RTSG). *Students who pay international tuition fees are eligible to apply, but should note that the award will only provide payment for part of the international tuition fee and no stipend. International applicants need to be aware that you will have to cover the cost of your student visa, healthcare surcharge and other costs of moving to the UK to do a PhD. The conditions for eligibility of home fees status are complex and you will need to seek advice if you have moved to or from the UK (or Republic of Ireland) within the past 3 years or have applied for settled status under the EU Settlement Scheme.
Timeline
The closing date for applications is midnight on Tuesday 17th September 2024. Interview panels are anticipated to be held virtually week commencing the 30th September 2024. Expected start dates are between 1st November 2024 and 6th January 2025.
Contact details
If you would like to discuss the studentships further, please contact the primary supervisor as stated in the advert. If you have any queries surrounding the application process or the BRC more generally, please contact Dr Sophie Gould (NIHR Exeter BRC Training and Events Manager) at S.L.Gould(a)exeter.ac.uk <mailto:S.L.Gould@exeter.ac.uk>.
Entry requirements
Applicants for this studentship must have obtained, or be about to obtain, a First or Upper Second Class UK Honours degree, or the equivalent qualifications gained outside the UK, in an appropriate subject area.
If English is not your first language you will need to meet the required level as per our guidance at https://www.exeter.ac.uk/pg-research/apply/english/
Our Equality, Diversity, and Inclusion Commitment
The NIHR Exeter Biomedical Research Centre (BRC) and Clinical Research Facility (CRF) strongly adhere to Equality, Diversity and Inclusivity (EDI) principles. They share a fundamental objective to empower better health outcomes for all patients and the public by translating scientific breakthroughs into potential new treatments, diagnostics and medical technologies.
We are committed to ensuring that the consideration of EDI is second nature to all members of our experimental medicine and translational research community, fostering a fully inclusive environment where everyone feels supported, valued, and is provided the opportunity to reach their full potential.
Our strategy purposefully shares overarching EDI visions with those of the NIHR, UoE and NHS Trust partners to allow for collaborative working to reach our mutual goals. Whilst all applicants will be judged on merit alone, we particularly welcome applications from groups currently underserved within our working community.
Summary
Application deadline: 17th September 2024
Value: For eligible students, the studentship will cover Home tuition fees plus an annual tax-free stipend of at least £19,237 (in alignment with standard Research Council UK rate) for 3 years full-time, in addition to a Research Training and Support Grant (RTS
Duration of award: per year
Contact: PGR Admissions pgrapplicants(a)exeter.ac.uk <mailto:pgrapplicants@exeter.ac.uk>
----------------------------------------------------
Aline Villavicencio <https://sites.google.com/view/alinev> (she/her)
Professor in Natural Language Processing
Director of the Institute for Data Science and Artificial Intelligence <https://www.exeter.ac.uk/research/institutes/idsai/>
University of Exeter (UK)
Dear Corpora-list,
We are advertising a post-doctoral position in ML/XAI : 18 month at IMT
Mines Alès (south of France), or IMT Business School, Evry (near Paris)
Last call for candidates, closing application date 20/09/2024.
Subject: Evaluation of the impact of XAI techniques on Human-Machine
collaboration
Context: ENFIELD project, Horizon-funded European AI Network of
Excellence on adaptive, sustainable, human-centered and trustworthy AI.
Objectives :
Evaluate the impact of XAI methods on Human-Machine collaboration
through the study of :
Performance of the human operator in performing a task, in different
contexts: alone, with the help of a predictive model for which decisions
will be explained/not explained, with the help of an XAI technique,
Types of human-machine collaboration (e.g. delegation, substitution,
mediation), Potential biases induced by XAI techniques.
A focus will be made on specific contexts of study (e.g., image
classification or NLP tasks, XAI techniques based on local
interpretability using attribution methods).
You will contribute to:
Defining the study contexts (e.g. games, image classification) and test
protocols to be considered.
Selecting and implementing predictive models and XAI techniques.
Set up the tools needed to carry out the experiments covered by the
study protocols, e.g. development of simple games, decision interfaces.
Implement the above-mentioned protocols on cohorts of human operators.
Evaluate and promote the results obtained.
Deadline for applications: 20/09/2024
Desired start date: 01/11/2024
Application and additional info:
https://institutminestelecom.recruitee.com/o/post-doctorant-post-doctorante…
Contacts :
Sébastien Harispe, Associate Professor
sebastien.harispe(a)mines-ales.fr
Nicolas Soulié, Associate Professor
nicolas.soulie(a)imt-bs.eu
Best regards,
Andon Tchechmedjiev
--
Andon Tchechmedjiev, PhD. Associate Professor of Artificial Intelligence
and Computer Engineering at EuroMov Digital Health in Motion, IMT Mines
Alès. Taxonomy and Semantics of Movement (SemTaxM) co-lead, Learning and
Complexity group member. Research expertise: Deep Learning, Knowledge
Engineering, Computational Linguistics and Semantics, Biomedical
Informatics, Neuroengineering and Human Movement Processing
Second call for abstracts, UniDive 3rd general meeting, HUN-REN Hungarian
Research Centre for Linguistics, Hungary, Budapest, 29-30. January 2025
*UniDive <https://unidive.lisn.upsaclay.fr/>*is a COST action, i.e. a
scientific network, dedicated to universality, diversity and idiosyncrasy
in language technology. It is structured around 4 Working Groups:
- WG1: Corpus annotation
- WG2: Lexicon-corpus interface
- WG3: Multilingual and cross-lingual language technology
- WG4: Quantifying and promoting diversity
The *third general meeting
<https://unidive.lisn.upsaclay.fr/doku.php?id=meetings:general_meetings:2rd_…>*
of
the action will take place on January 29-30 and will be preceded by a WG2
meeting on 28 January 2025 at the HUN-REN Hungarian Research Centre for
Linguistics in Budapest. We invite UniDive WG members to submit abstract
proposals related to the scientific program of the WGs.
The main venues will be in *Benczúr Hotel <https://www.hotelbenczur.hu/>*,
Budapest, but some sessions will take place at the *Hungarian Research
Centre for Linguistics <https://nytud.hu/en%7CHUN-REN>*, Budapest, Hungary.
To know more about the posters, see the *call
<https://unidive.lisn.upsaclay.fr/doku.php?id=meetings:general_meetings:3rd_…>*
.
Proposals may describe diverse types of contributions, according to 3
different tracks:
- Planned work
- Work in progress
- Complete work, also previously published
A proposal should be anonymous, written in English and submitted in pdf only.
It should include (on the title page) the list of the relevant WGs. It
should not exceed 2 pages, including figures and tables (bibliographic
references may go beyond the 2-page limit). If linguistic examples from
languages other than English are included, those should be glossed and
translated into English, and an extra half page is allowed for this
purpose.
For the sake of uniformity and easing the reviewers’ effort, we encourage
authors to use the *Overleaf Latex template
<https://www.overleaf.com/read/yqbpxcbjmjjw>*. Other formats (not
necessarily Latex-based) can also be used, provided that they conform to
the following specifications: A4 paper, 11pt font, 1in margins. The
submission link will be announced soon.
*The submission link
is https://openreview.net/group?id=UniDive/2025/General_Meeting
<https://openreview.net/group?id=UniDive/2025/General_Meeting> *
The reviewing process is double-blind. The selection of proposals will be
done by UniDive Program Committee according to the following criteria:
- relevance to UniDive and the work program of its Working Groups (see
pp. 18-20 of the Memorandum of Understanding),
- clarity
- diversity of the languages covered by the workshop program
The selected proposals will be presented at the 3rd UniDive general meeting
as posters and/or oral presentations.
At least one author per selected proposal will be reimbursed for their
travel and stay.
Important dates
- 26 July 2024: Call for abstracts
- *30 September 2024: Submission deadline*
- 21 October 2024: Notification of acceptance
- 26 October 2024: Communication of the names of the presenters
- 09 November 2024: Final versions of abstracts
- 28 January 2025: WG2 meeting
- 29-30 January 2025: UniDive 3rd general meeting
The time zone for all deadlines is anywhere on Earth (UTC-12). Due to the
tight schedule, there's no further submission deadline extension.
Best regards
Program Chairs
- Olha Kanishcheva, SET University (Ukraine) and Friedrich Schiller
University Jena (Germany)
- Veronika Lipp, HUN-REN Hungarian Research Centre for Linguistics
(Hungary)
- Ranka Stanković, University of Belgrade (Serbia)
Dear Colleagues,
We're delighted to announce that the CfP for the 22nd Annual Workshop of the Australasian Language Technology Association - ALTA 2024 - is now open and closes on 20th September (23:59hrs Anywhere on Earth UTC -12)
Details are available on our website at https://alta2024.alta.asn.au/calls/papers and a summary follows.
---
Important Dates
* Submission deadline for short/long papers, presentation abstracts and industry demonstrations:
20 September 2024 (23:59 Anywhere On Earth UTC-12).
* Main conference: 3 December and 4 December 2024, ANU, Canberra, ACT, hybrid (in person and online)
Overview
The 22nd Annual Workshop of the Australasian Language Technology Association (ALTA) will be held in a hybrid format at the Australian National University, Canberra, from 2 December to 4 December 2024 and also online.
The ALTA 2024 workshop is the key local forum for socialising research results in Natural Language Processing (NLP) and Computational Linguistics (CL). It will feature presentations, posters, and demonstrations from students, industry, and academic researchers. Like previous years, we also encourage submissions and participation from industry and government researchers and developers. Note that ALTA is listed in the CORE 2023 Conference Rankings as Australasian C<https://www.core.edu.au/conference-portal>.
Topics
ALTA invites the submission of papers and presentations on all aspects of NLP and CL, including, but not limited to:
* Commonsense Reasoning.
* Computational Social Science and Cultural Analytics.
* Dialogue and Interactive Systems.
* Discourse and Pragmatics.
* Efficient Methods for NLP.
* Ethics in NLP.
* Information Extraction.
* Information Retrieval and Text Mining.
* Interpretability, Interactivity and Analysis of Models for NLP.
* Language Grounding to Vision, Robotics and Beyond.
* Language Modeling and Analysis of Language Models.
* Linguistic Theories, Cognitive Modeling and Psycholinguistics.
* Machine Learning for NLP.
* Machine Translation.
* Multilinguality and Linguistic Diversity.
* Natural Language Generation.
* NLP Applications.
* Phonology, Morphology and Word Segmentation.
* Question Answering.
* Resources and Evaluation.
* Semantics: Lexical, Sentence level, Document Level, Textual Inference, etc.
* Sentiment Analysis, Stylistic Analysis, and Argument Mining.
* Speech and Multimodality.
* Summarisation.
* Syntax, Parsing and their Applications.
We particularly encourage submissions that broaden the scope of our community by considering practical applications of language technology and multidisciplinary research. We also specifically encourage submissions from the industry.
Format and instructions for authors
Please refer to our CfP webpage for specifics.<https://alta2024.alta.asn.au/calls/papers>
We are using OpenReview for submissions, and invite submissions of three different formats: (1) Original Research Papers, (2) Abstract-based Presentations, and (3) Industry Demonstrations.
---
You can follow ALTA on social media at the following links:
*
LinkedIn (page): https://www.linkedin.com/company/australasian-language-technology-associati…
*
LinkedIn (group):https://www.linkedin.com/groups/1849979/
*
Twitter: https://twitter.com/altanlp
*
Mastodon: https://sigmoid.social/@ALTAnlp
*
Hashtag is #ALTA2024
With kind regards, on behalf of the ALTA 2024 Team:
Dr Gabriela Ferraro, General Chair
Professor Tim Baldwin, Program Chair
Dr Sergio José Rodríguez Méndez, Program Chair
Dr Nicholas Kuo, Program Chair
Dr Anton Malko, Publication Chair
Dr Dawei Chen, Technology Chair
A/Prof Shunichi Ishihara, Finance Chair
Charbel El-Khaissi, PhD candidate, Sponsorship Chair
Ned Cooper, PhD candidate, Local Chair
Kathy Reid, PhD candidate, Publicity Chair
Dear corpus and computational linguists,
We invite applications for the Adam Kilgarriff Prize [image: 🏆] for
outstanding works in corpus linguistics, computational linguistics, and
lexicography.
Apply by 30th September 2024. The Prize will be awarded at the eLex
Conference 2025.
For more information, visit https://kilgarriff.co.uk/prize/
<https://kilgarriff.co.uk/prize/?fbclid=IwZXh0bgNhZW0CMTAAAR0cqURwXdon0Ksd1L…>
On behalf of the Board of Trustees,
Michal Cukr
[Apologies for cross posting]
I am currently recruiting two four-year PhD researchers in Computational
Linguistics for the NSF funded project "ReDDDoT Phase 2: Inclusive
American language technologies" under my supervision.
The two areas of research are:
1) Data governance issues in voice collection collaborations between
non-Indigenous researchers and Indigenous communities.
2) Technical and social issues in collecting data for English--Spanish
translanguaging, codeswitching and code mixing.
The positions are available in joint collaboration between Indiana
University and the Mozilla Foundation. Researchers will be based in the
Department of Linguistics at Indiana University in Bloomington, Indiana
and work in collaboration with Mozilla Common Voice.
Appointments are 50% FTE (20 hours/week) and include:
- a stipend of $23k/year
- health insurance
Application deadline:
- International: 1st December, 2024
- Domestic: 2nd January, 2025
General requirements:
- https://linguistics.indiana.edu/graduate/how-to-apply.html
For further details please feel free to contact me on ftyers(a)iu.edu.
Francis M. Tyers
Associate Professor
Linguistics
Indiana University, Bloomington
--------Apologies for cross posting-----
Dear Colleagues,
We are excited to announce *Financial Causality Detection (FinCausal 2025)
shared task* to be organized in the upcoming the *6th Financial Narrative
Processing (FNP)* <https://www.lllf.uam.es/wordpress/fincausal-25/>held in
conjunction with COLING-2025 in Abu Dhabi, UAE, on January 19-20, 2025.
For the 6th FNP shared task, we will continue working in two languages (
*English*, *Spanish*) and we will introduce a new FinCausal Shared Task:
Financial Causality Detection (FinCausal 2025) on hybrid question answering.
The objective is to detect causal effects in financial disclosures in
English and Spanish. The dataset is a combination of extractive and
generative QA. Questions will be formulated abstractedly, while answers
will be extractive. In certain segments, questions will either focus on
causes or effects, and the answers will be directly extracted from the
text. The evaluation metric for responses will encompass both exact
matching and semantic similarity.
The shared tasks attracted more than 150 participants within the last 3 FNP
editions, for this year we are expanding the dataset by making the tasks
more challenging.
→ *2 new datasets* will be developed, one in English and one in Spanish
Access the CodaLab by clicking here
<https://codalab.lisn.upsaclay.fr/competitions/19936>
*Key Dates*
First CFP: 15 July 2024
Second CFP: 15 August 2024
(FinCausal) Practice set release: *2 September 2024*
(FinCausal) Training set release: *15 September 2024*
(FinCausal) Blind test set release: *30 October 2024*
(FinCausal) Systems submission: *7 November 2024*
(FinCausal) Release of results: *12 November 2024*
(FinCausal and general) Paper Submission Deadline: *25 November 2024*
Notifications of Acceptance: *5 December 2024*
Camera-ready Paper Deadline: *13 December 2024*
As in previous editions, FNP 2025 will be on-site and in-person (there will
be NO option for it to be virtual or hybrid).
More info: https://www.lllf.uam.es/wordpress/fincausal-25/fnp-2025/
Best Regards,
Paloma Martínez
Full professor
Human Language and Accessibility Technologies Group (hulat.inf.uc3m.es
<http://labda.inf.uc3m.es>)
Computer Science Department
Universidad Carlos III de Madrid
@Grupo_HULAT
We're happy to announce the release of *MessIRve*, a new *large-scale IR
dataset in Spanish!*
MessIRve* contains around *730k queries from 20 Spanish-speaking
countries* *and
the United States*, with relevant documents sourced from Wikipedia.
MessIRve's queries reflect diverse Spanish-speaking regions, unlike other
datasets that are translated from English or do not consider dialectal
variations. The large size of the dataset allows it to cover a wide variety
of topics, unlike smaller datasets.
The dataset is available in *HuggingFace*! 🤗
- Queries and relevance judgments: spanish-ir/messirve
<https://huggingface.co/datasets/spanish-ir/messirve>
- The collection of documents: spanish-ir/eswiki_20240401_corpus
<https://huggingface.co/datasets/spanish-ir/eswiki_20240401_corpus>
- Queries and qrels in TREC format: spanish-ir/messirve-trec
<https://huggingface.co/datasets/spanish-ir/messirve-trec>
For more details, check out our *arXiv paper*: MessIRve: A Large-Scale
Spanish Information Retrieval Dataset <http://arxiv.org/abs/2409.05994>
We hope MessIRve serves to spur more work in IR for the Spanish language
and facilitate the development of efficient information access tools for
Spanish speakers.
* MessIRve means *works** for **me* in Spanish ("me sirve"). The reference
to Lionel Messi, player of the most popular sport in Spanish-speaking
countries, football, stresses the importance of using topics that are
relevant to Spanish speakers.