*SIGHAN 2024 Shared Task for Chinese Dimensional Aspect-Based Sentiment
Analysis (dimABSA)*
The 10th SIGHAN Workshop on Chinese Language Processing (SIGHAN-10)
Location: Bangkok, Thailand (co-located with ACL 2024)
Date: Friday, August 16, 2024
Shared Task: Dimensional Aspect-Based Sentiment Analysis (dimABSA)
Task Website: https://dimabsa2024.github.io/
Registration and Submission: https://www.codabench.org/competitions/2137/
*Task Summary *
*Aspect-Based Sentiment Analysis (ABSA)* is a critical NLP research topic
that aims to identify the aspects of a given sentence and analyze the
sentiments associated with each aspect. Compared to representing affective
states as several discrete classes (i.e., polarity), the dimensional
approach that represents affective states as continuous numerical values
(called intensity) in multiple dimensions, such as valence-arousal (VA)
space, provides more fine-grained emotional information. Therefore, we
organize a *Chinese dimensional ABSA shared task (dimABSA) in the SIGHAN
2024 workshop*, providing fine-grained sentiment intensity prediction for
each extracted aspect of a restaurant review. We have three subtasks:
1) *Intensity
Prediction*, 2) *Triplet Extraction*, and 3) *Quadruple Extraction*.
Participants will be free to choose the subtasks they wish to participate
in.
More information is available online at https://dimabsa2024.github.io/
*Important Dates*
Note that all deadlines are 23:59:59 AoE (UTC-12).
- Release of training data: 1st March, 2024
- Release of test data: 20th May, 2024
- Testing results submission due: 24th May, 2024
- System description paper due: 7th June, 2024
- Notification of Acceptance: 17th June, 2024
- Camera-ready deadline: 1st July, 2024
- SIGHAN 2024 Workshop: August 16, 2024
*Organizers *
- Lung-Hao Lee, National Yang Ming Chiao Tung University
- Liang-Chih Yu, Yuan Ze University
- Suge Wang, Shanxi University
- Jian Liao, Shanxi University
***Update: call for papers extended to March 21st***
We are excited to announce the *7th Laughter and Other Non-Verbal
Vocalisations Workshop* (bit.ly/LaughterWorkshop2024) on July 16-17 at
Queen’s University Belfast. The workshop will be a pre-conference event,
part of the 2024 Conference of the International Society for Research on
Emotion (www.isre2024.org).
Non-verbal vocalisations in human-human and human-machine interactions
play important roles in displaying social and affective behaviours and
in managing the flow of interaction. Laughter, sighs, clicks, filled
pauses, and short utterances such as feedback responses are among some
of the non-verbal vocalisations that are being increasingly studied from
various research fields. However, much is still unknown about the
phonetic or visual characteristics of non-verbal vocalisations
(production/encoding), their relations to the social actions they are
part of, their perceived meanings (perception/decoding), and their
ordering in interaction. Furthermore, with the increased interest for
more naturalness in human-machine interaction, current times also invite
exploring how these phenomena can be integrated in speech applications.
*Research themes* include, but are not restricted to, these aspects of
laughter and other non-verbal vocalisations:
• Articulation, acoustics, and perception
• Interaction and pragmatics
• Affective and evaluative meanings
• Social perception and organisation
• Disfluency
• Technology applications
Researchers are invited to submit *extended abstracts* (2 pages of
content and maximum one additional page for references and figures)
describing original work, including work in progress. The deadline for
submission is March 21st, 2024. More information about the submission
process can be found on our website (bit.ly/LaughterWorkshop2024).
There will be two *keynote presentations* on the topics treated by the
workshop, delivered by Prof. Carolyn McGettigan (University College
London, UK) and Prof. Margaret Zellers (Kiel University, Germany).
The workshop will be supported by the International Speech Communication
Association (ISCA), offering one travel grant for early career scientists.
Looking forward to receiving your contributions and welcoming you at the
workshop in July!
Best wishes,
The LW2024 Organising Committee
Bogdan Ludusan, Bielefeld University, Germany
Marina Cantarutti, University of York, United Kingdom
For further information or questions regarding the workshop you can
contact the organisers at the following email address:
LaughterWorkshop2024(a)gmail.com
The Department of Swedish, Multilingualism, Language Technology at the
University of Gothenburg is inviting applications for the position of
Professor of Language Technology. The new professor's main duties will
be to lead and develop research, education, and outreach in the field
of language technology at the department, in particular within its
Språkbanken Text group.
A detailed description of the position and the application requirements
can be found in University of Gothenburg's job application portal, at
the link below. This detailed description is only available in Swedish,
as proficiency in Swedish or another Scandinavian language is required
for the position.
Applications must be submitted no later than 8 April 2024.
https://web103.reachmee.com/ext/I005/1035/job?site=6&lang=SE&validator=3038…
<https://web103.reachmee.com/ext/I005/1035/job?site=6&lang=SE&validator=3038…>
--
GERLOF BOUMA
Universitetslektor
GÖTEBORGS UNIVERSITET
Institutionen för svenska, flerspråkighet och språkteknologi
Språkbanken Text
https://spraakbanken.gu.se/om/personal/gerlof
Dear colleagues,
The Fifth Workshop on Insights from Negative Results in NLP Co-located with
NAACL, June 16-21 2024
First Call for Participation
Insights Website: <https://insights-workshop.github.io/>
Contact email: insights-workshop-organizers(a)googlegroups.com
*Overview
Publication of negative results is difficult in most fields, but in NLP the
problem is exacerbated by the near-universal focus on improvements in
benchmarks. This situation implicitly discourages hypothesis-driven
research, and it turns creation and fine-tuning of NLP models into art
rather than science. Furthermore, it increases the time, effort, and carbon
emissions spent on developing and tuning models, as the researchers have no
opportunity to learn what has already been tried and failed.
This workshop invites both practical and theoretical unexpected or negative
results that have important implications for future research, highlight
methodological issues with existing approaches, and/or point out pervasive
misunderstandings or bad practices. In particular, the most successful NLP
models currently rely on Transformer-based large language models (LLMs). To
complement all the success stories, it would be insightful to see where and
possibly why they fail. Any NLP tasks are welcome: sequence labeling,
question answering, inference, dialogue, machine translation - you name it.
A successful negative results paper would contribute one of the following:
** broadly applicable recommendations for training/fine-tuning/prompting,
especially if X that didn’t work is something that many practitioners would
think reasonable to try, and if the demonstration of X’s failure is
accompanied by some explanation/hypothesis;
** ablation studies of components in previously proposed models, showing
that their contributions are different from what was initially reported;
** datasets or probing tasks showing that previous approaches do not
generalize to other domains or language phenomena;
** trivial baselines that work suspiciously well for a given task/dataset;
** cross-lingual studies showing that a technique X is only successful for
a certain language or language family;
** experiments on (in)stability of the previously published results due to
hardware, random initializations, preprocessing pipeline components, etc;
** theoretical arguments and/or proofs for why X should not be expected to
work;
** demonstration of issues with data processing/collection/annotation
pipelines, especially if they are widely used;
** demonstration of issues with evaluation metrics (e.g. accuracy, F1 or
BLEU), which prevent their usage for fair comparison of methods;
** demonstration of issues with under-reporting of training details of
pre-trained models, including test data contamination and invalid
comparisons
In 2024, we will invite the authors of accepted negative results papers to
nominate the specific work reporting the original positive results. The
goal is to organize joint discussion sessions, so that the community can
learn the most from the specific insightful failure.
* Important Dates
** Submission due: March 10, 2024
** Submission due for papers reviewed through ACL Rolling Review: April 7,
2024
** Notification of acceptance: April 14, 2024
** Camera-ready papers due: April 24, 2024
** Workshop: TBA, between June 21-22, 2024
* Submission
Submission is electronic, using the Softconf START conference management
system.
Submission link: <https://softconf.com/naacl2024/insights2024/>
The workshop will accept short papers (up to 4 pages, excluding
references), as well as 1-2 page non-archival abstract submissions for
papers published elsewhere (e.g. in one of the main conferences or in
non-NLP venues). The goal of this event is to stimulate a meaningful
community-wide discussion of the deep issues in NLP methodology, and the
authors of both types of submissions will be welcome to take part in our
get-togethers.
The workshop will run its own review process, and papers can be submitted
directly to the workshop by March 10, 2024. It is also possible to submit a
paper accompanied with reviews from the ACL Rolling Review system by April
7, 2024. The submission deadline for ARR papers follows the ACL RR
calendar. Both research papers and abstracts must follow the ACL two-column
format. Official style sheets:
https://github.com/acl-org/acl-style-files
Please do not modify these style files, nor should you use templates
designed for other conferences. Submissions that do not conform to the
required styles, including paper size, margin width, and font size
restrictions, will be rejected without review. Please follow the formatting
guidelines outlined here: https://acl-org.github.io/ACLPUB/formatting.html
* Multiple Submission Policy
The workshop cannot accept work for publication or presentation that will
be (or has been) published elsewhere and that have been or will be
submitted to other meetings or publications whose review periods overlap
with that of Insights. Any questions regarding submissions can be sent to
insights-workshop-organizers(a)googlegroups.com.
If the paper has been rejected from another venue, the authors will have
the option to provide the original reviews and the author response. The new
reviewers will not have access to this information, but the organizers will
be able to take into account the fact that the paper has already been
revised and improved.
* Anonymity Period
The workshop will follow the new ACL policy:
https://www.aclweb.org/adminwiki/index.php/ACL_Anonymity_Policy
* Presentation
All accepted papers must be presented at the workshop to appear in the
proceedings. Authors of accepted papers must notify the program chairs by
the camera-ready deadline if they wish to withdraw the paper. At least one
author of each accepted paper must register for the workshop.
Previous presentations of the work (e.g. preprints on arXiv.org) should be
noted in a footnote in the camera-ready version (but not in the anonymized
version of the paper).
The workshop will take place during NAACL 2024 (June 16-21 2024). It will
be hybrid, allowing for both in-person and virtual presentations.
* Organization Committee
** Shabnam Tafreshi, inQbator AI at eviCore Healthcare
** Arjun Reddy Akula, Google Research
** João Sedoc, New York University
** Anna Rogers, IT University of Copenhagen
** Aleksandr Drozd, RIKEN
** Anna Rumshisky, University of Massachusetts Lowell / Amazon Alexa
* Contact info
Any questions regarding the workshop can be sent to
insights-workshop-organizers(a)googlegroups.com.
Please continue reading about: Authorship, Citation and Comparison, Ethics
Policy, Reproducibility, and Presentation in the call for paper page on our
website: https://insights-workshop.github.io/2024/cfp/
Regards,
Insights 2024 Organizers
--
*Shabnam Tafreshi, PhD*
*Machine Learning Senior Advisor - NLP Researcher*
*Computational Linguistics, NLP*
*inQbator AI at eviCore Healthcare*
*"All the problems of the world could be settled easily, if people only
willing to think."*
*-Thomas J. Watson*
International Conference ‘New Trends in Translation and Technology’ (NeTTT’2024)
Varna, Bulgaria, 3-6 July 2024
https://nettt-conference.com/
Submission deadline: 31st March 2024
# The conference
The second edition of the forthcoming International Conference ‘New Trends in Translation and Technology’ (NeTTT’2024) will take place in Varna, Bulgaria, 3-6 July 2024.
Continuing the tradition of the first edition of the NeTTT conference and HiT-IT events series, the objective of the conference is (i) to bridge the gap between academia and industry in the field of translation and interpreting by bringing together academics in linguistics, translation studies, machine translation and natural language processing, developers, practitioners, language service providers and vendors who work on or are interested in different aspects of technology for translation and interpreting, and (ii) to be a distinctive event for discussing the latest developments and practices. NeTTT’2024 invites all professionals who would like to learn about the new trends, present the latest work or/and share their experience in the field, and who would like to establish business and research contacts, collaborations and new ventures.
The conference will take the form of presentations (peer-reviewed research and user presentations, keynote speeches), and posters; it will also feature panel discussions. The accepted papers will be published as open-access conference e-proceedings.
# Conference topics
Contributions are invited on any topic related to latest technology and practices in machine translation, translation, subtitling, localisation and interpreting. NeTTT’2024 will feature a Special Theme Track "Future of Translation Technology in the Era of LLMs and Generative AI".
The conference topics include but are not limited to:
## CAT tools
- Translation Memory (TM) systems
- NLP and MT for translation memory systems
- Terminology extraction tools
- Localisation tools
## Machine Translation
- Latest developments in Neural Machine Translation
- MT for under-resourced languages
- MT with low computing resources
- Multimodal MT
- Integration of MT in TM systems
- Resources for MT
## Technologies for MT deployment
- MT evaluation techniques, metrics and evaluation results
- Human evaluations of MT output
- Evaluating MT in a real-world setting
- Quality estimation for MT
- Domain adaptation
## Translation Studies
- Corpus-based studies applied to translation
- Corpora and resources for translation
- Translationese
- Cognitive effort and eye-tracking experiments in translation
## Interpreting studies
- Corpus-based studies applied to interpreting
- Corpora and resources for interpreting
- Interpretese
- Resources for interpreting and interpreting technology applications
- Cognitive effort and eye-tracking experiments in interpreting
## Interpreting technology
- Machine interpreting
- Computer-aided interpreting
- NLP for dialogue interpreting
- Development of NLP based applications for communication in public service settings (healthcare, education, law, emergency services)
## Emerging Areas in Translation and Interpreting
- MT and translation tools for literary texts and creative texts
- MT for social media and real-time conversations
- Sign language recognition and translation
## Subtitling
- NLP and MT for subtitling
- Latest technology for subtitling
## User needs
- Analysis of translators’ and interpreters’ needs in terms of translation and interpreting technology
- User requirements for interpreting and translation tools
- Incorporating human knowledge into translation and interpreting technology
- What existing translators’ (including subtitlers’) and interpreters’ tools do not offer
- User requirements for electronic resources for translators and interpreters
- Translation and interpreting workflows in larger organisations and the tools for translation and interpreting employed
## The business of translation and interpreting
- Translation workflow and management
- Technology adoption by translators and industry
- Setting up translation /interpreting / language provider company
## Teaching translation and interpreting
- Teaching Machine Translation
- Teaching translation technology
- Teaching interpreting technology
- Latest AI developments in the syllabi of translation and interpreting curricula
## Ethical issues in translation and technology
- Bias and fairness in MT
- Privacy and security in cloud MT systems
- Transparency and explainability of MT systems
- Environmental impact on MT systems
# Special Theme Track - Future of Translation Technology in the Era of LLMs and Generative AI
We are excited to share that NeTTT’2024 will have a special theme with the goal of stimulating discussion around Large Language Models, Generative AI and the Future of Translation and Interpreting Technology. While the new generation of Large Language Models such as CHATGPT and LLAMA showcase remarkable advancements in language generation and understanding, we find ourselves in uncharted territory when it comes to their performance on various Translation and Interpreting Technology tasks with regards to fairness, interpretability, ethics and transparency.
The theme track invites studies on how LLMs perform on Translation and Interpreting Technology tasks and applications, and what this means for the future of the field. The possible topics of discussion include (but are not limited to) the following:
- Changes in the translators and interpreters’ professions in the new AI era especially as a result of the latest developments in LLMSs and Generative AI
- Generative AI and translation
- Generative AI and interpreting
- Augmenting machine translation systems with generative AI
- Domain and terminology adaptation with Large Language Models
- Literary translation with Large Language Models
- Improving Machine Translation Quality with Contextual Prompts in Large Language Models
- Prompt engineering for translation
- Generative AI for professional translation
- Generative AI for professional interpreting
# Keynote speakers
We are delighted to announce the NeTTT’2024 keynote speakers
- Helena Moniz (University of Lisbon and Unbabel), President of the European Association of Machine Translation
- Carla Parra Escartín (RWS Language Weaver)
# Tutorial (3 July 2024)
- Tharindu Ranasinghe (Aston University), Quality Estimation for Machine Translation
# Programme Committee
The Programme Committee of NeTTT’2024 is listed https://nettt-conference.com/26844-2/.
# Conference Chairs
- Ruslan Mitkov (Lancaster University)
- Gloria Corpas Pastor (University of Malaga)
# Programme Chairs
- Constantin Orasan (University of Surrey)
- Tharindu Ranasinghe (Aston University)
# Sponsorship Chair
- Vilelmini Sosoni (Ionian University)
# Publication Chair
- Maria Kunilovskaya (University of Saarland)
# Organising Committee
- Organising Committee of NeTTT’2024 is listed https://nettt-conference.com/organisers/
# Submissions and publication
NETTT’2024 invites the following types of submissions:
User papers – for industry and practitioners. References to related work are optional. Allowed paper length: between 1 and 4 pages.
Academic submissions, in three different categories (have to follow formatting requirements, references to related work are required):
• (academic) full papers – describing original completed research. Allowed paper length: maximum 12 pages + unlimited references.
• (academic) work-in-progress papers/posters – describing work in progress, late breaking research, papers at a more conceptual stage, and other types of papers that do not fit in the ‘full’ papers category. Allowed paper length: maximum 7 pages + unlimited references.
• (academic) demo papers – describing working systems. Allowed paper length: maximum 5 pages + unlimited references. In addition to the papers, the authors will be expected to demonstrate the systems at the workshop.
The conference will not consider and evaluate abstracts only.
Each submission will be reviewed by three members of the Programme Committee. Submission is organised via Softconf START conference management system at https://softconf.com/n/nettt2024.
For submitting the papers, we invite the authors to comply with the Springer format, following the templates:
• LaTeX: https://resource-cms.springernature.com/springer-cms/rest/v1/content/192386…,
• Overleaf: https://www.overleaf.com/latex/templates/springer-lecture-notes-in-computer…,
• Word: https://resource-cms.springernature.com/springer-cms/rest/v1/content/192387….
The accepted papers will be published in the conference proceedings and made available online on the conference website. Authors of accepted papers will receive guidelines regarding how to produce camera-ready versions of their papers.
The final version of the accepted papers will be published in e-proceedings with assigned ISBN and DOI.
All accepted papers will be included in the conference e-proceedings which will be available at the conference website.
# Schedule
- Submission deadline: 31 March 2024
- Notification: 5 June 2024
- Final version due: 20 June 2024
All deadlines are valid for 23.59 Anywhere on Earth.
# Registration
Conference registration is open on https://nettt-conference.com/fees-registration/
The promotional early registration fee has been extended to 17 March 2024.
# Venue
The conference will take place at https://www.chernomorebg.com/en/conference-centre.html, Varna, situated only 200 m away from the fine sandy Black Sea beach.
# Further information and contact details
The conference website is https://nettt-conference.com and will be updated on a regular basis. For further information, please contact us at nettt2024(a)nettt-conference.com
---
Prof Constantin Orăsan
Professor of Language and Translation Technologies
https://www.surrey.ac.uk/centre-translation-studies | https://www.surrey.ac.uk/school-literature-languages
Personal page: https://dinel.org.uk
Office: 06LC03, Phone: +44 (0) 1483 68 4115
Library and Learning Centre, University of Surrey, Guildford, Surrey, GU2 7XH, UK
We are delighted to invite you to #ICNLSP 2024, the 7th edition of the International Conference on Natural Language and Speech Processing, which will be held at University of Trento from October 19th to 20th, 2024.
The conference will be hybrid (it can be attended in-person and online).
ICNLSP 2024 offers the opportunity for attendees (researchers, academics and students, and industrials) to share their ideas and to connect to each other and make them up to date on the ongoing researches in the field.
Authors are invited to present their work relevant to the topics of the conference - listed below - but not limited to:
-Signal processing, acoustic modeling.-Speech recognition (Architecture, search methods, lexical modeling, language modeling, language model adaptation, multimodal systems, applications in education and learning, zero-resource speech recognition, etc.).-Speech Analysis.-Paralinguistics in Speech and Language (Perception of paralinguistic phenomena, analysis of speaker states and traits, etc.).-Spoken Dialog Systems and Conversational Analysis-Speech Translation.-Speech synthesis.-Speaker verification and identification.-Language identification-Speech coding.-Speech enhancement-Speech intelligibility-Speech Perception-Speech Production-Brain studies on speech-Phonetics, phonology and prosody.-Speech and hearing disorders.-Paralinguistics of pathological speech and language.-Speech technology for disordered speech/hairing.-Cognition and natural language processing.-Machine translation.-Text categorization.-Summarization.-Sentiment analysis and opinion mining.-Computational Social Web.-Arabic dialects processing.-Under-resourced languages: tools and corpora.-Large language models.-Arabic OCR.-NLP tools for software requirements and engineering.-Knowledge fundamentals.-Knowledge management systems.-Information extraction.-Data mining and information retrieval.-Lexical semantics and knowledge representation.-Requirements engineering and NLP.-NLP for Arabic heritage documents.
**PUBLICATION**1- All accepted papers will be published in ACL Anthology.2- Selected papers will be published in Signals and Communication Technology (Springer) (https://www.springer.com/series/4748), indexed by Scopus and zbMATH.
**Keynote speakers**TBA
**CONTACT**icnlsp(at)gmail(dot)com
https://www.icnlsp.org/2024welcome/
BioLaySumm 2024
The 2nd Shared Task on the Lay Summarization of Biomedical Research
Articles @BioNLP Workshop, ACL 2024
Biomedical publications contain the latest research on prominent
health-related topics, ranging from common illnesses to global pandemics.
This can often result in their content being of interest to a wide variety
of audiences including researchers, medical professionals, journalists, and
even members of the public. However, the highly technical and specialist
language used within such articles typically makes it difficult for
non-expert audiences to understand their contents. The BioLaySumm shared
task surrounds the abstractive summarization of biomedical articles, with
an emphasis on catering to non-expert audiences through the generation of
summaries that are more readable, containing more background information
and less technical terminology (i.e., a “lay summary”).
This is the 2nd iteration of BioLaySumm, following the success of the 1st
edition of the task at BioNLP 2023 which attracted 56 submissions across 20
different teams. In this edition, which is again to be hosted by the BioNLP
workshop <https://aclweb.org/aclwiki/BioNLP_Workshop> at ACL 2024, we aim
to build on last year’s task by introducing a new test set, updating our
evaluation protocol, and encouraging participants to explore novel
approaches that will help to further advance the state-of-the-art for
Lay Summarization.
Accordingly, we will not only be offering a prize of £100 to the team with
the top-ranking submission, but we will also offer a second prize of £50 to
the team that proposes the most innovative approach (as decided upon by the
task organizers).
For more information, see:
- Main site: https://biolaysumm.org/
- CodaBench site: https://www.codabench.org/competitions/1920/
Important dates:
- First call for participation: 22nd January, 2024
- Releasing of task data: 22nd January, 2024
- System submission deadline: May 6th, 2024
- System papers due date: May 20th, 2024
- Notification of acceptance: June 17th, 2024
- Camera-ready system papers due: July 1st, 2024
- BioNLP Workshop Date: August 16th, 2024
Organizers:
- Tomas Goldsack, University of Sheffield, UK
- Matthew Shardlow, Manchester Metropolitan University, UK
- Carolina Scarton, University of Sheffield, UK
- Chenghua Lin, University of Manchester, UK
*Apologies for Cross Posting*
Calling all NLP, Digital Humanities and media analysis enthusiasts!
Participate in the "Framing the Israel War on Gaza" (FIGNEWS) shared task
and play a pivotal role in shaping media narrative research. [image: 📚] Engage
in creating guidelines, annotating a diverse multilingual corpus, and
pushing the boundaries of NLP!
*Task Website and Registration :*
https://sites.google.com/view/fignews
*🎯 Task Highlights:*
*Guidelines Creation*: Craft comprehensive annotation guidelines and set a
benchmark in NLP research.
*Annotation*: Dive into annotating news articles in Arabic, Hebrew, Hindi,
French, and English, uncovering biases and enhancing our understanding of
media narratives. The teams will be asked to annotate a minimum of 2,000
posts. Teams will ideally have 4 members with a minimum of 2 members to
participate.
*Two Sub-Tasks*: The shared task aims to serve as a collaborative platform
where participants propose guidelines and diverse methods for annotating
and analyzing the dataset.
There will be two subtasks of focus:
*Sub-Task 1: Bias Annotation *
*Sub-Task 2 : Propaganda Annotation*
*🏆 Tracks & Awards:*
*Quantity Track:* Be the team with the most annotated data batches and win
the quantity track award.
*Quality Track*: Excel in the accuracy and consistency of your annotations
to dominate the quality track.
*Guidelines Track*: Develop innovative and effective guidelines to be
recognized in the guidelines track.
*📅 Deadlines:*
*Registration closes: March 31, 2024Submission deadline (Annotation and
Guidelines): April 30, 2024Paper submissions: May 10, 2024*
🌍 Be part of this significant event, co-located with the prestigious
ArabicNLP 2024 conference and ACL 2024 in Thailand. Enhance your skills,
contribute to vital research, and network with global experts in NLP and
media studies.
Join us to make an impact and advance the field of NLP! #NLP #MediaAnalysis
#ArabicNLP2024
Call for Submissions — SIGIR 2024 Workshop on Information Retrieval for Climate Impact
Climate change is a far-reaching, global phenomenon that will impact many aspects of our society. The evidence base for observed climate impacts is expanding, and the wider climate literature is growing exponentially. How can effective access be provided to the growing body of peer-reviewed literature on climate change impact?
Purpose
The emphasis will be on discussion, not a mini-conference but a dynamic sharing of ideas. The workshop will be organized along four areas of interest: (i) Information needs in climate impact; (ii) Search and analysis of formal literature for climate impact; (iii) Search and analysis of informal publications for climate impact; and (iv) Resources to support IR for climate impact. During the workshop, we will work towards creating actionable technical research agendas for each of them.
Call for contributions
To help shape a research agenda for information retrieval for climate impact, we welcome technical contributions and position papers as extended abstracts (2-4 pages) on a wide range of topics, on a wide range of topics related to information retrieval for climate impact, including but not limited to very large-scale systematic reviews, climate language models, geolocated literature with climate information, evidence synthesis.
Important dates
- April 25, 2024: Extended abstracts due
- May 23, 2024: Notifications
- July 18, 2024: Workshop at SIGIR 2024
- December 1, 2024: Submission of the Information Retrieval for Climate Impact Agenda for publication in SIGIR Forum
How to submit
Extended abstracts submitted to the workshop should be in English, in PDF, and formatted using the standard ACM sigconf format (using \documentclass[sigconf, natbib=true, anonymous=false]{acmart}). The review process is single-blind. The workshop uses EasyChair to handle submissions: https://easychair.org/conferences/?conf=manila24. No official proceedings will be published.
Organization
Bart van den Hurk (IPCC), Maarten de Rijke (U. Amsterdam), Flora Salim (UNSW, Sydney)
Workshop site
https://sites.google.com/view/ir-for-climate-impact/home
Contact
manila24(a)easychair.org
--
Maarten de Rijke
Distinguished University Professor AI & IR
University of Amsterdam
http://staff.fnwi.uva.nl/m.derijke
Event Notification Type: Call for Participation
Website: <https://sites.google.com/view/autextification>
https://sites.google.com/view/iberautextification
We kindly invite you to participate in the IberLEF 2024 shared task -
Iber AuTexTification
Automated Text Identification on Languages of the Iberian Peninsula
This shared task will take place as part of IberLEF 2024
<https://sites.google.com/view/iberlef-2024/tasks>, the 6th Workshop on
Iberian Languages Evaluation Forum at the SEPLN 2024 Conference, which will
be held in Valladolid, Spain on the 26th of September, 2023.
This is the second version of the AuTexTification at IberLEF 2023 shared
task (Sarvazyan et al., 2023). We extend our previous task in three
dimensions: more models, more domains and more languages from the Iberian
Peninsula (in a multilingual fashion), aiming to build more generalizable
detectors and attributors. In this task, participants must develop models
that exploit clues about linguistic form and meaning to identify
automatically generated texts from a wide variety of models, domains, and
languages. We plan to include LLMs like GPT-3.5, GPT-4, LLaMA, Coral,
Command, Falcon, MPT, among others. New domains like essays, or dialogues,
and cover the most prominent languages from the Iberian Peninsula: Spanish,
Catalan, Basque, Galician, Portuguese, and English (in Gibraltar). We
propose two different subtasks:
-
Subtask 1 (Human or Generated): Participants will be provided a text,
and they will have to determine whether the text has been automatically
generated or not. We encourage participants to develop models that
generalize to new LLMs, writing styles, and domains.
-
Subtask 2 (Model Attribution): Participants will be provided an
automatically generated text, and they will have to determine what LLM
generated it.
The novelty of this edition is to detect in a multilingual (languages from
the Iberian peninsula such as Spanish, English, Catalan, Gallego, Euskera,
and Portuguese), multi-domain (news, reviews, essays, dialogues, Wikipedia,
wikiHow, tweets, emails, etc.), and multi-model (GPT, LLaMA, Mistral,
Cohere, Anthropic, MPT, Falcon, etc.) setup, whether a text has been
automatically generated or not, and, if generated, identify the model that
generated the text. The datasets of this edition are built using TextMachina
<https://github.com/Genaios/TextMachina>, a Python framework that aids the
creation of high-quality, unbiased datasets to build robust models for
MGT-related tasks such as detection, attribution, boundary, and mix-case
detection.
To foster engagement and reward dedication, we will award the best
participant in each subtask with 500€ sponsored by Genaios
<https://genaios.ai/>.
Important Links
-
Task Website <https://sites.google.com/view/iberautextification>
-
GitHub Repository <https://github.com/Genaios/IberAuTexTification>
-
Slack workspace
<https://join.slack.com/t/iberautextification/shared_invite/zt-2c28ezgwy-lHH…>
-
Google Groups <https://groups.google.com/g/iberautextification>
-
Registration <https://sites.google.com/view/iberautextification/data>
Important Dates
-
March 22, 2023: Release of training data
-
April 21, 2023: Release of test data
-
May 10, 2023: Participant system results submission
-
May 17, 2023: Results notification
-
June 3, 2023: Paper submission
-
June 16, 2023: Paper peer-reviewed
-
July 4, 2023: Camera-ready paper version
Task organizers
-
José Ángel González (Genaios <https://genaios.ai/>) Contact Email:
jose.gonzalez(a)genaios.ai
-
Areg Sarvazyan (Genaios <https://genaios.ai/>) Contact Email:
areg.sarvazyan(a)genaios.ai
-
Marc Franco-Salvador (Genaios <https://genaios.ai/>)
-
Francisco Rangel (Genaios <https://genaios.ai/>)
-
Paolo Rosso (Universitat Politècnica de València <https://www.upv.es/>)
Please reach out to the organizers <organizers.autextification(a)gmail.com>
or join the Slack
<https://join.slack.com/t/iberautextification/shared_invite/zt-2c28ezgwy-lHH…>
workspace to connect with the other participants and organizers.