Learning with Small Data
September 11-12, 2023
CLASP, University of Gothenburg
Come visit us in Gothenburg!
Great topic, free registration, proceedings published in the ACL anthology,
amazing keynote speakers:
Aurélie Herbelot -- University of Trento
Tal Linzen -- NYU & Google
Danielle Matthews -- University of Sheffield
https://sites.google.com/view/learning-with-small-data/home
There is now an acute need for intensive research on the possibility of
effective learning with small data. Our 2023 conference, LSD, is devoted to
work on this problem, with application to computational linguistics.
Learning with Small Data will bring together researchers from various areas
to discuss the sustainability of current state of the art methods in
computational linguistics which rely on very large models, such as GPT2-3,
BERT, and XLNet. The conference encourages contributions from machine
learning, computational linguistics, theoretical linguistics, philosophy,
cognitive science, and psycholinguistics, as well as from artificial
intelligence ethics and social policy. We hope to see innovative technical
proposals, and we will cultivate a wide spectrum of views within a lively
dialog on the issues that the conference addresses.
The workshop is organized by CLASP, University of Gothenburg.
https://gu-clasp.github.io/
## Important Dates:
Submission deadline: 2023 May 5, anywhere on Earth
Notification of acceptance: 2023 June 12, anywhere on Earth
Camera ready: 2023 August 14, anywhere on Earth
Conference: 2023 September 11-12, not anywhere on Earth, but in Gothenburg
Additional details in our website:
https://sites.google.com/view/learning-with-small-data/home
Training set released!
SECOND CALL FOR PARTICIPATION
IberLEF 2023 Task - PoliticEs: Political ideology detection in Spanish texts
Held as part of the evaluation forum IberLEF 2023
<https://sites.google.com/view/iberlef-2023> in the XXXIX edition of the
International Conference of the Spanish Society for Natural Language
Processing (SEPLN 2023 <http://sepln2023.sepln.org/en/home/>)
September 26, 2023. Jaén, Andalusia, Spain
Codalab link: https://codalab.lisn.upsaclay.fr/competitions/10173
Dear All,
We are inviting researchers and students to participate in the
shared-task PoliticEs
2023: Political ideology detection in Spanish texts, held as part of IberLEF
2023, the shared evaluation campaign for Natural Language Processing
systems in Spanish and other Iberian languages, collocated with SEPLN 2023
Conference.
The goal of this task is to extract political ideology information from
Spanish texts. For this, an automatic document classification task on
clusters of texts is proposed. It consists of extracting the self-assigned
gender and profession as demographic traits, and the political ideology as
a psychographic trait from a set of texts written in Spanish from several
authors that share those traits. Political ideology is considered as a
binary and as a multiclass problem. The PoliticES 2023 shared task is based
on a previous task named PoliticES 2022 presented at IberLEF2022
(García-Díaz et. al. 2022b) where the dataset was an extension of the
PoliCorpus 2020 dataset (García-Díaz et al., 2022a). The novelty of this
year is that participants will work with clusters of texts written by
different users, but with the same traits, instead of profiling users to
prevent legal and ethical issues.
The participants will be provided development, development_test, training
and test datasets in Spanish from an extension of the PoliCorpus 2020
(García-Díaz et al., 2022) and the corpus used for the PoliticES 2022
shared task (García-Díaz et. al. 2022b). The dataset was collected between
2020 and 2022 from the Twitter accounts of politicians, political
journalists and celebrities in Spain using the UMUCorpusClassifier
(García-Díaz et al., 2020). We automatically created clusters of texts
mixing some of these extracted tweets in order to prevent ethical and
privacy issues about author profiling in Twitter. Each cluster is composed
of 80 tweets written by different users that share all the traits under
evaluation. We labeled each cluster with the self-assigned gender (male,
female), profession (politician, celebrity, journalist) and political
spectrum on two axes: binary (left, right) and multiclass (left,
moderate_left, moderate_right, right). Moreover, the Twitter mentions of
the politicians were anonymised by replacing them with the token @user. In
addition, other Twitter accounts mentions were also encoded as @user. Other
entities, such as political party references, are also replaced with the
@political_party token. Consequently, the text traits cannot be guessed
trivially by reading the user's name and searching information on them on
the Internet. The dataset is composed of approximately 2800 different
clusters.
Finally, in order to facilitate participation in the competition, a
notebook with two baselines will be provided. The first one will be based
on BoW and the second one will be based on Transformers. To download the
data, the notebook and participate, go to
https://codalab.lisn.upsaclay.fr/competitions/10173.
Yesterday, we released the training dataset that can be found in the
"Files" subsection of the "Participate" tab. It is worth mentioning that
this dataset includes all the instances that were also released during the
Practice stage; so, it is not needed to combine both datasets.
Finally, remember that the CodaLab competition is open to submit your
results with the development dataset provided. This dataset is also
available in the same section as the training dataset.
Best regards,
The PoliticES 2023 organizing committee
References
-
García-Díaz, J. A., Almela, Á., Alcaraz-Mármol, G., & Valencia-García,
R. (2020). UMUCorpusClassifier: Compilation and evaluation of linguistic
corpus for Natural Language Processing tasks. Procesamiento del Lenguaje
Natural, 65, 139-142.
-
García-Díaz, J. A., Colomo-Palacios, R., & Valencia-García, R. (2022a).
Psychographic traits identification based on political ideology: An author
analysis study on Spanish politicians’ tweets posted in 2020. Future
Generation Computer Systems, 130(1), 59-74.
-
García-Díaz, J. A., Jiménez Zafra, S. M., Martín Valdivia, M. T.,
García-Sánchez, F., Ureña López, L. A., & Valencia García, R. (2022b).
Overview of PoliticEs 2022: Spanish Author Profiling for Political
Ideology. Procesamiento del Lenguaje Natural, 69, 265-272.
Important dates
-
Release of development corpora: Feb 13, 2023
-
Release of training corpora: Mar 13, 2023
-
Release of test corpora and start of evaluation campaign: Apr 17, 2023
-
End of evaluation campaign (deadline for runs submission): May 3, 2023
-
Publication of official results: May 5, 2023
-
Paper submission: May 29, 2023
-
Review notification: Jun 17, 2023
-
Camera ready submission: Jun 27, 2023
-
IberLEF Workshop (SEPLN 2023): Sep 26, 2023 (Jaén, Andalusia, Spain)
-
Publication of proceedings: Sep ??, 2023
Organizing committee
-
José Antonio García-Díaz (UMUTeam, Universidad de Murcia)
-
Salud María Jiménez-Zafra (SINAI, Universidad de Jaén)
-
María-Teresa Martín Valdivia (SINAI, Universidad de Jaén)
-
Francisco García-Sánchez (UMUTeam, Universidad de Murcia)
-
L. Alfonso Ureña-López (SINAI, Universidad de Jaén)
-
Rafael Valencia-García (UMUTeam, Universidad de Murcia)
[image: Universidad de Jaén] <http://www.uja.es/> *Salud María Jiménez
Zafra*
sjzafra(a)ujaen.es
Universidad de Jaén
Grupo de Investigación SINAI <http://sinai.ujaen.es/> | Departamento de
Informática
EPS Jaén, Edificio A3, Despacho 219
Campus Las Lagunillas s/n 23071 - Jaén | +34 953212992
[image: Universidad de Jaén] <http://www.uja.es/>
[With apologies for cross-posting]
Second call for submissions for the Workshop on Individual Differences in
Pragmatics and Discourse (IndiPRAG).
Important dates:
Submission deadline: 1st May 2023
Notification date: 5th June 2023
Workshop dates: 18th September (all day) and 19th September (morning) 2023
Workshop venue: Saarland University, Saarbrücken, Germany (the workshop is
collocated with XPRAG in Paris, 20th-23rd September)
**
Call for submissions:
Experimental research in pragmatics and discourse processing has
consistently found that not all comprehenders behave the same: while some
seem to draw rich pragmatic inferences, others respond in a way that is
more consistent with a literal interpretation (Fairchild & Papafragou,
2021; Mayn & Demberg, 2022). Similarly for discourse inference,
experiments have found differences with respect to the sensitivity to
discourse cues and the readiness for discourse predictions between
participants (Scholman, Demberg & Sanders, 2020; Tskhovrebova, Zufferey &
Gygax, 2022).
This workshop aims to bring together researchers interested in exploring
individual differences at the level of pragmatics and discourse, as well
as methods for relating those differences to cognitive properties, and
approaches for modelling the mechanism driving the individual differences
effects.
IndiPRAG Workshop invites submissions of abstracts addressing the
following questions:
- To what extent do pragmatic processing and discourse inferences differ
between individuals?
- How consistent are interpretation biases across different types of
pragmatic implicatures?
- What individual difference measures are particularly suitable for
measuring IDs related to pragmatic processing?
- How can we computationally model individual differences in discourse and
pragmatics?
- What statistical methods are best suited to identifying latent groups of
participants and relating ID measures to task performance?
**
Formatting guidelines:
The abstracts must not exceed 1000 words for the text (excl. captions),
10000 characters for references, 2 figures. Abstracts should be submitted
in PDF format, with 2.54 cm margins on all sides and 12 point font size,
single-spaced. Please indicate up to three appropriate keywords for your
abstract, which will be used for session planning.
Abstracts must be written in English and should include a title but no
information revealing the author(s).
We welcome submissions for work that is being considered by other
conferences, workshops, or journals.
Submissions should be handed in via easychair:
https://easychair.org/conferences/?conf=indiprag2023
**
We will have invited talks by:
Kirsten Abbot-Smith, University of Kent
Morten Christiansen, Cornell University
Craig Hedge, Aston University
Petra Hendriks, University of Groningen
Antje Meyer & Florian Hintz, MPI for Psycholinguistics Nijmegen
**
IndiPRAG is being organised by: Vera Demberg, Jia Loy, Alexandra Mayn,
Dongqi Pu, Margarita Ryzhova, Merel Scholman, Sebastian Schuster
You can contact us at: indiprag(a)lst.uni-saarland.de
The Department of English (http://www.LN.edu.hk/eng/) of Lingnan University (Hong Kong) now invites applications for a tenure-track position at the rank of Assistant Professor to contribute to research and teaching in Linguistics (Contemporary Language Studies). Candidates who can contribute to both the Linguistics and Literature strands of the programme, for example by specializing in literary linguistics, stylistics, corpus linguistics and/or digital humanities, will be given priority.
Applicants should have:
1.
2. A Ph.D. degree in linguistics, applied linguistics, or a related field;
3. A good publication record in peer-reviewed journals with a good international standing;
4. A track record of developing and teaching courses on literary linguistics, stylistics, corpus linguistics, and/or digital humanities;
5. Experience in academic administration and curriculum development.
Applicants are required to provide information about their research records and evidence of high-quality teaching in relevant subjects.
The appointee is expected to deliver and develop courses at both the undergraduate and postgraduate levels, supervise final-year projects, produce quality research outputs, and participate in competitive grant exercises. He or she is also expected to engage in knowledge transfer activities, and assist the Department with administration.
Details and how to apply: https://lingnan.csod.com/ux/ats/careersite/4/home/requisition/1859?c=lingnan
[Lingnan University Hong Kong] <https://www.ln.edu.hk>
[Transformation For Life] <https://www.ln.edu.hk/transformation-for-life>
Dear Colleagues,
I'm looking for a PhD student with an interest in multilingual natural
language processing (NLP) and transfer learning to join my group NALA at
the Institute of Computer Science of Johannes Gutenberg University Mainz,
Germany. Details can be found in this posting:
https://karriere.uni-mainz.de/files/2023/03/00323-08-wiss-nk.pdf
If you have questions, please feel free to email me directly, and please
forward this information to anyone who might be interested!
Thanks,
Katharina Kann
--
Dr. Katharina Kann
Assistant Professor of Computer Science
University of Colorado Boulder
Personal page: https://kelina.github.io
Group page: https://nala-cub.github.io
**apologies for cross-postings**
===== CFP deadline extension IWCS 2023 =====
Paper submissions:
15 March 2023 --> 22 March 2023
https://softconf.com/iwcs2023/papers
============================================
15th International Conference on Computational Semantics (IWCS)
Université de Lorraine, Nancy, France
20-23th June 2023
http://iwcs2023.loria.fr/
IWCS is the biennial meeting of SIGSEM [1], the ACL special interest
group on semantics [2]; this year's edition is organized in person by the
Loria [3] and IDMC [4] of the Université de Lorraine.
[1] http://sigsem.org/
[2] http://aclweb.org/
[3] https://www.loria.fr/fr/
[4] http://idmc.univ-lorraine.fr/
The aim of the IWCS conference is to bring together researchers
interested in any aspects of the computation, annotation, extraction,
representation and neuralisation of meaning in natural language,
whether this is from a lexical or structural semantic perspective.
IWCS embraces both symbolic and machine learning approaches to
computational semantics, and everything in between. The conference
and workshops will take place 20-23 June 2023.
=== TOPICS OF INTEREST ===
We invite paper submissions in all areas of computational semantics, in
other words all computational aspects of meaning of natural language within
written, spoken, signed, or multi-modal communication.
Presentations will be oral and posters.
Submissions are invited on these closely related areas, including the
following:
* design of meaning representations
* syntax-semantics interface
* representing and resolving semantic ambiguity
* shallow and deep semantic processing and reasoning
* hybrid symbolic and statistical approaches to semantics
* distributional semantics
* alternative approaches to compositional semantics
* inference methods for computational semantics
* recognising textual entailment
* learning by reading
* methodologies and practices for semantic annotation
* machine learning of semantic structures
* probabilistic computational semantics
* neural semantic parsing
* computational aspects of lexical semantics
* semantics and ontologies
* semantic web and natural language processing
* semantic aspects of language generation
* generating from meaning representations
* semantic relations in discourse and dialogue
* semantics and pragmatics of dialogue acts
* multimodal and grounded approaches to computing meaning
* semantics-pragmatics interface
* applications of computational semantics
=== SUBMISSION INFORMATION ===
Two types of submission are solicited: long papers and short papers. Both
types should be submitted not later than 3 March (anywhere on earth).
Long papers should describe original research and must not exceed 8 pages
(not counting acknowledgements and references).
Short papers (typically system or project descriptions, or ongoing research)
must not exceed 4 pages (not counting acknowledgements and references).
Both types will be published in the conference proceedings and in the ACL
Anthology. Accepted papers get an extra page in the camera-ready version.
Style-files:
IWCS papers should be formatted following the common two-column structure
as used by ACL. Please use our specific style-files or the Overleaf template, taken
from ACL 2021. Similar to ACL 2021, initial submissions should be fully anonymous
to ensure double-blind reviewing.
Submitting:
Papers should be submitted in PDF format via Softconf:
https://softconf.com/iwcs2023/papers
Please make sure that you select the right track when submitting your paper.
Contact the organisers if you have problems using Softconf.
No anonymity period
IWCS 2023 does not have an anonymity period. However, we ask you to be
reasonable and not publicly advertise your preprint during (or right before) review.
=== IMPORTANT DATES ===
15 March 2023 ->
22 March 2023 (anywhere on earth) Paper submissions
17 April 2023 Decisions sent to authors
15 May 2023 Camera-ready papers due
20-23 June 2023 IWCS conference
=== CONTACT ===
For questions, contact: iwcs2023-contact(a)univ-lorraine.fr
Maxime Amblard, Ellen Breithloltz (the IWCS 2023 organizers)
**apologies for cross-postings**
===== CFP deadline extension IWCS 2023 =====
Paper submissions:
15 March --> 22 March 2023
https://softconf.com/iwcs2023/papers
============================================
15th International Conference on Computational Semantics (IWCS)
Universit�� de Lorraine, Nancy, France
20-23th June 2023
http://iwcs2023.loria.fr/
IWCS is the biennial meeting of SIGSEM [1], the ACL special interest
group on semantics [2]; this year's edition is organized in person by the
Loria [3] and IDMC [4] of the Universit�� de Lorraine.
[1] http://sigsem.org/
[2] http://aclweb.org/
[3] https://www.loria.fr/fr/
[4] http://idmc.univ-lorraine.fr/
The aim of the IWCS conference is to bring together researchers
interested in any aspects of the computation, annotation, extraction,
representation and neuralisation of meaning in natural language,
whether this is from a lexical or structural semantic perspective.
IWCS embraces both symbolic and machine learning approaches to
computational semantics, and everything in between. The conference
and workshops will take place 20-23 June 2023.
=== TOPICS OF INTEREST ===
We invite paper submissions in all areas of computational semantics, in
other words all computational aspects of meaning of natural language within
written, spoken, signed, or multi-modal communication.
Presentations will be oral and posters.
Submissions are invited on these closely related areas, including the
following:
* design of meaning representations
* syntax-semantics interface
* representing and resolving semantic ambiguity
* shallow and deep semantic processing and reasoning
* hybrid symbolic and statistical approaches to semantics
* distributional semantics
* alternative approaches to compositional semantics
* inference methods for computational semantics
* recognising textual entailment
* learning by reading
* methodologies and practices for semantic annotation
* machine learning of semantic structures
* probabilistic computational semantics
* neural semantic parsing
* computational aspects of lexical semantics
* semantics and ontologies
* semantic web and natural language processing
* semantic aspects of language generation
* generating from meaning representations
* semantic relations in discourse and dialogue
* semantics and pragmatics of dialogue acts
* multimodal and grounded approaches to computing meaning
* semantics-pragmatics interface
* applications of computational semantics
=== SUBMISSION INFORMATION ===
Two types of submission are solicited: long papers and short papers. Both
types should be submitted not later than 3 March (anywhere on earth).
Long papers should describe original research and must not exceed 8 pages
(not counting acknowledgements and references).
Short papers (typically system or project descriptions, or ongoing research)
must not exceed 4 pages (not counting acknowledgements and references).
Both types will be published in the conference proceedings and in the ACL
Anthology. Accepted papers get an extra page in the camera-ready version.
Style-files:
IWCS papers should be formatted following the common two-column structure
as used by ACL. Please use our specific style-files or the Overleaf template, taken
from ACL 2021. Similar to ACL 2021, initial submissions should be fully anonymous
to ensure double-blind reviewing.
Submitting:
Papers should be submitted in PDF format via Softconf:
https://softconf.com/iwcs2023/papers
Please make sure that you select the right track when submitting your paper.
Contact the organisers if you have problems using Softconf.
No anonymity period
IWCS 2023 does not have an anonymity period. However, we ask you to be
reasonable and not publicly advertise your preprint during (or right before) review.
=== IMPORTANT DATES ===
15 March --> 22 March 2023 (anywhere on earth) Paper submissions
17 April 2023 Decisions sent to authors
15 May 2023 Camera-ready papers due
20-23 June 2023 IWCS conference
=== CONTACT ===
For questions, contact: iwcs2023-contact(a)univ-lorraine.fr
Maxime Amblard, Ellen Breithloltz (the IWCS 2023 organizers)
________________________________
Von: Nagengast, Milena
Gesendet: Freitag, 24. Februar 2023 14:09
An: corpora(a)list.elra.info
Betreff: Unsubscribe
Dear All,
Please unsubscribe me from this list.
Thank you, and all the best,
M. Nagengast
First International Workshop on Gender-Inclusive Translation Technologies (GITT) at EAMT 2023
15 June 2023, Tampere, Finland
https://sites.google.com/tilburguniversity.edu/gitt2023
@GITT2023
Important Dates (Time zone: Anywhere on Earth)
Paper Submission deadline: 14 April, 2023
Notification of Acceptance: 5 May, 2023
Camera Ready Copy due: 12 May, 2023
Workshop: 15 June, 2023
Aim and scope
The Gender-Inclusive Translation Technologies Workshop (GITT) is set out to be the first workshop that focuses on gender-inclusive language in translation and cross-lingual scenarios. The workshop aims to bring together researchers from diverse areas, including industry partners, MT practitioners, and language professionals. GITT aims to encourage multidisciplinary research that develops and interrogates both solutions and challenges for addressing bias and promoting gender inclusivity in MT and translation tools.
Topics
GITT invites technical as well as non-technical submissions, which consist of experimental, theoretical or methodological contributions. We explicitly welcome interdisciplinary submissions and submissions that focus on innovative, non-binary linguistic strategies and/or with sociolinguistically-informed perspectives. The topics of interest include, but are not limited to:
- Models or methods for assessing and mitigating gender bias
- New resources for inclusive language and gender translation (e.g., datasets, translation memories, dictionaries)
- Social, cross-lingual, and ethical implications of gender bias
- Qualitative and quantitative analyses on the potential limits of current approaches to gender bias in translation and MT, error taxonomies as well as best practices and guidelines
- User-centric case studies on the impact of biased language and/or mitigating approaches which can include translators, post-editors, or monolingual MT users
GITT is also open to other non-listed topics aligned with the scope of the workshop and works focusing on non-textual modalities (e.g., audiovisual translation)
Submission
We welcome three types of submissions:
- Research papers: of at least 4 up to 10 pages (including references)
- Extended Abstracts: up to 2 pages (including references)
Accepted papers and extended abstracts consisting of novel work will be published online as proceedings in the ACL Anthology.
- Research Communications: up to 2 pages (including reference)
We include a parallel submission policy for papers accepted in other venues in 2022. Research communications will not be included in the proceedings, but will serve to promote the dissemination of research aligned with the scope of the workshop.
Submissions should adhere to the EAMT 2023 guidelines and style templates (PDF, LaTeX, Word) and be uploaded on EasyChair: https://easychair.org/my/conference?conf=gitt2023
Activities
During the workshop, there will be a guided discussion starting from examples of gender bias in MT output collected via the DeBiasByUs website.
Attendees are invited to contribute their own examples beforehand via DeBiasByUs.
More information about the project and the activity can be found on the WS website.
Workshop organizers
Eva Vanmassenhove, University of Tilburg
Beatrice Savoldi, Fondazione Bruno Kessler
Luisa Bentivogli, Fondazione Bruno Kessler
Joke Daems, University of Ghent
Janiça Hackenbuchner, Cologne University of Applied Sciences