The 9th International Workshop on Computational Linguistics for Uralic Languages (IWCLUL 2024) will be organized by ACL SIGUR. The proceedings of the event will be published in the ACL anthology. The workshop will take place in November 28-29, 2024 in Helsinki, Finland at Metropolia University of Applied Sciences.
https://acl-sigur.github.io/iwclul2024.html
Submission deadline: October 20, 2024
Registration/publication fees: 0€!
We solicit original and unpublished work related to NLP methods for Uralic languages, including multilingual methods that include at least one Uralic language (e.g. Finnish, Estonian, Hungarian etc). Appropriate topics include (but are not limited to):
- Multilingual approaches in NLP presenting work on at least one Uralic language
- Parsers, analysers and processing pipelines of Uralic languages
- Lexical databases, electronic dictionaries
- Finished end-user applications aimed at Uralic languages, such as spelling or grammar checkers, machine translation or speech processing
- Evaluation methods and gold standards, tagged corpora, treebanks
- Reports on language-independent or unsupervised methods as applied to Uralic languages
- Surveys and review articles on subjects related to computational linguistics for one or more Uralic languages
- Any work that aims at combining efforts and reducing duplication of work
- How to elicit activity from the language community, agitation campaigns, games with a purpose
Short papers can be up to 4 pages in length (5 for camera-ready version). Short papers can report on work in progress or a more targeted contribution such as software or partial results.
Long papers can be up to 8 pages in length (9 for camera-ready version). Long papers should report on previously unpublished, completed, original work.
Lightning talks submitted as 750-word abstracts. Lightning talks are suited for discussing ideas or presenting work in progress. The abstracts will be published in a lightning proceedings on Zenodo.
All submission formats can have an unlimited number of pages for references. All submissions must follow the ACL stylesheet.
The submissions must be anonymous, and they will be peer-reviewed by our program committee. The peer review is double blinded. Papers must be submitted using the conference submission system by the deadline. At least one of the authors of an accepted paper must attend the event and present their paper.
Accepted papers (short and long) will be published in the joint proceedings that will appear in the ACL Anthology. Accepted papers will also be given an additional page to address the reviewers’ comments. The length of a camera-ready submission can then be 5 pages for a short paper and 9 for a long paper with an unlimited number of pages for references.
Important dates:
- Paper submission (full and short): October 20, 2024
- Notification of acceptance: November 3, 2024
- Camera ready deadline: November 10, 2024
- Registration deadline: November 10, 2024
- Workshop: November 28-29, 2024
Dear list members,
I'm delighted to announce a new publication in the Cambridge Elements in Corpus Linguistics series. This publication is FREE to download until 10 June 2024 (see the link at the bottom of this email).
Title: Programming for Corpus Linguistics with Python and Dataframes
Author: Daniel Keller, Western Kentucky University
Summary: This Element offers intermediate or experienced programmers algorithms for Corpus Linguistic (CL) programming in the Python language using dataframes that provide a fast, efficient, intuitive set of methods for working with large, complex datasets such as corpora. This Element demonstrates principles of dataframe programming applied to CL analyses, as well as complete algorithms for creating concordances; producing lists of collocates, keywords, and lexical bundles; and performing key feature analysis. An additional algorithm for creating dataframe corpora is presented including methods for tokenizing, part-of-speech tagging, and lemmatizing using spaCy. This Element provides a set of core skills that can be applied to a range of CL research questions, as well as to original analyses not possible with existing corpus software.
The Element can be accessed using this link: https://doi.org/10.1017/9781108904094.
Susan Hunston (she/her)
Professor of English Language
+44 121 414 5675
University of Birmingham
Department of English Language and Linguistics
www.birmingham.ac.uk
Dear all,
Our Chair of Multilingual Computational Linguistics in Passau offers one
post-doc or doctoral positions as part of the ERC project "Productive
Signs: A computer-assisted investigation of evolutionary, typological,
and cognitive aspects of word families". The position is 3 years. An
extension for doctoral candidates may be possible, pending sufficient
funding.
You find more information on the position on the following link (for the
time being only in German, but the English version will also follow soon):
https://www.uni-passau.de/fileadmin/dokumente/beschaeftigte/Stellenangebote…
Sincerely,
Mattis List
--
Prof. Dr. Johann-Mattis List
Chair of Multilingual Computational Linguistics
University of Passau
Dr.-Hans-Kapfinger-Str. 16
04032 Passau
Germany
Chair Website: https://phil.uni-passau.de/multilinguale-computerlinguistik/
Personal Website: https://lingulist.de
Telephone: +49(0)851/509-3480
Dear all,
We are offering an exciting PostDoc/Senior Researcher position at the
intersection of NLP and computational social science within the
department Knowledge Technologies for the Social Sciences
(https://gesis.org/en/kts) at GESIS in Cologne, Germany.
The position is limited to 4 years, with option for tenure/permanency.
Further information can be found at
https://www.hidden-professionals.de/HPv3.Jobs/gesis/stellenangebot/37621/Se….
For any questions, please don't hesitate to get in touch with me.
Best regards,
Stefan
--
Prof. Dr. Stefan Dietze
Scientific Director Knowledge Technologies for the Social Sciences
GESIS - Leibniz Institute for the Social Sciences
Web: https://www.gesis.org/en/kts
Chair of Data & Knowledge Engineering
Heinrich-Heine-University Düsseldorf
Web: https://www.cs.hhu.de/en/research-groups/data-knowledge-engineering
Phone: +49 (0)221-47694-421
Web: http://stefandietze.net
Dear Colleagues,
This is the last call for participation in the FinLLM Challenge of the FinNLP-agentscen workshop at IJCAI 2024! The registration will close at AOE 00:00 AM on May 28, 2024.
Don't miss this opportunity to test your skills and contribute to the exciting field of AI in Finance. For the latest updates and to access the train set, please visit our website:
FinLLM Challenge: https://sites.google.com/nlg.csie.ntu.edu.tw/finnlp-agentscen/shared-task-f…
We also welcome submissions of existing models or checkpoints of LLMs.
If you have any questions or encounter any issues, feel free to reach out to our support team via email at ijcaifinllmcontest(a)thefin.ai.
Stay tuned for further announcements and good luck!
Best regards,
FinLLM Organizers
Contact email: ijcaifinllmcontest(a)thefin.ai
Discord Link: https://discord.gg/pCrWUQct
The Department of Computer Science at the IT University of Copenhagen is
offering a Postdoc position in Natural Language Processing/Computational
Linguistics*,* with a start date of *1 September 2024* or as soon as
possible. The *application deadline is 31* *May** 2024.* Applications for
the position can be submitted via ITU job portal
<https://candidate.hr-manager.net/ApplicationInit.aspx?cid=119&ProjectId=181…>
.
*Proposed project title: *Efficiency and Robustness in Language Model
Pre-training
*Proposed project description.* Recent generative systems based on
pre-trained language models are remarkably fluent, but this is achieved by
extreme volumes of computation and training data. This means not only high
energy costs, but also training on data that is problematic in various
ways: copyright, harmful social stereotypes, non-representative sampling,
misinformation, junk SEO texts, pornography, and contamination with NLP
datasets used for evaluation.
This project will create an ambitious resource for research on transfer
learning, in which pre-training data is held constant, and evaluation takes
into account how much similar data was observed in training, and in what
ways it was similar. This resource will encourage the development of more
efficient and robust approaches, since it will not be possible to improve
benchmark scores by simply training on more data.
The ideal candidate will have a strong background in Computational
Linguistics/Natural Language Processing and experience developing NLP
resources, as well as core skills in programming in Python and machine
learning.
The position is funded for 1 year, and it is our intention to find
additional funding to extend this postdoc to a 2- or 3-year position.
Besides research, the postdoc will gain experience with organization of an
international workshop and shared task and build up their international
network. For those interested in pursuing an academic career, the following
is also possible (but entirely optional):
- gain experience in applying for external funding with professional
support (either for the continuation of the postdoc’s own position, e.g.
Marie Curie postdoctoral fellowship, or by contributing to PI’s grant
proposals);
- supervise Master students solo, and/or assist in supervising a PhD
student;
- undertake a formal teacher training program, including teaching guest
lectures in the relevant data science courses at the ITU computer science
department.
The successful candidate will be a member of the national Pioneer Centre
for Artificial Intelligence <https://aicentre.dk/>, a 5-university Danish
research endeavor, and of the NLPnorth <https://nlpnorth.github.io/>research
group at the IT University’s Computer Science Department. Both the centre
and research group are highly international and well-funded, working on a
broad range of research topics.
The project will be supervised by Associate Professor Anna Rogers
<https://annargrs.github.io/> (arog(a)itu.dk), to whom inquiries about the
project can be directed.
--
Best regards,
Anna Rogers
Associate Professor
Computer Science Department, IT University of Copenhagen
http://annargrs.github.io/
[apologies for cross-posting]
Dear all,
We are offering three fully funded PhD positions to study the political
biases of large language models in various application settings
(simplification and paraphrasing, summarization and question answering,
translation): how to define them? How to measure them? How to mitigate
them? The starting date for these positions is october 2024.
Depending on their interests and preferences, the successful candidates
will either join :
- the MLIA team of ISIR: https://www.isir.upmc.fr/
- the Computational Linguistic group of STIH:
http://stih-sorbonne-universite.fr
Both labs are affiliated with Sorbonne Université and located in the
center of Paris.
Applications should be submitted only via this link, where more
information about these positions is available:
https://emploi.cnrs.fr/Offres/Doctorant/UMR7222-FRAYVO-002/Default.aspx?lan…
If you are interested, please submit your application *before june
15th*, including the following elements:
• Detailed CV,
• Letter of motivation,
• Details of transcripts (especially M1 and M2),
• Elements of bibliography or personal achievements related to a
research activity (e.g. master project, research internship subject, etc.),
• 2 recommendation letters
Interviews will be conducted online between june 10th and june 24th, 2024.
G. Lejeune (STIH) - B. Piwowarski & F. Yvon (ISIR/MLIA)
---
François Yvon
ISIR, CNRS & Sorbonne Université
+33 (0)1 44 27 62 11
4 Place jussieu
F-75005 Paris
Apologies for cross-postings!
** Please forward to anyone who might be interested **
************************************************************************************
CALL FOR PAPERS
Sixth International Conference on AI in Computational Linguistics
(ACLing2024)
September 21-22, 2024 (Hybrid Conference)
Brochure: http://acling.org/wp-content/uploads/2024/05/ACLing24_CFP.pdf
Publication: Procedia Computer Science by ELSEVIER (
https://www.sciencedirect.com/journal/procedia-computer-science)
Website: https://acling.org/
************************************************************************************
IMPORTANT DATES
* Paper submission deadline: 31 May 2024 (Extended and Final)
* Notification of Acceptance: 21 June 2024
* Registration: 25 June 2024
* Camera ready version submission: 14 July 2024
* Conference Date: 20 – 21 September 2024
************************************************************************************
INTRODUCTION
Are you interested in artificial intelligence and computational
linguistics? The ACLing Conference is a great opportunity to share your
latest research with a global audience of experts. We're inviting
submissions on a wide range of topics, including machine learning, natural
language processing, and applications of these technologies.
SOME KEY TOPICS OF INTEREST:
* Large Language Models and their applications
* Information Retrieval and Question Answering
* Information Extraction
* Linguistic Theories and Resources
* Language Modeling
* Speech and Multimodality
* Machine Learning, Text Categorization, and Text Mining
* Machine Translation
* Multilinguality and Cross-linguality
* NLP Applications
* Segmentation, Tagging, and Parsing
* Semantics
* Sentiment Analysis and Opinion Mining
* Web, Social Media and Computational Social Science
* Natural Language Generation
* Text Categorization and Topic Modeling
* Text Mining
* Language and Vision
* AI applications in Computational linguistics
SUBMISSION INSTRUCTIONS
For detailed instructions on how to submit your paper, please visit the
conference website: https://acling.org/submission/. All the submissions
should be submitted via EasyChair:
https://easychair.org/my/conference?conf=acling2024
BENEFITS OF PUBLISHING AT ACLING:
• Your work will be seen by a global audience of leading researchers in
computational linguistics.
• All accepted papers will be published in the Procedia Computer Science by
ELSEVIER.
INDEXING, RANKING, AND IMPACT (web sources):
* Abstracting and indexing:
https://www.sciencedirect.com/journal/procedia-computer-science/about/insig…
* Scopus: https://www.scopus.com/sourceid/19700182801?origin=sbrowse
* CiteScore: https://www.scopus.com/sourceid/19700182801?origin=sbrowse
* SJR (scimago):
https://www.scimagojr.com/journalsearch.php?q=19700182801&tip=sid
* ACLingy by Google Citation:
https://scholar.google.com/citations?user=jkpMuFMAAAAJ&hl=en
* ACLing by DBLP: https://dblp.org/db/conf/acling/index.html
FURTHER INFORMATION & CONTACT DETAILS
* Vist the conference website link https://acling.org/ (will be updated on
a regular basis).
* For further information, please contact us at ACLing2024(a)gmail.com
Regards,
CONFERENCE ORGANIZERS
* Prof. Dr. Khaled Shaalan, The British University of Dubai, UAE
* Prof. Dr. Samhaa R. El-Beltagy, Newgiza University, Egypt
================[Apologies for any cross-posting]================
Dear Colleagues,
Due to several requests, the deadline of the Special issue of the
journal Traitement Automatique des Langues (TAL) "Abusive Language
Detection : Linguistic Resources, Methods and Applications" *has been
extended to June 15th*.
**Guest Editors**
Farah Benamara (IRIT-Toulouse University, IPAL Singapore), Delphine
Battistelli (MoDyCo, Paris Nanterre University) and Viviana Patti (Turin
University)
**Motivations**
Abusive language - or, in another very common terminology, hate speech -
and the propagation of harmful stereotypes have unfortunately become
commonplace occurrences on various social media platforms, partly due to
users’ freedom and anonymity and the lack of regulation provided by
these platforms. The sheer volume and often implicit nature of such
unwanted content make manual moderation of these user spaces a
formidable task. Various scientific communities interested in its at
least partial automation have taken up the problem over the past ten
years. In particular, Computational Social Science, Natural Language
Processing and Computational Linguistics have proposed numerous works to
create resources, datasets, and models aimed at automating the task of
abusive language detection (henceforth ALD). In fact, we see that ALD
has become a research theme in its own right in the field of Natural
Language Processing with an abundant literature.
Abusive language (umbrella term to refer to the various forms of harmful
language, such as toxic, offensive language, hate speech, and
stereotypes) is topically focused and each specific manifestation of
abusive language targets different vulnerable groups based on
characteristics such as gender (misogyny, sexism), ethnicity, race,
religion (xenophobia, racism, Islamophobia), sexual orientation
(homophobia), and so on. Most automatic ALD approaches cast the problem
into a binary classification task but important considerations should be
taken into account, in particular: (1) the topical focus or the
target-oriented nature of hate speech ; (2) the degree of engagement of
users in abusive content (e.g., denunciation, approbation, reporting,
neutral attitude) ; (3) the question of stereotypes and dominant
ideologies ; (4) the question of linguistic strategies more particularly
linked or born with social networks (e.g., emoticons, hashtags).
Furthermore, most of the work (resources, classifiers) is developed for
English.
**Topics**
Motivated by the interest of the community in the problem of ALD, we
invite papers from Natural Language Processing, Machine Learning and
Computational Social Sciences. We explicitly encourage interdisciplinary
submissions (resources, computational methods, and user applications at
the interface of linguistics/psychology/socio-linguistics/sociology) but
also position papers on the actual state of the art in the field
discussing the limitations of the current approaches and directions for
future work. The topics covered by the special issue include, but are
not limited to:
-- Linguistic resources and evaluation: annotation schemes, corpus
linguistics studies, new datasets, with a particular interest in French
language and/or multilingual resources. In the case of strictly lexical
resources: methods for constituting them and coverage, semantic
categories retained.
-- Formal/Conceptual approaches for ALD as inspired by models in
sociology, socio-linguistics and psychology.
-- Models and Methods: supervised and unsupervised approaches, including
LLMs.
-- Role of contextual phenomena, including discourses, extra-linguistic
contexts (e.g., cultural aspects).
-- Models for cross-lingual and multimodal detection.
-- New approaches beyond binary classification: target-oriented ALD,
degrees of user engagement, etc.
-- Dynamics of online AL in social media, propaganda propagation.
-- Bias detection and removal in resource creation, datasets and methods.
-- Application of ALD tools in education, social media content
moderation, etc.
-- Social, legal, and ethical implications of detecting, monitoring and
moderating AL.
**Important dates**
June 15th, 2024: Submission deadline
July 15th, 2024: Notification of acceptance after first rereading
End of September 2024: Revised version
Mid October 2024: Final decision
End of November 2024: Camera ready
January 2025: Publication of the special issue
**Submission**
Submissions can either be in French or English and should follow the
journal templates: https://tal-65-3.sciencesconf.org/
**About the journal**
Traitement Automatiques des Langues Journal (TAL) is the international
French journal of Natural Language Processing
(https://www.atala.org/revuetal) published by ATALA (French Association
for Natural Language Processing, http://www.atala.org) since 1959 with
the support of CNRS (National Centre for Scientific Research). It is
indexed by ACL Anthology as well as DBLP. It is also supported by the
Institute of Human and Social Sciences of the CNRS.
**Contact**
For any question, please contact tal-65-3(a)sciencesconf.org
**External committee**
-- Cristina Bosco, University of Turin
-- Elena Cabrio, University of Côte d'Azur
-- Tommaso Caselli, Faculty of Arts, Rijksuniveristeit Groningen
-- Valentina Dragos, ONERA
-- Karën Fort, Sorbonne University
-- Claire Hugonnier, University of Grenoble Alpes
-- Irina Illina, University of Lorraine
-- Roy Ka-Wei Lee, Singapore University of Technology and Design
-- Véronique Moriceau, IRIT, University of Toulouse
-– Frédérique Segond, INRIA Paris
-- Mariona Taulé, University of Barcelona
-- Samuel Vernet, Aix-Marseille University
-- Mathieu Valette, Paris Sorbonne Nouvelle University
-- Marcos Zampieri, George Mason University
--
========================
Farah Benamara Zitoune
Professor in Computer Science, Université Paul Sabatier
IRIT-CNRS
118 Route de Narbonne, 31062, Toulouse.
Tel : +33 5 61 55 77 06
http://www.irit.fr/~Farah.Benamara
==================================
--
========================
Farah Benamara Zitoune
Professor in Computer Science, Université Paul Sabatier
IRIT-CNRS
118 Route de Narbonne, 31062, Toulouse.
Tel : +33 5 61 55 77 06
http://www.irit.fr/~Farah.Benamara
==================================