Dear All,
I am pleased to forward the open call for SRIA Contribution Projects
launched by the ELE2 Consortium.
Full information, including the call documentation and related annexes
is available on the open call website
<https://european-language-equality.eu/open-call/>.
Please note that only research organisations, NGOs, incorporated
associations, companies from EU Member States are eligible to apply.
Proposals can be submitted until 29 November 2022.
Best regards,
Claudia
-------- Messaggio Inoltrato --------
Oggetto: ELE2 consortium launched an open call for SRIA Contribution
Projects
Data: Tue, 11 Oct 2022 13:23:04 +0200 (CEST)
Mittente: Jana Hamrlova <hamrlova(a)ufal.mff.cuni.cz>
A: langeq-2020 <langeq-2020(a)adaptcentre.ie>
CC: open-call <open-call(a)european-language-equality.eu>
Dear all,
I would like to remind you that ELE2 consortium launched an open call
for SRIA Contribution Projects.
For more details see the text below, full information including the call
documentation and related annexes is available on the open call website
<https://european-language-equality.eu/open-call/>.
With best regards,
Jana Hamrlova
ELE Open Call for SRIA Contribution Projects Management Team
open-call(a)european-language-equality.eu
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
European Language Equality initiative
<https://european-language-equality.eu/> launched an open call for SRIA
Contribution Projects <https://european-language-equality.eu/open-call/>.
TOPICS
TheSRIAcontributionprojectsaremeanttoprovidemeaningful,sofarmissing,convincing
and
compellinginputfortheStrategicAgendaandRoadmapforachievingfulldigitallanguageequalityinEuropeby2030.The
projects should provide defined use cases and feasibility studies for
concrete application scenarios in one of the following topics:
1. Data sets for more robust speech technology
2. Study of language coverage for text mining and natural language
understanding in key European industrial sectors
3.Legal Assessment (Desk Research)
4. General NLP/LT/AI Landscaping (Desk Research)
5. General NLP/LT Domains (Desk Research)
6. Analysis of AI and LT in European news media
7. Computing facilities for LT (Desk Research)
8. Demonstrably Greener Models of MT
9. Survey of the use of LT in the hospital sector
10. Basic LAnguageResource Kit (BLARK) (re)definition (Desk Research)
TIMELINE
Publication of the call: *29 September 2022*
Submission deadline: *29 November 2022 (23:59 CET)*
Evaluation and selection of the submitted proposals: December 2022
Contract signing and projects start: December 2022 – January 2023
Project duration: 2-3 months
ELIGIBLE APPLICANTS
* research organisations (including but not limited to higher education
organisations and independent research organisations), NGOs,
incorporated associations and companies
* legally established in EU member states
* one organisation per project only (mono-beneficiary projects)
FUNDING
* €185,000 allocated for the call
* maximum amount of eligible costs per single project: €25,000, funding
rate: 90%
* financial support will be provided in the form of a lump sum after
completing all project activities
HOW TO APPLY
Apply via the open call submission platform
<https://opencall.european-language-equality.eu/>, follow the Call
documentation and related annexes on the open call website
<https://european-language-equality.eu/open-call/>
CONTACT
open call management team: open-call(a)european-language-equality.eu
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
--
You received this message because you are subscribed to the Google
Groups "langeq-2020(a)adaptcentre.ie" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to langeq-2020+unsubscribe(a)adaptcentre.ie.
Dear All,
Together with Sunayana Sitaram, we are organizing a special session (Dialogue Systems for Multilingual and Under-resourced Language Speakers<https://sites.google.com/view/iwsds2023/special-sessions>) at IWSDS’2023 (hosted by University of Southern California/Institute for Creative Technologies).
Looking forward to the academic and industrial contributions (deadline for papers: Oct.28th).
Feel free to email us if you have any questions.
Kind regards,
A.S. Doğruöz & Sunayana Sitaram
Here is the call:
"Current dialogue systems target mostly monolingual and high resource languages and their speakers. However, millions of speakers around the world (e.g., India, Africa, Europe as well as indigenous and immigrant communities in the US) are multilingual and it is normal for these speakers and communities to switch within or across languages in daily lives (Doğruöz & Sitaram, 2022; Doğruöz et al., 2021; Sitaram et al., 2019). In addition, most languages of the world are still under-resourced. Therefore, there is a need for dialogue systems to be more inclusive and target both the multilingual and under-resourced languages and their speakers. The aim of this special session is to bring together researchers from the SDS community and encourage research and discussion around the unique challenges (e.g., data collection, model building, sociolinguistic aspects and system evaluation) for multilingual and under-resourced languages."
References:
Doğruöz, A. S., & Sitaram, S. (2022). Language technologies for low resource languages : sociolinguistic and multilingual insights.<https://biblio.ugent.be/publication/8756694> In M. Melero, S. Sakti, & C. Soria (Eds.), Proceedings of the 1st Annual Meeting of the ELRA/ISCA Special Interest Group on Under-Resourced Languages (pp. 92–97). Marseille, France: European Language Resources Association (ELRA).
Doğruöz, A. S., Sitaram, S., Bullock, B. E., & Toribio, A. J. (2021). A survey of code-switching : linguistic and social perspectives for language technologies<https://biblio.ugent.be/publication/8712328>. In C. Zong, F. Xia, W. Li, & R. Navigli (Eds.), 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021) (pp. 1654–1666). https://doi.org/10.18653/v1/2021.acl-long.131
Sitaram et al., (2019). A Survey of Code-switched Speech and Language Processing. <https://arxiv.org/abs/1904.00784>
*Apologies for cross-posting*
*Special Issue on Language Technology for Safer Online Social Media
Platforms in Low-resource Eurasian Languages *
Link:
https://dl.acm.org/pb-assets/static_journal_pages/tallip/pdf/TALLIP-SI-Lang…
* Aims, Scope and Objective of Special Issue: *
Our everyday lives have become more reliant on online platforms. Social
media (Facebook, Twitter, Instagram), discussion websites (Reddit),
messaging services (WhatsApp, Snapchat), blogs, forums, and online chats
have all been used to spread ideas and data. Without a doubt, social media
platforms like Facebook, Twitter, and Instagram benefit society by enabling
individuals to express themselves and seek support from others in the
online community. Additionally, these platforms have an unmistakable wrong
side: cyberbullying, cyberstalking, cyberterrorism, e-bile, fake news,
flaming, hate speech, impersonation, pornography, glorification of
dangerous behavior (e.g., eating disorders), and trolling. Various news
sites in recent years have recorded numerous incidences of suicide, grief,
and fear. Additionally, although individuals from many linguistic origins
are exposed to online social media, English remains at the forefront of
continuing advances in language technology research. Recently, several
study investigations on highly resourced languages, such as Arabic, German,
Hindi, and Italian, have been done. However, more research on making social
media platforms safer in low-resource Eurasian languages is still needed.
This special issue aims to gather original research articles that add to
the body of knowledge about the use of intelligent natural language systems
to build a safer social media environment in low-resource Eurasian
languages.
Topics Among the special issue's topics of interest are the following: -
• Early detection of radicalization in low-resource Eurasian languages
• Mechanisms for recognizing and preventing cyber predators in
low-resource Eurasian languages
• Identifying and resolving hate speech (abusive language, cyberbullying,
etc.) in low-resource Eurasian languages
• Simulated propagation and transmission of potentially harmful information
via social media in low-resource Eurasian languages
• Data collection and annotation methodologies for to safer social media in
low resourced Eurasian languages • Content moderation strategies in
low-resource Eurasian languages
• Cybersecurity and social media in low-resource Eurasian languages
• Fake news detection in low-resource Eurasian languages
* Important Dates • Submissions deadline: 10 February 2023 *
with regards,
Dr. Bharathi Raja Chakravarthi,
Assistant Professor / Lecturer-above-the-bar
School of Computer Science, University of Galway
Insight SFI Research Centre for Data Analytics, Data Science Institute,
University of Galway
E-mail: bharathiraja.akr(a)gmail.com ,
bharathiraja.asokachakravarthi(a)universityofgalway.ie
Google Scholar: https://scholar.google.com/citations?user=irCl028AAAAJ&hl=en
Special Issue on Language Technology for Safer Online Social Media
Platforms in Low-resource Eurasian Languages
<https://dl.acm.org/pb-assets/static_journal_pages/tallip/pdf/TALLIP-SI-Lang…>
Hi
This is the last call for application to ALPS 2023 winter school.
We extended the deadline to Sept 30th 2022
Our list of invited speakers has also been updated and it is awesome !
See more on [ http://alps.imag.fr/ | http://alps.imag.fr ]
Laurent
Dear all,
I'm a big fan of lig-aikuma, which allowed us to collect key data in remote
regions in Bolivia, and recommended it to a team member who is heading back
there in a week. But only today we realized it doesn't work with the
android version in the phone that's being taken to the field.
I wonder if any of you know of something that works like lig-aikuma,
allowing us to provide the app with:
- a list of texts to be shown on the screen, & collect the audio while
the informant reads the text;
- a list of sound files that can be listened to, & collect the audio
while the informant repeats what they heard (or discusses it)
I read about ODK & jotforms mobile, which may be programmable to have those
functionalities, but I'm hoping we don't need to develop this in a rush. So
if you've used them to do one or the both above, we'd be grateful if we can
take a peek at your code.
Your help will be greatly appreciated!
-Alex
---------------------------------------------------------------
Alex (Alejandrina) Cristia
Researcher, CNRS
Laboratoire de Sciences Cognitives et Psycholinguistique
29, rue d'Ulm, 75005, Paris, FRANCE
My site: www.acristia.org
---------------------------------------------------------------
If you donate, ask me about effective charities
<https://effectivealtruism.us8.list-manage.com/track/click?u=52b028e7f799cca…>.
/ Si vous faites des dons, demandez moi sur le don efficace
<https://www.altruismeefficacefrance.org/guide-don-efficace-1/>.
Dear colleagues,
Can you please forward this opportunity broadly in your network? We are
particularly inviting applications by folks from under-represented
backgrounds, who thrive in our team!
Thank you in advance,
Alex
*Short summary: *We are looking for someone with experience with deep
learning, ideally using scikit-learn & pytorch, to join our technical team.
We specialize in long-form audio-recordings, and your job will be to
design, fine-tune, and evaluate neural networks on such data. French is NOT
required - our team works in English!
For more details see
https://emploi.cnrs.fr/Offres/CDD/UMR8554-ALECRI1-001/Default.aspx?lang=EN
---------------------------------------------------------------
Alex (Alejandrina) Cristia
Researcher, CNRS
Laboratoire de Sciences Cognitives et Psycholinguistique
29, rue d'Ulm, 75005, Paris, FRANCE
My site: www.acristia.org
---------------------------------------------------------------
If you donate, ask me about effective charities
<https://effectivealtruism.us8.list-manage.com/track/click?u=52b028e7f799cca…>.
/ Si vous faites des dons, demandez moi sur le don efficace
<https://www.altruismeefficacefrance.org/guide-don-efficace-1/>.
FIRST CALL FOR PARTICIPATION
Advanced Language Processing School (ALPS)
January, 16-20 2023
Virtual Event
We are opening the registration for the third Advanced Language Processing School (ALPS), co-organized by University Grenoble Alpes and Naver Labs Europe.
*Target Audience*
This is a winter school covering advanced topics in NLP, and we are primarily targeting doctoral students and advanced (research) masters. A few slots will also be reserved for academics and persons working in research-heavy positions in industry.
*Characteristics*
Advanced lectures by first class researchers. A (virtual) atmosphere that fosters connections and interaction. A poster session for attendees to present their work, gather feedback and brainstorm future work ideas.
*Speakers*
The current list of speakers is: Kyunghyun Cho (New York University, USA); Yejin Choi (University of Washington and Allen Institute for AI, USA); Dirk Hovy (Bocconi University, Italia); Colin Raffel (University of North Carolina at Chapel Hill, Hugging Face, USA); Lucia Specia (Imperial College, UK), François Yvon (LISN/CNRS, France).
*Application*
To apply to this winter school, please follow the instructions at [ http://alps.imag.fr/index.php/application/ | http://alps.imag.fr/index.php/application/ ] . The deadline for applying is Sept 16th, and we will notify acceptance on October 3rd.
*Contact*
Website: [ http://alps.imag.fr/ | http://alps.imag.fr/ ] E-mail: [ mailto:alps@univ-grenoble-alpes.fr | alps(a)univ-grenoble-alpes.fr ]
Due to several requests, we have extended the submission deadline to 4
September 2022
Final call for papers
Third workshop on Resources for African Indigenous Language (RAIL)
https://bit.ly/rail2022
The South African Centre for Digital Language Resources (SADiLaR) is
organising the 3rd RAIL workshop in the field of Resources for African
Indigenous Languages. This workshop aims to bring together researchers
who are interested in showcasing their research and thereby boosting
the field of African indigenous languages. This provides an overview of
the current state-of-the-art and emphasizes availability of African
indigenous language resources, including both data and tools.
Additionally, it will allow for information sharing among researchers
interested in African indigenous languages and also start discussions
on improving the quality and availability of the resources. Many
African indigenous languages currently have no or very limited
resources available and, additionally, they are often structurally
quite different from more well-resourced languages, requiring the
development and use of specialized techniques. By bringing together
researchers from different fields (e.g., (computational) linguistics,
sociolinguistics, language technology) to discuss the development of
language resources for African indigenous languages, we hope to boost
research in this field.
The RAIL workshop is an interdisciplinary platform for researchers
working on resources (data collections, tools, etc.) specifically
targeted towards African indigenous languages. It aims to create the
conditions for the emergence of a scientific community of practice that
focuses on data, as well as tools, specifically designed for or applied
to indigenous languages found in Africa.
Suggested topics include the following:
* Digital representations of linguistic structures
* Descriptions of corpora or other data sets of African indigenous
languages
* Building resources for (under resourced) African indigenous languages
* Developing and using African indigenous languages in the digital age
* Effectiveness of digital technologies for the development of African
indigenous languages
* Revealing unknown or unpublished existing resources for African
indigenous languages
* Developing desired resources for African indigenous languages
* Improving quality, availability and accessibility of African
indigenous language resources
The 3rd RAIL workshop 2022 will be co-located with the 10th Southern
African Microlinguistics Workshop
(https://sites.google.com/nwulettere.co.za/samwop-10/home). This will
be an in-person event located in Potchefstroom, South Africa.
Registration will be free.
RAIL 2022 submission requirements:
* RAIL asks for full papers from 4 pages to 8 pages (plus more pages
for references if needed), which must strictly follow the Journal of
the Digital Humanities Association of Southern Africa style guide
(https://upjournals.up.ac.za/index.php/dhasa/libraryFiles/downloadPublic/30
).
* Accepted submissions will be published in JDHASA, the Journal of the
Digital Humanities Association of Southern Africa
(https://upjournals.up.ac.za/index.php/dhasa/).
* Papers will be double blind peer-reviewed and must be submitted
through EasyChair (https://easychair.org/my/conference?conf=rail2022).
Important dates
Submission deadline: 4 September 2022
Date of notification: 30 September 2022
Camera ready copy deadline: 23 October 2022
RAIL: 30 November 2022, North-West University - Potchefstroom
SAMWOP: 1 – 3 December 2022, North-West University - Potchefstroom
Organising Committee
Jessica Mabaso
Rooweither Mabuya
Muzi Matfunjwa
Mmasibidi Setaka
Menno van Zaanen
South African Centre for Digital Language Resources (SADiLaR), South
Africa
--
Prof Menno van Zaanen menno.vanzaanen(a)nwu.ac.za
Professor in Digital Humanities
South African Centre for Digital Language Resources
https://www.sadilar.org
________________________________
NWU CORONA VIRUS:
http://www.nwu.ac.za/coronavirus/
NWU PRIVACY STATEMENT:
http://www.nwu.ac.za/it/gov-man/disclaimer.html
DISCLAIMER: This e-mail message and attachments thereto are intended solely for the recipient(s) and may contain confidential and privileged information. Any unauthorised review, use, disclosure, or distribution is prohibited. If you have received the e-mail by mistake, please contact the sender or reply e-mail and delete the e-mail and its attachments (where appropriate) from your system.
________________________________
Final call for papers
Third workshop on Resources for African Indigenous Language (RAIL)
https://bit.ly/rail2022
The South African Centre for Digital Language Resources (SADiLaR) is
organising the 3rd RAIL workshop in the field of Resources for African
Indigenous Languages. This workshop aims to bring together researchers
who are interested in showcasing their research and thereby boosting
the field of African indigenous languages. This provides an overview of
the current state-of-the-art and emphasizes availability of African
indigenous language resources, including both data and tools.
Additionally, it will allow for information sharing among researchers
interested in African indigenous languages and also start discussions
on improving the quality and availability of the resources. Many
African indigenous languages currently have no or very limited
resources available and, additionally, they are often structurally
quite different from more well-resourced languages, requiring the
development and use of specialized techniques. By bringing together
researchers from different fields (e.g., (computational) linguistics,
sociolinguistics, language technology) to discuss the development of
language resources for African indigenous languages, we hope to boost
research in this field.
The RAIL workshop is an interdisciplinary platform for researchers
working on resources (data collections, tools, etc.) specifically
targeted towards African indigenous languages. It aims to create the
conditions for the emergence of a scientific community of practice that
focuses on data, as well as tools, specifically designed for or applied
to indigenous languages found in Africa.
Suggested topics include the following:
* Digital representations of linguistic structures
* Descriptions of corpora or other data sets of African indigenous
languages
* Building resources for (under resourced) African indigenous languages
* Developing and using African indigenous languages in the digital age
* Effectiveness of digital technologies for the development of African
indigenous languages
* Revealing unknown or unpublished existing resources for African
indigenous languages
* Developing desired resources for African indigenous languages
* Improving quality, availability and accessibility of African
indigenous language resources
The 3rd RAIL workshop 2022 will be co-located with the 10th Southern
African Microlinguistics Workshop (
https://sites.google.com/nwulettere.co.za/samwop-10/home). This will be
an in-person event located in Potchefstroom, South Africa. Registration
will be free.
RAIL 2022 submission requirements:
* RAIL asks for full papers from 4 pages to 8 pages (plus more pages
for references if needed), which must strictly follow the Journal of
the Digital Humanities Association of Southern Africa style guide (
https://upjournals.up.ac.za/index.php/dhasa/libraryFiles/downloadPublic/30
).
* Accepted submissions will be published in JDHASA, the Journal of the
Digital Humanities Association of Southern Africa (
https://upjournals.up.ac.za/index.php/dhasa/).
* Papers will be double blind peer-reviewed and must be submitted
through EasyChair (https://easychair.org/my/conference?conf=rail2022).
Important dates
Submission deadline: 28 August 2022
Date of notification: 30 September 2022
Camera ready copy deadline: 23 October 2022
RAIL: 30 November 2022, North-West University - Potchefstroom
SAMWOP: 1 – 3 December 2022, North-West University - Potchefstroom
Organising Committee
Jessica Mabaso
Rooweither Mabuya
Muzi Matfunjwa
Mmasibidi Setaka
Menno van Zaanen
South African Centre for Digital Language Resources (SADiLaR), South
Africa
--
Prof Menno van Zaanen menno.vanzaanen(a)nwu.ac.za
Professor in Digital Humanities
South African Centre for Digital Language Resources
https://www.sadilar.org
________________________________
NWU CORONA VIRUS:
http://www.nwu.ac.za/coronavirus/
NWU PRIVACY STATEMENT:
http://www.nwu.ac.za/it/gov-man/disclaimer.html
DISCLAIMER: This e-mail message and attachments thereto are intended solely for the recipient(s) and may contain confidential and privileged information. Any unauthorised review, use, disclosure, or distribution is prohibited. If you have received the e-mail by mistake, please contact the sender or reply e-mail and delete the e-mail and its attachments (where appropriate) from your system.
________________________________
Hi
Some colleagues organizing this hackton asked if I could help to boradcast it. Hence this message !
Best
Laurent Besacier
SLT-CODE Hackathon Announcement
Have you ever asked yourself how your smartphone recognizes what you say and who you are?
Have you ever thought about how machines recognize different languages ?
If that is your case, join us for a two-day speech and language technology hackathon. We will answer these questions and build fantastic systems with the guidance of top language and speech scientists in a collaborative environment.
The two-day speech and language technology hackathon will take place during the IEEE Spoken Language Technology (SLT) Workshop in Doha, Qatar, on January 7th and 8th, 2023. This year's Hackathon will be inspiring, momentous, and fun. The goal is to build a diverse community of people who want to explore and envision how machines understand the world's spoken languages.
During the Hackathon, you will be exposed (but not limited) to speech and language toolkits like ESPNet, SpeechBrain, K2/Kaldi, Huggingface, TorchAudio, or commercial APIs like Amazon Lex, etc., and you will be hands-on using this technology.
At the end of the Hackathon, every team will share their findings with the rest of the participants. Selected projects will have the opportunity to be presented at the SLT workshop.
The Hackathon will be at the Qatar Computing Research Institute (QCRI) in Doha, Qatar (GMT+3). In-person participation is preferred; however, remote participation is possible by joining a team with at least one person being local.
More information on how to apply and important dates are available at our website https://slt2022.org/hackathon.php .
Interested? Apply here: https://forms.gle/a2droYbD4qset8ii9 The deadline for registration is September 30th, 2022.
If you have immediate questions, don't hesitate to contact our hackathon chairs directly at hackathon.slt2022(a)gmail.com .
Third call for papers
Third workshop on Resources for African Indigenous Language (RAIL)
https://bit.ly/rail2022
The South African Centre for Digital Language Resources (SADiLaR) is
organising the 3rd RAIL workshop in the field of Resources for African
Indigenous Languages. This workshop aims to bring together researchers
who are interested in showcasing their research and thereby boosting
the field of African indigenous languages. This provides an overview of
the current state-of-the-art and emphasizes availability of African
indigenous language resources, including both data and tools.
Additionally, it will allow for information sharing among researchers
interested in African indigenous languages and also start discussions
on improving the quality and availability of the resources. Many
African indigenous languages currently have no or very limited
resources available and, additionally, they are often structurally
quite different from more well-resourced languages, requiring the
development and use of specialized techniques. By bringing together
researchers from different fields (e.g., (computational) linguistics,
sociolinguistics, language technology) to discuss the development of
language resources for African indigenous languages, we hope to boost
research in this field.
The RAIL workshop is an interdisciplinary platform for researchers
working on resources (data collections, tools, etc.) specifically
targeted towards African indigenous languages. It aims to create the
conditions for the emergence of a scientific community of practice that
focuses on data, as well as tools, specifically designed for or applied
to indigenous languages found in Africa.
Suggested topics include the following:
* Digital representations of linguistic structures
* Descriptions of corpora or other data sets of African indigenous
languages
* Building resources for (under resourced) African indigenous languages
* Developing and using African indigenous languages in the digital age
* Effectiveness of digital technologies for the development of African
indigenous languages
* Revealing unknown or unpublished existing resources for African
indigenous languages
* Developing desired resources for African indigenous languages
* Improving quality, availability and accessibility of African
indigenous language resources
The 3rd RAIL workshop 2022 will be co-located with the 10th Southern
African Microlinguistics Workshop (
https://sites.google.com/nwulettere.co.za/samwop-10/home). This will be
an in-person event located in Potchefstroom, South Africa. Registration
will be free.
RAIL 2022 submission requirements:
* RAIL asks for full papers from 4 pages to 8 pages (plus more pages
for references if needed), which must strictly follow the Journal of
the Digital Humanities Association of Southern Africa style guide (
https://upjournals.up.ac.za/index.php/dhasa/libraryFiles/downloadPublic/30
).
* Accepted submissions will be published in JDHASA, the Journal of the
Digital Humanities Association of Southern Africa (
https://upjournals.up.ac.za/index.php/dhasa/).
* Papers will be double blind peer-reviewed and must be submitted
through EasyChair (https://easychair.org/my/conference?conf=rail2022).
Important dates
Submission deadline: 28 August 2022
Date of notification: 30 September 2022
Camera ready copy deadline: 23 October 2022
RAIL: 30 November 2022, North-West University - Potchefstroom
SAMWOP: 1 – 3 December 2022, North-West University - Potchefstroom
Organising Committee
Jessica Mabaso
Rooweither Mabuya
Muzi Matfunjwa
Mmasibidi Setaka
Menno van Zaanen
South African Centre for Digital Language Resources (SADiLaR), South
Africa
--
Prof Menno van Zaanen menno.vanzaanen(a)nwu.ac.za
Professor in Digital Humanities
South African Centre for Digital Language Resources
https://www.sadilar.org
________________________________
NWU CORONA VIRUS:
http://www.nwu.ac.za/coronavirus/
NWU PRIVACY STATEMENT:
http://www.nwu.ac.za/it/gov-man/disclaimer.html
DISCLAIMER: This e-mail message and attachments thereto are intended solely for the recipient(s) and may contain confidential and privileged information. Any unauthorised review, use, disclosure, or distribution is prohibited. If you have received the e-mail by mistake, please contact the sender or reply e-mail and delete the e-mail and its attachments (where appropriate) from your system.
________________________________
FYI
> Inizio messaggio inoltrato:
>
> Da: Menno Van Zaanen <Menno.VanZaanen(a)nwu.ac.za>
> Oggetto: [Corpora-List] Job opening: Computational linguist
> Data: 19 luglio 2022 08:43:38 CEST
> A: "corpora(a)list.elra.info" <corpora(a)list.elra.info>
>
> Computational Linguist
>
> Purpose of the position:
> As a Computational Linguist at the South African Centre for Digital
> Language Resources (SADiLaR) you will have the opportunity to initiate
> and lead Human Language Technology and Digital Humanities projects
> stemming from your own research interests. You will work closely with a
> team of researchers as part of SADiLaR’s extended network, both on your
> own and commissioned projects. Dissemination of project results at
> national and international conferences will be encouraged and
> supported.
>
> This position is crucial for research and development in Human Language
> Technology and Digital Humanities, fields that form the essence of
> SADiLaR, which is a national Research Infrastructure supported by the
> Department of Science and Innovation.
>
> Minimum Requirements
> * PhD in one of the following fields: Computational Linguistics,
> Natural Language Processing, General Linguistics, Human Language
> Technology, Digital Humanities, Computer Science, Information
> Technology, Artificial Intelligence or related fields with a focus on
> computational aspects of linguistics.
> * Applicable experience in the use of Python (recommended). Other
> programming languages used within the computational linguistics domain
> can also be considered.
> * Experience as a supervisor/co-supervisor of students or playing a
> mentorship/supervising role for individuals.
> * Evidence of peer-reviewed academic publications.
> * Advanced computer literacy.
>
> Other competency requirements
> * Ability to work independently or as part of a team.
> * Ability to effectively liaise and communicate with public, students,
> colleagues, and other stakeholders at various levels and from diverse
> backgrounds.
> * Demonstration of language proficiency in order to function optimally
> in the various multilingual environments of SADiLaR.
>
> Recommendations:
> * Experience with writing research reports.
> * Ability to lead research projects.
> * Evidence of acquiring research funding.
> * Experience with using and/or developing computational tools.
> * Experience related to research within the domain of Language
> Technology or Digital Humanities.
> * Experience in the presentation of research-based results at national
> and international conferences.
> * Experience related to teaching within the domain of Language
> Technology or Digital Humanities.
> * Strong interest in the advancement of under-resourced South African
> languages.
>
> Responsibilities:
> * Research in the area of Human Language Technology and Digital
> Humanities.
> * Teaching in the area of Human Language Technology and Digital
> Humanities.
> * Initiating and leading Human Language Technology and Digital
> Humanities projects.
> * Mentorship of researchers in the field of Computational Linguistics
> and Digital Humanities.
>
> ENQUIRIES:
> Prof Menno van Zaanen, menno.vanzaanen(a)nwu.ac.za
>
> CLOSING DATE:
> 29 July 2022
>
> COMMENCEMENT OF DUTIES:
> As soon as possible
>
> TO APPLY:
> https://bit.ly/3yQqcnd
> --
> Prof Menno van Zaanen menno.vanzaanen(a)nwu.ac.za
> Professor in Digital Humanities
> South African Centre for Digital Language Resources
> https://www.sadilar.org
> ________________________________
> NWU CORONA VIRUS:
> http://www.nwu.ac.za/coronavirus/
>
> NWU PRIVACY STATEMENT:
> http://www.nwu.ac.za/it/gov-man/disclaimer.html
>
> DISCLAIMER: This e-mail message and attachments thereto are intended solely for the recipient(s) and may contain confidential and privileged information. Any unauthorised review, use, disclosure, or distribution is prohibited. If you have received the e-mail by mistake, please contact the sender or reply e-mail and delete the e-mail and its attachments (where appropriate) from your system.
> ________________________________
> _______________________________________________
> Corpora mailing list -- corpora(a)list.elra.info
> https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/
> To unsubscribe send an email to corpora-leave(a)list.elra.info
FYI.
> Inizio messaggio inoltrato:
>
> Da: Menno Van Zaanen <Menno.VanZaanen(a)nwu.ac.za>
> Oggetto: [Corpora-List] 2nd CfP Third workshop on Resources for African Indigenous Language (RAIL)
> Data: 19 luglio 2022 09:07:30 CEST
> A: "corpora(a)list.elra.info" <corpora(a)list.elra.info>
>
>
> Second call for papers
>
> Third workshop on Resources for African Indigenous Language (RAIL)
> https://bit.ly/rail2022
>
>
> The South African Centre for Digital Language Resources (SADiLaR) is
> organising the 3rd RAIL workshop in the field of Resources for African
> Indigenous Languages. This workshop aims to bring together researchers
> who are interested in showcasing their research and thereby boosting
> the field of African indigenous languages. This provides an overview of
> the current state-of-the-art and emphasizes availability of African
> indigenous language resources, including both data and tools.
> Additionally, it will allow for information sharing among researchers
> interested in African indigenous languages and also start discussions
> on improving the quality and availability of the resources. Many
> African indigenous languages currently have no or very limited
> resources available and, additionally, they are often structurally
> quite different from more well-resourced languages, requiring the
> development and use of specialized techniques. By bringing together
> researchers from different fields (e.g., (computational) linguistics,
> sociolinguistics, language technology) to discuss the development of
> language resources for African indigenous languages, we hope to boost
> research in this field.
>
> The RAIL workshop is an interdisciplinary platform for researchers
> working on resources (data collections, tools, etc.) specifically
> targeted towards African indigenous languages. It aims to create the
> conditions for the emergence of a scientific community of practice that
> focuses on data, as well as tools, specifically designed for or applied
> to indigenous languages found in Africa.
>
> Suggested topics include the following:
> * Digital representations of linguistic structures
> * Descriptions of corpora or other data sets of African indigenous
> languages
> * Building resources for (under resourced) African indigenous languages
> * Developing and using African indigenous languages in the digital age
> * Effectiveness of digital technologies for the development of African
> indigenous languages
> * Revealing unknown or unpublished existing resources for African
> indigenous languages
> * Developing desired resources for African indigenous languages
> * Improving quality, availability and accessibility of African
> indigenous language resources
>
>
> The 3rd RAIL workshop 2022 will be co-located with the 10th Southern
> African Microlinguistics Workshop (
> https://sites.google.com/nwulettere.co.za/samwop-10/home). This will be
> an in-person event located in Potchefstroom, South Africa. Registration
> will be free.
>
> RAIL 2022 submission requirements:
> * RAIL asks for full papers from 4 pages to 8 pages (plus more pages
> for references if needed), which must strictly follow the Journal of
> the Digital Humanities Association of Southern Africa style guide (
> https://upjournals.up.ac.za/index.php/dhasa/libraryFiles/downloadPublic/30
> ).
> * Accepted submissions will be published in JDHASA, the Journal of the
> Digital Humanities Association of Southern Africa (
> https://upjournals.up.ac.za/index.php/dhasa/).
> * Papers will be double blind peer-reviewed and must be submitted
> through EasyChair (https://easychair.org/my/conference?conf=rail2022).
>
> Important dates
> Submission deadline: 28 August 2022
> Date of notification: 30 September 2022
> Camera ready copy deadline: 23 October 2022
> RAIL: 30 November 2022, North-West University - Potchefstroom
> SAMWOP: 1 – 3 December 2021, North-West University - Potchefstroom
>
>
> Organising Committee
> Jessica Mabaso
> Rooweither Mabuya
> Muzi Matfunjwa
> Mmasibidi Setaka
> Menno van Zaanen
>
> South African Centre for Digital Language Resources (SADiLaR), South
> Africa
>
> --
> Prof Menno van Zaanen menno.vanzaanen(a)nwu.ac.za
> Professor in Digital Humanities
> South African Centre for Digital Language Resources
> https://www.sadilar.org
> ________________________________
> NWU CORONA VIRUS:
> http://www.nwu.ac.za/coronavirus/
>
> NWU PRIVACY STATEMENT:
> http://www.nwu.ac.za/it/gov-man/disclaimer.html
>
> DISCLAIMER: This e-mail message and attachments thereto are intended solely for the recipient(s) and may contain confidential and privileged information. Any unauthorised review, use, disclosure, or distribution is prohibited. If you have received the e-mail by mistake, please contact the sender or reply e-mail and delete the e-mail and its attachments (where appropriate) from your system.
> ________________________________
> _______________________________________________
> Corpora mailing list -- corpora(a)list.elra.info
> https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/
> To unsubscribe send an email to corpora-leave(a)list.elra.info
Dear All,
There will be the two-day speech and language technology hackathon will
take place during the IEEE Spoken Language Technology (SLT) Workshop in
Doha, Qatar, on January 7th and 8th, 2023. This year's Hackathon will be
inspiring, momentous, and fun. The goal is to build a diverse community
of people who want to explore and envision how machines understand the
world's spoken languages.
More details can be found here: https://slt2022.org/hackathon.php
Sincerely yours,
Sakriani Sakti
PhD Position : Naver Labs Europe (France) and FBK Trento (Italy) start Nov 2022
Have you recently completed or expect very soon an MSc or equivalent degree in computer science, artificial intelligence, computational linguistics, engineering, or a related area? Are you interested in carrying out research on Speech-to-Speech Translation during the next few years? Are you excited to spend a part of your life in 2 pleasant alpine cities in France (Grenoble) and Italy (Trento) ?
WE ARE LOOKING FOR YOU!!!
The Machine Translation (MT) group at Fondazione Bruno Kessler (Trento, Italy) in conjunction with Naver Labs Europe (Grenoble, France) are pleased to announce the availability of the following fully-funded Ph.D. position at the Doctorate Program in Industrial Innovation of the University of Trento and Fondazione Bruno Kessler.
PhD topic: Unified Foundation models for Speech-to-Speech Translation
The deadline for application: August 23rd.
More details here: [ http://tinyurl.com/PhD-FBK-NLE | http://tinyurl.com/PhD-FBK-NLE ]
=====
Laurent Besacier
Dear SIGUL list members,
we are happy to inform you that the SIGUL2022 Workshop Proceedings are
available for download:
http://www.lrec-conf.org/proceedings/lrec2022/workshops/SIGUL/2022.sigul-1.…
The individual papers can be found as well on the workshop program page,
where we are laso making available the slides and posters that were used
during the presentations: https://sigul-2022.ilc.cnr.it/programme/
SIGUL2022 was held on the last 24th and 25th of June in Marseille,
co-located with LREC2022. It featured 27 papers addressing a vast array
of topics and covering 76 different languages from Africa, the Americas,
Asia, and Europe.
We are very thankful to all the authors, participants, invited speakers,
chairs, panelists, local organisers and program committee members for
contributing to a very successful event.
All the best,
Claudia, Maite, Sakti (SIGUL2022 Co-chairs)
--
Claudia Soria
Researcher
Istituto di Linguistica Computazionale "A. Zampolli"
Consiglio Nazionale delle Ricerche
Via Moruzzi 1
56124 Pisa
Italy
Management Committee member
COST Action CA19102 ‘Language In The Human-Machine Era' (LITHME)
www.lithme.eu
Tel. +39 050 3153166
Skype clausor
Dear colleagues,
My team and I are thinking of approaching a Bolivian community we have
collaborated with in the past about potentially building SLT tools and/or a
dataset with them. One of our research projects requires the creation of a
TTS system, so we think it would be important to couch this research goal
within a collaborative research project that takes into account the
communities' own goals and needs.
This is the first time I do anything like this, and I'm sorry if my
question is very naïve: Do you have materials you'd recommend for us to
read, such as:
- information often provided to aboriginal communities about this kind
of effort
- information about how other communities have set up a payment scheme
- information about variable terms in licensing; eg if the community
does not want commercial reuse, is that ok by the LDC? any other
restrictions communities often ask for? any other rights, such as royalties
in case of commercialization, or free access to the software?
Please reply to me alone. I'll compile all replies and share back the full
list of resources with the mailing list.
Thank you in advance,
Alex
---------------------------------------------------------------
Alex (Alejandrina) Cristia
Researcher, CNRS
Laboratoire de Sciences Cognitives et Psycholinguistique
29, rue d'Ulm, 75005, Paris, FRANCE
My site: www.acristia.org
---------------------------------------------------------------
If you donate, ask me about effective charities
<https://effectivealtruism.us8.list-manage.com/track/click?u=52b028e7f799cca…>.
/ Si vous faites des dons, demandez moi sur le don efficace
<https://www.altruismeefficacefrance.org/guide-don-efficace-1/>.
Dear colleagues,
A fascinating opportunity for those working on languages that are
inflectional! Read below & contact Ben Ambridge, in cc, for any questions.
-Alex
---------------------------------------------------------------
Alex (Alejandrina) Cristia
Researcher, CNRS
Laboratoire de Sciences Cognitives et Psycholinguistique
29, rue d'Ulm, 75005, Paris, FRANCE
My site: www.acristia.org
---------------------------------------------------------------
If you donate, ask me about effective charities
<https://effectivealtruism.us8.list-manage.com/track/click?u=52b028e7f799cca…>.
/ Si vous faites des dons, demandez moi sur le don efficace
<https://www.altruismeefficacefrance.org/guide-don-efficace-1/>.
---------- Forwarded message ---------
From: Ben Ambridge <ben.ambridge(a)manchester.ac.uk>
Date: Thu, Apr 28, 2022 at 10:15 PM
Subject: Fwd: Crosslinguistic morphology experiments - call for collborators
To: Alex CRISTIA <alecristia(a)gmail.com>
Hi Alex - I know you’ve worked on quite a few hard-to-reach languages -
would you be interested in this, or able to point me in the direction of
others who might be?
Thanks
Ben
===
Dear colleagues, we are seeking potential collaborators for a grant
application for a large crosslinguistic project investigating children’s
acquisition of inflectional morphology. We aim to include 100
typologically-diverse languages. Due to the size of the envisaged project,
it would not be feasible to apply for funding for full-time research
assistants to test children (or to fund a portion of each collaborator’s
salary). Our intention for the grant application is that each collaborator
will be able to claim up to €10,000 for expenses (e.g., travel, laptops,
participant payments, part-time/casual researchers), with the data
collected by a researcher who is already primarily sponsored/employed
(e.g., as PhD student, postdoc or research assistant) at your institution.
We will provide computerized elicitation tasks; your role (with the help of
full-time research and support staff employed at our end) would be to
translate the task into your language and inflectional system and to
supervise data collection (with children aged 3-6, and adults). At the
moment, our goal is simply to put together a list of *potential*
collaborators+languages for the grant application (NB: we can include only
languages with verb and/or noun person/case/number inflectional
morphology). To be included on this provisional list, please email
Ben.Ambridge(a)Manchester.ac.uk with your name, institution and language(s).
****Apologies for cross-postings****
Call for Papers
SIGUL 2022 Workshop <https://sigul-2022.ilc.cnr.it/>
a post-Conference Workshop of LREC 2022
Marseille (FR), 24-25 June 2022
*EXTENDED paper submission deadline: 19 April 2022*
The 1st Annual Meeting of the ELRA/ISCA Special Interest Group on Under-Resourced Languages (SIGUL 2022) will provide a forum for the presentation and discussion of cutting-edge research in text and speech processing for under-resourced languages by academic and industry researchers. SIGUL 2022 will carry on the tradition of the CCURL-SLTU (Collaboration and Computing for Under-Resourced Languages – Spoken Language Technologies for Under-resourced languages) Workshop Series, which has been organised since 2008 and, as LREC Workshops, since 2014. As usual, this Workshop spans the research interest areas of less-resourced, under-resourced, endangered, minority and minoritized languages. Since this year LREC includes a track dedicated specifically to endangered and less-resourced languages, the workshop aims to be a venue for networking and discussion as much as for scientific debate.
Over the last years, research in NLP for less-resourced languages has taken momentum. The multiplication of research interest makes it even more necessary for the community that revolves around less-resourced languages to find opportunities for aggregation and discussion. Following the long-standing series of previous meetings, the SIGUL venue will provide a forum for the presentation of cutting edge research in NLP, MT and Speech Technologies for under-resourced languages to both academic and industry researchers, and also to offer a venue where researchers in different disciplines and from varied backgrounds can fruitfully explore new areas of intellectual and practical development while honouring their common interest of sustaining less-resourced languages.
Topics include but are not limited to:
General research on under-resourced languages.
Transfer-learning techniques for under-resourced languages (use of multilingual, pretrained models, unsupervised, semi-supervised, zero-shot, few-shot training,...) in NLP, MT and Speech technologies.
We also invite position papers on methodological, ethical, or institutional issues
Instructions for submission can be found here <https://sigul-2022.ilc.cnr.it/submission/>
Important Dates
- Paper submission deadline: *19* April 2022
- Notification of acceptance: 3 May 2022
- Camera-ready paper: 23 May 2022
- Workshop date: 24-25 June 2022
Organizing Committee
Maite Melero - Barcelona Supercomputing Centre, Spain
Sakriani Sakti - NAIST, Japan
Claudia Soria - CNR-ILC, Italy
To contact the organisers, please mail sigul2022(a)ilc.cnr.it <mailto:sigul2022@ilc.cnr.it> (Subject: [SIGUL2022]).
>
>
> Research internship position at NAVER LABS Europe (Grenoble, France) on Energy-Based Models for Controlled Text Generation
>
> Start date: June 2022
> Duration: 5-6 months
>
> DESCRIPTION
> Large language models can now be used to generate highly fluent texts. However, the synthesized utterances can be deficient on other important levels: semantic consistency, faithfulness to the facts, toxic or socially biased content.
>
> Our team has developed several effective solutions on that front [1,2,3,4] exploiting the expressive power of Energy-Based Models in defining constraints over generative models. However, certain challenges remain: (1) How can we quickly adapt to changing control conditions without the need for model retraining? (2) Can we exploit these techniques to improve on hard-to-quantify features, such as safety, unbiasedness, textual coherence, or matching the human intention? (3) Can we improve training speed/robustness, for example, by leveraging techniques from RL?
>
> We are looking for a motivated intern to help us develop techniques and algorithms addressing these challenges. Experiments will be conducted on selected text generation tasks using the state of art pre-trained language models.
>
> The successful candidate should be enrolled in a graduate program, at the Master or (preferably) PhD level.
>
> The intern will work in a team integrated by Hady Elsahar, Marc Dymetman, Germán Kruszewski, and Jos Rozen.
>
> Publication of this internship's results in major conferences/journals will be strongly encouraged.
>
> REQUIRED SKILLS
> - Strong programming skills
> - Relevant experience with training Deep Learning models for NLP
> - Strong mathematical skills
> - Ability to communicate research
>
> OPTIONAL SKILLS
> - Knowledge of MCMC sampling techniques and/or Reinforcement Learning
> - Publications at peer-reviewed AI conferences
>
> REFERENCES
> [1] Khalifa et al., A Distributional Approach to Controlled Text Generation, In ICLR-2021
> [2] Eikema et al., Sampling from Energy-Based Models with Quality/Efficiency Trade-offs, In CtrlGen at Neurips 2021
> [3] Korbak et al., Energy-Based Models for Code Generation under Compilability Constraints, In NLP4prog at ACL2021
> [4] Korbak et al. Controlling Conditional Language Models with Distributional Policy Gradients, In CtrlGen at Neurips 2021
>
> APPLICATION INSTRUCTIONS
> Please note that applicants must be registered students at a university or other academic institution and that this establishment will need to sign an 'Internship Convention' with NAVER LABS Europe before the student is accepted.
>
> You can apply for this position online at https://europe.naverlabs.com/job/energy-based-models-for-controlled-text-ge… <https://europe.naverlabs.com/job/energy-based-models-for-controlled-text-ge…>. Don't forget to upload your CV and cover letter before you submit. Incomplete applications will not be accepted.
>
> ABOUT NAVER LABS
> NAVER is the #1 Internet portal in Korea with activities that span a wide range of businesses including search, commerce, content, financial and cloud platforms.
>
> NAVER LABS, co-located in Korea and France, is the organization dedicated to preparing NAVER’s future. NAVER LABS Europe is located in a spectacular setting in Grenoble, in the heart of the French Alps. Scientists at NAVER LABS Europe are empowered to pursue long-term research problems that, if successful, can have significant impact and transform NAVER. We take our ideas as far as research can to create the best technology of its kind. Active participation in the academic community and collaborations with world-class public research groups are, among others, important tools to achieve these goals. Teamwork, focus and persistence are important values for us.
>
> NAVER LABS Europe is an equal opportunity employer.
>
> For more information and application see https://europe.naverlabs.com/job/energy-based-models-for-controlled-text-ge… <https://europe.naverlabs.com/job/energy-based-models-for-controlled-text-ge…>
****Apologies for cross-postings****
***Please help disseminate****
1st Call for Papers
SIGUL 2022 Workshop <https://sigul-2022.ilc.cnr.it/>
a post-Conference Workshop of LREC 2022
Marseille (FR), 24-25 June 2022
*paper submission deadline: 11 April 2022*
The 1st Annual Meeting of the ELRA/ISCA Special Interest Group on
Under-Resourced Languages (SIGUL 2022) will provide a forum for the
presentation and discussion of cutting-edge research in text and speech
processing for under-resourced languages by academic and industry
researchers. SIGUL 2022 will carry on the tradition of the CCURL-SLTU
(Collaboration and Computing for Under-Resourced Languages – Spoken
Language Technologies for Under-resourced languages) Workshop Series,
which has been organised since 2008 and, as LREC Workshops, since 2014.
As usual, this Workshop spans the research interest areas of
less-resourced, under-resourced, endangered, minority and minoritized
languages. Since this year LREC includes a track dedicated specifically
to endangered and less-resourced languages, the workshop aims to be a
venue for networking and discussion as much as for scientific debate.
Over the last years,research in NLP for less-resourced languages has
taken momentum. The multiplication of research interest makes it even
more necessary for the community that revolves around less-resourced
languages to find opportunities for aggregation and discussion.
Following the long-standing series of previous meetings, the SIGUL venue
will provide a forum for the presentation of cutting edge research in
NLP, MT and Speech Technologies for under-resourced languages to both
academic and industry researchers, and also to offer a venue where
researchers in different disciplines and from varied backgrounds can
fruitfully explore new areas of intellectual and practical development
while honouring their common interest of sustaining less-resourced
languages.
Topics include but are not limited to:
*
General research on under-resourced languages.
*
Transfer-learning techniquesfor under-resourced languages(use of
multilingual, pretrained models, unsupervised, semi-supervised,
zero-shot, few-shot training,...) in NLP, MT and Speech technologies.
*
We also invite position paperson methodological, ethical, or
institutional issues
Instructions for submission can be found here
<https://sigul-2022.ilc.cnr.it/submission/>
Important Dates
- Paper submission deadline: 11 April 2022
- Notification of acceptance: 3 May 2022
- Camera-ready paper: 23 May 2022
- Workshop date: 24-25 June 2022
Organizing Committee
*
Maite Melero - Barcelona Supercomputing Centre, Spain
*
Sakriani Sakti - NAIST, Japan
*
Claudia Soria - CNR-ILC, Italy
To contact the organisers, please mail sigul2022(a)ilc.cnr.it
<mailto:sigul2022@ilc.cnr.it>(Subject: [SIGUL2022]).
--
Claudia Soria
Researcher
Istituto di Linguistica Computazionale "A. Zampolli"
Consiglio Nazionale delle Ricerche
Via Moruzzi 1
56124 Pisa
Italy
Management Committee member
COST Action CA19102 ‘Language In The Human-Machine Era' (LITHME)
www.lithme.eu
Tel. +39 050 3153166
Skype clausor
****Apologies for cross-postings****
***Please help disseminate****
1st Call for Papers
SIGUL 2022 Workshop <https://sigul-2022.ilc.cnr.it/>
a post-Conference Workshop of LREC 2022
Marseille (FR), 24-25 June 2022
The 1st Annual Meeting of the ELRA/ISCA Special Interest Group on
Under-Resourced Languages (SIGUL 2022) will provide a forum for the
presentation and discussion of cutting-edge research in text and speech
processing for under-resourced languages by academic and industry
researchers. SIGUL 2022 will carry on the tradition of the CCURL-SLTU
(Collaboration and Computing for Under-Resourced Languages – Spoken
Language Technologies for Under-resourced languages) Workshop Series, which
has been organised since 2008 and, as LREC Workshops, since 2014. As usual,
this Workshop spans the research interest areas of less-resourced,
under-resourced, endangered, minority and minoritized languages. Since this
year LREC includes a track dedicated specifically to endangered and
less-resourced languages, the workshop aims to be a venue for networking
and discussion as much as for scientific debate.
Over the last years, research in NLP for less-resourced languages has taken
momentum. The multiplication of research interest makes it even more
necessary for the community that revolves around less-resourced languages
to find opportunities for aggregation and discussion. Following the
long-standing series of previous meetings, the SIGUL venue will provide a
forum for the presentation of cutting edge research in NLP, MT and Speech
Technologies for under-resourced languages to both academic and industry
researchers, and also to offer a venue where researchers in different
disciplines and from varied backgrounds can fruitfully explore new areas of
intellectual and practical development while honouring their common
interest of sustaining less-resourced languages.
Topics include but are not limited to:
-
General research on under-resourced languages.
-
Transfer-learning techniques for under-resourced languages (use of
multilingual, pretrained models, unsupervised, semi-supervised, zero-shot,
few-shot training,...) in NLP, MT and Speech technologies.
-
We also invite position papers on methodological, ethical, or
institutional issues
Instructions for submission can be found here
<https://sigul-2022.ilc.cnr.it/submission/>
Important Dates
- Paper submission deadline: 11 April 2022
- Notification of acceptance: 3 May 2022
- Camera-ready paper: 23 May 2022
- Workshop date: 24-25 June 2022
Organizing Committee
-
Maite Melero - Barcelona Supercomputing Centre, Spain
-
Sakriani Sakti - NAIST, Japan
-
Claudia Soria - CNR-ILC, Italy
To contact the organisers, please mail sigul2022(a)ilc.cnr.it (Subject:
[SIGUL2022]).
To kick off the International Decade of Indigenous Languages 2022-32, Linguapax will present the 2021 Linguapax Review special issue on Language Technologies and Language Diversity
The event will take place online on 9 March 2022, at 6 pm CET, via Zoom.
During the presentation, authors of the 2021 Linguapax Review will participate in a live debate. We are proud to be joined by:
Andras Kornai <https://www.linkedin.com/in/ACoAAAAB31oBkjnx7uquXtV7tM7-w2lGSsKxbdw>, advisor at the Hungarian Academy of Sciences and author of "Digital Language Death"
Daniel Pimienta <https://www.linkedin.com/in/ACoAAABDoo8BjE6560HaeAmfgcrAOHIqHATliMg>, mathematician, head of the Observatory of Linguistic and Cultural Diversity on the Internet
Tunde Adegbola <https://www.linkedin.com/in/ACoAAABR2W0B-E35vkfUZzAJdeTL_awNeQq0oDM>, scientist, musician, engineer, linguist and culture activist, founder of African Languages Technology Initiative (Alt-i)
Roland Kuhn, <https://nrc.canada.ca/en/corporate/contact-us/nrc-directory-science-profess…>PRO at National Research Council Canada and leader of the Indigenous Languages Technology project
Eddie Avila <https://www.linkedin.com/in/ACoAAABGd_8BCQSXFxlgDkk4uf8jWhru9V_AbUA>, director of Rising Voices, an initiative to support peer networks of indigenous language digital activists in Latin America
Subhashish P. <https://www.linkedin.com/in/ACoAAANsOmkB2pSPSzrrS0zGo0pKhFCGXY8I80s> Panigrahi, National Geographic Explorer and documentary filmmaker, founder of OpenSpeaks, a project for documenting indigenous and endangered languages
The debate will be moderated by Maite Melero <https://www.linkedin.com/in/ACoAAAK3E2UBEPSieDYF2HzmjZpjuAL1A-7E5tQ>, coordinator of the special issue, and will address questions such as :
Why should we - all of us - care about linguistic diversity?
Are new technologies a threat or an opportunity for endangered languages?
What are the keys for effective language digital activism?
We will open this interesting debate to the audience.
Participation is free but registration is required: at: https://lnkd.in/eZMf-QcV <https://lnkd.in/eZMf-QcV>