Hello,
Can anyone point me to corpora of language learner speech or written text
that are labeled by CEFR proficiency level? Any languages are useful!
Thanks in advance
LUCIA
Lucia Donatelli, Ph.D.
Department of Language Science and Technology
Saarland University
http://luciadonatelli.georgetown.domains
Dear colleagues,
The Computational Cognition Lab at Open University of Cyprus, and
the Socially-Competent Robotic and Agent Technologies group at CYENS Center of Excellence
are looking to recruit post-docs and research associates for ongoing and upcoming projects on topics related to:
- cognitive computing,
- personal assistants,
- explainable and trustworthy AI,
- machine learning / learning theory,
- preference elicitation,
- neural-symbolic integration,
- conversational AI,
- natural language understanding / generation,
- formal argumentation,
- knowledge-based systems.
Relevant announcements:
1. https://www.ouc.ac.cy/images/files/hr/2022/WeNet_RISE_Researchers_Developer…
2. https://www.cyens.org.cy/en-gb/vacancies/job-listings/research-associates/r…
Interested candidates should apply directly following the procedures in the two links above, by December 5th, 2022.
Regards,
Loizos
The CorpusCALL SIG (Special Interest Group) proudly announces this interesting webinar:
English version:
Don't miss this really interesting webinar with Tanara Zingano Kuhn & Rina Zviel Girshin, 8 December 11am UK time, organised by the CorpusCALL SIG!
Title: Crowdsourcing corpus filtering for pedagogical purpose project: A fruitful partnership between computer science and linguistics
Bionotes and abstract: https://link.infini.fr/abstractbionotes
Register here to get the Zoom link: https://link.infini.fr/sigwebinar
Version française :
Le CorpusCALL SIG vous invite au webinaire du 8 décembre, 12h, avec Tanara Zingano Kuhn & Rina Zviel Girshin.
Titre: Crowdsourcing corpus filtering for pedagogical purpose project: A fruitful partnership between computer science and linguistics
Informations concernant les présentatrices et l'abstract de leur présentation : https://link.infini.fr/abstractbionotes
Inscription gratuite permettant d'obtenir le lien Zoom : https://link.infini.fr/sigwebinar
--
Eva Schaeffer-Lacroix
Maîtresse de conférences HDR
https://orcid.org/0000-0002-6260-9095http://didaktik.hautetfort.com
Tél. : 06 64 68 21 92
The UC Santa Cruz Natural Language Processing (NLP) master's degree program
provides both depth and breadth in core algorithms and methods for NLP.
Taught intensively over 15-18 months, our program design combines
theoretical learning with hands-on practice to ensure our students have the
right skill set to prepare for a professional career in this fast-growing
field. We are accepting applications for Fall 2023 admission consideration,
and will be hosting a series of information sessions about the NLP MS
program over the next few months. To review our information session
schedule, visit https://nlp.ucsc.edu/admissions.
Join us at our next virtual information session on December 1st at 6 PM PST
to learn more about studying NLP at UCSC, and to meet with our current
students and faculty. Please complete the following registration form to
attend the session on December 1st:
https://ucsc.zoom.us/meeting/register/tJIkdeCpqj8qGdWpJAKwUc-CtJTVq1Qp0RBt
Applications for Fall 2023 admission consideration are now open. Apply by
March 1st, 2023: https://applygrad.ucsc.edu/apply/
If you have questions about the program or our upcoming information
session, please contact the NLP Support Team at nlp(a)ucsc.edu.
All the best,
The UCSC NLP Support Team
Natural Language Processing Program <https://nlp.ucsc.edu/>
Baskin Engineering
University of California, Santa Cruz
Dear all,
The University of Arizona seeks multiple tenure-track hires in their School
of Information Science. The successful candidates will have a record of
research in machine learning, natural language processing, and/or
computational social science focusing on misinformation in social media and
social networks to begin in Fall, 2023.
They are especially interested in candidates who are well-versed in big
data computational methodologies, those with interest in academic
leadership roles (e.g., program supervision, student advising), and/or
those who bring a record of working on interdisciplinary/transdisciplinary
funded grant teams.
This position will include teaching responsibilities at both undergraduate
and graduate levels and across online and face to face formats. The
successful applicant will have ideally developed a strong track record of
excellence in teaching and academic citizenry.
I have only just moved here in Sept but I have found the University of
Arizona to be a lively and interdisciplinary community, and Tucson is a
fabulous city.
More details, and how to apply,
https://arizona.csod.com/ux/ats/careersite/4/home/requisition/12223?c=arizo…
Yours,
Heather
--
Dr Heather Froehlich
w // http://hfroehli.ch
t // @heatherfro
Hello,
Corpus CELI (Certificati di Lingua Italiana) and other Italian learner
corpora at this link:
https://www.unistrapg.it/cqpwebnew/
Best,
Stefania
---------- Forwarded message ----------
From: Lucia Donatelli <donatelli(a)coli.uni-saarland.de>
To: corpora(a)list.elra.info
Cc:
Bcc:
Date: Fri, 11 Nov 2022 12:00:01 +0100
Subject: [Corpora-List] CEFR language learner corpora?
Hello,
Can anyone point me to corpora of language learner speech or written text
that are labeled by CEFR proficiency level? Any languages are useful!
Thanks in advance
LUCIA
Lucia Donatelli, Ph.D.
Department of Language Science and Technology
Saarland University
http://luciadonatelli.georgetown.domains
---
*Prof. Stefania Spina*
Full Professor of Linguistics
https://www.researchgate.net/profile/Stefania_Spina2
*La storia non è poi*
*la devastante ruspa che si dice.*
*Lascia sottopassaggi, cripte, buche*
*e nascondigli. C'è chi sopravvive.*
Dear colleagues,
**Apologies for cross-posting**
Please find below information on the third call for papers for the *XIV
International Conference on Corpus, *which will be held in Oviedo 10-12 May
2023. Please, note that proposals can be submitted until the 31st of
December 2022.
*WEB*
https://cilc2023.wordpress.com/
<https://urldefense.com/v3/__https://cilc2023.wordpress.com/__;!!D9dNQwwGXtA…>
*SUBMISSION OF PROPOSALS*
https://old.linguistlist.org/confservices/14CILC.OVIEDO
<https://urldefense.com/v3/__https://old.linguistlist.org/confservices/14CIL…>
*IMPORTANT DATES*
Submission of proposals: 15 October – 31 December 2022.
Notification of acceptance: 15 February 2023.
Early bird registration: 15 February – 15 March 2023.
Late registration: 16 March – 16 April 2023.
*PUBLICATION*
We are pleased to announce hat, in order to promote research on corpus
linguistics, a selection of the papers presented at the conference will
be published in two relevant linguistic journals:
1. *International Journal of English Studies <https://revistas.um.es/ijes>*
(Q1 in SJR)
2. *Research in Corpus Linguistics <https://ricl.aelinco.es/index.php/ricl>*
(Q1 in Dialnet)
*REGISTRATION*
*Early bird registration (from 15 February to 15 March 2023)*
* Registration fee: 120€
* Postgraduate students (certification required): 80€
* Undergraduate students (certification required): 40€
*Late registration* (from 16 March to 16 April)
* Regular registration: 160€
* Postgraduate students (certification required): 120€
* Undergraduate students (certification required): 60€
*CONFERENCE DINNER*
The conference dinner will take place on Thursday 11 May at Tierra Astur
<https://tierra-astur.com/>, a typical Asturian cider house.
The menu price is 50€ per person.
It includes starters, main course, drinks, dessert, coffee and liqueurs.
There will be alternative options for diners with special dietary
requirements.
Further information may be found on the conference website.
Best regards,
The Organising Committee
--
Paula Rodríguez Puente
paula.r.puente(a)gmail.com
http://www.usc-vlcg.es/PRP.htm
The Ubiquitous Knowledge Processing (UKP) Lab and the research group
Trustworthy Human Language Technologies (TrustHLT) at the Department of
Computer Science of the Technical University of Darmstadt, Germany have
four job openings for
**Research scientists (m/f/d)**
in the recently acquired projects funded by the National Research
Center for Applied Cybersecurity ATHENE in the research area of
Security and Privacy in Artificial Intelligence (SenPAI).
These positions will focus on research questions related to privacy-
preserving natural language processing. As part of our work in digital
mental health, the project "Privacy-Aware Domain-Adaptive Medical NLP"
will explore and evaluate different approaches to privacy methods for
dataset creation and deep learning in the medical domain. This project
covers a variety of tasks, from semi-automatic de-identification to
synthetic training data generation and downstream tasks. The latter
require research in large language models for clinical applications in
close collaboration with experts from the psychiatric domain. The
project "Protecting Privacy and Sensitive Information in Texts" touches
the heart of privacy preservation in NLP and will explore a wide range
of domain-agnostic techniques. A solid background in machine learning
for natural language processing is essential, prior knowledge of formal
privacy methods such as differential privacy, or experience in clinical
NLP is a plus. The salary is based on the collective agreement
applicable to the TU Darmstadt (TV-TU Darmstadt). The starting date is
as soon as possible.
## Candidates
The ideal candidate holds a PhD (for a Post-Doc position) or Master
degree (for a PhD position) in computer science, computational
linguistics, machine learning, or a related discipline, has a strong
interest in privacy in natural language processing, excellent
analytical and programming skills, is a team player, and is fluent in
English.
## Diversity
TU Darmstadt is strongly committed to diversity and particularly
welcomes applications from members of underrepresented groups.
Applications from female candidates are highly encouraged.
## Team
The Computer Science Department of the Technical University of
Darmstadt is regularly ranked among the best in Germany. The Ubiquitous
Knowledge Processing Lab (UKP Lab) in Darmstadt led by Prof. Iryna
Gurevych offers an excellent research environment. The UKP Lab is
committed to cutting-edge research, publishing in top-tier venues,
cooperative work style, and close interaction of all team members. See
ukp-lab.de for more details.
TrustHLT is an independent research group led by Dr. Ivan Habernal,
appointed at the Department of Computer Science of the Technical
University of Darmstadt. The group conducts research in the field of
natural language processing with a focus on privacy-preserving
technologies and legal argumentation, see www.trusthlt.org for more
details.
## Application
Please provide us with your CV including an outline of previous work or
research experience, names of two referees, and a letter of motivation
outlining your research interests. Submit your application via the
following form by December 18, 2022:
https://careers.ukp.informatik.tu-darmstadt.de/ukprecruitment (please
enter "ATHENE" as the name of the application position)
Applications arriving after the deadline will still be considered if
the position is not filled yet.
The Natural Language research group in IDEAI <https://ideai.upc.edu/en>
Research center (Universitat Politecnica de Catalunya, Barcelona) is
looking for a engineer to participate in a Spanish-government funded
project to push forward the digital transformation of organizations.
The project includes to main research lines:
a) Advancing business process automation from natural language
descriptions or human-machine interactions
b) Detecting new process automation opportunities from human-human
interactions (mail threads, slack conversations, etc).
The successful candidate will be in charge of the development of tools
to support the linguist team in the curation of data used to fine-tune
Large Language Models (GPT, BLOOM, ...), as well as in the training,
evaluation, and deployment of tuned models.
Find more details about the offer at
https://www.cs.upc.edu/~padro/jobs/engineer-SE.pdf
<https://www.cs.upc.edu/~padro/jobs/engineer-SE.pdf>
Lluís Padró
Universitat Politècnica de Catalunya
The Natural Language research group in IDEAI <https://ideai.upc.edu/en>
Research center (Universitat Politecnica de Catalunya, Barcelona) is
looking for a Postdoc to participate in a Spanish-government funded
project to push forward the digital transformation of organizations.
The project includes to main research lines:
a) Advancing business process automation from natural language
descriptions or human-machine interactions
b) Detecting new process automation opportunities from human-human
interactions (mail threads, slack conversations, etc).
Find more details about the offer at
https://www.cs.upc.edu/~padro/jobs/postdoc-IA.pdf
Lluís Padró
Universitat Politècnica de Catalunya