Hello,
Can anyone point me to corpora of language learner speech or written text that are labeled by CEFR proficiency level? Any languages are useful!
Thanks in advance
LUCIA
Lucia Donatelli, Ph.D. Department of Language Science and Technology Saarland University http://luciadonatelli.georgetown.domains
----- Mail original ----- De: "Lucia Donatelli via Corpora" corpora@list.elra.info À: corpora@list.elra.info Envoyé: Vendredi 11 Novembre 2022 12:00:01 Objet: [Corpora-List] CEFR language learner corpora?
Hello,
Can anyone point me to corpora of language learner speech or written text that are labeled by CEFR proficiency level? Any languages are useful!
Thanks in advance
LUCIA
Lucia Donatelli, Ph.D. Department of Language Science and Technology Saarland University http://luciadonatelli.george town.domains
_______________________________________________ Corpora mailing list -- corpora@list.elra.info https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/ To unsubscribe send an email to corpora-leave@list.elra.info
Hi International Corpus of Learner Finnish, more info available here: https://metashare.csc.fi/repository/browse/international-corpus-of-learner-f...
Best, Jarmo
Jarmo Harri Jantunen Professor Department of Language and Communication Studies P.O. Box 35 40014 University of Jyväskylä Finland
Jantunen, J. H., Ainiala, T., Jokela, S., & Tarvainen, J. (2022). Mapping Digital Discourses of the Capital Region of Finland : Combining Onomastics, CADS, and GIS. Names, 70(1), 20-39. https://doi.org/10.5195/names.2022.2289 Jantunen, J. H. & Kytölä, S. (2022). Online discourses of ‘homosexuality’ and religion. The discussion relating to Islam in Finland. Journal of Language and Sexualityhttps://www.academia.edu/71161967/Online_discourses_of_homosexuality_and_religion_The_discussion_relating_to_Islam_in_Finland. Jantunen, J. H., & Juvonen, T. (2021). Lesbonormatiivisuuksien ristipaineessa : määrällistä ja laadullista analyysiä Suomi24-verkkokeskusteluista. SQS : Suomen Queer-tutkimuksen Seuran lehti, 15(1-2), 17-36. https://doi.org/10.23980/sqs.112512
Lähettäjä: Eva Schaeffer-Lacroix via Corpora corpora@list.elra.info Päivämäärä: tiistaina, 22. marraskuuta 2022 klo 16.26 Vastaanottaja: Lucia Donatelli donatelli@coli.uni-saarland.de Kopio: corpora@list.elra.info corpora@list.elra.info Aihe: [Corpora-List] Re: CEFR language learner corpora? https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fmerlin-pla...
----- Mail original ----- De: "Lucia Donatelli via Corpora" corpora@list.elra.info À: corpora@list.elra.info Envoyé: Vendredi 11 Novembre 2022 12:00:01 Objet: [Corpora-List] CEFR language learner corpora?
Hello,
Can anyone point me to corpora of language learner speech or written text that are labeled by CEFR proficiency level? Any languages are useful!
Thanks in advance
LUCIA
Lucia Donatelli, Ph.D. Department of Language Science and Technology Saarland University https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fluciadonate... town.domains
_______________________________________________ Corpora mailing list -- corpora@list.elra.info https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Flist.elra.... To unsubscribe send an email to corpora-leave@list.elra.info
--
Eva Schaeffer-Lacroix Maîtresse de conférences HDR https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Forcid.org%...
https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fdidaktik.ha... Tél. : 06 64 68 21 92 _______________________________________________ Corpora mailing list -- corpora@list.elra.info https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Flist.elra.... To unsubscribe send an email to corpora-leave@list.elra.info
Hi:
CAES: Corpus de aprendices de español (Spanish learners)
Best regards,
Mario
On 11 Nov 2022, at 12:00, Lucia Donatelli via Corpora corpora@list.elra.info wrote:
Hello,
Can anyone point me to corpora of language learner speech or written text that are labeled by CEFR proficiency level? Any languages are useful!
Thanks in advance
LUCIA
Lucia Donatelli, Ph.D. Department of Language Science and Technology Saarland University http://luciadonatelli.georgetown.domains _______________________________________________ Corpora mailing list -- corpora@list.elra.info https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/ To unsubscribe send an email to corpora-leave@list.elra.info
-- Mario Barcala CEO at NLPgo http://www.nlpgo.com
MERLIN corpus German, Czech and Italian as target languages https://aclanthology.org/L14-1488/
https://merlin-platform.eu/index.php
Best, Elisa
Il giorno 22 nov 2022, alle ore 16:15, Mario Barcala via Corpora corpora@list.elra.info ha scritto:
Hi:
CAES: Corpus de aprendices de español (Spanish learners)
https://www.google.com/url?q=https://galvan.usc.es/caes&source=gmail-ima...
Best regards,
Mario
On 11 Nov 2022, at 12:00, Lucia Donatelli via Corpora corpora@list.elra.info wrote:
Hello,
Can anyone point me to corpora of language learner speech or written text that are labeled by CEFR proficiency level? Any languages are useful!
Thanks in advance
LUCIA
Lucia Donatelli, Ph.D. Department of Language Science and Technology Saarland University https://www.google.com/url?q=http://luciadonatelli.georgetown.domains&so... _______________________________________________ Corpora mailing list -- corpora@list.elra.info https://www.google.com/url?q=https://list.elra.info/mailman3/postorius/lists... To unsubscribe send an email to corpora-leave@list.elra.info
-- Mario Barcala CEO at NLPgo https://www.google.com/url?q=http://www.nlpgo.com&source=gmail-imap&...
Corpora mailing list -- corpora@list.elra.info https://www.google.com/url?q=https://list.elra.info/mailman3/postorius/lists... To unsubscribe send an email to corpora-leave@list.elra.info
Hi there,
Funded by the Spanish Ministry of Science and Innovation, we are compiling the FineDesc Corpus, composed of the successful written production at B1, B2 and C1 in the high-stakes examination CertAcles, as rated by two independent trained raters. The students' L1 is Spanish or any co-official language in Spain.
Further info at http://web.ujaen.es/investiga/finedesc/index.php
Best,
Belén
El mar, 22 nov 2022 a las 16:20, Elisa Di Nuovo via Corpora (< corpora@list.elra.info>) escribió:
MERLIN corpus German, Czech and Italian as target languages https://aclanthology.org/L14-1488/
https://merlin-platform.eu/index.php
Best, Elisa
Il giorno 22 nov 2022, alle ore 16:15, Mario Barcala via Corpora < corpora@list.elra.info> ha scritto:
Hi:
CAES: Corpus de aprendices de español (Spanish learners)
https://www.google.com/url?q=https://galvan.usc.es/caes&source=gmail-ima...
Best regards,
Mario
On 11 Nov 2022, at 12:00, Lucia Donatelli via Corpora < corpora@list.elra.info> wrote:
Hello,
Can anyone point me to corpora of language learner speech or written text that are labeled by CEFR proficiency level? Any languages are useful!
Thanks in advance
LUCIA
Lucia Donatelli, Ph.D.
Department of Language Science and Technology
Saarland University
https://www.google.com/url?q=http://luciadonatelli.georgetown.domains&so...
Corpora mailing list -- corpora@list.elra.info
https://www.google.com/url?q=https://list.elra.info/mailman3/postorius/lists...
To unsubscribe send an email to corpora-leave@list.elra.info
-- Mario Barcala CEO at NLPgo
https://www.google.com/url?q=http://www.nlpgo.com&source=gmail-imap&...
Corpora mailing list -- corpora@list.elra.info
https://www.google.com/url?q=https://list.elra.info/mailman3/postorius/lists... To unsubscribe send an email to corpora-leave@list.elra.info
Corpora mailing list -- corpora@list.elra.info https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/ To unsubscribe send an email to corpora-leave@list.elra.info
Hi there,
*CroLTeC* (CROatian Learner TExt Corpus) contains texts collected from learners of Croatian as a second and foreign language (from beginners – A1 to advanced learners – C1 and higher). http://teitok.clul.ul.pt/croltec/
On Tue, 22 Nov 2022 at 17:21, María Belén Díez Bedmar via Corpora < corpora@list.elra.info> wrote:
Hi there,
Funded by the Spanish Ministry of Science and Innovation, we are compiling the FineDesc Corpus, composed of the successful written production at B1, B2 and C1 in the high-stakes examination CertAcles, as rated by two independent trained raters. The students' L1 is Spanish or any co-official language in Spain.
Further info at http://web.ujaen.es/investiga/finedesc/index.php
Best,
Belén
El mar, 22 nov 2022 a las 16:20, Elisa Di Nuovo via Corpora (< corpora@list.elra.info>) escribió:
MERLIN corpus German, Czech and Italian as target languages https://aclanthology.org/L14-1488/
https://merlin-platform.eu/index.php
Best, Elisa
Il giorno 22 nov 2022, alle ore 16:15, Mario Barcala via Corpora < corpora@list.elra.info> ha scritto:
Hi:
CAES: Corpus de aprendices de español (Spanish learners)
https://www.google.com/url?q=https://galvan.usc.es/caes&source=gmail-ima...
Best regards,
Mario
On 11 Nov 2022, at 12:00, Lucia Donatelli via Corpora < corpora@list.elra.info> wrote:
Hello,
Can anyone point me to corpora of language learner speech or written text that are labeled by CEFR proficiency level? Any languages are useful!
Thanks in advance
LUCIA
Lucia Donatelli, Ph.D.
Department of Language Science and Technology
Saarland University
https://www.google.com/url?q=http://luciadonatelli.georgetown.domains&so...
Corpora mailing list -- corpora@list.elra.info
https://www.google.com/url?q=https://list.elra.info/mailman3/postorius/lists...
To unsubscribe send an email to corpora-leave@list.elra.info
-- Mario Barcala CEO at NLPgo
https://www.google.com/url?q=http://www.nlpgo.com&source=gmail-imap&...
Corpora mailing list -- corpora@list.elra.info
https://www.google.com/url?q=https://list.elra.info/mailman3/postorius/lists... To unsubscribe send an email to corpora-leave@list.elra.info
Corpora mailing list -- corpora@list.elra.info https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/ To unsubscribe send an email to corpora-leave@list.elra.info
Corpora mailing list -- corpora@list.elra.info https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/ To unsubscribe send an email to corpora-leave@list.elra.info
Hi,
There is also COPLE2, a learner corpus of Portuguese, here:
teitok.clul.ul.pt/learnercorpus
Best,
Amalia
________________________________
De: Gaurish Thakkar via Corpora corpora@list.elra.info Enviado: 22 de novembro de 2022 18:15 Para: María Belén Díez Bedmar Cc: Lucia Donatelli; Eva Schaeffer-Lacroix via Corpora Assunto: [Corpora-List] Re: CEFR language learner corpora?
Hi there,
CroLTeC (CROatian Learner TExt Corpus) contains texts collected from learners of Croatian as a second and foreign language (from beginners – A1 to advanced learners – C1 and higher).
http://teitok.clul.ul.pt/croltec/
On Tue, 22 Nov 2022 at 17:21, María Belén Díez Bedmar via Corpora <corpora@list.elra.infomailto:corpora@list.elra.info> wrote: Hi there,
Funded by the Spanish Ministry of Science and Innovation, we are compiling the FineDesc Corpus, composed of the successful written production at B1, B2 and C1 in the high-stakes examination CertAcles, as rated by two independent trained raters. The students' L1 is Spanish or any co-official language in Spain.
Further info at http://web.ujaen.es/investiga/finedesc/index.php
Best,
Belén [https://ci3.googleusercontent.com/mail-sig/AIorK4yIwdt3AfxusA7daiBoBey6COyQa...]
El mar, 22 nov 2022 a las 16:20, Elisa Di Nuovo via Corpora (<corpora@list.elra.infomailto:corpora@list.elra.info>) escribió: MERLIN corpus German, Czech and Italian as target languages https://aclanthology.org/L14-1488/
https://merlin-platform.eu/index.php
Best, Elisa
Il giorno 22 nov 2022, alle ore 16:15, Mario Barcala via Corpora <corpora@list.elra.infomailto:corpora@list.elra.info> ha scritto:
Hi:
CAES: Corpus de aprendices de español (Spanish learners)
https://www.google.com/url?q=https://galvan.usc.es/caes&source=gmail-ima...
Best regards,
Mario
On 11 Nov 2022, at 12:00, Lucia Donatelli via Corpora <corpora@list.elra.infomailto:corpora@list.elra.info> wrote:
Hello,
Can anyone point me to corpora of language learner speech or written text that are labeled by CEFR proficiency level? Any languages are useful!
Thanks in advance
LUCIA
Lucia Donatelli, Ph.D. Department of Language Science and Technology Saarland University https://www.google.com/url?q=http://luciadonatelli.georgetown.domains&so... _______________________________________________ Corpora mailing list -- corpora@list.elra.infomailto:corpora@list.elra.info https://www.google.com/url?q=https://list.elra.info/mailman3/postorius/lists... To unsubscribe send an email to corpora-leave@list.elra.infomailto:corpora-leave@list.elra.info
-- Mario Barcala CEO at NLPgo https://www.google.com/url?q=http://www.nlpgo.com&source=gmail-imap&...
_______________________________________________ Corpora mailing list -- corpora@list.elra.infomailto:corpora@list.elra.info https://www.google.com/url?q=https://list.elra.info/mailman3/postorius/lists... To unsubscribe send an email to corpora-leave@list.elra.infomailto:corpora-leave@list.elra.info _______________________________________________ Corpora mailing list -- corpora@list.elra.infomailto:corpora@list.elra.info https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/ To unsubscribe send an email to corpora-leave@list.elra.infomailto:corpora-leave@list.elra.info _______________________________________________ Corpora mailing list -- corpora@list.elra.infomailto:corpora@list.elra.info https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/ To unsubscribe send an email to corpora-leave@list.elra.infomailto:corpora-leave@list.elra.info
-- Regards: Gaurish P Thakkar
Hi Lucia,
We also made available the data for one of our studies. It’s a (small) corpus with short answers to questions in English and labeled with CEFR levels.
https://github.com/anaistack/cefr-asag-corpus [cefr-asag-corpus.png] anaistack/cefr-asag-corpus: A corpus of short answers written by learners of English and graded with CEFR levelshttps://github.com/anaistack/cefr-asag-corpus github.comhttps://github.com/anaistack/cefr-asag-corpus
All the best, Anaïs
On 11 Nov 2022, at 12:00, Lucia Donatelli via Corpora corpora@list.elra.info wrote:
Hello,
Can anyone point me to corpora of language learner speech or written text that are labeled by CEFR proficiency level? Any languages are useful!
Thanks in advance
LUCIA
Lucia Donatelli, Ph.D. Department of Language Science and Technology Saarland University http://luciadonatelli.georgetown.domainshttp://luciadonatelli.georgetown.domains/ _______________________________________________ Corpora mailing list -- corpora@list.elra.info https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/ To unsubscribe send an email to corpora-leave@list.elra.info