Data and Documentation
Open Data Policy
FAQ
EN
DE
FR
Suchbegriff
Advanced search
Publication
Back to overview
Using Multilingual Resources to Evaluate CEFRLex for Learner Applications
Type of publication
Peer-reviewed
Publikationsform
Proceedings (peer-reviewed)
Author
Graën Johannes, Alfter David, Schneider Gerold,
Project
From parallel corpora to multilingual exercises - Making use of large text collections and crowdsourcing techniques for innovative autonomous language learning applications
Show all
Proceedings (peer-reviewed)
Page(s)
346 - 355
Title of proceedings
Proceedings of The 12th Language Resources and Evaluation Conference (LREC)
Place
Marseille
Open Access
URL
https://www.aclweb.org/anthology/2020.lrec-1.43
Type of Open Access
Publisher (Gold Open Access)
Abstract
The Common European Framework of Reference for Languages (CEFR) defines six levels of learner proficiency, and links them to particular communicative abilities. The CEFRLex project aims at compiling lexical resources that link single words and multi-word expressions to particular CEFR levels. The resources are thought to reflect second language learner needs as they are compiled from CEFR-graded textbooks and other learner-directed texts. In this work, we investigate the applicability of CEFRLex resources for building language learning applications. Our main concerns were that vocabulary in language learning materials might be sparse, i.e. that not all vocabulary items that belong to a particular level would also occur in materials for that level, and, on the other hand, that vocabulary items might be used on lower-level materials if required by the topic (e.g. with a simpler paraphrasing or translation). Our results indicate that the English CEFRLex resource is in accordance with external resources that we jointly employ as gold standard. Together with other values obtained from monolingual and parallel corpora, we can indicate which entries need to be adjusted to obtain values that are even more in line with this gold standard. We expect that this finding also holds for the other languages
-