Submitted manuscript
Final published version
Licence: CC BY: Creative Commons Attribution 4.0 International License
Research output: Contribution to Journal/Magazine › Journal article › peer-review
Research output: Contribution to Journal/Magazine › Journal article › peer-review
}
TY - JOUR
T1 - The more proficient the learners, the less sophisticated their L2 vocabulary?
T2 - The curious effect of the reference corpus on mean-frequency measures of lexical sophistication.
AU - Bottini, Raffaella
AU - Le Foll, Elen
PY - 2025/2/3
Y1 - 2025/2/3
N2 - Mean-frequency scores of lexical sophistication are used to evaluate written and spoken language production. They are calculated using word frequencies extracted from a reference corpus. Using mixed-effects regression models, we analyse the strength of the relationship between L2 proficiency and mean-frequency scores in spoken and written texts using reference corpora representing different modes and registers. We control for task and topic effects. We observe that mean-frequency measures of lexical sophistication are considerably more influenced by the mode and register of the reference corpus used to calculate these scores than by language users’ proficiency level. Advanced language users produce more frequent vocabulary, typical of the target register, in both spoken monologues and written essays. These results provide evidence in favour of a conceptual and terminological shift from lexical sophistication to register appropriateness (as suggested by Durrant & Brenchley, 2019) to refer to the construct captured by mean-frequency scores of vocabulary use.
AB - Mean-frequency scores of lexical sophistication are used to evaluate written and spoken language production. They are calculated using word frequencies extracted from a reference corpus. Using mixed-effects regression models, we analyse the strength of the relationship between L2 proficiency and mean-frequency scores in spoken and written texts using reference corpora representing different modes and registers. We control for task and topic effects. We observe that mean-frequency measures of lexical sophistication are considerably more influenced by the mode and register of the reference corpus used to calculate these scores than by language users’ proficiency level. Advanced language users produce more frequent vocabulary, typical of the target register, in both spoken monologues and written essays. These results provide evidence in favour of a conceptual and terminological shift from lexical sophistication to register appropriateness (as suggested by Durrant & Brenchley, 2019) to refer to the construct captured by mean-frequency scores of vocabulary use.
KW - lexical complexity
KW - L2 proficiency
KW - reference corpora
KW - register appropriateness
KW - language testing
U2 - 10.1075/ijlcr.23029.bot
DO - 10.1075/ijlcr.23029.bot
M3 - Journal article
VL - 11
SP - 47
EP - 78
JO - International Journal of Learner Corpus Research
JF - International Journal of Learner Corpus Research
SN - 2215-1486
IS - 1
ER -