Abstract
As large-scale learner corpora become increasingly available, it is vital that natural language processing (NLP) technology is developed to provide rich linguistic annotations necessary for second language (L2) research. We present a system for automatically analyzing subcategorization frames (SCFs) for learner English. SCFs link lexis with morphosyntax, shedding light on the interplay between lexical and structural information in learner language. Meanwhile, SCFs are crucial to the study of a wide range of phenomena including individual verbs, verb classes and varying syntactic structures. To illustrate the usefulness of our system for learner corpus research and second language acquisition (SLA), we investigate how L2 learners diversify their use of SCFs in text and how this diversity changes with L2 proficiency.
Original language | English |
---|---|
Pages (from-to) | 187-218 |
Number of pages | 32 |
Journal | International Journal of Corpus Linguistics |
Volume | 26 |
Issue number | 2 |
Early online date | 8 Dec 2020 |
DOIs | |
Publication status | Published - Jul 2021 |
Keywords
- natural language processing
- SCF identification
- second language acquisition
- subcategorization
- verb-argument construction