Abstract
This paper describes a corpus methodology developed in the course of investigations of three small subcorpora of texts: fiction by Conrad, nineteenth-century accounts of exploration, and adventure fiction. Stemming from observations that both high and lower frequency words in text are of interest in relation to meaning and structure, a 'stratified approach' integrates different levels of frequency by grouping words in a text into a series of frequency bands, identified from a reference corpus, then examining the most recurrent words within each band. This approach shows up, in particular, the ways in which topic and theme are lexicalized, and connotative lexis is associated with lower frequencies. In addition to exemplifying the approach and demonstrating its applications, the paper argues that any set of words, identified as significant through frequency, still involves selection, and that information which is derived quantitatively should not be interpreted as if wholly objective.
Original language | English |
---|---|
Pages (from-to) | 1-33 |
Number of pages | 33 |
Journal | Journal of Literary Semantics |
Volume | 36 |
DOIs | |
Publication status | Published - 1 Apr 2007 |