Second International Conference on Spoken Language Processing (ICSLP'92)
Banff, Alberta, Canada
This paper presents lexical statistics on the pattern of occurrence of words embedded in other words. We report the results of an analysis of 25000 words, varying in length from two to six syllables, extracted from a phonetically-coded English dictionary (The Longman Dictionary of Contemporary English). Each syllable, and each string of syllables within each word was checked against the dictionary. Two analyses are presented: the first used a complete list of polysyllables, with look-up on the entire dictionary; the second used a sublist of content words, counting only embedded words which were themselves content words. The results have important implications for models of human speech recognition. The efficiency of these models depends, in different ways, on the number and location of words within words.
Bibliographic reference. McQueen, James M. / Cutler, Anne (1992): "Words within words: lexical statistics and lexical access", In ICSLP-1992, 221-224.