Second European Conference on Speech Communication and Technology

Genova, Italy
September 24-26, 1991


Up from Trigrams! - The Struggle for Improved Language Models

Frederick Jelinek

Continuous Speech Recognition Group, IBM TJ. Watson Research Center, Yorktown Heights, NY, USA

The first experimental results in "large" vocabulary speech recognition were obtained in 1976 [ Bahl 78 ]. They involved continuous speech reading of. the so called Laser Patent Text. The test set was limited to sentences that were entirely composed of words belonging to a vocabulary of the 1000 most frequent words found in the training text.

Full Paper

Bibliographic reference.  Jelinek, Frederick (1991): "Up from trigrams! - the struggle for improved language models", In EUROSPEECH-1991, 1037-1040.