EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology

Aalborg, Denmark
September 3-7, 2001


Classification of Video Genre using Audio

Matthew Roach, John S. Mason

University of Wales Swansea, UK

In this paper we propose an approach to high-level classification of video into genre: sport, cartoon, news, commercial and music. An important issue for automatic high-level classification systems is the amount of time needed to classify a video. Here we investigate classification performance as a function of the test sequence length. In addition we present performance against different orders and combinations of static and dynamic mel-frequency cepstral coefficients (MFCC). We find that static and delta MFCCs perform well for this classification task. A test sequence length of approximately 25 seconds for the 5 class problem gives approximately 80% correct identification.

Full Paper

Bibliographic reference.  Roach, Matthew / Mason, John S. (2001): "Classification of video genre using audio", In EUROSPEECH-2001, 2693-2696.