Skip to main content

NEWS AND RESOURCES FOR MEMBERS OF THE IEEE SIGNAL PROCESSING SOCIETY

Reusing Speech Techniques for Video Semantic Indexing

Many techniques developed in speech research have been successfully employed in other fields, such as automatic video semantic indexing. In this application, a user submits a textual input query for an desired object or a scene to a search system, which returns video shots that include the object or scene. Recently, a new method using Gaussian-mixture model (GMM) supervectors and support vector machines (SVMs) was proven to be very effective. In this method, speech technology such as speaker verification and adaptation techniques plays very important roles. In the column article, entitled “Reusing Speech Techniques for Video Semantic Indexing”, in the March issue of IEEE Signal Processing Magzine, the interested readers can learn more about video semantic indexing and the approaches using speech techniques.