Paper
10 January 2003 Context-enhanced video understanding
Author Affiliations +
Proceedings Volume 5021, Storage and Retrieval for Media Databases 2003; (2003) https://doi.org/10.1117/12.479745
Event: Electronic Imaging 2003, 2003, Santa Clara, CA, United States
Abstract
Many recent efforts have been made to automatically index multimedia content with the aim of bridging the semantic gap between syntax and semantics. In this paper, we propose a novel framework to automatically index video using context for video understanding. First we discuss the notion of context and how it relates to video understanding. Then we present the framework we are constructing, which is modeled as an expert system that uses a rule-based engine, domain knowledge, visual detectors (for objects and scenes), and different data sources available with the video (metadata, text from automatic speech recognition, etc.). We also describe our approach to align text from speech recognition and video segments, and present experiments using a simple implementation of our framework. Our experiments show that context can be used to improve the performance of visual detectors.
© (2003) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Alejandro Jaimes, Milind Ramesh Naphade, Harriet Nock, John R. Smith, and Belle L. Tseng "Context-enhanced video understanding", Proc. SPIE 5021, Storage and Retrieval for Media Databases 2003, (10 January 2003); https://doi.org/10.1117/12.479745
Lens.org Logo
CITATIONS
Cited by 4 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Visualization

Sensors

Speech recognition

Video processing

Data modeling

Information visualization

RELATED CONTENT

Attention based CNN-LSTM network for video caption
Proceedings of SPIE (November 10 2022)
Content-based analysis of news video
Proceedings of SPIE (September 25 2001)
System for parsing MPEG videos
Proceedings of SPIE (December 20 1999)

Back to Top