Proceedings Article | 4 May 2012
KEYWORDS: Video, 3D displays, Stars, Sensors, Feature extraction, 3D acquisition, Statistical analysis, Fused deposition modeling, Detection and tracking algorithms, Image processing
3D video content is captured and created mainly in high resolution targeting big cinema or home TV screens. For 3D
mobile devices, equipped with small-size auto-stereoscopic displays, such content has to be properly repurposed,
preferably in real-time. The repurposing requires not only spatial resizing but also properly maintaining the output stereo
disparity, as it should deliver realistic, pleasant and harmless 3D perception.
In this paper, we propose an approach to adapt the disparity range of the source video to the comfort disparity zone of
the target display. To achieve this, we adapt the scale and the aspect ratio of the source video. We aim at maximizing the
disparity range of the retargeted content within the comfort zone, and minimizing the letterboxing of the cropped
content.
The proposed algorithm consists of five stages. First, we analyse the display profile, which characterises what 3D
content can be comfortably observed in the target display. Then, we perform fast disparity analysis of the input
stereoscopic content. Instead of returning the dense disparity map, it returns an estimate of the disparity statistics (min,
max, meanand variance) per frame. Additionally, we detect scene cuts, where sharp transitions in disparities occur.
Based on the estimated input, and desired output disparity ranges, we derive the optimal cropping parameters and scale
of the cropping window, which would yield the targeted disparity range and minimize the area of cropped and
letterboxed content. Once the rescaling and cropping parameters are known, we perform resampling procedure using
spline-based and perceptually optimized resampling (anti-aliasing) kernels, which have also a very efficient
computational structure. Perceptual optimization is achieved through adjusting the cut-off frequency of the anti-aliasing
filter with the throughput of the target display.