VideoCompression

From PublicWiki
Revision as of 00:37, 14 October 2006 by AnnaCavender (talk | contribs)

Jump to: navigation, search

Video Compression

[1] George Ghinea, Johnson P Thomas. QoS Impact on User Perception and Understanding of Multimedia Video Clips. Proceedings of ACM Multimedia 1998: 49-54 (University of Reading, U.K.)

Comments: Users' definition of QoS as based on their ability to understand multimedia videos may be different from the technical definition. Lower framerate may lead to more information gathered because it was seen longer.

[2] Wilson S. Geisler and Jeffrey S. Perry. A real-time foveated multiresolution system for low-bandwidth video communication. SPIE Proceedings Vol. 3299, 1998 (University of Texas, Austin)

Comments: Better resolution where people are likely to look. Plus, some research on how to intelligently segment images to mesh with human visual system.

[3] Laura J. Muir and Iain E. G. Richardson. Perception of Sign Language and Its Application to Visual Communications for Deaf People. Journal of Deaf Studies and Deaf Educationn 2005. Volume 10, Number 4 390-401 (The Robert Gordon University, Aberdeen, United Kingdom)

Comments: Participants were shown three videos where zoom, background, and range of sign was varied. Regardless of these factors, participants viewing patterns were the same, focused around the lower face of the signer. Visual excursions were noticed when the hands came close to the face (close enough to "draw" the eyes away from the face briefly, but fingerspelling was not a factor. Conclusion is that ROI would be an appropriate and useful technique.

[4] Andrew P. Bradley and Fred W.M. Stentiford. Visual attention for region of interest coding in JPEG 2000. Journal of Visual Communication and Image Representation 14(3):232-250. (The University of Queensland, Australia)

Comments: Images, not video. Compicated clustering to find relevant ROIs. Difference in regions is just bits/pixel. User studies indicated that users did *not* prefer the ROI images over the uniformly distorted ones! Perhaps the region was not chosen properly, or not all users prefer the same region...

[5] Richard P Schumeyer, Edwin A. Heredia, Kenneth E. Barner. Region of Interest Priority Coding for Sign Language Videoconferencing. IEEE First Workshop on Multimedia Signal Processing, pp. 531--536, (Princeton), 1997. (Applied Science and Engineering Laboratories and Thomson Consumer Electronics)

Comments: ROI of sign language video results in better compression (old paper: using H.261). Results not empirically tested.

[6] Nariman Habili, Cheng-Chew Lim, Alireza Moini. Segmentation of the face and hands in sign language video sequences using color and motion cues. IEEE Trans. Circuits Syst. Video Techn. 14(8): 1086-1097 (2004)

Comments: One of many skin detection algorithms using properties of skin found in the chrominance, motion, and connected components. Just happens to have 'sign language' in the title.

[7] George Sperling, Michael Landy, Yoav Cohen, and M. Pavel. Intelligible Encoding of ASL image Sequences at Extremely Low Information Rates. Papers from the second workshop Vol. 13 on Human and Machine Vision II, pp. 256--312, 1986