HEAL DSpace

An embedded saliency map estimator scheme: Application to video encoding

Αποθετήριο DSpace/Manakin

Εμφάνιση απλής εγγραφής

dc.contributor.author Tsapatsoulis, N en
dc.contributor.author Rapantzikos, K en
dc.contributor.author Pattichis, C en
dc.date.accessioned 2014-03-01T02:44:26Z
dc.date.available 2014-03-01T02:44:26Z
dc.date.issued 2007 en
dc.identifier.issn 0129-0657 en
dc.identifier.uri https://dspace.lib.ntua.gr/xmlui/handle/123456789/31826
dc.subject Embedded implementation en
dc.subject ROI-based video encoding en
dc.subject Visual attention model en
dc.subject.classification Computer Science, Artificial Intelligence en
dc.subject.other Computation theory en
dc.subject.other Image compression en
dc.subject.other Image quality en
dc.subject.other Video signal processing en
dc.subject.other Wavelet analysis en
dc.subject.other Wavelet decomposition en
dc.subject.other Embedded implementation en
dc.subject.other ROI-based video encoding en
dc.subject.other Visual attention model en
dc.subject.other Image coding en
dc.subject.other article en
dc.subject.other artificial neural network en
dc.subject.other attention en
dc.subject.other automated pattern recognition en
dc.subject.other human en
dc.subject.other methodology en
dc.subject.other pattern recognition en
dc.subject.other photostimulation en
dc.subject.other physiology en
dc.subject.other psychological model en
dc.subject.other reaction time en
dc.subject.other videorecording en
dc.subject.other Attention en
dc.subject.other Humans en
dc.subject.other Models, Psychological en
dc.subject.other Neural Networks (Computer) en
dc.subject.other Pattern Recognition, Automated en
dc.subject.other Pattern Recognition, Visual en
dc.subject.other Photic Stimulation en
dc.subject.other Reaction Time en
dc.subject.other Video Recording en
dc.title An embedded saliency map estimator scheme: Application to video encoding en
heal.type conferenceItem en
heal.identifier.primary 10.1142/S0129065707001147 en
heal.identifier.secondary http://dx.doi.org/10.1142/S0129065707001147 en
heal.language English en
heal.publicationDate 2007 en
heal.abstract In this paper we propose a novel saliency-based computational model for visual attention. This model processes both top-down (goal directed) and bottom-up information. Processing in the top-down channel creates the so called skin conspicuity map and emulates the visual search for human faces performed by humans. This is clearly a goal directed task but is generic enough to be context independent. Processing in the bottom-up information channel follows the principles set by Itti et al. but it deviates from them by computing the orientation, intensity and color conspicuity maps within a unified multi-resolution framework based on wavelet subband analysis. In particular, we apply a wavelet based approach for efficient computation of the topographic feature maps. Given that wavelets and multiresolution theory are naturally connected the usage of wavelet decomposition for mimicking the center surround process in humans is an obvious choice. However, our implementation goes further. We utilize the wavelet decomposition for inline computation of the features (such as orientation angles) that are used to create the topographic feature maps. The bottom-up topographic feature maps and the top-down skin conspicuity map are then combined through a sigmoid function to produce the final saliency map. A prototype of the proposed model was realized through the TMDSDMK642-0E DSP platform as an embedded system allowing real-time operation. For evaluation purposes, in terms of perceived visual quality and video compression improvement, a ROI-based video compression setup was followed. Extended experiments concerning both MPEG-I as well as low bit-rate MPEG-4 video encoding were conducted showing significant improvement in video compression efficiency without perceived deterioration in visual quality. © World Scientific Publishing Company. en
heal.publisher WORLD SCIENTIFIC PUBL CO PTE LTD en
heal.journalName International Journal of Neural Systems en
dc.identifier.doi 10.1142/S0129065707001147 en
dc.identifier.isi ISI:000249024600008 en
dc.identifier.volume 17 en
dc.identifier.issue 4 en
dc.identifier.spage 289 en
dc.identifier.epage 304 en


Αρχεία σε αυτό το τεκμήριο

Αρχεία Μέγεθος Μορφότυπο Προβολή

Δεν υπάρχουν αρχεία που σχετίζονται με αυτό το τεκμήριο.

Αυτό το τεκμήριο εμφανίζεται στην ακόλουθη συλλογή(ές)

Εμφάνιση απλής εγγραφής