dc.contributor.author |
Lakka, C |
en |
dc.contributor.author |
Nikolopoulos, S |
en |
dc.contributor.author |
Varytimidis, C |
en |
dc.contributor.author |
Kompatsiaris, I |
en |
dc.date.accessioned |
2014-03-01T01:34:51Z |
|
dc.date.available |
2014-03-01T01:34:51Z |
|
dc.date.issued |
2011 |
en |
dc.identifier.issn |
0923-5965 |
en |
dc.identifier.uri |
https://dspace.lib.ntua.gr/xmlui/handle/123456789/20900 |
|
dc.subject |
Bayesian networks modeling |
en |
dc.subject |
Compound documents analysis |
en |
dc.subject |
Cross media analysis |
en |
dc.subject |
Knowledge fusion |
en |
dc.subject |
Video shot classification |
en |
dc.subject.classification |
Engineering, Electrical & Electronic |
en |
dc.subject.other |
Application contexts |
en |
dc.subject.other |
Bayesian |
en |
dc.subject.other |
Car manufacturing |
en |
dc.subject.other |
Certain hypothesis |
en |
dc.subject.other |
Compound document |
en |
dc.subject.other |
Conceptual spaces |
en |
dc.subject.other |
Cross-media |
en |
dc.subject.other |
Discriminative models |
en |
dc.subject.other |
Domain knowledge |
en |
dc.subject.other |
Existing method |
en |
dc.subject.other |
Explicit knowledge |
en |
dc.subject.other |
Heterogeneous media |
en |
dc.subject.other |
Knowledge fusion |
en |
dc.subject.other |
Media types |
en |
dc.subject.other |
Modeling approach |
en |
dc.subject.other |
Network modeling |
en |
dc.subject.other |
Performance improvements |
en |
dc.subject.other |
Semantic analysis |
en |
dc.subject.other |
Textual information |
en |
dc.subject.other |
Video shot classification |
en |
dc.subject.other |
Video shots |
en |
dc.subject.other |
Automobile manufacture |
en |
dc.subject.other |
Competition |
en |
dc.subject.other |
Distributed parameter networks |
en |
dc.subject.other |
Inference engines |
en |
dc.subject.other |
Information retrieval systems |
en |
dc.subject.other |
Intelligent networks |
en |
dc.subject.other |
Knowledge based systems |
en |
dc.subject.other |
Semantics |
en |
dc.subject.other |
Speech recognition |
en |
dc.subject.other |
Support vector machines |
en |
dc.subject.other |
Video signal processing |
en |
dc.subject.other |
Bayesian networks |
en |
dc.title |
A Bayesian network modeling approach for cross media analysis |
en |
heal.type |
journalArticle |
en |
heal.identifier.primary |
10.1016/j.image.2011.01.004 |
en |
heal.identifier.secondary |
http://dx.doi.org/10.1016/j.image.2011.01.004 |
en |
heal.language |
English |
en |
heal.publicationDate |
2011 |
en |
heal.abstract |
Existing methods for the semantic analysis of multimedia, although effective for single-medium scenarios, are inherently flawed in cases where knowledge is spread over different media types. In this work we implement a cross media analysis scheme that takes advantage of both visual and textual information for detecting high-level concepts. The novel aspect of this scheme is the definition and use of a conceptual space where information originating from heterogeneous media types can be meaningfully combined and facilitate analysis decisions. More specifically, our contribution is on proposing a modeling approach for Bayesian Networks that defines this conceptual space and allows evidence originating from the domain knowledge, the application context and different content modalities to support or disproof a certain hypothesis. Using this scheme we have performed experiments on a set of 162 compound documents taken from the domain of car manufacturing industry and 118 581 video shots taken from the TRECVID2010 competition. The obtained results have shown that the proposed modeling approach exploits the complementary effect of evidence extracted across different media and delivers performance improvements compared to the single-medium cases. Moreover, by comparing the performance of the proposed approach with an approach using Support Vector Machines (SVM), we have verified that in a cross media setting the use of generative rather than discriminative models are more suited, mainly due to their ability to smoothly incorporate explicit knowledge and learn from a few examples. (C) 2011 Elsevier B.V. All rights reserved. |
en |
heal.publisher |
ELSEVIER SCIENCE BV |
en |
heal.journalName |
Signal Processing: Image Communication |
en |
dc.identifier.doi |
10.1016/j.image.2011.01.004 |
en |
dc.identifier.isi |
ISI:000290825000005 |
en |
dc.identifier.volume |
26 |
en |
dc.identifier.issue |
3 |
en |
dc.identifier.spage |
175 |
en |
dc.identifier.epage |
193 |
en |