dc.contributor.author |
Spanakis, G |
en |
dc.contributor.author |
Siolas, G |
en |
dc.contributor.author |
Stafylopatis, A |
en |
dc.date.accessioned |
2014-03-01T02:52:38Z |
|
dc.date.available |
2014-03-01T02:52:38Z |
|
dc.date.issued |
2010 |
en |
dc.identifier.issn |
18761100 |
en |
dc.identifier.uri |
https://dspace.lib.ntua.gr/xmlui/handle/123456789/35962 |
|
dc.subject.other |
Clustering process |
en |
dc.subject.other |
Document Representation |
en |
dc.subject.other |
Hier-archical clustering |
en |
dc.subject.other |
Novel methods |
en |
dc.subject.other |
Wikipedia |
en |
dc.subject.other |
Clustering algorithms |
en |
dc.subject.other |
Information science |
en |
dc.title |
Conceptual hierarchical clustering of documents using Wikipedia knowledge |
en |
heal.type |
conferenceItem |
en |
heal.identifier.primary |
10.1007/978-90-481-9794-1_25 |
en |
heal.identifier.secondary |
http://dx.doi.org/10.1007/978-90-481-9794-1_25 |
en |
heal.publicationDate |
2010 |
en |
heal.abstract |
In this paper, we propose a novel method for conceptual hierarchical clustering of documents using knowledge extracted from Wikipedia. A robust and compact document representation is built in real-time using the Wikipedia API. The clustering process is hierarchical and creates cluster labels which are descriptive and important for the examined corpus. Experiments show that the proposed technique greatly improves over the baseline approach. © 2011 Springer Science+Business Media B.V. |
en |
heal.journalName |
Lecture Notes in Electrical Engineering |
en |
dc.identifier.doi |
10.1007/978-90-481-9794-1_25 |
en |
dc.identifier.volume |
62 LNEE |
en |
dc.identifier.spage |
121 |
en |
dc.identifier.epage |
126 |
en |