dc.contributor.author |
Terrovitis, M |
en |
dc.contributor.author |
Bouros, P |
en |
dc.contributor.author |
Vassiliadis, P |
en |
dc.contributor.author |
Sellis, T |
en |
dc.contributor.author |
Mamoulis, N |
en |
dc.date.accessioned |
2014-03-01T02:47:20Z |
|
dc.date.available |
2014-03-01T02:47:20Z |
|
dc.date.issued |
2011 |
en |
dc.identifier.uri |
https://dspace.lib.ntua.gr/xmlui/handle/123456789/33089 |
|
dc.subject |
Containment queries |
en |
dc.subject |
Inverted files |
en |
dc.subject |
Ordered inverted files |
en |
dc.subject |
Set-values |
en |
dc.subject.other |
Containment query |
en |
dc.subject.other |
Indexing scheme |
en |
dc.subject.other |
Inverted files |
en |
dc.subject.other |
Query processing algorithms |
en |
dc.subject.other |
Range query |
en |
dc.subject.other |
Set containment |
en |
dc.subject.other |
Set-valued attributes |
en |
dc.subject.other |
Set-values |
en |
dc.subject.other |
State-of-the-art methods |
en |
dc.subject.other |
Synthetic data |
en |
dc.subject.other |
Database systems |
en |
dc.subject.other |
Technology |
en |
dc.subject.other |
Information retrieval systems |
en |
dc.title |
Efficient answering of set containment queries for skewed item distributions |
en |
heal.type |
conferenceItem |
en |
heal.identifier.primary |
10.1145/1951365.1951394 |
en |
heal.identifier.secondary |
http://dx.doi.org/10.1145/1951365.1951394 |
en |
heal.publicationDate |
2011 |
en |
heal.abstract |
In this paper we address the problem of efficiently evaluating containment (i.e., subset, equality, and superset) queries over set-valued data. We propose a novel indexing scheme, the Ordered Inverted File (OIF) which, differently from the state-of-the-art, indexes set-valued attributes in an ordered fashion. We introduce query processing algorithms that practically treat containment queries as range queries over the ordered postings lists of OIF and exploit this ordering to quickly prune unnecessary page accesses. OIF is simple to implement and our experiments on both real and synthetic data show that it greatly outperforms the current state-of-the-art methods for all three classes of containment queries. |
en |
heal.journalName |
ACM International Conference Proceeding Series |
en |
dc.identifier.doi |
10.1145/1951365.1951394 |
en |
dc.identifier.spage |
225 |
en |
dc.identifier.epage |
236 |
en |