TY - GEN
T1 - Multi-Space Tree with Incremental Construction for GPU-Accelerated Range Queries
AU - Donnelly, Brian
AU - Gowanlock, Michael
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Performing range queries is prohibitively expensive as the dimensionality of the data increases. Indexing data structures reduce the time complexity of these searches by eliminating superfluous distance calculations. The state-of-the-art utilizes the GPU due to its high distance calculation throughput as compared to multi-core CPUs. Previous state-of-the-art indexes fall into two categories: metric- and coordinate-based indexes, both of which partition the space using different approaches. The indexes partition the space to generate a set of candidate points for a given query which are later refined by distance calculations. Popular metric-based indexes partition the data based on distances to reference points, where the placement of the reference points determines the partitioning of the data space but the effectiveness depends on the distribution of the data. In high-dimensions, coordinate-based indexes typically partition the data based on a subset of the coordinate dimensions. Regardless of the index type there is a tradeoff between index search overhead and the number of distance calculations, where increasing the number of partitions will increase the search overhead but will decrease the number of distance calculations computed. In this paper, we propose Multi-Space Tree with Incremental Construction (MISTIC), a blended approach which uses both metric-based and coordinate-based partitioning strategies coupled with incremental index construction. We evaluate MISTIC on 5 real-world datasets and compare performance to both a state-of-the-art metric-based index, COSS, and a state-of-the-art coordinate-based index, GDS-JOIN. We find that MISTIC outperforms the state-of-the-art methods with an average speedup of 2.53× over COSS and 2.73× over GDS-JOIN.
AB - Performing range queries is prohibitively expensive as the dimensionality of the data increases. Indexing data structures reduce the time complexity of these searches by eliminating superfluous distance calculations. The state-of-the-art utilizes the GPU due to its high distance calculation throughput as compared to multi-core CPUs. Previous state-of-the-art indexes fall into two categories: metric- and coordinate-based indexes, both of which partition the space using different approaches. The indexes partition the space to generate a set of candidate points for a given query which are later refined by distance calculations. Popular metric-based indexes partition the data based on distances to reference points, where the placement of the reference points determines the partitioning of the data space but the effectiveness depends on the distribution of the data. In high-dimensions, coordinate-based indexes typically partition the data based on a subset of the coordinate dimensions. Regardless of the index type there is a tradeoff between index search overhead and the number of distance calculations, where increasing the number of partitions will increase the search overhead but will decrease the number of distance calculations computed. In this paper, we propose Multi-Space Tree with Incremental Construction (MISTIC), a blended approach which uses both metric-based and coordinate-based partitioning strategies coupled with incremental index construction. We evaluate MISTIC on 5 real-world datasets and compare performance to both a state-of-the-art metric-based index, COSS, and a state-of-the-art coordinate-based index, GDS-JOIN. We find that MISTIC outperforms the state-of-the-art methods with an average speedup of 2.53× over COSS and 2.73× over GDS-JOIN.
KW - gpu
KW - metric-based
KW - range query
UR - http://www.scopus.com/inward/record.url?scp=105000015329&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=105000015329&partnerID=8YFLogxK
U2 - 10.1109/HIPC62374.2024.00024
DO - 10.1109/HIPC62374.2024.00024
M3 - Conference contribution
AN - SCOPUS:105000015329
T3 - Proceedings - 2024 IEEE 31st International Conference on High Performance Computing, Data, and Analytics, HiPC 2024
SP - 132
EP - 142
BT - Proceedings - 2024 IEEE 31st International Conference on High Performance Computing, Data, and Analytics, HiPC 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 31st Annual IEEE International Conference on High Performance Computing, Data, and Analytics, HiPC 2024
Y2 - 18 December 2024 through 21 December 2024
ER -