Heterogeneous CPU-GPU Epsilon Grid Joins: Static and Dynamic Work Partitioning Strategies

Benoit Gallet, Michael Gowanlock

Research output: Contribution to journalArticlepeer-review

8 Scopus citations

Abstract

Given two datasets (or tables) A and B and a search distance ϵ, the distance similarity join, denoted as A⋉ ϵB, finds the pairs of points (pa, pb), where pa∈ A and pb∈ B, and such that the distance between pa and pb is ≤ ϵ. If A= B, then the similarity join is equivalent to a similarity self-join, denoted as A⋈ ϵA. We propose in this paper Heterogeneous Epsilon Grid Joins (HEGJoin), a heterogeneous CPU-GPU distance similarity join algorithm. Efficiently partitioning the work between the CPU and the GPU is a challenge. Indeed, the work partitioning strategy needs to consider the different characteristics and computational throughput of the processors (CPU and GPU), as well as the data-dependent nature of the similarity join that accounts in the overall execution time (e.g., the number of queries, their distribution, the dimensionality, etc.). In addition to HEGJoin, we design in this paper a dynamic and two static work partitioning strategies. We also propose a performance model for each static partitioning strategy to perform the distribution of the work between the processors. We evaluate the performance of all three partitioning methods by considering the execution time and the load imbalance between the CPU and GPU as performance metrics. HEGJoin achieves a speedup of up to 5.46 × (3.97 ×) over the GPU-only (CPU-only) algorithms on our first test platform and up to 1.97 × (12.07 ×) on our second test platform over the GPU-only (CPU-only) algorithms.

Original languageEnglish (US)
Pages (from-to)39-62
Number of pages24
JournalData Science and Engineering
Volume6
Issue number1
DOIs
StatePublished - Mar 2021
Externally publishedYes

Keywords

  • HEGJoin
  • Heterogeneous CPU-GPU computing
  • Range query
  • Similarity join
  • Super-EGO
  • Work partitioning

ASJC Scopus subject areas

  • Software
  • Artificial Intelligence
  • Information Systems
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Heterogeneous CPU-GPU Epsilon Grid Joins: Static and Dynamic Work Partitioning Strategies'. Together they form a unique fingerprint.

Cite this