TY - GEN
T1 - Evaluating Accelerators for a High-Throughput Hash-Based Security Protocol
AU - Lee, Kaitlyn
AU - Donnelly, Brian
AU - Sery, Tomer
AU - Ilan, Dan
AU - Cambou, Bertrand
AU - Gowanlock, Michael
N1 - Publisher Copyright:
© 2023 ACM.
PY - 2023/8/7
Y1 - 2023/8/7
N2 - Security threats are rising due to widely available computational power and near-future quantum computers. New cryptographic protocols have been developed to address these challenges, but very few protocols take advantage of parallel computing. In this paper, we propose optimizations to the cryptography protocol Response-Based Cryptography (RBC). Since the protocol is general-purpose, it can be incorporated into post-quantum cryptography systems to authenticate users in resource-constrained environments, like Internet of Thing (IoT) devices. The optimizations proposed in this paper allow for clients to be authenticated faster. Additionally, this paper makes a cross-platform comparison of the performance of the optimized RBC protocol on the Graphics Processing Unit (GPU), the Central Processing Unit (CPU), and the Associative Processing Unit (APU). We find that the GPU and APU yield similar performance but the APU can be much more energy efficient. Furthermore, we evaluate the multi-GPU scalability of the algorithm and achieve a minimum speedup of 2.66 × on 3 × A100 GPUs.
AB - Security threats are rising due to widely available computational power and near-future quantum computers. New cryptographic protocols have been developed to address these challenges, but very few protocols take advantage of parallel computing. In this paper, we propose optimizations to the cryptography protocol Response-Based Cryptography (RBC). Since the protocol is general-purpose, it can be incorporated into post-quantum cryptography systems to authenticate users in resource-constrained environments, like Internet of Thing (IoT) devices. The optimizations proposed in this paper allow for clients to be authenticated faster. Additionally, this paper makes a cross-platform comparison of the performance of the optimized RBC protocol on the Graphics Processing Unit (GPU), the Central Processing Unit (CPU), and the Associative Processing Unit (APU). We find that the GPU and APU yield similar performance but the APU can be much more energy efficient. Furthermore, we evaluate the multi-GPU scalability of the algorithm and achieve a minimum speedup of 2.66 × on 3 × A100 GPUs.
UR - http://www.scopus.com/inward/record.url?scp=85175029278&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85175029278&partnerID=8YFLogxK
U2 - 10.1145/3605731.3605745
DO - 10.1145/3605731.3605745
M3 - Conference contribution
AN - SCOPUS:85175029278
T3 - ACM International Conference Proceeding Series
SP - 40
EP - 49
BT - 52nd International Conference on Parallel Processing, ICPP 2023 - Workshops Proceedings
PB - Association for Computing Machinery
T2 - 52nd International Conference on Parallel Processing, ICPP 2023 - Workshops Proceedings
Y2 - 7 August 2023 through 10 August 2023
ER -