Performance–energy trade-offs of deep learning convolution algorithms on ARM processors
CC BY
Saved in:
Main Authors: | , , |
---|---|
Format: | Book |
Language: | English |
Published: |
Springer
2023
|
Subjects: | |
Online Access: | https://link.springer.com/article/10.1007/s11227-023-05050-4 https://dlib.phenikaa-uni.edu.vn/handle/PNK/8247 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
oai:localhost:PNK-8247 |
---|---|
record_format |
dspace |
spelling |
oai:localhost:PNK-82472023-04-25T01:26:40Z Performance–energy trade-offs of deep learning convolution algorithms on ARM processors Manuel F., Dolz Sergio, Barrachina Héctor, Martínez ARM-based processor architectures NVIDIA Denver2 CC BY In this work, we assess the performance and energy efficiency of high-performance codes for the convolution operator, based on the direct, explicit/implicit lowering and Winograd algorithms used for deep learning (DL) inference on a series of ARM-based processor architectures. Specifically, we evaluate the NVIDIA Denver2 and Carmel processors, as well as the ARM Cortex-A57 and Cortex-A78AE CPUs as part of a recent set of NVIDIA Jetson platforms. The performance–energy evaluation is carried out using the ResNet-50 v1.5 convolutional neural network (CNN) on varying configurations of convolution algorithms, number of threads/cores, and operating frequencies on the tested processor cores. The results demonstrate that the best throughput is obtained on all platforms with the Winograd convolution operator running on all the cores at their highest frequency. However, if the goal is to reduce the energy footprint, there is no rule of thumb for the optimal configuration. 2023-04-25T01:26:40Z 2023-04-25T01:26:40Z 2023 Book https://link.springer.com/article/10.1007/s11227-023-05050-4 https://dlib.phenikaa-uni.edu.vn/handle/PNK/8247 en application/pdf Springer |
institution |
Digital Phenikaa |
collection |
Digital Phenikaa |
language |
English |
topic |
ARM-based processor architectures NVIDIA Denver2 |
spellingShingle |
ARM-based processor architectures NVIDIA Denver2 Manuel F., Dolz Sergio, Barrachina Héctor, Martínez Performance–energy trade-offs of deep learning convolution algorithms on ARM processors |
description |
CC BY |
format |
Book |
author |
Manuel F., Dolz Sergio, Barrachina Héctor, Martínez |
author_facet |
Manuel F., Dolz Sergio, Barrachina Héctor, Martínez |
author_sort |
Manuel F., Dolz |
title |
Performance–energy trade-offs of deep learning convolution algorithms on ARM processors |
title_short |
Performance–energy trade-offs of deep learning convolution algorithms on ARM processors |
title_full |
Performance–energy trade-offs of deep learning convolution algorithms on ARM processors |
title_fullStr |
Performance–energy trade-offs of deep learning convolution algorithms on ARM processors |
title_full_unstemmed |
Performance–energy trade-offs of deep learning convolution algorithms on ARM processors |
title_sort |
performance–energy trade-offs of deep learning convolution algorithms on arm processors |
publisher |
Springer |
publishDate |
2023 |
url |
https://link.springer.com/article/10.1007/s11227-023-05050-4 https://dlib.phenikaa-uni.edu.vn/handle/PNK/8247 |
_version_ |
1764177434900430848 |
score |
8.891695 |