liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Activation Function Integration for Accelerating Multi-Layer Graph Convolutional Neural Networks
Linköping University, Department of Electrical Engineering, Computer Engineering. Linköping University, Faculty of Science & Engineering.ORCID iD: 0000-0003-1823-4211
Univ Texas Dallas, TX USA.
Linköping University, Department of Electrical Engineering, Computer Engineering. Linköping University, Faculty of Science & Engineering.ORCID iD: 0000-0003-3470-3911
Linköping University, Department of Electrical Engineering, Computer Engineering. Linköping University, Faculty of Science & Engineering.ORCID iD: 0000-0002-5153-5481
2024 (English)In: 17TH IEEE DALLAS CIRCUITS AND SYSTEMS CONFERENCE, DCAS 2024, IEEE , 2024Conference paper, Published paper (Refereed)
Abstract [en]

Graph Neural Networks (GNNs) are renowned for their ability to process data with irregular structures, making them invaluable in various fields. However, their high computational requirements pose a challenge for efficient inference. In this context, Graph Convolutional Networks (GCNs), a prominent type of GNN, derive their effectiveness from their architecture, which involves layers of graph convolutions and subsequent activation functions. These elements are pivotal in determining the overall runtime of GCNs. Addressing the need for enhanced efficiency, this paper introduces an innovative approach that integrates specialized Activation Functions within a graph Accelerator for Multi-layer GCNs. Our strategy mitigates the need for external memory transfers by positioning the activation function accelerator immediately after the GCN accelerator so that the output remains in the local FPGA memory. This method also incorporates sparse matrix compression and multi-threading to reduce unnecessary data storage and computations, thereby optimizing memory bandwidth and computational efficacy on the Zynq Ultrascale device. A noteworthy aspect of our approach is the adaptation of the Softmax Activation Function to be in a hardware-friendly format, utilizing base replacement and low-precision calculations. This hardware/software co-designed multilayer GCN architecture utilizes PyTorch functions executed on a multi-core ARM CPU, in conjunction with the accelerator which is described in a High-level synthesis (HLS) and is implemented on an AMD/Xilinx Zynq Ultrascale+ FPGA. Our results demonstrate significant performance enhancements, with speedups reaching up to 78.5x for Hardmax and 17.4x for Softermax, compared to traditional software implementations. Importantly, the Softermax variant achieves these speedups with minimal impact on accuracy, while other activation functions maintain reference-level accuracy.

Place, publisher, year, edition, pages
IEEE , 2024.
Series
Proceedings of the IEEE Dallas Circuits and Systems Workshop, ISSN 2994-578X, E-ISSN 2766-5186
Keywords [en]
Graph neural network; hardware accelerators; hardware/software co-design; FPGA HLS; streaming PYNQ overlay
National Category
Computer Engineering
Identifiers
URN: urn:nbn:se:liu:diva-206953DOI: 10.1109/DCAS61159.2024.10539892ISI: 001241292100025ISBN: 9798350349535 (print)ISBN: 9798350349542 (electronic)OAI: oai:DiVA.org:liu-206953DiVA, id: diva2:1892561
Conference
17th Dallas Circuits and Systems Conference (DCAS), Dallas, TX, apr 19-21, 2024
Note

Funding Agencies|Wallenberg AI autonomous systems and software (WASP) program - Knut and Alice Wallenberg Foundation

Available from: 2024-08-27 Created: 2024-08-27 Last updated: 2024-08-27

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full text

Search in DiVA

By author/editor
Grailootanha, MahdiehGustafsson, OscarNunez-Yanez, Jose Luis
By organisation
Computer EngineeringFaculty of Science & Engineering
Computer Engineering

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 275 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf