Ajou University repository

A Cycle-Accurate Simulation Platform for NPU-DRAM Integrated Systems
  • Park, Hyemin ;
  • Kim, Boyeal ;
  • Lee, Suhong ;
  • Lee, Hyuk Jae ;
  • Lee, Hyokeun
Citations

SCOPUS

0

Citation Export

Publication Year
2025-01-01
Journal
Digest of Technical Papers - IEEE International Conference on Consumer Electronics
Publisher
Institute of Electrical and Electronics Engineers Inc.
Citation
Digest of Technical Papers - IEEE International Conference on Consumer Electronics
Keyword
cycle-accurate simulationDRAMNPU
Mesh Keyword
Consumer electronic devicesCycle-accurate simulationIntegrated systemsNeural processing unitNeural-networksNeural-processingOff-chip memoryProcessing unitsRealtime processingSimulation platform
All Science Classification Codes (ASJC)
Industrial and Manufacturing EngineeringElectrical and Electronic Engineering
Abstract
The ever-increasing accuracy of artificial neural networks facilitates various applications in consumer electronic devices. Furthermore, neural processing units (NPUs) enable the real-time processing of neural networks by leveraging domain-specific hardware structures along with considerable on-chip buffers. Unfortunately, the data movement between NPU and off-chip memory (e.g., DRAM) can no longer be ignored; hence, it is necessary to accurately take the performance effect of off-chip memory into consideration, particularly at the architectural-level simulation. In this paper, we propose a configurable, cycle-accurate NPU simulation infrastructure that considers not only the latency effect as in the analytical modeling but also the memory bandwidth utilization. Our simulator reveals that the accurate simulation of off-chip memory captures higher latency compared to analytical modeling. Specifically, it demonstrates the total execution time increases of 19.2%, 2.8%, and 16.2% in ResNet-50, YOLOv3, and BERT, respectively.
ISSN
2159-1423
Language
eng
URI
https://aurora.ajou.ac.kr/handle/2018.oak/38585
https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105006579003&origin=inward
DOI
https://doi.org/10.1109/icce63647.2025.10929895
Type
Conference Paper
Funding
This work is supported by Institute of Information & communications Technology Planning & Evaluation (IITP) grant funded by the Korea government(MSIT)(2021-0-00106, AI accelerator-optimized neural network automatic generation technology and open service platform development). Hyokeun Lee is the corresponding author.
Show full item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Lee, Hyokeun  Image
Lee, Hyokeun 이효근
Department of Electrical and Computer Engineering
Read More

Total Views & Downloads

File Download

  • There are no files associated with this item.