• Chỉ mục bởi
  • Năm xuất bản
LIÊN KẾT WEBSITE

A proposal for enhancing training speed in deep learning models based on memory activity survey

Kiet University of Electro-Communications (UEC), 1–5–1 Chofugaoka, Chofu-shi, Tokyo, 182–8585, Japan|
Cong-Kha (56186480400) | Xuan-Tu (16308138900); Pham | Khai-Duy (57220854883); Tran | Trong-Thuc (56024762000); Nguyen | Binh (57200626731); Hoang | Dang Tuan (57219503230); Kieu-Do-Nguyen The Information Technology Institute (VNU-ITI), 144 Xuan Thuy road, Cau Giay dist., Hanoi, Viet Nam|

IEICE Electronics Express Số 15, năm 2021 (Tập 18, trang 1-6)

ISSN: 13492543

ISSN: 13492543

DOI: 10.1587/elex.18.20210252

Tài liệu thuộc danh mục:

Article

English

Từ khóa: Cache memory; Field programmable gate arrays (FPGA); Surveys; Cache configurations; Double data rate memory; Learning models; New approaches; Profiling tools; Software and hardwares; Training database; Training process; Deep learning
Tóm tắt tiếng anh
Deep Learning (DL) training process involves intensive computations that require a large number of memory accesses. There are many surveys on memory behaviors with the DL training. They use well-known profiling tools or improving the existing tools to monitor the training processes. This paper presents a new approach to profile using a co-operate solution from software and hardware. The idea is to use Field-Programmable-Gate-Array memory as the main memory for the DL training processes on a computer. Then, the memory behaviors from both software and hardware point-of-views can be monitored and evaluated. The most common DL models are selected for the tests, including ResNet, VGG, AlexNet, and GoogLeNet. The CIFAR-10 dataset is chosen for the training database. The experimental results show that the ratio between read and write transactions is roughly about 3 to 1. The requested allocations are varied from 2-Byte to 64-MB, with the most requested sizes are approximately 16-KB to 64-KB. Based on the statistic, a suggestion was made to improve the training speed using an L4 cache for the Double-Data-Rate (DDR) memory. It can be demonstrated that our recommended L4 cache configuration can improve the DDR performance by about 15% to 18%. Copyright � 2021 The Institute of Electronics, Information and Communication Engineers.

Xem chi tiết