Abstract
Deep convolutional neural networks have become the mainstream solution for many artificial intelligence applications. However, they are still rarely deployed on mobile or edge devices due to the cost of a substantial amount of data movement among limited resources. The emerging processing-inmemory neuromorphic architecture offers a promising direction to accelerate the inference process. The key issue becomes how to effectively allocate the processing of inference between computing and storage resources on an edge device.This paper presents Mobile-I, a resource allocation scheme to accelerate the Inference process on Mobile or edge devices. Mobile-I targets at the emerging 3D neuromorphic architecture to reduce the processing latency among computing resources and fully utilize the limited on-chip storage resources. We formulate the target problem as a resource allocation problem and use a software-based solution to offer the cross-platform deployment across multiple mobile or edge devices. We conduct a set of experiments using realistic workloads that are generated from Intel Movidius neural compute stick. Experimental results show that Mobile-I can effectively reduce the processing latency and improve the utilization of computing resources with negligible overhead in comparison with representative schemes.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 2019 Design, Automation and Test in Europe Conference and Exhibition, DATE 2019 |
| Publisher | IEEE |
| Pages | 806-811 |
| Number of pages | 6 |
| ISBN (Electronic) | 9783981926323, 9783981926330 |
| ISBN (Print) | 9781728103310 |
| DOIs | |
| Publication status | Published - 25 Mar 2019 |
| Event | 22nd Design, Automation and Test in Europe Conference and Exhibition, DATE 2019 - Florence, Italy Duration: 25 Mar 2019 → 29 Mar 2019 https://ieeexplore.ieee.org/xpl/conhome/8704855/proceeding (Conference Proceedings) |
Publication series
| Name | Proceedings of the Design, Automation and Test in Europe Conference and Exhibition, DATE |
|---|---|
| ISSN (Print) | 1530-1591 |
| ISSN (Electronic) | 1558-1101 |
Conference
| Conference | 22nd Design, Automation and Test in Europe Conference and Exhibition, DATE 2019 |
|---|---|
| Country/Territory | Italy |
| City | Florence |
| Period | 25/03/19 → 29/03/19 |
| Internet address |
|
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 9 Industry, Innovation, and Infrastructure
User-Defined Keywords
- Edge computing
- memory management
- scheduling
- neuromorphic architecture
- inference
Fingerprint
Dive into the research topics of 'Towards Cross-Platform Inference on Edge Devices with Emerging Neuromorphic Architecture'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver