Abstract
As heterogeneous computing systems continue to evolve, emerging workloads increasingly span multiple types of accelerators, resulting in frequent inter-accelerator data transfers. However, traditional CPU-managed memory systems often struggle to coordinate these transfers efficiently, leading to high latency, poor memory bandwidth utilization, and scalability bottlenecks. We propose CeDMA, a centralized and programmable Direct Memory Access (DMA) control architecture that enables high-performance, CPU-decoupled memory coordination across diverse accelerators. CeDMA combines a unified hardware-software co-design: a modular DMA engine with integrated address translation and dual-level arbitration logic on the hardware side, and a lightweight instruction-driven memory management model with adaptive scheduling on the software side. CeDMA enables fine-grained control over memory transfers, minimizes off-chip bandwidth consumption, and exploits memory-level parallelism through dynamic resource partitioning. Cycle-accurate simulation results across a diverse workload suite—including GEMM, Conv2D, and graph traversal kernels—demonstrate up to 75% reduction in external memory access, 60% improvement in performance, and 45% reduction in access latency. Furthermore, CeDMA maintains high throughput and predictable latency at scale, supporting up to 32 concurrent accelerators. These results position CeDMA as a scalable, general-purpose memory management substrate for future heterogeneous SoC architectures.
| Original language | English |
|---|---|
| Title of host publication | Advanced Parallel Processing Technologies |
| Subtitle of host publication | 16th International Symposium, APPT 2025, Athens, Greece, July 13-16, 2025, Proceedings |
| Editors | Chao Li, Xuehai Qian, Dimitris Gizopoulos, Boris Grot |
| Place of Publication | Singapore |
| Publisher | Springer |
| Pages | 129-144 |
| Number of pages | 16 |
| ISBN (Electronic) | 9789819510214 |
| ISBN (Print) | 9789819510207 |
| DOIs | |
| Publication status | Published - 3 Nov 2025 |
| Event | 16th International Symposium on Advanced Parallel Processing Technologies - Athenaeum Intercontinental hotel, Athens, Greece Duration: 13 Jul 2025 → 16 Jul 2025 https://link.springer.com/book/10.1007/978-981-95-1021-4 (Conference proceeding) https://www.appt-conference.com/ (Conference website) https://www.appt-conference.com/program (Conference program) |
Publication series
| Name | Lecture Notes in Computer Science |
|---|---|
| Volume | 16062 |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
| Name | APPT: International Symposium on Advanced Parallel Processing Technologies |
|---|
Conference
| Conference | 16th International Symposium on Advanced Parallel Processing Technologies |
|---|---|
| Abbreviated title | APPT 2025 |
| Country/Territory | Greece |
| City | Athens |
| Period | 13/07/25 → 16/07/25 |
| Internet address |
|