Abstract
High-performance parallel code generation is a complex and fascinating area in computer science that focuses on producing code that executes as quickly and efficiently as possible. In our paper, we designed a new architecture for parallel code generation agent with 4 inter-connected components of LLM---Memory, Planning, Tools and Action. It also incooperated with two techniques: data augmentation, prompting and retrieval-augmented editing to improve the performance of the parallel codes. Data augmentation is implemented by extracting and processing PIE dataset, and also synthesis dataset generated by LLM models with ParEval benchmark. Finally planning-oriented prompting, code verification and retrieval augmented editing are used to promote the actual performance of the LLM generated code. The evaluation results confirm that a rough speedup of 6.06X and 5.13X are achieved using Qwen2.5-Coder-7B-Instruct, Qwen2.5-Coder-14B-Instruct LLM models.
Original language | English |
---|---|
Title of host publication | Proceedings of the 1st FastCode Programming Challenge, FCPC 2025 |
Place of Publication | New York |
Publisher | Association for Computing Machinery (ACM) |
Pages | 1-7 |
Number of pages | 7 |
ISBN (Electronic) | 9798400714467 |
ISBN (Print) | 9798400714467 |
DOIs | |
Publication status | Published - 2 May 2025 |
Event | Fastcode Programming Challenge 2025 - The Westin Las Vegas Hotel & Spa, Las Vegas, United States Duration: 1 Mar 2025 → 5 Mar 2025 https://dl.acm.org/doi/proceedings/10.1145/3711708 (Conference proceeding) https://ppopp25.sigplan.org/ (Conference website) |
Publication series
Name | Proceedings of the FastCode Programming Challenge |
---|---|
Publisher | Association for Computing Machinery |
Conference
Conference | Fastcode Programming Challenge 2025 |
---|---|
Abbreviated title | PPoPP 2025 |
Country/Territory | United States |
City | Las Vegas |
Period | 1/03/25 → 5/03/25 |
Internet address |
|
User-Defined Keywords
- LLM
- code generation
- parallelization