ParaCoder: Parallel Code Generation with Large Language Model

Xiaowen Huang, Xu Zhang, Lvfang Tao, Renjie Mao, Nan Zhou, Wenxi Zhu, Minwen Deng, Jintao Meng*, Yanjie Wei, Amelie Chi Zhou, Bingqiang Wang, Shengzhong Feng

*Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

Abstract

High-performance parallel code generation is a complex and fascinating area in computer science that focuses on producing code that executes as quickly and efficiently as possible. In our paper, we designed a new architecture for parallel code generation agent with 4 inter-connected components of LLM---Memory, Planning, Tools and Action. It also incooperated with two techniques: data augmentation, prompting and retrieval-augmented editing to improve the performance of the parallel codes. Data augmentation is implemented by extracting and processing PIE dataset, and also synthesis dataset generated by LLM models with ParEval benchmark. Finally planning-oriented prompting, code verification and retrieval augmented editing are used to promote the actual performance of the LLM generated code. The evaluation results confirm that a rough speedup of 6.06X and 5.13X are achieved using Qwen2.5-Coder-7B-Instruct, Qwen2.5-Coder-14B-Instruct LLM models.
Original languageEnglish
Title of host publicationProceedings of the 1st FastCode Programming Challenge, FCPC 2025
Place of PublicationNew York
PublisherAssociation for Computing Machinery (ACM)
Pages1-7
Number of pages7
ISBN (Electronic)9798400714467
ISBN (Print)9798400714467
DOIs
Publication statusPublished - 2 May 2025
EventFastcode Programming Challenge 2025 - The Westin Las Vegas Hotel & Spa, Las Vegas, United States
Duration: 1 Mar 20255 Mar 2025
https://dl.acm.org/doi/proceedings/10.1145/3711708 (Conference proceeding)
https://ppopp25.sigplan.org/ (Conference website)

Publication series

NameProceedings of the FastCode Programming Challenge
PublisherAssociation for Computing Machinery

Conference

ConferenceFastcode Programming Challenge 2025
Abbreviated titlePPoPP 2025
Country/TerritoryUnited States
CityLas Vegas
Period1/03/255/03/25
Internet address

User-Defined Keywords

  • LLM
  • code generation
  • parallelization

Fingerprint

Dive into the research topics of 'ParaCoder: Parallel Code Generation with Large Language Model'. Together they form a unique fingerprint.

Cite this