SparkSQL+: Next-generation Query Planning over Spark

Binyang Dai, Qichen Wang, Ke Yi

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

Abstract

We will demonstrate SparkSQL+, a SQL processing engine built on top of Spark. Unlike the vanilla SparkSQL that uses classical query plans, SparkSQL+ adopts some of the recently developed new query plans, including generalized hypertree decompositions (GHD), worst-case optimal join (WCOJ) algorithms, and conjunctive queries with comparisons (CQC). SparkSQL+ also provides a platform for users to explore different query plans for a given query through a web-based interface, and compare their performance with classical query plans on the same Spark core.
Original languageEnglish
Title of host publicationSIGMOD/PODS '23: Companion of the 2023 International Conference on Management of Data
PublisherAssociation for Computing Machinery (ACM)
Pages115–118
Number of pages4
ISBN (Electronic)9781450395076
ISBN (Print)9781450395076
DOIs
Publication statusPublished - 5 Jun 2023
EventACM SIGMOD International Conference on Management of Data, SIGMOD/PODS 2023 - Seattle, United States
Duration: 18 Jun 202323 Jun 2023
https://2023.sigmod.org/
https://dl.acm.org/doi/proceedings/10.1145/3555041

Publication series

NameProceedings of the ACM SIGMOD International Conference on Management of Data
ISSN (Print)0730-8078

Conference

ConferenceACM SIGMOD International Conference on Management of Data, SIGMOD/PODS 2023
Country/TerritoryUnited States
CitySeattle
Period18/06/2323/06/23
Internet address

Scopus Subject Areas

  • Software
  • Information Systems

User-Defined Keywords

  • acyclic joins
  • compilation
  • conjunctive query
  • visualization

Fingerprint

Dive into the research topics of 'SparkSQL+: Next-generation Query Planning over Spark'. Together they form a unique fingerprint.

Cite this