Abstract
To evaluate the performance of database applications and database management systems (DBMSs), we usually execute workloads of queries on generated databases of different sizes and then benchmark various measures such as respond time and throughput. This paper introduces MyBenchmark, a parallel data generation tool that takes a set of queries as input and generates database instances. Users of MyBenchmark can control the characteristics of the generated data as well as the characteristics of the resulting workload. Applications of MyBenchmark include DBMS testing, database application testing, and application-driven benchmarking. In this paper, we present the architecture and the implementation algorithms of MyBenchmark. Experimental results show that MyBenchmark is able to generate workload-aware databases for a variety of workloads including query workloads extracted from TPC-C, TPC-E, TPC-H, and TPC-W benchmarks.
Original language | English |
---|---|
Pages (from-to) | 895-913 |
Number of pages | 19 |
Journal | VLDB Journal |
Volume | 23 |
Issue number | 6 |
DOIs | |
Publication status | Published - 15 Nov 2014 |
Scopus Subject Areas
- Information Systems
- Hardware and Architecture
User-Defined Keywords
- Benchmarking
- Data Generation
- Performance
- Query Processing