Sudokube

Sudokube is a data cube system that supports fast aggregation queries on high-dimensional data. Like traditional data cubes, Sudokube supports OLAP operations such as roll-up, drill-down, slicing and dicing, but even on high-dimensional data that cannot be supported at interactive speeds using previous technology. For high-dimensional data, the full materialization involving all possible projections is not possible due to storage and compute limitations. When only some projections can be materialized, current approaches evaluate queries from the smallest materialized projection that contains the query, which in practice can be slow for large volumes of data. Sudokube, on the other hand, tries to approximate query results from all available projections, incrementally improving the results as more projections are processed in an online fashion.

Technical details can be found in our VLDB'22 paper, High-dimensional Data Cubes

Requirements

This project has the following dependencies:

sbt
JDK (version 8)
gcc
cmake

Instructions to build the shared library libCBackend

Set the environment variable JAVA_HOME to the home directory of the JDK installation. The folder ${JAVA_HOME}/include must contain the header file jni.h
Run sbt nativeCompile from the root directory of the project

Instructions to run

Edit the file .jvmopts in the root directory of the project to set the maximum Java heap size to the desirable amount.
Run sbt test from the root directory of the project to run all the tests
Run sbt "runMain <classname>" to run some class containing the main method, for example, example.Demotxt

Generate data and build data cube

In order to reproduce the experiment with fixed queries (Fig 12) exactly, we have fixed the seed of the random generator that is used in deciding what cuboids are materialized to zero. This can be disabled by editing the files src/main/scala/frontend/generators/NYC.scala and src/main/scala/frontend/generators/SSB.scala before generating the data cube.

New York Parking Violations Dataset
- Run dataloading-scripts/nyc.sh
Star Schema Benchmark
- Follow instructions to build ssb-dbgen in the same folder containing the sudokube repository. In our scripts, we use ../ssb-dbgen from the root directory of our project to access the generator.
- Run dataloading-scripts/ssb.sh
Warmup Dataset
- Run sbt "runMain frontend.generators.Warmup"

Run Experiments from our VLDB 2022 paper

The complete reproducibility package can be found under experiments/vldb2022_sudokube_reproducibility.zip.

Run Experiments comparing Graphical Model Solvers and Moment Solvers

See file IPF Experimenter

Run Experiments from our TODS 2025 paper

The complete reproducibility package can be found under experiments/tods2025_sudokube_reproducibility.zip.

Name		Name	Last commit message	Last commit date
Latest commit History 482 Commits
Docker		Docker
dataloading-scripts		dataloading-scripts
example-data		example-data
experiments		experiments
project		project
scripts		scripts
src		src
tabledata		tabledata
.gitignore		.gitignore
.jvmopts		.jvmopts
LICENSE		LICENSE
README.md		README.md
build.sbt		build.sbt
hosts		hosts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Sudokube

Requirements

Instructions to build the shared library libCBackend

Instructions to run

Generate data and build data cube

Run Experiments from our VLDB 2022 paper

Run Experiments comparing Graphical Model Solvers and Moment Solvers

Run Experiments from our TODS 2025 paper

About

Uh oh!

Releases

Packages

Contributors 9

Uh oh!

Languages

License

epfldata/sudokube

Folders and files

Latest commit

History

Repository files navigation

Sudokube

Requirements

Instructions to build the shared library libCBackend

Instructions to run

Generate data and build data cube

Run Experiments from our VLDB 2022 paper

Run Experiments comparing Graphical Model Solvers and Moment Solvers

Run Experiments from our TODS 2025 paper

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 9

Uh oh!

Languages

Packages