ALP: Adaptive Lossless Floating-Point Compression

Authors: Azim Afroozeh, Leonardo Kuffó, Peter Boncz
Conference: ACM SIGMOD 2024

What is this repo?

This repository contains the source code and benchmarks for the paper ALP: Adaptive Lossless Floating-Point Compression, published at ACM SIGMOD 2024.

ALP is a state-of-the-art lossless compression algorithm designed for IEEE 754 floating-point data. It encodes data by exploiting two common patterns found in real-world floating-point values:

Decimal Floating-Point Numbers:
A large portion of floats/doubles in real-world datasets are decimals. ALP maps these values into integers by multiplying the number by a power of 10 and then compressing the result using a FastLanes variant of Frame-of-Reference encoding¹, which is SIMD-friendly.
Example: the number 10.12 becomes 1012 and is then fed to the FastLanes encoder.
High-Precision Floating-Point Numbers:
The remaining values are typically high-precision floats/doubles. ALP targets compression opportunities in only the left part of these values, which it compresses using FastLanes dictionary encoding. The right part is left uncompressed, as it is required to preserve high precision and is often highly random and incompressible.

📊 How does ALP perform?

These results highlight ALP’s superior performance across all three key metrics of a compression algorithm:
Decoding Speed, Compression Ratio, and Compression Speed—outperforming other schemes in every category.

🧪 How to Reproduce Results

Just run the following script:

./publication/script/master_script.sh

For more information on reproducing our benchmarks, refer to our guide here,
or read the official ACM reproducibility report:
https://dl.acm.org/doi/10.1145/3687998.3717057

🏅 ACM Artifacts & Awards

We are happy to share that we participated in the SIGMOD Availability & Reproducibility Initiative, and our paper earned all three badges:

🎉 We're also proud to share that ALP won the SIGMOD Best Artifact Award!

⏱️ Want to Benchmark Your Dataset?

Check out our guide: How to Benchmark Your Dataset
It explains how to run ALP on your own data.

🗂️ Repository Structure

src/: Core implementation of ALP and ALP_RD
benchmarks/: Benchmarking tools and datasets
include/: Header files for integration
scripts/: Utility scripts for data processing
test/: Unit tests
publication/: Publications and supplementary materials

📚 Publications

Conference Paper:
ALP: Adaptive Lossless Floating-Point Compression, ACM SIGMOD 2024
https://dl.acm.org/doi/10.1145/3626717
Reproducibility Report:
Reproducibility Report for ACM SIGMOD 2024 Paper: 'ALP: Adaptive Lossless Floating-Point Compression'
https://dl.acm.org/doi/10.1145/3687998.3717057

📄 License

This project is licensed under the MIT License. See the LICENSE file for details.

📬 Contact

If you have questions, want to contribute, or just want to stay up to date with ALP and related projects, join our community on Discord:

🧩 Used By

ALP has been integrated into the following systems:

Learn more about FastLanes here: https://github.com/cwida/fastlanes ↩

Name	Name	Last commit message	Last commit date
Latest commit azimafroozeh Announce SIGMOD 2024 Best Artifact Award in README (#39 ) Apr 24, 2025 db9379d · Apr 24, 2025 History 198 Commits
.github/workflows	.github/workflows	enable verbose	Dec 4, 2024
assets	assets	Announce SIGMOD 2024 Best Artifact Award in README (#39 )	Apr 24, 2025
benchmarks	benchmarks	add speed benchmarks as well including decompression speed and compre…	Dec 4, 2024
data	data	check if mvn and java is installed,	Dec 6, 2024
include	include	ops	Dec 4, 2024
publication	publication	Announce SIGMOD 2024 Best Artifact Award in README (#39 )	Apr 24, 2025
scripts	scripts	add issue_24 dataset	Dec 1, 2024
src	src	Setting '-march=native' for x86 processors without AVX-512DQ.	Jan 16, 2025
test	test	add a test for float alp and implement the explicitly SIMDized part o…	Dec 4, 2024
toolchain	toolchain	rename this to i4i_4xlarge	Dec 1, 2024
.clang-format	.clang-format	init v_0_1_4	Jun 11, 2024
.clang-tidy	.clang-tidy	enable clang_tidy	Sep 15, 2024
.gitignore	.gitignore	ELF with master script	Dec 6, 2024
BENCHMARKING.md	BENCHMARKING.md	add a guide to how to benchmark your own data	Dec 1, 2024
CMakeLists.txt	CMakeLists.txt	use newer version of googletest	Dec 4, 2024
LICENSE	LICENSE	init v_0_1_4	Jun 11, 2024
PRIMITIVES.md	PRIMITIVES.md	init v_0_1_4	Jun 11, 2024
README.md	README.md	Announce SIGMOD 2024 Best Artifact Award in README (#39 )	Apr 24, 2025
alp_results.png	alp_results.png	Announce SIGMOD 2024 Best Artifact Award in README (#39 )	Apr 24, 2025
availability_reproducibility_initiative_report.md	availability_reproducibility_initiative_report.md	ELF with master script	Dec 6, 2024
how_to_benchmark_your_dataset.md	how_to_benchmark_your_dataset.md	add speed benchmarks as well including decompression speed and compre…	Dec 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ALP: Adaptive Lossless Floating-Point Compression

What is this repo?

📊 How does ALP perform?

🧪 How to Reproduce Results

🏅 ACM Artifacts & Awards

⏱️ Want to Benchmark Your Dataset?

🗂️ Repository Structure

📚 Publications

📄 License

📬 Contact

🧩 Used By

About

Releases

Packages

Contributors 3

Languages

License

cwida/ALP

Folders and files

Latest commit

History

Repository files navigation

ALP: Adaptive Lossless Floating-Point Compression

What is this repo?

📊 How does ALP perform?

🧪 How to Reproduce Results

🏅 ACM Artifacts & Awards

⏱️ Want to Benchmark Your Dataset?

🗂️ Repository Structure

📚 Publications

📄 License

📬 Contact

🧩 Used By

Footnotes

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages