GitHub - metrico/kompactor: Parquet + Metadata Compactor for InfluxDB 3 Core

Kompactor is a DuckDB powered Parquet + Metadata data compactor for InfluxDB3 Core or "FDAP" stack

Warning

⚠️ Experimental, Untested & Unstable - keep backups!

Overview

Kompactor extends the lifetime and performance of InfluxDB3 Core by

Reading InfluxDB 3 snapshot metadata from JSON files
Compacting and time sorting multiple parquet files daily/hourly
Updating metadata to reflect the new compacted file structure
Maintaining correct min/max time ranges and statistics for readers

Prerequisites

InfluxDB 3 Core w/ File storage
Bun runtime environment
DuckDB node API package

Usage

Basic usage pattern:

bun run kompactor.ts <data-dir> --hosts <host1,host2,...> [options]

Arguments

Arguments:
    data-dir     Root data directory (e.g., /data)

Options:
    --hosts      Comma-separated list of host folders to process (e.g., my_host,other_host)
    --dry-run    Run without making any changes
    --verbose    Enable detailed logging
    --help       Show this help message

Example:
    bun run kompactor.ts /data --hosts my_host --dry-run --verbose
    bun run kompactor.ts /data --hosts my_host,other_host --verbose

Examples

# Dry run with verbose output
bun kompactor.ts ./data --hosts my_host --dry-run --verbose

# Actual compaction
bun kompactor.ts ./data --hosts my_host ./compacted

Features

Merges multiple parquet files while maintaining time-based sorting
Preserves metadata structure and relationships
Calculates and updates aggregate statistics
Supports dry-run mode for validation
Detailed logging in verbose mode
Uses DuckDB for efficient parquet file operations
Automatic cleanup of DuckDB resources

Input Format

The tool expects snapshot metadata files with the format:

{
  "writer_id": "host_name",
  "parquet_size_bytes": 398790,
  "row_count": 6854,
  "min_time": 1737928861362000000,
  "max_time": 1737930192543000000,
  "databases": [
    [
      0,
      {
        "tables": [
          [
            3,
            [
              {
                "id": 14,
                "path": "host/dbs/db-0/table-3/2025-01-26/22-00/file.parquet",
                "size_bytes": 10377,
                "row_count": 50,
                "chunk_time": 1737928800000000000,
                "min_time": 1737928874762000000,
                "max_time": 1737929170992000000
              }
            ]
          ]
        ]
      }
    ]
  ]
}

Development

The project is written in TypeScript and uses:

Bun runtime for modern JavaScript/TypeScript execution
DuckDB for parquet file operations
Node.js fs/promises API for file system operations

Motivation and background information from this blog post ¹ ↩
Bun™, DuckDB™, InfluxDB™ and any other trademarks, service marks, trade names, and product names referenced in this documentation are the property of their respective owners. The use of any trademark, trade name, or product name is for descriptive purposes only and does not imply any affiliation with or endorsement by the trademark owner. All product names, logos, brands, trademarks, and registered trademarks mentioned herein are the property of their respective owners. They are used in this documentation for identification purposes only. Use of these names, logos, trademarks, and brands does not imply endorsement, sponsorship, or affiliation. This project is independent and not affiliated with, endorsed by, or sponsored by any of the companies whose products or technologies are mentioned in this documentation. ² ↩

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
LICENSE		LICENSE
README.md		README.md
jsconfig.json		jsconfig.json
kompactor.ts		kompactor.ts
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Prerequisites

Usage

Arguments

Examples

Features

Input Format

Development

About

Releases 1

Packages

Languages

License

metrico/kompactor

Folders and files

Latest commit

History

Repository files navigation

Overview

Prerequisites

Usage

Arguments

Examples

Features

Input Format

Development

Footnotes

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages