Skip to content

bjoernmeier/kartothek

 
 

Repository files navigation

========= kartothek

Datasets are a collection of files with the same schema that reside in a storage. kartothek offers a metadata definition to handle these datasets efficiently. In addition, the kartothek.io module provides building blocks to create and modify these datasets. Handling of I/O, tracking of dataset partitions and selecting subsets of data are handled transparently.

What is a (real) Kartothek?

A Kartothek (or more modern: Zettelkasten/Katalogkasten) is a tool to organize (high-level) information extracted from a source of information.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 99.8%
  • Shell 0.2%