Skip to content

v0.30.0: Major update to column API

Compare
Choose a tag to compare
@benmccann benmccann released this 03 Nov 16:22
· 772 commits to master since this release

With this release we wanted to strike a balance between making the implementation and maintenance of the library more approachable, reducing memory usage, and providing powerful operations.

We provide non-double numeric column types so that data can be held in memory in a less memory-intensive manner. All computations return DoubleColumn. The alternative would be supporting every pair-wise type of numeric, which would quickly make the library too difficult to contribute to and maintain. The user can convert DoubleColumn to less memory intensive column types like IntColumn as they choose to reduce in-memory overhead for large tables. For small tables, users may be comfortable keeping all columns as DoubleColumn.

Other improvements include:

  • Improvements to Smile machine learning integration
  • Additional CSV parsing options
  • Bug fixes

Thanks to @hallvard, @Ebalaitung, @jln-ho, @gregorco for contributing to this release