v0.30.0: Major update to column API
With this release we wanted to strike a balance between making the implementation and maintenance of the library more approachable, reducing memory usage, and providing powerful operations.
We provide non-double numeric column types so that data can be held in memory in a less memory-intensive manner. All computations return DoubleColumn
. The alternative would be supporting every pair-wise type of numeric, which would quickly make the library too difficult to contribute to and maintain. The user can convert DoubleColumn
to less memory intensive column types like IntColumn
as they choose to reduce in-memory overhead for large tables. For small tables, users may be comfortable keeping all columns as DoubleColumn
.
Other improvements include:
- Improvements to Smile machine learning integration
- Additional CSV parsing options
- Bug fixes
Thanks to @hallvard, @Ebalaitung, @jln-ho, @gregorco for contributing to this release