Skip to content

Releases: googleapis/python-bigquery-dataframes

v1.23.0

23 Oct 21:07
e8d689e
Compare
Choose a tag to compare

1.23.0 (2024-10-23)

Features

  • Add bigframes.bigquery.create_vector_index to assist in creating vector index on ARRAY<FLOAT64> columns (#1024) (863d694)
  • Add gemini-1.5-pro-002 and gemini-1.5-flash-002 to known Gemini model list. (#1105) (7094c85)
  • Add support for pandas series & data frames as inputs for ml models. (#1088) (30c8883)
  • Cleanup temp resources with session deletion (#1068) (1d5373d)
  • Show possible correct key(s) in .__getitem__ KeyError message (#1097) (32fab96)
  • Support uploading local geo data (#1036) (51cdd33)

Bug Fixes

  • Escape ids more consistently in ml module (#1074) (103e998)
  • Model.fit metric not collected issue. (#1085) (06cec00)
  • Remove index requirement from some dataframe APIs (#1073) (2d16f6d)
  • Update session metrics in read_gbq_query (#1084) (dced460)

Performance Improvements

  • Speed up tree transforms during sql compile (#1071) (d73fe9d)
  • Utilize ORDER BY LIMIT over ROW_NUMBER where possible (#1077) (7003d1a)

Documentation

  • Add ml tutorial for Evaluate the model (#1038) (a120bae)
  • Show best practice of closing the session to cleanup resources in sample notebooks (#1095) (62a88e8)
  • Update docstrings of Session and related files (#1087) (bf93e80)

v1.22.0

09 Oct 22:52
8821dd4
Compare
Choose a tag to compare

1.22.0 (2024-10-09)

Features

  • Support regional endpoints for more bigquery locations (#1061) (45b672a)
  • Update LLM generators to warn user about model name instead of raising error. (#1048) (650d80d)

Bug Fixes

  • Access MATERIALIZED_VIEW with read_gbq (#1070) (601e984)
  • Correct zero row count in DataFrame from table view (#1062) (b536070)
  • Fix generic error message when entering an incorrect column name (#1031) (5ac217d)
  • Make explode respect the index labels (#1064) (99ca0df)
  • Make invalid location warning case-insensitive (#1044) (b6cd55a)
  • Remove palm2 test case from llm load test (#1063) (575a10a)
  • Show warning for unknown location set through .ctor (#1052) (02c2da7)

Performance Improvements

Documentation

  • Add docstring return type section to BigQueryOptions class (#964) (307385f)

v1.21.0

02 Oct 04:25
56fec28
Compare
Choose a tag to compare

1.21.0 (2024-10-02)

Features

  • Add deprecation warning to PaLM2TextGenerator model (#1035) (1183b0f)
  • Add DeprecationWarning for PaLM2TextEmbeddingGenerator (#1018) (4af5bbb)
  • Add ml.model_selection.cross_validate support (#1020) (1a38063)
  • Allow access of struct fields with dot operators on Series (#1019) (ef76f13)

Bug Fixes

  • Ensure no double execution for to_pandas (#1032) (4992cc2)
  • Remove pre-caching of remote function results (#1028) (0359bc8)

Documentation

v1.20.0

26 Sep 07:33
7aaef6f
Compare
Choose a tag to compare

1.20.0 (2024-09-25)

Features

  • Add bigframes.bigquery.approx_top_count (#1010) (3263bd7)
  • Add bigframes.ml.compose.SQLScalarColumnTransformer to create custom SQL-based transformations (#955) (1930b4e)
  • Allow multiple columns input for llm models (#998) (2fe5e48)

Bug Fixes

Documentation

  • Limit pypi notebook to 7 days and add more info about differences with partial ordering mode (#1013) (3c54399)
  • Move and edit existing linear-regression tutorial snippet (#991) (4cb62fd)

v1.19.0

24 Sep 02:27
a95493d
Compare
Choose a tag to compare

1.19.0 (2024-09-24)

Features

  • Add ml.model_selection.KFold class (#1001) (952cab9)
  • Support bool and bytes types in describe(include='all') (#994) (cc48f58)
  • Support ingress settings in remote_function (#1011) (8e9919b)

Bug Fixes

Performance Improvements

  • Join op discards child ordering in unordered mode (#923) (1b5b0ee)

Dependencies

v1.18.0

18 Sep 20:21
d2fb49f
Compare
Choose a tag to compare

1.18.0 (2024-09-18)

Features

  • Add "include" param to describe for string types (#973) (deac6d2)
  • Add subset parameter to DataFrame.dropna to select which columns to consider (#981) (f7c03dc)

Bug Fixes

  • DataFrameGroupby.agg now works with unnamed tuples (#985) (0f047b4)
  • Fix a bug that raises exception when re-indexing columns with their original order (#988) (596b03b)
  • Make the Series.apply outcome assignable to the original dataframe in partial ordering mode (#874) (c94ead9)

Dependencies

  • Limit ibis-framework version to 9.2.0 (#989) (06c1b33)
  • Update to ibis-framework 9.x and newer sqlglot (#827) (89ea44f)

v1.17.0

11 Sep 18:34
d42d674
Compare
Choose a tag to compare

1.17.0 (2024-09-11)

Features

  • Add __version__ alias to bigframes.pandas (#967) (9ce10b4)
  • Add Gemini 1.5 stable models support (#945) (c1cde19)
  • Allow setting table labels in to_gbq (#941) (cccc6ca)
  • Define list accessor for bigframes Series (#946) (8e8279d)
  • Enable read_csv() to process other files (#940) (3b35860)
  • Include the bigframes package version alongside the feedback link in error messages (#936) (7b59b6d)

Bug Fixes

  • Astype Decimal to Int64 conversion. (#957) (27764a6)
  • Make read_gbq_function work for multi-param functions (#947) (c750be6)
  • Support read_gbq_function for axis=1 application (#950) (86e54b1)

Documentation

  • Add docstring returns section to Options (#937) (a2640a2)
  • Update title of pypi notebook example to reflect use of the PyPI public dataset (#952) (cd62e60)

v1.16.0

04 Sep 20:53
6fdb6b1
Compare
Choose a tag to compare

1.16.0 (2024-09-04)

Features

  • Add DataFrame.struct.explode to add struct subfields to a DataFrame (#916) (ad2f75e)
  • Implement bigframes.bigquery.json_extract_array (#910) (575a29e)
  • Recover struct column from exploded Series (#904) (7dd304c)

Bug Fixes

  • Fix issue with iterating on >10gb dataframes (#949) (2b0f0fa)
  • Improve Series.replace for dict input (#907) (4208044)
  • NullIndex in ML model.predict error (#917) (612271d)
  • Struct field non-nullable type issue. (#914) (149d5ff)
  • Unordered mode errors in ml train_test_split (#925) (85d7c21)

Performance Improvements

Dependencies

  • Re-introduce support for numpy 1.24.x (#931) (3d71913)
  • Update minimum support to Pandas 1.5.3 and Pyarrow 10.0.1 (#903) (7ed3962)

Documentation

  • Add Claude3 ML and RemoteFunc notebooks (#930) (cfd16c1)
  • Create sample notebook to manipulate struct and array data (#883) (3031903)
  • Update struct examples. (#953) (d632cd0)
  • Use unstack() from BigQuery DataFrames instead of pandas in the PyPI sample notebook (#890) (d1883cc)

v1.15.0

20 Aug 18:44
e43e0e5
Compare
Choose a tag to compare

1.15.0 (2024-08-20)

Features

  • Add llm.TextEmbeddingGenerator to support new embedding models (#905) (6bc6a41)
  • Add ml.llm.Claude3TextGenerator model (#901) (7050038)

Documentation

  • Add columns for "requires ordering/index" to supported APIs summary (#892) (d2fc51a)
  • Remove duplicate description for kms_key_name (#898) (1053d56)
  • Update embedding model notebooks (#906) (d9b8ef5)

v1.14.0

14 Aug 02:21
ae07274
Compare
Choose a tag to compare

1.14.0 (2024-08-14)

Features

  • Implement bigframes.bigquery.json_extract (#868) (3dbf84b)
  • Implement Series.str.__getitem__ (#897) (e027b7e)

Bug Fixes

  • Fix caching from generating row numbers in partial ordering mode (#872) (52b7786)

Performance Improvements

  • Generate SQL with fewer CTEs (#877) (eb60804)
  • Speed up compilation by reducing redundant type normalization (#896) (e0b11bc)

Documentation