feat: support batch inference #3099

qu8n · 2022-10-14T23:31:56Z

What does this PR address?

BentoML supports serving models online through HTTP and gRPC but needs more support for batch and streaming inference. Integrating with Apache Spark helps bridge the gap. This PR involves allowing users to create Spark UDFs with BentoML-packaged ML models in different types of environment.

Fixes #890

Before submitting:

Does the Pull Request follow Conventional Commits specification naming? Here are GitHub's
guide on how to create a pull request.
Does the code follow BentoML's code style, both make format and make lint script have passed (instructions)?
Did you read through contribution guidelines and follow development guidelines?
Did your changes require updates to the documentation? Have you updated
those accordingly? Here are documentation guidelines and tips on writting docs.
Did you write tests to cover your changes?

codecov · 2022-10-14T23:37:37Z

Codecov Report

Merging #3099 (940d676) into main (2b57ccf) will decrease coverage by 0.54%.
The diff coverage is 11.76%.

@@            Coverage Diff             @@
##             main    #3099      +/-   ##
==========================================
- Coverage   33.45%   32.90%   -0.55%     
==========================================
  Files         106      132      +26     
  Lines        9769    10652     +883     
  Branches     1685     1771      +86     
==========================================
+ Hits         3268     3505     +237     
- Misses       6270     6914     +644     
- Partials      231      233       +2

Impacted Files	Coverage Δ
src/bentoml/_internal/service/inference_api.py	`54.32% <ø> (ø)`
src/bentoml/_internal/service/loader.py	`38.84% <0.00%> (-0.29%)`	⬇️
src/bentoml/_internal/spark.py	`0.00% <0.00%> (ø)`
src/bentoml/_internal/io_descriptors/pandas.py	`38.25% <25.92%> (-4.70%)`	⬇️
src/bentoml/_internal/io_descriptors/base.py	`85.18% <100.00%> (-3.46%)`	⬇️
src/bentoml/_internal/io_descriptors/multipart.py	`56.32% <0.00%> (-2.02%)`	⬇️
src/bentoml/_internal/utils/lazy_loader.py	`78.37% <0.00%> (-1.04%)`	⬇️
src/bentoml/_internal/io_descriptors/file.py	`49.54% <0.00%> (-0.46%)`	⬇️
... and 46 more

aarnphm · 2022-10-15T15:56:01Z

ah next time you can also do a git rebase to update branch history 😄

aarnphm

Quick comments b4 I forgot about them :)

src/bentoml/_internal/io_descriptors/base.py

aarnphm · 2022-10-15T15:57:20Z

src/bentoml/_internal/spark.py

@@ -0,0 +1,193 @@
+from __future__ import annotations


Should this be under src/bentoml/_internal/distributed/spark.py? @sauyon

Maybe batch, but it doesn't really matter right now.

src/bentoml/_internal/spark.py

src/bentoml/_internal/io_descriptors/base.py

Co-authored-by: Quan Nguyen <quandollar@users.noreply.github.com>

…d by spark

aarnphm · 2022-10-20T21:12:20Z

examples/batch_processing/service.py

+@svc.api(
+    input=PandasDataFrame(),
+    output=PandasSeries(dtype="float"),
+)
+async def classify1(input_series: pd.DataFrame) -> pd.Series:
+    return await PandasDataFrame(iris_clf_runner.predict.async_run(input_series))
+
+@svc.api(
+    input=PandasSeries(),
+    output=PandasSeries(),
+)
+async def classify2(input_series: pd.Series) -> pd.Series:
+    return await PandasSeries(iris_clf_runner.predict.async_run(input_series))


nit: let sname the endpoint something more helpful.

examples/batch_processing/test/data.csv

aarnphm · 2023-01-13T09:14:36Z

Development now track at #3425

Talador12 · 2023-01-20T15:25:06Z

Awesome work on this one 🎉

qu8n added 2 commits October 14, 2022 19:24

add prototype code from Notion doc 'Batch Inference Design'

acb7bc2

create to/from pandas series base methods

ef34da3

qu8n mentioned this pull request Oct 15, 2022

feat: add support for batch inference #3098

Closed

5 tasks

aarnphm reviewed Oct 15, 2022

View reviewed changes

sauyon and others added 7 commits October 18, 2022 11:32

Implement to and from series for pandas datatypes

b026e02

Co-authored-by: Quan Nguyen <quandollar@users.noreply.github.com>

Merge branch 'bentoml:main' into support-batch-inference

8b0d9c3

--wip-- example and minor fixes

d3bc3e0

Co-authored-by: Quan Nguyen <quandollar@users.noreply.github.com>

Merge branch 'bentoml:main' into support-batch-inference

777e1bc

add temporary spark test

dab4155

--wip-- change output from tuple series to a single series as require…

71c6d44

…d by spark

Merge branch 'bentoml:main' into support-batch-inference

477a666

aarnphm reviewed Oct 20, 2022

View reviewed changes

examples/batch_processing/test/data.csv Show resolved Hide resolved

qu8n added 14 commits October 21, 2022 17:03

fix df call

2e9b0b3

rename svc to not confuse cloudpickle

c5c75e9

convert functions from async to normal

bf03562

fix dtype

4cfb1fd

fix process function

83fa333

--wip-- add unit tests for spark.py

96b80bf

add integration test for spark

e9b0503

Merge branch 'main' into support-batch-inference

609156d

add conversions between pyarrow and pd.series to the io descriptors

149f72c

refactor to work with pyarrow

93c9b74

fix spark integration test

c58e8e7

add pyspark to requirements

20cd393

Merge branch 'main' into support-batch-inference

9f60edd

Merge branch 'bentoml:main' into support-batch-inference

bde31d1

qu8n mentioned this pull request Dec 7, 2022

feat: bentoserver client #3321

Merged

qu8n added 2 commits December 8, 2022 15:13

wip: unit tests

edbbe2d

wip: add correct version of unit tests

940d676

bojiang self-requested a review January 13, 2023 04:21

aarnphm self-requested a review January 13, 2023 08:52

sauyon mentioned this pull request Jan 13, 2023

feat: support batch inference with Spark #3425

Merged

aarnphm closed this Jan 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support batch inference #3099

feat: support batch inference #3099

qu8n commented Oct 14, 2022 •

edited

Loading

codecov bot commented Oct 14, 2022 •

edited

Loading

aarnphm commented Oct 15, 2022

aarnphm left a comment

aarnphm Oct 15, 2022

sauyon Oct 18, 2022

aarnphm Oct 20, 2022

aarnphm commented Jan 13, 2023

Talador12 commented Jan 20, 2023

feat: support batch inference #3099

feat: support batch inference #3099

Conversation

qu8n commented Oct 14, 2022 • edited Loading

What does this PR address?

Before submitting:

codecov bot commented Oct 14, 2022 • edited Loading

Codecov Report

aarnphm commented Oct 15, 2022

aarnphm left a comment

Choose a reason for hiding this comment

aarnphm Oct 15, 2022

Choose a reason for hiding this comment

sauyon Oct 18, 2022

Choose a reason for hiding this comment

aarnphm Oct 20, 2022

Choose a reason for hiding this comment

aarnphm commented Jan 13, 2023

Talador12 commented Jan 20, 2023

qu8n commented Oct 14, 2022 •

edited

Loading

codecov bot commented Oct 14, 2022 •

edited

Loading