Asynchronous Processing Challenge with BentoML: Need Advice on Handling Concurrent Requests #4137

euremi · 2023-08-21T09:42:57Z

euremi
Aug 21, 2023

I've developed multiple APIs using BentoML, and the general structure of my script looks like the following:

There are two functions, main1 and main2, which receive requests almost simultaneously. I'm aiming to achieve an asynchronous processing flow where the success message is received before the sub functions are fully executed. However, I'm facing an issue where main1 immediately receives the success message, but it doesn't wait for the completion of the sub1 function before main2 receives its success message.

Current Situation:

Requests made to main1 and main2
main1 receives "success1" message
sub1 function of main1 begins execution
main2 receives "success2" message
sub1 function of main1 completes execution
sub2 function of main2 completes execution

Desired Situation:

Requests made to main1 and main2
main1 receives "success1" message
sub1 function of main1 is executing + main2 receives "success2" message
sub1 function of main1 completes execution
sub2 function of main2 completes execution

Here is a simplified version of my code:

@svc.api(route='/execute/main1',
         input=bentoml.io.JSON(),
         output=bentoml.io.JSON())
async def main1():
    asyncio.ensure_future(sub1())
    return {'message': 'success1'}

@svc.api(route='/execute/main2',
         input=bentoml.io.JSON(),
         output=bentoml.io.JSON())
async def main2():
    asyncio.ensure_future(sub2())
    return {'message': 'success2'}

async def sub1():
    ...
    request.post()

async def sub2():
    ...
    requests.post()

If you have any references or advice that could help me achieve the desired behavior, I would greatly appreciate it! Thank you!

aarnphm · 2023-08-22T07:56:50Z

aarnphm
Aug 22, 2023
Maintainer

shouldn't it just be

@svc.api(route='/execute/main1',
         input=bentoml.io.JSON(),
         output=bentoml.io.JSON())
async def main1():
    await sub1()
    return {'message': 'success1'}

?

0 replies

frostming · 2023-08-22T08:08:18Z

frostming
Aug 22, 2023
Maintainer

You need some synchronization primitives to communicate between the two coroutine main1 and main2.

But it seems more like an improper architecture design. Because both main1 and main2 are async endpoints that can receive multiple requests simultaneously. What if there are multiple sub1 running at the same time, which one should main2 wait for?

0 replies

KimSoungRyoul · 2023-09-04T16:43:03Z

KimSoungRyoul
Sep 4, 2023

starlette support BackgroundTask
use starlette's feature with fastapi

import asyncio

import bentoml
import numpy as np
import torch
from fastapi import FastAPI
from starlette.responses import JSONResponse
from torch import Tensor

from starlette.background import BackgroundTask

runner = bentoml.pytorch.get("iris_clf:latest").to_runner()

svc = bentoml.Service(name="sample-dummy-bento", runners=[runner])

fastapi_app = FastAPI()
# fastapi app with bentoml Service
svc.mount_asgi_app(fastapi_app)


async def sub1( input_array: list[list[float]]):
    print("task1 start~~~~") 
    res = request.post() # highly recommend to use httpx async client instead of request
    print(f"sub1: {res}")


async def sub2( input_array: list[list[float]]):
    print("task2 start~~~~")
    res = request.post() # highly recommend to use httpx async client instead of request
    print(f"sub2: {res}")


@fastapi_app.post("/predict-fastapi")
def predict_fastapi(input_array: list[list[float]]):
    task = BackgroundTask(sub1, input_array=input_array)
    return JSONResponse("message": "success", background=task)

@fastapi_app.post("/predict2-fastapi")
def predict2_fastapi(input_array: list[list[float]]):
    task = BackgroundTask(sub2, runner=runner, input_array=input_array)
    return JSONResponse({"message":"success"}, background=task)

this issue was resolved in bentoml slack KR channel

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BentoML

Asynchronous Processing Challenge with BentoML: Need Advice on Handling Concurrent Requests #4137

{{title}}

Replies: 3 comments

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

BentoML

Asynchronous Processing Challenge with BentoML: Need Advice on Handling Concurrent Requests #4137

euremi Aug 21, 2023

Replies: 3 comments

aarnphm Aug 22, 2023 Maintainer

frostming Aug 22, 2023 Maintainer

KimSoungRyoul Sep 4, 2023

euremi
Aug 21, 2023

aarnphm
Aug 22, 2023
Maintainer

frostming
Aug 22, 2023
Maintainer

KimSoungRyoul
Sep 4, 2023