[GH Issue Summarization] Create a model server #11

texasmichelle · 2018-02-21T17:06:19Z

Create a model server using TFServing.

Component of #14.

jlewi · 2018-03-06T14:15:12Z

Ankush can you describe the problems you were running into turning the model into a model that can be served with TF serving?

Would it be easier to serve the model using Seldon?

ankushagarwal · 2018-03-06T17:54:27Z

The model used for issue summarization is very different from the examples that we've been using. For our image models, the model prediction looks something like this: output = model(input)

But for the issue summarization model, it looks something like this

output = '<START>'

intermediate_result = encoder_model(input)

while True:
  intermediate_result, next_char = decoder_model(intermediate_result, output)
  if next_char == '<STOP>':
    return output
  output += next_char

The first issue that I had was - exporting Keras models as Tensorflow models which can be used by TFServing - this is mostly done.

The second challenge that I have is understanding how TFServing works with

multiple models (encoder_model and decoder_model)
models with multiple inputs and multiple outputs (decoder_model)

Would it be easier to serve the model using Seldon?
I am not familiar enough with Seldon...

ankushagarwal · 2018-03-07T02:57:50Z

I am having issues with the model exported from keras imported into tfserving.

I get this error when I send a Prediction request to the TFServing server

AbortionError: AbortionError(code=StatusCode.INVALID_ARGUMENT, details="Expected multiples argument to be a vector of length 3 but got length 2
[[Node: Encoder-Last-GRU_1/Tile = Tile[T=DT_FLOAT, Tmultiples=DT_INT32, _device="/job:localhost/replica:0/task:0/device:CPU:0"](Encoder-Last-GRU_1/ExpandDims, Encoder-Last-GRU_1/Tile/multiples)]]")

Could not find a workaround for this. Will give seldon or tornado a shot to serve this Keras model.

We can probably illustrate serving a model with TFServing in another example which trains a tensorflow model directly.

Create a simple tornado server to serve the model TODO: Create a docker image for the server and deploy on kubeflow Related to #11

jlewi · 2018-03-08T14:21:15Z

@cliveseldon @gsunner Do you think we should try to use Seldon here?

Could we use the existing Seldon model server rather than creating our own Tornado stub?

Should we deploy the model using Seldon Core rather than deploying it directly with K8s resources?

ukclivecox · 2018-03-08T14:42:20Z

@jlewi For a sklearn model, seldon-core would seem to be a good choice.

@ankushagarwal You specify a seq-to-seq model, but does the external business app send the whole sequence of characters in a single request to get a sequence back? if so then that should fit fine into the seldon-core prediction payload using NDArray. Your prediction component would need to split the request and then do as you specify in pseudo-code above.

Suggest you look at https://github.com/kubeflow/example-seldon which contains a sklearn model in the example code.

@jlewi Not sure I follow your last two questions. It would seem preferable to use the most appropriate serving solution TfServing or seldon-core rather than starting to build a new serving solution.

ankushagarwal · 2018-03-08T22:54:37Z

Hi @cliveseldon I have followed the instructions at https://github.com/SeldonIO/seldon-core/blob/master/docs/wrappers/python.md and wrapped my model into a docker image. I am able to run the image locally and it is serving a REST API server at port 5000.

My question is what is the API to send a prediction request to the server. I could not find docs on that.

ukclivecox · 2018-03-09T07:26:32Z

Hi @ankushagarwal

See here for definition. You can send a Tensor or NDArray or custom string or binary. NDArray would seem to make sense for your case.
For example see some of the notebooks, for example in the kubeflow-seldon example so you send something like

payload = {"data":{"ndarray":["the","cat,"sat","on","the", "mat"]}}

… the model (#36) * Create a end-to-end kubeflow example using seq2seq model (4/n) * Move from a custom tornado server to a seldon-core model Related to #11 * Update to use gcr.io registry for serving image

Update the issue summarization end to end tutorial to deploy the seldon core model to the k8s cluster Update the sample request and response Related to #11

ankushagarwal · 2018-03-21T19:09:51Z

Closing since we have a seldon model server.

k8s-ci-robot assigned ankushagarwal Mar 6, 2018

jlewi added the priority/p1 label Mar 7, 2018

ankushagarwal mentioned this issue Mar 7, 2018

Create a end-to-end kubeflow example using seq2seq model (3/n) #31

Merged

jlewi added a commit that referenced this issue Mar 8, 2018

Merge pull request #31 from ankushagarwal/issue_summarization_serving

0020dae

Create a simple tornado server to serve the model TODO: Create a docker image for the server and deploy on kubeflow Related to #11

ankushagarwal mentioned this issue Mar 9, 2018

Move from a custom tornado server to a seldon-core server for serving the model #36

Merged

ankushagarwal mentioned this issue Mar 14, 2018

Add instructions to deploy the seldon core model #46

Merged

k8s-ci-robot pushed a commit that referenced this issue Mar 15, 2018

Add instructions to deploy the seldon core model (#46)

45255b5

Update the issue summarization end to end tutorial to deploy the seldon core model to the k8s cluster Update the sample request and response Related to #11

ankushagarwal closed this as completed Mar 21, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GH Issue Summarization] Create a model server #11

[GH Issue Summarization] Create a model server #11

texasmichelle commented Feb 21, 2018 •

edited

Loading

jlewi commented Mar 6, 2018

ankushagarwal commented Mar 6, 2018 •

edited

Loading

ankushagarwal commented Mar 7, 2018

jlewi commented Mar 8, 2018

ukclivecox commented Mar 8, 2018

ankushagarwal commented Mar 8, 2018

ukclivecox commented Mar 9, 2018

ankushagarwal commented Mar 21, 2018

[GH Issue Summarization] Create a model server #11

[GH Issue Summarization] Create a model server #11

Comments

texasmichelle commented Feb 21, 2018 • edited Loading

jlewi commented Mar 6, 2018

ankushagarwal commented Mar 6, 2018 • edited Loading

ankushagarwal commented Mar 7, 2018

jlewi commented Mar 8, 2018

ukclivecox commented Mar 8, 2018

ankushagarwal commented Mar 8, 2018

ukclivecox commented Mar 9, 2018

ankushagarwal commented Mar 21, 2018

texasmichelle commented Feb 21, 2018 •

edited

Loading

ankushagarwal commented Mar 6, 2018 •

edited

Loading