tensorflow_serving_spikes

There seems to be some arbitrary spikes in latency when using tensorflow serving. To replicate the issue, I created a dummy tensorflow decision forest gradient boosted tree model and a tensorflow decision forest random forest model that predict penguin types. I used a docker image from the ml6 team that supports tensorflow decision forests ops in tensorflow serving. I then send requests to the fake_model_id predict url and track prediction times/spikes in prediction times.

The average prediction time is around 6 ms, and I defined a spike in prediction time as any prediction time greater than 20ms. I also read the results prior to moving onto the next request as to avoid qeueing.

The docker image can be found here: https://hub.docker.com/r/ml6team/tf-serving-tfdf

To run (defaults to tensorflow decision forest gbt model)

Clone the repo git clone https://github.com/shayansadeghieh/tensorflow_serving_spikes.git
Enter the root directory cd tensorflow_serving_spikes
From the tensorflow_serving_spikes root directory in your terminal run bash serving.sh fake_model_id_gbt to start up the docker container
From the tensorflow_serving_spikes root directory in your terminal run python tfdf_serving.py. It will output the number of latency spikes that occur and provide a distribution plot.

NOTE if you want to use the rf model, in step 3 run bash serving.sh fake_model_id_rf and replace the predict url to be fake_model_id_rf in the tfdf_serving.py script.

When I run for 10000 requests with the gbt model I get:

The mean pred time was 6.1302518830269275 ms
Out of 10000 requests there were 38 spikes

The distribution plot looks like this:

But if you zoom past the 20ms point, you'll see the spikes

When I run for 10000 requests with the rf model I get:

The mean pred time was 5.550512391635227 ms
Out of 10000 requests there were 32 spikes

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
fake_model_id_gbt/weights		fake_model_id_gbt/weights
fake_model_id_rf/weights		fake_model_id_rf/weights
README.md		README.md
serving.Dockerfile		serving.Dockerfile
serving.sh		serving.sh
tfdf_serving.py		tfdf_serving.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tensorflow_serving_spikes

To run (defaults to tensorflow decision forest gbt model)

About

Releases

Packages

Languages

shayansadeghieh/tensorflow_serving_spikes

Folders and files

Latest commit

History

Repository files navigation

tensorflow_serving_spikes

To run (defaults to tensorflow decision forest gbt model)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages