Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
main.py		main.py
optimized_predict_load_test.py		optimized_predict_load_test.py
predict_load_test.py		predict_load_test.py
requirements.txt		requirements.txt

README.md

Async Batcher with Keras model and FastAPI

This example shows how to use the async_batcher library with FastAPI to process data in batches by a Keras (Tensorflow) model.

In this example, we serve a TensorFlow model using FastAPI and create two endpoints:

/predict to make predictions using the TensorFlow model directly
/optimized_predict to make predictions using the TensorFlow model with the async_batcher library

How to use

# Install the packages
pip install -r requirements.txt

# Run the FastAPI server with Uvicorn
PYTHONPATH=$PYTHONPATH:$(git rev-parse --show-toplevel) uvicorn main:app --reload

Load testing

To evaluate the performance of the /predict and /optimized_predict endpoints, we can use the locust library to simulate multiple users making requests to the server.

You can run one of the provided Locust files to simulate the load testing:

predict_load_test.py to test the /predict endpoint
optimized_predict_load_test.py to test the /optimized_predict endpoint

To run the load test, you can use the following command:

locust -f predict_load_test.py

Then, open the web interface at http://localhost:8089 and start the load test.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

keras

keras

README.md

Async Batcher with Keras model and FastAPI

How to use

Load testing

Files

keras

Directory actions

More options

Directory actions

More options

Latest commit

History

keras

Folders and files

parent directory

README.md

Async Batcher with Keras model and FastAPI

How to use

Load testing