[docs] Adds document for MAX_NETTY_BUFFER_SIZE env var (deepjavalibra…

…ry#2177)
david-sitsky · Jul 15, 2024 · 22551ed · 22551ed
1 parent 4a77c76
commit 22551ed
Showing 1 changed file with 1 addition and 0 deletions.
diff --git a/serving/docs/configurations.md b/serving/docs/configurations.md
@@ -72,6 +72,7 @@ DJLServing build on top of Deep Java Library (DJL). Here is a list of settings f
 | DJL_ENTRY_POINT                   | env var             | The entrypoint python file or module, default: model.py                                                                                                |
 | MODEL_LOADING_TIMEOUT             | env var             | Python worker load model timeout: default: 240 seconds                                                                                                 |
 | PREDICT_TIMEOUT                   | env var             | Python predict call timeout, default: 120 seconds                                                                                                      |
+| MAX_NETTY_BUFFER_SIZE             | env var/system prop | Max response size in bytes, default 20 * 1024 * 1024 (20M)                                                                                             |
 | DJL_VENV_DIR                      | env var/system prop | The venv directory, default: $DJL_CACHE_DIR/venv                                                                                                       |
 | ai.djl.python.disable_alternative | system prop         | Disable alternative engine                                                                                                                             |
 | TENSOR_PARALLEL_DEGREE            | env var             | Set tensor parallel degree.<br>For mpi mode, the default is number of accelerators.<br>Use "max" for non-mpi mode to use all GPUs for tensor parallel. |