Description
What happened?
i'm trying to add documents to a collection, using an embedding model.
as far as i understand, the collection first creates an instance of this model and tries to save it in a cache directory.
instead of having a way to control the cache directory- it simply uses pathlib Path.home()
in my case, the home directory is full and that causes the code to crash.
the only way i can run the code is by modifying the home directory before running the script.
bug location:
site-packages\chromadb\utils\embedding_functions.py
class ONNXMiniLM_L6_V2(EmbeddingFunction[Documents]):
MODEL_NAME = "all-MiniLM-L6-v2"
DOWNLOAD_PATH = Path.home() / ".cache" / "chroma" / "onnx_models" / MODEL_NAME
EXTRACTED_FOLDER_NAME = "onnx"
ARCHIVE_FILENAME = "onnx.tar.gz"
MODEL_DOWNLOAD_URL = (
"https://chroma-onnx-models.s3.amazonaws.com/all-MiniLM-L6-v2/onnx.tar.gz"
)
_MODEL_SHA256 = "913d7300ceae3b2dbc2c50d1de4baacab4be7b9380491c27fab7418616a16ec3"
Versions
chromadb==0.4.24
Relevant log output
No response