Allow users to configure `embed_batch_size` or `ThreadPoolExecutor` size when calling `Client.embed`

It looks like batching was added in #437 - thank you for implementing this, it's very helpful.

I notice that batching, as defined [here](https://github.com/cohere-ai/cohere-python/blob/main/src/cohere/client.py#L135), depends on a [fixed batch size](https://github.com/cohere-ai/cohere-python/blob/main/src/cohere/config.py#L1). This can be suboptimal for clients submitting a large number of smaller documents, as we cannot [configure the ThreadPoolExecutor size](https://github.com/cohere-ai/cohere-python/blob/main/src/cohere/client.py#L111) to parallelize a large number of small data payloads. As a result a client might end up blocking while waiting for small network responses.

Would it be possible to allow clients to configure either the `ThreadPoolExecutor` size or the `embed_batch_size` setting when calling `embed`?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow users to configure `embed_batch_size` or `ThreadPoolExecutor` size when calling `Client.embed` #534

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Allow users to configure embed_batch_size or ThreadPoolExecutor size when calling Client.embed #534

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Allow users to configure `embed_batch_size` or `ThreadPoolExecutor` size when calling `Client.embed` #534