Releases · svilupp/PromptingTools.jl

getindex(::MultiIndex, ::MultiCandidateChunks) now returns sorted chunks by default (sorted=true) to guarantee that potential context (=chunks) is sorted by descending similarity score across different sub-indices.

Updated

Updated a hcat implementation in RAGTools.get_embeddings to reduce memory allocations for large embedding batches (c. 3x fewer allocations, see hcat_truncate).
Updated length_longest_common_subsequence signature to work only for pairs of AbstractString to not fail silently when wrong arguments are provided.

Fixed

Changed the default behavior of getindex(::MultiIndex, ::MultiCandidateChunks) to always return sorted chunks for consistency with other similar functions and correct retrieve behavior. This was accidentally changed in v0.40 and is now reverted to the original behavior.

Commits

Merged pull requests:

Updates embedding concatenation (#186) (@svilupp)
Fix getindex (#187) (@svilupp)

Contributors

svilupp

Assets 2

30 Jul 19:30

github-actions

v0.44.0

0916bd7

v0.44.0

PromptingTools v0.44.0

Diff since v0.43.0

Added

Added Mistral Large 2 and Mistral-Nemo to the model registry (alias mistral-nemo).

Fixed

Fixed a bug where wrap_string would not correctly split very long Unicode words.

Commits

Merged pull requests:

Add mistral large 2 (#184) (@svilupp)
Fix wrap_string for long Unicode (#185) (@svilupp)

Contributors

svilupp

Assets 2

24 Jul 07:13

github-actions

v0.43.0

c02bd43

v0.43.0

PromptingTools v0.43.0

Diff since v0.42.0

Added

Added Llama 3.1 registry records for Fireworks.ai (alias fllama3, fllama370, fllama3405 and fls, flm, fll for small/medium/large similar to the other providers).

Commits

Merged pull requests:

Add Fireworks.ai llama3.1 (#183) (@svilupp)

Contributors

svilupp

Assets 2

23 Jul 18:12

github-actions

v0.42.0

0f1a334

v0.42.0

PromptingTools v0.42.0

Diff since v0.41.0

Added

Registered new Meta Llama 3.1 models hosted on GroqCloud and Together.ai (eg, Groq-hosted gllama370 has been updated to point to the latest available model and 405b model now has alias gllama3405). Because that's quite clunky, I've added abbreviations based on sizes small/medium/large (that is 8b, 70b, 405b) under gls/glm/gll for Llama 3.1 hosted on GroqCloud (similarly, we now have tls/tlm/tll for Llama3.1 on Together.ai).
Generic model aliases for Groq and Together.ai for Llama3 models have been updated to point to the latest available models (Llama 3.1).
Added Gemma2 9b model hosted on GroqCloud to the model registry (alias ggemma9).

Updated

Minor optimizations to SubDocumentTermMatrix to reduce memory allocations and improve performance.

Commits

Merged pull requests:

Contributors

svilupp

Assets 2

22 Jul 20:50

github-actions

v0.41.0

89d4c43

v0.41.0

PromptingTools v0.41.0

Diff since v0.40.0

Added

Introduced a "view" of DocumentTermMatrix (=SubDocumentTermMatrix) to allow views of Keyword-based indices (ChunkKeywordsIndex). It's not a pure view (TF matrix is materialized to prevent performance degradation).

Fixed

Fixed a bug in find_closest(finder::BM25Similarity, ...) where the view of DocumentTermMatrix (ie, view(DocumentTermMatrix(...), ...)) was undefined.
Fixed a bug where a view of a view of a ChunkIndex wouldn't intersect the positions (it was returning only the latest requested positions).

Commits

Merged pull requests:

Add SubDocumentTermMatrix (#181) (@svilupp)

Contributors

svilupp

Assets 2

21 Jul 10:30

github-actions

v0.40.0

fcd7509

v0.40.0

PromptingTools v0.40.0

Diff since v0.39.0

Added

Introduces RAGTools.SubChunkIndex to allow projecting views of various indices. Useful for pre-filtering your data (faster and more precise retrieval). See ?RT.SubChunkIndex for more information and how to use it.

Updated

CandidateChunks and MultiCandidateChunks intersection methods updated to be an order of magnitude faster (useful for large sets like tag filters).

Fixed

Fixed a bug in find_closest(finder::BM25Similarity, ...) where minimum_similarity kwarg was not implemented.

Commits

Merged pull requests:

Add SubChunkIndex (view of index) (#179) (@svilupp)

Contributors

svilupp

Assets 2

19 Jul 07:34

github-actions

v0.39.0

53ac0b8

v0.39.0

PromptingTools v0.39.0

Diff since v0.38.0

Breaking Changes

Changed the default model for ai* chat functions (PT.MODEL_CHAT) from gpt3t to gpt4om (GPT-4o-mini). See the LLM-Leaderboard results and the release blog post.

Added

Added the new GPT-4o-mini to the model registry (alias gpt4om). It's the smallest and fastest model based on GPT4 that is cheaper than GPT3.5Turbo.

Commits

Merged pull requests:

Add GPT-4o-mini + set as default (#180) (@svilupp)

Contributors

svilupp

Assets 2

Releases: svilupp/PromptingTools.jl

v0.48.0

PromptingTools v0.48.0

Added

Commits

Contributors

v0.47.0

PromptingTools v0.47.0

Contributors

v0.46.0

PromptingTools v0.46.0

Contributors

v0.45.0

PromptingTools v0.45.0

Breaking Change

Updated

Fixed

Commits

Contributors

v0.44.0

PromptingTools v0.44.0

Added

Fixed

Commits

Contributors

v0.43.0

PromptingTools v0.43.0

Added

Commits

Contributors

v0.42.0

PromptingTools v0.42.0

Added

Updated

Commits

Contributors

v0.41.0

PromptingTools v0.41.0

Added

Fixed

Commits

Contributors

v0.40.0

PromptingTools v0.40.0

Added

Updated

Fixed

Commits

Contributors

v0.39.0

PromptingTools v0.39.0

Breaking Changes

Added

Commits

Contributors