Releases: svilupp/PromptingTools.jl
Releases · svilupp/PromptingTools.jl
v0.24.0
PromptingTools v0.24.0
Added
- Added support for DeepSeek models via the
dschat
anddscode
aliases. You can set theDEEPSEEK_API_KEY
environment variable to your DeepSeek API key.
Commits
Merged pull requests:
v0.23.0
PromptingTools v0.23.0
Added
- Added new prompt templates for "Expert" tasks like
LinuxBashExpertAsk
,JavascriptExpertTask
, etc. - Added new prompt templates for self-critiquing agents like
ChiefEditorTranscriptCritic
,JuliaExpertTranscriptCritic
, etc.
Updated
- Extended
aicodefixer_feedback
methods to work withAICode
andAIGenerate
.
Commits
Merged pull requests:
- Add model providers and Supported functions (#134) (@adarshpalaskar1)
- Mention that the account requires funding (#135) (@KronosTheLate)
- Add templates and minor improvements (#142) (@svilupp)
Closed issues:
- Unable to get started (#132)
v0.22.0
PromptingTools v0.22.0
### Added
- Added support for Groq, the fastest LLM provider out there. It's free for now, so you can try it out - you just need to set your GROQ_API_KEY. We've added Llama3 8b (alias "gllama3"), 70b (alias "gllama370") and Mixtral 8x7b (alias "gmixtral"). For the shortcut junkies, we also added a shorthand Llama3 8b = "gl3" (first two letters and the last digit), Llama3 70b = "gl70" (first two letters and the last two digits).
Commits
Merged pull requests:
v0.21.0
PromptingTools v0.21.0
Added
- New models added to the model registry: Llama3 8b on Ollama (alias "llama3" for convenience) and on Together.ai (alias "tllama3", "t" stands for Together.ai), also adding the llama3 70b on Together.ai (alias "tllama370") and the powerful Mixtral-8x22b on Together.ai (alias "tmixtral22").
Fixed
- Fixed a bug where pretty-printing
RAGResult
would forget a newline between the sources and context sections.
### Commits
Merged pull requests:
v0.20.1
PromptingTools v0.20.1
### Fixed
- Fixed
truncate_dimension
to ignore when 0 is provided (previously it would throw an error).
Commits
Merged pull requests:
v0.20.0
PromptingTools v0.20.0
Added
- Added a few new open-weights models hosted by Fireworks.ai to the registry (DBRX Instruct, Mixtral 8x22b Instruct, Qwen 72b). If you're curious about how well they work, try them!
- Added basic support for observability downstream. Created custom callback infrastructure with
initialize_tracer
andfinalize_tracer
and dedicated types areTracerMessage
andTracerMessageLike
. See?TracerMessage
for more information and the correspondingaigenerate
docstring. - Added
MultiCandidateChunks
which can hold candidates for retrieval across many indices (it's a flat structure to be similar toCandidateChunks
and easy to reason about). - JSON serialization support extended for
RAGResult
,CandidateChunks
, andMultiCandidateChunks
to increase observability of RAG systems - Added a new search refiner
TavilySearchRefiner
- it will search the web via Tavily API to try to improve on the RAG answer (see?refine!
). - Introduced a few small utilities for manipulation of nested kwargs (necessary for RAG pipelines), check out
getpropertynested
,setpropertynested
,merge_kwargs_nested
.
Updated
- [BREAKING] change to
CandidateChunks
where it's no longer allowed to be nested (ie,cc.positions
being a list of severalCandidateChunks
). This is a breaking change for theRAGTools
module only. We have introduced a newMultiCandidateChunks
types that can refer toCandidateChunks
across many indices. - Changed default model for
RAGTools.CohereReranker
to "cohere-rerank-english-v3.0".
Fixed
wrap_string
utility now correctly splits only on spaces. Previously it would split on newlines, which would remove natural formatting of prompts/messages when displayed viapprint
Commits
Merged pull requests:
- Correct hero cards (#129) (@cpfiffer)
- Update Hero section in docs (#130) (@svilupp)
- Add TraceMessage for observability (#133) (@svilupp)
- Update binary RAG pipeline (#136) (@svilupp)
Closed issues:
v0.19.0
PromptingTools v0.19.0
Added
- [BREAKING CHANGE] The default GPT-4 Turbo model alias ("gpt4t") now points to the official GPT-4 Turbo endpoint ("gpt-4-turbo").
- Adds references to
mistral-tiny
(7bn parameter model from MistralAI) to the model registry for completeness. - Adds the new GPT-4 Turbo model (
"gpt-4-turbo-2024-04-09"
), but you can simply use alias"gpt4t"
to access it.
Commits
Merged pull requests:
v0.18.0
PromptingTools v0.18.0
Added
- Adds support for binary embeddings in RAGTools (dispatch type for
find_closest
isfinder=BinaryCosineSimilarity()
), but you can also just convert the embeddings to binary yourself (always chooseMatrix{Bool}
for speed, notBitMatrix
) and use without any changes (very little performance difference at the moment). - Added Ollama embedding models to the model registry ("nomic-embed-text", "mxbai-embed-large") and versioned MistralAI models.
- Added template for data extraction with Chain-of-thought reasoning:
:ExtractDataCoTXML
. - Added data extraction support for Anthropic models (Claude 3) with
aiextract
. Try it with Claude-3 Haiku (model="claudeh"
) and Chain-of-though template (:ExtractDataCoTXML
). See?aiextract
for more information and check Anthropic's recommended practices.
Fixed
Commits
Merged pull requests:
v0.17.1
PromptingTools v0.17.1
Fixed
- Fixed a bug in
print_html
where the custom kwargs were not being passed to theHTML
constructor.
Commits
Merged pull requests:
v0.17.0
PromptingTools v0.17.0
Added
- Added support for
aigenerate
with Anthropic API. Preset model aliases areclaudeo
,claudes
, andclaudeh
, for Claude 3 Opus, Sonnet, and Haiku, respectively. - Enabled the GoogleGenAI extension since
GoogleGenAI.jl
is now officially registered. You can useaigenerate
by setting the model togemini
and providing theGOOGLE_API_KEY
environment variable. - Added utilities to make preparation of finetuning datasets easier. You can now export your conversations in JSONL format with ShareGPT formatting (eg, for Axolotl). See
?PT.save_conversations
for more information. - Added
print_html
utility for RAGTools module to print HTML-styled RAG answer annotations for web applications (eg, Genie.jl). See?PromptingTools.Experimental.RAGTools.print_html
for more information and examples.
Commits
Merged pull requests:
- Update CHANGELOG.md (#104) (@svilupp)
- A little README.md correction (#107) (@Muhammad-saad-2000)
- Update RAG Diagram (#108) (@svilupp)
- Add support for Claude API (#109) (@svilupp)
- Enable GoogleGenAI extension (#111) (@svilupp)
- Add ShareGPT template (#113) (@svilupp)
- Increase compat for GoogleGenAI v0.3 (#114) (@svilupp)
- Update html printing (#115) (@svilupp)
Closed issues:
- [FR] Add support for Claude API (#96)