Skip to content

Releases: argilla-io/distilabel

1.0.0

17 Apr 07:42
23c5fe5
Compare
Choose a tag to compare

What's Changed

Read more

0.6.0

01 Mar 17:57
fce6c2d
Compare
Choose a tag to compare

What's Changed

  • Fix typo in docstring of to_argilla metrics_ to metric_ by @burtenshaw in #334
  • Implement a JSON responding OpenAI LLM as JSONOpenAILLM by @burtenshaw in #331
  • Add examples for the deita paper tasks by @plaguss in #329
  • Add checkpoint strategy to automatically push to hub by @plaguss in #321
  • docs: update tutorials avoid argilla installation error by @sdiazlor in #337
  • Fix CustomDataset.load_from_disk with str/Path objects by @plaguss in #341
  • Clalrify number of generations produced when using LLMPool in docs by @davanstrien in #339
  • Refactor _build_dataset piece for speed by @plaguss in #344
  • Fix documentation and type variables in CustomDataset checkpoint methods by @plaguss in #342
  • US Spelling and other typo correction on Distilabel tutorials by @ignacioct in #324
  • docs: add a tutorial for evolinstruct by @sdiazlor in #327
  • Fix Openai api error with OpenAI-compatible providers by @jphme in #351
  • Add fix for labels not returned by openai api by @plaguss in #364
  • Refactor model availability check in is_serverless_endpoint_available by @davanstrien in #363

New Contributors

Full Changelog: 0.5.0...0.6.0

0.5.0

02 Feb 16:21
8ccf116
Compare
Choose a tag to compare

What's Changed

  • fix: Correct import error by @plaguss in #279
  • fix: Filter examples for which len generations != len ratings by @plaguss in #284
  • feat: Add sentence transformers support for the to argilla method by @davidberenstein1957 in #262
  • feat: Add text descriptives support to the to argilla methods by @davidberenstein1957 in #271
  • feat: Add to_argilla method to EvolInstructTask generated datasets by @plaguss in #291
  • docs: Shorten titles tutorials and update core example by @davidberenstein1957 in #289
  • feat: Add new serialization strategy by @plaguss in #288
  • feat: Review OllamaLLM and TogetherInferenceLLM by @alvarobartt in #305
  • refactor: Remove Metadata for Ratings by @ignacioct in #303
  • docs: Add missing VertexAI information within README.md and docs/index.md by @alvarobartt in #308
  • feat: Add functionality to push tasks to the HuggingFace hub and download them automatically. by @plaguss in #297
  • feat: Add ComplexityScorer and QualityScorer tasks from Deita by @plaguss in #302
  • fix: Fix logging visualization of labeller pipelines by @plaguss in #310
  • feat: Add Improving Text Embeddings with LLMs tutorial by @alvarobartt in #313
  • feat: Add EvolComplexity and EvolQuality by @davidberenstein1957 in #299
  • feat: Add validate_prompts method to LLMs to help validating the prompts by @plaguss in #314
  • fix: typo in clean an existing preference dataset by @sdiazlor in #312
  • feat: Add new column for sft fine tuning with prepare_dataset by @plaguss in #309
  • docs: Custom Task Documentation by @ignacioct in #275
  • refactor: Align the LLM subclasses args by @alvarobartt in #315
  • feat: Include rationale of the model responses on prepare_dataset if available by @plaguss in #317
  • feat: Add embedding tutorial to docs by @ignacioct in #319
  • feat: Add MistralAILLM by @plaguss in #293
  • feat: Use ollama Python client within OllamaLLM by @sdiazlor in #307

Full Changelog: 0.4.0...0.5.0

0.4.0

19 Jan 15:20
2abe11a
Compare
Choose a tag to compare

What's Changed

  • docs: Notus end2end example for preference and instruction generation by @ignacioct in #145
  • docs: binders anchors by @ignacioct in #235
  • feat: Add support for dedicated and serverless inference endpoints via inference API by @philschmid in #238
  • docs: Update links to arxiv landing pages rather than PDFs by @davanstrien in #249
  • feat: add ETA to progress bar and fix not showing the progress bar if irrelavant by @ignacioct in #253
  • feat: Add Evol instruct task by @plaguss in #237
  • docs: rename enable_checkpoints to checkpoint_strategy by @davidberenstein1957 in #257
  • feat: Fixing progress bar and ETA by @ignacioct in #260
  • fix: resolved error with self instruct to argilla method by @plaguss in #265
  • chore: Add extra check in llmpool to ensure all the tasks share the same parent class by @plaguss in #266
  • fix: fix for Notus tutorial after bug in record unwrap by @ignacioct in #267
  • feat: add customizable criteria for query generation in SelfInstructTask by @ignacioct in #269
  • docs: add a tutorial on "clean a DPO/preference dataset with distilabel" by @sdiazlor in #270
  • feat: Add new functionality to binarize preference datasets directly from distilabel by @plaguss in #264
  • feat: add support ollama api by @davidberenstein1957 in #250

New Contributors

Full Changelog: 0.3.0...0.4.0

0.3.0

09 Jan 15:34
ba3891a
Compare
Choose a tag to compare

What's Changed

Full Changelog: 0.2.1...0.3.0

0.2.1

27 Dec 13:06
9835760
Compare
Choose a tag to compare

What's Changed

Full Changelog: 0.2.0...0.2.1

0.2.0

21 Dec 15:27
3078da5
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: 0.1.1...0.2.0

0.1.1

11 Dec 08:24
Compare
Choose a tag to compare

What's Changed

  • Template for Documentation Issue created by @ignacioct in #128
  • self.thread_pool_executor can be None, protecting it for print by @ignacioct in #129
  • Use do_sample in transformers example by @dvsrepo in #138
  • Fix llama-cpp and hf-inference-endpoints extras in pyproject.toml by @plaguss in #139
  • Fix llama_cpp_python dependency check by @plaguss in #140

New Contributors

Full Changelog: 0.1.0...0.1.1

0.1.0

29 Nov 10:18
7154396
Compare
Choose a tag to compare

Stable Release - v0.1.0

0.1.0rc2

23 Nov 13:10
d2f9943
Compare
Choose a tag to compare
0.1.0rc2 Pre-release
Pre-release
distilabel 0.1.0rc2