-
Couldn't load subscription status.
- Fork 61
Pull requests: IBM/unitxt
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
New metric definitions for llama-3-3-70b as judge in Arena Hard benchmark
#1949
opened Oct 27, 2025 by
kmazrolina
Loading…
Update rag llm as judge metric to support llama-3-3-70b model on WML.
#1945
opened Oct 9, 2025 by
piotrhelm
Loading…
Correct reflection based tool calling metrics so valid results will be 1.
#1940
opened Sep 25, 2025 by
yoavkatz
Loading…
light fast removal of register_all_artifacts for unitxt classes
#1939
opened Sep 20, 2025 by
dafnapension
Loading…
A potential fix for messy error box when Error is KeyError
#1932
opened Sep 1, 2025 by
dafnapension
Loading…
Refactor, simplify and unify llm as judges to map reduce metric
#1927
opened Aug 24, 2025 by
elronbandel
•
Draft
fixed mmmu by cooking options from answer, when options is not given in the instance
#1921
opened Aug 20, 2025 by
dafnapension
Loading…
Fix bfcl so that instances do not explode in size, and all cards yield recipes that pass _source_to_dataset
#1915
opened Aug 13, 2025 by
dafnapension
Loading…
Allow using python functions instead of operators (e.g in pre-processing pipeline)
#1845
opened Jun 26, 2025 by
elronbandel
Loading…
For issue 1575: Eliminating Manual Class Registration in Unitxt, replaced by Import Paths
#1713
opened Apr 5, 2025 by
dafnapension
Loading…
Momentary fix of 'CriteriaWithOptions is not JSON serializable'
#1694
opened Mar 19, 2025 by
martinscooper
•
Draft
BugFix: Use dumping of task data and source only when dumping
#1661
opened Mar 9, 2025 by
elronbandel
Loading…
Add batch size control to Huggingface pipeline based inference engine
#1636
opened Feb 27, 2025 by
elronbandel
Loading…
Add prediction variable name customization to LLM as Judge
#1622
opened Feb 21, 2025 by
martinscooper
•
Draft
Previous Next
ProTip!
Updated in the last three days: updated:>2025-10-24.