Yi Cui PRO

onekq

AI & ML interests

Benchmark, Code Generation Model

Recent Activity

updated a model 3 minutes ago
onekq-ai/granite-20b-code-base-8k-bnb-4bit
updated a collection about 23 hours ago
QLora-ready Coding Models
updated a collection about 23 hours ago
QLora-ready Coding Models
View all activity

Articles

Organizations

MLX Community's profile picture ONEKQ AI's profile picture

onekq's activity

posted an update 14 days ago
view post
Post
3020
πŸ‹ DeepSeek πŸ‹v3 achieves a solid 7 point jump than v2.5, surpassing GPT-4o, but is still behind πŸ“ o1 πŸ“and Claude 3.5.

onekq-ai/WebApp1K-models-leaderboard
posted an update 3 months ago
view post
Post
586
October version of Claude 3.5 lifts SOTA (set by its June version) by 7 points.
onekq-ai/WebApp1K-models-leaderboard

Closed sourced models are widening the gap again.

Note: Our frontier leaderboard now uses double test scenarios because the single-scenario test suit has been saturated.