BlinkDL

BlinkDL

AI & ML interests

RWKV is all you need

Recent Activity

updated a model 3 days ago
BlinkDL/rwkv-7-world
updated a model 4 days ago
BlinkDL/temp-latest-training-models
updated a Space 4 days ago
BlinkDL/RWKV-Gradio-1
View all activity

Organizations

RWKV's profile picture FreedomAI's profile picture rwkv-x-dev's profile picture Social Post Explorers's profile picture

BlinkDL's activity

posted an update 24 days ago
view post
Post
2151
RWKV-7 "Goose" 0.4B trained w/ ctx4k automatically extrapolates to ctx32k+, and perfectly solves NIAH ctx16k ๐Ÿคฏ 100% RNN and attention-free. Only trained on the Pile. No finetuning. Replicable training runs. tested by our community: https://github.com/Jellyfish042/LongMamba
posted an update about 2 months ago
view post
Post
4669
RWKV-6-world-v3 (+3.1T tokens) is our best multilingual 7B model as of now: BlinkDL/rwkv-6-world

It's 100% RNN and attention-free. MMLU 54.2% (previous world-v2.1 = 47.9%. note: without eval-boosting tricks such as annealing).

RWKV-7-world-v4 soon :)
updated a Space about 2 months ago
posted an update 4 months ago
view post
Post
5482
RWKV-7 "Goose" preview rc2 => Peak RNN architecture?๐Ÿ˜ƒWill try to squeeze more performance for the final release. Preview code & model: https://github.com/BlinkDL/RWKV-LM/tree/main/RWKV-v7
  • 2 replies
ยท
liked a Space 8 months ago