60 4 137

Wolfram Ravenwolf

wolfram

https://ko-fi.com/wolframravenwolf

AI & ML interests

Local LLMs

Recent Activity

new activity 8 days ago

blog-explorers/README:[Support] Community Articles

updated a model 17 days ago

wolfram/QVQ-72B-Preview-4.65bpw-h6-exl2

liked a model 23 days ago

nisten/apollonia-7b

View all activity

Articles

🐺🐦‍⬛ LLM Comparison/Test: Phi-4, Qwen2 VL 72B Instruct, Aya Expanse 32B in my updated MMLU-Pro CS benchmark

about 16 hours ago

• 1

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

9 days ago

• 36

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

Dec 4, 2024

• 76

LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)

Apr 24, 2024

• 60

Organizations

wolfram's activity

upvoted an article about 1 month ago

Article

They Said It Couldn’t Be Done

•

Dec 5, 2024

• 76

upvoted an article about 2 months ago

Article

Releasing the largest multilingual open pretraining dataset

•

Nov 13, 2024

• 98

upvoted an article 5 months ago

Article

The 5 Most Under-Rated Tools on Hugging Face

Aug 22, 2024

• 86

upvoted an article 8 months ago

Article

Let's talk about LLM evaluation

•

May 23, 2024

• 144

Wolfram Ravenwolf

AI & ML interests

Recent Activity

Articles

🐺🐦‍⬛ LLM Comparison/Test: Phi-4, Qwen2 VL 72B Instruct, Aya Expanse 32B in my updated MMLU-Pro CS benchmark

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

Turning Home Assistant into an AI Powerhouse: Amy's Guide

Your AI, Everywhere

The Great LLM Showdown: Amy's Quest for the Perfect LLM

LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)

Organizations

wolfram's activity

They Said It Couldn’t Be Done

Releasing the largest multilingual open pretraining dataset

The 5 Most Under-Rated Tools on Hugging Face

Let's talk about LLM evaluation