2 10 82

Alberto Cetoli PRO

fractalego

https://fractalego.social/@alberto

AI & ML interests

Entity/relation extraction, Q&A, Summarisation

Recent Activity

liked a model 2 days ago

stabilityai/stable-video-diffusion-img2vid-xt

reacted to mitkox's post with 🤯 3 days ago

Can it run DeepSeek V3 671B is the new 'can it run Doom'. How minimalistic can I go with on device AI with behemoth models - here I'm running DeepSeek V3 MoE on a single A6000 GPU. Not great, not terrible, for this minimalistic setup. I love the Mixture of Experts architectures. Typically I'm running my core LLM distributed over the 4 GPUs. Make sure you own your AI. AI in the cloud is not aligned with you; it's aligned with the company that owns it.

reacted to mitkox's post with 🔥 3 days ago

View all activity

Articles

Fine-tuning LLMs with Singular Value Decomposition

Jun 2, 2024

• 8

The LASER technique: Evaluating SVD compression

Apr 4, 2024

• 7

Organizations

fractalego's activity

liked a model 2 days ago

stabilityai/stable-video-diffusion-img2vid-xt

Image-to-Video • Updated Jul 10, 2024 • 181k • 2.8k

reacted to mitkox's post with 🤯🔥➕ 3 days ago

Post

2306

Can it run DeepSeek V3 671B is the new 'can it run Doom'.

How minimalistic can I go with on device AI with behemoth models - here I'm running DeepSeek V3 MoE on a single A6000 GPU.

Not great, not terrible, for this minimalistic setup. I love the Mixture of Experts architectures. Typically I'm running my core LLM distributed over the 4 GPUs.

Make sure you own your AI. AI in the cloud is not aligned with you; it's aligned with the company that owns it.

5 replies

liked a dataset 8 days ago

facebook/winoground

Viewer • Updated Oct 22, 2024 • 400 • 353 • 86

liked a dataset 9 days ago

agibot-world/AgiBotWorld-Alpha

Viewer • Updated 3 days ago • 20.1M • 9.56k • 156

updated a model 9 days ago

fractalego/wafl-phi3.5-mini-instruct

Text Generation • Updated 9 days ago • 337

upvoted 3 papers 18 days ago

Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning

Paper • 2411.18203 • Published Nov 27, 2024 • 33

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

Paper • 2410.02884 • Published Oct 3, 2024 • 53

Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B

Paper • 2406.07394 • Published Jun 11, 2024 • 26

liked a model 21 days ago

NyxKrage/Microsoft_Phi-4

Updated 29 days ago • 6.96k • 53

updated a dataset 28 days ago

fractalego/wafl-dataset-2.0_sentence_by_sentence

Viewer • Updated 28 days ago • 5.51k • 40

liked a model about 1 month ago

Qwen/Qwen2.5-32B

Text Generation • Updated Sep 20, 2024 • 23.8k • 60

reacted to julien-c's post with 🔥 about 1 month ago

Post

2487

wow 😮

INTELLECT-1 is the first collaboratively trained 10 billion parameter language model trained from scratch on 1 trillion tokens of English text and code.

PrimeIntellect/INTELLECT-1-Instruct

reacted to merve's post with ❤️ about 2 months ago

Post

3156

your hugging face profile now has your recent activities 🤗

liked 2 datasets about 2 months ago

pminervini/HaluEval

Viewer • Updated Dec 7, 2023 • 64.5k • 653 • 12

gorilla-llm/Berkeley-Function-Calling-Leaderboard

Preview • Updated Dec 10, 2024 • 664 • 54

upvoted a collection 2 months ago

Daily Papers

Collection

1 item • Updated Oct 26, 2023 • 66

reacted to chansung's post with 👍 2 months ago

Post

4691

Effortlessly stay up-to-date with AI research trends using a new AI tool, "AI Paper Reviewer" !!

It analyzes a list of Hugging Face Daily Papers(w/ @akhaliq ) and turn them into insightful blog posts. This project leverages Gemini models (1.5 Pro, 1.5 Flash, and 1.5 Flash-8B) for content generation and Upstage Document Parse for parsing the layout and contents.
blog link: https://deep-diver.github.io/ai-paper-reviewer/

Also, here is the link of GitHub repository for parsing and generating pipeline. By using this, you can easily build your own GitHub static pages based on any arXiv papers with your own interest!
: https://github.com/deep-diver/paper-reviewer

liked a dataset 3 months ago

Matthijs/cmu-arctic-xvectors

Viewer • Updated Feb 7, 2023 • 7.93k • 16.6k • 41