Alberto Cetoli PRO

fractalego

AI & ML interests

Entity/relation extraction, Q&A, Summarisation

Recent Activity

Articles

Organizations

Blog-explorers's profile picture Hugging Face Discord Community's profile picture open/ acc's profile picture

fractalego's activity

reacted to mitkox's post with 🤯🔥 3 days ago
view post
Post
2306
Can it run DeepSeek V3 671B is the new 'can it run Doom'.

How minimalistic can I go with on device AI with behemoth models - here I'm running DeepSeek V3 MoE on a single A6000 GPU.

Not great, not terrible, for this minimalistic setup. I love the Mixture of Experts architectures. Typically I'm running my core LLM distributed over the 4 GPUs.

Make sure you own your AI. AI in the cloud is not aligned with you; it's aligned with the company that owns it.
·
reacted to julien-c's post with 🔥 about 1 month ago
view post
Post
2487
wow 😮

INTELLECT-1 is the first collaboratively trained 10 billion parameter language model trained from scratch on 1 trillion tokens of English text and code.

PrimeIntellect/INTELLECT-1-Instruct
reacted to merve's post with ❤️ about 2 months ago
view post
Post
3156
your hugging face profile now has your recent activities 🤗
reacted to chansung's post with 👍 2 months ago
view post
Post
4691
Effortlessly stay up-to-date with AI research trends using a new AI tool, "AI Paper Reviewer" !!

It analyzes a list of Hugging Face Daily Papers(w/ @akhaliq ) and turn them into insightful blog posts. This project leverages Gemini models (1.5 Pro, 1.5 Flash, and 1.5 Flash-8B) for content generation and Upstage Document Parse for parsing the layout and contents.
blog link: https://deep-diver.github.io/ai-paper-reviewer/

Also, here is the link of GitHub repository for parsing and generating pipeline. By using this, you can easily build your own GitHub static pages based on any arXiv papers with your own interest!
: https://github.com/deep-diver/paper-reviewer