Victor Mustar's picture

Victor Mustar PRO

victor

AI & ML interests

Building the UX of this website

Recent Activity

liked a model 24 minutes ago
NovaSky-AI/Sky-T1-32B-Preview
liked a Space about 18 hours ago
fffiloni/Sa2VA-simple-demo
liked a Space about 23 hours ago
declare-lab/TangoFlux
View all activity

Articles

Organizations

Hugging Face's profile picture Google's profile picture Competitions's profile picture Safetensors's profile picture 21 RNN's profile picture Spaces-explorers's profile picture Text Generation Inference's profile picture CVPR Demo Track's profile picture Spaces Examples's profile picture Hugging Chat's profile picture Webhooks Explorers (BETA)'s profile picture lora concepts library's profile picture Scanned Tokens's profile picture Huggingface Projects's profile picture hf admins's profile picture Hugging Face OSS Metrics's profile picture Stable Diffusion Dreambooth Concepts Library's profile picture Core ML Projects's profile picture temp-org's profile picture Blog-explorers's profile picture Mustarz's profile picture Open LLM Leaderboard's profile picture Enterprise Explorers's profile picture The Collectionists's profile picture ZeroGPU Explorers's profile picture Hugging Face Tools's profile picture TstOrg141's profile picture Stable Video benchmark's profile picture Social Post Explorers's profile picture Dev Mode Explorers's profile picture LLHF's profile picture SLLHF's profile picture Self-serve FTW's profile picture

victor's activity

replied to roseking's post 2 days ago
reacted to roseking's post with πŸ€— 2 days ago
view post
Post
1453
πŸŽ‰ Major Updates to HF Daily Paper Newsletter Bot

I'm excited to announce significant improvements to my HF Daily Paper Newsletter Bot! Here are the key updates:

πŸ–ΌοΈ Enhanced Poster Generation
- Implemented dynamic height adjustment for daily paper posters
- Added support for displaying complete paper content without truncation
- Improved Chinese font rendering and text layout
- Integrated Hugging Face logo for better branding
- Enhanced visual aesthetics with optimized card layouts

πŸ“ Content Improvements
- Removed paper count limitations (previously capped at 5 papers)
- Enhanced title and summary extraction algorithms
- Improved text wrapping and spacing for better readability
- Added proper handling of long content with automatic layout adjustments

πŸ› οΈ Technical Enhancements
- Implemented better font loading mechanism with fallback options
- Added support for multiple Chinese font paths
- Improved error handling and logging
- Enhanced memory management for image processing
- Added detailed debugging information

🌟 Visual Design Updates
- Refined color scheme with HF brand colors
- Improved card spacing and padding
- Enhanced typography with better font sizing
- Added smooth transitions between paper cards
- Optimized overall layout for better visual hierarchy

πŸ”§ Infrastructure Updates
- Improved GitHub Actions workflow reliability
- Enhanced error notification system
- Added automatic retries for API calls
- Improved logging and debugging capabilities

The bot now generates more professional and visually appealing daily paper summaries while ensuring complete content display. These updates make the newsletter more readable and informative for our users.

Try it out and let me know what you think! Your feedback helps me make continuous improvements to better serve the AI research community.

#HuggingFace #AI #MachineLearning #ResearchPapers #OpenSource


  • 2 replies
Β·
reacted to mitkox's post with πŸ”₯ 3 days ago
view post
Post
2303
Can it run DeepSeek V3 671B is the new 'can it run Doom'.

How minimalistic can I go with on device AI with behemoth models - here I'm running DeepSeek V3 MoE on a single A6000 GPU.

Not great, not terrible, for this minimalistic setup. I love the Mixture of Experts architectures. Typically I'm running my core LLM distributed over the 4 GPUs.

Make sure you own your AI. AI in the cloud is not aligned with you; it's aligned with the company that owns it.
Β·
reacted to Tonic's post with πŸ”₯ 3 days ago
view post
Post
1494
microsoft just released Phi-4 , check it out here : Tonic/Phi-4

hope you like it :-)
reacted to ngxson's post with πŸ‘€ 3 days ago
reacted to jasoncorkill's post with πŸ‘ 3 days ago
reacted to sequelbox's post with πŸ”₯ 3 days ago
view post
Post
1311
NEW RELEASE: the sequelbox/Tachibana-QVQ dataset is here! Code-reasoning and code-instruct data generated with Qwen/QVQ-72B-Preview

Come check out QVQ's coding skills!

for everyone to use!

more QVQ and Llama 3.1 405b datasets coming soon :)
reacted to danielhanchen's post with πŸ”₯ 3 days ago
reacted to reddgr's post with πŸ‘€ 3 days ago
view post
Post
2244
Major update on the Talking to Chatbots dataset! Expanded the 'wrapped' dataset (one row per chat) to 2.86k records, and the 'unwrapped' version (one row per conversation turn) to 11k records. The main source is my ChatGPT archive with nearly 2 years of chats. It is still a work in progress as I incorporate chats from other sources and qualitative metrics (SCBN) for responses.

reddgr/talking-to-chatbots-unwrapped-chats

reddgr/talking-to-chatbots-chats

reacted to prithivMLmods's post with πŸš€ 3 days ago
view post
Post
5358
Reasoning SmolLM2 πŸš€

🎯Fine-tuning SmolLM2 on a lightweight synthetic reasoning dataset for reasoning-specific tasks. Future updates will focus on lightweight, blazing-fast reasoning models. Until then, check out the blog for fine-tuning details.

πŸ”₯Blog : https://huggingface.co/blog/prithivMLmods/smollm2-ft

πŸ”Ό Models :
+ SmolLM2-CoT-360M : prithivMLmods/SmolLM2-CoT-360M
+ Reasoning-SmolLM2-135M : prithivMLmods/Reasoning-SmolLM2-135M
+ SmolLM2-CoT-360M-GGUF : prithivMLmods/SmolLM2-CoT-360M-GGUF

🀠 Other Details :
+ Demo : prithivMLmods/SmolLM2-CoT-360M
+ Fine-tune nB : prithivMLmods/SmolLM2-CoT-360M




reacted to hexgrad's post with πŸ”₯ 5 days ago
view post
Post
4883
πŸ“£ Looking for labeled, high-quality synthetic audio/TTS data πŸ“£ Have you been or are you currently calling API endpoints from OpenAI, ElevenLabs, etc? Do you have labeled audio data sitting around gathering dust? Let's talk! Join https://discord.gg/QuGxSWBfQy or comment down below.

If your data exceeds quantity & quality thresholds and is approved into the next hexgrad/Kokoro-82M training mix, and you permissively DM me the data under an effective Apache license, then I will DM back the corresponding voicepacks for YOUR data if/when the next Apache-licensed Kokoro base model drops.

What does this mean? If you've been calling closed-source TTS or audio API endpoints to:
- Build voice agents
- Make long-form audio, like audiobooks or podcasts
- Handle customer support, etc
Then YOU can contribute to the training mix and get useful artifacts in return. ❀️

More details at hexgrad/Kokoro-82M#21
Β·
reacted to merve's post with πŸ”₯ 11 days ago
view post
Post
4713
supercharge your LLM apps with smolagents πŸ”₯

however cool your LLM is, without being agentic it can only go so far

enter smolagents: a new agent library by Hugging Face to make the LLM write code, do analysis and automate boring stuff!

Here's our blog for you to get started https://huggingface.co/blog/smolagents
reacted to cfahlgren1's post with πŸš€ 11 days ago
reacted to sequelbox's post with πŸ‘ 11 days ago
reacted to ivanfioravanti's post with πŸ‘ 13 days ago
view post
Post
1542
Probably most of you already knows this trick but just in case:
πŸ€” Unable to connect to Hugging Face Spaces Dev Mode through local Cursor? πŸ’‘ Don't worry there is an easy trick!

- right click Connect with VS Code
- copy link in your browser
- vscode://vscode-remote/...
- replace vscode with cursor and go
- cursor://vscode-remote/...
reacted to hexgrad's post with πŸ”₯ 15 days ago
view post
Post
3944
Merry Christmas! πŸŽ„ Open sourced a small TTS model at hexgrad/Kokoro-82M
  • 2 replies
Β·
reacted to merve's post with πŸ‘ 15 days ago
reacted to AdinaY's post with πŸš€πŸ”₯ 15 days ago
view post
Post
3577
The Chinese community is shipping 🚒

DeepSeek V3 (685 B MoE) has quietly released on the hub!
Base: deepseek-ai/DeepSeek-V3-Base
Instruct: deepseek-ai/DeepSeek-V3

Can’t wait to see what’s next!
  • 1 reply
Β·
reacted to vincentg64's post with πŸ”₯ 15 days ago
view post
Post
2220
LLM 2.0, RAG & Non-Standard Gen AI on GitHub https://mltblog.com/3DsyZSq

In this article, I share my latest Gen AI and LLM advances, featuring innovative approaches radically different from both standard AI and classical ML/NLP. The focus is on doing better with less, using efficient architectures, new algorithms and evaluation metrics. It originates from research that I started long ago. It gained significant momentum in the last two years. See background and history at https://mltblog.com/4g2sKTv.

OpenAI, Perplexity, Anthropic, Llama and others typically follow the trend and implement solutions very similar to mines within 3 to 6 months after I publish new milestones. For instance, multi-tokens, knowledge graph tokens, multi-indexes, real-time fine-tuning, mixtures of experts, LLM routers, small enterprise sub-LLMs, prompt distillation, relevancy scoring engine, deep contextual retrieval, optimum agentic chunking, and modern UI instead of the basic prompt box. I keep adding new features all the time, staying ahead of competition.

➑️ Read full article with links to GitHub, at https://mltblog.com/3DsyZSq
  • 1 reply
Β·