https://huggingface.co/papers/2501.03006
A sample demonstration of building with thinking LLMs
Erase any object just by naming it!
3D Generation from text prompts
automated video and sound synthesis from images
WebGPU text-to-Speech powered by OuteTTS and Transformers.js
Experiment with and compare different tokenizers