Quentin Lhoest PRO

lhoestq

AI & ML interests

Maintainer of πŸ€—Datasets: NLP, Multimodal data processing and sharing

Recent Activity

updated a dataset 8 minutes ago
infinite-dataset-hub/LotteryPatterns
liked a dataset about 13 hours ago
Magpie-Align/Magpie-Reasoning-V1-150K
liked a dataset about 15 hours ago
HumanLLMs/Human-Like-DPO-Dataset
View all activity

Articles

Organizations

Hugging Face's profile picture WMT: Workshop on Statistical Machine Translation's profile picture BigScience Workshop's profile picture Neuropark's profile picture Hugging Face Internal Testing Organization's profile picture Training Transformers Together's profile picture BigScience Catalogue Data's profile picture OpenSLR's profile picture BigScience Data's profile picture Evaluation on the Hub's profile picture 2023 Jan Offsite hackathon's profile picture Datasets Maintainers's profile picture Whisper Distillation's profile picture Open LLM Leaderboard's profile picture huggingPartyParis's profile picture CommonCanvas's profile picture ZeroGPU Explorers's profile picture Datasets examples's profile picture Pixel Parsing's profile picture HuggingFaceFW-Dev's profile picture Infinite Dataset Hub's profile picture Hugging Face FineVideo's profile picture Dataset ReWriter's profile picture Dataset Tools's profile picture Rainforest Connection's profile picture

Posts 3

view post
Post
1694
Made a HF Dataset editor a la gg sheets here: lhoestq/dataset-spreadsheets

With Dataset Spreadsheets:
✏️ Edit datasets in the UI
πŸ”— Share link with collaborators
🐍 Use locally in DuckDB or Python

Available for the 100,000+ parquet datasets on HF :)
view post
Post
4104
Hey ! I'm working on a 100% synthetic Dataset Hub here (you can search for any kind of datasets an the app invents them). The link is here: infinite-dataset-hub/infinite-dataset-hub

Question for the Community:

Which models should I use to generate images and audio samples for those datasets ? πŸ€—