Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1312
160
226
Merve Noyan
merve
Follow
etmeseh's profile picture
mfasialliaqat's profile picture
aakarsh03's profile picture
6038 followers
·
226 following
https://github.com/merveenoyan/smol-vision
mervenoyann
merveenoyan
merve.bsky.social
AI & ML interests
VLMs, vision & co
Recent Activity
posted
an
update
1 day ago
What a beginning to this year in open ML 🤠 Let's unwrap! https://huggingface.co/collections/merve/jan-10-releases-677fe34177759de0edfc9714 Multimodal 🖼️ > ByteDance released SA2VA: a family of vision LMs that can take image, video, text and visual prompts > moondream2 is out with new capabilities like outputting structured data and gaze detection! > Dataset: Alibaba DAMO lab released multimodal textbook — 22k hours worth of samples from instruction videos 🤯 > Dataset: SciCap captioning on scientific documents benchmark dataset is released along with the challenge! LLMs 💬 > Microsoft released Phi-4, sota open-source 14B language model 🔥 > Dolphin is back with Dolphin 3.0 Llama 3.1 8B 🐬🐬 > Prime-RL released Eurus-2-7B-PRIME a new language model trained using PRIME alignment > SmallThinker-3B is a new small reasoning LM based on Owen2.5-3B-Instruct 💭 > Dataset: QWQ-LONGCOT-500K is the dataset used to train SmallThinker, generated using QwQ-32B-preview 📕 > Dataset: @cfahlgren1 released React Code Instructions: a dataset of code instruction-code pairs 📕 > Dataset: Qwen team is on the roll, they just released CodeElo, a dataset of code preferences 👩🏻💻 Embeddings 🔖 > @MoritzLaurer released zero-shot version of ModernBERT large 👏 > KaLM is a new family of performant multilingual embedding models with MIT license built using Qwen2-0.5B Image/Video Generation ⏯️ > NVIDIA released Cosmos, a new family of diffusion/autoregressive World Foundation Models generating worlds from images, videos and texts 🔥 > Adobe released TransPixar: a new text-to-video model that can generate assets with transparent backgrounds (a first!) > Dataset: fal released cosmos-openvid-1m Cosmos-tokenized OpenVid-1M with samples from OpenVid-1M Others > Prior Labs released TabPFNv2, the best tabular transformer is out for classification and regression > Metagene-1 is a new RNA language model that can be used for pathogen detection, zero-shot embedding and genome understanding
liked
a model
1 day ago
vikhyatk/moondream2
updated
a collection
1 day ago
Jan 10 Releases 🌨️
View all activity
Articles
Introducing smolagents: simple agents that write actions in code.
12 days ago
•
380
Welcome PaliGemma 2 – New vision language models by Google
Dec 5, 2024
•
124
SmolVLM - small yet mighty Vision Language Model
Nov 26, 2024
•
152
Llama can now see and run on your device - welcome Llama 3.2
Sep 25, 2024
•
180
Preference Optimization for Vision Language Models
Jul 10, 2024
•
55
Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models
Jun 24, 2024
•
182
PaliGemma – Google's Cutting-Edge Open Vision Language Model
May 14, 2024
•
232
Vision Language Models Explained
Apr 11, 2024
•
241
Introduction to Quantization cooked in 🤗 with 💗🧑🍳
Aug 25, 2023
•
24
Deploy MusicGen in no time with Inference Endpoints
Aug 4, 2023
•
4
Open-Source Text Generation & LLM Ecosystem at Hugging Face
Jul 17, 2023
•
2
Jupyter X Hugging Face
Mar 23, 2023
•
2
Using Machine Learning to Aid Survivors and Race through Time
Mar 3, 2023
•
6
Introducing Skops
Aug 12, 2022
•
1
Announcing the Hugging Face Fellowship Program
May 17, 2022
•
6
Showcase Your Projects in Spaces using Gradio
Oct 5, 2021
•
6
Hosting your Models and Datasets on Hugging Face Spaces using Streamlit
Oct 5, 2021
•
3
Organizations
merve
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
keremberke/yolov5m-smoke
1 day ago
Enable download stats -- fix library name
#1 opened 1 day ago by
merve
New activity in
arnabdhar/YOLOv8-Face-Detection
1 day ago
Fix metadata
#7 opened 1 day ago by
merve
New activity in
Ultralytics/YOLOv5
1 day ago
Enable download stats
#2 opened 1 day ago by
merve
New activity in
Ultralytics/YOLOv8
1 day ago
Enable download stats
#1 opened 1 day ago by
merve
New activity in
Ultralytics/YOLO11
1 day ago
Update library name
#1 opened 1 day ago by
merve
New activity in
StephanST/WALDO30
1 day ago
Add library
#4 opened 1 day ago by
merve
License
5
#3 opened 2 days ago by
merve
New activity in
WHL95/PRIME-RL-Eurus-2-7B-PRIME
2 days ago
Zero A100 Grant
#1 opened 2 days ago by
merve
New activity in
ByteDance/Sa2VA-1B
2 days ago
Demo
1
#2 opened 2 days ago by
merve
New activity in
ByteDance/Sa2VA-8B
2 days ago
Fix model tree
#2 opened 2 days ago by
merve
Fix model tree
#1 opened 2 days ago by
merve
New activity in
ByteDance/Sa2VA-1B
2 days ago
Fix model tree
#1 opened 2 days ago by
merve
New activity in
StephanST/WALDO30
2 days ago
Suspicious Pickle?
4
#2 opened 3 months ago by
JohnEDSAR
New activity in
HuggingFaceTB/SmolVLM-Instruct
23 days ago
Add FT tutorial link
#22 opened 23 days ago by
merve
How to training or fientunee SmolVLM easily?
1
#21 opened 28 days ago by
lucasjin
New activity in
merve/paligemma_vqav2
24 days ago
Update `dataset` to reference to the actual dataset used
#4 opened 24 days ago by
alvarobartt
New activity in
merve/vision_papers
about 1 month ago
Fix streamlit warning
#3 opened about 1 month ago by
lbourdois
Multilingual version
1
#1 opened 4 months ago by
lbourdois
New activity in
google/paligemma2-10b-pt-896
about 1 month ago
About downstream task apply?
3
#2 opened about 1 month ago by
JackWang0601
New activity in
TIGER-Lab/VideoScore-v1.1
about 1 month ago
Fix task tag
#1 opened about 1 month ago by
merve
Load more