Caleb Fahlgren's picture

Caleb Fahlgren PRO

cfahlgren1

AI & ML interests

None yet

Recent Activity

updated a dataset 27 minutes ago
cfahlgren1/react-code-instructions
updated a dataset about 14 hours ago
cfahlgren1/hub-stats
updated a dataset about 16 hours ago
duckdb-nsql-hub/duckdb-nsql-scores
View all activity

Articles

Organizations

Hugging Face's profile picture Datasets Maintainers's profile picture Hugging Face OSS Metrics's profile picture Hugging Face TB Research's profile picture ChatDB's profile picture Cognitive Computations's profile picture nltpt-q's profile picture DuckDB Text-2-SQL Bench's profile picture open/ acc's profile picture Bluesky Community's profile picture

cfahlgren1's activity

reacted to merve's post with ❤️ about 17 hours ago
view post
Post
1280
What a beginning to this year in open ML 🤠
Let's unwrap! merve/jan-10-releases-677fe34177759de0edfc9714

Multimodal 🖼️
> ByteDance released SA2VA: a family of vision LMs that can take image, video, text and visual prompts
> moondream2 is out with new capabilities like outputting structured data and gaze detection!
> Dataset: Alibaba DAMO lab released multimodal textbook — 22k hours worth of samples from instruction videos 🤯
> Dataset: SciCap captioning on scientific documents benchmark dataset is released along with the challenge!

LLMs 💬
> Microsoft released Phi-4, sota open-source 14B language model 🔥
> Dolphin is back with Dolphin 3.0 Llama 3.1 8B 🐬🐬
> Prime-RL released Eurus-2-7B-PRIME a new language model trained using PRIME alignment
> SmallThinker-3B is a new small reasoning LM based on Owen2.5-3B-Instruct 💭
> Dataset: QWQ-LONGCOT-500K is the dataset used to train SmallThinker, generated using QwQ-32B-preview 📕
> Dataset: @cfahlgren1 released React Code Instructions: a dataset of code instruction-code pairs 📕
> Dataset: Qwen team is on the roll, they just released CodeElo, a dataset of code preferences 👩🏻‍💻

Embeddings 🔖
> @MoritzLaurer released zero-shot version of ModernBERT large 👏
> KaLM is a new family of performant multilingual embedding models with MIT license built using Qwen2-0.5B

Image/Video Generation ⏯️
> NVIDIA released Cosmos, a new family of diffusion/autoregressive World Foundation Models generating worlds from images, videos and texts 🔥
> Adobe released TransPixar: a new text-to-video model that can generate assets with transparent backgrounds (a first!)
> Dataset: fal released cosmos-openvid-1m Cosmos-tokenized OpenVid-1M with samples from OpenVid-1M

Others
> Prior Labs released TabPFNv2, the best tabular transformer is out for classification and regression
> Metagene-1 is a new RNA language model that can be used for pathogen detection, zero-shot embedding and genome understanding
posted an update 1 day ago
view post
Post
949
Wow, I just added Langfuse tracing to the Deepseek Artifacts app and it's really nice 🔥

It allows me to visualize and track more things along with the cfahlgren1/react-code-instructions dataset.

It was just added as a one click Docker Space template, so it's super easy to self host 💪
New activity in microsoft/phi-4 3 days ago
New activity in microsoft/phi-4 3 days ago
New activity in microsoft/phi-4 3 days ago

Link Paper to Model?

#8 opened 3 days ago by
cfahlgren1