26 33 84

Ankush Singal

Andyrasika

AI & ML interests

None yet

Recent Activity

reacted to merve's post with ❤️ about 19 hours ago

What a beginning to this year in open ML 🤠 Let's unwrap! https://huggingface.co/collections/merve/jan-10-releases-677fe34177759de0edfc9714 Multimodal 🖼️ > ByteDance released SA2VA: a family of vision LMs that can take image, video, text and visual prompts > moondream2 is out with new capabilities like outputting structured data and gaze detection! > Dataset: Alibaba DAMO lab released multimodal textbook — 22k hours worth of samples from instruction videos 🤯 > Dataset: SciCap captioning on scientific documents benchmark dataset is released along with the challenge! LLMs 💬 > Microsoft released Phi-4, sota open-source 14B language model 🔥 > Dolphin is back with Dolphin 3.0 Llama 3.1 8B 🐬🐬 > Prime-RL released Eurus-2-7B-PRIME a new language model trained using PRIME alignment > SmallThinker-3B is a new small reasoning LM based on Owen2.5-3B-Instruct 💭 > Dataset: QWQ-LONGCOT-500K is the dataset used to train SmallThinker, generated using QwQ-32B-preview 📕 > Dataset: @cfahlgren1 released React Code Instructions: a dataset of code instruction-code pairs 📕 > Dataset: Qwen team is on the roll, they just released CodeElo, a dataset of code preferences 👩🏻‍💻 Embeddings 🔖 > @MoritzLaurer released zero-shot version of ModernBERT large 👏 > KaLM is a new family of performant multilingual embedding models with MIT license built using Qwen2-0.5B Image/Video Generation ⏯️ > NVIDIA released Cosmos, a new family of diffusion/autoregressive World Foundation Models generating worlds from images, videos and texts 🔥 > Adobe released TransPixar: a new text-to-video model that can generate assets with transparent backgrounds (a first!) > Dataset: fal released cosmos-openvid-1m Cosmos-tokenized OpenVid-1M with samples from OpenVid-1M Others > Prior Labs released TabPFNv2, the best tabular transformer is out for classification and regression > Metagene-1 is a new RNA language model that can be used for pathogen detection, zero-shot embedding and genome understanding

updated a collection 2 days ago

multimodal

updated a collection 5 days ago

Embedding

View all activity

Articles

SwanLab and Transformers: Power Up Your NLP Experiments

Jun 17, 2024

• 6

Estimating Memory Consumption of LLMs for Inference and Fine-Tuning for Cohere Command-R+

Apr 26, 2024

• 11

RAG Empowerment: Cohere C4AI Command-R and Transformers Unveiled

Apr 7, 2024

• 10

Elevate Responses: RAG with LlamaIndex & MongoDB

Mar 28, 2024

• 4

Revolutionizing Video Transcription: Unveiling Gemma-2b-it and Langchain in the Era of Transformers

Mar 12, 2024

• 3

Unveiling TinyLlama: An Inspiring Dive into a Revolutionary Small-Scale Language Model

Jan 8, 2024

• 2

Multimodal IDEFICS: Unveiling the Transparency & Power of Open Visual Language Models

Jan 8, 2024

Streamlining Data Management with Hugging Face and DVC: A Seamless Integration

Jan 3, 2024

Leveraging Transformers and PyTorch for Multiple Choice Question Tasks

Dec 25, 2023

• 1

Uniting Forces: Integrating Hugging Face with Langchain for Enhanced Natural Language Processing

Dec 18, 2023

• 4

Detecting the Deceptive: Unmasking Deep Fake Voices

Oct 29, 2023

• 2

Hearing is Believing: Revolutionizing AI with Audio Classification via Computer Vision

Oct 22, 2023

• 1

InfiniText: Empowering Conversations & Content with Mistral-7B-Instruct-v0.1

Oct 12, 2023

Samantha and Mistral 7B: A Powerful and Versatile Language Model Duo

Oct 2, 2023

• 1

Organizations

Andyrasika's activity

reacted to merve's post with ❤️ about 19 hours ago

Post

1614

What a beginning to this year in open ML 🤠
Let's unwrap! merve/jan-10-releases-677fe34177759de0edfc9714

Multimodal 🖼️
> ByteDance released SA2VA: a family of vision LMs that can take image, video, text and visual prompts
> moondream2 is out with new capabilities like outputting structured data and gaze detection!
> Dataset: Alibaba DAMO lab released multimodal textbook — 22k hours worth of samples from instruction videos 🤯
> Dataset: SciCap captioning on scientific documents benchmark dataset is released along with the challenge!

LLMs 💬
> Microsoft released Phi-4, sota open-source 14B language model 🔥
> Dolphin is back with Dolphin 3.0 Llama 3.1 8B 🐬🐬
> Prime-RL released Eurus-2-7B-PRIME a new language model trained using PRIME alignment
> SmallThinker-3B is a new small reasoning LM based on Owen2.5-3B-Instruct 💭
> Dataset: QWQ-LONGCOT-500K is the dataset used to train SmallThinker, generated using QwQ-32B-preview 📕
> Dataset: @cfahlgren1 released React Code Instructions: a dataset of code instruction-code pairs 📕
> Dataset: Qwen team is on the roll, they just released CodeElo, a dataset of code preferences 👩🏻‍💻

Embeddings 🔖
> @MoritzLaurer released zero-shot version of ModernBERT large 👏
> KaLM is a new family of performant multilingual embedding models with MIT license built using Qwen2-0.5B

Image/Video Generation ⏯️
> NVIDIA released Cosmos, a new family of diffusion/autoregressive World Foundation Models generating worlds from images, videos and texts 🔥
> Adobe released TransPixar: a new text-to-video model that can generate assets with transparent backgrounds (a first!)
> Dataset: fal released cosmos-openvid-1m Cosmos-tokenized OpenVid-1M with samples from OpenVid-1M

Others
> Prior Labs released TabPFNv2, the best tabular transformer is out for classification and regression
> Metagene-1 is a new RNA language model that can be used for pathogen detection, zero-shot embedding and genome understanding