Stars
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
๐ณ Implementation of various Distributional Reinforcement Learning Algorithms using TensorFlow2.
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
Implementation of Axial attention - attending to multi-dimensional data efficiently
sbi is a Python package for simulation-based inference, designed to meet the needs of both researchers and practitioners. Whether you need fine-grained control or an easy-to-use interface, sbi has โฆ
This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mastering CUDA programming. Whether you're just starting or lookโฆ
Fully open reproduction of DeepSeek-R1
A curated list of resources about generative flow networks (GFlowNets).
Generative Flow Networks - GFlowNet
Best Practices on Recommendation Systems
Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools.
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Google Research
Course materials of "Bayesian Modelling and Probabilistic Programming with Numpyro, and Deep Generative Surrogates for Epidemiology"
A curated list of Large Language Model resources, covering model training, serving, fine-tuning, and building LLM applications.
Datasets with baselines for offline multi-agent reinforcement learning.
๐น๏ธ A diverse suite of scalable reinforcement learning environments in JAX
๐ฆ A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX
Official implementation for "CLIP-ReID: Exploiting Vision-Language Model for Image Re-identification without Concrete Text Labels" (AAAI 2023)
๐ Guides, papers, lecture, notebooks and resources for prompt engineering
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.
The official gpt4free repository | various collection of powerful language models | o4, o3 and deepseek r1, gpt-4.1, gemini 2.5
Supercharge Your LLM Application Evaluations ๐
[CVPR 2024 ๐ฅ] GeoChat, the first grounded Large Vision Language Model for Remote Sensing