28 31 65

Di Zhang

qq8933

https://scholar.google.com/citations?user=vxAO250AAAAJ&hl=en

AI & ML interests

AI4Chem, LLM, Green LLM

Recent Activity

updated a dataset 1 day ago

qq8933/MATH12000

upvoted a paper 2 days ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

new activity 13 days ago

AI4Chem/ChemBench4K:Chembench数据集的一些问题

View all activity

Organizations

Posts 21

Post

1233

News! ChemVLM Codes Opensource Now! https://github.com/AI4Chem/ChemVlm

Post

2593

LLaMA-O1-PRM and LLaMA-O1-Reinforcement will release in this weekend.
We have implemented a novel Reinforcement finetune(RFT) pipeline that taught models learning reasoning and reward labeling without human annotation.

View all posts

Collections 2

Papers 6

models 1

qq8933/OpenLongCoT-Base-Gemma2-2B

Updated Oct 29, 2024 • 17 • 8

datasets 35

Di Zhang

AI & ML interests

Recent Activity

Organizations

Posts 21

Collections 2

SimpleBerry/LLaMA-O1-Supervised-1129

SimpleBerry/LLaMA-O1-Base-1127

SimpleBerry/OpenLongCoT-Pretrain-1202

SimpleBerry/OpenLongCoT-SFT

YeungNLP/firefly-train-1.1M

stingning/ultrachat

Open-Orca/OpenOrca

Vezora/Tested-143k-Python-Alpaca

Papers 6

models 1

qq8933/OpenLongCoT-Base-Gemma2-2B

datasets 35

qq8933/MATH12000

qq8933/OpenLongCoT-prm-rectify

qq8933/AIME_1983_2024

qq8933/UltraChat-200k

qq8933/OpenLongCoT-Pretrain-v2-filtered

qq8933/OpenLongCoT-Pretrain-v2

qq8933/OpenLongCoT-SFT-v2

qq8933/OpenLongCoT-SFT-v2-filtered

qq8933/OpenLongCoT-SFT-problems-v2

qq8933/llama_o1_offline_training_data_v1

Di Zhang

AI & ML interests

Recent Activity

Organizations

Posts 21

Collections 2

Papers 6

models 1

datasets 35 Sort: Recently updated

datasets 35