An Yang's picture

An Yang

yangapku

·

https://scholar.google.com/citations?user=vO9FZekAAAAJ

AI & ML interests

NLP and Deep Learning

Recent Activity

upvoted a paper 9 days ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

upvoted a paper 23 days ago

Qwen2.5 Technical Report

upvoted a paper about 1 month ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

View all activity

Organizations

yangapku's activity

upvoted a paper 9 days ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published 9 days ago • 45

upvoted a paper 23 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 23 days ago • 339

upvoted a paper about 1 month ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 74

updated a model about 2 months ago

Qwen/QwQ-32B-Preview

Text Generation • Updated Nov 29, 2024 • 131k • • 1.52k

updated 2 models 2 months ago

Qwen/Qwen2.5-Coder-7B-Instruct

Text Generation • Updated Nov 18, 2024 • 109k • 383

Qwen/Qwen2.5-Coder-1.5B-Instruct

Text Generation • Updated Nov 18, 2024 • 12.9k • 51

updated a model 4 months ago

Qwen/Qwen2-VL-72B-Instruct-AWQ

Image-Text-to-Text • Updated Sep 25, 2024 • 28.3k • 42

authored 11 papers 4 months ago

Qwen Technical Report

Paper • 2309.16609 • Published Sep 28, 2023 • 35

InterBERT: Vision-and-Language Interaction for Multi-modal Pretraining

Paper • 2003.13198 • Published Mar 30, 2020

ExpertPrompting: Instructing Large Language Models to be Distinguished Experts

Paper • 2305.14688 • Published May 24, 2023

M6: A Chinese Multimodal Pretrainer

Paper • 2103.00823 • Published Mar 1, 2021

M6-T: Exploring Sparse Expert Models and Beyond

Paper • 2105.15082 • Published May 31, 2021 • 1

Prompt Tuning for Generative Multimodal Pretrained Models

Paper • 2208.02532 • Published Aug 4, 2022

Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese

Paper • 2211.01335 • Published Nov 2, 2022 • 1

Transferring General Multimodal Pretrained Models to Text Recognition

Paper • 2212.09297 • Published Dec 19, 2022

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 160

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18, 2024 • 140

Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement

Paper • 2409.12122 • Published Sep 18, 2024 • 3

New activity in Qwen/Qwen2.5-Math-RM-72B 4 months ago

Create README.md

#1 opened 4 months ago by

updated a model 4 months ago

Qwen/Qwen2-VL-7B

Image-Text-to-Text • Updated 9 days ago • 11k • 31