Awesome-code-llm

Awesome-code-llm
- Survey
- Paper
- Projects
- Other

Survey

If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents, arXiv, 2401.00812, arxiv, pdf, cication: -1

Ke Yang, Jiateng Liu, John Wu, Chaoqi Yang, Yi R. Fung, Sha Li, Zixuan Huang, Xu Cao, Xingyao Wang, Yiquan Wang
A Survey on Language Models for Code, arXiv, 2311.07989, arxiv, pdf, cication: -1

Ziyin Zhang, Chaoyu Chen, Bingchang Liu, Cong Liao, Zi Gong, Hang Yu, Jianguo Li, Rui Wang

Paper

ReGAL: Refactoring Programs to Discover Generalizable Abstractions, arXiv, 2401.16467, arxiv, pdf, cication: -1

Elias Stengel-Eskin, Archiki Prasad, Mohit Bansal
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence, arXiv, 2401.14196, arxiv, pdf, cication: -1

Daya Guo, Qihao Zhu, Dejian Yang, Zhenda Xie, Kai Dong, Wentao Zhang, Guanting Chen, Xiao Bi, Y. Wu, Y. K. Li
Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering, arXiv, 2401.08500, arxiv, pdf, cication: -1

Tal Ridnik, Dedy Kredo, Itamar Friedman

· (AlphaCodium - Codium-ai)
JumpCoder: Go Beyond Autoregressive Coder via Online Modification, arXiv, 2401.07870, arxiv, pdf, cication: -1

Mouxiang Chen, Hao Tian, Zhongxin Liu, Xiaoxue Ren, Jianling Sun · (JumpCoder - Keytoyze)
DebugBench: Evaluating Debugging Capability of Large Language Models, arXiv, 2401.04621, arxiv, pdf, cication: -1

Runchu Tian, Yining Ye, Yujia Qin, Xin Cong, Yankai Lin, Zhiyuan Liu, Maosong Sun
CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution, arXiv, 2401.03065, arxiv, pdf, cication: -1

Alex Gu, Baptiste Rozière, Hugh Leather, Armando Solar-Lezama, Gabriel Synnaeve, Sida I. Wang
AST-T5: Structure-Aware Pretraining for Code Generation and Understanding, arXiv, 2401.03003, arxiv, pdf, cication: -1

Linyuan Gong, Mostafa Elhoushi, Alvin Cheung · (ast_t5 - gonglinyuan)
Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models, arXiv, 2401.00788, arxiv, pdf, cication: -1

Terry Yue Zhuo, Armel Zebaze, Nitchakarn Suppattarachai, Leandro von Werra, Harm de Vries, Qian Liu, Niklas Muennighoff
"I Want It That Way": Enabling Interactive Decision Support Using Large Language Models and Constraint Programming, arXiv, 2312.06908, arxiv, pdf, cication: -1

Connor Lawless, Jakob Schoeffer, Lindy Le, Kael Rowan, Shilad Sen, Cristina St. Hill, Jina Suh, Bahar Sarrafzadeh
Purple Llama CyberSecEval: A Secure Coding Benchmark for Language Models, arXiv, 2312.04724, arxiv, pdf, cication: -1

Manish Bhatt, Sahana Chennabasappa, Cyrus Nikolaidis, Shengye Wan, Ivan Evtimov, Dominik Gabi, Daniel Song, Faizan Ahmad, Cornelius Aschermann, Lorenzo Fontana
Chain of Code: Reasoning with a Language Model-Augmented Code Emulator, arXiv, 2312.04474, arxiv, pdf, cication: -1

Chengshu Li, Jacky Liang, Andy Zeng, Xinyun Chen, Karol Hausman, Dorsa Sadigh, Sergey Levine, Li Fei-Fei, Fei Xia, Brian Ichter · (chain-of-code.github)
Magicoder: Source Code Is All You Need, arXiv, 2312.02120, arxiv, pdf, cication: -1

Yuxiang Wei, Zhe Wang, Jiawei Liu, Yifeng Ding, Lingming Zhang · (magicoder - ise-uiuc)
ML-Bench: Large Language Models Leverage Open-source Libraries for Machine Learning Tasks, arXiv, 2311.09835, arxiv, pdf, cication: -1

Yuliang Liu, Xiangru Tang, Zefan Cai, Junjie Lu, Yichi Zhang, Yanjun Shao, Zexuan Deng, Helan Hu, Zengxian Yang, Kaikai An · (ML-bench - gersteinlab) · (drive.google) · (ml-bench.github)
Leveraging Large Language Models for Automated Proof Synthesis in Rust, arXiv, 2311.03739, arxiv, pdf, cication: -1

Jianan Yao, Ziqiao Zhou, Weiteng Chen, Weidong Cui
MFTCoder: Boosting Code LLMs with Multitask Fine-Tuning, arXiv, 2311.02303, arxiv, pdf, cication: -1

Bingchang Liu, Chaoyu Chen, Cong Liao, Zi Gong, Huan Wang, Zhichao Lei, Ming Liang, Dajun Chen, Min Shen, Hailian Zhou · [github]
Personalised Distillation: Empowering Open-Sourced LLMs with Adaptive Learning for Code Generation, arXiv, 2310.18628, arxiv, pdf, cication: -1

Hailin Chen, Amrita Saha, Steven Hoi, Shafiq Joty
CodeFusion: A Pre-trained Diffusion Model for Code Generation, arXiv, 2310.17680, arxiv, pdf, cication: -1

Mukul Singh, José Cambronero, Sumit Gulwani, Vu Le, Carina Negreanu, Gust Verbruggen
ChatCoder: Chat-based Refine Requirement Improves LLMs' Code Generation, arXiv, 2311.00272, arxiv, pdf, cication: -1

Zejun Wang, Jia Li, Ge Li, Zhi Jin

· (mp.weixin.qq)
Large Language Models for Software Engineering: Survey and Open Problems, arXiv, 2310.03533, arxiv, pdf, cication: 1

Angela Fan, Beliz Gokkaya, Mark Harman, Mitya Lyubarskiy, Shubho Sengupta, Shin Yoo, Jie M. Zhang
CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion, arXiv, 2310.11248, arxiv, pdf, cication: -1

Yangruibo Ding, Zijian Wang, Wasi Uddin Ahmad, Hantian Ding, Ming Tan, Nihal Jain, Murali Krishna Ramanathan, Ramesh Nallapati, Parminder Bhatia, Dan Roth
Ranking LLM-Generated Loop Invariants for Program Verification, arXiv, 2310.09342, arxiv, pdf, cication: -1

Saikat Chakraborty, Shuvendu K. Lahiri, Sarah Fakhoury, Madanlal Musuvathi, Akash Lal, Aseem Rastogi, Aditya Senthilnathan, Rahul Sharma, Nikhil Swamy
CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules, arXiv, 2310.08992, arxiv, pdf, cication: -1

Hung Le, Hailin Chen, Amrita Saha, Akash Gokul, Doyen Sahoo, Shafiq Joty
Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation, arXiv, 2310.02304, arxiv, pdf, cication: -1

Eric Zelikman, Eliana Lorch, Lester Mackey, Adam Tauman Kalai · mp.weixin.qq
CodePlan: Repository-level Coding using LLMs and Planning, arXiv, 2309.12499, arxiv, pdf, cication: -1

Ramakrishna Bairi, Atharv Sonwane, Aditya Kanade, Vageesh D C, Arun Iyer, Suresh Parthasarathy, Sriram Rajamani, B. Ashok, Shashank Shet · mp.weixin.qq
Can ChatGPT replace StackOverflow? A Study on Robustness and Reliability of Large Language Model Code Generation, arXiv, 2308.10335, arxiv, pdf, cication: 3

Li Zhong, Zilong Wang · jiqizhixin · mp.weixin.qq
Can Programming Languages Boost Each Other via Instruction Tuning?, arXiv, 2308.16824, arxiv, pdf, cication: -1

Daoguang Zan, Ailun Yu, Bo Shen, Jiaxin Zhang, Taihong Chen, Bing Geng, Bei Chen, Jichuan Ji, Yafen Yao, Yongji Wang
SoTaNa: The Open-Source Software Development Assistant, arXiv, 2308.13416, arxiv, pdf, cication: -1

Ensheng Shi, Fengji Zhang, Yanlin Wang, Bei Chen, Lun Du, Hongyu Zhang, Shi Han, Dongmei Zhang, Hongbin Sun · github
OctoPack: Instruction Tuning Code Large Language Models, arXiv, 2308.07124, arxiv, pdf, cication: 6

Niklas Muennighoff, Qian Liu, Armel Zebaze, Qinkai Zheng, Binyuan Hui, Terry Yue Zhuo, Swayam Singh, Xiangru Tang, Leandro von Werra, Shayne Longpre
Enhancing Network Management Using Code Generated by Large Language Models, arXiv, 2308.06261, arxiv, pdf, cication: -1

Sathiya Kumaran Mani, Yajie Zhou, Kevin Hsieh, Santiago Segarra, Ranveer Chandra, Srikanth Kandula
PanGu-Coder2: Boosting Large Language Models for Code with Ranking Feedback, arXiv, 2307.14936, arxiv, pdf, cication: 9

Bo Shen, Jiaxin Zhang, Taihong Chen, Daoguang Zan, Bing Geng, An Fu, Muhan Zeng, Ailun Yu, Jichuan Ji, Jingyang Zhao · jiqizhixin
Predicting Code Coverage without Execution, arXiv, 2307.13383, arxiv, pdf, cication: 1

Michele Tufano, Shubham Chandel, Anisha Agarwal, Neel Sundaresan, Colin Clement
Communicative Agents for Software Development, arXiv, 2307.07924, arxiv, pdf, cication: 23

Chen Qian, Xin Cong, Wei Liu, Cheng Yang, Weize Chen, Yusheng Su, Yufan Dang, Jiahao Li, Juyuan Xu, Dahai Li · jiqizhixin
Software Testing with Large Language Models: Survey, Landscape, and Vision, arXiv, 2307.07221, arxiv, pdf, cication: -1

Junjie Wang, Yuchao Huang, Chunyang Chen, Zhe Liu, Song Wang, Qing Wang · (LLM4SoftwareTesting - LLM-Testing) · (qbitai)
RLTF: Reinforcement Learning from Unit Test Feedback, arXiv, 2307.04349, arxiv, pdf, cication: -1

Jiate Liu, Yiqin Zhu, Kaiwen Xiao, Qiang Fu, Xiao Han, Wei Yang, Deheng Ye · github
CodeT5+: Open Code Large Language Models for Code Understanding and Generation, arXiv, 2305.07922, arxiv, pdf, cication: 43

Yue Wang, Hung Le, Akhilesh Deepak Gotmare, Nghi D. Q. Bui, Junnan Li, Steven C. H. Hoi · jiqizhixin
Guiding Language Models of Code with Global Context using Monitors, arXiv, 2306.10763, arxiv, pdf, cication: 3

Lakshya A Agrawal, Aditya Kanade, Navin Goyal, Shuvendu K. Lahiri, Sriram K. Rajamani
RepoFusion: Training Code Models to Understand Your Repository, arXiv, 2306.10998, arxiv, pdf, cication: -1

Disha Shrivastava, Denis Kocetkov, Harm de Vries, Dzmitry Bahdanau, Torsten Scholak
Is Self-Repair a Silver Bullet for Code Generation?, arXiv, 2306.09896, arxiv, pdf, cication: 17

Theo X. Olausson, Jeevana Priya Inala, Chenglong Wang, Jianfeng Gao, Armando Solar-Lezama · mp.weixin.qq
WizardCoder: Empowering Code Large Language Models with Evol-Instruct, arXiv, 2306.08568, arxiv, pdf, cication: 44

Ziyang Luo, Can Xu, Pu Zhao, Qingfeng Sun, Xiubo Geng, Wenxiang Hu, Chongyang Tao, Jing Ma, Qingwei Lin, Daxin Jiang · jiqizhixin
Learning Transformer Programs, arXiv, 2306.01128, arxiv, pdf, cication: 2

Dan Friedman, Alexander Wettig, Danqi Chen · github
Large Language Models of Code Fail at Completing Code with Potential Bugs, arXiv, 2306.03438, arxiv, pdf, cication: 2

Tuan Dinh, Jinman Zhao, Samson Tan, Renato Negrinho, Leonard Lausen, Sheng Zha, George Karypis
Teaching Large Language Models to Self-Debug, arXiv, 2304.05128, arxiv, pdf, cication: 78

Xinyun Chen, Maxwell Lin, Nathanael Schärli, Denny Zhou
InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback, arXiv, 2306.14898, arxiv, pdf, cication: 7

John Yang, Akshara Prabhakar, Karthik Narasimhan, Shunyu Yao · intercode-benchmark.github

Projects

DeciCoder-6B-Demo - Deci 🤗
LLM4SoftwareTesting - LLM-Testing
stable-code-3b - stabilityai 🤗
Mastering-GitHub-Copilot-for-Paired-Programming - microsoft

A 6 Lesson course teaching everything you need to know about harnessing GitHub Copilot and an AI Paired Programing resource.
sweep - sweepai

Sweep: AI-powered Junior Developer for small features and bug fixes.
wizardlm - nlpxucan

Family of instruction-following LLMs powered by Evol-Instruct: WizardLM, WizardCoder
codellama-13b-oasst-sft-v10 - OpenAssistant 🤗
DeepSeek-Coder - deepseek-ai

DeepSeek Coder: Let the Code Write Itself
CodeT - microsoft
SolidGPT - AI-Citizen

Chat everything with your code repository, ask repository level code questions, and discuss your requirements. AI Scan and learning your code repository, provide you code repository level answer🧱 🧱
codeshell - WisdomShell

A series of code large language models developed by PKU-KCL · mp.weixin.qq
replit-code-v1_5-3b - replit 🤗
gpt-pilot - Pythagora-io

PoC for a scalable dev tool that writes entire apps from scratch while the developer oversees the implementation
codellama - facebookresearch

Inference code for CodeLlama models · huggingface · huggingface · github · jiqizhixin · huggingface

· (promptingguide)
CodeLlama-70b-hf-4bit-MLX - mlx-community 🤗
sqlcoder - defog-ai

SoTA LLM for converting natural language questions to SQL queries · [jiqizhixin]
DeciCoder-1b - Deci 🤗
MiniChain - srush

A tiny library for coding with large language models.
stablecode-completion-alpha-3b - stabilityai 🤗

· qbitai
continue - continuedev

⏩ the open-source autopilot for software development—a VS Code extension that brings the power of ChatGPT to your IDE · [jiqizhixin]
CodeGeeX2 - THUDM

CodeGeeX2: A More Powerful Multilingual Code Generation Model
codeinterpreter-api - shroominic

Open source implementation of the ChatGPT Code Interpreter 👾
AmadeusGPT - AdaptiveMotorControlLab

We turn natural language descriptions of behaviors into machine-executable code
aider - paul-gauthier

aider is GPT powered coding in your terminal
gpt-migrate - 0xpayne

Easily migrate your codebase from one framework or language to another.
gpt-engineer - AntonOsika

Specify what you want it to build, the AI asks for clarification, and then builds it.

Other

Personal Copilot: Train Your Own Coding Assistant
Introducing SafeCoder
SafeCoder vs. Closed-source Code Assistants
AGI降临派技术闭门会20230826 - YouTube

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

awesome_code_llm.md

awesome_code_llm.md

Awesome-code-llm

Survey

Paper

Projects

Other

Files

awesome_code_llm.md

Latest commit

History

awesome_code_llm.md

File metadata and controls

Awesome-code-llm

Survey

Paper

Projects

Other