-
If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents,
arXiv, 2401.00812
, arxiv, pdf, cication: -1Ke Yang, Jiateng Liu, John Wu, Chaoqi Yang, Yi R. Fung, Sha Li, Zixuan Huang, Xu Cao, Xingyao Wang, Yiquan Wang
-
A Survey on Language Models for Code,
arXiv, 2311.07989
, arxiv, pdf, cication: -1Ziyin Zhang, Chaoyu Chen, Bingchang Liu, Cong Liao, Zi Gong, Hang Yu, Jianguo Li, Rui Wang
-
ReGAL: Refactoring Programs to Discover Generalizable Abstractions,
arXiv, 2401.16467
, arxiv, pdf, cication: -1Elias Stengel-Eskin, Archiki Prasad, Mohit Bansal
-
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence,
arXiv, 2401.14196
, arxiv, pdf, cication: -1Daya Guo, Qihao Zhu, Dejian Yang, Zhenda Xie, Kai Dong, Wentao Zhang, Guanting Chen, Xiao Bi, Y. Wu, Y. K. Li
-
Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering,
arXiv, 2401.08500
, arxiv, pdf, cication: -1Tal Ridnik, Dedy Kredo, Itamar Friedman
· (AlphaCodium - Codium-ai)
-
JumpCoder: Go Beyond Autoregressive Coder via Online Modification,
arXiv, 2401.07870
, arxiv, pdf, cication: -1Mouxiang Chen, Hao Tian, Zhongxin Liu, Xiaoxue Ren, Jianling Sun · (JumpCoder - Keytoyze)
-
DebugBench: Evaluating Debugging Capability of Large Language Models,
arXiv, 2401.04621
, arxiv, pdf, cication: -1Runchu Tian, Yining Ye, Yujia Qin, Xin Cong, Yankai Lin, Zhiyuan Liu, Maosong Sun
-
CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution,
arXiv, 2401.03065
, arxiv, pdf, cication: -1Alex Gu, Baptiste Rozière, Hugh Leather, Armando Solar-Lezama, Gabriel Synnaeve, Sida I. Wang
-
AST-T5: Structure-Aware Pretraining for Code Generation and Understanding,
arXiv, 2401.03003
, arxiv, pdf, cication: -1Linyuan Gong, Mostafa Elhoushi, Alvin Cheung · (ast_t5 - gonglinyuan)
-
Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models,
arXiv, 2401.00788
, arxiv, pdf, cication: -1Terry Yue Zhuo, Armel Zebaze, Nitchakarn Suppattarachai, Leandro von Werra, Harm de Vries, Qian Liu, Niklas Muennighoff
-
"I Want It That Way": Enabling Interactive Decision Support Using Large Language Models and Constraint Programming,
arXiv, 2312.06908
, arxiv, pdf, cication: -1Connor Lawless, Jakob Schoeffer, Lindy Le, Kael Rowan, Shilad Sen, Cristina St. Hill, Jina Suh, Bahar Sarrafzadeh
-
Purple Llama CyberSecEval: A Secure Coding Benchmark for Language Models,
arXiv, 2312.04724
, arxiv, pdf, cication: -1Manish Bhatt, Sahana Chennabasappa, Cyrus Nikolaidis, Shengye Wan, Ivan Evtimov, Dominik Gabi, Daniel Song, Faizan Ahmad, Cornelius Aschermann, Lorenzo Fontana
-
Chain of Code: Reasoning with a Language Model-Augmented Code Emulator,
arXiv, 2312.04474
, arxiv, pdf, cication: -1Chengshu Li, Jacky Liang, Andy Zeng, Xinyun Chen, Karol Hausman, Dorsa Sadigh, Sergey Levine, Li Fei-Fei, Fei Xia, Brian Ichter · (chain-of-code.github)
-
Magicoder: Source Code Is All You Need,
arXiv, 2312.02120
, arxiv, pdf, cication: -1Yuxiang Wei, Zhe Wang, Jiawei Liu, Yifeng Ding, Lingming Zhang · (magicoder - ise-uiuc)
-
ML-Bench: Large Language Models Leverage Open-source Libraries for Machine Learning Tasks,
arXiv, 2311.09835
, arxiv, pdf, cication: -1Yuliang Liu, Xiangru Tang, Zefan Cai, Junjie Lu, Yichi Zhang, Yanjun Shao, Zexuan Deng, Helan Hu, Zengxian Yang, Kaikai An · (ML-bench - gersteinlab)
· (drive.google) · (ml-bench.github)
-
Leveraging Large Language Models for Automated Proof Synthesis in Rust,
arXiv, 2311.03739
, arxiv, pdf, cication: -1Jianan Yao, Ziqiao Zhou, Weiteng Chen, Weidong Cui
-
MFTCoder: Boosting Code LLMs with Multitask Fine-Tuning,
arXiv, 2311.02303
, arxiv, pdf, cication: -1Bingchang Liu, Chaoyu Chen, Cong Liao, Zi Gong, Huan Wang, Zhichao Lei, Ming Liang, Dajun Chen, Min Shen, Hailian Zhou · [github]
-
Personalised Distillation: Empowering Open-Sourced LLMs with Adaptive Learning for Code Generation, arXiv, 2310.18628, arxiv, pdf, cication: -1
Hailin Chen, Amrita Saha, Steven Hoi, Shafiq Joty
-
CodeFusion: A Pre-trained Diffusion Model for Code Generation, arXiv, 2310.17680, arxiv, pdf, cication: -1
Mukul Singh, José Cambronero, Sumit Gulwani, Vu Le, Carina Negreanu, Gust Verbruggen
-
ChatCoder: Chat-based Refine Requirement Improves LLMs' Code Generation, arXiv, 2311.00272, arxiv, pdf, cication: -1
Zejun Wang, Jia Li, Ge Li, Zhi Jin
· (mp.weixin.qq)
-
Large Language Models for Software Engineering: Survey and Open Problems, arXiv, 2310.03533, arxiv, pdf, cication: 1
Angela Fan, Beliz Gokkaya, Mark Harman, Mitya Lyubarskiy, Shubho Sengupta, Shin Yoo, Jie M. Zhang
-
CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion, arXiv, 2310.11248, arxiv, pdf, cication: -1
Yangruibo Ding, Zijian Wang, Wasi Uddin Ahmad, Hantian Ding, Ming Tan, Nihal Jain, Murali Krishna Ramanathan, Ramesh Nallapati, Parminder Bhatia, Dan Roth
-
Ranking LLM-Generated Loop Invariants for Program Verification, arXiv, 2310.09342, arxiv, pdf, cication: -1
Saikat Chakraborty, Shuvendu K. Lahiri, Sarah Fakhoury, Madanlal Musuvathi, Akash Lal, Aseem Rastogi, Aditya Senthilnathan, Rahul Sharma, Nikhil Swamy
-
CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules, arXiv, 2310.08992, arxiv, pdf, cication: -1
Hung Le, Hailin Chen, Amrita Saha, Akash Gokul, Doyen Sahoo, Shafiq Joty
-
Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation, arXiv, 2310.02304, arxiv, pdf, cication: -1
Eric Zelikman, Eliana Lorch, Lester Mackey, Adam Tauman Kalai · mp.weixin.qq
-
CodePlan: Repository-level Coding using LLMs and Planning, arXiv, 2309.12499, arxiv, pdf, cication: -1
Ramakrishna Bairi, Atharv Sonwane, Aditya Kanade, Vageesh D C, Arun Iyer, Suresh Parthasarathy, Sriram Rajamani, B. Ashok, Shashank Shet · mp.weixin.qq
-
Can ChatGPT replace StackOverflow? A Study on Robustness and Reliability of Large Language Model Code Generation, arXiv, 2308.10335, arxiv, pdf, cication: 3
Li Zhong, Zilong Wang · jiqizhixin · mp.weixin.qq
-
Can Programming Languages Boost Each Other via Instruction Tuning?, arXiv, 2308.16824, arxiv, pdf, cication: -1
Daoguang Zan, Ailun Yu, Bo Shen, Jiaxin Zhang, Taihong Chen, Bing Geng, Bei Chen, Jichuan Ji, Yafen Yao, Yongji Wang
-
SoTaNa: The Open-Source Software Development Assistant, arXiv, 2308.13416, arxiv, pdf, cication: -1
Ensheng Shi, Fengji Zhang, Yanlin Wang, Bei Chen, Lun Du, Hongyu Zhang, Shi Han, Dongmei Zhang, Hongbin Sun · github
-
OctoPack: Instruction Tuning Code Large Language Models, arXiv, 2308.07124, arxiv, pdf, cication: 6
Niklas Muennighoff, Qian Liu, Armel Zebaze, Qinkai Zheng, Binyuan Hui, Terry Yue Zhuo, Swayam Singh, Xiangru Tang, Leandro von Werra, Shayne Longpre
-
Enhancing Network Management Using Code Generated by Large Language Models, arXiv, 2308.06261, arxiv, pdf, cication: -1
Sathiya Kumaran Mani, Yajie Zhou, Kevin Hsieh, Santiago Segarra, Ranveer Chandra, Srikanth Kandula
-
PanGu-Coder2: Boosting Large Language Models for Code with Ranking Feedback, arXiv, 2307.14936, arxiv, pdf, cication: 9
Bo Shen, Jiaxin Zhang, Taihong Chen, Daoguang Zan, Bing Geng, An Fu, Muhan Zeng, Ailun Yu, Jichuan Ji, Jingyang Zhao · jiqizhixin
-
Predicting Code Coverage without Execution, arXiv, 2307.13383, arxiv, pdf, cication: 1
Michele Tufano, Shubham Chandel, Anisha Agarwal, Neel Sundaresan, Colin Clement
-
Communicative Agents for Software Development, arXiv, 2307.07924, arxiv, pdf, cication: 23
Chen Qian, Xin Cong, Wei Liu, Cheng Yang, Weize Chen, Yusheng Su, Yufan Dang, Jiahao Li, Juyuan Xu, Dahai Li · jiqizhixin
-
Software Testing with Large Language Models: Survey, Landscape, and Vision,
arXiv, 2307.07221
, arxiv, pdf, cication: -1Junjie Wang, Yuchao Huang, Chunyang Chen, Zhe Liu, Song Wang, Qing Wang · (LLM4SoftwareTesting - LLM-Testing)
· (qbitai)
-
RLTF: Reinforcement Learning from Unit Test Feedback, arXiv, 2307.04349, arxiv, pdf, cication: -1
Jiate Liu, Yiqin Zhu, Kaiwen Xiao, Qiang Fu, Xiao Han, Wei Yang, Deheng Ye · github
-
CodeT5+: Open Code Large Language Models for Code Understanding and Generation, arXiv, 2305.07922, arxiv, pdf, cication: 43
Yue Wang, Hung Le, Akhilesh Deepak Gotmare, Nghi D. Q. Bui, Junnan Li, Steven C. H. Hoi · jiqizhixin
-
Guiding Language Models of Code with Global Context using Monitors, arXiv, 2306.10763, arxiv, pdf, cication: 3
Lakshya A Agrawal, Aditya Kanade, Navin Goyal, Shuvendu K. Lahiri, Sriram K. Rajamani
-
RepoFusion: Training Code Models to Understand Your Repository, arXiv, 2306.10998, arxiv, pdf, cication: -1
Disha Shrivastava, Denis Kocetkov, Harm de Vries, Dzmitry Bahdanau, Torsten Scholak
-
Is Self-Repair a Silver Bullet for Code Generation?, arXiv, 2306.09896, arxiv, pdf, cication: 17
Theo X. Olausson, Jeevana Priya Inala, Chenglong Wang, Jianfeng Gao, Armando Solar-Lezama · mp.weixin.qq
-
WizardCoder: Empowering Code Large Language Models with Evol-Instruct, arXiv, 2306.08568, arxiv, pdf, cication: 44
Ziyang Luo, Can Xu, Pu Zhao, Qingfeng Sun, Xiubo Geng, Wenxiang Hu, Chongyang Tao, Jing Ma, Qingwei Lin, Daxin Jiang · jiqizhixin
-
Learning Transformer Programs, arXiv, 2306.01128, arxiv, pdf, cication: 2
Dan Friedman, Alexander Wettig, Danqi Chen · github
-
Large Language Models of Code Fail at Completing Code with Potential Bugs, arXiv, 2306.03438, arxiv, pdf, cication: 2
Tuan Dinh, Jinman Zhao, Samson Tan, Renato Negrinho, Leonard Lausen, Sheng Zha, George Karypis
-
Teaching Large Language Models to Self-Debug, arXiv, 2304.05128, arxiv, pdf, cication: 78
Xinyun Chen, Maxwell Lin, Nathanael Schärli, Denny Zhou
-
InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback, arXiv, 2306.14898, arxiv, pdf, cication: 7
John Yang, Akshara Prabhakar, Karthik Narasimhan, Shunyu Yao · intercode-benchmark.github
-
DeciCoder-6B-Demo - Deci 🤗
-
LLM4SoftwareTesting - LLM-Testing
-
stable-code-3b - stabilityai 🤗
-
Mastering-GitHub-Copilot-for-Paired-Programming - microsoft
A 6 Lesson course teaching everything you need to know about harnessing GitHub Copilot and an AI Paired Programing resource.
-
sweep - sweepai
Sweep: AI-powered Junior Developer for small features and bug fixes.
-
wizardlm - nlpxucan
Family of instruction-following LLMs powered by Evol-Instruct: WizardLM, WizardCoder
-
codellama-13b-oasst-sft-v10 - OpenAssistant 🤗
-
DeepSeek-Coder - deepseek-ai
DeepSeek Coder: Let the Code Write Itself
-
CodeT - microsoft
-
SolidGPT - AI-Citizen
Chat everything with your code repository, ask repository level code questions, and discuss your requirements. AI Scan and learning your code repository, provide you code repository level answer🧱 🧱
-
codeshell - WisdomShell
A series of code large language models developed by PKU-KCL · mp.weixin.qq
-
replit-code-v1_5-3b - replit 🤗
-
gpt-pilot - Pythagora-io
PoC for a scalable dev tool that writes entire apps from scratch while the developer oversees the implementation
-
codellama - facebookresearch
Inference code for CodeLlama models · huggingface · huggingface · github
· jiqizhixin · huggingface
· (promptingguide)
-
CodeLlama-70b-hf-4bit-MLX - mlx-community 🤗
-
sqlcoder - defog-ai
SoTA LLM for converting natural language questions to SQL queries · [jiqizhixin]
-
DeciCoder-1b - Deci 🤗
-
MiniChain - srush
A tiny library for coding with large language models.
-
stablecode-completion-alpha-3b - stabilityai 🤗
· qbitai
-
continue - continuedev
⏩ the open-source autopilot for software development—a VS Code extension that brings the power of ChatGPT to your IDE · [jiqizhixin]
-
CodeGeeX2 - THUDM
CodeGeeX2: A More Powerful Multilingual Code Generation Model
-
codeinterpreter-api - shroominic
Open source implementation of the ChatGPT Code Interpreter 👾
-
AmadeusGPT - AdaptiveMotorControlLab
We turn natural language descriptions of behaviors into machine-executable code
-
aider - paul-gauthier
aider is GPT powered coding in your terminal
-
gpt-migrate - 0xpayne
Easily migrate your codebase from one framework or language to another.
-
gpt-engineer - AntonOsika
Specify what you want it to build, the AI asks for clarification, and then builds it.