Skip to content

Latest commit

 

History

History
261 lines (185 loc) · 24.6 KB

awesome_code_llm.md

File metadata and controls

261 lines (185 loc) · 24.6 KB

Awesome-code-llm

Survey

  • If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents, arXiv, 2401.00812, arxiv, pdf, cication: -1

    Ke Yang, Jiateng Liu, John Wu, Chaoqi Yang, Yi R. Fung, Sha Li, Zixuan Huang, Xu Cao, Xingyao Wang, Yiquan Wang

  • A Survey on Language Models for Code, arXiv, 2311.07989, arxiv, pdf, cication: -1

    Ziyin Zhang, Chaoyu Chen, Bingchang Liu, Cong Liao, Zi Gong, Hang Yu, Jianguo Li, Rui Wang

Paper

  • ReGAL: Refactoring Programs to Discover Generalizable Abstractions, arXiv, 2401.16467, arxiv, pdf, cication: -1

    Elias Stengel-Eskin, Archiki Prasad, Mohit Bansal

  • DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence, arXiv, 2401.14196, arxiv, pdf, cication: -1

    Daya Guo, Qihao Zhu, Dejian Yang, Zhenda Xie, Kai Dong, Wentao Zhang, Guanting Chen, Xiao Bi, Y. Wu, Y. K. Li

  • Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering, arXiv, 2401.08500, arxiv, pdf, cication: -1

    Tal Ridnik, Dedy Kredo, Itamar Friedman

    · (AlphaCodium - Codium-ai) Star

  • JumpCoder: Go Beyond Autoregressive Coder via Online Modification, arXiv, 2401.07870, arxiv, pdf, cication: -1

    Mouxiang Chen, Hao Tian, Zhongxin Liu, Xiaoxue Ren, Jianling Sun · (JumpCoder - Keytoyze) Star

  • DebugBench: Evaluating Debugging Capability of Large Language Models, arXiv, 2401.04621, arxiv, pdf, cication: -1

    Runchu Tian, Yining Ye, Yujia Qin, Xin Cong, Yankai Lin, Zhiyuan Liu, Maosong Sun

  • CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution, arXiv, 2401.03065, arxiv, pdf, cication: -1

    Alex Gu, Baptiste Rozière, Hugh Leather, Armando Solar-Lezama, Gabriel Synnaeve, Sida I. Wang

  • AST-T5: Structure-Aware Pretraining for Code Generation and Understanding, arXiv, 2401.03003, arxiv, pdf, cication: -1

    Linyuan Gong, Mostafa Elhoushi, Alvin Cheung · (ast_t5 - gonglinyuan) Star

  • Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models, arXiv, 2401.00788, arxiv, pdf, cication: -1

    Terry Yue Zhuo, Armel Zebaze, Nitchakarn Suppattarachai, Leandro von Werra, Harm de Vries, Qian Liu, Niklas Muennighoff

  • "I Want It That Way": Enabling Interactive Decision Support Using Large Language Models and Constraint Programming, arXiv, 2312.06908, arxiv, pdf, cication: -1

    Connor Lawless, Jakob Schoeffer, Lindy Le, Kael Rowan, Shilad Sen, Cristina St. Hill, Jina Suh, Bahar Sarrafzadeh

  • Purple Llama CyberSecEval: A Secure Coding Benchmark for Language Models, arXiv, 2312.04724, arxiv, pdf, cication: -1

    Manish Bhatt, Sahana Chennabasappa, Cyrus Nikolaidis, Shengye Wan, Ivan Evtimov, Dominik Gabi, Daniel Song, Faizan Ahmad, Cornelius Aschermann, Lorenzo Fontana

  • Chain of Code: Reasoning with a Language Model-Augmented Code Emulator, arXiv, 2312.04474, arxiv, pdf, cication: -1

    Chengshu Li, Jacky Liang, Andy Zeng, Xinyun Chen, Karol Hausman, Dorsa Sadigh, Sergey Levine, Li Fei-Fei, Fei Xia, Brian Ichter · (chain-of-code.github)

  • Magicoder: Source Code Is All You Need, arXiv, 2312.02120, arxiv, pdf, cication: -1

    Yuxiang Wei, Zhe Wang, Jiawei Liu, Yifeng Ding, Lingming Zhang · (magicoder - ise-uiuc) Star

  • ML-Bench: Large Language Models Leverage Open-source Libraries for Machine Learning Tasks, arXiv, 2311.09835, arxiv, pdf, cication: -1

    Yuliang Liu, Xiangru Tang, Zefan Cai, Junjie Lu, Yichi Zhang, Yanjun Shao, Zexuan Deng, Helan Hu, Zengxian Yang, Kaikai An · (ML-bench - gersteinlab) Star · (drive.google) · (ml-bench.github)

  • Leveraging Large Language Models for Automated Proof Synthesis in Rust, arXiv, 2311.03739, arxiv, pdf, cication: -1

    Jianan Yao, Ziqiao Zhou, Weiteng Chen, Weidong Cui

  • MFTCoder: Boosting Code LLMs with Multitask Fine-Tuning, arXiv, 2311.02303, arxiv, pdf, cication: -1

    Bingchang Liu, Chaoyu Chen, Cong Liao, Zi Gong, Huan Wang, Zhichao Lei, Ming Liang, Dajun Chen, Min Shen, Hailian Zhou · [github]Star

  • Personalised Distillation: Empowering Open-Sourced LLMs with Adaptive Learning for Code Generation, arXiv, 2310.18628, arxiv, pdf, cication: -1

    Hailin Chen, Amrita Saha, Steven Hoi, Shafiq Joty

  • CodeFusion: A Pre-trained Diffusion Model for Code Generation, arXiv, 2310.17680, arxiv, pdf, cication: -1

    Mukul Singh, José Cambronero, Sumit Gulwani, Vu Le, Carina Negreanu, Gust Verbruggen

  • ChatCoder: Chat-based Refine Requirement Improves LLMs' Code Generation, arXiv, 2311.00272, arxiv, pdf, cication: -1

    Zejun Wang, Jia Li, Ge Li, Zhi Jin

    · (mp.weixin.qq)

  • Large Language Models for Software Engineering: Survey and Open Problems, arXiv, 2310.03533, arxiv, pdf, cication: 1

    Angela Fan, Beliz Gokkaya, Mark Harman, Mitya Lyubarskiy, Shubho Sengupta, Shin Yoo, Jie M. Zhang

  • CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion, arXiv, 2310.11248, arxiv, pdf, cication: -1

    Yangruibo Ding, Zijian Wang, Wasi Uddin Ahmad, Hantian Ding, Ming Tan, Nihal Jain, Murali Krishna Ramanathan, Ramesh Nallapati, Parminder Bhatia, Dan Roth

  • Ranking LLM-Generated Loop Invariants for Program Verification, arXiv, 2310.09342, arxiv, pdf, cication: -1

    Saikat Chakraborty, Shuvendu K. Lahiri, Sarah Fakhoury, Madanlal Musuvathi, Akash Lal, Aseem Rastogi, Aditya Senthilnathan, Rahul Sharma, Nikhil Swamy

  • CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules, arXiv, 2310.08992, arxiv, pdf, cication: -1

    Hung Le, Hailin Chen, Amrita Saha, Akash Gokul, Doyen Sahoo, Shafiq Joty

  • Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation, arXiv, 2310.02304, arxiv, pdf, cication: -1

    Eric Zelikman, Eliana Lorch, Lester Mackey, Adam Tauman Kalai · mp.weixin.qq

  • CodePlan: Repository-level Coding using LLMs and Planning, arXiv, 2309.12499, arxiv, pdf, cication: -1

    Ramakrishna Bairi, Atharv Sonwane, Aditya Kanade, Vageesh D C, Arun Iyer, Suresh Parthasarathy, Sriram Rajamani, B. Ashok, Shashank Shet · mp.weixin.qq

  • Can ChatGPT replace StackOverflow? A Study on Robustness and Reliability of Large Language Model Code Generation, arXiv, 2308.10335, arxiv, pdf, cication: 3

    Li Zhong, Zilong Wang · jiqizhixin · mp.weixin.qq

  • Can Programming Languages Boost Each Other via Instruction Tuning?, arXiv, 2308.16824, arxiv, pdf, cication: -1

    Daoguang Zan, Ailun Yu, Bo Shen, Jiaxin Zhang, Taihong Chen, Bing Geng, Bei Chen, Jichuan Ji, Yafen Yao, Yongji Wang

  • SoTaNa: The Open-Source Software Development Assistant, arXiv, 2308.13416, arxiv, pdf, cication: -1

    Ensheng Shi, Fengji Zhang, Yanlin Wang, Bei Chen, Lun Du, Hongyu Zhang, Shi Han, Dongmei Zhang, Hongbin Sun · githubStar

  • OctoPack: Instruction Tuning Code Large Language Models, arXiv, 2308.07124, arxiv, pdf, cication: 6

    Niklas Muennighoff, Qian Liu, Armel Zebaze, Qinkai Zheng, Binyuan Hui, Terry Yue Zhuo, Swayam Singh, Xiangru Tang, Leandro von Werra, Shayne Longpre

  • Enhancing Network Management Using Code Generated by Large Language Models, arXiv, 2308.06261, arxiv, pdf, cication: -1

    Sathiya Kumaran Mani, Yajie Zhou, Kevin Hsieh, Santiago Segarra, Ranveer Chandra, Srikanth Kandula

  • PanGu-Coder2: Boosting Large Language Models for Code with Ranking Feedback, arXiv, 2307.14936, arxiv, pdf, cication: 9

    Bo Shen, Jiaxin Zhang, Taihong Chen, Daoguang Zan, Bing Geng, An Fu, Muhan Zeng, Ailun Yu, Jichuan Ji, Jingyang Zhao · jiqizhixin

  • Predicting Code Coverage without Execution, arXiv, 2307.13383, arxiv, pdf, cication: 1

    Michele Tufano, Shubham Chandel, Anisha Agarwal, Neel Sundaresan, Colin Clement

  • Communicative Agents for Software Development, arXiv, 2307.07924, arxiv, pdf, cication: 23

    Chen Qian, Xin Cong, Wei Liu, Cheng Yang, Weize Chen, Yusheng Su, Yufan Dang, Jiahao Li, Juyuan Xu, Dahai Li · jiqizhixin

  • Software Testing with Large Language Models: Survey, Landscape, and Vision, arXiv, 2307.07221, arxiv, pdf, cication: -1

    Junjie Wang, Yuchao Huang, Chunyang Chen, Zhe Liu, Song Wang, Qing Wang · (LLM4SoftwareTesting - LLM-Testing) Star · (qbitai)

  • RLTF: Reinforcement Learning from Unit Test Feedback, arXiv, 2307.04349, arxiv, pdf, cication: -1

    Jiate Liu, Yiqin Zhu, Kaiwen Xiao, Qiang Fu, Xiao Han, Wei Yang, Deheng Ye · githubStar

  • CodeT5+: Open Code Large Language Models for Code Understanding and Generation, arXiv, 2305.07922, arxiv, pdf, cication: 43

    Yue Wang, Hung Le, Akhilesh Deepak Gotmare, Nghi D. Q. Bui, Junnan Li, Steven C. H. Hoi · jiqizhixin

  • Guiding Language Models of Code with Global Context using Monitors, arXiv, 2306.10763, arxiv, pdf, cication: 3

    Lakshya A Agrawal, Aditya Kanade, Navin Goyal, Shuvendu K. Lahiri, Sriram K. Rajamani

  • RepoFusion: Training Code Models to Understand Your Repository, arXiv, 2306.10998, arxiv, pdf, cication: -1

    Disha Shrivastava, Denis Kocetkov, Harm de Vries, Dzmitry Bahdanau, Torsten Scholak

  • Is Self-Repair a Silver Bullet for Code Generation?, arXiv, 2306.09896, arxiv, pdf, cication: 17

    Theo X. Olausson, Jeevana Priya Inala, Chenglong Wang, Jianfeng Gao, Armando Solar-Lezama · mp.weixin.qq

  • WizardCoder: Empowering Code Large Language Models with Evol-Instruct, arXiv, 2306.08568, arxiv, pdf, cication: 44

    Ziyang Luo, Can Xu, Pu Zhao, Qingfeng Sun, Xiubo Geng, Wenxiang Hu, Chongyang Tao, Jing Ma, Qingwei Lin, Daxin Jiang · jiqizhixin

  • Learning Transformer Programs, arXiv, 2306.01128, arxiv, pdf, cication: 2

    Dan Friedman, Alexander Wettig, Danqi Chen · githubStar

  • Large Language Models of Code Fail at Completing Code with Potential Bugs, arXiv, 2306.03438, arxiv, pdf, cication: 2

    Tuan Dinh, Jinman Zhao, Samson Tan, Renato Negrinho, Leonard Lausen, Sheng Zha, George Karypis

  • Teaching Large Language Models to Self-Debug, arXiv, 2304.05128, arxiv, pdf, cication: 78

    Xinyun Chen, Maxwell Lin, Nathanael Schärli, Denny Zhou

  • InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback, arXiv, 2306.14898, arxiv, pdf, cication: 7

    John Yang, Akshara Prabhakar, Karthik Narasimhan, Shunyu Yao · intercode-benchmark.github

Projects

Other