Repository navigation

#

llm4code

[ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct

Python
2016
6 个月前

A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering

710
9 个月前

[NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation

Python
304
2 个月前

MPLSandbox is an out-of-the-box multi-programming language sandbox designed to provide unified and comprehensive feedback from compiler and analysis tools for LLMs.

Python
174
5 天前

For our CCS24 paper 🏆 "ReSym: Harnessing LLMs to Recover Variable and Data Structure Symbols from Stripped Binaries" by Danning Xie, Zhuo Zhang, Nan Jiang, Xiangzhe Xu, Lin Tan, and Xiangyu Zhang. 🏆 ACM SIGSAC Distinguished Paper Award Winner

Makefile
92
10 天前

✅SRepair: Powerful LLM-based Program Repairer with $0.029/Fixed Bug

Python
62
1 年前

For our ICSE23 paper "Impact of Code Language Models on Automated Program Repair" by Nan Jiang, Kevin Liu, Thibaud Lutellier, and Lin Tan

Python
60
6 个月前

Indexing three datasets for GPTScan

Python
58
10 个月前

For our ISSTA23 paper "How Effective are Neural Networks for Fixing Security Vulnerabilities?" by Yi Wu, Nan Jiang, Hung Viet Pham, Thibaud Lutellier, Jordan Davis, Lin Tan, Petr Babkin, and Sameena Shah.

Java
36
1 年前

Flow Chart Image-to-Code Generation

Python
32
2 年前

Official repo for "HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation Task"

Python
27
13 天前

Collections of research, benchmarks and tools towards more robust and reliable language models for code; LM4Code; LM4SE; reliable LLM; LLM4Code

26
1 年前

Code for ACL (main) paper "JumpCoder: Go Beyond Autoregressive Coder via Online Modification"

Python
25
1 年前

Dataflow-guided retrieval augmentation for repository-level code completion, ACL 2024 (main)

Python
23
1 个月前

Can Language Models Replace Programmers? RepoCod Says ‘Not Yet’ - by Shanchao Liang and Yiran Hu and Nan Jiang and Lin Tan

Python
17
1 个月前

Simultaneous evaluation on both functionality and security of LLM-generated code.

Python
13
3 个月前

[AAAI 2025] The official code of the paper "InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with Inverse-Instruct"(https://arxiv.org/abs/2407.05700).

Python
11
9 个月前

WAFFLE: Multi-Modal Model for Automated Front-End Development - by Shanchao Liang and Nan Jiang and Shangshu Qian and Lin Tan

Python
10
3 个月前

For our AAAI25 paper LATTE: Improving Latex Recognition for Tables and Formulae with Iterative Refinement by Nan Jiang, Shanchao Liang, Chengxiao Wang, Jiannan Wang, and Lin Tan

Python
2
3 个月前

Replication package for the paper: "How Much Do Code Language Models Remember? An Investigation on Data Extraction Attacks before and after Fine-tuning"

Jupyter Notebook
0
3 个月前