Repository navigation

ai4code

Website
Wikipedia

ise-uiuc / magicoder

[ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct

ai4code large-language-models 大语言模型 llm4code

Python

2024

166

10 个月前

salesforce / CodeTF

CodeTF: One-stop Transformer Library for State-of-the-art Code LLM

ai4code ai4se code-generation code-intelligence code-understanding transformers code-learning-datasets code-representation-learning human-eval Tree-sitter

Python

1480

4 个月前

replit / ReplitLM

Inference code and configs for the ReplitLM model family

人工智能 ai4code 大语言模型

Python

991

108

2 年前

saltudelft / ml4se

A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering

机器学习软件工程 Code papers datasets 工具深度学习 research ai4code ai4se llm4code

715

1 年前

microsoft / multilspy

multilspy is a lsp client library in Python intended to be used to build applications around language servers.

人工智能 ai4code code-analysis code-completion code-generation codegen huggingface-transformers language-server-protocol 大语言模型 large-language-models lsp lsp-client neurips program-synthesis transformer

Python

407

15 天前

microsoft / monitors4codegen

Code and Data artifact for NeurIPS 2023 paper - "Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context". `multispy` is a lsp client library in Python intended to be used to build applications around language servers.

人工智能 ai4code codegen dataset huggingface-transformers language-server-protocol code-completion code-generation large-language-models program-synthesis neurips transformer code-analysis lsp lsp-client 大语言模型

Python

270

1 年前

FSoft-AI4Code / CodeCapybara

Open-source Self-Instruction Tuning Code LLM

ai4code alpaca instruction-tuning llama

Python

169

2 年前

deep-symbolic-mathematics / LLM-SR

[ICLR 2025 Oral] This is the official repo for the paper "LLM-SR" on Scientific Equation Discovery and Symbolic Regression with Large Language Models

large-language-models llm-agent program-synthesis ai4code ai4science

Python

157

19 天前

FSoft-AI4Code / TheVault

[EMNLP 2023] The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation

ai4code dataset

Jupyter Notebook

1 年前

GhabiX / SRepair

✅SRepair: Powerful LLM-based Program Repairer with $0.029/Fixed Bug

大语言模型 ai4code large-language-models llm4code

Python

1 年前

deep-symbolic-mathematics / llm-srbench

[ICML2025 Oral] LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models

ai4code ai4science large-language-models llm-agent

Python

19 天前

JY0284 / code_completion_as_human_action_prediction

This repository contains the core methods and models described in the paper “Represent Code as Action Sequence for Predicting Next Method Call.” It uses action sequence modeling to predict method calls in Python code based on developer intentions, treating code editing as a sequence of human-like actions.

ai4code 大语言模型机器学习

Python

1 年前

ALFA-group / adversarial-code-generation

[ICLR 2021] "Generating Adversarial Computer Programs using Optimized Obfuscations" by Shashank Srikant, Sijia Liu, Tamara Mitrovska, Shiyu Chang, Quanfu Fan, Gaoyuan Zhang, and Una-May O'Reilly

combinatorial-optimization differentiable-programming big-code program-analysis adversarial-machine-learning seq2seq ai4code

Python

4 年前

FSoft-AI4Code / CodeFlow

[FORGE 2025] Predicting Program Behavior with Dynamic Dependencies Learning

ai4code cfg gnn

Python

1 年前

wyt2000 / InverseCoder

[AAAI 2025] The official code of the paper "InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with Inverse-Instruct"(https://arxiv.org/abs/2407.05700).

llm4code synthetic-data ai4code large-language-models 大语言模型 finetuning

Python

1 年前

FSoft-AI4Code / VisualCoder

[NAACL 2025] Guiding Large Language Models in Code Execution with Fine-grained Multimodal Chain-of-Thought Reasoning

ai4code cfg vlms

Jupyter Notebook

6 个月前

HWH-2000 / DynaCode

[ACL'2025 Findings] DynaCode: A Dynamic Complexity-Aware Code Benchmark for Evaluating Large Language Models in Code Generation

ai4code benchmark code-generation 大语言模型

Python

1 个月前

Alex-Mathai-98 / Monolith-to-Microservices

This paper explores the idea of using heterogeneous graph neural networks (Het-GNN) to partition old legacy monoliths into candidate microservices. We additionally take membership constraints that come from a subject matter expert who has deep domain knowledge of the application.

ai4code

Python

3 年前