Repository navigation

mathematical-reasoning

Website
Wikipedia

ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].

autonomous-agents language-model 大语言模型 mathematical-reasoning tool-learning

Python

1094

2 年前

lupantech / dl4math

Resources of deep learning for mathematical reasoning (DL4MATH).

深度学习机器学习 mathematical-reasoning natural-language-procressing papers

366

2 年前

CSfufu / Revisual-R1

🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, multimodal reinforcement learning, and text-only reinforcement learning—to achieve faithful, concise, and self-reflective state-of-the-art performance in visual and textual reasoning.

mathematical-reasoning reinforcement-learning

Python

182

3 个月前

HKUNLP / diffusion-of-thoughts

[NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"

diffusion-models 机器学习 mathematical-reasoning 自然语言处理 non-autoregressive PyTorch text-generation

Python

180

7 个月前

akjindal53244 / Arithmo

Small and Efficient Mathematical Reasoning LLMs

large-language-models 大语言模型 mathematical-reasoning mistral-7b

Python

2 年前

OSU-NLP-Group / llm-planning-eval

[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"

large-language-models mathematical-reasoning planning text-to-sql tree-search

Python

2 年前

mukhal / GRACE

[EMNLP '23] Discriminator-Guided Chain-of-Thought Reasoning

chain-of-thought decoding language-model reasoning text-generation 大语言模型 mathematical-reasoning

Python

1 年前

QwenLM / PolyMath

[NeurIPS 2025 D&B Track] Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"

large-language-models mathematical-reasoning multilingual qwen3

Python

4 个月前

Alsace08 / OOD-Math-Reasoning

[NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"

mathematical-reasoning out-of-distribution-detection

Python

1 年前

conceptmath / conceptmath

[ACL 2024 Findings] The official repo for "ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models".

benchmark 大语言模型 mathematical-reasoning

Python

1 年前

adeelahmad / mlx-grpo

🧠 Train your own DeepSeek-R1 style reasoning model on Mac! First MLX implementation of GRPO - the breakthrough technique behind R1's o1-matching performance. Build mathematical reasoning AI without expensive RLHF. Apple Silicon optimized. 🚀 A batteries‑included training & inference framework for **MLX**‑based language models on Apple Silicon.

人工智能 grpo 大语言模型 MLX thinking apple-silicon chain-of-thought llama mathematical-reasoning rlhf deepseek-r1

Python

11 天前

alexanderknop / I2DM

The lecture notes for my discrete mathematics classes.

lecture-notes mathematical-reasoning graph-theory game-theory

TeX

2 年前

RamonKaspar / MathPrompter

MathPrompter Implementation: This repository hosts an implementation based on the 'MathPrompter: Mathematical Reasoning Using Large Language Models' paper by Microsoft Research. The code replicates the methods discussed in the paper.

large-language-models mathematical-reasoning

Python

6 个月前

sparkle-reasoning / sparkle

Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs Under Reinforcement Learning

large-language-models mathematical-reasoning reinforcement-learning scaling grpo 机器学习 qwen rlhf

Python

3 个月前

JunyiYe / CreativeMath

[AAAI 2025] Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problems

creativity large-language-models mathematical-reasoning benchmarking

Jupyter Notebook

5 个月前

Nativeatom / FRoG

Fuzzy reasoning of Generalized Quantifiers (EMNLP 2024)

自然语言处理 mathematical-reasoning reasoning

Python

9 个月前

SuperBruceJia / GSM8K-Consistency

GSM8K-Consistency is a benchmark database for analyzing the consistency of Arithmetic Reasoning on GSM8K.

mathematical-reasoning foundation-models large-language-models reasoning prompt prompt-engineering prompt-toolkit

2 年前

RamonKaspar / MathDataset-ElementarySchool

This dataset aggregates carefully selected elementary-level math problems from various existing resources, providing an optimal mix for testing and enhancing math-solving chatbots for young learners.

dataset 大语言模型 mathematical-reasoning

Python

3 个月前

ahmedmhussein111 / mlx-grpo

MLX-GRPO allows you to train your own DeepSeek-R1 models directly on your Mac. This implementation simplifies the process of building advanced reasoning AI, making it accessible for developers. 🐙🌟

人工智能 apple-silicon chain-of-thought deepseek-r1 grpo llama 大语言模型 mathematical-reasoning MLX rlhf thinking

Python

3 个月前

RamonKaspar / Math-Capabilities-LLM

We implement and benchmark various prompting techniques for LLMs (i.e. PAL, CoT, PoT, etc.) on a specialized math reasoning dataset (on elementary school grade).

chain-of-thought 大语言模型 mathematical-reasoning sympy

Python

1 年前