Repository navigation

#

mathematical-reasoning

ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].

Python
1094
2 年前

Resources of deep learning for mathematical reasoning (DL4MATH).

366
2 年前

🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, multimodal reinforcement learning, and text-only reinforcement learning—to achieve faithful, concise, and self-reflective state-of-the-art performance in visual and textual reasoning.

Python
182
3 个月前

[NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"

Python
180
7 个月前

Small and Efficient Mathematical Reasoning LLMs

Python
72
2 年前

[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"

Python
54
2 年前
Python
49
1 年前

[NeurIPS 2025 D&B Track] Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"

Python
30
4 个月前

[NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"

Python
27
1 年前

[ACL 2024 Findings] The official repo for "ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models".

Python
24
1 年前

🧠 Train your own DeepSeek-R1 style reasoning model on Mac! First MLX implementation of GRPO - the breakthrough technique behind R1's o1-matching performance. Build mathematical reasoning AI without expensive RLHF. Apple Silicon optimized. 🚀 A batteries‑included training & inference framework for **MLX**‑based language models on Apple Silicon.

Python
24
11 天前

The lecture notes for my discrete mathematics classes.

TeX
18
2 年前

MathPrompter Implementation: This repository hosts an implementation based on the 'MathPrompter: Mathematical Reasoning Using Large Language Models' paper by Microsoft Research. The code replicates the methods discussed in the paper.

Python
14
6 个月前

Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs Under Reinforcement Learning

Python
13
3 个月前

[AAAI 2025] Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problems

Jupyter Notebook
12
5 个月前

Fuzzy reasoning of Generalized Quantifiers (EMNLP 2024)

Python
8
9 个月前

GSM8K-Consistency is a benchmark database for analyzing the consistency of Arithmetic Reasoning on GSM8K.

2
2 年前

This dataset aggregates carefully selected elementary-level math problems from various existing resources, providing an optimal mix for testing and enhancing math-solving chatbots for young learners.

Python
2
3 个月前

MLX-GRPO allows you to train your own DeepSeek-R1 models directly on your Mac. This implementation simplifies the process of building advanced reasoning AI, making it accessible for developers. 🐙🌟

Python
1
3 个月前

We implement and benchmark various prompting techniques for LLMs (i.e. PAL, CoT, PoT, etc.) on a specialized math reasoning dataset (on elementary school grade).

Python
1
1 年前