"topic:mathematical-reasoning" — Search

32 results for “topic:mathematical-reasoning”

ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].

Python1.1k80Updated 2 years ago

autonomous-agentslanguage-modelllmmathematical-reasoningtool-learning

lupantech/dl4math

Resources of deep learning for mathematical reasoning (DL4MATH).

37028Updated 2 years ago

deep-learningmachine-learningmathematical-reasoningnatural-language-procressingpapers

HKUNLP/diffusion-of-thoughts

[NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"

Python20615Updated 1 year ago

chain-of-thought-reasoningdiffusion-lmdiffusion-modelsmachine-learningmathematical-reasoningnatural-language-processingnon-autoregressivepytorchtext-generation

CSfufu/Revisual-R1

[ICLR 2026]🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, multimodal reinforcement learning, and text-only reinforcement learning—to achieve faithful, concise, and self-reflective state-of-the-art performance in visual and textual reasoning.

Python2023Updated 3 months ago

cold-start-initializationdata-efficiencyefficient-length-rewardmathematical-reasoningmultimodal-large-language-modelopen-source-7b-modelprioritized-advantage-distillationreinforcement-learningself-reflective-chain-of-thoughtvisual-reasoning

AMAP-ML/MathForge

[ICLR 2026] Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation

Python1222Updated 1 month ago

data-augmentationgrpomathematical-reasoning

akjindal53244/Arithmo

Small and Efficient Mathematical Reasoning LLMs

Python736Updated 2 years ago

gsm8klarge-language-modelsllmmathematical-reasoningmistral-7b

OSU-NLP-Group/llm-planning-eval

[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"

Python543Updated 2 years ago

language-agentlarge-language-modelsmathematical-reasoningplanningself-correctiontext-to-sqltree-search

mukhal/GRACE

[EMNLP '23] Discriminator-Guided Chain-of-Thought Reasoning

Python502Updated 1 year ago

chain-of-thoughtdecodinglanguage-modelllmmathematical-reasoningmulti-step-reasoningreasoningsymbolic-reasoningtext-generation

QwenLM/PolyMath

[NeurIPS 2025 D&B Track] Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"

Python438Updated 10 months ago

large-language-modelsmathematical-reasoningmultilingualqwen3

Alsace08/OOD-Math-Reasoning

[NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"

Python283Updated 1 year ago

generative-language-modelsmathematical-reasoningneurips-2024out-of-distribution-detection

conceptmath/conceptmath

[ACL 2024 Findings] The official repo for "ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models".

Python240Updated 1 year ago

benchmarkfinegrainedllmmathematical-reasoning

alexanderknop/I2DM

The lecture notes for my discrete mathematics classes.

TeX196Updated 2 years ago

combinatorial-game-theorycombinatoricscomputability-theorygame-theorygraph-theorylecture-notesmathematical-logicmathematical-reasoningprobability-theoryset-theory

sparkle-reasoning/sparkle

[NeurIPS'25] Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs Under Reinforcement Learning

Python160Updated 3 months ago

data-efficientgrpointerpretabilitylarge-language-modelsmachine-learningmathematical-reasoningqwenreasoning-language-modelsreinforcement-learningrlhfscaling

JunyiYe/CreativeMath

[AAAI 2025] Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problems

Jupyter Notebook134Updated 10 months ago

benchmarkingcreativitylarge-language-modelsmathematical-reasoning

RamonKaspar/MathPrompter

MathPrompter Implementation: This repository hosts an implementation based on the 'MathPrompter: Mathematical Reasoning Using Large Language Models' paper by Microsoft Research. The code replicates the methods discussed in the paper.

Python134Updated 11 months ago

arithmetic-reasoninglarge-language-modelsmathematical-reasoningmathprompter

Nativeatom/FRoG

Fuzzy reasoning of Generalized Quantifiers (EMNLP 2024)

Python80Updated 1 year ago

fuzzy-reasoninggeneralized-quantifiersmathematical-reasoningnatural-language-processingreasoning

wzy6642/StepCo

Official implementation for "Enhancing Mathematical Reasoning in LLMs by Stepwise Correction" (ACL 2025)

Python80Updated 4 months ago

large-language-modelmathematical-reasoningprompt-engineeringstepwise-refinementsupervised-finetuning

xingbpshen/agentic-reasoning-vpgm

[AAAI 2026] The official implementation of the paper "BayesAgent: Bayesian Agentic Reasoning Under Uncertainty via Verbalized Probabilistic Graphical Modeling".

Python41Updated 3 months ago

agentic-aiartificial-intelligencelarge-language-modelsmathematical-reasoningprobabilistic-graphical-modelsvisual-question-answering

RamonKaspar/MathDataset-ElementarySchool

This dataset aggregates carefully selected elementary-level math problems from various existing resources, providing an optimal mix for testing and enhancing math-solving chatbots for young learners.

Python34Updated 8 months ago

datasetelementary-school-mathk-12-mathllmmathematical-reasoningprimary-school

sean-lamont/odd

Codebase for Orthogonal Diverse Diffusion. We present a lightweight, training free method for improving sampling diversity and Pass@k in Diffusion Language Models.

Python30Updated 2 weeks ago

code-generationdiffusion-language-modelsmathematical-reasoningpass-at-kproblem-solvingreasoning-language-modelssampling

SuperBruceJia/GSM8K-Consistency

GSM8K-Consistency is a benchmark database for analyzing the consistency of Arithmetic Reasoning on GSM8K.

20Updated 2 years ago

arithmetic-consistencyarithmetic-reasoningfactual-consistencyfoundation-modelsgradegrade-school-mathgsm8klarge-language-modelslogical-consistencymathematical-reasoningpromptprompt-engineeringprompt-perturbationprompt-toolkitreasoningself-consistencyself-consistency-benchmarksemantics-consistencysemantics-preserving-transformationssemantics-similar

LiXin97/WirelessMathLM

WirelessMathLM:Teaching Mathematical Reasoning for LLMs in Wireless Communications with Reinforcement Learning - Official repository for WirelessMathLM paper

HTML21Updated 5 months ago

datasetslarge-language-modelsmachine-learningmathematical-reasoningmathmaticsreinforcement-learningwirelesswireless-communication

Dichotoom/Bachelor-Project

Bachelor thesis codebase: GRPO training for improving mathematical reasoning in small language models using reinforcement learning

Python10Updated 9 months ago

group-relative-policy-optimizationlanguage-modelsmathematical-reasoningreinforcement-learning

zoravur/llm-math-reasoning

Synthetic data for LLM math reasoning

Jupyter Notebook10Updated 1 year ago

llmmachine-learningmathematical-reasoningsynthetic-datatransformers

msmrexe/llm-math-reasoning-analysis

An evaluation of prompting techniques (Zero-Shot CoT, Few-Shot, Self-Consistency) on the Mistral-7B model for mathematical reasoning. This project systematically benchmarks 7 distinct methods on the GSM8K dataset.

Python10Updated 4 months ago

chain-of-thoughtchain-of-thought-reasoningcourse-projectdeep-learningfew-shotgsm8khuggingface-transformerslarge-language-modelsllmllm-evaluationmajority-votingmathematical-reasoningmistral-7bprompt-engineeringquestion-decompositionreasoningreasoning-language-modelsself-consistencyuniversity-projectzero-shot

guoxinyue1112/MathLM

MathLM: An end-to-end pipeline for training small language models (SLMs) on arithmetic logic. Includes randomized GPT-2 base initialization, SFT with assistant-only loss, and optimized inference for MPS and CUDA.

Python10Updated 1 month ago

apple-silicongpt2llmmathematical-reasoningpytorchsft

Starscream-11813/Variational-Mathematical-Reasoning

This repository contains the code, data, and models of the paper titled "Math Word Problem Solving by Generating Linguistic Variants of Problem Statements" published in the Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop).

Jupyter Notebook11Updated 1 year ago

acl2023debertagpt-3gpt-35-turbolinguistic-variantsmath-word-problemmath-word-problem-solvingmathematical-reasoningmawpsmwpparamawpssvamp

RamonKaspar/Math-Capabilities-LLM

We implement and benchmark various prompting techniques for LLMs (i.e. PAL, CoT, PoT, etc.) on a specialized math reasoning dataset (on elementary school grade).

Python10Updated 1 year ago

chain-of-thoughtllmmathematical-reasoningprogram-aided-language-modelrole-play-promptingsympy

protagolabs/MathematicalReasoning

No description provided.

Python00Updated 4 years ago

deep-neural-networkmathematical-reasoningtransformer

tomoeOOseven/gptoss120b-qlora-mathreasoning

KrackHack 3.0 submission — Domain: Gen AI | PS: Open Innovation — GPT-OSS-120B QLoRA finetuning using Unsloth for mathematical reasoning

Jupyter Notebook00Updated 1 month ago

fine-tuningllmmathematical-reasoningopen-source-aiqloraunsloth

Page 1 of 2