reasoning-language-models

Star

Here are 34 public repositories matching this topic...

reasoning-survey / Awesome-Reasoning-Foundation-Models

Star

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

reasoning multimodal foundation-models llm reasoning-agent llm-reasoning reasoning-language-models

Updated Apr 25, 2025

mims-harvard / TxAgent

Star

TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools

agents precision-medicine tool-use therapeutics reasoning-agent reasoning-language-models

Updated Apr 21, 2025
Python

dvlab-research / Seg-Zero

Star

Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"

reinforcement-learning segmentation multimodal multimodel-large-language-model reasoning-language-models

Updated May 20, 2025
Python

LightChen233 / Awesome-Long-Chain-of-Thought-Reasoning

Star

Latest Advances on Long Chain-of-Thought Reasoning

agent reinforcement-learning rl long thinking reasoning r1 o3 o1 system-2 chain-of-thought openai-o1 reasoning-language-models deepseek-r1 long-chain-of-thought

Updated May 19, 2025

krystalan / DRT

Star

Deep Reasoning Translation via Reinforcement Learning (arXiv preprint 2025); DRT: Deep Reasoning Translation via Long Chain-of-Thought (arXiv preprint 2024)

reinforcement-learning machine-translation large-language-models reasoning-language-models literature-translation

Updated Apr 28, 2025

mims-harvard / ToolUniverse

Star

ToolUniverse is a collection of biomedical tools designed for AI agents

agents precision-medicine tool-use therapeutics reasoning-agent reasoning-language-models

Updated Mar 20, 2025
Python

a-m-team / a-m-models

Star

a-m-team's exploration in large language modeling

llm reasoning-language-models

Updated May 20, 2025

yihedeng9 / OpenVLThinker

Star

OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement

rl vision-language-model reasoning-language-models grpo

Updated May 19, 2025
Python

Wild-Cooperation-Hub / Awesome-MLLM-Reasoning-Benchmarks

Star

A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.

reasoning multimodal multimodal-large-language-models multimodal-reasoning reasoning-language-models mllm-reasoning multimodal-reasoning-benchmarks

Updated Mar 18, 2025

spcl / x1

Star

Official Implementation of "Reasoning Language Models: A Blueprint"

lrm rlm large-language-models llm large-reasoning-models reasoning-language-models reasoning-llms mcts-for-llms

Updated Feb 10, 2025
Python

The-FinAI / Fino1

Star

This is the repo of developing reasoning models in the specific domain of financial, aim to enhance models capabilities in handling financial reasoning tasks.

llamas financial-modeling llms deepseek gpt-4o reasoning-language-models

Updated May 16, 2025
Jupyter Notebook

DolbyUUU / Logic-RL-Lite

Star

Lightweight replication study of DeepSeek-R1-Zero. Interesting findings include "No Aha Moment", "Longer CoT ≠ Accuracy", and "Language Mixing in Instruct Models".

reinforcement-learning fine-tuning post-training llm deepseek gpt-o1 reasoning-language-models reasoning-models deepseek-r1

Updated Apr 1, 2025
Python

DolbyUUU / DeepEnlighten

Star

Pure RL to post-train base models for social reasoning capabilities. Lightweight replication of DeepSeek-R1-Zero with Social IQa dataset.

reinforcement-learning fine-tuning post-training llm deepseek gpt-o1 reasoning-language-models reasoning-models deepseek-r1

Updated Mar 16, 2025
Python

codelion / pts

Star

Pivotal Token Search

Updated May 17, 2025
Python

zihao-ai / unthinking_vulnerability

Star

To Think or Not to Think: Exploring the Unthinking Vulnerability in Large Reasoning Models

ai-agents backdoor-attacks qwq large-language-models chain-of-thought deepseek reasoning-language-models deepseek-r1

Updated May 20, 2025
Python

dvlab-research / VisionReasoner

Star

The official implement of "VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement Learning"

reinforcement-learning segmentation object-detection multimodal counting-objects visual-perception multimodal-large-language-models reasoning-language-models

Updated May 20, 2025
Python

MozerWang / AMPO

Star

[arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agents

agent large-language-models reasoning-agent llm-reasoning reasoning-language-models long-cot

Updated May 20, 2025
Python

linhaowei1 / kumo

Star

☁️ KUMO: Generative Evaluation of Complex Reasoning in Large Language Models

benchmark reasoning llm reasoning-language-models

Updated Apr 26, 2025
Jupyter Notebook

Hyun-Ryu / clover

Star

Official code for "Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical Reasoning", ICLR 2025.

logical-reasoning large-language-models reasoning-language-models

Updated May 12, 2025
Python

Trustworthy-ML-Lab / ThinkEdit

Star

An effective weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study uncovering how reasoning length is encoded in the model’s representation space.

deep-learning interpretable-machine-learning large-language-models generative-ai mechanistic-interpretability reasoning-language-models

Updated May 6, 2025
Python

Improve this page

Add a description, image, and links to the reasoning-language-models topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the reasoning-language-models topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reasoning-language-models

Here are 34 public repositories matching this topic...

reasoning-survey / Awesome-Reasoning-Foundation-Models

mims-harvard / TxAgent

dvlab-research / Seg-Zero

LightChen233 / Awesome-Long-Chain-of-Thought-Reasoning

krystalan / DRT

mims-harvard / ToolUniverse

a-m-team / a-m-models

yihedeng9 / OpenVLThinker

Wild-Cooperation-Hub / Awesome-MLLM-Reasoning-Benchmarks

spcl / x1

The-FinAI / Fino1

DolbyUUU / Logic-RL-Lite

DolbyUUU / DeepEnlighten

codelion / pts

zihao-ai / unthinking_vulnerability

dvlab-research / VisionReasoner

MozerWang / AMPO

linhaowei1 / kumo

Hyun-Ryu / clover

Trustworthy-ML-Lab / ThinkEdit

Improve this page

Add this topic to your repo