Repository navigation

#

openai-o1

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python
8060
12 天前

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6835
1 个月前

The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".

27
1 个月前

Awesome LLM papers, news and projects about learning to reason with LLM, OpenAI o1, reasonning techniques, chain-of-thought (COT), Large Language Model, Straberry

26
1 年前

A demo of OpenAI o1 and Next.js, used to automate the import of user-provided contacts file.

TypeScript
24
9 个月前

A demo of OpenAI o1 and Next.js, used to automate the import of user-provided contacts file.

TypeScript
24
9 个月前

Explore DeepSeek R1🚀: reproduction guides, papers, insightful tweets&blogs to explore and learn. 🌟

4
8 个月前

A Python wrapper that enables large language models (LLMs) to simulate the step-by-step thinking process of OpenAI’s o1 model, providing users with detailed reasoning and comprehensive answers.

Python
3
1 年前

Explore DeepSeek R1🚀: reproduction guides, papers, insightful tweets&blogs to explore and learn. 🌟

3
7 个月前

OpenAI-powered tool for bulk processing of Google Ads search queries to identify and filter irrelevant keywords

Python
1
8 个月前

openai o1 is A new series of reasoning models for solving hard problems. Available starting

0
1 年前