Repository navigation

#

qwq

🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability

Python
1216
20 天前

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Python
1013
3 个月前

Higher performance OpenAI LLM service than vLLM serve: A pure C++ high-performance OpenAI LLM service implemented with GPRS+TensorRT-LLM+Tokenizers.cpp, supporting chat and function call, AI agents, distributed multi-GPU inference, multimodal capabilities, and a Gradio chat interface.

Python
147
3 个月前

Ollama负载均衡服务器 | 一款高性能、易配置的开源负载均衡服务器,优化Ollama负载。它能够帮助您提高应用程序的可用性和响应速度,同时确保系统资源的有效利用。

128
5 个月前

Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache

Python
120
6 天前

Deploy open-source LLMs on AWS in minutes — with OpenAI-compatible APIs and a powerful CLI/SDK toolkit.

Python
71
2 天前

To Think or Not to Think: Exploring the Unthinking Vulnerability in Large Reasoning Models

Python
31
3 个月前

多文件多密钥加密成一个大文件. 给一个正确的密钥,可以提取对应文件. 用于在受到胁迫的情况下隐藏文件.

C++
13
3 年前

Models based on DeepSeek, Qwen3, ChatGPT, and Ollama call the Golang SDK.

Go
9
4 个月前

A modern, silky-smooth UI framwork built on RsPack.

TypeScript
7
1 天前

大文件分卷混淆为小图片

Python
1
4 年前

i created better operating system, except it doesn't work...

HTML
0
6 个月前

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell
0
4 个月前

Breaking long thought processes of o1-like LLMs, such as DeepSeek-R1, QwQ

Python
0
5 个月前

只能压ASCII数据...

C++
0
4 年前

简单的混淆,可用于网盘文件防封。

C++
0
3 年前

一个key/文件生成n个keyfile,获取其中任意m个即可还原key/解密原文件.

Python
0
4 年前