Repository navigation

#

大语言模型

large-language-model logo

大型语言模型(large language model,简称LLM)是一种专为理解、生成和处理人类语言而设计的深度学习模型。这些模型通过在海量文本数据(如书籍、文章、网页等)上进行训练,学习语言的模式、语境和语义。大型语言模型广泛应用于聊天机器人、代码生成、翻译、文本摘要等领域,是生成式人工智能的核心技术,通常基于Transformer架构构建。

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python
27040
5 个月前

KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge bases. It can effectively overcome the shortcomings of the traditional RAG vector similarity calculation model.

Python
7954
13 天前

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

Python
7063
2 个月前

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python
6125
6 天前

Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama.

Swift
5656
7 个月前

⚙️🦀 Build modular and scalable LLM Applications in Rust

Rust
4602
12 小时前

[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).

Jupyter Notebook
3369
9 个月前

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

Python
3284
9 天前

Implementation for MatMul-free LM.

Python
3032
2 个月前

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python
2428
8 个月前

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Python
2165
1 年前

Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates. - Professor Yu Liu

Jupyter Notebook
1647
3 个月前

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥

1539
12 天前