Repository navigation

fp8

Website
Wikipedia

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.

CUDA 深度学习 gpu 机器学习 Python PyTorch fp8 jax

Python

2660

487

3 小时前

Azure / MS-AMP

Microsoft Automatic Mixed Precision Library

amp 深度学习 fp8 gpu PyTorch transformer

Python

616

1 年前

intel / neural-speed

An innovative library for efficient LLM inference via low-bit quantization

cpu fp8 gpu int8 llm-inference sparsity llamacpp

C++

349

1 年前

aredden / flux-fp8-api

Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x faster on consumer devices.

diffusion flux fp8 PyTorch quantization

Python

274

10 个月前

graphcore-research / jax-scalify

JAX Scalify: end-to-end scaled arithmetics

fp8 大语言模型 jax low-precision

Python

10 个月前

zerfoo / zerfoo

A modular, accelerator-ready machine learning framework built in Go that speaks float8/16/32/64. Designed with clean architecture, strong typing, and native concurrency for scalable, production-ready AI systems. Ideal for engineers who value simplicity, speed, and maintainability.

autodiff 深度学习 distributed-training float16 float8 fp8 Go 机器学习神经网络 onnx transformer

16 天前

zsxkib / cog-step-video-t2v

Cog Single GPU Quantized Implementation of Step-Video-T2V

fp8 replicate

Python

6 个月前

MurrellGroup / Microfloats.jl

Slow, low-precision floating point types

floating-point fp8

Julia

2 天前

mukullokhande99 / XR-NPE

Python implementations for multi-precision quantization in computer vision and sensor fusion workloads, targeting the XR-NPE Mixed-Precision SIMD Neural Processing Engine. The code includes visual inertial odometry (VIO), object classification, and eye gaze extraction code in FP4, FP8, Posit4, Posit8, and BF16 formats.

fp8 object-detection posit quantization visual-inertial-odometry

Jupyter Notebook

3 天前

umangyadav / py_fp8

FP8 dtypes enumeration in python

fp8

C++

2 年前