LightLLM-Kernel

LightLLM-Kernel is a high-performance CUDA kernel library powering the LightLLM inference system. It provides optimized GPU implementations for critical operations in large language model (LLM) inference, delivering significant performance improvements through carefully crafted CUDA kernels.

Project Overview

LightLLM-Kernel serves as the computational backbone for LightLLM framework, offering:

Custom CUDA Kernels: Highly optimized implementations for transformer-based model operations
Memory Efficiency: Reduced memory footprint through advanced quantization techniques
Scalability: Support for large model architectures including MoE (Mixture-of-Experts) models

Key Features

Core Modules

Module	Description
Attention	Optimized Multi-Head Attention kernels with fused QKV operations and efficient softmax
MoE	Expert routing and computation kernels for Mixture-of-Experts architectures
Quant	Low-precision quantization support (INT8/INT4) for weights and activations
Extensions	Continuous expansion of optimized operations for emerging model architectures

Installation

System Requirements

NVIDIA GPU with Compute Capability ≥ 7.0 (Volta+)
CUDA 11.8 or higher
Python 3.8+

Installation Methods

Static Compilation (Recommended)

pip install .

Build only a wheel package

python -m build --wheel

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
benchmark		benchmark
csrc		csrc
include		include
lightllm_kernel		lightllm_kernel
test		test
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
Makefile		Makefile
README-CH.md		README-CH.md
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LightLLM-Kernel

Project Overview

Key Features

Core Modules

Installation

System Requirements

Installation Methods

Static Compilation (Recommended)

Build only a wheel package

About

Uh oh!

Releases

Packages

Languages

License

ModelTC/light_ops

Folders and files

Latest commit

History

Repository files navigation

LightLLM-Kernel

Project Overview

Key Features

Core Modules

Installation

System Requirements

Installation Methods

Static Compilation (Recommended)

Build only a wheel package

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages