frozenleaves/kernels

mirror of https://github.com/huggingface/kernels.git synced 2025-10-20 20:56:31 +08:00

Go to file

Nicolas Patry b6ae897c4d Fix all occurrences.

2025-01-20 12:55:22 +01:00

.github/workflows

Reduce the test surface.

2025-01-20 11:55:59 +01:00

Fix all occurrences.

2025-01-20 12:55:22 +01:00

fix: adjust example and docker for name

2025-01-15 23:09:28 +00:00

Rename tool.kernels to tool.hf-kernels

2025-01-20 12:54:27 +01:00

Fix all occurrences.

2025-01-20 12:55:22 +01:00

.gitignore

feat: proof of concept

2024-11-29 17:43:30 +01:00

pyproject.toml

Some fixup.

2025-01-14 16:30:25 +01:00

README.md

Fix all occurrences.

2025-01-20 12:55:22 +01:00

README.md

hf-kernels

Make sure you have torch==2.5.1+cu124 installed.

import torch

from hf_kernels import get_kernel

# Download optimized kernels from the Hugging Face hub
activation = get_kernel("kernels-community/activation")

# Random tensor
x = torch.randn((10, 10), dtype=torch.float16, device="cuda")

# Run the kernel
y = torch.empty_like(x)
activation.gelu_fast(y, x)

print(y)

Docker Reference

build and run the reference example/basic.py in a Docker container with the following commands:

docker build --platform linux/amd64 -t kernels-reference -f docker/Dockerfile.reference .
docker run --gpus all -it --rm -e HF_TOKEN=$HF_TOKEN kernels-reference