v0.1.3
LatestFebruary 27, 2026
72B Model Benchmark Support
Verified 72B model benchmarks showing 79× speedup over bitsandbytes.
Verified Qwen 72B benchmark: 6.5s cold start (79× faster than bitsandbytes)
llama.cpp GGUF comparison: ZSE 1.6× faster on 72B models
Updated website with H200 GPU benchmark results
New benchmark scripts for 70B+ models