Hacker News: Front Page 2026-06-16 16:12

GateGPT: 56k tokens per second Transformer (KV cache) on FPGA at 80 MHz

Article URL: https://twitter.com/fguzmanai/status/2065832668172845209 Comments URL: https://news.ycombinator.com/item?id=48557535 Points: 28 # Comments: 9

Add Comment

Comments

No comments yet.

Recommended Similar Articles

Real-time LLM Inference on Standard GPUs: 3k tokens/s per request

Hacker News: Front Page • similarity 0.506

MiMo-v2.5-Pro-UltraSpeed: 1T model with 1000 tokens per second

Hacker News: Front Page • similarity 0.480

Can I Buy Your KV Cache?

Hacker News: Front Page • similarity 0.428

On CPU Physics and CPU Cycles

Hacker News: Front Page • similarity 0.384

Raspberry Pi 5 – 16GB RAM

Hacker News: Front Page • similarity 0.377