I wrote a custom CUDA inference engine to run Qwen3.5-27B on $130 mining cards

(news.ycombinator.com)

3 points | by Haru-neo 10 hours ago ago

1 comments