MindfulReturn 身心修复局

MindfulReturn

@ruby_runner retweeted 我艹，这个牛逼啊，把qwen直接烧录进芯片，本地每秒处理10000个token.“中型”LLM燃烧器即将推出！ 🔥 这可能使本地 HyperToken 生成成为现实。 📷 NVIDIA 最担心的事发生了。📷应用专用硬件 Taala 的新型 PCIe ASIC 板会将整个中型 Qwen 3.5-27B LLM 直接烧录到硅片中📷 Taalos表示，到2026年春季，他们的实验室将提供基于ASIC的中型模型。不再增加配重 📷本地每秒处理约 10,000 个令牌（Llama 3.1 8B 已经达到每秒 17,000 个令牌） 📷标准PC插槽，超低功耗(10x更少） 📷 100%离线，无云，无GPU集群 📷 Reddit 传闻单价 300 至 400 美元📷思维速度堪比光速的人工智能体。 📷你准备好了吗？ David Hendrickson @TeksEdge 🎗️ "Medium-Sized" LLM Burners Coming Soon! 🔥This Could Make Local HyperToken Generation a Reality. ⚡️ NVIDIA’s worst nightmare? 😱⚙️ Application-Specific HardwareTaalas new PCIe ASIC board would burn the entire medium-sized Qwen 3.5-27B LLM straight into silicon 🤯 (already doing it with small models)Taalos said medium models on ASIC would be available in their lab by Spring '26.💭Imagine:🚫 No more loading weights🚀 ~10,000 Tokens Per Second locally (Llama 3.1 8B already @ 17,000 tps)💻 Standard PC slot, ultra-low power (10x less) 🔋🌍 100% offline with no cloud, no GPU farm💰 Reddit unit cost rumor $300 to $400🖥️ Imagine HyperToken generation on your desktop.🤖 AI agents that think at light speed. ⚡️ Are you ready? 👀 Posted Mar 27, 2026 at 5:07AM Posted Mar 27, 2026 at 12:14PM