Personal Assistant
Home Settings
Daily Digest Newsletters Papers Ruby Posts AI Posts Ruby: Blogs and News AI: Blogs and News Gem Updates Gem Discoveries Digest Tweets
Twitter Lists Bluesky Lists RSS Lists Tracked Gems
Sign in Explore
M

MindfulReturn 身心修复局

MindfulReturn

@ruby_runner retweeted 我艹,这个牛逼啊,把qwen直接烧录进芯片,本地每秒处理10000个token.“中型”LLM燃烧器即将推出! 🔥 这可能使本地 HyperToken 生成成为现实。 📷 NVIDIA 最担心的事发生了。📷应用专用硬件 Taala 的新型 PCIe ASIC 板会将整个中型 Qwen 3.5-27B LLM 直接烧录到硅片中📷 Taalos表示,到2026年春季,他们的实验室将提供基于ASIC的中型模型。 不再增加配重 📷本地每秒处理约 10,000 个令牌(Llama 3.1 8B 已经达到每秒 17,000 个令牌) 📷标准PC插槽,超低功耗(10x更少) 📷 100%离线,无云,无GPU集群 📷 Reddit 传闻单价 300 至 400 美元📷思维速度堪比光速的人工智能体。 📷你准备好了吗? David Hendrickson @TeksEdge 🎗️ "Medium-Sized" LLM Burners Coming Soon! 🔥This Could Make Local HyperToken Generation a Reality. ⚡️ NVIDIA’s worst nightmare? 😱⚙️ Application-Specific HardwareTaalas new PCIe ASIC board would burn the entire medium-sized Qwen 3.5-27B LLM straight into silicon 🤯 (already doing it with small models)Taalos said medium models on ASIC would be available in their lab by Spring '26.💭Imagine:🚫 No more loading weights🚀 ~10,000 Tokens Per Second locally (Llama 3.1 8B already @ 17,000 tps)💻 Standard PC slot, ultra-low power (10x less) 🔋🌍 100% offline with no cloud, no GPU farm💰 Reddit unit cost rumor $300 to $400🖥️ Imagine HyperToken generation on your desktop.🤖 AI agents that think at light speed. ⚡️ Are you ready? 👀 Posted Mar 27, 2026 at 5:07AM Posted Mar 27, 2026 at 12:14PM

8:30 PM · Mar 29, 2026