FlashQLA: High-Performance Linear Attention Kernels on TileLang

GitHub@Alibaba_QwenApr 29, 2026

🚀 Introducing FlashQLA: high-performance linear attention kernels built on TileLang. ⚡ 2–3× forward speedup. 2× backward speedup. 💻 Purpose-built for agentic AI on your personal devices. 💡Key insights: 1. Gate-driven automatic intra-card CP. 2. Hardware-friendly algebraic https://t.co/4Vhyyw5RuB

Views75.0k

Comments38

Reposts107

Likes936

Launched Apr 29, 2026View post

More from GitHub

GitHub releases Git 2.54 with new hooks and history rewrite

@github

Git 2.54 is here with features like config-based hooks, new ways to rewrite history, and much more. ✨ Check out the highlights from this release. 👇 https://t.co/CmIInsdLkq

Apr 20, 2026

Views577.9k

Comments31

Reposts234

View post

Gemini 3.5 Flash GA across Gemini app, AI Mode in Search, and Copilot

@github

📣 @GoogleAI’s Gemini 3.5 Flash is now generally available and rolling out in GitHub Copilot. Early testing shows ➡️ It has strong tool use, fast response times, and high cache efficiency ➡️ It is it well-suited for fast, iterative agentic coding workflows Try it out in @code.

May 19, 2026

Views497.4k

Comments42

Reposts47

View post

GPT-5.5 GA in GitHub Copilot

@github

🆕 @OpenAIDevs GPT-5.5 is now generally available and rolling out in GitHub Copilot. Our early testing shows ➡️ It delivers its strongest performance on complex agentic coding tasks ➡️ It resolves real-world coding challenges previous GPT models couldn’t Try it out in Copilot https://t.co/jLAZagNKXJ

Apr 24, 2026

Views244.0k

Comments98

Reposts55

View post

GPT-5.2-Codex is now generally available in GitHub Copilot

@github

.@OpenAI’s GPT-5.2-Codex is now rolling out in GitHub Copilot. This model excels at large code changes like refactors or migrations, and has improved performance in Windows environments. Try it out in @code. https://t.co/TnkOVixjUI

Jan 14, 2026

Views192.7k

Comments43

Reposts95

View post

See all GitHub launches →