fla-org / flash-linear-attention Public

Notifications You must be signed in to change notification settings
Fork 502
Star 4.9k

Code
Issues 39
Pull requests 19
Discussions
Actions
Projects
Wiki
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security and quality
Insights

Pull requests: fla-org/flash-linear-attention

Labels 16 Milestones 3

New pull request New

19 Open 495 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[Ops] Fix int32 overflow in pointer arithmetic across all Triton kernels

#818 opened Apr 8, 2026 by tmct Contributor • Draft

Add MALA (Magnitude-Aware Linear Attention) to FLA

#809 opened Apr 3, 2026 by drdanielwuwu

Loading…

[L2Norm] Fix bf16 numerical stability by calculating norm in f32 then normalising in input dtype

#806 opened Mar 31, 2026 by tmct Contributor • Draft

feat: add Quasar Attention and standalone model implementation

#805 opened Mar 31, 2026 by troy12x

Loading…

[Cache] FLA autotune cache

#798 opened Mar 29, 2026 by sBobHuang

Loading…

[GDN] Tricked kernels: ungated KKT + fused inference via similarity transform

#797 opened Mar 28, 2026 by hypnopump Contributor

Loading…

5 tasks

[Layernorm] Fix autotuner crash and OOB writes in layer_norm_bwd on high-SM GPUs

#796 opened Mar 28, 2026 by mpurland Contributor

Loading…

5 tasks done

Add LinOSS model(ICLR 2025 oral)

#749 opened Feb 17, 2026 by Phoenix8215

Loading…

[NPU] add NPU (Ascend) backend for chunk_gla

#737 opened Feb 5, 2026 by noemotiovon • Draft

Add NPU support for the fused_norm_gate operator

#719 opened Jan 20, 2026 by iiiiLllllzx • Draft

[Cache] Cache Triton Autotune

#705 opened Dec 30, 2025 by zhiyuan1i Collaborator • Draft

Add fused short convolution kernel with L2 norm

#661 opened Nov 24, 2025 by sustcsonglin Collaborator

Loading…

[kda] add recursive block intra implementation

#656 opened Nov 22, 2025 by sustcsonglin Collaborator

Loading…

[Deltaformer] kernel improvement; if-else optimization; change w to fp32; add 1e-9 to avoid nan

#603 opened Sep 30, 2025 by foreverpiano

Loading…

Update README.md of ops delta_rule

#595 opened Sep 17, 2025 by SeepingFragranceLock Contributor

Loading…

Cached inference for NSA

#574 opened Aug 22, 2025 by mutiann Contributor

Loading…

Modify output shape in nsa for decoding

#565 opened Aug 14, 2025 by Espere-1119-Song

Loading…

Updated the Technical Note for WY of DPLR

#562 opened Aug 12, 2025 by phnazari

Loading…

Delta Product Rule Backwards Kernel

#526 opened Jul 14, 2025 by phi-jkim

Loading…

ProTip! Adding no:label will show everything without a label.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!