Gated Delta Networks: Improving Mamba2 with Delta Rule Mamba2: St=αtSt−1+vtkt⊤, where αt∈(0,1). Gated Delta Rule: St=St−1(αt(I−βtktkt⊤))+βtvtkt⊤