Mamba2 Scan OptimizationopenOptimize a CUDA/Triton implementation of the Mamba2 sequential scan recurrence.cudatritonscanmamba2-scan-frontier-cs-mamba2-scan