Cross Entropy Kernel OptimizationopenOptimize a Triton cross-entropy loss kernel against PyTorch GPU baselines.cudatritoncross-entropyfrontier-cscross-entropy-kernel-frontier-cs-cross-entropy