CUDA-Attention CUDA Implementation of Naive Attention & Flash Attention Environment conda create -n attn python=3.8 conda activate attn pip install torch --index-url https://download.pytorch.org/whl/cu121 How To Run cd src python bench.py