CUDA-l2: Surpassing cuBLAS performance for matrix multiplication through RL
(news.ycombinator.com)
1.
2.
CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication Through RL
(news.ycombinator.com)