Computing Sharding with Einsum
(news.ycombinator.com)
1.
2.
A Trick for Backpropagation of Linear Transformations
(news.ycombinator.com)