DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers

Published in

TBD: describe paper