Fisher-Orthogonal Projection Methods for Gradient Descent with Large Batches

(arxiv.org)

2 points | by sorenjan 7 hours ago ago

No comments yet.