Ben Chuanlong Du's Blog

It is never too late to learn.

Reduce Memory Needed to Train Deep Learning Models

checkmate

checkmate breaks the GPU memory wall by enabling researchers to train large state-of-the-art models that do not fit in GPU memory. Checkmate applies optimal tensor rematerialization (as detailed in the paper) to trade off space and time.

In [ ]:
 

Comments