Contents:
Bases: DataParallel
DataParallel
Wrapper class for multi-gpu training.
from https://github.com/pytorch/tutorials/issues/836