一则来自于Sebastian Ruder的分享 来源于https://www.slideshare.net/SebastianRuder/optimization-for-deep-learning