v1.3.7
Lots of documentation, bugs squashed and misc changes.
Features
- Load checkpoint for fine-tuning in 0349c3f by @henrykmichalewski !
- Added generic input pipeline for GLUE tasks in 2f490e8 by @henrykmichalewski - thanks a lot!
- Some nascent support for bfloat16s!
Models
- Performer's Favor and CausalFavor - 77db199 thanks @lukaszkaiser !
- Funnel-Transformer in #1156 thanks a lot @mvxxx !
- BERT for Trax in #1254 , #1223, etc by @piotrekp1 - thanks a lot!
- Residual Exchange Network by @kkanska in 3a8f402 ! Thanks!
PRs Merged
- Fixing broken example links in #1263 thanks @amtagrwl !
- Added WideResnet, Deconv etc Example Notebook in #1259 , #1202, #1232 thanks a lot! @SauravMaheshkar !
- Remove implicit
object
from the base class in #1228 thanks @HarshCasper ! - Fashion MNIST example in #1199 thanks @jimexist !
- Fix PretrainedBERT init in #1135 by @hepaajan !
- Typo in the TransformerDecoder input parameters description in #1100 by @kujaomega !