345M model support
- Support the 345M model (thanks to Neil Shepperd for the gradient checkpointing implementation!)
- Support model_name in the CLI for above support
- Support run_name in the CLI
- Support
.csv
files as an input dataset tofinetune
(will parse the CSV as if it was done viaencode_csv()
). - Fix one off issues (#21)