Skip to content

Release 0.2.1

Compare
Choose a tag to compare
@workingloong workingloong released this 11 Oct 09:38
· 1168 commits to master since this release
48fa032

DLRover:

ElasticJob:

  • Autotuning batch size without restarting the job.
  • Automatically detect the straggler (slow worker).

TFPlus

TFPlus 0.1.0 has been released, see detail in https://github.com/intelligent-machine-learning/dlrover/tree/master/tfplus

Kv Variable (Core Embedding Capability)

  • High-performance Embedding Ops
  • Kv Variable low level APIs (4 in total)
    • tfplus.get_kv_variable
    • embedding_lookup
    • embedding_lookup_sparse
    • safe_embedding_lookup_sparse
  • Dynamic expansion and partitioning of Embedding weights
  • Support for both single-machine training and PS/Worker cluster training

High-performance Optimizers

  • Common optimizers compatible with Kv Variable
    • Adam
    • Adagrad
  • In-house deep learning optimizers based on Sparse Group Lasso
    • Group Adam
    • Group Adagrad