r1.15.5-deeprec2204
liutongxuan
released this
07 Apr 12:27
·
783 commits
to main
since this release
Major Features and Improvements
Embedding
- Support hybrid storage of EmbeddingVariable (DRAM, PMEM, LevelDB)
- Support memory-continuous storage of multi-slot EmbeddingVariable.
- Optimize beta1_power and beta2_power slots of EmbeddingVariable.
- Support restore frequency of features in EmbeddingVariable.
Distributed Training
- Integrate SOK in DeepRec.
Graph Optimization
- Auto Graph Fusion, support float32/int32/int64 type for select fusion.
- SmartStage, fix graph contains circle bug when enable SmartStage optimization.
Runtime Optimization
- GPUTensorPoolAllocator, which reduce GPU memory usage and improve performance.
- PMEMAllocator, support allocation in persistent memory.
Optimizer
- Optimize AdamOptimizer performance.
Op & Hardware Acceleration
- Change fused MatMul layout type and number thread for small size inputs.
IO & Dataset
- KafkaGroupIODataset, support consumer rebalance.
Model Save/Restore
- Support dump incremental graph info.
Serving
- Add serving module (ODL processor), which support Online Deep Learning (ODL).
More details of features: https://deeprec.readthedocs.io/zh/latest/
Release Images
CPU Image
registry.cn-shanghai.aliyuncs.com/pai-dlc-share/deeprec-training:deeprec2204-cpu-py36-ubuntu18.04
GPU Image
registry.cn-shanghai.aliyuncs.com/pai-dlc-share/deeprec-training:deeprec2204-gpu-py36-cu110-ubuntu18.04
Known Issue
Some user report issue when use Embedding Variable, such as #167. The bug is fixed in r1.15.5-deeprec2204u1.