Skip to content

r1.15.5-deeprec2204

Compare
Choose a tag to compare
@liutongxuan liutongxuan released this 07 Apr 12:27
· 783 commits to main since this release
31f8362

Major Features and Improvements

Embedding

  • Support hybrid storage of EmbeddingVariable (DRAM, PMEM, LevelDB)
  • Support memory-continuous storage of multi-slot EmbeddingVariable.
  • Optimize beta1_power and beta2_power slots of EmbeddingVariable.
  • Support restore frequency of features in EmbeddingVariable.

Distributed Training

  • Integrate SOK in DeepRec.

Graph Optimization

  • Auto Graph Fusion, support float32/int32/int64 type for select fusion.
  • SmartStage, fix graph contains circle bug when enable SmartStage optimization.

Runtime Optimization

  • GPUTensorPoolAllocator, which reduce GPU memory usage and improve performance.
  • PMEMAllocator, support allocation in persistent memory.

Optimizer

  • Optimize AdamOptimizer performance.

Op & Hardware Acceleration

  • Change fused MatMul layout type and number thread for small size inputs.

IO & Dataset

  • KafkaGroupIODataset, support consumer rebalance.

Model Save/Restore

  • Support dump incremental graph info.

Serving

  • Add serving module (ODL processor), which support Online Deep Learning (ODL).

More details of features: https://deeprec.readthedocs.io/zh/latest/

Release Images

CPU Image

registry.cn-shanghai.aliyuncs.com/pai-dlc-share/deeprec-training:deeprec2204-cpu-py36-ubuntu18.04

GPU Image

registry.cn-shanghai.aliyuncs.com/pai-dlc-share/deeprec-training:deeprec2204-gpu-py36-cu110-ubuntu18.04

Known Issue

Some user report issue when use Embedding Variable, such as #167. The bug is fixed in r1.15.5-deeprec2204u1.