OpenNMT-py v3.5.0
3.5.0 (2024-02-22)
- Further improvements and fixes
- Suport for AWQ models
- Add n_best for topp/topk generation
- Support MoE (Mixtral) inference
- Extend HF models converter
- use flash_attn_with_kvcache for faster inference
- Add wikitext2 PPL computation
- Support for Phi-2 models