Skip to content

Commit

Permalink
Update Release 0.4.1 changelog
Browse files Browse the repository at this point in the history
  • Loading branch information
piotrm-nvidia committed Nov 8, 2023
1 parent a744f0e commit 0bc9ea5
Show file tree
Hide file tree
Showing 2 changed files with 4 additions and 2 deletions.
2 changes: 1 addition & 1 deletion CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -16,7 +16,7 @@ limitations under the License.

# Changelog

## Unreleased
## 0.4.1 (2023-11-09)

- New: Place where workspaces with temporary Triton model repositories and communication file sockets can be configured by `$PYTRITON_HOME` environment variable
- Fix: recover handling `KeyboardInterrupt` in `triton.serve()`
Expand Down
4 changes: 3 additions & 1 deletion docs/known_issues.md
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,6 @@ limitations under the License.
- Support is currently limited to the x86-64 instruction set architecture.
- Running multiple scripts hosting PyTriton on the same machine or container is not feasible.
- Deadlocks may occur in some models when employing the NCCL communication library and multiple Inference Callables are triggered concurrently. This issue can be observed when deploying multiple instances of the same model or multiple models within a single server script. Additional information about this issue can be found [here](https://docs.nvidia.com/deeplearning/nccl/user-guide/docs/usage/communicators.html#using-multiple-nccl-communicators-concurrently).
- Enabling verbose logging may cause a significant performance drop in model inference.
- Enabling verbose logging may cause a significant performance drop in model inference.
- Creation of Triton object leaks single intance of ModelClient object. This is a known issue and will be fixed in future releases. This may cause gevent warnings to be printed to the console.
- GRPC ModelClient doesn't support timeouts for model configuration and model metadata requests due to a limitation in the underlying tritonclient library.

0 comments on commit 0bc9ea5

Please sign in to comment.