gpud-v0.1.5
GPUd release notes (2024-11-08T02:20:32Z)
Welcome to this new release!
What's Changed
- fix(join): user detected provider by default if not provided by @cardyok in #146
- fix(components): do not attempt register if already registered by @gyuho in #148
- fix(nvidia): remove init func for nvidia packages (do not check nvidia if not needed) by @gyuho in #150
- nit(nvidia/nvml): log nvml call failures in critical paths, add xid 119 test cases by @gyuho in #149
- nits(accelerator/nvidia): rename reason fields, Xid/SXid detail fields for clarification by @gyuho in #151
- nits(accelerator): remove redundant criticality, suggested action fields, define sxid reason struct only by @gyuho in #152
- feat(components/os): count process counts per status (e.g., zombie, detached) by @gyuho in #147
- feat(nvidia/xid,sxid): more accurate criticality, suggested actions by GPUd, catch all Xids by @gyuho in #145
- feat(components): rename action name REPAIR_HARDWARE to HARDWARE_INSPECTION by @gyuho in #153
- feat(nvidia/xid, sxid): catch all events, return critical for /states, non-critical ones for /events by @gyuho in #155
- chore(fix): fix typo corrent -> correct by @Yangqing in #158
- feat(package): support deleting packages on session delete by @cardyok in #154
- nits(pkg): move "go-pkg" ones to "pkg" by @gyuho in #156
- feat(nvidia/xid, sxid): return all via /events by @gyuho in #159
- feat(nvidia): add/document xid 94, sxid 20009 by @gyuho in #160
- feat(nvidia/remapped-rows): suggest reboot/inspection on row remapping by @gyuho in #161
- feat(notify): support sending notification to control plane by @cardyok in #163
- feat(nvidia/gsp-firmware-mode): initial commit to track GSP modes by @gyuho in #162
New Contributors
Full Changelog: v0.1.2...v0.1.5