Skip to content

gpud-v0.1.5

Compare
Choose a tag to compare
@github-actions github-actions released this 08 Nov 02:21
· 59 commits to main since this release
c9fba83

GPUd release notes (2024-11-08T02:20:32Z)

Welcome to this new release!

What's Changed

  • fix(join): user detected provider by default if not provided by @cardyok in #146
  • fix(components): do not attempt register if already registered by @gyuho in #148
  • fix(nvidia): remove init func for nvidia packages (do not check nvidia if not needed) by @gyuho in #150
  • nit(nvidia/nvml): log nvml call failures in critical paths, add xid 119 test cases by @gyuho in #149
  • nits(accelerator/nvidia): rename reason fields, Xid/SXid detail fields for clarification by @gyuho in #151
  • nits(accelerator): remove redundant criticality, suggested action fields, define sxid reason struct only by @gyuho in #152
  • feat(components/os): count process counts per status (e.g., zombie, detached) by @gyuho in #147
  • feat(nvidia/xid,sxid): more accurate criticality, suggested actions by GPUd, catch all Xids by @gyuho in #145
  • feat(components): rename action name REPAIR_HARDWARE to HARDWARE_INSPECTION by @gyuho in #153
  • feat(nvidia/xid, sxid): catch all events, return critical for /states, non-critical ones for /events by @gyuho in #155
  • chore(fix): fix typo corrent -> correct by @Yangqing in #158
  • feat(package): support deleting packages on session delete by @cardyok in #154
  • nits(pkg): move "go-pkg" ones to "pkg" by @gyuho in #156
  • feat(nvidia/xid, sxid): return all via /events by @gyuho in #159
  • feat(nvidia): add/document xid 94, sxid 20009 by @gyuho in #160
  • feat(nvidia/remapped-rows): suggest reboot/inspection on row remapping by @gyuho in #161
  • feat(notify): support sending notification to control plane by @cardyok in #163
  • feat(nvidia/gsp-firmware-mode): initial commit to track GSP modes by @gyuho in #162

New Contributors

Full Changelog: v0.1.2...v0.1.5