-
Notifications
You must be signed in to change notification settings - Fork 28
Issues: ml-energy/zeus
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Incorporate Zeusd for CPU and DRAM monitoring in ZeusMonitor
cpu
enhancement
New feature or request
#146
opened Dec 11, 2024 by
wbjin
Investigate Intel PCM support
cpu
enhancement
New feature or request
#142
opened Dec 8, 2024 by
jaywonchung
Subclass New feature or request
torch.distributed.pipelining.PipelineStage
for PFO
enhancement
#131
opened Oct 13, 2024 by
jaywonchung
Add CPU support for the PowerMonitor
enhancement
New feature or request
#128
opened Sep 23, 2024 by
sharonsyh
[Testing] A simple mock device implementation for testing
enhancement
New feature or request
#127
opened Sep 20, 2024 by
jaywonchung
[RFC] Integration of Prometheus Push Gateway and Energy Metrics Collection in Zeus
#125
opened Sep 15, 2024 by
sharonsyh
Lazily initialize RAPL wraparound monitor processes
enhancement
New feature or request
good first issue
Good for newcomers
#121
opened Sep 10, 2024 by
jaywonchung
Integration with IPMI metrics
enhancement
New feature or request
#112
opened Aug 25, 2024 by
jaywonchung
Support for NVIDIA Jetson platforms
enhancement
New feature or request
#103
opened Jul 26, 2024 by
jaywonchung
6 tasks
[Zeusd] Better failure handling and testing
enhancement
New feature or request
#88
opened May 30, 2024 by
jaywonchung
Training framework integration opportunities
integration
roadmap
#77
opened May 16, 2024 by
jaywonchung
Test and verify New feature or request
nvmlDeviceSetAPIRestriction
enhancement
#59
opened May 3, 2024 by
jaywonchung
Carbon-aware Zeus (Chase) as an optimizer
enhancement
New feature or request
#53
opened Apr 28, 2024 by
jaywonchung
GlobalPowerLimitOptimizer
for distributed data parallel training
enhancement
#43
opened Mar 13, 2024 by
jaywonchung
3 tasks
Cluster-wide energy metric aggregation
enhancement
New feature or request
#30
opened Oct 27, 2023 by
jaywonchung
OperationProfiler
and PerseusOptimizer
server and client
enhancement
#21
opened Oct 8, 2023 by
jaywonchung
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.