Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Moneo refresh #74

Merged
merged 1 commit into from
Jan 10, 2024
Merged

Moneo refresh #74

merged 1 commit into from
Jan 10, 2024

Conversation

rafsalas19
Copy link
Collaborator

  • Update fix nvidia-exporter sample rates
  • Updated node-exporter to have eth device naming argument
  • refreshed dashboard and ifra deployment
  • Add quick deploy script for linux services
  • refreshed linux service deployment
  • Bug fixes and doc updates

@RyoYang
Copy link
Collaborator

RyoYang commented Jan 10, 2024

  1. Could you also update the ETH device argument and sample rate in the container configuration here?https://github.com/Azure/Moneo/blob/70c0a2d75355d82909784c886ed0fc169a49a033/dockerfile/moneo-exporter-nvidia_entrypoint.sh#L17C1-L17C37?
  2. In the current configure_service.sh, if we use the managed Prometheus method, which means the argument should be empty, it still installs some unnecessary packages related to Azure Monitor. You can check here https://github.com/Azure/Moneo/blob/70c0a2d75355d82909784c886ed0fc169a49a033/linux_service/configure_service.sh#L35C1-L36C60.

refresh dashboards

Add front-end network naming option

dashboard fix

formatting

lint

addressing pr comments
@rafsalas19
Copy link
Collaborator Author

  1. Could you also update the ETH device argument and sample rate in the container configuration here?https://github.com/Azure/Moneo/blob/70c0a2d75355d82909784c886ed0fc169a49a033/dockerfile/moneo-exporter-nvidia_entrypoint.sh#L17C1-L17C37?
  2. In the current configure_service.sh, if we use the managed Prometheus method, which means the argument should be empty, it still installs some unnecessary packages related to Azure Monitor. You can check here https://github.com/Azure/Moneo/blob/70c0a2d75355d82909784c886ed0fc169a49a033/linux_service/configure_service.sh#L35C1-L36C60.

Addressed

@rafsalas19 rafsalas19 merged commit c7dc48b into Azure:main Jan 10, 2024
5 checks passed
@afragop72
Copy link

Hello

I am new on using Moneo for monitoring our Azure HPC cluster.
We are using Headless Managed Grafana to visualize metrics from GPUs.

How do we enable GPU profiling metrics collection?

Thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants