Alloy does not work on Windows 11 #2225

igolets · 2024-12-06T09:24:57Z

What's wrong?

I've installed alloy on my win11 machine using script provided by configurationmaster. "C:\Program Files\GrafanaLabs\Alloy\config.alloy" was created and contain my token. But test connection fails (No metrics were found for this integration. ).
I did some basic check and here is wha I've found:

Service is running.
in Windows Application logs I see one error message — ts=2024-12-06T09:11:45.650179Z level=error source=/go/pkg/mod/github.com/!kim!machine!gun/[email protected]/memlimit/memlimit.go:117 msg="failed to set GOMEMLIMIT" package=github.com/KimMachineGun/automemlimit/memlimit error="failed to set GOMEMLIMIT: cgroups is not supported on this system"

Googling did not give any clue.

Steps to reproduce

Get setup script using configuration master (cd "%TEMP%" && powershell -c Invoke-WebRequest "https://storage.googleapis.com/cloud-onboarding/alloy/scripts/install-windows.ps1" -OutFile "install-windows.ps1" && powershell -executionpolicy Bypass -File ".\install-windows.ps1"....)
Run it in elevated CMD
Check Windows eventlog (Application)

System information

Windows 11 x64 PRO 24H2

Software version

Grafana Alloy 1.5.1

Configuration

remotecfg {
	url            = "https://******.grafana.net"
	id             = "******"
	poll_frequency = "60s"

	basic_auth {
		username = "******"
		password = "glc_******"
	}
}

Logs

ts=2024-12-06T09:11:45.650179Z level=error source=/go/pkg/mod/github.com/!kim!machine!gun/[email protected]/memlimit/memlimit.go:117 msg="failed to set GOMEMLIMIT" package=github.com/KimMachineGun/automemlimit/memlimit error="failed to set GOMEMLIMIT: cgroups is not supported on this system"
ts=2024-12-06T09:11:45.8593396Z level=info msg="no peer discovery configured: both join and discover peers are empty" service=cluster
ts=2024-12-06T09:11:45.8593396Z level=info msg="running usage stats reporter"
ts=2024-12-06T09:11:45.8593396Z level=info msg="starting complete graph evaluation" controller_path=/ controller_id="" trace_id=8916bce04d8fa5b0e0d3de118bd490fa
ts=2024-12-06T09:11:45.8593396Z level=info msg="enabled windows_exporter collectors" component_path=/ component_id=prometheus.exporter.windows.integrations_windows_exporter collectors=cpu,cs,diskdrive,logical_disk,net,os,service,system,time
ts=2024-12-06T09:11:45.8593396Z level=warn msg="No where-clause specified for service collector. This will generate a very large number of metrics!" component_path=/ component_id=prometheus.exporter.windows.integrations_windows_exporter collector=service
ts=2024-12-06T09:11:45.8593396Z level=info msg="finished node evaluation" controller_path=/ controller_id="" trace_id=8916bce04d8fa5b0e0d3de118bd490fa node_id=prometheus.exporter.windows.integrations_windows_exporter duration=191.5318ms
ts=2024-12-06T09:11:45.8593396Z level=info msg="finished node evaluation" controller_path=/ controller_id="" trace_id=8916bce04d8fa5b0e0d3de118bd490fa node_id=discovery.relabel.integrations_windows_exporter duration=0s
ts=2024-12-06T09:11:45.8593396Z level=info msg="finished node evaluation" controller_path=/ controller_id="" trace_id=8916bce04d8fa5b0e0d3de118bd490fa node_id=tracing duration=0s
ts=2024-12-06T09:11:45.8593396Z level=info msg="finished node evaluation" controller_path=/ controller_id="" trace_id=8916bce04d8fa5b0e0d3de118bd490fa node_id=logging duration=0s

The text was updated successfully, but these errors were encountered:

naZzethe3rd · 2024-12-09T04:10:28Z

We confirmed the same error with Windows Server 2019.

dehaansa · 2024-12-09T21:31:19Z

While that error log is annoying it should be a red herring, as it only affects automatic setting of the GOMEMLIMIT variable.

I see that you mentioned test connection failed in Fleet Management, however if you go to look at your collector fleet does the alloy instance appear? I believe FM is working on some issues where the test connection fails but the collector is actually successfully connected.

humbberto · 2024-12-10T21:45:54Z

facing the same issue here

.\alloy-windows-amd64.exe run .\config.alloy --stability.level public-preview
ts=2024-12-10T21:31:05.266222Z level=info "boringcrypto enabled"=false
ts=2024-12-10T21:31:04.9237752Z level=error source=/go/pkg/mod/github.com/!kim!machine!gun/[email protected]/memlimit/memlimit.go:117 msg="failed to set GOMEMLIMIT" package=github.com/KimMachineGun/automemlimit/memlimit error="failed to set GOMEMLIMIT: cgroups is not supported on this system"
ts=2024-12-10T21:31:05.2667357Z level=info msg="no peer discovery configured: both join and discover peers are empty" service=cluster
ts=2024-12-10T21:31:05.2667357Z level=info msg="running usage stats reporter"
ts=2024-12-10T21:31:05.2667357Z level=info msg="starting complete graph evaluation" controller_path=/ controller_id="" trace_id=196bc9dce13d4fdea8cc515903d8c143
ts=2024-12-10T21:31:05.2667357Z level=info msg="finished node evaluation" controller_path=/ controller_id="" trace_id=196bc9dce13d4fdea8cc515903d8c143 node_id=remotecfg duration=340.2089ms
ts=2024-12-10T21:31:05.2667357Z level=info msg="applying non-TLS config to HTTP server" service=http
ts=2024-12-10T21:31:05.2667357Z level=info msg="finished node evaluation" controller_path=/ controller_id="" trace_id=196bc9dce13d4fdea8cc515903d8c143 node_id=http duration=0s
ts=2024-12-10T21:31:05.267259Z level=info msg="finished node evaluation" controller_path=/ controller_id="" trace_id=196bc9dce13d4fdea8cc515903d8c143 node_id=cluster duration=0s
ts=2024-12-10T21:31:05.267259Z level=info msg="finished node evaluation" controller_path=/ controller_id="" trace_id=196bc9dce13d4fdea8cc515903d8c143 node_id=livedebugging duration=0s
ts=2024-12-10T21:31:05.267259Z level=info msg="finished node evaluation" controller_path=/ controller_id="" trace_id=196bc9dce13d4fdea8cc515903d8c143 node_id=ui duration=0s
ts=2024-12-10T21:31:05.267259Z level=info msg="finished node evaluation" controller_path=/ controller_id="" trace_id=196bc9dce13d4fdea8cc515903d8c143 node_id=labelstore duration=0s
ts=2024-12-10T21:31:05.267259Z level=info msg="finished node evaluation" controller_path=/ controller_id="" trace_id=196bc9dce13d4fdea8cc515903d8c143 node_id=tracing duration=0s
ts=2024-12-10T21:31:05.267259Z level=info msg="finished node evaluation" controller_path=/ controller_id="" trace_id=196bc9dce13d4fdea8cc515903d8c143 node_id=logging duration=1.037ms
ts=2024-12-10T21:31:05.267804Z level=info msg="finished node evaluation" controller_path=/ controller_id="" trace_id=196bc9dce13d4fdea8cc515903d8c143 node_id=otel duration=0s
ts=2024-12-10T21:31:05.267804Z level=info msg="finished complete graph evaluation" controller_path=/ controller_id="" trace_id=196bc9dce13d4fdea8cc515903d8c143 duration=342.3349ms
ts=2024-12-10T21:31:05.267804Z level=info msg="scheduling loaded components and services"
ts=2024-12-10T21:31:05.267804Z level=info msg="starting cluster node" service=cluster peers_count=0 peers="" advertise_addr=127.0.0.1:12345

erikbaranowski · 2024-12-11T17:33:59Z

The error message is a typical error message to be addressed with alloy running in windows and I believe unrelated.

There are a few things going on here. I think the question lies more grafana cloud than alloy from here.

It looks like you have installed alloy with the Enable Remote Configuration option.
The Test Connection button looks for the existence of a specific alloy metric in order to succeed
Using Fleet Management should create some enabled pipelines which match to your collector such as self_monitoring_metrics
This self_monitoring_metrics pipeline should create the metric that you need for Test Connection

Debug

Navigate here

Does this pipeline exist and is enabled?

Is your collector showing here and status green?

When you click on your collector, does any health data show?

What remote configuration is being served?

Additional Thoughts

Each remote configuration is an independent pipeline. We are working on a long-term solution so that there exists a Fleet Management native integration workflow but for now if you create a remote configuration you will need to add the remote write component so that it forwards the data when copy/pasting integration config. This may be relevant if you are, for example, setting up the windows integration using Fleet Management.

Alternatively, you can go back to the install steps for alloy and uncheck the Enable Remote Configuration option to use the original alloy grafana cloud workflow.

humbberto · 2024-12-11T17:48:21Z

Hi, Just checked the dashboard, and seems that I can see my computer on the Fleet Management, I was misled by the test connection which was not working when I tried. But seems that everything is working now as expected! Thanks a lot Kind regards, Humberto Rodrigues

…

________________________________ From: Erik Baranowski ***@***.***> Sent: Wednesday, December 11, 2024 12:34 PM To: grafana/alloy ***@***.***> Cc: Humberto Rodrigues ***@***.***>; Comment ***@***.***> Subject: Re: [grafana/alloy] Alloy does not work on Windows 11 (Issue #2225) The error message is a typical error message to be addressed with alloy running in windows and I believe unrelated. There are a few things going on here. I think the question lies more grafana cloud than alloy from here. * It looks like you have installed alloy with the Enable Remote Configuration option. * The Test Connection button looks for the existence of a specific alloy metric in order to succeed * Using Fleet Management should create some enabled pipelines which match to your collector such as self_monitoring_metrics * This self_monitoring_metrics pipeline should create the metric that you need for Test Connection Debug Navigate here image.png (view on web)<https://github.com/user-attachments/assets/156bd25d-292f-4e88-ace3-240f46f85a68> Does this pipeline exist and is enabled? image.png (view on web)<https://github.com/user-attachments/assets/69658610-4846-42a4-851c-1809fd492263> Is your collector showing here and status green? image.png (view on web)<https://github.com/user-attachments/assets/6fe7a66a-ee38-442c-b1fa-5b50872acc21> When you click on your collector, does any health data show? image.png (view on web)<https://github.com/user-attachments/assets/e41592b1-8128-4c27-bc06-c1b23c54d5c8> What remote configuration is being served? image.png (view on web)<https://github.com/user-attachments/assets/7796a4f5-40b0-4289-832a-cc19ebbeea96> Additional Thoughts Each remote configuration is an independent pipeline. We are working on a long-term solution so that there exists a Fleet Management native integration workflow but for now if you create a remote configuration you will need to add the remote write component so that it forwards the data when copy/pasting integration config. This may be relevant if you are, for example, setting up the windows integration using Fleet Management. Alternatively, you can go back to the install steps for alloy and uncheck the Enable Remote Configuration option to use the original alloy grafana cloud workflow. — Reply to this email directly, view it on GitHub<#2225 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/BIKYXOE5KIOE3UPNRMLBSWL2FBZR5AVCNFSM6AAAAABTEIO722VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKMZWGY2TCMJQGE>. You are receiving this because you commented.Message ID: ***@***.***>

igolets · 2024-12-12T07:57:39Z

Thanks for pointing out to this screen.

I see my 2 connections in fleet management, but there is no data from both (Win11 and CentOS). As I see from config, I should see at least Alloy logs. Could you give me any clues what to rey to see something?

Win11 config:
`
declare "self_monitoring_logs_windows" {

// THIS IS A GENERATED REMOTE CONFIGURATION.
// 
//   * You can edit the contents and matchers for this configuration without them being overwritten.
//   * If you delete ALL generated configurations, the latest default versions will be recreated.
//   * This configuration requires the following environment variables to be set wherever alloy is running:
//     * GCLOUD_RW_API_KEY: The Grafana Cloud API key with write access to Loki.
//     * GCLOUD_FM_COLLECTOR_ID: A unique collector ID matching the remotecfg id argument value.

// Write logs to your Grafana Cloud Loki instance.
loki.write "grafana_cloud_loki" {
	endpoint {
		url = "https://logs-prod-025.grafana.net/loki/api/v1/push"

		basic_auth {
			username = "*******"
			password = sys.env("GCLOUD_RW_API_KEY")
		}
	}
}

// Read Alloy logs from the Windows Event Log with the following additional labels:
//   * job: "integrations/alloy" is compatible with Grafana Cloud's Alloy Health Integrations.
//   * collector_id: The unique collector ID matching the remotecfg id argument value.
//                   Used to match collector-specific metrics to power the 'Collector
//                   Health' section of the Fleet Management UI.
loki.source.windowsevent "alloy_logs" {
	xpath_query = `<QueryList>

*[System[Provider[@name='Alloy']]] `

	forward_to = [loki.write.grafana_cloud_loki.receiver]
	labels     = {"job" = "integrations/alloy", "collector_id" = sys.env("GCLOUD_FM_COLLECTOR_ID")}
}

}

self_monitoring_logs_windows "default" { }

declare "self_monitoring_metrics" {

// THIS IS A GENERATED REMOTE CONFIGURATION.
//
//   * You can edit the contents and matchers for this configuration without them being overwritten.
//   * If you delete ALL generated configurations, the latest default versions will be recreated.
//   * This configuration requires the following environment variables to be set wherever alloy is running:
//     * GCLOUD_RW_API_KEY: The Grafana Cloud API key with write access to Loki.
//     * GCLOUD_FM_COLLECTOR_ID: A unique collector ID matching the remotecfg id argument value.

// Export Alloy metrics in memory.
prometheus.exporter.self "integrations_alloy_health" { }

// Target Alloy metrics with the following additional labels:
//   * job: "integrations/alloy" is compatible with Grafana Cloud's Alloy Health Integrations.
//   * collector_id: The unique collector ID matching the remotecfg id argument value.
//                   Used to match collector-specific metrics to power the 'Collector
//                   Health' section of the Fleet Management UI.
//   * instance: The hostname of the machine running Alloy.
discovery.relabel "integrations_alloy_health" {
	targets = prometheus.exporter.self.integrations_alloy_health.targets

	rule {
		action       = "replace"
		target_label = "collector_id"
		replacement  = sys.env("GCLOUD_FM_COLLECTOR_ID")
	}

	rule {
		target_label = "instance"
		replacement  = constants.hostname
	}

	rule {
		target_label = "job"
		replacement  = "integrations/alloy"
	}
}

// Scrape Alloy metrics and forward them to the remote write component.
prometheus.scrape "integrations_alloy_health" {
	targets = array.concat(
		discovery.relabel.integrations_alloy_health.output,
	)
	forward_to = [prometheus.remote_write.default.receiver]
	job_name   = "integrations/alloy"
}

// Write metrics to your Grafana Cloud Prometheus instance.
prometheus.remote_write "default" {
	endpoint {
		url = "https://prometheus-prod-39-prod-eu-north-0.grafana.net/api/prom/push"

		basic_auth {
			username = "*******"
			password = sys.env("GCLOUD_RW_API_KEY")
		}
	}
}

}

self_monitoring_metrics "default" { }
`

Linux one:
`
declare "self_monitoring_logs_linux" {

// THIS IS A GENERATED REMOTE CONFIGURATION.
// 
//   * You can edit the contents and matchers for this configuration without them being overwritten.
//   * If you delete ALL generated configurations, the latest default versions will be recreated.
//   * This configuration requires the following environment variables to be set wherever alloy is running:
//     * GCLOUD_RW_API_KEY: The Grafana Cloud API key with write access to Loki.
//     * GCLOUD_FM_COLLECTOR_ID: A unique collector ID matching the remotecfg id argument value.

// Write logs to your Grafana Cloud Loki instance.
loki.write "grafana_cloud_loki" {
	endpoint {
		url = "https://logs-prod-025.grafana.net/loki/api/v1/push"

		basic_auth {
			username = "******"
			password = sys.env("GCLOUD_RW_API_KEY")
		}
	}
}

// Read Alloy logs when running as a systemd service with the following additional labels:
//   * job: "integrations/alloy" is compatible with Grafana Cloud's Alloy Health Integrations.
//   * collector_id: The unique collector ID matching the remotecfg id argument value.
//                   Used to match collector-specific metrics to power the 'Collector
//                   Health' section of the Fleet Management UI.
loki.source.journal "alloy_logs_unit" {
	matches    = "_SYSTEMD_UNIT=alloy.service"
	forward_to = [loki.write.grafana_cloud_loki.receiver]
	labels     = {"job" = "integrations/alloy", "collector_id" = sys.env("GCLOUD_FM_COLLECTOR_ID")}
}

// Read Alloy logs from syslog with the following additional labels:
//   * job: "integrations/alloy" is compatible with Grafana Cloud's Alloy Health Integrations.
//   * collector_id: The unique collector ID matching the remotecfg id argument value.
//                   Used to match collector-specific metrics to power the 'Collector
//                   Health' section of the Fleet Management UI.
loki.source.journal "alloy_logs_tag" {
	matches    = "SYSLOG_IDENTIFIER=alloy"
	forward_to = [loki.write.grafana_cloud_loki.receiver]
	labels     = {"job" = "integrations/alloy", "collector_id" = sys.env("GCLOUD_FM_COLLECTOR_ID")}
}

}

self_monitoring_logs_linux "default" { }

declare "self_monitoring_metrics" {

// THIS IS A GENERATED REMOTE CONFIGURATION.
//
//   * You can edit the contents and matchers for this configuration without them being overwritten.
//   * If you delete ALL generated configurations, the latest default versions will be recreated.
//   * This configuration requires the following environment variables to be set wherever alloy is running:
//     * GCLOUD_RW_API_KEY: The Grafana Cloud API key with write access to Loki.
//     * GCLOUD_FM_COLLECTOR_ID: A unique collector ID matching the remotecfg id argument value.

// Export Alloy metrics in memory.
prometheus.exporter.self "integrations_alloy_health" { }

// Target Alloy metrics with the following additional labels:
//   * job: "integrations/alloy" is compatible with Grafana Cloud's Alloy Health Integrations.
//   * collector_id: The unique collector ID matching the remotecfg id argument value.
//                   Used to match collector-specific metrics to power the 'Collector
//                   Health' section of the Fleet Management UI.
//   * instance: The hostname of the machine running Alloy.
discovery.relabel "integrations_alloy_health" {
	targets = prometheus.exporter.self.integrations_alloy_health.targets

	rule {
		action       = "replace"
		target_label = "collector_id"
		replacement  = sys.env("GCLOUD_FM_COLLECTOR_ID")
	}

	rule {
		target_label = "instance"
		replacement  = constants.hostname
	}

	rule {
		target_label = "job"
		replacement  = "integrations/alloy"
	}
}

// Scrape Alloy metrics and forward them to the remote write component.
prometheus.scrape "integrations_alloy_health" {
	targets = array.concat(
		discovery.relabel.integrations_alloy_health.output,
	)
	forward_to = [prometheus.remote_write.default.receiver]
	job_name   = "integrations/alloy"
}

// Write metrics to your Grafana Cloud Prometheus instance.
prometheus.remote_write "default" {
	endpoint {
		url = "https://prometheus-prod-39-prod-eu-north-0.grafana.net/api/prom/push"

		basic_auth {
			username = "******"
			password = sys.env("GCLOUD_RW_API_KEY")
		}
	}
}

}

self_monitoring_metrics "default" { }
`

igolets added the bug Something isn't working label Dec 6, 2024

dehaansa mentioned this issue Dec 11, 2024

fix: Prevent the constant logging of cgroup errors on non-linux machines #2264

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Alloy does not work on Windows 11 #2225

Alloy does not work on Windows 11 #2225

igolets commented Dec 6, 2024 •

edited

Loading

naZzethe3rd commented Dec 9, 2024

dehaansa commented Dec 9, 2024

humbberto commented Dec 10, 2024

erikbaranowski commented Dec 11, 2024

humbberto commented Dec 11, 2024 via email

igolets commented Dec 12, 2024

Alloy does not work on Windows 11 #2225

Alloy does not work on Windows 11 #2225

Comments

igolets commented Dec 6, 2024 • edited Loading

What's wrong?

Steps to reproduce

System information

Software version

Configuration

Logs

naZzethe3rd commented Dec 9, 2024

dehaansa commented Dec 9, 2024

humbberto commented Dec 10, 2024

erikbaranowski commented Dec 11, 2024

Debug

Additional Thoughts

humbberto commented Dec 11, 2024 via email

igolets commented Dec 12, 2024

igolets commented Dec 6, 2024 •

edited

Loading