-
-
Notifications
You must be signed in to change notification settings - Fork 442
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Temperatures for cores not showing #1284
Comments
What does the full output for |
Log File
|
The server setup i have is 2x E5-2680 v2's, which should only be two threads per core. So I'm assuming it should only be 20 sensors. Oddly enough, |
My problem is that none show at all, but i still have a valid sensors reading. |
@Kharlenkow :
Please provide the output as plain text. While images are fine to point at UI issues or convey what the display looks like, they usually aren't very accessible or easy for further processing. Also Your screenshot is missing (the interesting) part of the
Glad to hear.
Will have to see if we find a solution to properly process the available information and correlate it with our internal view of the system. This is not the first report regarding CPU sensor stuff – and likely not the last. That stuff is strange at times. @AnOpenSauceDev Can you provide the full contents of Also, if you want to help a bit with investigations: Can you try to establish some kind of mapping of physical cores to the temperature sensor cores by putting some load on individual CPU threads (affinity binding) and checking which temperature follows the load? TIA. @Kharlenkow In case you have a different CPU, having the same information (cpuinfo, sensors reading, physical<-->sensors mapping) available would be nice. |
It might take a while to benchmark every core, but so far nothing seems off. |
Thank you for that info. Seems this strange core ID counting is in the CPU info as well. At least makes things consistent. :) |
Thank you for attention my feedback!!! Here is the entire output:
I can absolutely sure two CPUs are the same because I personally installed them onto the socket, unless I was cheated by the seller~ |
Thank you for the quick feedback. I did some study of the documentation of the coretemp stuff and it seems the main issue in htop comes down to how the sensors are mapped onto the actual CPU cores. This will likely take a bit of work, as currently the information related to the cpuinfo (and thus core layout) is not kept for correlation in the libsensors code. Also, the libsensors code assumes the core IDs to be contiguous, which is clearly not the case with the example by @AnOpenSauceDev. The second issue arises with multiple coretemp instances due to multiple CPUs present in the system. Both being issues that can be resolved when properly mapping the core IDs of the coretemp instances to the physical CPU cores available from @cgzones Can you please take a look at refactoring the libsensors code? Would be nice if we could implement some proper mapping of sensors to their physical cores. The heuristic could still remain similar to what it is now, being all cores inherit Tctrl, Tdie followed by Tccd{X}, with only parts of the information cleared out, if multiple readings are available on the same core (e.g. acpitz + coretemp). If acpitz gives temperatures for cores not covered by coretemp, those should still keep the acpitz readings. References: |
Thanks again for your attention!!! CAUSES:I referred codes in
Here is the output: (It is run on another dual socket server, Dual Xeon E5-2643 V3 , which has the same problem)
There are two SENSORs named SOLUTIONstep 0: step 1: step 2: step 3: Since I have been a little busy at work recently, the code has not been implemented on the original project(i am so sorry TOT). |
That's still incomplete. because your solution does not properly track, which instance of |
In the past two days, I have consulted the source code of the hwmon subsystem and lm-sensors, and tested it with numactl (which is able to force the task to run on a certain CPU core). First of all, the lm-sensors reading method of increasing the tempX by suffix number in each hwmon group exactly corresponds to the sequential increase of the core id (at least on my three machines), and there is no exception of out-of-order correspondence as you described. In addition, for multi-socket motherboards, I also tested and verified the one-to-one correspondence between the CPU socket ID number and To sum up, we could first use the Hope my suggestions would be adopted! |
@BenBE @AnOpenSauceDev I browse the pull request list and find she has done what I want. #1352 I also test her fork using the same method. As is shown below, the problem has been solved. Besides, core ID and its temperature are correctly corresponded. Hope this PR will be accepted! |
When using
htop
via SSH on my Ubuntu server, i notice that even if i enableAlso show CPU temperature
(libsensors5 is installed), no temperature reading appears. I'm unsure if this is because of my core count or not (40 threads total), but no matter what i do, nothing will show up alongside the usage reading.The text was updated successfully, but these errors were encountered: