WindowsPerf is a Visual Studio Code extension that integrates the WindowsPerf Windows on Arm performance profiling tool. It provides you with a simple way of viewing the results of a run of the wperf command line tool.
You can use the WindowsPerf extension to optimize for performance on Windows on Arm. WindowsPerf uses a top-down performance analysis methodology that enables you to start with more general performance counters and then drill down into the microarchitecture to find lines in the code that correspond to events, run hot spot analysis, and view code disassembly.
The Sampling Settings tab enables you to configure a run of wperf
to perform hot spot analysis of an executable. Alternatively, you can record events in a JSON file.
You can then analyze the data from the run in the WindowsPerf tree view.
NOTE If you are using a non-Windows on Arm machine, you can analyze results recorded in a JSON file, but you cannot record a run or use the Sampling Settings interface.
Before you start using the WindowsPerf extension, you must install the WindowsPerf command-line tool and the kernel driver. Follow the instructions in the WindowsPerf Install Guide.
To use the features of the WindowsPerf extension that are specific to Windows on Arm, such as recording a run, you must install the Arm64 build of Visual Studio Code.
The WindowsPerf extension is available from the Visual Studio Code Extension Marketplace. To install the WindowsPerf extension in Visual Studio Code, follow the instructions in the Visual Studio Code Extension Marketplace documentation.
Either add wperf.exe
to the system PATH or configure it in the windowsPerf.wperfPath
VS Code setting.
You can check your installation is set up correctly by using the WindowsPerf: Check WindowsPerf Installation command from the Command Palette.
In Visual Studio Code, open the WindowsPerf tab in the bottom panel. Click if the panel is not visible.
There are 2 ways to use the WindowsPerf Sampling Results Explorer.
- In the WindowsPerf tab, click Record. The Sampling Settings tab opens.
- In the Executable path field, specify the path to the executable file that you want to sample. You can either browse to the location where your code has been built, or enter the location directly.
- Enter optional arguments to pass to the executable file.
- Select an event to sample. You can select an event from the drop-down list or search for a specific event.
- Optionally specify the frequency at which the event should be sampled. For example, if you sample the
st_spec
hardware event at a frequency of1000
and there are 256 hits, that represents 256,000 hits of that event in your code. - Select a CPU core to monitor from the drop-down list.
- Optionally specify a Timeout.
The WindowsPerf extension updates the Command Line Preview as you specify your sampling settings. Click Record to run the command. Windows Perf runs the command for the length of time that you specified in the Timeout field, or until you click Cancel. The results display in the tree view.
- Add the flags
--annotate --disassemble --json --output out.json
to yourwperf record ...
command to capture events in a JSON file. - In the WindowsPerf tab, click Open File. Navigate to the location of your JSON file, select the file, and then click Open. The WindowsPerf tree view opens.
- For files in your code with hits, the tree view shows the line where the issue is and the number of times that a particular counter was hit. Every event has an entry in the tree view indicating the total number of hits for that event. Click the arrows next to an entry to drill down into individual functions.
- Click a line in the tree view to open that file. Lines of code with event hits are colour-coded to enable you to perform hot spot analysis.
- If you have enabled the disassembly view in the Sampling Settings page, you can move your mouse over a highlighted line to see the disassembly view. The disassembly view shows you the number of hits for the specific event, the instructions for the function, and the performance metrics for each instruction.
- To rerun the recording with the same settings and add any new event hits to the results, click [Rerun Record]{.ui} next to an entry in the tree view.
If you want to change any of your settings, and the Sampling Settings page is not visible, click [Show Sampling Settings]{.ui} .
If you encounter any errors while running the WindowsPerf extension, you can click Check WindowsPerf Installation on the error page to detect any issues with your setup and provide suggestions on how to fix them, with links to further information.
The WindowsPerf extension collects usage telemetry and sends it to Arm to help improve our products. Such data can include the time when a recording starts, the events that you want to monitor, or any incorrect results. Arm does not collect file names or locations, or any arguments when running a recording.
This extension respects the Visual Studio Code telemetry settings described in the Visual Studio Code telemetry documentation. To change your telemetry settings, follow the instructions in Disable telemetry reporting.
- Arm Community blog post on performance analysis methodology, including links to a complete list of Performance Monitoring Unit (PMU) events
- WindowsPerf command-line tool documentation
Perf is an abbreviation of Performance. Windows and Visual Studio are trademarks of the Microsoft group of companies.