Skip to content

VS Code extension for performance analysis using WindowsPerf

License

Notifications You must be signed in to change notification settings

arm-developer-tools/vscode-windowsperf

Repository files navigation

WindowsPerf Extension for Visual Studio Code

WindowsPerf is a Visual Studio Code extension that integrates the WindowsPerf Windows on Arm performance profiling tool. It provides you with a simple way of viewing the results of a run of the wperf command line tool.

WindowsPerf Sampling Results Explorer showing hot spot highlighting

You can use the WindowsPerf extension to optimize for performance on Windows on Arm. WindowsPerf uses a top-down performance analysis methodology that enables you to start with more general performance counters and then drill down into the microarchitecture to find lines in the code that correspond to events, run hot spot analysis, and view code disassembly.

The Sampling Settings tab enables you to configure a run of wperf to perform hot spot analysis of an executable. Alternatively, you can record events in a JSON file.

You can then analyze the data from the run in the WindowsPerf tree view.

NOTE If you are using a non-Windows on Arm machine, you can analyze results recorded in a JSON file, but you cannot record a run or use the Sampling Settings interface.

Before you begin

Before you start using the WindowsPerf extension, you must install the WindowsPerf command-line tool and the kernel driver. Follow the instructions in the WindowsPerf Install Guide.

To use the features of the WindowsPerf extension that are specific to Windows on Arm, such as recording a run, you must install the Arm64 build of Visual Studio Code.

Installation

The WindowsPerf extension is available from the Visual Studio Code Extension Marketplace. To install the WindowsPerf extension in Visual Studio Code, follow the instructions in the Visual Studio Code Extension Marketplace documentation.

Either add wperf.exe to the system PATH or configure it in the windowsPerf.wperfPath VS Code setting.

You can check your installation is set up correctly by using the WindowsPerf: Check WindowsPerf Installation command from the Command Palette.

Usage

In Visual Studio Code, open the WindowsPerf tab in the bottom panel. Click VS Code toggle panel icon if the panel is not visible.

There are 2 ways to use the WindowsPerf Sampling Results Explorer.

Record events using the Sampling Settings tab

  1. In the WindowsPerf tab, click Record. The Sampling Settings tab opens.
  2. In the Executable path field, specify the path to the executable file that you want to sample. You can either browse to the location where your code has been built, or enter the location directly.
  3. Enter optional arguments to pass to the executable file.
  4. Select an event to sample. You can select an event from the drop-down list or search for a specific event.
  5. Optionally specify the frequency at which the event should be sampled. For example, if you sample the st_spec hardware event at a frequency of 1000 and there are 256 hits, that represents 256,000 hits of that event in your code.
  6. Select a CPU core to monitor from the drop-down list.
  7. Optionally specify a Timeout.

The WindowsPerf extension updates the Command Line Preview as you specify your sampling settings. Click Record to run the command. Windows Perf runs the command for the length of time that you specified in the Timeout field, or until you click Cancel. The results display in the tree view.

Record events in a JSON file

  1. Add the flags --annotate --disassemble --json --output out.json to your wperf record ... command to capture events in a JSON file.
  2. In the WindowsPerf tab, click Open File. Navigate to the location of your JSON file, select the file, and then click Open. The WindowsPerf tree view opens.

Use the WindowsPerf tree view

  1. For files in your code with hits, the tree view shows the line where the issue is and the number of times that a particular counter was hit. Every event has an entry in the tree view indicating the total number of hits for that event. Click the arrows next to an entry to drill down into individual functions.
  2. Click a line in the tree view to open that file. Lines of code with event hits are colour-coded to enable you to perform hot spot analysis.
  3. If you have enabled the disassembly view in the Sampling Settings page, you can move your mouse over a highlighted line to see the disassembly view. The disassembly view shows you the number of hits for the specific event, the instructions for the function, and the performance metrics for each instruction.

WindowsPerf code disassembly view

  1. To rerun the recording with the same settings and add any new event hits to the results, click [Rerun Record]{.ui} Rerun Record button next to an entry in the tree view.

If you want to change any of your settings, and the Sampling Settings page is not visible, click [Show Sampling Settings]{.ui} Show Sampling Settings.

Troubleshooting

If you encounter any errors while running the WindowsPerf extension, you can click Check WindowsPerf Installation on the error page to detect any issues with your setup and provide suggestions on how to fix them, with links to further information.

Telemetry

The WindowsPerf extension collects usage telemetry and sends it to Arm to help improve our products. Such data can include the time when a recording starts, the events that you want to monitor, or any incorrect results. Arm does not collect file names or locations, or any arguments when running a recording.

This extension respects the Visual Studio Code telemetry settings described in the Visual Studio Code telemetry documentation. To change your telemetry settings, follow the instructions in Disable telemetry reporting.

Additional information

Trademarks

Perf is an abbreviation of Performance. Windows and Visual Studio are trademarks of the Microsoft group of companies.

About

VS Code extension for performance analysis using WindowsPerf

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published