[hal] Fix potential race in CANAPI #6819

rzblue · 2024-07-09T18:51:33Z

Currently, the call to HAL_CAN_SendMessage is not synchronized with updates to periodicSends (which represents the internal state of the netcomm sender thread).

Now, the mutex is locked before HAL_CAN_SendMessage is called to ensure the update is atomic.
periodicSends and receives also now have their own mutexes to reduce unnecessary contention between send and receive functions.

Ex:
Thread A calls HAL_StopCANPacketRepeating with apiId 0
Thread B calls HAL_WriteCANPacketRepeating with apiId 0 and repeatMs 10
Inside HAL_StopCANPacketRepeating, Thread A calls HAL_CAN_SendMessage, which updates netcomm's state to not repeat the packet
Thread A is paused
Inside HAL_WriteCANPacketRepeating, Thread B calls HAL_CAN_SendMessage, which updates netcomm's state to repeat the packet
Thread B locks the mutex
Thread B updates the map to indicate the new state (packet is repeating)
Thread B exits HAL_WriteCANPacketRepeating and unlocks the mutex
Thread A resumes
Thread A locks the mutex
Thread A updates the map with what it thinks the new state is (packet is not repeating)
Thread A exits HAL_StopCANPacketRepeating and unlocks the mutex
Thread A calls HAL_CleanCAN, which doesn't stop the repeating packet because the state has diverged.

rzblue · 2024-07-09T18:52:36Z

hal/src/main/native/athena/CANAPI.cpp

  if (*status != 0) {
    return;
  }


Question for @ThadHouse: will netcomm still update its internal state if it returns an error?

I am unsure. But I do seem to remember a mention from NI that those functions basically can't fail when any of the periodic flags are set.

rzblue · 2024-07-27T07:07:10Z

I've removed the status check on tx functions, making the assumption that even if the function returns a bad status, the periodic state will still have been updated.

PeterJohnson · 2024-07-28T03:19:08Z

Needs conflicts resolved.

rzblue requested a review from a team as a code owner July 9, 2024 18:51

rzblue commented Jul 9, 2024

View reviewed changes

PeterJohnson requested a review from ThadHouse July 13, 2024 14:53

rzblue requested a review from a team as a code owner July 27, 2024 07:27

rzblue added 4 commits July 28, 2024 14:51

fix potential race in CANAPI

cc08f98

iwyu

62cea64

remove status check in tx functions

20fba6b

Update sim too

1a0c448

rzblue force-pushed the canapi-periodic-race branch from 40fc634 to 1a0c448 Compare July 28, 2024 18:52

PeterJohnson approved these changes Jul 29, 2024

View reviewed changes

PeterJohnson merged commit 8c06ef6 into wpilibsuite:main Jul 29, 2024
36 checks passed

rzblue deleted the canapi-periodic-race branch August 23, 2024 04:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[hal] Fix potential race in CANAPI #6819

[hal] Fix potential race in CANAPI #6819

rzblue commented Jul 9, 2024 •

edited

Loading

rzblue Jul 9, 2024

ThadHouse Jul 18, 2024

rzblue commented Jul 27, 2024 •

edited

Loading

PeterJohnson commented Jul 28, 2024

[hal] Fix potential race in CANAPI #6819

[hal] Fix potential race in CANAPI #6819

Conversation

rzblue commented Jul 9, 2024 • edited Loading

rzblue Jul 9, 2024

Choose a reason for hiding this comment

ThadHouse Jul 18, 2024

Choose a reason for hiding this comment

rzblue commented Jul 27, 2024 • edited Loading

PeterJohnson commented Jul 28, 2024

rzblue commented Jul 9, 2024 •

edited

Loading

rzblue commented Jul 27, 2024 •

edited

Loading