Restructure stepper/transporter/state/params #1556

sethrj · 2024-12-30T02:41:17Z

The Stepper should be refactored since it's grown from a single-purpose "do one step" to a general interface:

I don't think it's right that it owns the state, since we have to extract data from it occasionally. We also set params.set_state(stream_id.get(), step_->sp_state()) in accel so we can do data reductions. (In total there's three accessors to get the state.)
It's no longer a function-like object since we have so many other functions attached to it.
The step(primaries) is a little odd.
The stepper owns an action sequence. Although the action times currently are "local", we ought to have one action sequence per Params.
We should have an "end run" called over all states, and "begin run" should be over all states as well. This is necessary for global reductions.

Current construction order relating to actions:

Core params input. Some of the components (e.g. physics) add actions.
Core params, which also adds actions.
During stepper construction, action sequence
Then state, which initializes auxiliary data
Then begin_run is called on each state

Use cases:

Geant4 offloading: LocalTransporter registers a pointer to the state with the global parameters
Celer-sim: Runner creates multiple transporters which are used to step in a parallel openmp loop

The text was updated successfully, but these errors were encountered:

sethrj · 2024-12-30T20:30:45Z

Inputs:

Imported data from Geant4
Callback functions to generate input functions: physics processes
Options to construct built-in Celeritas actions (e.g. SD callback, calo, diagnostics)
Callback to generate additional actions, including along-step (if we do an along-step action manager, it should include a mapping of {region, particle} -> along_step; that stuff shouldn't be done in the CoreParams)

Parallelism/multithread requirements:

Be able to do parallel (all track slots, streams/CPU threads, MPI processes) operations at beginning, event boundaries (or "batches" of histories in parallel), and end
Match Geant4 allocation requirements: state creation, use, and destruction must be on same thread
(??) In thread-independent model, allow different memory management models so that one shared params can be used by different states

During construction

On the [M]ain thread or in [P]arallel:

BeginOfRunAction (main)

[M] Load import data
[M] Create global objects that don't have any associated actions (geometry, material, particle)
[M] Create core params and incorporate any additional user actions
[M] Create action sequence from core params, which should "finalize" the number of actions and anything action-related
[M] Create state collection(s) with unallocated per-stream states, based on maximum local threads; potentially have CPU and GPU state collections side by side, incorporate knowledge of parallel MPI processes

BeginOfRunAction (worker)

[P] Allocate & construct state from core params/action sequence
[P] (??) State registers itself with state collection or params for parallel operations, or done implicitly as part of construction
[P] State allocates and constructs auxiliary data

Not currently done in geant4

[M] Register signal handler on main thread; if called, it sets "abort" flags on all states (note: currently done inside stepper, which is wrong in MT mode)
[M] Call begin_run on all states (not yet implemented)
[P] Warm up (note: this is done in celer-sim)

Not currently used for anything interesting

celeritas::TrackingManagerOffload via G4VPhysicsConstructor::ConstructProcess via G4RunManagerKernel::InitializePhysics
TrackingManagerOffload::BuildPhysicsTable, TrackingManagerOffload::PreparePhysicsTable

During execution

At runtime we have two different use cases:

Synchronized events/batches across all threads; this would be for dosimetry, reactor applications, optical maps. We could have a single "event" and distribute across all states on all threaads and all processes.
Independent events on each thread; this is for Geant4.

BeginOfEventAction (worker)

Asynchronous events:

[P] Reseed with event ID
[P] Call begin_event on each state independently?

Not currently done

Synchronized event (batch) at beginning:

[P] Distribute primaries/initializers/generators across states
[M] Call begin_event on all states (?)

PreUserTrackingAction or HandOverOneTrack

[P] Push a track onto the stack

`EndOfEventAction` or `FlushEvent`

During event/batch, repeat:

[P] Step
[P] Kill active tracks if requested (e.g., user abort)
Don't accumulate counters as part of the stepper: that should be a step action

Synchronized event (batch) at end:

[M] Call end_event on all states

EndOfRunAction (worker)

At end of run:

[M] Call end_run on all states
[P] Deallocate states

EndOfRunAction (main)

Free Celeritas objects and memory

TODO:

Refactor streams so that each state holds a Stream object rather than having to redirect into the Device object and allocate those streams there
Eliminate max_streams once we get rid of the stream store.
State counter diagnostic should be part of an action rather than hardcoded into the steam
Initializer/generator interface should let us accumulate the number of expected primaries (?)
Along-step manager
User-supplied callbacks to generate additional actions; core params setup/initialization should be more conslidated

sethrj · 2024-12-30T21:28:58Z

I'd be grateful for input on this discussion from a parallelism standpoint (@amandalund) and Geant4 mechanics standpoint (@drbenmorgan). I'd like to have a structure that is compatible with all use cases and targeted at the Geant4 use case. The main questions I have are:

Is it too restrictive to have a StateVector where each element corresponds to a stream/CPU thread? That would make it really easy to pass into "parallel reduce" methods at the beginning/end of run to (e.g.) sum energy deposition across threads, or action times, and output them.
Where should I put begin_run so that it's after all threads have been allocated? Maybe just on the last thread to call BeginOfRunAction? (But I assume one thread could call "begin event" before another calls "begin of run"... and we don't want to force a synchronization.) Or do we just try to eliminate begin-of-run action? (Currently its primary use is for "lazy" initialization of params that depend on the number of actions. If we "hardcode" the use of such objects so that they're added after user actions, we could get away with this.)

sethrj added physics Particles, processes, and stepping algorithms minor Minor internal changes or fixes labels Dec 30, 2024

sethrj mentioned this issue Dec 30, 2024

Fix accel examples and related CI issues #1557

Merged

sethrj mentioned this issue Jan 6, 2025

Add global Celeritas input definition #1562

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Restructure stepper/transporter/state/params #1556

Restructure stepper/transporter/state/params #1556

sethrj commented Dec 30, 2024

sethrj commented Dec 30, 2024 •

edited

Loading

sethrj commented Dec 30, 2024

Restructure stepper/transporter/state/params #1556

Restructure stepper/transporter/state/params #1556

Comments

sethrj commented Dec 30, 2024

sethrj commented Dec 30, 2024 • edited Loading

During construction

BeginOfRunAction (main)

BeginOfRunAction (worker)

Not currently done in geant4

Not currently used for anything interesting

During execution

BeginOfEventAction (worker)

Not currently done

PreUserTrackingAction or HandOverOneTrack

EndOfEventAction or FlushEvent

EndOfRunAction (worker)

EndOfRunAction (main)

sethrj commented Dec 30, 2024

sethrj commented Dec 30, 2024 •

edited

Loading

`EndOfEventAction` or `FlushEvent`