Refactoring and support for QD #65

sk1p · 2024-05-16T11:51:42Z

The idea is to extract the common parts into a shared crate, have common traits etc. and then implement QD Merlin as a proof of concept

Fixes #61

Doesn't build yet, very WIP

* generic background thread trait: works * generic connection types: works * generic cam client types: WIP

* Fix stats: properly count the last frame stack * Fix `impl_py_connection` close implementation to actually `Option::take` the connection implementation and call `close` on it * Add a lot of debug logging to make it easier to diagnose issues

* Fix off-by-one in decoding function * Implement dispatch by dtype in decoding logic (`Complex{32,64}` still missing) * Python interface for decoding parts of a stack

* Extract helper `try_cast_if_safe` * Add support for explicitly free'ing empty stack frames * Implement methods for (unsafe) zero-copy data access * Simplify: move `GetStats` impl to common `frame_stack` module * Add `_Py*Connection::get_meta` for access to full `FrameMeta` vector * Use generic connection and background thread infrastructure for ASI MPX3 * libertem_dectris: rename misleading `common` modules to `base_types`

…tion warnings

* Extract `num_from_byte_slice` helper * Build up basic infrastructure and types * Scaffolding for connection, decoder, cam client etc. * `background_thread`: work on header parsing, reading and peeking

* Extract `three_way_shift` helper * Add `SharedSlabAllocator.try_get_mut` which has a `Result` instead of an `Option`, so we can easily convert it into other error types, instead of having to `ok_or(..)` or similar. * Fix peek: don't retry infinitely, as it's possible the requested buffer size is larger than the underlying socket buffer, in which case we will never peek enough! Some retrying in case there is really not enough data yet * Parses for the line protocol: MPX prefix, acquisition header, frame header * `recv_frame` that implements logic for receiving into either a primary stack frame or a spare (can be extracted into a generic function in the future! very similar for other impls) * `acquisition` function: receive all the frames * Properly send `ReceiverMsg::ReceiverArmed` when we have connected (which is what `start_passive` is waiting for)

* Intruduce `DecoderTargetPixelType` trait and implement that for all the common int/float formats * Add some more test cases for frame/acquisition header parsing * Fix nasty off-by-one for some test cases * Start implementing decoding * u1/u8/u16/u32/u64 for single and quad * r1/r6/r12 for single, raw formats are still missing for quad * Decoders are still to be validated!

* Add a generic `RawType` trait and `R1`, `R6`, `R12` implementations * Fix decoding of interleaved quad raw format * One generic impl that uses the underlying `decode_chunk`/`decode_all` functions on the `RawType` trait * Add hacky Python interface for decoders for integration testing; not for production use (later: for offline decoding? needs better interface though...) * Make `QdFrameMeta` constructible from parts (only for testing) * Add tests for `decode_ints_be` and `try_cast_if_safe` helpers * Add another helper: `try_cast_primitive`; useful for decoders etc.

* Check input/output buffer sizes when decoding integer formatted frames * `start_passive` * short-circuit if we are already Armed * drop the GIL while waiting for the message from the background thread * Allow to customize the timeout for waiting for the status change; this is useful for example in case we know that the bg thread first has to drain stuff * Implement draining (mostly for backwards-compat.) * Add numerous logging messages * `QdFrameMeta::parse_bytes`: ignore additional data after the header * Add `QdAcquisitionHeader::frames_in_acquisition` Python method

Default to the old `RecoveryStrategy::ImmediateReconnect`, but allow to switch to `RecoveryStrategy::DrainThenReconnect`. Need to experiment if the drain strategy needs to become the default.

On the slow Mac OS 12 workers, there were some spurious timeouts. Let's see if this is enough.

The test is based on some assumptions, but this should cover the non-raw use-case.

Right now, we only support the "pass through the raw data" option, meaning we don't zero out the border pixels. But we do insert a two-pixel gap between the sensors, meaning `Layout::L2x2G` results in a 514x514 output.

In quad raw mode, we need to somehow map the input size (1024x256) to a sensible size, which depends on the layout and thus also the gap mode. This is implemented here for the normal quad setup; the EELS layout might need a fix in the future, too.

* All of them are now >= 1GiB/s on my system w/ AVX2 enabled, most of them >= 6GiB/s, greatly increasing efficiency and freeing up resources for actual work * Instead of using a reversible iterator for output, decode into temporary array and reverse that * Provide const length guarantees for input and output of the `decode_chunk` function * Provide more guarantees to the generic `Decoder` impls (like being able to convert using `_ as OutputType` from `u8` and `u16`) * More comprehensive benchmarks

AVX2 looks to be most important; this also adds a more complete list of Zen2-ish features as one version.

Directly use the macro generated type instead.

This is a combination of the acquisition header and the first frame header, meaning we can accurately get the detector shape Still exporting the `QdAcquisitionHeader` to Python, which can be used to parse from raw bytes

When calling `wait_for_arm`, we must first cancel any already running acquisitions, wait for the system to be idle, and then can arm it again.

This reverts commit 2e3d116.

sk1p added 5 commits April 30, 2024 18:03

Add initial libertem_qd_mpx crate

a672c02

Add libertem_qd_mpx to workspace and fix pyo3 version

815d8f2

Extract low-level frame-stack stuff into common crate

4eb49da

WIP: start to refactor chunked iterator

7bbe60c

WIP: generic frame iterator/receiver infra

812cd46

Doesn't build yet, very WIP

sk1p added the enhancement New feature or request label May 16, 2024

sk1p added 24 commits June 7, 2024 20:57

WIP: extract GenericConnection thingy

78fd50c

WIP

f17bb82

* generic background thread trait: works * generic connection types: works * generic cam client types: WIP

WIP

fb65e5e

WIP

511c9ea

WIP: dectris impl almost done

a9484a5

Misc. fixes for correctness and completeness

c51b5c4

* Fix off-by-one in decoding function * Implement dispatch by dtype in decoding logic (`Complex{32,64}` still missing) * Python interface for decoding parts of a stack

WIP: PyDeprecationWarning, fix tests etc.

4c5340b

libertem_asi_mpx3: improved error handling and logging

93a7798

Properly expose functionality on ServalConnection; fix some depreca…

2d5bf78

…tion warnings

WIP qd support

58240b8

* Extract `num_from_byte_slice` helper * Build up basic infrastructure and types * Scaffolding for connection, decoder, cam client etc. * `background_thread`: work on header parsing, reading and peeking

Properly drop the GIL in wait_for_arm

88fdea1

Extract tcp helpers and use interruptible reads in asi_mpx

83c2303

Encoders, tests and benches

c4369a6

2x2 raw encoder and roundtrip tests

cbab024

Add benchmarks for quad encoding/decoding

c514587

Support for timeout=None in wait_for_arm and start_passive

fa255f1

Export QdAcquisitionHeader class

c814f6d

Add libertem_qd_mpx and common crates to CI pipeline; fix licensing

99c0ba3

sk1p added 5 commits July 23, 2024 16:27

Add a RecoveryStrategy

0f820ee

Default to the old `RecoveryStrategy::ImmediateReconnect`, but allow to switch to `RecoveryStrategy::DrainThenReconnect`. Need to experiment if the drain strategy needs to become the default.

wait_for_arm: properly retry without timeout

6bfe1fb

Add convenience nav_shape method to QdAcquisitionHeader

3759fe0

Increase timeouts in test cases

6c6bf9a

On the slow Mac OS 12 workers, there were some spurious timeouts. Let's see if this is enough.

Support for non-raw Nx1 layout

67c1662

The test is based on some assumptions, but this should cover the non-raw use-case.

sk1p mentioned this pull request Jul 24, 2024

Data backend updates LiberTEM/LiberTEM-live#161

Merged

9 tasks

sk1p added 22 commits July 24, 2024 17:18

Enable back traces when running tests

d81e476

Try again: set RUST_BACKTRACE=1 in CI

5d2b484

Use DrainThenReconnect method for test_confused_by_diff_length

db8edca

Remove some unused code and fix warnings

5ceade8

Fix numerous clippy warnings

05ef6b5

Fix more clippy warnings

3cbdcec

Notes on the generic infrastructure

80f2798

Implement quad raw w/ gap support for 2x2 layout

165da72

Right now, we only support the "pass through the raw data" option, meaning we don't zero out the border pixels. But we do insert a two-pixel gap between the sensors, meaning `Layout::L2x2G` results in a 514x514 output.

Increase timeout in slightly flaky test case

edcbe7e

Fix frame size in quad raw mode

8a288c2

In quad raw mode, we need to somehow map the input size (1024x256) to a sensible size, which depends on the layout and thus also the gap mode. This is implemented here for the normal quad setup; the EELS layout might need a fix in the future, too.

More benchmarks; measure input throughput

1e68374

Use multiversion to accelerate decoding in production

5529b49

AVX2 looks to be most important; this also adds a more complete list of Zen2-ish features as one version.

Remove unnecessary QdFrameStack wrapper

bb449bf

Directly use the macro generated type instead.

Add QdAcquisitionConfig

b1c275c

This is a combination of the acquisition header and the first frame header, meaning we can accurately get the detector shape Still exporting the `QdAcquisitionHeader` to Python, which can be used to parse from raw bytes

qd: log the first frame header

da69578

Add explicit cancellation

58d8e3a

When calling `wait_for_arm`, we must first cancel any already running acquisitions, wait for the system to be idle, and then can arm it again.

Move the wait-for-idle code into GenericConnection::cancel

c594d09

Bump versions and update READMEs

26f89e0

Build libertem_asi_mpx3 in CI

2e3d116

Revert "Build libertem_asi_mpx3 in CI"

1e47630

This reverts commit 2e3d116.

Tweak changelog (libertem_asi_mpx3 is not ready for release yet)

fb1a1e8

sk1p merged commit 0f1683b into LiberTEM:main Sep 17, 2024
33 checks passed

sk1p deleted the qd-merlin branch September 17, 2024 15:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactoring and support for QD #65

Refactoring and support for QD #65

sk1p commented May 16, 2024

Refactoring and support for QD #65

Refactoring and support for QD #65

Conversation

sk1p commented May 16, 2024