Float timeseries #98

KaraMelih · 2024-04-08T08:48:49Z

Aimed at fixing #87
~~also fixes #97~~ (moved this fix to #99)

Previously, we were expecting iso-formattable time strings for all the entries in the expected timing_series .

However, this can easily become too large. As was suggested by others, this PR implements a float conversion.

It first checks if all the values are strings or floats/integers.

If they are strings, it converts them to numpy datetime objects, sort by value, look at the relative time delta's from the first ever value in the list.
If they are floats/integers it also checks if the "neutrino_time" is explicitly given.

Right now, the logic is missing. I do not know if we should always force initial neutrino time too, or should this be fetched from the Coincidence Tier by time tier people?

I imagine a list of time series starting with 0 is not fully useful if we do not know what the 0 refers to.

Merge pull request #84 from SNEWS2/patch

KaraMelih · 2024-04-08T11:28:10Z

Reverted the firedrill fix (ee7f062), as I added that in a minor patch

# Conflicts: # snews_pt/messages.py

KaraMelih · 2024-04-08T12:42:31Z

@sybenzvi @mcolomermolla

I want to ask the expected information in the timing tier. Now, this PR allows for relative-to-first neutrino times with ns precision. This logic indicates that the first value is always zero, and the rest are relative to this value.

In this case, do we also want to force the initial "neutrino_time" to appear as a string (that corresponds to the zeroth time) or do we expect that people looking at the TimeTier will fetch this from the CoincidenceTier ?

Right now, if you pass

neutrino_time = "2024-01-01T12:00:00.00000000"
timing_series = [0, 119781135000, 119881135000, 179781135000, 248890124000]

the neutrino_times will be parsed and sent under CoincidenceTier message (timing series ignored) and

the timing_series will be parsed and sent under TimingTier message (neutrino time ignored).

Is this what we want, or do we want to also always see the neutrino_time in the timing tier too?

merge main onto float timeseries branch

KaraMelih · 2024-05-07T13:46:08Z

I added 'neutrin_time' as a required field for the time tier messages.

Now, it also allows for either a list of strings with individual neutrino times, or a list of integers indicating the relative times from the initial neutrino time with a nanosecond precision.

In the former case, it computes these relative times from the initial neutrino time itself and creates a list of relative times. So the submitted message always contains;

'neutrino_time' : string, time of the first neutrino event
'timing_series' : list of integers, nanosecond precision time differences from the initial neutrino time

habig · 2024-05-10T14:53:49Z

Have we been able to reproduce #87 ? If there's an unexpected hard limit on message size, we need to take that to scimma as a bug, since the original specs were "sure, you guys can hove anything around including big skymaps"

KaraMelih · 2024-05-10T15:11:08Z

I just tried and this fails;

import numpy as np
numbers = np.linspace(1e9, 9e9, 100000).astype(int)

tims = SNEWSMessageBuilder(detector_name='XENONnT',
                               neutrino_time='2012-06-09T15:31:08.109876',
                               timing_series=numbers.tolist(),
                               machine_time='2012-06-09T15:30:00.009876',
                               firedrill_mode=False, is_test=True)
tims.send_messages()

here is the error that it raises;

KafkaException: KafkaError{code=MSG_SIZE_TOO_LARGE,val=10,str="Unable to produce message: Broker: Message size too large"}

It was fine with 10k integers, it crashes at 100k.

habig · 2024-05-10T15:25:43Z

Great, thanks. If we convert that to bytes (since we're sending in ASCII one number is a lot of characters), what's that look like? I've got the attention of scimma devs at the moment.

habig · 2024-05-10T15:31:10Z

They're not opposed to discussing it on slack, but suggested their ticketing system, https://support.scimma.org/

justinvasel · 2024-05-10T18:30:54Z

See #87 (comment) for a potential solution.

habig · 2024-05-13T17:23:50Z

The scimma people report that the default max message size is 1MB. They're leery to increase that to head off future scaling problems, and are working on a large file offload service. But: now knowing this we can plan for it, with efficiencies like this PR helping. Could imagine daisy-chaining things together like SMS messages that go over 140 characters, too.

KaraMelih added 8 commits December 19, 2023 09:33

Merge pull request #88 from SNEWS2/main

7564cce

Merge pull request #84 from SNEWS2/patch

avoid TimeTier crash

58e1b5f

avoid TimeTier crash

d41900a

Merge remote-tracking branch 'origin/patch' into patch

5c6a6c2

avoid TimeTier crash

6555b3e

CLI fix for publish and hb

20ccc79

time tier accepts floats

139a0eb

revert firedrill fix

ee7f062

KaraMelih added 2 commits April 8, 2024 13:56

Merge branch 'main' into float_timeseries

07d6131

# Conflicts: # snews_pt/messages.py

adjust floating implementation

de50ab5

KaraMelih added 4 commits April 8, 2024 14:47

Merge branch 'main' into float_timeseries

e4b2e9c

Merge pull request #101 from SNEWS2/main

5fa05dc

merge main onto float timeseries branch

placeholder for prod topic

86e0f56

add neutrino times for time tier

713549b

KaraMelih marked this pull request as ready for review May 7, 2024 13:40

KaraMelih requested a review from sybenzvi May 7, 2024 13:41

clean a line

4db8f99

Storreslara merged commit 4da6619 into main May 10, 2024
2 checks passed

KaraMelih deleted the float_timeseries branch May 10, 2024 12:34

justinvasel mentioned this pull request May 10, 2024

Size of message problems with hopskotch? #87

Open

KaraMelih mentioned this pull request Aug 13, 2024

Use relative timing for timing tier messages #83

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Float timeseries #98

Float timeseries #98

KaraMelih commented Apr 8, 2024 •

edited

Loading

KaraMelih commented Apr 8, 2024

KaraMelih commented Apr 8, 2024

KaraMelih commented May 7, 2024

habig commented May 10, 2024

KaraMelih commented May 10, 2024

habig commented May 10, 2024

habig commented May 10, 2024

justinvasel commented May 10, 2024

habig commented May 13, 2024

Float timeseries #98

Float timeseries #98

Conversation

KaraMelih commented Apr 8, 2024 • edited Loading

KaraMelih commented Apr 8, 2024

KaraMelih commented Apr 8, 2024

KaraMelih commented May 7, 2024

habig commented May 10, 2024

KaraMelih commented May 10, 2024

habig commented May 10, 2024

habig commented May 10, 2024

justinvasel commented May 10, 2024

habig commented May 13, 2024

KaraMelih commented Apr 8, 2024 •

edited

Loading