-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Write out simulation truths to jsonl files #70
Comments
The simulation truth of CORSIKA is in separate files which have to contain at least the headers of the CORSIKA output. This was done on purpose to avoid mental-mapping and not remove any simulation-truth only it is not used at the moment. import photon_stream as ps
import pandas as pd
sim_reader = ps.SimulationReader(
photon_stream_path='tests/resources/011014.phs.jsonl.gz',
mmcs_corsika_path='tests/resources/011014.ch'
)
for event in sim_reader:
# process event ...
# extract Hillas and other features ....
# do deep learning ...
pass
thrown_events = pd.DataFrame(sim_reader.thrown_events()) There is a reader which merges this information: reader = photon_stream.SimulationReader(photon_stream_path, mmcs_corsika_path) |
I know this! I think it is a huge advantage, to not have to give people the corsika files so they can do event reconstruction. That's the point.
A little mental mapping for us is much less work than explaining every new bachelor student what Corsika is, why there are these strange other files are, why things are called |
The CORSIKA files are tiny when the photon-blocks are removed, as I did it for the simulation sample here https://ihp-pc41.ethz.ch/public/phs/sim/. So I do not see a problem to give the CORSIKA files to the users. |
No, I just went through an hour explaining someone why these files exist, what How is that easier than just providing 5 additional numbers in the jsonl???? |
It is not easier, and yes I know that it takes hours. But solving this is beyond the scope of the photon-stream. For a specific task, five numbers with key-names known by two or three people are fine. But in general we want to have the full simulation truth. We do not know what piece of the simulation truth might be relevant for the user. The user might know the CORSIKA manual. For our stuff, there does not even exist a manual. |
I agree. Providing some method to perform the transformation might be helpful. Or even do it 'on-the-fly' as you mention. |
see issue #71 for the discussion on the different reference-frames |
I whole-heartedly disagree. Having energy, pointing direction and source direction directly at hand and in a well defined coordinate system is such huge usability boost that we shouldn't say
That's insane. |
Can we compromise that we agree to find ways to provide all pointing in one 'well defined' reference-frame, but that we will not put it into the 'phs' files? Can we decouple the reference-frame issue from the format-issue? |
Important simulation truths is missing in the output
at least these 3 are needed to use the files for training of an energy estimator or source reconstruction algorithm
The text was updated successfully, but these errors were encountered: