Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add db view unpacking #1321

Open
wants to merge 1 commit into
base: master
Choose a base branch
from
Open

add db view unpacking #1321

wants to merge 1 commit into from

Conversation

jstucke
Copy link
Collaborator

@jstucke jstucke commented Dec 16, 2024

No description provided.

@jstucke jstucke self-assigned this Dec 16, 2024
@codecov-commenter
Copy link

codecov-commenter commented Dec 16, 2024

Codecov Report

Attention: Patch coverage is 84.84848% with 5 lines in your changes missing coverage. Please review.

Project coverage is 91.91%. Comparing base (058e49f) to head (0b7cb80).
Report is 10 commits behind head on master.

Files with missing lines Patch % Lines
src/storage/graphql/hasura/init_hasura.py 72.72% 3 Missing ⚠️
src/init_postgres.py 0.00% 2 Missing ⚠️
Additional details and impacted files
@@            Coverage Diff             @@
##           master    #1321      +/-   ##
==========================================
- Coverage   92.42%   91.91%   -0.52%     
==========================================
  Files         379      378       -1     
  Lines       23661    21126    -2535     
==========================================
- Hits        21869    19417    -2452     
+ Misses       1792     1709      -83     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@jstucke jstucke marked this pull request as ready for review December 19, 2024 14:52
@maringuu
Copy link
Collaborator

What is this used for?

@jstucke
Copy link
Collaborator Author

jstucke commented Jan 16, 2025

What is this used for?

I should probably have included a description 😅
I only talked with @dorpvom about this. The problem this PR addresses is the following: The metadata from unpacking (entropy, extracted file count and other stuff that lands in fo.processed_analysis["unpacking"]) cannot be used together with numerical comparison operators (like "greather than" or "less equal than"). The reason for this is simple: GraphQL does not support this on JSON data. This is the simplest workaround for using this data without creating more tables/etc.: Creating a "view" for this data, which is a PostgreSQL construct, that allows using the data that is stored in a JSONB column as if it was stored as a number in a regular table in queries (and also GraphQL).

Note: in PostgreSQL we can absolutely formulate such a query, only GraphQL (or more specifically the default schema created by Hasura) is the problem

What can we do with this? We can use this to e.g. run queries like "find all files with an entropy greater or equal 0.8 from which no files were unpacked".

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants