Improve querying #264

olejandro · 2024-12-26T22:31:16Z

This PR allows other fields (in addition to process and commodity) to be passed as lists to query. It also allows column-wise control of which dataframe entries should be exploded if a comma is found.

siddharth-krishna

Thanks, Olex, this looks good.

I'm wondering if we need to remove the old match_uc_wildcards test though? It's common practice to test a function by comparing its results to a slower-but-simpler implementation, and I think we don't have unit tests that test the new code on a series of small examples, do we? So why not leave it in.

(Ideally, eventually, we would have unit tests for more transforms using small examples.)

siddharth-krishna · 2024-12-28T08:37:16Z

Wow, this PR also improves runtime by around 50%! Do you have any idea what is responsible for the speedup? 🤩

olejandro · 2024-12-28T12:29:42Z

Wow, this PR also improves runtime by around 50%! Do you have any idea what is responsible for the speedup? 🤩

Good question! TIMES-NZ runs are responsible for the reported wild improment. It's runtimes have varied a lot before. As far as I can see from the log, the time it takes to run convert_to_string transform went down significantly with the changes in this PR. Let me check what changed for it...

olejandro · 2024-12-28T12:36:42Z

I'm wondering if we need to remove the old match_uc_wildcards test though? It's common practice to test a function by comparing its results to a slower-but-simpler implementation, and I think we don't have unit tests that test the new code on a series of small examples, do we? So why not leave it in.

The changes in this PR brake the test, because some inputs change. Since we have had the faster version running for a while now, without ever having any issues with it, I thought it was okay just to delete the test. I could try to fix it instead, if you think it is worth keeping it?

olejandro · 2024-12-28T13:05:50Z

Wow, this PR also improves runtime by around 50%! Do you have any idea what is responsible for the speedup? 🤩

Good question! TIMES-NZ runs are responsible for the reported wild improment. It's runtimes have varied a lot before. As far as I can see from the log, the time it takes to run convert_to_string transform went down significantly with the changes in this PR. Let me check what changed for it...

I've timed it: convert_to_string take less time on the mig table now. It is probably because the table has fewer rows, since we keep previously exploded entries as lists for querying.

siddharth-krishna · 2024-12-28T16:51:34Z

Thanks for looking into it! And no, not worth spending too much time on the test. It might be easier to write unit tests once we've converted many transforms into methods of the TimesModel class, perhaps.

olejandro added 10 commits December 26, 2024 10:17

Specify types in Tag.has_tag

5ed2b44

Include tmf_ava_c in Tag

9aed6b9

Change the type of has_tag inputs

54ee75c

Refactor

e5085a7

Generate model.topology earlier

8f458d5

Remove _match_uc_wildcards_old and adjust the test

5479981

Mostly fix topology

b24b6bb

Remove debug print

b90eda7

Adjust constraints

71eb4c9

Identify timeslice as comma-separated-list for some tags

71af09b

olejandro marked this pull request as ready for review December 27, 2024 05:20

olejandro requested a review from siddharth-krishna December 27, 2024 05:27

siddharth-krishna approved these changes Dec 28, 2024

View reviewed changes

Merge branch 'main' into olex/improve-querying

f7f09bf

olejandro merged commit 5302aa5 into main Dec 28, 2024
2 checks passed

olejandro deleted the olex/improve-querying branch December 28, 2024 18:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve querying #264

Improve querying #264

olejandro commented Dec 26, 2024 •

edited

Loading

siddharth-krishna left a comment

siddharth-krishna commented Dec 28, 2024

olejandro commented Dec 28, 2024

olejandro commented Dec 28, 2024

olejandro commented Dec 28, 2024

siddharth-krishna commented Dec 28, 2024

Improve querying #264

Improve querying #264

Conversation

olejandro commented Dec 26, 2024 • edited Loading

siddharth-krishna left a comment

Choose a reason for hiding this comment

siddharth-krishna commented Dec 28, 2024

olejandro commented Dec 28, 2024

olejandro commented Dec 28, 2024

olejandro commented Dec 28, 2024

siddharth-krishna commented Dec 28, 2024

olejandro commented Dec 26, 2024 •

edited

Loading