add scripting tasks config vignette #86

zkamvar · 2024-11-25T23:12:48Z

I've moved the scripting tasks configuration chapter of the user guide to here because as @annakrystalli correctly pointed out, it makes much more sense to link to this vignette where we can dynamically execute code.

That being said, I migrated this over and made the following changes:

removed the screenshots and mostly redundant text
added an overview of the structure of a tasks.json file (this is a variant of the cake-first approach)
used the development version of pkgdown to incorporate Retain forward slashes in HTML img src paths r-lib/pkgdown#2811
updated the pkgdown workflow to get the dev preview
added a narrative throughout
added a validation step

I had some challenges when writing this because the original script did not actually produce a valid configuration file (there were duplicate round IDs). There were also some minor issues that didn't really make sense once you put a narrative to it.

Specifically, why would we have two modeling tasks that have identical task IDs but share only a "mean" output type?

I was able to use the extra round to highlight the need for validation after write, but it would be good to get a modeler's eyes on this.

I'm also pretty sure I got the structure of the tasks.json wrong, but I am le tired. To do so, I created a list of names and used lobstr::tree() to create the tree diagram.

github-actions · 2024-11-25T23:14:53Z

🚀 Deployed on https://676b177219894b4a167b8ec5--hubadmin-pr-previews.netlify.app

annakrystalli

Overall a great start with lots of useful demos and context. I've made quite a few suggestions in that name of clarifying a few things and I'd especially like to see the comments contained in code blocks moved to narrative text.

One really useful topic that hasn't been discussed is that the functions default to the latest schema version and demonstration of the use of hubAdmin.schema_version option to override this behaviour and protect workflows from breaking on new schema releases. I think this vignette would be an excellent place to discuss this.

vignettes/articles/scripting-tasks-config.Rmd

codecov · 2024-12-10T23:29:46Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 88.88%. Comparing base (b809161) to head (f5ae2b1).
Report is 51 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #86      +/-   ##
==========================================
- Coverage   89.24%   88.88%   -0.36%     
==========================================
  Files          29       29              
  Lines        2185     2322     +137     
==========================================
+ Hits         1950     2064     +114     
- Misses        235      258      +23

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

I want to include images and r-lib/pkgdown#2811 is standing in my way until it is released

The original tutorial was a bit lacking and I wanted to take the opportunity to give more context, show intermediate steps, and show errors

Co-authored-by: Anna Krystalli <[email protected]>

zkamvar · 2024-12-11T16:40:53Z

a note about the force push: I rebased this branch onto the main branch.

A weird bug just popped up and I don't know where it's coming from, but effectively, when a value with a class of "error" is passed to `encodeString()`, it fails with: Error in UseMethod("conditionMessage") : no applicable method for 'conditionMessage' applied to an object of class "error" This never used to happen, so I'm fixing it here.

vignettes/articles/scripting-tasks-config.Rmd

tests/testthat/test-validate_model_metadata_schema.R

annakrystalli

Thanks for responding to all the previous comments! I have a couple of minor additional comments/suggestions.

However, the more I thought about the example of adding a second round, the more I felt we need to modify it. I think the second round is both a bit unrealistic but more importantly probably bad practice to demonstrate. While it does pass validation and is technically allowed, we are repeating round IDs in the second round that exist in the first but just getting round this by setting round_id_from_variable: false. This doesn't feel like something we want to encourage in our docs.

Instead why don't we create a whole new round with new values in origin date but largely reusing previously created objects and code and a different submission window (still relative to origin date) due to...(some reason, any good ideas?). Then we could append the round via append_round(). I know this will add a lot of repeated code to the vignette but I think this would still be preferable because:

it avoids demonstrating hacky practice
might be more realistic
demonstrates the value of re-using previous code and objects created
demonstrates the use of append_round()

For a validation error, you could leave a round_id that already exists in the previous round's origin_date, catch it with the validation and correct it.

vignettes/articles/scripting-tasks-config.Rmd

Co-authored-by: Anna Krystalli <[email protected]>

in reference to https://github.com/hubverse-org/hubAdmin/pull/86/files#r1889979644

zkamvar · 2024-12-24T15:40:42Z

However, the more I thought about the example of adding a second round, the more I felt we need to modify it. I think the second round is both a bit unrealistic but more importantly probably bad practice to demonstrate. While it does pass validation and is technically allowed, we are repeating round IDs in the second round that exist in the first but just getting round this by setting round_id_from_variable: false. This doesn't feel like something we want to encourage in our docs.

Trying to work around duplicated round IDs was not my intention, but now that you mention it, I can see why it seems that way. My intention was to demonstrate a stand-alone round (hence round_id_from_variable: false).

I think your suggestions make sense, but instead of adding another round of review, I am going to cut off that last section and open an issue to amend the vignette. This vignette will replace the scripting tasks configuration chapter from the hubverse documentation, which currently will produce an invalid configuration file. The vignette without that last section is still not perfect, but it's better than what we have at the moment.

This section had brought up concerns [1] about the kind of scripting behaviour this was encouraging. I have decided to remove it to move this forward [2]. To incorporate a section in here, the narrative needs to be changed so that it has a clear goal that a hub administrator can identify with. [1]: #86 (review) [2]: #86 (comment)

zkamvar · 2024-12-24T16:10:10Z

Note: there is a small bug in pillar that's causing weird output: r-lib/pillar#720, specifically, the section demonstrating an error in creation of a target metadata item has this error from pillar.

#> Error in get(paste0(generic, ".", class), envir = get_method_env()) : 
#>   object 'type_sum.accel' not found

Given that the CRAN team are on break, it may be mid-January until it's fixed.

vignettes/articles/scripting-tasks-config.Rmd

elray1

This is a great thing to have, and overall very clear! I made a few fairly minor comments throughout.

Co-authored-by: Evan Ray <[email protected]>

elray1

this looks good enough to merge in to me, with plans to revisit the example soon.

See #86 (comment).

I have addressed the minor issues with this, but the narrative issues require a deeper rewrite. Since the goal of this was to get a "good enough" replacement for the currently broken scripting tasks config chapter, I am going to push this through and open an issue with the concerns that everyone (myself included) raised about the narrative structure (or lack thereof).

zkamvar marked this pull request as ready for review December 6, 2024 20:41

annakrystalli requested changes Dec 10, 2024

View reviewed changes

zkamvar and others added 13 commits December 11, 2024 08:38

migrate scripting config tutorial to here

53b1396

use dev version of pkgdown

23f72ec

I want to include images and r-lib/pkgdown#2811 is standing in my way until it is released

halfway through first pass

3ac39d1

The original tutorial was a bit lacking and I wanted to take the opportunity to give more context, show intermediate steps, and show errors

dangit linter

7fc2cd6

finalize vignette

1c5114a

fix loooooong comments

e3871a0

update pkgdown workflow

f29a854

Apply suggestions from code review

9614b2c

Co-authored-by: Anna Krystalli <[email protected]>

minor typo fixes

7eb711b

remove comments

cf8e27a

add todo

c061b97

remove second paragraph

e7b7fb7

Update vignettes/articles/scripting-tasks-config.Rmd

90f6018

Co-authored-by: Anna Krystalli <[email protected]>

zkamvar force-pushed the znk/task-config-vignette branch from de2c750 to 90f6018 Compare December 11, 2024 16:38

zkamvar added 3 commits December 11, 2024 09:05

add version option; mention output type id datatype arg

cf519c0

no need for new pkgdown

1f810c2

zkamvar commented Dec 11, 2024

View reviewed changes

vignettes/articles/scripting-tasks-config.Rmd Outdated Show resolved Hide resolved

Apply suggestions from code review

03ff6a9

zkamvar requested a review from annakrystalli December 11, 2024 18:53

zkamvar commented Dec 12, 2024

View reviewed changes

tests/testthat/test-validate_model_metadata_schema.R Show resolved Hide resolved

annakrystalli previously requested changes Dec 18, 2024

View reviewed changes

This was referenced Dec 18, 2024

Only allow a single target_keys key value pair #90

Merged

Add round_id expected pattern match check #88

Merged

Apply suggestions from code review

1c4c8bc

Co-authored-by: Anna Krystalli <[email protected]>

zkamvar and others added 4 commits December 23, 2024 15:27

add notes to the unclassed tests

b78bc89

clarify structure of the round objects.

0ebd33d

in reference to https://github.com/hubverse-org/hubAdmin/pull/86/files#r1889979644

Merge branch 'main' into znk/task-config-vignette

99d2e95

fix round id thing

30ad619