Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Modelling for Test Assertions #18787

Merged
merged 3 commits into from
Sep 17, 2024
Merged

Conversation

jmchilton
Copy link
Member

@jmchilton jmchilton commented Sep 6, 2024

Summary

The source of truth for the documentation around test assertions from the Galaxy Tool XSD file (including semantically important things like are parameters required and their xsd datatype) has been pulled out of the XSD and put into the Python modules via docstrings and Annotated parameters (stack overflow on how to use Annotated). This now centralized documentation and typing information is used to generate Pydantic models for assertion lists. The JSON versions of assertion lists is used by the Planemo test format (https://planemo.readthedocs.io/en/latest/test_format.html) and in experimental YAML tool definitions. Validating these is an important part of a hardened workflow tool chain.

Why not XSD?

Out-of-sync

I've always been a little uncomfortable with the assertion documentation in galaxy.xsd. The plugins were meant to be defined as isolated Python modules. Spreading out the documentation in this other format is a bit problematic because the two sources of truth can easily become out of sync - and indeed had in some ways (see bug fixes below).

Developer Familiarity

While I do think XSD is very clever and kind of wonderful to develop against - it is a bit of a dying format and is almost certainly less familiar to nearly anyone interested in developing test assertions than Python. I hope centralizing these things makes it easier for new developers to more rapidly develop and document test assertions. This point pairs well with the new test cases that make it really easy to test assertion parsing. The developer experience should be a lot better despite the assertions themselves being more generically useful.

Not Intrinsically Tied to XML

These test assertions are very readily ported to JSON and useful there. Indeed, the Pydantic models provide stronger validation and typing. For this reason, I think placing the "source of truth" documentation in XSD is a bit less than ideal.

But...

I am very confident this switch is the right choice, but I will admit there is a "but". We are losing some minor things by not using hand crafted XSD. The structured of the XSD itself and how XSD abstractions are used is a bit less than with the auto-generated code. The file is a bit bigger now and a bit more redundant. This is the nature of auto-generated code and I think is fine - the benefits of centralization described above are worth the cost and then some.

Typing Improvements

In addition to improving the documentation of the Python code, I also made a pass at making all the types more specific where it makes sense. For instance, various image assertion XSD attributes could have used the type xs:nonNegativeInteger and were instead using xs:integer. I also added validators to get similar specificity out of Pydantic. In some places, I was able to go even farther with Pydantic. One instance is the center_of_mass parameter that has the format <float>, <float>. It is trivial to validate this format in Pydantic but would likely be more difficult to validate in XSD. Additionally, the Pydantic version of validation compiles all embedded regular expressions to ensure they are correct. Despite the differences in validation quality between XML and JSON - all the declaration of typing used to do this is centralized in the module definitions themselves. Maintaining the single source of truth I would hope for from a "plugin".

Tests & Bug Fixes

I've added a file that has positive and negative validation test cases for I think every assertion test case in both JSON (for the generated pydantic) and XML (for the generated XSD). These test cases are really trivial to write - one just needs to add new snippets to either positive or negative validation lists.

These test cases found a few issues around XSD. Recursive XML testing (from asserts/xml.py ) definitions were documented in the XSD and would work in tools but I think would not validate before. Only the archive assertion was setup correctly in the XSD for these recursive assertions.

Linting the XSD

In order to do automatic code generation for the XSD well, I've auto formatted it with xmllint. The project is various familiar with code formatting and the advantages it provides for Python and Typescript - I just wanted to note that the XSD is now being formatted in the some way. The Makefile target I added is format-xsd

Applications

I've started work on Pydnatic models for the Planemo test format (as an alternative approach to galaxyproject/planemo#1417) at https://github.com/jmchilton/galaxy/pull/new/test_format. This PR doesn't include that work but it provides the infrastructure to do the hardest parts and ensure the test format stays in line and on par with the tool XML assertions over the long term. I think the validation that we do using Pydantic goes beyond what is done for XSD so I think the validation available via this approach will be stronger that that going down the XSD -> jsonschema route. JSON schema for these models can still be generated from the Pydantic using the following command:

. .venv/bin/activate; PYTHONPATH=lib python -c "from galaxy.tool_util.verify.assertion_models import assertion_list; import json; print(json.dumps(assertion_list.model_json_schema(), indent=2))"

Additionally, there are bits and pieces of progress toward dynamic tools in Galaxy (for workflows) and YAML tools. For the dynamic tools - we probably want to use validated YAML - so I think these models will be very important to ensure those things are hardened and can be maintained long term.

Resyncing

The XSD and Pydantic can be resync-d against the Python implementation of the assert plugins using.

. .venv/bin/activate; PYTHONPATH=lib python lib/galaxy/tool_util/verify/codegen.py

These validation artifacts can be validated against the new tests using the pytest command:

pytest test/unit/tool_util/test_assertion_models.py

How to test the changes?

(Select all options that apply)

  • I've included appropriate automated tests.
  • This is a refactoring of components with existing test coverage.

License

  • I agree to license these and all my past contributions to the core galaxy codebase under the MIT license.

@jmchilton jmchilton added kind/enhancement kind/refactoring cleanup or refactoring of existing code, no functional changes area/testing area/workflows area/tool-framework labels Sep 6, 2024
@jmchilton jmchilton force-pushed the assertion_overhaul branch 3 times, most recently from 8fc578f to 347639a Compare September 6, 2024 21:00
Copy link
Contributor

@bernt-matthias bernt-matthias left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pretty cool.

return v


has_line_line_description = """The full line of text to search for in the output."""
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What is the intention of defining the description text separately?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I thought the generated code would read more clean if the potentially large strings were defined on their own. Probably a pretty arbitrary choice either way.


has_line_max_description = """Maximum number (default: infinity), can be suffixed by ``(k|M|G|T|P|E)i?``"""

has_line_negate_description = """A boolean that can be set to true to negate the outcome of the assertion."""
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This description should be the same for all assertions that implement negate.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The docs have a single source of truth in the parameters types - I don't think it is worth while to try to de-duplicate the strings here. Does that make sense?

<xs:group ref="TestAssertionsJson" minOccurs="0" maxOccurs="unbounded"/>
<xs:group ref="TestAssertionsH5" minOccurs="0" maxOccurs="unbounded"/>
<xs:group ref="TestAssertionsImage" minOccurs="0" maxOccurs="unbounded"/>
<xs:group ref="TestAssertion"/>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is there a way to ensure that this part is not edited?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'll see if I can regenerated these with some comments.

Copy link
Member Author

@jmchilton jmchilton Sep 12, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also we could put the definitions in their own file. That would probably be a better design in the abstract.

Edit: I don't know what ramifications this would have - it is nice to have a single URL to download for tooling. I'll add the comments.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I've regenerated the docs with a lot more warnings about things being auto-generated and not to modify them. I've also rearranged things so the XSD has comments about where modules are defined so the docs can be updated.

@jmchilton jmchilton force-pushed the assertion_overhaul branch 3 times, most recently from 7032070 to b93e8a9 Compare September 13, 2024 14:37
@jmchilton jmchilton force-pushed the assertion_overhaul branch 3 times, most recently from 2ca12f9 to b7e9957 Compare September 13, 2024 20:06
@jmchilton jmchilton marked this pull request as ready for review September 15, 2024 15:48
@github-actions github-actions bot added this to the 24.2 milestone Sep 15, 2024
@jmchilton jmchilton merged commit 5acc518 into galaxyproject:dev Sep 17, 2024
52 of 55 checks passed
@jmchilton jmchilton mentioned this pull request Jan 23, 2025
4 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/testing area/tool-framework area/workflows kind/enhancement kind/refactoring cleanup or refactoring of existing code, no functional changes
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants