Add the ability to normalize wrapped objects #138

gaurav · 2024-01-17T06:44:50Z

Some wrapped objects -- in particular TaxonomicUnitWrappers, TaxonConceptWrappers and TaxonNameWrappers -- may have multiple representations that result in identical definitions -- for example, a TaxonNameWrapper may wrap a taxon name in the form {"nameComplete": "Caiman crocodilus", ...} while another represents this as {"nameComplete": "Caiman crocodilus", "genus": "Caiman", "specificEpithet": "crocodilus", ...}. This is the underlying cause of phyloref/klados#263.

While we could fix this specifically for the three wrappers listed above in Klados, it seems like a better idea to add static methods for PhyxWrapper.normalize(phyxDocument), which could recursively normalize its subcomponents. This would keep the normalize code next to their wrappers, which will make it easier to update in the future if/when the Phyx format changes. It also adds a test for normalizing phylorefs with multiple normalization files -- every phyloreference that ends with _same is expected to be different but to normalize to the same phyloreference, while every phyloreference that ends with _different is expected to normalize to a different phyloreference. Phyx files and Phylogenies are not currently tested, because they don't cause phyloref/klados#263 and so are not a priority for us right now. I've filed an issue to write those tests as #140.

Also includes a few minor changes needed to make this PR work and pass testing:

There is a known issue with Node 21 and 22 and esm (see regression in 21.4.0 when using esm package nodejs/node#51081), so I've added a .node-version file to indicate that we should use Node 20 until that issue is fixed.
Includes fixes for some typos in TaxonConceptWrapper.js.
Increases the JPhyloRef version from 0.4.0 to 1.1.1 to support more modern versions of Java.
The normalization checks are in the test/examples/correct directory, which causes a discrepancy in the expected and actual count of the phyx2owl.js check. This can be fixed by calculating the expected file count with the recursion flag.

hlapp

Sorry for the long delay. Looks fine to me. The one hesitation that comes to mind is that with enumerating the properties in creating the "normalized" object every added property will have be replicated in the normalization code too. But perhaps that's for now just unavoidable? And if someone were to forget to add a new property to the normalization code, would the test(s) catch it?

gaurav · 2024-06-18T06:05:47Z

Sorry for the long delay. Looks fine to me. The one hesitation that comes to mind is that with enumerating the properties in creating the "normalized" object every added property will have be replicated in the normalization code too. But perhaps that's for now just unavoidable? And if someone were to forget to add a new property to the normalization code, would the test(s) catch it?

Hm, that's a good point. There are two kinds of normalization code here: those that keep all un-normalized properties in the normalized object, and those that only use a specific subset of properties to generate the normalized object. I have some ideas on testing these that I've just written up (#141). If I understand your comment correctly, we're okay with the first category (although a few explicit tests would be nice), but the second category needs some additional tests to make sure we handle the use-case where someone adds a new property to, say, SpecimenWrapper (which we plan to do for UUIDs, see phyloref/klados#320). I'm not sure if this will catch the case where we add a new property, but it will make it explicit in the tests which properties we preserve -- would that address your question? Are there other tests we need to add?

gaurav and others added 12 commits January 17, 2024 01:25

First stab at normalize() static methods for all wrappers.

e2b9c64

Merge branch 'master' into add-normalize

5b431ef

Fixed linting errors in CitationWrapper.

8bc0c32

Fixed linting issues.

5ae8e6b

Fixed linting issues.

43aa198

Upgraded packages.

1298d60

Updated JPhyloRef to v1.1.1.

d4d1271

Added test for normalization.

0f9af87

Added normalization code to multiple wrappers.

12185da

Fixed an issue in another test.

aa34a5e

Upgraded packages further.

4ee588c

Added a .node-version before of esm issues.

a16f16e

gaurav mentioned this pull request May 8, 2024

Add tests for normalizing Phyx files and phylogenies #140

Open

Added a _different test in addition to the _same test.

2f7ac95

gaurav requested a review from hlapp May 8, 2024 05:56

gaurav marked this pull request as ready for review May 8, 2024 05:56

gaurav requested review from hlapp and removed request for hlapp June 3, 2024 21:16

hlapp approved these changes Jun 3, 2024

View reviewed changes

gaurav merged commit abed61c into master Jun 18, 2024
4 checks passed

gaurav deleted the add-normalize branch June 18, 2024 06:07

gaurav mentioned this pull request Jun 19, 2024

Release phyx.js v1.2.0 #142

Draft

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add the ability to normalize wrapped objects #138

Add the ability to normalize wrapped objects #138

gaurav commented Jan 17, 2024 •

edited

Loading

hlapp left a comment

gaurav commented Jun 18, 2024

Add the ability to normalize wrapped objects #138

Add the ability to normalize wrapped objects #138

Conversation

gaurav commented Jan 17, 2024 • edited Loading

hlapp left a comment

Choose a reason for hiding this comment

gaurav commented Jun 18, 2024

gaurav commented Jan 17, 2024 •

edited

Loading