-
Notifications
You must be signed in to change notification settings - Fork 111
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
HED Validator integration into bids schema based validator. #1648
Conversation
…ion. Add hed validator dep built from current master.
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## master #1648 +/- ##
=======================================
Coverage 85.75% 85.75%
=======================================
Files 91 91
Lines 3785 3785
Branches 1218 1218
=======================================
Hits 3246 3246
Misses 453 453
Partials 86 86 ☔ View full report in Codecov by Sentry. |
@rwblair --- should we be nervous that this is PR is still a draft? Is there anything that we need to prepare for? Thx! VisLab |
…. add logger debug for response from hed validator
I know you have been swamped, but could you give us an update on this @nellh? Thx. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this is ready to merge. I ran through some test cases and was able to match the failures to the legacy validator. ds004395 runs out of memory with the default heap size but does pass increasing the size to 8GB. There could be some memory optimization to be done because this doesn't happen without the HED patches but that dataset is an exception, every other test case passed with the default heap size.
@nellh ... thx for update. I am not sure what is going on with ds004395
-- I looked at it on openNeuro and it doesn't have any HED in it. The new
HED validator works directly from the list of .tsv files and their
associated sidecars as produced from the BIDS side. I wonder if we are
somehow copying the data inadvertently before determining whether to even
look at the files.
@alexander Jones ***@***.***> let's look at this more closely.
Version 3.13.3 should be released in the next day or so. It has some minor
bug fixes.
…On Thu, Jan 11, 2024 at 11:58 AM Nell Hardcastle ***@***.***> wrote:
***@***.**** approved this pull request.
I think this is ready to merge. I ran through some test cases and was able
to match the failures to the legacy validator. ds004395 runs out of memory
with the default heap size but does pass increasing the size to 8GB. There
could be some memory optimization to be done because this doesn't happen
without the HED patches but that dataset is an exception, every other test
case passed with the default heap size.
—
Reply to this email directly, view it on GitHub
<https://github.com/bids-standard/bids-validator/pull/1648#pullrequestreview-1816172159>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAJCJOWF6OFLORSN5CB55JTYOAR4BAVCNFSM6AAAAAAWJW4A2SVHI2DSMVQWIX3LMV43YUDVNRWFEZLROVSXG5CSMV3GSZLXHMYTQMJWGE3TEMJVHE>
.
You are receiving this because you commented.Message ID:
***@***.***>
|
I should finish profiling before running my mouth. |
…l hed-validator call commented out for testing memory usage
I'm not sure what's going on with the Deno build. The lines in the errors do not appear in the version of that file in our code base, and we don't use |
@rwblair Has there been any further progress on this? |
… schema/hedIntegration
I believe the failing test is because of an off-by-one error (you're converting the TSV data to the old format, which you shouldn't ( |
… schema/hedIntegration
… hedvalidator build tsv constructor.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Overall looks reasonable.
Co-authored-by: Chris Markiewicz <[email protected]>
Co-authored-by: Chris Markiewicz <[email protected]>
Co-authored-by: Chris Markiewicz <[email protected]>
…-validator into schema/hedIntegration
bids-validator/src/validators/hed.ts
Outdated
const tsvData = await context.file.text() | ||
const eventFile = new hedValidator.bids.BidsTsvFile( | ||
context.path, | ||
tsvData, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Would passing the ColumnMap
(or a copy) work here, or would that expose too much internal data? The parameter type in hed-validator is Map<string, string[]>
, which appears to be ColumnMap
's parent type.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Using the columns map here generates an error:
TypeError: Cannot read properties of undefined (reading 'forEach')
This is coming from convertParsedTSVData
:
https://github.com/hed-standard/hed-javascript/blob/3be833f929cd3db5e259b315ba6cff6e1c677f59/bids/tsvParser.js#L45
via:
https://github.com/hed-standard/hed-javascript/blob/3be833f929cd3db5e259b315ba6cff6e1c677f59/bids/types/tsv.js#L53
I think this is because Map objects pass the check meant to catch the old style tsv data:
> let x = new Map()
undefined
> x === Object(x)
true
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@happy5214 Does it make sense to go ahead as-is and then submit a PR once hed-validator can handle a ColumnMap
directly?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
#2041 should fix that issue. The type check was a bug that proved easy to fix.
For #1639.
bids-validator/src/deps/hed-validatordependency was generated by running
npm run build` on latest commit for hed-javascript:hed-standard/hed-javascript@bba927bTwo main functions located inbids-validator/src/validators/hed.ts
arehedAccumulator
andhedValidate
hedAccumulator
is called once per context like other checks (filename validation, schema checks) inbids-validator/src/validators/bids.ts
. It catches any events.tsv files and json files and builds up an object to be passed to the actual validator call.Instead of recreating the old data structures that were passed intobids-validator/validators/events/hed.js
I tried to get everything in shape a bit closer to where the outgoing calls were made, not sure if I got it quite right. The contexts have the proper sidecar data in them already so no need to mess withpotentialSidecars
.hedValidate
is called once, after the context walk is complete.bids-validator/src/issues/list.ts
as needed.detectHed
function to skip calls when no HED data present.Currently am getting"ERROR: [HED_SCHEMA_LOAD_FAILED] The supplied schema specification is invalid. Specification: no sche..."
when run onbids-examples/eeg_ds003645s_hed
Latest commits incorporate the hed validator api changes and follow the lead of https://github.com/bids-standard/bids-validator/pull/2014
Validation is done per file as a context level check instead of original accumulator/dataset level check.