[Fix] check_outliers() expose 'xi' parameter for OPTICS #785
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
I have a nice new dataset of a questionnaire validation that included attention checks. I have been playing with check_outliers(), and the OPTICS method. Interestingly, the current rule for its threshold that we have didn't work at all, + I had to change
xi
to actually make it work (hence this PR that allows to do that).Interestingly though, we typically consider outliers observations with a large distance, but my data shows that people that have failed attention checks (i.e., "objective" outliers) tend to have a low distance:
(the logistic model failed ~ distance_optics makes it a significant predictor of failure)
I wouldn't recommend any action for now, maybe we can simply mention this observation (that it is possible that low distances are also problematic) once my paper is published and link to it