Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MDS implementation with missing and censored values #175

Open
adalisan opened this issue May 8, 2024 · 4 comments
Open

MDS implementation with missing and censored values #175

adalisan opened this issue May 8, 2024 · 4 comments

Comments

@adalisan
Copy link

adalisan commented May 8, 2024

In the documentation, lack of an MDS implementation for this type of data is mentioned. By "MDS", are you referring to metric MDS specifically (which assume the input matrix is a distance matrix that is based on a metric)? There are relaxed non-metric variants of MDS (like ordinal MDS in smacof package) which would be more robust to outliers as well. I don't know whether the specific R implementation I mentioned support both missing and censored dissimilarities, but ordinal MDS in general should. The limitations of those variants are that they are more expensive to optimize usually.

@shwilks
Copy link
Member

shwilks commented May 8, 2024

Yes, you're right, that question is about metric MDS specifically, but a non-metric variant of MDS that allows for both missing values and right-censored distances would be interesting, the smacof MDS options certainly seem quite comprehensive thanks for sharing that.

It would be nice to have a test set of different types of antigenic map data against which the performance of possible alternative approaches could be easily compared.

@adalisan
Copy link
Author

adalisan commented May 9, 2024

I am relatively new to antigenic cartography, and have some experience in dissimilarity and embedding methodologies. As I understand it, there can be different levels of uncertainty and outliers in the source titer data. in comparing methods, robustness to those may be more important than other factors (like computational efficiency). If there is no reason that you would expect to have the data to have a better fit for metric MDS, other approaches may work better overall.

@adalisan
Copy link
Author

adalisan commented May 9, 2024

As you mentioned, the bottleneck is always test data, which are harder to come by or may not exist at all in some cases.

@adalisan
Copy link
Author

adalisan commented May 9, 2024

I think one of these unfolding functions could be used in smacof https://www.rdocumentation.org/packages/smacof/versions/2.1-6/topics/unfolding

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants