Skip to content
This repository has been archived by the owner on Jun 15, 2021. It is now read-only.

Correlations tool fails with a large corpus #471

Open
susanmonnelly opened this issue Feb 11, 2020 · 2 comments
Open

Correlations tool fails with a large corpus #471

susanmonnelly opened this issue Feb 11, 2020 · 2 comments

Comments

@susanmonnelly
Copy link

https://voyant-tools.org/tool/Correlations?corpus=288cf2a4af5b205c07ab68c8294e725a&view=correlations

This is an XML document that is about 29 MB in size

Although slow to load it works in other tools .

@sgsinclair
Copy link
Owner

Right, I'm not entirely sure what to do for this case since an open-ended (i.e. no query) request for correlations has to compare every word to every other word in gigantic matrix. I'm tempted to say that the tool only works for either 1) a query or 2) some threshold of top frequency terms. Any thoughts on this?

@susanmonnelly
Copy link
Author

I'm not sure what to do either but could you at least inform the user there's a problem and instruct them how to fix it?

Here's a screen shot

Screen Shot 2020-04-02 at 10 57 02 AM

I have also seen a similar problem in the Phrases tool specifically when select different words in the drop down

Screen Shot 2020-04-02 at 10 59 13 AM

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants