You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Right now we're just dumping everything into nutch. Well, we're already reconfiguring Nutch for each crawl when we do visualization, should I have each nutch crawl dump into a different index?
The text was updated successfully, but these errors were encountered:
Discussion with Katrina in Flowdock. Ideal would be one index per project, and then a crawl_id field. I don't think Nutch can do the latter, but I'll look at what options are available to the indexer.
Right now we're just dumping everything into
nutch
. Well, we're already reconfiguring Nutch for each crawl when we do visualization, should I have each nutch crawl dump into a different index?The text was updated successfully, but these errors were encountered: