Skip to content
This repository has been archived by the owner on Jun 26, 2020. It is now read-only.

Crosswalk performance for large tabular input files #78

Open
gregjan opened this issue Oct 13, 2011 · 1 comment
Open

Crosswalk performance for large tabular input files #78

gregjan opened this issue Oct 13, 2011 · 1 comment

Comments

@gregjan
Copy link
Contributor

gregjan commented Oct 13, 2011

The Research Labs of Archaeology has a tabular input file of 40,000 records. Each crosswalk run in these circumstances takes about ten minutes and leaves the user will little feedback. (A rolling beach ball on the Mac)

Warn users about the size of a large input file, suggest that they take an action:

  • limit the crosswalk to a number range of lines
@gregjan
Copy link
Contributor Author

gregjan commented Oct 13, 2011

Create separate files for each crosswalked MODS record. This keeps a large number of records out of the in-memory workbench METS model.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

1 participant