Carnegie Hall Archives maintains a series of small, portable scripts for matching performance data to files. Expediting batch description procedures on our Digital Collections allows us to work quickly and with accuracy.
These scripts have benefitted immensely from a wide community of archiving, preservation, and programming experts who share their code and troubleshooting techniques online. We are excited by the opportunity to participate in this community and have our methods improve through open collaboration and mutual exchange.
For more information on Carnegie Hall's performance history database and publicly-accessible Performance History Search, please see our linked-data overview.
Script Name | Purpose |
---|---|
idMatching_flyers.py | Matches performance database IDs/information to filenames based on filenaming convention and matching logic (for Archives' flyer collections) |
idMatching_photos.py | Matches asset IDs to scrape updated filenames to create metadata spreadsheet (for an Archives' subset of public relations' photographs) |
idMatching_programs.py | Matches performance database IDs/information to filenames based on filenaming convention and matching logic (for Archives' concert program collections) |
idMatching_video.py | Matches performance database IDs/information to filenames based on filenaming convention and matching logic (for Archives' Robert Shaw Workshop and other video collections) |
This code is provided “as is” and for you to use at your own risk. The information included in the contents of this repository is not necessarily complete. Carnegie Hall offers the scripts as-is and makes no representations or warranties of any kind.
The MIT License (MIT)
Copyright (c) 2016 Carnegie Hall
All contents are released under the terms described in the MIT License included in this repository.
We plan to update the scripts regularly. CH Archives welcomes your thoughts, questions, and recommendations on our evolving quality control strategies.
Anyone is welcome to start a new topic ("issue") by selecting the Issues
tab in GitHub and clicking the green New Issue button in the upper right.
All existing issues, open and closed, may be reviewed or commented upon in the Issues section.
Email your thoughts to the Carnegie Hall Archives at [email protected] with the subject line Digital Collections: Metadata Matching.