-
Notifications
You must be signed in to change notification settings - Fork 24
[Experimental] Add sample datapackage.jsonld #101
base: master
Are you sure you want to change the base?
Conversation
Since |
rm -r archive.org | ||
", | ||
"size": "1 GB", | ||
"resources": { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"resources"
is the nomenclature used in datapackage.json
.
@@ -0,0 +1,38 @@ | |||
{ | |||
"name": "scholarpedia.org", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
REQUIRED in all of the references.
The most effective way to find the common ground, I think, is by packaging datasets that have already been packaged in these various standards and see which choice covers the most cases. |
@flyingzumwalt since you're the captain of this repo, RFCR? Is this not in the intended direction? |
I put 2 version of manifest file. One is the output of |
All of the issues you're closing pre-date my involvement. I will have to do a bit of reading in order to provide comments. That will have to wait until after the data.gov sprint. I'm currently maxed out dealing with that work. |
This PR packages the data with the scale of 10e5 orders of magnitude smaller than data.gov. Delivering data.gov depends on having the manifest/datapackage.json/packfile implemented, which can be done in parallel with smaller datasets. |
The issues this PR close basically contain which datasets have been published to ipfs. The main concern is the datapackage.json format and the packmanifest format. I don't know how to make this simpler to put: I decouple the task of having to spec these format from having to prepare a 300TB data in the first place. |
Just as a comment, here is the {
"last-synch": "2016-12-27T16:04:20.322511",
"name": "ipfs-ietf-rfc-archive",
"license": {
"url": "http://trustee.ietf.org/license-info/IETF-TLP-4.pdf",
"type": "other-open"
},
"title": "IETF RFC Archive",
"sources": [
{
"web": "https://www.rfc-editor.org/retrieve/",
"name": "RFC Editor"
}
],
"resources": [
{
"path": "rfc-index.txt",
"name": "rfc-index"
},
{
"path": "rfc-data/",
"name": "rfc-data"
},
{
"path": "update.py",
"name": "update-script"
}
],
"ipfs-github-issue": "https://github.com/ipfs/archives/issues/18"
} I based my format on http://specs.frictionlessdata.io/data-package/. What are your thoughts about this format, or the differences between this and your proposal? |
resources
is normalizeddatapackage.json
Closes: #86, #18, #32, #35