Skip to content

Document Store Design

fccoelho edited this page Jul 12, 2012 · 7 revisions

The document store will be based on the following schema:

Collections (or tables if on a relational db)

Documents:

=========

{"_id":"ObjectId",
 "text":"raw utf8 text of the document",
 "filename":"filename as stored in the GRIDFS",
}

Analyses:

========

{"text":"raw utf8 text of the document",
 "filename":"filename as stored in the GRIDFS",
}
{"text":"raw utf8 text of the document",
 "filename":"filename as stored in the GRIDFS",
}
{"text":"raw utf8 text of the document",
 "filename":"filename as stored in the GRIDFS",
}
Clone this wiki locally