Skip to content

Utility for generating Lucene indexes for collections of emails.

License

Notifications You must be signed in to change notification settings

ArchitecturalKnowledgeAnalysis/EmailIndexer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

EmailIndexer

Utility for generating Lucene indexed datasets for collections of emails. Includes the following components:

  • Dataset generation (including indexing)
  • Dataset data access
  • Lucene searching
  • Data exporting to a variety of formats.

Dataset Structure

The datasets generated by this library are structured simply as a directory containing multiple files and/or subdirectories:

  • The index directory contains all files used by Apache Lucene for creating and searching over indexes.
  • The database.mv.db file is the self-contained H2 relational database that contains all emails and any associated tags.
  • A metadata.properties file contains meta information about the dataset. Currently stores the version number.

About

Utility for generating Lucene indexes for collections of emails.

Resources

License

Stars

Watchers

Forks

Packages