NERDetection

Project NE Detection as DK Pro Component

This (plain Java Project-no Maven - no Ant - no Eclipse plugin) can be used as a UIMA component to detect names in German novels. It uses a MaxEnt-Classifier ( which showed to perform better than a Linear Chain CRF ) to do so.

The required model is stored in the resource Folder ( however it is expected that there will be changes during the next days !!!) This Project comes with all its dependend jars included.

-(Its only requirements are Mallet 2.07 RC2 , UIMA-Core and UIMA-Fit).

Basic usage ( in accordance with DKPro):

public static void main(String[] args) throws Exception {

CollectionReaderDescription cr = createReaderDescription(TextReader.class,
        TextReader.PARAM_PATH,
        "<Input-File>",
        TextReader.PARAM_LANGUAGE, "de");

AnalysisEngineDescription segmenter = createEngineDescription(OpenNlpSegmenter.class);

AnalysisEngineDescription tagger = createEngineDescription(OpenNlpPosTagger.class);

// ========PARAMS FOR THIS ANALYSIS ENGINE it requires to have POS-tags and Sentences!! ======

String modelLocation = "resources\\modelNERRegular.bin";
String featuresFile = "resources\\features.txt";

AnalysisEngineDescription neDetection = createEngineDescription(RomaneNERAnnotator.class,
        RomaneNERAnnotator.PARAM_FEATURE_FILE_LOCATION, featuresFile,
        RomaneNERAnnotator.PARAM_MODEL_LOCATION, modelLocation);

// =========

AnalysisEngineDescription cc = createEngineDescription(CasDumpWriter.class,
        CasDumpWriter.PARAM_OUTPUT_FILE, "<outputfile>");

runPipeline(cr, segmenter, tagger, neDetection, cc);

}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
neDetectionProject		neDetectionProject
.gitattributes		.gitattributes
.gitignore		.gitignore
License.txt		License.txt
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NERDetection

Basic usage ( in accordance with DKPro):

About

Releases

Packages

Languages

License

MarkusKrug/NERDetection

Folders and files

Latest commit

History

Repository files navigation

NERDetection

Basic usage ( in accordance with DKPro):

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages