Skip to content

Latest commit

 

History

History
43 lines (27 loc) · 1.17 KB

README.md

File metadata and controls

43 lines (27 loc) · 1.17 KB

Wikipedia Mining

Readme en français

Presentation

This project aim at analysing a french Wikipedia Dump, using two different approaches :

  • text-mining : building a vector representation of the corpus, using well-known VSM and word embedding method.
  • graph-mining : build an atlas based on the cross references.

Installation

Prerequisites

Before installing the project, you'll need

You can check your current versions of the two softwares using the linux commands :

mvn --version
java -version

Building

Building the Maven project :

mvn clean install

Authors

ArcToScience Team, M2 Data Mining, University Lyon 2, France :

License

This project is licensed under the MIT License - see the LICENSE.md file for details