Skip to content

AntoineGourru/wikipedia-mining

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Wikipedia Mining

Readme en français

Presentation

This project aim at analysing a french Wikipedia Dump, using two different approaches :

  • text-mining : building a vector representation of the corpus, using well-known VSM and word embedding method.
  • graph-mining : build an atlas based on the cross references.

Installation

Prerequisites

Before installing the project, you'll need

You can check your current versions of the two softwares using the linux commands :

mvn --version
java -version

Building

Building the Maven project :

mvn clean install

Authors

ArcToScience Team, M2 Data Mining, University Lyon 2, France :

License

This project is licensed under the MIT License - see the LICENSE.md file for details

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published