Skip to content

VipulMascarenhas/DocSummarizer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DocSummarizer

Node.js Document Summary Statistics Application

Task:

Utilize Node.js to create a simple command line application that provides summary statistics for a given dataset. This will not require using Node.js as a web server, but rather will use it as a command line application only. The application will take the relative path to a text file (.txt) as a command line argument. The application then processes the text file to generate the following descriptive statistics about the document:

1.	The number of words in the document
2.	The number of lines
3.	A list of all the words with the count of their occurrence, in descending order
4.	A list of the ten most common trigrams
5.	A listing of the edit distances between the most common trigram and the next ten most common trigrams, in order of distance value. This includes the trigrams being compared and the edit distance value.

The output of is a properly formatted HTML file that can be viewed in any modern browser. The name of the HTML report includes the name of the data file provided as input.

About

Node.js Document Summary Statistics Application

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published