MapReduceCode:

This project focuses on comparing the performance of MapReduce and spark on a Hadoop Cluster for the same sufficiently large dataset. (which can be found here: https://archive.ics.uci.edu/ml/datasets/Poker+Hand )

MapReduceCode:

- Contains the MapReduce code written in Java.

SparkAppCode:

- Contains code written in Scala that can be run on a Cluster. 
  Add relevant `hdfs` or `s3` paths for the testing and training data.

- The app writes the classes of the Test Data to a local `.txt` file on the Master Node.

AccuracyTest

- Use `accuracyTest.java` to check the accuracy of the predicted classes.

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
AccuracyTest		AccuracyTest
MapReduceCode		MapReduceCode
SampleData		SampleData
SparkAppCode		SparkAppCode
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MapReduceCode:

SparkAppCode:

AccuracyTest

About

Releases

Packages

Languages

vinitS101/knn

Folders and files

Latest commit

History

Repository files navigation

MapReduceCode:

SparkAppCode:

AccuracyTest

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages