Analysis and clustering of big data using Hadoop and Flink programming using K-means on Flow Cytometry Data to identify cells exhibiting similar behaviour and cluster together and remove outliers to identify characteristics of cells which may lead to cancer.
Test Data too big to upload. Need to be cured to be uploaded. Coming soon.