Skip to content

A scalable, distributed, and Dockerized pipeline for weather prediction using Kafka, Cassandra, and Spark.

Notifications You must be signed in to change notification settings

alex-weichun-huang/weather_app

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Weather App

A scalable, distributed, and Dockerized pipeline for weather prediction using Kafka, Cassandra, and Spark.

Note: To customize the port numbers, host names, replication factor, et cetera, please modify the config file.

Instructions

  1. Start the Weather App

Note: running this App will take up roughly 20G memory!

make start
  1. Train the model with Spark ML
make trainer
  1. Inference on stream data and save to Cassandra
make predictor
  1. View the data stored in Cassandra
make viewer
  1. Clean up
make clean

Appendix

  1. To generate the gRPC proto
python3 -m grpc_tools.protoc -I=. --python_out=. --grpc_python_out=. report.proto

About

A scalable, distributed, and Dockerized pipeline for weather prediction using Kafka, Cassandra, and Spark.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published