Skip to content

This project uses Apache Spark to process data and Linear Regression, Random Forest models to estimate the fare and duration of taxi trips.

Notifications You must be signed in to change notification settings

armankuyucu/taxi-fare-and-trip-duration-estimation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Taxi Fare and Trip Duration Estimation

This project uses Apache Spark to process data and Linear Regression, Random Forest models to estimate the fare and duration of taxi trips. You can examine the data dictionary of the dataset here.

DataFrame Schema

The table below illustrates the schema of the DataFrame used in this project: DataFrame_Schema

Fare Estimation Results

Fare_Estimation

Trip Duration Results

Trip_Duration

About

This project uses Apache Spark to process data and Linear Regression, Random Forest models to estimate the fare and duration of taxi trips.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published