Skip to content

Latest commit

 

History

History
2 lines (2 loc) · 295 Bytes

File metadata and controls

2 lines (2 loc) · 295 Bytes

NBA_playerInfo_Sparks

This project extracts list, information and statistics from Wikipedia articles of current and past NBA players. I used Spark SQL to extract information from html documents and save it to a csv file. In the nearby future, I will post the same objective achieved using Pig