Skip to content
This repository has been archived by the owner on Oct 2, 2020. It is now read-only.

Commit

Permalink
changed tokenize.py to score.py
Browse files Browse the repository at this point in the history
  • Loading branch information
kevin-wittmer committed Apr 18, 2016
1 parent 98b8392 commit 97f7602
Showing 1 changed file with 18 additions and 0 deletions.
18 changes: 18 additions & 0 deletions score.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,18 @@
import csv

# Split each tweet into a list of words
def stemmify(tweet_string):
word_list = tweet_string.split()
return word_list

# Open csv file containing tweets
csvfile = open('clean_data/collection3.csv')
csvreader = csv.DictReader(csvfile)

# Split each tweet into a list of words and print the list
for tweet in csvreader:
stemmed_words = stemmify(tweet['text'])
print stemmed_words

# Close the csv file
csvfile.close()

0 comments on commit 97f7602

Please sign in to comment.