Sentiment-analysis

NLP is the analysis of words and not sentences

A few questions you might have while reading the code

Why are we using utf-8 while reading the file?

utf-8 is used in reading files as while we copy text from any site or blog, the text copied could be encoded, using utf-8 while reading the text file helps read the proper file without encoded value. ( basically, decrypts and the encrypted file so we can read the plain text)

Why should we change all the letters or characters in the text file to lowercase?

Movie != movie [Case sensivity] So we converted all the cases to lowercase to have better insight into the sentiments in it

What does tokenization mean?

Tokenization - breaking the sentence into words. saves each word in a list

What are stop words?

Stop words are the words that add no meaning to the sentence and hence can be ignored from the list while we make tokenization of the file

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
emotions.txt		emotions.txt
graph.png		graph.png
main.py		main.py
read.txt		read.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment-analysis

A few questions you might have while reading the code

Why are we using utf-8 while reading the file?

Why should we change all the letters or characters in the text file to lowercase?

What does tokenization mean?

What are stop words?

About

Releases

Packages

Languages

License

annapurna2003/Sentiment-analysis

Folders and files

Latest commit

History

Repository files navigation

Sentiment-analysis

A few questions you might have while reading the code

Why are we using utf-8 while reading the file?

Why should we change all the letters or characters in the text file to lowercase?

What does tokenization mean?

What are stop words?

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages