GitHub - Razany98/Sentiment-analysis

Sentiment analysis is a A natural language processing technique used to determine the data belongs to which class/type of emotions. Sentiment analysis is often performed on textual data to help businesses monitor brand and product sentiment in customer feedback and understand customer needs.

Dataset:

The dataset contains 20,000 tweets that were collected through API for classifying emotions.

It contains 6 classes of emotions:

Anger
joy
fear
love
surprise
sadness

The dataset can be found at:

https://huggingface.co/datasets/dair-ai/emotion

or

https://paperswithcode.com/dataset/emotion

The Pipeline:

1 - Text Preprocessing Dataset Cleaning: Check for null values Check for duplicated values Remove unwanted patterns as: #, &, punctuation, ... etc.
2 - Text Normalization Used NLTK Library and includes 3 Techniques: Tokenization: Converting sequence of texts into smaller parts. Slemming: Reducing a word to its word stem (root). Lemmatization: Reduce the word better to its root word, or lemme, good.
3 - Word Embedding BOW (Bag Of Words): Accuracy 80% TF-IDF: Accuracy 77% Word2Vec with LSTM: 64%
4 - Modeling & Evaluation

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Dataset		Dataset
README.md		README.md
Sentiment_Analysis.ipynb		Sentiment_Analysis.ipynb
Sentiment_Analysis_LSTM.ipynb		Sentiment_Analysis_LSTM.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dataset:

The Pipeline:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Dataset:

The Pipeline:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages