Projet_TA

This project aims at tackling the problem of sentiment analysis of tweets in the context of the COVID-19 pandemic.

Description

Natural Language Processing (NLP) is a very hot field these days, especially with social networks. In the IFT712 - Learning Techniques course at the University of Sherbrooke we will study this field applied to tweets.

In concrete terms, the project selected is the classification of tweets on COVID-19. The complete dataset is available in the references at the end of the report.

The objective is therefore to produce several efficient classification algorithms. The objective will be achieved by different important steps such as visualization and data preprocessing.

Our database consists of five classes: Extremely Negative, Negative, Neutral, Positive and Extremely Positive. Our goal is to classify these tweets in order to know which type of sentiment it belongs to. We decided to keep only 3 classes: Negative, Neutral and Positive.

Getting Started

Dependencies

Libraries: requirements.txt
Tested on Linux and Windows operating systems.

Structure of the project

Root : Where the main_notebook.ipynb and python scripts are located.
notebooks_for_test : Where the notebooks for specific tests are located.
out : Where the outputs of the visualizations outputs are located.

Installing

clone the repository:
install the dependencies:

pip install -r requirements.txt

If you want to use nltk lemmatizer :
```
import nltk
nltk.download('wordnet')
```

Executing program

run the notebook main_notebook.ipynb
Only run first cell of the notebook if inside a google colab environment.

Help

Recommended to run in a google colab notebook :
- Access to GPU
- Better display

Authors

Contributors names and contact info

CHANTRE Honorine CHAH2807 : https://github.com/ChantreHonorine

THOMAS Eliott THOE2303 : https://github.com/eliottthomas99

Acknowledgments

Inspiration, code snippets, etc.

Name		Name	Last commit message	Last commit date
Latest commit History 81 Commits
notebooks_for_test		notebooks_for_test
out		out
.gitignore		.gitignore
Corona_NLP_test.csv		Corona_NLP_test.csv
Corona_NLP_train.csv		Corona_NLP_train.csv
Projet_TA_Rapport.pdf		Projet_TA_Rapport.pdf
README.md		README.md
RNN.py		RNN.py
analysing.py		analysing.py
data.json		data.json
data_visualisation.ipynb		data_visualisation.ipynb
hyperparameters.py		hyperparameters.py
main_notebook.ipynb		main_notebook.ipynb
optimising.py		optimising.py
preprocessing.py		preprocessing.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Projet_TA

Description

Getting Started

Dependencies

Structure of the project

Installing

Executing program

Help

Authors

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Projet_TA

Description

Getting Started

Dependencies

Structure of the project

Installing

Executing program

Help

Authors

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages