Machine learning methods to predict the positivity/sentiment of an IMDb movie review.
- Clone repository (run
git clone https://github.com/RylanSteinkey/imdbSentimentAnalysis.git) - Change directories into the project folder:
cd imdbSentimentAnalysis - Download anaconda or miniconda (python 3.7), instructions for that are here
- Install dependecies: run
conda env create -f envi.yaml - Run
snakemake - Check results.txt for accuracy and a ranked list of important words
After the above has successfully run, you can execute other models by running models.py as:
python models.py XGB -- XGBoost (68.2% accuracy with 1000 samples)
python models.py SVM -- Support Vector Machine (68.4% accuracy with 1000 samples)
python models.py MNB -- Multinomial Naive Bayes (68.4% accuracy with 1000 samples)
python models.py ANN -- Artificial Neural Network (78.8% accuracy with 1000 samples)