- Using tfidf vectorizer, Logistic Regression, and other Machine Learning models aside with preprocessing techniques used for Arabic Language Tweets to classify Arabic Dialect from text.
- Using AraBERT model version 2 for Deep Learning approach and comparing results with Machine Learning Approach using Confussion Matrix, F1-score.
for more info about the repo please check pdf slides, and check Models Directory for results
- First need to download related packages in conda envirnoment
PyTorch Pandas matplotlib scikit-learn transformers pyarabic emoji nltk
- Make sure you activate env where all packages are downloaded.
- Run ModelTraining_ML.ipynb,ModelPrediction-AraBert.ipynbto get models pickle files
- After that go the saved pickle files and copy (or cut) paste to static folder for the FastAPI server within folders for ML models, and other for AraBERT model.
static
│   ├───ML_models
│   └───output_dir- Run python main.py
- After running your server go to localhost:5000/docsin browser, and try out different POST methods with different text
- Enjoy your server app

