GUILD is an unsupervised data categorization framework to cluster and assign a topic to unorganized raw text data.
https://www.overleaf.com/6685394279ybbxvcmbgysv
git clone https://github.com/joshianirudh/DocumentClustering.git
pip install -r requirements.txt
Code is in Notebooks/GUILD.ipynb