NLP enabled supershell
Install scrapy and nltk prior to running this program.
-
Clone the repo
-
make setup.sh executable
-
Run setup.sh
-
That creates a folder named ManPageCrawler
-
cd ManPageCrawler
-
run getmanpages.sh
-
Done! That fetches all man pages
-
In order to process the data, cd into programs directory
-
First run doc_vector.py and then run clusterer2.py
-
Processed result gets stored in clusterdata folder situated in the ManPageCrawler folder