Skip to content

implementation of continuous space topic model for modeling text through gaussian processes

Notifications You must be signed in to change notification settings

seiichiinoue/cstm

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Continuous Space Topic Model

Environment

  • C++ 14+
  • clang++ 9.0
  • boost 1.71.0
  • glog 0.4.0
  • gflag 2.2.2
  • boost-python3
  • python3

Usage

  • process text data with mecab-python3
$ python3 utils/process.py --tar_path data/raw/ --save_path data/train/
  • training CSTM with MH algorithm
$ make
$ ./cstm -ndim_d=20 -ignore_word_count=4 -epoch=100 -num_threads=1 -data_path=./data/train/ -validation_data_path=./data/validation/ -model_path=./model/cstm.model
  • load CSTM model and plot vector
$ make install
$ python3 utils/plot_doc.py
$ python3 utils/plot_word.py
  • caluculation cosine similarity between words/docs
$ python3 utils/similarity.py -word TARGET_WORD

Reference

About

implementation of continuous space topic model for modeling text through gaussian processes

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published