Why not choose the breakpoint with lowest test perplexity? #5

MrSeven77 · 2019-07-23T06:10:46Z

Recently, I've been reproducing the paper's result using the oringinal data and this code.
The picture is a visulization from tensorboard, with record of testing perplexity from every 500 step. And from this picture, I noticed that the last step (ppl:37.76) doesn't have the lowest test perplexity (ppl:25.11). However, the value of the last step consists with paper's result(ppl:36.9) .
So, why not choose the breakpoint with lowest test perplexity? Or, what is the criterion of the convergence of the model?

claude-zhou · 2019-07-28T23:00:37Z

Criterion of convergence is that the perplexity has stabilized not that it reaches a lowest point.

MrSeven77 · 2019-07-29T06:32:54Z

Criterion of convergence is that the perplexity has stabilized not that it reaches a lowest point.

Thanks for reply. However, when testing perplexity is stablized, is there a chance that this model is overfitted?

More over, when i changed the code for NLPCC 2017 dataset and trained the CVAE model, I got extremely large testing perplexty and after training for a while, the losses became NaN. What problem did I encounter? Are there any tutorials of how to train a CVAE model?

Note that the CVAE model is initialized with a pretrained seq2seq model, as the paper said.

MrSeven77 · 2019-07-29T06:50:07Z

Thanks a lot for your reply.

hqlin2018 · 2019-11-05T00:57:37Z

Criterion of convergence is that the perplexity has stabilized not that it reaches a lowest point.

Thanks for reply. However, when testing perplexity is stablized, is there a chance that this model is overfitted?

More over, when i changed the code for NLPCC 2017 dataset and trained the CVAE model, I got extremely large testing perplexty and after training for a while, the losses became NaN. What problem did I encounter? Are there any tutorials of how to train a CVAE model?

Note that the CVAE model is initialized with a pretrained seq2seq model, as the paper said.

hello , do you solve the problem that using the NLPCC 2017 dataset to trained the CVAE model?
and also that is i want to do now.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why not choose the breakpoint with lowest test perplexity? #5

Why not choose the breakpoint with lowest test perplexity? #5

MrSeven77 commented Jul 23, 2019 •

edited

Loading

claude-zhou commented Jul 28, 2019

MrSeven77 commented Jul 29, 2019

MrSeven77 commented Jul 29, 2019

hqlin2018 commented Nov 5, 2019

Why not choose the breakpoint with lowest test perplexity? #5

Why not choose the breakpoint with lowest test perplexity? #5

Comments

MrSeven77 commented Jul 23, 2019 • edited Loading

claude-zhou commented Jul 28, 2019

MrSeven77 commented Jul 29, 2019

MrSeven77 commented Jul 29, 2019

hqlin2018 commented Nov 5, 2019

MrSeven77 commented Jul 23, 2019 •

edited

Loading