Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reproduce reported results #35

Open
trannhuthuat96 opened this issue Nov 1, 2020 · 4 comments
Open

Reproduce reported results #35

trannhuthuat96 opened this issue Nov 1, 2020 · 4 comments

Comments

@trannhuthuat96
Copy link

Hi @hugochan,

Could you please public the actual 20 news data that you used to train model and get the results reported in paper?

I downloaded data from the link http://qwone.com/~jason/20Newsgroups/ (filename is 20news-bydate.tar.gz) and used your code but cannot reproduce your reported results. (I also used your processed data but didn't work as well)

Thanks,

@hugochan
Copy link
Owner

hugochan commented Nov 7, 2020

Hi @hugochan,

Could you please public the actual 20 news data that you used to train model and get the results reported in paper?

I downloaded data from the link http://qwone.com/~jason/20Newsgroups/ (filename is 20news-bydate.tar.gz) and used your code but cannot reproduce your reported results. (I also used your processed data but didn't work as well)

Thanks,

Hi @trannhuthuat96 , that is exactly the data we used in our experiments. You should be able to reproduce the results following the instructions in README. Can you report your results here? Also I am not sure if the issue is related to different TF versions, so please check #24 as well.

@trannhuthuat96
Copy link
Author

trannhuthuat96 commented Nov 18, 2020

Sorry @hugochan for late response.

I checked issued #23 and #24 as well and changed TF version to 1.2.1 but still didn't reproduce your results.

For document classification on 20 news data, I got 62% accuracy with 128 topics with KATE. Moreover, using your processed data mentioned in #23, the results is 73% instead of (74.4%) as reported.

Do you have any ideas/comments to reproduce result?

@hugochan hugochan reopened this Nov 30, 2020
@hugochan
Copy link
Owner

Sorry @hugochan for late response.

I checked issued #23 and #24 as well and changed TF version to 1.2.1 but still didn't reproduce your results.

For document classification on 20 news data, I got 62% accuracy with 128 topics with KATE. Moreover, using your processed data mentioned in #23, the results is 73% instead of (74.4%) as reported.

Do you have any ideas/comments to reproduce result?

@trannhuthuat96 I am not sure what will be other reasons besides package (e.g., TF, Keras) versions. BTW, did you use cpu or gpu version of TF? I used cpu version when doing the experiments. Not sure if this is the cause.

You might also try running the experiments multiple times with different random seeds, and report the avg/std scores.

@trannhuthuat96
Copy link
Author

Thanks @hugochan for your suggestions.
I used both GPU and CPU for training but cannot reproduce your reported results.

May be it is needed further exploration to reproduce the results.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants