Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WordSeg] Alphabet can only contain characters from training #599

Open
texasmichelle opened this issue Jun 10, 2020 · 0 comments
Open

[WordSeg] Alphabet can only contain characters from training #599

texasmichelle opened this issue Jun 10, 2020 · 0 comments

Comments

@texasmichelle
Copy link
Member

Currently, the WordSeg dataset uses characters from all datasets to create an instance of Alphabet, but only the training set should be used.

This potentially involves architectural changes to CharacterSequence, but we might be able to handle failures differently instead.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant