Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BERT-CoLA] Data source resolution #488

Open
texasmichelle opened this issue May 5, 2020 · 2 comments
Open

[BERT-CoLA] Data source resolution #488

texasmichelle opened this issue May 5, 2020 · 2 comments
Labels
good first issue Good for newcomers help wanted Extra attention is needed

Comments

@texasmichelle
Copy link
Member

texasmichelle commented May 5, 2020

The CoLA dataset is reliant on a firebase url with included token. Identify the source and determine whether we should host these files from GCS instead.

/cc @BradLarson @eaplatanios

@texasmichelle texasmichelle added good first issue Good for newcomers help wanted Extra attention is needed labels May 5, 2020
@texasmichelle
Copy link
Member Author

@eaplatanios do you know where this link came from?

@Shashi456
Copy link
Contributor

@texasmichelle So the link comes from here, that is the original script for downloading the glue dataset, while the original COLA dataset resides on this website. The difference between both of them is the test set is hosted at a separate Kaggle competition originally, while the glue dataset has this combined.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

2 participants