-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use open-source Snorkel to create labelling functions to expand our training dataset. #65
Comments
Labelling functions for labels with < 10 examples Sense temperature cues: |
Another idea for a labelling function is to take our ground truth labelled papers, and use the MAG API to find papers related to each one. Each related paper gets assigned the same label as the ground truth paper it's related to. Here's how to get related papers from MAG (kudos to @dsmith111 for figuring this out!):
|
https://www.snorkel.org/get-started/
https://github.com/snorkel-team/snorkel
Many of our labelling functions will use MAG topics. For this we will use the free MAG APIs 'evaluate' method. I will provide an API key for this.
‘Evaluate’ method ’try it out'
https://msr-apis.portal.azure-api.net/docs/services/academic-search-api/operations/565d753be597ed16ac3ffc03?
API limits
10,000 transactions per month, 3 per second for interpret, 1 per second for evaluate, 6 per minute for calcHistogram.
API Documentation
https://docs.microsoft.com/en-us/academic-services/project-academic-knowledge/reference-query-expression-syntax
https://docs.microsoft.com/en-us/academic-services/project-academic-knowledge/reference-evaluate-method
List of Microsoft Academic topics
https://academic.microsoft.com/topics/100858432?fullPath=false
Example API request to get ids, dois, titles, abstract, topics, authors, venue, and references labelled with 'biology' OR in the Biomimetics journal AND labelled with 'wind stress' OR 'wind engineering'. This would be used for a labelling function for 'protect from wind'.
The text was updated successfully, but these errors were encountered: