-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add taxon disjoints to subset #75
Conversation
Awesome initiative! I am too young (imagine that) to know the exact nature of the disjoint subset, so I will want to get someones eyes on this that has all the context:
Did you test with the current release files? https://github.com/obophenotype/ncbitaxon/releases/tag/v2023-02-24 |
But we still need to fix the pipeline, right? |
I can download the file now; maybe it was only a network issue. |
Having memory issues to generate the disjoint file to the complete taxonomy. I set 16G of memory to owltools.
|
I can imagine.. let's drop the full one for now and make a comment on the issue |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good—I think it's fine to just create for the taxslim for now. I'll make another issue to get rid of owltools; I think we can do this in less memory with Jena SPARQL.
Fixes #72
This includes
(1)
(in_taxon some X) DisjointWith (in_taxon some (not X))
for every taxonX
I could not test because I don't have thencbitaxon.obo
file and the pipeline needs to download a file that could not resolvecurl: (6) Could not resolve host: ftp.ncbi.nih.gov