You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A small project to improve anvi'o, based upon feedback/ideas @FlorianTrigodet and I heard from our colleagues at the QIB in Norwich.
The need
If you install function databases in a non-default location, as is often done on HPCs, it could be nice to set an environment variable for your entire working group so that everyone can use these function databases without having to always specify their location on the command line with parameters like --kegg-data-dir. For example, we already have one of these for --cog-data-dir, which is accessed via the environment variable $ANVIO_COG_DATA_DIR.
Here is how we use that variable in cogs.py:
elif 'ANVIO_COG_DATA_DIR' in os.environ:
self.COG_base_dir = os.environ['ANVIO_COG_DATA_DIR']
The existence of this variable is not documented on the anvi-setup-ncbi-cogshelp page (or elsewhere on COGs-related help pages), so one issue is that very few people don't know about it.
But adding these sorts of environment variables for the other programs that use data directories (ie, --kegg-data-dir, --pfam-data-dir, --scgs-taxonomy-data-dir, etc) could be valuable :) As long as they are properly documented, of course.
The solution
Add documentation for $ANVIO_COG_DATA_DIR on the related help pages. Create new variables to be accessed via os.environ for the other data directories (and document them).
Beneficiaries
Anyone who has to install databases that anvi'o depends on in non-default locations (especially HPC users).
The text was updated successfully, but these errors were encountered:
A small project to improve anvi'o, based upon feedback/ideas @FlorianTrigodet and I heard from our colleagues at the QIB in Norwich.
The need
If you install function databases in a non-default location, as is often done on HPCs, it could be nice to set an environment variable for your entire working group so that everyone can use these function databases without having to always specify their location on the command line with parameters like
--kegg-data-dir
. For example, we already have one of these for--cog-data-dir
, which is accessed via the environment variable$ANVIO_COG_DATA_DIR
.Here is how we use that variable in
cogs.py
:The existence of this variable is not documented on the
anvi-setup-ncbi-cogs
help page (or elsewhere on COGs-related help pages), so one issue is that very few people don't know about it.But adding these sorts of environment variables for the other programs that use data directories (ie,
--kegg-data-dir
,--pfam-data-dir
,--scgs-taxonomy-data-dir
, etc) could be valuable :) As long as they are properly documented, of course.The solution
Add documentation for
$ANVIO_COG_DATA_DIR
on the related help pages. Create new variables to be accessed viaos.environ
for the other data directories (and document them).Beneficiaries
Anyone who has to install databases that anvi'o depends on in non-default locations (especially HPC users).
The text was updated successfully, but these errors were encountered: