SchemaRefinery - A Tool for Refining Genomic Schemas

Description

The SchemaRefinery repository contains tools and modules for refining genomic schemas. These tools help in identifying paralogous loci, spurious genes, and annotating schemas. The repository supports various genomic data processing tasks and provides configurable parameters for different processes.

Installation

Current implementation of SchemaRefinery is available as a Python package available at SchemaRefinery GitHub repository. To install the package, run the following command:

Install Python: Ensure you have Python 3.9 or higher installed. You can download Python from the official website.

Install Git: If you don't have Git installed, you can install it using the following command:

sudo apt-get install git # For Debian-based systems
sudo yum install git # For Red Hat-based systems
brew install git # For macOS

Clone the Repository: Clone the SchemaRefinery repository from GitHub:

git clone https://github.com/MForofontov/Schema_Refinery.git # Clone the repository

Change Directory: Navigate to the cloned repository:

cd Schema_Refinery # Change directory to the repository

Install Dependencies: Install the required Python packages:
```
pip install biopython
```
Install the Package: Install the SchemaRefinery package:
```
python setup.py install
```
Verify Installation: Verify the installation by running the following command:
```
SR --help
```

Modules

The repository includes the following main modules:

IdentifyParalogousLoci: Identifies paralogous loci in a schema.
IdentifySpuriousGenes: Identifies spurious genes in a schema.
SchemaAnnotation: Annotates schemas with additional information.
MatchSchemas: Matches schemas in a directory.
DownloadAssemblies: Downloads genomic assemblies from various databases.
AdaptLoci: Adapts loci in fasta format to a schema format.

Dependencies

Python 3.9 or higher
Biopython library (pip install biopython)
NCBI datasets (NCBI datasets)

Modules Usage

Each module can be used independently by running the corresponding script with the required command-line arguments. Below are examples for each module:

IdentifyParalogousLoci

```bash
SR IdentifyParalogousLoci --help
```

IdentifySpuriousGenes

```bash
SR IdentifySpuriousGenes --help
```

SchemaAnnotation

```bash
SR SchemaAnnotation --help
```

MatchSchemas

```bash
SR MatchSchemas --help
```

DownloadAssemblies

```bash
SR DownloadAssemblies --help
```

AdaptLoci

```bash
SR AdaptLoci --help
```

Troubleshooting

If you encounter issues while using the modules, consider the following troubleshooting steps:

Verify that the paths to the schema, output, and other directories are correct.
Check the output directory for any error logs or messages.
Increase the number of CPUs using the -c or --cpu option if the process is slow.
Ensure that you have a stable internet connection.

if the issue persists, please report it to the development team using github issues.

Contributing

We welcome contributions to the SchemaRefinery project. If you would like to contribute, please follow these steps:

Fork the repository on GitHub.
Create a new branch for your feature or bugfix.
Make your changes and commit them with a clear message.
Push your changes to your forked repository.
Create a pull request to the main repository.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Contact Information

For support or to report issues, please contact the development team at GitHub issues in SchemaRefinery GitHub repository.

Name		Name	Last commit message	Last commit date
Latest commit History 1,035 Commits
SchemaRefinery		SchemaRefinery
scripts		scripts
tests		tests
.editorconfig		.editorconfig
.gitignore		.gitignore
.readthedocs.yaml		.readthedocs.yaml
.travis.yml		.travis.yml
CONTRIBUTING.rst		CONTRIBUTING.rst
HISTORY.rst		HISTORY.rst
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.md		README.md
README.rst		README.rst
requirements_dev.txt		requirements_dev.txt
setup.cfg		setup.cfg
setup.py		setup.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SchemaRefinery - A Tool for Refining Genomic Schemas

Description

Installation

Modules

Dependencies

Modules Usage

IdentifyParalogousLoci

IdentifySpuriousGenes

SchemaAnnotation

MatchSchemas

DownloadAssemblies

AdaptLoci

Troubleshooting

Contributing

License

Contact Information

About

Releases 2

Packages

Contributors 6

Languages

License

B-UMMI/Schema_Refinery

Folders and files

Latest commit

History

Repository files navigation

SchemaRefinery - A Tool for Refining Genomic Schemas

Description

Installation

Modules

Dependencies

Modules Usage

IdentifyParalogousLoci

IdentifySpuriousGenes

SchemaAnnotation

MatchSchemas

DownloadAssemblies

AdaptLoci

Troubleshooting

Contributing

License

Contact Information

About

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 6

Languages

Packages