LBDM-project

Repository for the laboratory of biological data mining project, Oct. 2021

Installation

Simply clone the repository

Search significant

The search_significant.py script allows the user to outer merge (using pandas) the csv of the expansion lists of a gene of interest, retrieved from the gene@home portal. It merges based on the gene name to reduce complexity, so the user must take into account all isoforms listed in the final dataset when proceeding with the analysis. Run the script with python. You must provide:

(-i, --inputf) folder containing the csv files (not zipped)
(-l, --list) .txt file containing the list of genes that you want to check if they appear in the merged dataframe.

Example: python search_significant.py -i /home/elisa/LBDM-project/ACLY -l /home/elisa/LBDM-project/EMTgenes.txt

Output

a "significant.csv" containing the merged dataframes
prints at command line the list of the genes from the text file provided that are present in the dataframe.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Kmeans1.png		Kmeans1.png
PCA_analysis.png		PCA_analysis.png
README.md		README.md
data_pca.csv		data_pca.csv
k_means.py		k_means.py
kmeans3.png		kmeans3.png
kmeans_normalizer.png		kmeans_normalizer.png
merge_kmeans.csv		merge_kmeans.csv
pca.py		pca.py
search_significant.py		search_significant.py
variance_principal_component.png		variance_principal_component.png
zipping.sh		zipping.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LBDM-project

Installation

Search significant

Output

About

Releases

Packages

Languages

Elisshaze/LBDM-project

Folders and files

Latest commit

History

Repository files navigation

LBDM-project

Installation

Search significant

Output

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages