-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Difference between ICE and KR biases. Setting up the bias limits. #55
Comments
....with that, I am always getting an error when using bias file with -1 values and have to remove them manually before running fithic.py. Is t possible to fix in the code?
|
You should be removing the chromosomes/scaffolds that have ALL of their bins with -1 bias values or bias values outside the 0.5-2 range in general, not the specific bins with -1 values. Hope this clarifies |
I see. I used awk to remove lines with -1 values, but cannot figure out how
to find out which scaffolds have biases outside the range. Please help me
with the code. Could this step be included in the tool?
Thank you!
Pavla
…On Mon, 1 Aug 2022, 20:01 ay-lab, ***@***.***> wrote:
You should be removing the chromosomes/scaffolds that have ALL of their
bins with -1 bias values or bias values outside the 0.5-2 range in general,
not the specific bins with -1 values. Hope this clarifies
—
Reply to this email directly, view it on GitHub
<#55 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABYK2UTGMBHUGXUSGBFS2QLVXAGH3ANCNFSM53VAB5VQ>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
I did remove the scaffolds having all of their bins with -1 bias outsider or outside of the range using this R script:
However, I am still getting the same error when using the good biases file.
Thank you for help, Pavla |
the program is trying to throw an ERROR but in there there is a little bug about int to str conversion (we can fix). But the error that is important tells you that you haven't removed all problematic chrs or contigs properly. I suggest you look at the number of bins for each non-filtered chr or contig and there is likely one or more with zero or one such bins |
Hi,
I am processing a HiC-Pro results from a small genome (in scaffolds) at 5000bp resolution. I have used HiCPro2FitHiC utility to convert data and also generated the KR biases file using HiCKRy.py.
For example, I see 2 extremely high values in the ICE:
HiC_scaffold_83 6297500 88.3104735696205
HiC_scaffold_20 672500 43.444427428612
that are far from corresponding KR values:
HiC_scaffold_83 6297500 1.71088667499396
HiC_scaffold_20 672500 0.824151827905017
...otherwise, the distributions are similar, with a number of -1 values.
Which of the bias version is preferable?
My other question is in which cases the -bL and -bU need to be modified and whether it is appropriate to adjust them to the bias method or other genome/data-specific factors.
Thank you!
The text was updated successfully, but these errors were encountered: