-
Notifications
You must be signed in to change notification settings - Fork 47
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DMRfind generates thousands of chunk.tsv files #94
Comments
Yes the behavior of generating thousands of chunk files is expected. The chunk files are expected to be removed at the end of the run but it does not seem to be the case. I don't know what went wrong. One potential explanation is that the program died without error message. If you run it again, do you still see the chunk files? |
I am experiencing the same issue. I'm re-running DMRFind on allc files we previously used and I'm no longer getting the appropriate output files and the chunk files remain. The rms_results.tsv is empty Filtering allc files using 6 node(s). Splitting allc files for chromosome Chr1 Running rms tests for chromosome Chr1 Splitting allc files for chromosome Chr2 Running rms tests for chromosome Chr2 Splitting allc files for chromosome Chr3 Running rms tests for chromosome Chr3 Splitting allc files for chromosome Chr4 Running rms tests for chromosome Chr4 Splitting allc files for chromosome Chr5 Running rms tests for chromosome Chr5 Merging sorted test_rms_results.tsv files. Begin FDR Correction m0 estimate for iteration 0: 0 Calculating Residual Cutoff There are no null residuals to calculate resid_cutoff. Using 2 as the cutoff. Adding Methylation Levels Done |
Could you put together a small example to reproduce the error? |
Hi Yupeng, thanks for your help. I'm attaching small subsets of the data set. I'm also attaching the yml for the methylpy conda environment. This is the command I ran: Splitting allc files for chromosome Chr1 Running rms tests for chromosome Chr1 Merging sorted /opt/afernabio/sequencing_data/mkramer/projects/emseq/data/test_at_data/DMRfind/test.sample_rms_results.tsv files. Begin FDR Correction m0 estimate for iteration 0: 0 Calculating Residual Cutoff There are no null residuals to calculate resid_cutoff. Using 2 as the cutoff. Adding Methylation Levels Done In the output there are many intermediate files, and an empty rms_results.tsv file. Thanks for your help |
Weird. I cannot reproduce the error. Here was the log I got.
|
Could you try one more thing? If you just run For example
When you get the path, could you try the below command?
Are you able to see the below output?
|
How strange... Yes, I get that output. /home/ubuntu/miniconda3/envs/methylpy_env/lib/python3.10/site-packages/methylpy/run_rms_tests.out |
I'm trying to create a new conda env to ensure that isn't the issue, but it's taking a long time. Which download method did you do/do you recommend? |
Have you tried mamba? I found it much better than conda,. |
Hi Yupeng,
I'm running into some issues with the results generated by DMRfind. I have EM-seq reads from two different organisms to use in my DMR analysis. I've run DMRfind (methylpy 1.4.7) on Arabidopsis with the simplified command below with success:
Results:
However, running this similar command on another organism (with a scaffold genome) led to the generation of thousands of scaffold#_chunk#.tsv files in the results directory:
Results:
It's interesting because there was no obvious error in the output file and some of the DMRs seem to have compiled in the top four files.
Thank you for your help, and please let me know if you need any more information for troubleshooting this issue.
The text was updated successfully, but these errors were encountered: