Skip to content

Commit

Permalink
Updated README
Browse files Browse the repository at this point in the history
  • Loading branch information
GallVp committed Jul 2, 2024
1 parent 864b564 commit f829ed1
Show file tree
Hide file tree
Showing 12 changed files with 58 additions and 24 deletions.
2 changes: 1 addition & 1 deletion .nf-core.yml
Original file line number Diff line number Diff line change
Expand Up @@ -23,7 +23,7 @@ nf_core_version: 2.14.2.dev0
repository_type: pipeline
template:
author: Usman Rashid
description: A pipeline to preprocess raw NGS data for downstream reuse across various
description: A pipeline to pre-process raw NGS data for downstream reuse across various
pipelines
force: true
is_nfcore: false
Expand Down
61 changes: 46 additions & 15 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -12,14 +12,34 @@

**gallvp/prepngs** is a bioinformatics pipeline that pre-processes NGS data for downstream reuse across various pipelines.

<!-- TODO nf-core: Include a figure that guides the user through the major workflow steps. Many nf-core
workflows use the "tube map" design for that. See https://nf-co.re/docs/contributing/design_guidelines#examples for examples. -->
<!-- TODO nf-core: Fill in short bullet-pointed list of the default steps in the pipeline -->
```mermaid
%%{init: {
'theme': 'base',
'themeVariables': {
'fontSize': '52px",
'primaryColor': '#9A6421',
'primaryTextColor': '#ffffff',
'primaryBorderColor': '#9A6421',
'lineColor': '#B180A8',
'secondaryColor': '#455C58',
'tertiaryColor': '#ffffff'
}
}}%%
flowchart LR
samplesheet(samplesheet.csv) ==> BAM2FASTQ
samplesheet(samplesheet.csv) ==> FASTQC
BAM2FASTQ ==> FASTQC
FASTQC ==> FASTP
FASTP ==> FASTQC2[FASTQC]
FASTQC2 ==> MULTIQC
```

1. Raw read QC ([`FastQC`](https://www.bioinformatics.babraham.ac.uk/projects/fastqc/))
2. Adapter trimming ([`FASTP`](https://github.com/OpenGene/fastp))
3. Trimmed read QC ([`FastQC`](https://www.bioinformatics.babraham.ac.uk/projects/fastqc/))
4. Present QC for raw reads ([`MultiQC`](http://multiqc.info/))
1. BAM to FASTQ ([`PBTK`](https://github.com/PacificBiosciences/pbtk)), `optional`
2. Raw read QC ([`FastQC`](https://www.bioinformatics.babraham.ac.uk/projects/fastqc/))
3. Adapter trimming ([`FASTP`](https://github.com/OpenGene/fastp))
4. Trimmed read QC ([`FastQC`](https://www.bioinformatics.babraham.ac.uk/projects/fastqc/))
5. Present QC for raw reads ([`MultiQC`](http://multiqc.info/))

## Usage

Expand All @@ -31,16 +51,15 @@ First, prepare a samplesheet with your input data that looks as follows:
`samplesheet.csv`:

```csv
sample,fastq_1,fastq_2
CONTROL_REP1,AEG588A1_S1_L002_R1_001.fastq.gz,AEG588A1_S1_L002_R2_001.fastq.gz
sample,reads_1,reads_2
test1,SRR4292758_00_R1.fastq.gz,SRR4292758_00_R2.fastq.gz
test2,alz.ccs.bam
```

Each row represents a fastq file (single-end) or a pair of fastq files (paired end).
Each row represents a fastq/bam file (single-end) or a pair of fastq files (paired end).

Now, you can run the pipeline using:

<!-- TODO nf-core: update the following command to include all required parameters for a minimal example -->

```bash
nextflow run gallvp/prepngs \
-profile <docker/singularity/.../institute> \
Expand All @@ -55,9 +74,21 @@ nextflow run gallvp/prepngs \

gallvp/prepngs was originally written by Usman Rashid.

We thank the following people for their extensive assistance in the development of this pipeline:

<!-- TODO nf-core: If applicable, make list of people who have also contributed -->
The pipeline uses nf-core modules contributed by following authors:

<a href="https://github.com/drpatelh"><img src="https://github.com/drpatelh.png" width="50" height="50"></a>
<a href="https://github.com/adamrtalbot"><img src="https://github.com/adamrtalbot.png" width="50" height="50"></a>
<a href="https://github.com/grst"><img src="https://github.com/grst.png" width="50" height="50"></a>
<a href="https://github.com/gallvp"><img src="https://github.com/gallvp.png" width="50" height="50"></a>
<a href="https://github.com/maxulysse"><img src="https://github.com/maxulysse.png" width="50" height="50"></a>
<a href="https://github.com/robsyme"><img src="https://github.com/robsyme.png" width="50" height="50"></a>
<a href="https://github.com/mbeavitt"><img src="https://github.com/mbeavitt.png" width="50" height="50"></a>
<a href="https://github.com/kevinmenden"><img src="https://github.com/kevinmenden.png" width="50" height="50"></a>
<a href="https://github.com/jfy133"><img src="https://github.com/jfy133.png" width="50" height="50"></a>
<a href="https://github.com/felixkrueger"><img src="https://github.com/felixkrueger.png" width="50" height="50"></a>
<a href="https://github.com/ewels"><img src="https://github.com/ewels.png" width="50" height="50"></a>
<a href="https://github.com/bunop"><img src="https://github.com/bunop.png" width="50" height="50"></a>
<a href="https://github.com/abhi18av"><img src="https://github.com/abhi18av.png" width="50" height="50"></a>

## Contributions and Support

Expand Down
5 changes: 5 additions & 0 deletions docs/contributors.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
#!/usr/bin/env bash

module_authors=$(find ./modules -name meta.yml | xargs -I {} grep -A20 'authors:' {} | grep '\- ' | tr -d '"' | tr '[:upper:]' '[:lower:]' | awk '{print $2}')
workflow_authors=$(find ./subworkflows -name meta.yml | xargs -I {} grep -A20 'authors:' {} | grep '\- ' | tr -d '"' | tr '[:upper:]' '[:lower:]' | awk '{print $2}')
echo -e "${module_authors}\n${workflow_authors}" | sort -V | uniq -c | sort -k1,1 -r | awk '{print $2}' | sed -n 's|@\(.*\)|<a href="https://github.com/\1"><img src="https://github.com/\1.png" width="50" height="50"></a>|p'
6 changes: 3 additions & 3 deletions modules.json
Original file line number Diff line number Diff line change
Expand Up @@ -2,12 +2,12 @@
"name": "gallvp/prepngs",
"homePage": "https://github.com/gallvp/prepngs",
"repos": {
"https://github.com/PlantandFoodResearch/nxf-modules.git": {
"https://github.com/GallVp/nxf-components.git": {
"modules": {
"pfr": {
"gallvp": {
"pbtk/pbindex": {
"branch": "main",
"git_sha": "429f064fb5fdf155405dda82a0334b8172f360e8",
"git_sha": "b1b21af9d0298a87a328c3e5664eea7904538cfb",
"installed_by": ["modules"]
}
}
Expand Down
File renamed without changes.
File renamed without changes.
File renamed without changes.
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@ nextflow_process {
process "PBTK_PBINDEX"

tag "modules"
tag "modules_pfr"
tag "modules_gallvp"
tag "pbtk"
tag "pbtk/pbindex"

Expand Down
2 changes: 0 additions & 2 deletions modules/pfr/pbtk/pbindex/tests/tags.yml

This file was deleted.

2 changes: 1 addition & 1 deletion nextflow_schema.json
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
"$schema": "http://json-schema.org/draft-07/schema",
"$id": "https://raw.githubusercontent.com/gallvp/prepngs/master/nextflow_schema.json",
"title": "gallvp/prepngs pipeline parameters",
"description": "A pipeline to preprocess raw NGS data for downstream reuse across various pipelines",
"description": "A pipeline to pre-process raw NGS data for downstream reuse across various pipelines",
"type": "object",
"definitions": {
"input_output_options": {
Expand Down
2 changes: 1 addition & 1 deletion workflows/prepngs.nf
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@
*/
include { paramsSummaryMap } from 'plugin/nf-validation'

include { PBTK_PBINDEX } from '../modules/pfr/pbtk/pbindex/main'
include { PBTK_PBINDEX } from '../modules/gallvp/pbtk/pbindex/main'
include { PBTK_BAM2FASTQ } from '../modules/nf-core/pbtk/bam2fastq/main'
include { MULTIQC } from '../modules/nf-core/multiqc/main'

Expand Down

0 comments on commit f829ed1

Please sign in to comment.