Skip to content

Commit

Permalink
Add files to Nextclade Dataset
Browse files Browse the repository at this point in the history
  • Loading branch information
kimandrews committed May 23, 2024
1 parent 751cb5c commit c211eea
Show file tree
Hide file tree
Showing 5 changed files with 2,150 additions and 829 deletions.
3 changes: 3 additions & 0 deletions nextclade_dataset/CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
## Unreleased

Initial release.
31 changes: 31 additions & 0 deletions nextclade_dataset/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,31 @@
# Measles dataset

| Key | Value |
| ----------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| authors | [Nextstrain](https://nextstrain.org) |
| reference | NC_001498.1 |
| workflow | https://github.com/nextstrain/measles/tree/main/nextclade |
| path | `nextstrain/measles` |


## Scope of this dataset

This dataset assigns genotypes to measles samples based on [criteria outlined by the WHO](https://www.who.int/publications/i/item/WER8709).

The WHO has defined 24 measles genotypes based on N gene and H gene sequences from 28 reference strains. For new measles samples, genotypes can be assigned based on genetic similarity to the reference strains at the "N450" region (a 450 bp region of the N gene).

The reference tree used in this dataset includes N450 sequences for the 28 reference strains, along with other representative strains for each genotype.

This dataset can be used to assign genotypes to any sequence that includes at least 400 bp of the N450 region, including whole genome sequences. Sequence data beyond the N450 region will be reported as an insertion in the Nextclade output.

## Features

This dataset supports:

- Assignment of genotypes
- Phylogenetic placement
- Sequence quality control (QC)

## What are Nextclade datasets

Read more about Nextclade datasets in the Nextclade documentation: https://docs.nextstrain.org/projects/nextclade/en/stable/user/datasets.html
Loading

0 comments on commit c211eea

Please sign in to comment.