Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feat/parse nextclade annotation #1263

Open
wants to merge 16 commits into
base: master
Choose a base branch
from

Conversation

rneher
Copy link
Member

@rneher rneher commented Jul 26, 2023

Description of proposed changes

parse output of genome annotation form nextclade (v3) genemap format

Checklist

  • Add a message in CHANGES.md summarizing the changes in this PR that are end user focused. Keep headers and formatting consistent with the rest of the file.

rneher added 10 commits July 22, 2023 14:41
augur ancestral so far only dealt with nucleotide sequences, while
translation were done codon by codon from the nucleotide sequences in
later steps. If translations for tips are available, the ancestral
amino acid sequences can be determined analogously via ancestral
reconstruction. This is implemented in this commit.

For each node, the node data structure will contain a `muts` and a
`aa_muts` field. The latter is a dict with gene/cds names as keys and
`<ancestral><pos><derived>` mutations (same as in augur translate).
In addition, an annotation and reference sequences are derived stored
in the node-data-json.
huddlej and others added 6 commits July 27, 2023 13:46
Reorganize functional tests for augur ancestral to follow the standard
pattern with "cram" and "data" subdirectories and separate cram files
for individual functional tests. Paves the way for adding another
functional test for amino acid sequence reconstruction.
Adds amino acid sequences for the corresponding nucleotide alignment in
the ancestral data directory (created from augur translate but mimicking
output from Nextalign) and adds a functional test for the new interface
that allows reconstruction of amino acid sequences for internal nodes
from the given tip sequences per gene.
@huddlej huddlej force-pushed the feat/aa-ancestral branch from c4c2dec to 7cfbd9b Compare August 11, 2023 18:31
Base automatically changed from feat/aa-ancestral to master August 11, 2023 18:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
No open projects
Development

Successfully merging this pull request may close these issues.

2 participants