feat: Automatic GEM scoring using custom memote integration #252

JonathanRob · 2021-04-21T15:29:18Z

Description of the issue:

The memote package provides a really nice standardized tool for evaluating the quality of a GEM, and has been used during the curation of Human-GEM to identify problems or weak points in in the model. Although I feel a complete integration of memote may be overkill and potentially incompatible with our current repo framework, I think a custom lightweight integration using GitHub actions could be a nice way to automatically track some scores of interest, such as % reactions balanced, annotation coverage, etc.

The implementation would be very straightforward, and would return a JSON that could be parsed and presented/stored in whichever format we choose. However, some questions remain:

When should the scoring action be run?
- On every PR (to devel/master/etc)?
- Only with new releases?
Where should the scores be shown?
- As a comment on a PR?
- In a newly generated text/markdown file somewhere in the repo?
Should the scores be stored in some sort of log file?
- Keep a historical log of scores over time
- Would likely be a headache since we may change which tests are run
- Maybe replace the existing "score file" with a new one anytime the action is run?
How should the output be formatted?
- Will depend on some of the questions above
- tsv, markdown, etc.

Expected feature/value/output:

A lightweight, automated scoring script to periodically report a few model statistics of interest.

I hereby confirm that I have:

Checked that a similar issue does not exist already

mihai-sysbio · 2021-04-21T15:36:36Z

I'd suggest having a look at how this is set up for Yeast-GEM in this PR. It uses GH Actions to do both a simple memote run and also memote history. I imagine these workflows can be copy-pasted to a large extend.

haowang-bioinfo · 2021-04-23T07:14:23Z

@JonathanRob very good point to get memote test integrated.

@mihai-sysbio indeed, would be ideal to maintain a similar (or same) GH action as Yeast-GEM.

mihai-sysbio · 2021-04-23T17:50:31Z

Initial thoughts:

on every PR (as long as the output is compact)
as a PR comment (to keep the repo clean)
don't keep anything, as memote's history report can comb through the entire history when producing the report
in the PR comment, something markdown-like and easy to read

A longer output can be printed out in the Action run, where it will be stored for 90 days (I think that's the default setting). One could also generate a full html report, and store that in the main branch, but that is outside the scope as stated:

A lightweight, automated scoring script to periodically report a few model statistics of interest.

haowang-bioinfo · 2021-04-24T10:40:02Z

some comments to your thoughts @mihai-sysbio

on every PR (as long as the output is compact)

let's start with the PR from develop to master

as a PR comment (to keep the repo clean)

~~agree~~ this seems infeasible

don't keep anything, as memote's history report can comb through the entire history when producing the report

in the PR comment, something markdown-like and easy to read

Markdown report with a few statistics sounds good

mihai-sysbio · 2021-04-24T10:47:11Z

let's start with the PR from develop to master

This is going to be weird to test and merge before it gets to master.

haowang-bioinfo · 2021-04-24T12:27:35Z

This is going to be weird to test and merge before it gets to master.

@mihai-sysbio not sure what you mean. Fun fact: memote uptakes models only in xml, which exists only in master and is updated once a PR was merged into master from develop

mihai-sysbio · 2021-04-24T13:00:06Z

@Hao-Chalmers I think memote requires cobra to be installed, in which case maybe the xml format can be obtained with

cobra.io.write_sbml_model

Edit: this approach might yield low scores on the annotations, so these should not be included in the PR comments.

haowang-bioinfo · 2021-04-26T06:41:56Z

yes, there will be low scores in any PRs because a xml file with integrated annotation fields is updated only in master branch, to which a markdown report might be added. Not sure in a standalone file or an integrated section of README, or else?

mihai-sysbio · 2021-04-28T14:15:04Z

@Hao-Chalmers, with the help of an Action runner with Matlab, the model could be exported via Raven in xml format including all the annotations, exactly like on master, before running memote. These temporary files would not be committed, but some memote scores may be kept in a file or (my preference) in the PR comments.

mihai-sysbio · 2021-08-12T10:16:32Z

A workflow to run memote on a PR has been merged and released. It shouldn't be too time consuming to adopt this, or other memote actions, in Human-GEM. However, in line with previous comments, several questions need clear answers:

under which conditions should the action be triggered?
what parameters should be used in the memote command?
which parts of the report should be posted as PR comments (eg a small selection of scores)?

JonathanRob added enhancement discussion feature labels Apr 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Automatic GEM scoring using custom memote integration #252

feat: Automatic GEM scoring using custom memote integration #252

JonathanRob commented Apr 21, 2021

mihai-sysbio commented Apr 21, 2021

haowang-bioinfo commented Apr 23, 2021

mihai-sysbio commented Apr 23, 2021

haowang-bioinfo commented Apr 24, 2021 •

edited

Loading

mihai-sysbio commented Apr 24, 2021

haowang-bioinfo commented Apr 24, 2021 •

edited

Loading

mihai-sysbio commented Apr 24, 2021 •

edited

Loading

haowang-bioinfo commented Apr 26, 2021

mihai-sysbio commented Apr 28, 2021

mihai-sysbio commented Aug 12, 2021 •

edited

Loading

feat: Automatic GEM scoring using custom memote integration #252

feat: Automatic GEM scoring using custom memote integration #252

Comments

JonathanRob commented Apr 21, 2021

Description of the issue:

Expected feature/value/output:

mihai-sysbio commented Apr 21, 2021

haowang-bioinfo commented Apr 23, 2021

mihai-sysbio commented Apr 23, 2021

haowang-bioinfo commented Apr 24, 2021 • edited Loading

mihai-sysbio commented Apr 24, 2021

haowang-bioinfo commented Apr 24, 2021 • edited Loading

mihai-sysbio commented Apr 24, 2021 • edited Loading

haowang-bioinfo commented Apr 26, 2021

mihai-sysbio commented Apr 28, 2021

mihai-sysbio commented Aug 12, 2021 • edited Loading

haowang-bioinfo commented Apr 24, 2021 •

edited

Loading

haowang-bioinfo commented Apr 24, 2021 •

edited

Loading

mihai-sysbio commented Apr 24, 2021 •

edited

Loading

mihai-sysbio commented Aug 12, 2021 •

edited

Loading