Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

evcomplex scoring model #234

Open
cross12tamu opened this issue Apr 17, 2020 · 6 comments
Open

evcomplex scoring model #234

cross12tamu opened this issue Apr 17, 2020 · 6 comments

Comments

@cross12tamu
Copy link

cross12tamu commented Apr 17, 2020

I have ran different jobs using two primary scoring models, skewnormal and evcomplex. My question is regarding results from evcomplex jobs, which is, what does it mean to have a probability > 1. We have numerous pairs resulting in scores > 1, and want to verify with y'all the meaning.

@jrr-cpt

@jrr-cpt
Copy link

jrr-cpt commented Apr 17, 2020

Jobs identical in all parameters but the scoring_model (using skewnormal) do not yield probabilities greater than 1. Given that we are analyzing two proteins, the evcomplex algorithm seemed the appropriate choice. I was surprised to see that the scoring model used on the webserver for couplings jobs is skewnormal. Since every output we have generated locally using evcomplex has probabilities greater than one, in addition to the above question, is evcomplex not the best scoring model for this type of analysis?

@cross12tamu
Copy link
Author

Hey @thomashopf , I hope you are doing well!

I'm just wondering if you have had any time to take a look at this?

Thanks as always for your time and brain energy 😄

@thomashopf
Copy link
Contributor

@aggreen Could you maybe have a look at this?

@cross12tamu
Copy link
Author

Let me know if you need any more information @aggreen

@aggreen
Copy link
Contributor

aggreen commented Apr 27, 2020 via email

@cross12tamu
Copy link
Author

Thanks @aggreen

For the "probability (model_score)" column, is there a standard or simply a method, on understanding the values (specifically, the range of values)?

The contact map generation confuses me a bit, with default settings being 0.95/0.99 cutoffs, and me thinking of that cutoff being "probability" driven and not "model_score" driven. Although, those values are what it uses (right?) when selecting the plot.

For context this column (probability) is of interest to us, due to the resultant cn values not "jumping out". But, ec pairs from the probability/model_score column that rank high, are biologically intriguing, although again, with low cn scores.

Let me know if any of my questions are confusing, and I can reiterate/clarify. Thanks as always.

ASIDE I saw the evcomplex2 PR. 👍

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants