Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

OWLDAvis #731

Merged
merged 25 commits into from
Nov 15, 2021
Merged

OWLDAvis #731

merged 25 commits into from
Nov 15, 2021

Conversation

ajdapretnar
Copy link
Collaborator

@ajdapretnar ajdapretnar commented Oct 18, 2021

Issue

Widget for LDA topic model visualization, inspired by pyLDAvis: https://github.com/bmabey/pyLDAvis

Description of changes

Add Relevant Terms widget for interactive exploration of relevant terms per topic.

Includes
  • Code changes
  • Tests
  • Documentation

@ajdapretnar
Copy link
Collaborator Author

ajdapretnar commented Oct 18, 2021

TODO:

  • test the widget (GUI)
  • test the method (should yield same results as pyLDAvis)
  • implement bar charts instead of Distributions column (remove weights, too)
  • documentation
  • tests
  • visual settings

@ajdapretnar
Copy link
Collaborator Author

Here's how the widget would look like. The distribution bars would be red (current gray) and blue (current white).

ldavis

@ajdapretnar ajdapretnar changed the title OWRelevant Terms OWLDAvis Oct 26, 2021
@ajdapretnar
Copy link
Collaborator Author

ajdapretnar commented Oct 26, 2021

To discuss:

  • display only words, not POS tags (actually, there's no way around it as the model should distinguish between work_NOUN and work_VERB)
  • check weights, because they seem wrong (they are also correct since they push up the topic_prob:marginal_prob for more frequent topics - could be an optional parameter)

@codecov-commenter
Copy link

codecov-commenter commented Nov 3, 2021

Codecov Report

Merging #731 (2eefd5c) into master (ee5709e) will increase coverage by 0.37%.
The diff coverage is 89.20%.

❗ Current head 2eefd5c differs from pull request most recent head fea8f68. Consider uploading reports for the commit fea8f68 to get more accurate results

@@            Coverage Diff             @@
##           master     #731      +/-   ##
==========================================
+ Coverage   74.24%   74.62%   +0.37%     
==========================================
  Files          72       74       +2     
  Lines        9591     9872     +281     
  Branches     1307     1331      +24     
==========================================
+ Hits         7121     7367     +246     
- Misses       2222     2250      +28     
- Partials      248      255       +7     

@ajdapretnar ajdapretnar merged commit 1dfa334 into biolab:master Nov 15, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants