Migrate to mkdocs #81

thomasmarwitz · 2024-08-14T13:32:43Z

Demonstrate how mkdocs-jupyter plugin can be used to execute and render Jupyter Notebooks in a similar way to MyST.

Output: https://metalearners--81.org.readthedocs.build/en/81/

TODO:

Fix enumeration in background.md
Docstrings can / should be adapted to mkdocs style
Jupyter notebooks should be adapted to mkdocs style (remove cell tags)
Discuss whether jupyter notebooks should be built once and checked into git for fast serving of docs (better DX than slow optuna notebook) => now we have an additional CI action
~~Wait for feature: Add option s.t. overridden members are able to "inherit" docstrings from corresponding members in parent classes mkdocstrings/python#194~~ => moved to griffe extension: https://github.com/mkdocstrings/griffe-inherited-docstrings (now works with version 1.1.1) TBD: maybe not include in copier template!
~~Look into ruff pydocstyle (replacement to pre-commit-hook docformatter)~~ => The docs migration is already huge

codecov · 2024-08-14T13:48:11Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 94.73%. Comparing base (8b371de) to head (1bf0671).

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #81      +/-   ##
==========================================
- Coverage   94.73%   94.73%   -0.01%     
==========================================
  Files          15       15              
  Lines        1806     1805       -1     
==========================================
- Hits         1711     1710       -1     
  Misses         95       95

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

kklein · 2024-08-27T07:59:32Z

Thanks for looking into this @thomasmarwitz !
Is it possible that the subscripts no longer need escaping?

See

from this PR compared to

from https://metalearners.readthedocs.io/en/latest/background.html#x-learner

thomasmarwitz · 2024-08-27T08:27:08Z

@kklein Thanks for pointing that out, looks like I completely missed this. You are right, you don't need to escape subscript anymore.

kklein · 2024-08-27T09:46:39Z

Aside from the math and enumeration topic mentioned above, what do you see as hurdles and next steps?
Might it make sense to test whether our readthedocs setup works fine with mkdocs?

pavelzw · 2024-08-27T09:59:23Z

BTW an alternative to readthedocs could be github pages in combination with https://github.com/jimporter/mike. Disadvantage would be that there is no preview for PRs. Advantage would be that there is no external configuration necessary.

kklein · 2024-08-27T11:53:50Z

Sounds interesting! Can we have branch-specific deployments with GitHub-pages? Given our somewhat heavy use of formulas, we rely a lot on rendered docs in PRs.

thomasmarwitz · 2024-08-28T12:18:20Z

I don't know whether branch based deployments can be accomplished out of the box with mike and gh-pages, that would be interesting to keep an eye on.

I just tried out to host the mkdocs documentation with readthedocs and that worked: https://metalearners--81.org.readthedocs.build/en/81/

kklein · 2024-08-29T10:37:32Z

But perhaps this is even desired behavior? In the sense that if you change the implementation, you should also adapt the docstring?

I can totally see that one might want to ask that question. In our case we mostly described the 'contract' of a given method/function in the docstrings, i.e. what a user may provide as input and what they can expect as an output. We don't usually say much about the 'how's.

At times, changing the implementation does not change this contract.

E.g. one might think of the folllowing:

class Shape:
    contour: list[Point]
    def surface_area(self):
        """Return the surface area in square meters."""
        return numerical_integration(self.contour)
        
class Rectange(Shape)
    def surface_area(self):
        return distance(self.cotour[2], self.contour[0]) * distance(self.contour[3], self.contour[1])

In this case we used said sphinx feature in order to DRY and reduce the risk of inconsistencies due to redundancy.

pavelzw · 2024-08-29T11:23:47Z

Can we have branch-specific deployments with GitHub-pages? Given our somewhat heavy use of formulas, we rely a lot on rendered docs in PRs.

I don't think so, so in this case readthedocs might be more fitting.
Note though, that with mkdocs you can easily spin up a dev docs instance using pixi run docs or pixi run mkdocs serve.

mkdocs.yml

docs/styles/custom.css

pixi.toml

kklein · 2024-12-02T12:49:44Z

Thanks for your work @thomasmarwitz ! A couple of observations on the rendered docs:

We have quite a few diagrams with a transparent background and black or dark grey font color. Given the dark background of the mkdocs theme, these are not so easily legible, e.g. https://metalearners--81.org.readthedocs.build/en/81/motivation/#accessing-base-models.
Some of the math isn't rendered as intended, see e.g. the X-Learner paragraph, the R-Learner paragraph or the DR-Learner paragraph.
For some reason 'Covariates' in the glossary is treated surprisingly, see here.
Some cross-references aren't displayed as intended yet, see e.g. the second bullet in the example on using a first MetaLearner.
The examples no longer show the output of the code cells, e.g. here.
The Table of Content of this Linear Regression paragraph has three colon-subparagraphs which I don't see in the content.
Some cross-references aren't picked up, see e.g. the first usage of {term} and {ref} in the known propensity example .
Some notes aren't picked up properly, see e.g. the {note} reference in this paragraph.
RST Urls from docstring aren't rendered as intended, see e.g. the arxiv links in the docs of the DR-Learner class.
I can't find the parameter and return value type hints in the API docs, e.g. in the evaluate method docs.

thomasmarwitz · 2024-12-03T12:29:15Z

@kklein Thanks for examining the current progress so carefully. I checked each point to see whether there is some way of solving it. I haven't implemented everything as changing all URLs, adjusting all docstrings etc. takes some time.

We have quite a few diagrams with a transparent background and black or dark grey font color. Given the dark background of the mkdocs theme, these are not so easily legible, e.g. metalearners--81.org.readthedocs.build/en/81/motivation#accessing-base-models.

We can disable the dark and auto-mode as a temporary work-around. The site will then be always displayed in light mode.
A solution can be the neat conditional rendering of images depending on light/dark mode, this would require a dark-mode friendly version of the images, though.

Some of the math isn't rendered as intended, see e.g. the X-Learner paragraph, the R-Learner paragraph or the DR-Learner paragraph.

Thanks for pointing that out. As it turned out, prettier adjusted the tabwidth in markdown, that's why the version on github was always broken. I found a way to override the tabwidth for markdown with 4.

For some reason 'Covariates' in the glossary is treated surprisingly, see here.

Fixed, typo in heading.

Some cross-references aren't displayed as intended yet, see e.g. the second bullet in the example on using a first MetaLearner.

I have to replace all sphinx-like references with mkdocs / normal links. Just not there yet.

The examples no longer show the output of the code cells, e.g. here.

I think, this point needs some discussion.

During development, I turned off the execute option during docs build as the building of the optuna notebook (through mkdocs) takes >20 min on my mac (even by excluding optuna, the remaning notebooks need 5 min to be all converted). This leads to all notebooks being used in their current state, some may have no output, but some have e.g. optuna.

When building the documentation in CI, we could execute all notebooks beforehand to ensure everything is up to date and then build the documentation. Executing all notebooks everytime, makes the mkdocs serve rather inconvenient as I expect high iteration speeds from such a live server.

WDYT? (also @pavelzw as you are a huge fan of the serve option)

The Table of Content of this Linear Regression paragraph has three colon-subparagraphs which I don't see in the content.

Appears to be a bug, as a work-around we can remove Headings using "``" to wrap code-like elements. If we use that feature frequently, I can open an issue.

Some cross-references aren't picked up, see e.g. the first usage of {term} and {ref} in the known propensity example .

This is also sphinx-specific terminology that I haven't replaced completely with the mkdocs equivalent.

Some notes aren't picked up properly, see e.g. the {note} reference in this paragraph.

Mkdocs has nice admonition rendering: https://squidfunk.github.io/mkdocs-material/reference/admonitions/#+type:note. In jupyter notebooks, the syntactic sugar to render those admonitions is not (sadly) not available. A solution I found is to refer to the "compiled" html elements directly e.g.

<!-- mkdocs note -->
<div class="admonition note">
    <p class="admonition-title">Note</p>
    <p style="margin-top: 0.6rem">The fact that we have a fixed propensity score for all observations is not true for this dataset, we just use it for illustrational purposes.</p>
</div>

This produces a note, similar to sphinx:

Again, this is not really pretty. If we find ourselves using that a lot, we can also think about opening an issue to support the more concise markdown syntax.

RST Urls from docstring aren't rendered as intended, see e.g. the arxiv links in the docs of the DR-Learner class.

This is also sphinx-specific terminology that I haven't replaced completely with the mkdocs equivalent.

I can't find the parameter and return value type hints in the API docs, e.g. in the evaluate method docs.

I found a setting to render the type hints:

If we add black as a dependency in the docs feature, we get a formatted signature (that looks much better imo):

For a table like display (there are also other display options) like this:

The function needs a numpy, google or sphinx docstring: https://mkdocstrings.github.io/python/usage/configuration/docstrings/#docstring_style. There is a tendency that google docstrings has more features coming.

The nice thing here is that we don't need this google style or whatever everywhere. One can sprinkle that in if e.g. a param needs some explanation as in the screenshot. In all other cases, the param names and types seem pretty solid.

thomasmarwitz · 2024-12-03T12:32:45Z

I'm afraid this is taking so long - that's mainly because migration all sphinx specific syntax to mkdocs (even in docstrings or jupyter notebooks) takes some time and I can only spend a certain amount of time per week on this.

@kklein I'll ping you here explicitly once I've migrated all docstrings and jupyter notebooks.

kklein · 2024-12-05T10:07:35Z

I think, this point needs some discussion.

I totally second your take that the runtime caused by the execution of the notebooks upon building of the docs is a pain.

At the same time, I think that the output of the cells creates a lot of value for a reader of the docs.

If we can find a solution which provides the outputs while reducing the amount of time used for docs-building that'd be great.

To give you some context: We didn't start off by using jupyter notebooks at all for these code examples. Rather, we executed the corresponding code blocks by hand and mode the code blocks as well as the outputs into rst. This approach clearly has the downside of

there being no alerting mechanism if code doesn't run (anymore)
there being a lot of development overhead

If we add black as a dependency in the docs feature, we get a formatted signature (that looks much better imo):

Looks great; I think adding black as a docs dependency is not problem at all. :)

thomasmarwitz · 2024-12-10T14:45:27Z

If we can find a solution which provides the outputs while reducing the amount of time used for docs-building that'd be great.

@kklein I talked w/ @pavelzw about how we could address this. I agree that having outputs is very valuable to the reader.

Our idea is:

Have an additional check here in CI that checks whether the execution count for each cell of the jupyter notebooks is not null, i.e. the notebook has been executed and, fail if we encounter an unexecuted cell. Thus, it becomes the task of the person changing something in the jupyter notebook or adding a new one to execute it at least once.
In the docs build, we can just copy over the output of all already executed cells.

kklein · 2024-12-11T17:08:32Z

@thomasmarwitz sgtm :)

…ution.

Forgot during rebase.

Co-authored-by: Pavel Zwerschke <[email protected]>

This is necessary for prettier not to break math rendering which only works when indented 0 or 4 spaces.

Check via execution count in a bash script.

thomasmarwitz changed the title ~~Remove sphinx specific formatting, adjust heading syntax~~ Migrate to mkdocs Aug 21, 2024

This comment was marked as resolved.

Sign in to view

This comment was marked as outdated.

Sign in to view

This comment was marked as resolved.

Sign in to view

thomasmarwitz commented Nov 19, 2024

View reviewed changes

mkdocs.yml Outdated Show resolved Hide resolved

pavelzw reviewed Nov 19, 2024

View reviewed changes

docs/styles/custom.css Outdated Show resolved Hide resolved

docs/styles/custom.css Outdated Show resolved Hide resolved

pixi.toml Outdated Show resolved Hide resolved

thomasmarwitz force-pushed the mkdocs branch 2 times, most recently from 0bcbb39 to cb6c281 Compare November 26, 2024 14:31

thomasmarwitz marked this pull request as ready for review November 29, 2024 14:47

thomasmarwitz requested a review from kklein as a code owner November 29, 2024 14:47

thomasmarwitz marked this pull request as draft November 29, 2024 14:47

thomasmarwitz added 4 commits December 13, 2024 10:26

Remove sphinx specific formatting, adjust heading syntax

6986e8b

Add mkdocs deps and 'mkdocs' task.

42e6728

Add configuration to demonstrate rendering a jupyter notebook

820cec2

Reorder examples to match order in sphinx. Skip costly optuna jn exec…

c969414

…ution.

thomasmarwitz and others added 24 commits December 13, 2024 10:26

Fix links

dd612a0

Adapt pixi doc-related tasks

41b7067

Fix links

c73af9d

Enable sphinx-like behavior through mkdocstrings-python extension

6ffc86a

Remove custom target css, add note what this css selector does

bcb8a96

Fix math and math in enumerations (with indentation)

d3ea328

Remove index.rst again.

d31a6f3

Forgot during rebase.

Use list like format as in sphinx

2f97cde

Mirror sphinx layout

776776c

Actual indent

a612331

Add checkboxes in motivation part

2950de8

Revert

a5a1340

Indent

b2a4ff4

Prettier

bbffee8

Update docs/styles/custom.css

ff0f33a

Co-authored-by: Pavel Zwerschke <[email protected]>

Use lower case bool values

79d26f5

Adjust pixi docs tasks

ba4a38e

Resolve link to notebook

fad29ae

Remove target css selector

dc4b9d7

Fixes

40023fb

Adapt to mkdocs style (links, code, math)

412ba6e

Set indent to 4 in prettier for markdown.

4ab14a3

This is necessary for prettier not to break math rendering which only works when indented 0 or 4 spaces.

--wip-- [skip ci]

a5a6fc3

Check output cells of notebooks

2bf8bd1

thomasmarwitz force-pushed the mkdocs branch 2 times, most recently from a506cd4 to ff0a208 Compare December 13, 2024 09:33

Check outputs of example notebooks in CI.

7fbabd5

Check via execution count in a bash script.

thomasmarwitz force-pushed the mkdocs branch from ff0a208 to 7fbabd5 Compare December 13, 2024 09:39

thomasmarwitz added 2 commits December 13, 2024 11:45

Add outputs

2ef89f1

Fix bash for loop with default option

1bf0671

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Migrate to mkdocs #81

Migrate to mkdocs #81

thomasmarwitz commented Aug 14, 2024 •

edited

Loading

codecov bot commented Aug 14, 2024 •

edited

Loading

This comment was marked as resolved.

kklein commented Aug 27, 2024

thomasmarwitz commented Aug 27, 2024

kklein commented Aug 27, 2024

pavelzw commented Aug 27, 2024 •

edited

Loading

kklein commented Aug 27, 2024

thomasmarwitz commented Aug 28, 2024

This comment was marked as outdated.

This comment was marked as resolved.

kklein commented Aug 29, 2024 •

edited

Loading

pavelzw commented Aug 29, 2024

kklein commented Dec 2, 2024 •

edited

Loading

thomasmarwitz commented Dec 3, 2024 •

edited

Loading

thomasmarwitz commented Dec 3, 2024

kklein commented Dec 5, 2024

thomasmarwitz commented Dec 10, 2024 •

edited

Loading

kklein commented Dec 11, 2024

Migrate to mkdocs #81

Are you sure you want to change the base?

Migrate to mkdocs #81

Conversation

thomasmarwitz commented Aug 14, 2024 • edited Loading

TODO:

codecov bot commented Aug 14, 2024 • edited Loading

Codecov Report

This comment was marked as resolved.

kklein commented Aug 27, 2024

thomasmarwitz commented Aug 27, 2024

kklein commented Aug 27, 2024

pavelzw commented Aug 27, 2024 • edited Loading

kklein commented Aug 27, 2024

thomasmarwitz commented Aug 28, 2024

This comment was marked as outdated.

This comment was marked as resolved.

kklein commented Aug 29, 2024 • edited Loading

pavelzw commented Aug 29, 2024

kklein commented Dec 2, 2024 • edited Loading

thomasmarwitz commented Dec 3, 2024 • edited Loading

thomasmarwitz commented Dec 3, 2024

kklein commented Dec 5, 2024

thomasmarwitz commented Dec 10, 2024 • edited Loading

kklein commented Dec 11, 2024

thomasmarwitz commented Aug 14, 2024 •

edited

Loading

codecov bot commented Aug 14, 2024 •

edited

Loading

pavelzw commented Aug 27, 2024 •

edited

Loading

kklein commented Aug 29, 2024 •

edited

Loading

kklein commented Dec 2, 2024 •

edited

Loading

thomasmarwitz commented Dec 3, 2024 •

edited

Loading

thomasmarwitz commented Dec 10, 2024 •

edited

Loading