Never copy and paste code into documentation again!
π See a live example of a post made with notebooks π π
This repo provides an example of how to create blog posts and/or documentation with Docusaurus using Jupyter Notebooks. When you create documentation with notebooks:
- Your documentation is fully testable with unit tests
- No more copy / pasting code into your blog posts and docs
- The output of code is always up to date, since it is generated by actually running code, rather than copying and pasting it from somewhere else.
- You can author technical posts and docs in a WYSIWYG manner, which allows you to iterate faster.
- You can still write markdown posts if you want to.
- Background
- Getting Started
- Using Notebooks
- Using Markdown
- Running the documentation locally
- Automatic publishing
- About
This project assumes some familiarity with static site generators. It is also very helpful to gain some familiarity with Docusaurus and it's general features.
To create your own blog or docs site, you can start by forking this repo, or creating a new repo by using this one as a repository template.
This repo has both blog and docs, but you might want to enable blog only mode or docs only mode
After that, you should change the follwing lines in the settings.ini file. The keys are described below:
[DEFAULT]
nbs_path = .
recursive = True
tst_flags = notest
-user = outerbounds
+user = <your github username>
-doc_host = https://outerbounds.github.io
+doc_host = https://<your github user name>.github.io or your custom domain
-doc_baseurl = /nbdoc-docusaurus
+doc_baseurl = /<name of your repo> or path
-module_baseurls = metaflow=https://github.com/Netflix/metaflow/tree/master/
+module_baseurls = <your python module>=https://github.com/a/path/tree/master/
The definintions of these fields are as follows:
nbs_path
: the top most in your directory that contains notebooks to be converted to markdown files. The default value for this.
, which just means the root of the repo.recursive
: set toTrue
if you want to recurisvely find all notebooks undernbs_path
, andFalse
otherwise. Defaults toTrue
tst_flags
: special cell comment that will allow yout skip cell execution and test. For example, adding the comment#notest
in a code cell would result skip cell execution for that cell. You can add mulitple values in this field seperated by a pipe|
doc_host
: this is the domain where your docs are served. If your docs are served on GitHub Pages, this is typicallyyour-username.github.io
doc_baseurl
: the path on your domain that has the docs. This is usually the root/
so doesn't normally need to be changed.module_baseurl
: This is optional for a situations where you want to document python apis. This allows documentation to include links to source code.
-
Before you get started, you need to make sure you have node and npm installed.
-
Next, create an isolated python environment using your favorite tool such as
conda
,pipenv
,poetry
etc. Then, from the root of this repo run this command in the terminal:make install
Note that this references requirements.txt
, which you will have to update appropriately anytime you include addtional python dependencies in your documentation.
-
You need to open 3 different terminal windows (I recommend using split panes), and run the following commands in three seperate windows:
Note: we tried to use docker-compose but had trouble getting some things to run on Apple Silicon, so this will have to do for now.
Start the docs server:
make docs
Watch for changes to notebooks:
make watch
Start Jupyter Lab:
make nb
-
Open a browser window for the authoring guide in the docs. You may have to hard-refresh the first time you make a change, but hot-reloading generally works.
To go through the tutorial, open the rendered authoring guide in one window, and the notebook docs/nb.ipynb in another window. Study the notebook cells and the corresponding rendered page carefully. You will see many options for:
- How to deal with scripts vs interactive code cells
- How to run bash commands / scripts
- How to setup front matter for page configuration
- Hiding/Showing cell inputs, outputs or both
- Selectively displaying Metaflow logs
- How to setup tests
- How to Author API docs
After you complete the tutorial, you should run through this list and make sure you know how to do all of the above tasks.
To test the notebooks, run
make test
from the root of this repo. This will execute all notebooks in parallel and report an error if there are any errors found.
If you want to skip certain cells from running in tests because they take a really long time, you can place the comment
#notest
at the top of the cell. You can add additional flags by adding pipe-delimited values into thetst_flags
field of settings.ini.
You can refresh all notebooks in place with
make update
. This runs all notebooks programatically, skipping over cells with#notest
(or othertst_flags
flags you set) and saves the result to the same filename. This will also re-generate the markdown files from the updated notebooks.
The reason for providing
make update
seperately frommake test
is so you can have the option of first testing your notebooks without overwriting them.
We have previously discussed that you can skip tests or notebook refreshes in specific cells with the flag
#notest
(or other flags you set in settings.ini). You can force these cells to be run or refreshed with the--flags
argument. For example, let's say we want to run tests or refresh notebooks and do not want to ignore cells with the#notest
flag:
- For testing:
nboc_test --flags #notest
- For refreshing notebooks:
nbdoc_update --flags #notest
People complain about "state" in Jupyter. This can be easily avoided by frequently restarting the kernel and running all cells from the top. Thankfully, you can set a hotkey that allows you to do this effortlessly. In Jupyter Lab, go to
Settings
thenAdvanced Settings Editor
. Copy and paste the below json into theUser Prefences
pane. If you already have user-defined shortcuts, modify this appropriately.
{
"shortcuts": [
{
"command": "notebook:restart-run-all",
"keys": [
"Ctrl R",
"R"
],
"selector": "body",
},
{
"command": "notebook:restart-and-run-to-selected",
"keys": [
"Ctrl R",
"S"
],
"selector": "body",
},
...
}
You do not have to use notebooks to update documentation. You can use markdown as you would normally do with Docusaurus. This is a good option when there isn't much code in the document you are trying to write.
- Clone this repo
cd nbdoc-docusarus
make install
make docs
Any saved changes that you make to the
.md
files in thedocs
orblogs
directory will automatically be reflected at the local preview page.
Any pushes to the
master
branch will automatically publish to the live documentation pages . The publishing uses GitHub Actions and Github pages. You can see the progress of the publish action by going to theActions
tab of this repository.
We use nbdoc as a notebook converter and testing utility, as well as docusaurus for the static site generator. This technology is built using nbdev, a literate programming system.