.. _`nbconvert script`:
Converting notebooks to other formats
=====================================
Newly added in the 1.0 release of IPython is the ``nbconvert`` tool, which
allows you to convert an ``.ipynb`` notebook document file into various static
formats.
Currently, ``nbconvert`` is provided as a command line tool, run as a script
using IPython. In the future, a direct export capability from within the
IPython Notebook web app is planned.
The command-line syntax to run the ``nbconvert`` script is::
$ ipython nbconvert --format=FORMAT notebook.ipynb
This will convert the IPython document file ``notebook.ipynb`` into the output
format given by the ``FORMAT`` string.
The default output format is HTML, for which the ``--format`` modifier may be
omitted::
$ ipython nbconvert notebook.ipynb
The currently supported export formats are the following:
* HTML:
- **full_html**:
Standard HTML
- **simple_html**:
Simplified HTML
- **reveal**:
HTML slideshow presentation for use with the ``reveal.js`` package
* PDF:
- **sphinx_howto**:
The format for Sphinx_ HOWTOs; similar to an ``article`` in LaTeX
- **sphinx_manual**:
The format for Sphinx_ manuals; similar to a ``book`` in LaTeX
- **latex**:
An article formatted completely using LaTeX
* Markup:
- **rst**:
reStructuredText_ markup
- **markdown**:
Markdown_ markup
.. _Sphinx: http://sphinx-doc.org/
.. _reStructuredText: http://docutils.sourceforge.net/rst.html
.. _Markdown: http://daringfireball.net/projects/markdown/syntax
* Python:
Comments out all the non-Python code to produce a ``.py`` Python
script with just the code content. Currently the output includes IPython
magics, and so can be run with ``ipython``, after changing the extension
of the script to ``.ipy``.
The files output file created by ``nbconvert`` will have the same base name as
the notebook and will be placed in the current working directory. Any
supporting files (graphics, etc) will be placed in a new directory with the
same base name as the notebook, suffixed with ``_files``::
$ ipython nbconvert notebook.ipynb
$ ls
notebook.ipynb notebook.html notebook_files/
Each of the options for PDF export produces as an intermediate step a LaTeX
``.tex`` file with the same basename as the notebook, as well as individual
files for each figure, and ``.text`` files with textual output from running
code cells.
To actually produce the final PDF file, run the following commands::
$ ipython nbconvert --format=latex notebook.ipynb
$ pdflatex notebook
This requires a local installation of LaTeX on your machine.
The output is a PDF file ``notebook.pdf``, also placed inside the
``nbconvert_build`` subdirectory.
Alternatively, the output may be sent to standard output with::
$ ipython nbconvert notebook.ipynb --stdout
Multiple notebooks can be specified from the command line::
$ ipython nbconvert notebook*.ipynb
$ ipython nbconvert notebook1.ipynb notebook2.ipynb
or via a list in a configuration file, say ``mycfg.py``, containing the text::
c = get_config()
c.NbConvertApp.notebooks = ["notebook1.ipynb", "notebook2.ipynb"]
and using the command::
$ ipython nbconvert --config mycfg.py
Extracting standard Python files from notebooks
-----------------------------------------------
``.ipynb`` notebook document files are plain text files which store a
representation in JSON format of the contents of a notebook space. As such,
they are not valid ``.py`` Python scripts, and so can be neither imported
directly with ``import`` in Python, nor run directly as a standard Python
script (though both of these are possible with simple workarounds).
To extract the Python code from within a notebook document, the simplest
method is to use the ``File | Download as | Python (.py)`` menu item; the
resulting ``.py`` script will be downloaded to your browser's default
download location.
An alternative is to pass an argument to the IPython Notebook, from the moment
when it is originally started, specifying that whenever it saves an ``.ipynb``
notebook document, it should, at the same time, save the corresponding
``.py`` script. To do so, you can execute the following command::
$ ipython notebook --script
or you can set this option permanently in your configuration file with::
c = get_config()
c.NotebookManager.save_script=True
The result is that standard ``.py`` files are also now generated, which
can be ``%run``, imported from regular IPython sessions or other notebooks, or
executed at the command line, as usual. Since the raw code you have typed is
exported, you must avoid using syntax such as IPython magics and other
IPython-specific extensions to the language for the files to be able to be
successfully imported.
.. or you can change the script's extension to ``.ipy`` and run it with::
..
.. $ ipython script.ipy
In normal Python practice, the standard way to differentiate importable code
in a Python script from the "executable" part of a script is to use the
following idiom at the start of the executable part of the code::
if __name__ == '__main__'
# rest of the code...
Since all cells in the notebook are run as top-level code, you will need to
similarly protect *all* cells that you do not want executed when other scripts
try to import your notebook. A convenient shortand for this is to define
early on::
script = __name__ == '__main__'
Then in any cell that you need to protect, use::
if script:
# rest of the cell...
.. _notebook_format:
Notebook JSON file format
-------------------------
Notebook documents are JSON files with an ``.ipynb`` extension, formatted
as legibly as possible with minimal extra indentation and cell content broken
across lines to make them reasonably friendly to use in version-control
workflows. You should be very careful if you ever manually edit this JSON
data, as it is extremely easy to corrupt its internal structure and make the
file impossible to load. In general, you should consider the notebook as a
file meant only to be edited by the IPython Notebook app itself, not for
hand-editing.
.. note::
Binary data such as figures are also saved directly in the JSON file.
This provides convenient single-file portability, but means that the
files can be large; a ``diff`` of binary data is also not very
meaningful. Since the binary blobs are encoded in a single line, they
affect only one line of the ``diff`` output, but they are typically very
long lines. You can use the ``Cell | All Output | Clear`` menu option to
remove all output from a notebook prior to committing it to version
control, if this is a concern.
The notebook server can also generate a pure Python version of your notebook,
using the ``File | Download as`` menu option. The resulting ``.py`` file will
contain all the code cells from your notebook verbatim, and all Markdown cells
prepended with a comment marker. The separation between code and Markdown
cells is indicated with special comments and there is a header indicating the
format version. All output is removed when exporting to Python.
As an example, consider a simple notebook called ``simple.ipynb`` which
contains one Markdown cell, with the content ``The simplest notebook.``, one
code input cell with the content ``print "Hello, IPython!"``, and the
corresponding output.
The contents of the notebook document ``simple.ipynb`` is the following JSON
container::
{
"metadata": {
"name": "simple"
},
"nbformat": 3,
"nbformat_minor": 0,
"worksheets": [
{
"cells": [
{
"cell_type": "markdown",
"metadata": {},
"source": "The simplest notebook."
},
{
"cell_type": "code",
"collapsed": false,
"input": "print \"Hello, IPython\"",
"language": "python",
"metadata": {},
"outputs": [
{
"output_type": "stream",
"stream": "stdout",
"text": "Hello, IPython\n"
}
],
"prompt_number": 1
}
],
"metadata": {}
}
]
}
The corresponding Python script is::
# -*- coding: utf-8 -*-
# 3.0
#
# The simplest notebook.
#
print "Hello, IPython"
Note that indeed the output of the code cell, which is present in the JSON
container, has been removed in the ``.py`` script.