User Tools

Site Tools


tipsntricks:ipython_notebook

Jupyter / Ipython Notebook

Jupyter is installed in most of the python modules. To make use of it you would do something like:

module add chpc/python/3.6.0_gcc-6.3.0
jupyter-notebook

However using it effectively on the cluster is a bit more complicated…

Security

First things first – you are not the only person on the system, so it is important to set up authentication on your notebook so that not everyone gets access to your notebook (and worse – your data).

So first one needs a configuration file, this can be done by passing the generate-config parameter to jupyter as follows:

[USERNAME@cnode0010 ~]$ jupyter-notebook --generate-config
Writing default config to: /home/USERNAME/.jupyter/jupyter_notebook_config.py

Next you need to generate your password (remember it – you'll need it when you connect later):

python
from notebook.auth import passwd
passwd()
Enter password:
Verify password:
'sha1:f27008fdb0eb:4c2f305d5e230edca16c7059882ba3ba63bee03b'

Now edit the jupyter_notebook_config.py file, specifically edit the c.NotebookApp.passwd line (remember to uncomment it, and don't just copy and paste my hash in):

c.NotebookApp.password = 'sha1:f27008fdb0eb:4c2f305d5e230edca16c7059882ba3ba63bee03b'

Starting a notebook inside a job

There might be a cleaner way of doing this… Please let me know if you have one!

First you need to edit your ~/.ssh/config file on your local machine and add in the lines:

Host cnode*
    Hostname %h
    User YOURUSERNAME
    ProxyCommand ssh YOURUSERNAME@lengau.chpc.ac.za nc %h 22
    LocalForward 8838 localhost:8838

What this will do is allow you to ssh directly to a compute node (note you can only do this to nodes where you currently have a job running).

The jobscript looks something like:

jupyter.qsub
#!/bin/bash
#PBS -P SHORTNAME
#PBS -q serial
#PBS -l select=1:ncpus=8:mpiprocs=1
#PBS -l walltime=08:00:00
#PBS -N Jupyter
#PBS -m abe
#PBS -M YOUR@EMAIL.ADDRESS
 
module add chpc/python/3.6.0_gcc-6.3.0
 
JUPYTERPORT=8838  # you could change this too, if you wanted to.
 
hostname > ~/jupyter.host
 
jupyter-notebook --port=${JUPYTERPORT} --no-browser

If you submit that job and wait for it to start running then you can check which host the session is running on with:

cat ~/jupyter.host

Then, again on your local machine, you need to connect to the compute node, i.e. ssh cnode0101.

Finally in your browser go to: http://localhost:8838

Starting a parallel session

/var/www/wiki/data/pages/tipsntricks/ipython_notebook.txt · Last modified: 2019/08/23 11:47 by ccrosby