User Tools

Site Tools


quick:start

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
quick:start [2019/09/09 11:02]
wikiadmin [Shared Filesystems]
quick:start [2020/10/22 16:22] (current)
kevin [Queues]
Line 15: Line 15:
  
  
-* Maximum available memory on each type of node: ''​mem=125gb''​ (regular) or ''​mem=61gb''​ (regular with only 64GiB), and ''​mem=1007gb''​ (fat).+* Maximum available memory on each type of node: ''​mem=124gb''​ (regular) or ''​mem=61gb''​ (regular with only 64GiB), and ''​mem=1007gb''​ (fat).
  
  
Line 37: Line 37:
  
 Once you have logged in, give some consideration to how you will be using your session on the login node.  If you are going to spend a long time logged in, doing a variety of tasks, it is best to get yourself [[http://​wiki.chpc.ac.za/​quick:​start#​example_interactive_job_request|an interactive PBS session]] to work in.  This way, if you need to do something demanding, it will not conflict with other users logged into the login node. Once you have logged in, give some consideration to how you will be using your session on the login node.  If you are going to spend a long time logged in, doing a variety of tasks, it is best to get yourself [[http://​wiki.chpc.ac.za/​quick:​start#​example_interactive_job_request|an interactive PBS session]] to work in.  This way, if you need to do something demanding, it will not conflict with other users logged into the login node.
 +
 +====Trouble Logging in?====
 +Many users have their login blocked at some point. Usually this is because an incorrect password was entered more times than permitted (5 times). This restriction was put in place to prevent brute-force attacks by malicious individuals who want to gain access to your account.
 +
 +  * If you cannot log in, the first step is to make sure that you typed your username, hostname (lengau.chpc.ac.za or scp.chpc.ac.za) and password correctly. It sounds stupid, but this is often the problem. It happens to CHPC staff too...
 +  * Next, check that you are not experiencing a network problem. If you see a message along the lines of "​cannot resolve hostname",​ then your network is probably at fault (assuming that your spelling is correct).
 +  * If your network connection is fine, wait 30 minutes before attempting to log in again. After this period, the block is supposed to be automatically removed.
 +  * If for some reason this does not work, you should go to your user page on users.chpc.ac.za. There is a link at that address, to the left, which allows you to change your password and also edit other details for your entry on our user database (email addresses, qualifications,​ institution,​ etc.) **Be sure that your password conforms to all requirements**
 +  * If even changing the password does not help, please contact our helpdesk, and ask for our assistance.
 +
  
 ==== Transferring Data ==== ==== Transferring Data ====
Line 62: Line 72:
 subdirectory of your scratch directory ///​mnt/​lustre/​users/​yourusername///  ​ subdirectory of your scratch directory ///​mnt/​lustre/​users/​yourusername///  ​
 (where //​yourusername//​ is replaced by your user name on the CHPC cluster). (where //​yourusername//​ is replaced by your user name on the CHPC cluster).
 +
 +
 +=== Downloading files from other servers ===
 +You may need to download data from a server at another site.  Do not do this on ** //login2// **!  Use ** // scp.chpc.ac.za//​ ** for this purpose. ​ The easiest way of doing this is with the **wget** command:
 +
 +<​code>​
 +wget http://​someserver.someuni.ac.za/​pub/​somefile.tgz
 +</​code>​
 +
 +Very large files may be transferred more quickly by using a multi-threaded downloader. The easiest of these is **axel**, see [[https://​github.com/​axel-download-accelerator/​axel|axel'​s GitHub page]]. ​ The syntax is very simple:
 +
 +<​code>​
 +module load chpc/​compmech/​axel/​2.17.6
 +axel -n 4 -a http://​someserver.someuni.ac.za/​pub/​somefile.tgz
 +</​code>​
 +
 +
  
 [[guide:​connect|Read more on connecting to the CHPC...]] [[guide:​connect|Read more on connecting to the CHPC...]]
Line 203: Line 230:
 ^ Queue Name  ^ Max. cores  ^ Min. cores  ^  Max. jobs  ^^  Max. time  ^  Notes  ^ Access ​ ^ ^ Queue Name  ^ Max. cores  ^ Min. cores  ^  Max. jobs  ^^  Max. time  ^  Notes  ^ Access ​ ^
 ^ :::  ^  per job  ^^  in queue  ^  running ​ ^  hrs  ^ :::  ^ :::  ^ ^ :::  ^  per job  ^^  in queue  ^  running ​ ^  hrs  ^ :::  ^ :::  ^
-| serial ​ |  23 |  1 |  ​??? |  ​??? |  48 | For single-node non-parallel jobs.  |  | +| serial ​ |  23 |  1 |  ​24 |  ​10 |  48 | For single-node non-parallel jobs.  |  | 
-| seriallong ​ |  12 |  1 |  ​??? |  ​??? |  144 | For very long sub 1-node jobs.  |  |+| seriallong ​ |  12 |  1 |  ​24 |  ​10 |  144 | For very long sub 1-node jobs.  |  |
 | smp  |  24 |  24 |  20 |  10 |  96 | For single-node parallel jobs.  |  | | smp  |  24 |  24 |  20 |  10 |  96 | For single-node parallel jobs.  |  |
-^ normal ​ ^  240 ^  ​48 ^  20 ^  10 ^  48 ^ The standard queue for parallel jobs ^  ^+^ normal ​ ^  240 ^  ​25 ^  20 ^  10 ^  48 ^ The standard queue for parallel jobs ^  ^
 | large  |  2400 |  264 |  10 |  5 |  48 | For large parallel runs  | //​Restricted// ​ | | large  |  2400 |  264 |  10 |  5 |  48 | For large parallel runs  | //​Restricted// ​ |
 | express ​ |  2400 |  25 |  N/A |  100 total nodes |  96 | For paid commercial use only  | //​Restricted// ​ | | express ​ |  2400 |  25 |  N/A |  100 total nodes |  96 | For paid commercial use only  | //​Restricted// ​ |
Line 212: Line 239:
 | vis  |  12 |  1 |  1 |  1 |  3 | Visualisation node  |  | | vis  |  12 |  1 |  1 |  1 |  3 | Visualisation node  |  |
 | test  |  24 |  1 |  1 |  1 |  3 | Normal nodes, for testing only  |  | | test  |  24 |  1 |  1 |  1 |  3 | Normal nodes, for testing only  |  |
 +| gpu_1 |  10 |  1 |    |  2 |  12 | Up to 10 cpus, 1 GPU        |  |
 +| gpu_2 |  20 |  1 |    |  2 |  12 | Up to 20 cpus, 2 GPUs        |  |
 +| gpu_3 |  36 |  1 |    |  2 |  12 | Up to 36 cpus, 3 GPUs        |  |
 +| gpu_4 |  40 |  1 |    |  2 |  12 | Up to 40 cpus, 4 GPUs        |  |
 +
 +
  
 ===Notes:​=== ===Notes:​===
Line 302: Line 335:
 #PBS -e /​mnt/​lustre/​users/​USERNAME/​OMP_test/​test1.err #PBS -e /​mnt/​lustre/​users/​USERNAME/​OMP_test/​test1.err
 #PBS -m abe #PBS -m abe
-#PBS -M your.email@address+#PBS -WMail_Users=youremail@ddress
 ulimit -s unlimited ulimit -s unlimited
  
Line 328: Line 361:
 #PBS -e /​mnt/​lustre/​users/​USERNAME/​WRF_Tests/​WRFV3/​run2km_100/​wrf.err #PBS -e /​mnt/​lustre/​users/​USERNAME/​WRF_Tests/​WRFV3/​run2km_100/​wrf.err
 #PBS -m abe #PBS -m abe
-#PBS -M your.email@address+#PBS -WMail_Users=youremail@ddress
 ulimit -s unlimited ulimit -s unlimited
 . /​apps/​chpc/​earth/​WRF-3.7-impi/​setWRF . /​apps/​chpc/​earth/​WRF-3.7-impi/​setWRF
/var/www/wiki/data/attic/quick/start.1568019726.txt.gz · Last modified: 2019/09/09 11:02 by wikiadmin