University of Alberta banner
Academic Information and Communication TechnologiesUniversity of Alberta
About AICT | Getting Help | Online Tools | Services | Research | Teaching & Learning | Computer Security
Site Specific Information
The WestGrid SGI Origin machines at the University of Alberta

New User Message as of July 23, 2004



Greetings,

We have set up a WestGrid account for you
on nexus.westgrid.ca.

This is the gateway to WestGrid's large
shared-memory parallel computer facility
located at the University of Alberta.

You can use grid-enabled tools to access this
site's resources or login directly with the
username $USERNAME and password you supplied
during registration.

Please visit http://www.westgrid.ca for
general information about using WestGrid and
follow the links for site-specific details.
Contact support@westgrid.ca support@westgrid.ca
Thank you for your interest in WestGrid.

Regards,
   WestGrid Account Management Team
                                            

 

Here are some UofA Site-specific Details:

This local Westgrid facility consists of six SGI Origin machines with a current total of 452 processors:

  • nexus 8 MIPS R16000 700 MHz processors with 8 GB of RAM
  • arcturus 256 MIPS R16000 700 MHz processors with 256 GB of RAM
  • helios 32 MIPS R14000 500 MHz processors with 16 GB of RAM
  • corona* 32 MIPS R14000 500 MHz processors with 16 GB of RAM
  • australis 64 MIPS R12000 400 MHz processors with 32 GB of RAM
  • borealis 64 MIPS R12000 400 MHz processors with 16 GB of RAM

(*At the present time usage of corona is reserved for exclusive use of Dr. Rankin and his group.)

All of these machines may be used in batch mode, but only one, nexus, is available for interactive use. Nexus is the only one users are allowed to log on to, the rest are used by submitting a batch job to a PBS queue.

Getting Started, Logging In, and Submitting Batch Jobs:

You may access nexus interactively using ssh, sftp, or scp, but not using telnet or ftp, for security reasons.

ssh nexus.westgrid.ca

Interactive usage is allowed so that users may develop and submit batch jobs, and do limited testing. It is expected that users will run their jobs in batch mode.

Interactive users on nexus are asked to use at most 2 processors, and to limit their walltime to 1 hour. Interactive use is monitored.

Most of the processors are available only to users who submit their jobs to the batch queueing system (PBS). Using PBS is not difficult, however you will need to write and submit PBS job scripts. To learn how to do this and how to monitor your job's status please read our local users guide on:

PBS (Portable Batch System)

PBS will schedule your job according to your priority, so there will be a delay before your job starts running, but when your job does run it will have exclusive use of the processors it is running on.

The queues on all UofA WestGrid machines have a walltime limit of 24 hours. The other limits (such as minimum number of processors used) vary among the machines.

If you estimate that a particular job will run longer than the 24 hours, then you will need to incorporate checkpointing and/or signal handling into your program. We can help you with this; please feel free to contact us with any special requirements you might have.

There are also other limits, such as a limit of 6 simultaneous jobs running per user id. These are intended to increase the fairness of the system.

The Westgrid e-mail address for support is: support@westgrid.ca

Your $HOME disk space and working disk:

Storage space is provided through CXFS - a high performance filesystem shared by all the UofA Westgrid machines. CXFS is available on Nexus, Australis, and Arcturus. For Borealis, Corona, and Helios this space is served by NFS. You maight experience I/O performance degradation on NFS served space.

Please remember to remove files when they are no longer needed.

Regarding Back-ups:

We back up all files on a regular basis. However, clients must be aware that as with all file systems and backup facilities, problems can still occur. Users should still ensure that their critical files are archived in a separate location.


More site specific information is available at: http://www.ualberta.ca/AICT/RESEARCH/WestGrid/

Updated: July 23, 2004.

 
 

University of Alberta

© University of Alberta
AICT Privacy Policy