• NUS Information Technology | NUS IT Services, Solutions & Governance
  • NUS Information Technology | NUS IT Services, Solutions & Governance
  • Get Quote
  • NUS Information Technology | NUS IT Services, Solutions & Governance
    NUS Information Technology | NUS IT Services, Solutions & Governance
    • Milestones
    • Getting Started
      • HPC Portal
      • Register for HPC
      • Registration Guide
      • Introductory Guide for New HPC Users
      • How to Run Batch Job
    • Services
      • Data Processing, Storage & Management
      • Application Software
      • HPC Consulting Service
      • HPC GPU
      • Parallel Computing
      • Scientific Visualisation
  • myEmail    Staff Portal    Student Portal    A.S.K.
Home Services HPC Newsletter » Handy and Free to Use GPU Systems for Machine/Deep Learning

HANDY AND FREE TO USE GPU SYSTEMS FOR MACHINE/DEEP LEARNING

By Wang Junhong, Research Computing, NUS IT on 18 May, 2019
What Needs to be Saved

From time to time, you may hear or read words like “Artificial Intelligence (AI)”, “Deep Learning (DL)” and “Machine Learning (ML)” which are related to the Smart Nation Initiatives since it was introduced in November 2014. If you happen be one of the researchers working on AI/DL/ML, you would be very keen to run your machine learning or deep learning on a powerful GPU system with multiple GPU cards and thousands of Tensor cores.
So, here are two ready GPU resources you can consider to run your large and computing intensive machine learning or deep learning workloads.

• On-Premise GPU system maintained by NUS Information Technology (Volta)
• Remote GPU system maintained by National Supercomputing Centre (NSCC) (AI System)

A comparison of the technical details of the Volta GPU system at NUS and the AI system at NSCC is tabulated below.

Table 1.  Technical Specifications for NUS-Volta System and NSCC-AI System

Technical Specifications Volta GPU System at NUS AI System at NSCC
GPU Server Model Dell PowerEdge C4140 Nvidia DGX-1
Number of GPUs Per Server 4 8
Max Workload Per Server 4 8
GPU Model Nvidia Tesla® V100-32GB Nvidia Tesla® V100-16GB
GPU Interconnect NVlink NVlink
GPU Capacity 20 GPU Cards 48 GPU Cards
Supported Containers Singularity Singularity, Docker
Supported ML/DL Platform Tensorflow, Caffe, Caffe2, PyTorch, Torch7, etc. Tensorflow, Caffe, Caffe2, PyTorch, cntk, Theano, etc.
Job Management & Scheduler PBS Pro PBS Pro
Run Time Limit 48 hours (extendible) 24 hours
Eligible Users NUS staff, NUS Students, and Research Collaborators NUS staff, NUS Students, and Research Collaborators
Cost to Use Free and Fair share Free and Fair free
Application to Start Link Link
User Guide Online User Guide Online User Guide

You can choose to use either one or both systems to run your deep learning. Since the same job scheduler, PBS Pro, is used on both system, job submission procedures on both the NUS’s Volta system and NSCC’s AI system are very much similar.

Please feel free to contact us via A.S.K. or email at gs.ude.sun@gnireenignEataD if you need any assistance for running machine/deep learning on the two GPU systems.

NUS Information Technology | NUS IT Services, Solutions & Governance > Services > HPC Newsletter > » Handy and Free to Use GPU Systems for Machine/Deep Learning
ALCA
National University of Singapore
  • 2 Engineering Drive 4
  • NUS Information Technology
  • Singapore 117584
  • 6516 2080
  • © National University of Singapore. All Rights Reserved.
       Legal        Branding guidelines