By Choong Wey Yeh, Lucas on 21 Oct, 2019
The High Performance Computing (HPC) team at NUS IT provides services and resources for users to run large-scale computational jobs. These jobs range from machine learning programs to simulation programs by users from various departments. However, users may not always know the right amount of computational resources to request for their jobs, often resulting in resources requested being underutilised. With more resources requested than utilised, more unused resources are hogged on the HPC clusters than necessary which results in longer queueing times for other users waiting for their turn to run jobs.