Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Table of Contents
stylenone

Overview

In collaboration with Dr. Mert Sabuncu from Radiology, the ITS team has established the framework for a new high-performance computing (HPC) cluster dedicated to AI/ML type workflows, like training neural networks for imaging, LLMs and so on.

This cluster features high-memory nodes, Nvidia GPU servers (A100, A40 and L40), InfiniBand interconnect, and specialized storage designed for AI workloads.

Getting Started on the AI cluster

Login: Use SSH to log in to the cluster. Replace <cwid> with your credentials.

Code Block
ssh <cwid>@ai-login01.med.cornell.edu

Once logged on:

Code Block
Last login: Fri Jan  3 11:35:53 2025 from 157.000.00.00
<cwid>@ai-login01:~$
<cwid>@ai-login01:~$ pwd
/home/<cwid>
<cwid>@ai-login01:~$ 

Home directories have limited space. It is only used for small files. All data should be stored within your designated scratch space.

...

Storing Data

All lab's central HPC scratch storage is mounted under /midtier/<lab>/scratch/

Code Block
cwid@ai-login01:~# cd /midtier/labname/scratch/cwid
  • Please note scratch storage is intended for data that is actively being used or processed.

Common File Management

Code Block
# List all files and directories in the scratch directory
ls /midtier/labname/scratch/

# Navigate to a specific subdirectory
cd /midtier/labname/scratch/cwid

# Copy a file from the current directory to another directory
cp data.txt /midtier/labname/scratch/cwid/

# Move the copied file to a different directory
mv /midtier/labname/scratch/cwid/data.txt /midtier/labname/scratch/backup/

# Create a new directory
mkdir /midtier/labname/scratch/cwid/new_project/