Skip to content

Threadripper workstation

Brian M. Schilder edited this page May 18, 2023 · 19 revisions

The following are ways to interact with and use Nathan Skene's Threadripper workstation (or really any computer) remotely.

Saving your password

Rather than entering your password every time you ssh login to the Threadripper, you can simply use ssh-keygen to do this once:

ssh-copy-id <username>@129.31.68.96

This will ask you for your Threadripper password. Afterwards you will no longer have to enter your password from your local computer. For a more detailed explanation, see here.

Preventing crashes

Certain programs within applications (e.g. Rstudio/R/Python) will try to parallelise many processes. But an ongoing issue seems to be that the Threadripper doesn't always close these processes (e.g. when you force  an R command to stop early). This means that processes can accumulate, causing applications like Rstudio to crash, even or cause the whole machine to crash/restart!

To avoid this, you have to pay close attention to how many resources you're using with a manager software like htop. Simply open a command line window on the Threadripper (via Teamviewer or via a remote ssh session) and enter:

htop

This will show all of the processes currently running on the Threadripper.

You can then kill the ones that are no longer needed.


Remote ssh

Connect to the Threadripper via the command line (Mac Terminal) on your local machine. Ask Nathan Skene to create a username/password for you first.

ssh <username>@wmcr-nskene.med.ic.ac.uk


Rstudio server

You can interact with Rstudio on the Threadripper via your web browser, by accessing a Docker image that's running on the Threadripper.

Connecting to the Docker image

Brian Schilder has already created an Rstudio Docker image and launched it so it can be used by anyone in the lab.

Creating the Docker image

If you ever need to create a new Docker image, you can follow these steps beforehand.

Here we will use the scfdev Docker container, which is stored on Docker Hub courtesy of Alan Murphy (see here for more details).

To create the image, run the following in the Threadripper command line.

docker run \
 -e PASSWORD=scflow \
 -d \
 -v /mnt/raid1/:/home/rstudio/raid1 \
 -e ROOT=true \
 -p 8787:8787 almurphy/scfdev

Notes:

  • Make sure you mount whatever folders you need access to from within Rstudio with one or more -v arguments.
  • Make sure you set your mounted file/folder permissions to be accessible by via the Rstudio server, e.g.: chmod -R u=rwx,go=rwx <my_subfolder>/
  • Note that we mount the folder /mnt/raid1 which is an 8TB hard drive connected to the Threadripper. It's preferable to use this as it has much more storage capacity than the Threadripper's main hard drive (~750GB).

Troubleshooting:

  • If you encounter errors, check that you can run docker run hello-world successfully.
  • If you get the error docker: Got permission denied while trying to connect to the Docker daemon socket at unix:///var/run/docker.sock: Post http://%2Fvar%2Frun%2Fdocker.sock/v1.24/containers/create: dial unix /var/run/docker.sock: connect: permission denied. See 'docker run --help'. then run newgrp docker first.
  • By default, Docker will stores all images within the main HD at: /var/lib/docker. However, this partition has limited storage. To change Docker's storage location to the much larger raid1 HD, have someone with admin permissions (i.e. Nathan Skene) follow these steps.

Changing the port number

This -p 8787:8787bit forwards the port from inside of the docker image to the workstation, so it's -p <workstation_port>:<docker_port>.  If you wanted to run a new docker image then you could, for example, change this to : -p 8786:8787and then connect to http://129.31.68.96:8786/ etc.

Notes:

  • You may need to sudo ufw enable 8786 first to ensure you can use that port.

Jupyter server

Connect to a remote Jupyter Notebook/Lab session on a remote server (e.g. the Threadripper) as if it were on your local computer.

For detailed instructions, follow this Tutorial. To summarise you:

  1. If you haven't done so already, create a password for  Jupyter  on the remote machine:
jupyter notebook password

Or, if you prefer using Jupyter Lab:

jupyter lab password
  1. Start a Jupyter instance on the remote machine via command line. Optionally, you can start a tmux session first to ensure that Jupyter will keep running even if the terminal on your local machine is closed (see here for further details).
tmux new -s jupyter

Now start the Jupyter Notebook:

jupyter notebook --no-browser

Alternatively, you can use Jupyter Lab:

jupyter lab --no-browser --ip "*" --notebook-dir </your/dir>
  1. NOTE: make sure you change <username> to your actual remote server username first. Access it using the following bash command on your local computer saved in your ~/.bashrc file.
function tr_jupyter(){
    # Forwards port $1 into port $2 and listens to it
    ### Example of what the final command looks like ###
    ## ssh -N -f -L localhost:1080:localhost:1070 bschilder@146.169.8.44
    ### A more flexible function to use any port you want
    ssh -N -f -L localhost:$2:localhost:$1 <username>@<hostname> 
    open http://localhost:$2/
}
  1. Make sure the  function is loaded (you may need to start a new bash session if you just amended the ~/.bashrc file).
  2. In command line, run:
tr_jupyter <port1> <port2>
  1. This will open a Jupyter notebook folder in your web browser, which you can then use to open a Jupyter notebook. Importantly, this notebook has access to files on the remote machine and runs commands using the remote machine's computing resources (as opposed to your local machine).

  2. You may notice that your conda environments don't automatically appear as options when selecting your kernel.

You can setup each conda env to be detected like so:

conda activate base
conda install -c conda-forge nb_conda_kernels
conda activate <env_name>
ipython kernel install --user --name=<env_name>

Alternatively, you can use the mamba_gator extension to manage your conda environments directly from within Jupyter Notebook/Lab. Install jupyterlab and mamba_gatorto your "base" conda env like so:

conda activate base
conda install -c conda-forge jupyterlab mamba_gator

Now when you go to your Settings tab, you'll see a new option called "Conda Packages Manager".

Note: The first time you click this, it may take a long time for it to find all your environments and the packages within them (note the spinning arrows). Wait until this is done and then you can begin managing your conda envs. If it continues to take a very long time, you may need to restart your Jupyter session and reconnect (the issue should be resolved automatically after this and you should see all of your conda envs listed).

Screenshot 2023-05-18 at 12 44 15

macFUSE mounting

Mount your remote computer to access files like they’re on your local machine.

  1. Install macFUSE.
  2. Create the folder that will be used as an access point to your remote computer's folder system.
sudo mkdir /Volumes/TR
sudo chown ${USER}:staff /Volumes/TR
  1. Mount the remote computer:
sshfs <username>@wmcr-nskene.med.ic.ac.uk:/home/<username> /Volumes/TR -o allow_other -o noappledouble -o volname=TR -o follow_symlinks
  1. [
ln -s /Volumes/TR ~/Desktop

Automated setup

The mounted subfolder will erase itself each time you umount it. To make this easier to remount each time, you can turn these commands into a function and add it to your  ~/.bash_profile  or ~/.bashrc .

nano ~/.bashrc
mountTR(){
        diskutil umount force /Volumes/TR 
        
        sudo mkdir /Volumes/TR
        sudo chown ${USER}:staff /Volumes/TR
        sshfs <username>@wmcr-nskene.med.ic.ac.uk:/home/<username> /Volumes/TR -o allow_other -o noappledouble -o volname=TR -o follow_symlinks
}

The next step assumes your bash_profile/bashrc  have been sourced upon login.

If they haven't been, run source ~/.bash_profile  or source ~/.bashrc  first.

Then, simply run the function we created in command line.

mountTR()

The mounted folder should now appear, and be accessible via the link you created on your Desktop (if you did that step).

Troubleshooting

When setting this up, you might run into this error:

mount_macfuse: mount point /Volumes/TR is itself on a macFUSE volume

This arises when you've previously (unsuccessfully) tried mounting the directory of the same name.

To overcome this, simply run the following (Note; make sure you type umount, not unmount)

umount /Volumes/TR

If the above approaches don't work for you, you can also try the suggestion here:

https://unix.stackexchange.com/questions/62677/best-way-to-mount-remote-folder


Remote desktop access

Sometimes it's necessary to use the graphical interface for a machine, as opposed to the Terminal alone. TeamViewer can do this, but has some security issues. Therefore we recommend you use TigerVNC instead. You can find a setup guide here, but here is a summarized version:

On your remote computer

All commands are to be executed in the remote computer's Terminal.

  1. Install TigerVNC on the remote computer (if not already installed): sudo apt-get install tigervnc
  2. Create a TigerVNC config file: nano ~/.vnc/xstartup and add the following to it:
#!/bin/sh
# Start Gnome 3 Desktop 
[ -x /etc/vnc/xstartup ] && exec /etc/vnc/xstartup
[ -r $HOME/.Xresources ] && xrdb $HOME/.Xresources
vncconfig -iconic &
dbus-launch --exit-with-session gnome-session &
  1. Start TigerVNC: vncserver
  2. Verify that TigerVNC has launched successfully:
pgrep Xtigervnc
ss -tulpn | egrep -i 'vnc|590'

On your local computer

  1. Install TigerVNC on your local computer. For MacOS, use the TigerVNC-1.12.0.dmg: https://sourceforge.net/projects/tigervnc/files/stable/1.12.0/
  2. Connect to the remote computer using your username on the remote computer (<user>) and the IP address <remote.server>: ssh <user>@<remote-server> -L 5901:127.0.0.1:5901
  3. Open the TigerVNC application, enter the address 127.0.0.1:5901 and click "Connect":
Screenshot 2022-06-27 at 13 42 51
  1. This should launch a new window giving you access to your remote desktop. Enter your password for your account on the remote computer.
  2. You can now interact with the remote desktop. This will look quite bare at first, but you can begin to customise it and open applications by clicking "Activities" in the upper left corner:
Screenshot 2022-06-27 at 13 46 02
  1. Launch the Terminal by clicking its application icon, or use the hotkeys CTRL+CMD+T on the desktop. You can use this to install a useful app for app installation/management: apt-get update & apt-get -y install gnome-software. The "Software" app will now appear under "Activities" --> "Show Applications".

Core / memory management

Managing core and memory usage is important when using the Threadripper.

Programs like Rstudio are not very good at cancelling processes on the machine that you ended with Rstudio (e.g. via CTRL+C or ESC). This means that you can easily clog up the Threadripper with jobs you thought you cancelled already, but are actually still running indefinitely in the background.

htop

The easiest way to kill processes on the TR is to use htop.

In command line:

  1. htop
  2. Hit spacebar to select the processes you want to affect (use the up/down keys or click to move around). Hit spacebar again on the same item to deselect. Use SHIFT+U to deselect all items.
    1. /usr/lib/rstudio/libexec/QtWebEngineProcess... When trying to kill Rstudio processes, look for rows with something similar to the following in the
  3. You can sort the processes according to different columns (including
    1. %MEM
  4. Hit k and then 9 on your keyboard. Then hit Enter. This will kill the selected processes.
    1. Note, unless you have root privileges, you can only kill processes that you started (i.e.

htop is also quite handy for Macs, and can be easily installed via brew here.

TR Start-up in Emergency Mode

TR can start up in emergency mode after a reboot because of an issue in the /etc/fstab file. To get around this:

hit CTRL+D... then "sudo vi /etc/fstab" and commented out the line causing the issue

It should then restart as normal

Clone this wiki locally