Project:Infrastructure/SPARC server recovery

This document covers how to recover from hard failure on the Sun Fire T2000 development servers, bender.sparc.dev.gentoo.org and totoro.sparc.dev.gentoo.org.

Thanks to User:Iamben for writing this document. Wikified and edited by User:Robbat2.

Short version

 * 1) SSH to ALOM
 * 2) Press  at SILO prompt:
 * 3) Press,  to disconnect console
 * 1) Press  at SILO prompt:
 * 2) Press,  to disconnect console
 * 1) Press,  to disconnect console

Connecting to ALOM
First, login to something with access to the Gentoo LAN subnet at OSUOSL (another host or the OSL VPN).

Then SSH to the ALOM (SPARC Out Of Band management system), ensuring you tell SSH to use legacy options, as newer SSH security is not supported by the ALOM.

You should now have the ALOM console, denoted by :

ALOM: Manual host poweroff
From ALOM console, run. You will be prompted for confirmation, and then it will return to the prompt. You need to wait for shutdown confirmation!

ALOM: Manual host poweron
From ALOM console, run.

ALOM: connect to host console
From ALOM console, run. The  option is needed in case there is a stale connection to the console, as sometimes happens if SSH is disconnected without an explicit. You will be prompted to disconnect the stale connection.

Host console: POST output
Review the POST output; it might contain hardware faults (unlikely, and should pause).

Host console: SILO bootloader
Press at SILO prompt to boot the default Gentoo Linux kernel.

Boot device: disk File and args: SILO Version 1.4.14_git20170829 boot: boot: Allocated 64 Megs of memory at 0x40000000 for kernel Uncompressing image... Loaded kernel version 4.20.2

[   0.000028] PROMLIB: Sun IEEE Boot Prom 'OBP 4.25.0 2006/11/07 23:24' [   0.000037] PROMLIB: Root node compatible: sun4v [   0.000062] Linux version 4.20.2-gentoo (root@bender) (gcc version 8.2.0 (Gentoo 8.2.0-r6 p1.7)) #1 SMP Wed Jan 16 14:16:59 -00 2019 [   1.797025] printk: bootconsole [earlyprom0] enabled [   2.037192] ARCH: SUN4V ... This is bender.gentoo.osuosl.org (Linux sparc64 4.20.2-gentoo) 22:37:04

bender login:

Host console: exit from host console to ALOM prompt
Press, to disconnect console and return to   prompt.

This is bender.gentoo.osuosl.org (Linux sparc64 4.20.2-gentoo) 22:37:04 bender login: sc>

ALOM: logout
Properly logout from the ALOM console.