Summary: V1280 boot hang

2007-12-25 5:51:00

I disconnected the two SCSI cables from a Dual LVD HBA
and was able to boot without any hangs. SunService
replaced the HBA and the server is back up.

Tom Jones

----- Original Message -----
From: "Tom Jones" <tjones at statesman.com>
To: "Sun Managers List" <codeprof at codeprof.com>
Sent: Monday, February 20, 2006 2:23 PM
Subject: V1280 boot hang

> Hardware: V1280, two system boards, 24GB RAM, 3 attached
> 3510FC arrays, 2nd node of a two node SC3.1 cluster
>
> Software: Solaris 9, Volume manager mirrored boot volume
>
> Our V1280 cluster was powered down yesterday for a planned
> power outage. The second node of the cluster hangs
> at boot time. I've tried booting from an emergency boot
> disk and it hangs at the same place. The server will
> boot from CDROM.
>
> I have a recent backup tape but I suspect that it's corrupted
> as well, just like the boot volume and the emergency
> boot disk (emergency boot disk is created daily as
> a backup)
>
> I suspect there's a corrupted or missing file needed
> during the boot process that's causing the hang.
>
> Can anyone shed light on the boot process at the time
> of the hang. If I knew what files/directories are needed,
> I can restore them from the other cluster node. Is
> there any way to trace the process?
>
> Here's the messages when I boot the V1280:
> ==========================================
> {0} ok boot -sxv
> Size: 0x5fe8b+0x252cd+0x66aa7 Bytes
> SunOS Release 5.9 Version Generic_117171-08 64-bit
> Copyright 1983-2003 Sun Microsystems, Inc. All rights reserved.
> Use is subject to license terms.
> Ethernet address = 0:3:ba:3f:97:df
> mem = 25165824K (0x600000000)
> avail mem = 24716288000
> root nexus = Sun Fire V1280
> ssm0 at root: SSM Node 0
> ssm0 is /ssm at 0,0
> pci108e,80010 at ssm0: Node 0 Safari id 24 0xc700000
> pcisch0 is /ssm at 0,0/pci at 18,700000
> pci108e,80011 at ssm0: Node 0 Safari id 24 0xc600000
> pcisch1 is /ssm at 0,0/pci at 18,600000
> pci108e,80012 at ssm0: Node 0 Safari id 25 0xcf00000
> pcisch2 is /ssm at 0,0/pci at 19,700000
> pci108e,80013 at ssm0: Node 0 Safari id 25 0xce00000
> pcisch3 is /ssm at 0,0/pci at 19,600000
> /ssm at 0,0/pci at 18,600000/scsi at 2 (glm0):
> glm0 supports power management.
> /ssm at 0,0/pci at 18,600000/scsi at 2 (glm0):
> Rev. 0 Symbios 53c1010-33/66 found.
> PCI-device: scsi at 2, glm0
> glm0 is /ssm at 0,0/pci at 18,600000/scsi at 2
> /ssm at 0,0/pci at 18,600000/scsi at 2,1 (glm1):
> glm1 supports power management.
> /ssm at 0,0/pci at 18,600000/scsi at 2,1 (glm1):
> Rev. 0 Symbios 53c1010-33/66 found.
> PCI-device: scsi at 2,1, glm1
> glm1 is /ssm at 0,0/pci at 18,600000/scsi at 2,1
> PCI-device: ide at 3, uata0
> uata0 is /ssm at 0,0/pci at 18,700000/ide at 3
> PCI-device: pci at 3, pci_pci1
> pci_pci1 is /ssm at 0,0/pci at 19,700000/pci at 3
> PCI-device: scsi at 4, qus0
> qus0 is /ssm at 0,0/pci at 19,700000/pci at 3/scsi at 4
> /ssm at 0,0/pci at 19,700000/pci at 3/scsi at 4 (isp0):
> Firmware Version: v10.04.37, Customer: 0
> PCI-device: scsi at 5, qus1
> qus1 is /ssm at 0,0/pci at 19,700000/pci at 3/scsi at 5
> /ssm at 0,0/pci at 19,700000/pci at 3/scsi at 5 (isp1):
> Firmware Version: v10.04.37, Customer: 0
> sd0 at glm0: target 0 lun 0
> sd0 is /ssm at 0,0/pci at 18,600000/scsi at 2/sd at 0,0
> sd15 at glm1: target 0 lun 0
> sd15 is /ssm at 0,0/pci at 18,600000/scsi at 2,1/sd at 0,0
> sd16 at glm1: target 1 lun 0
> sd16 is /ssm at 0,0/pci at 18,600000/scsi at 2,1/sd at 1,0
> sd1 at glm0: target 1 lun 0
> sd1 is /ssm at 0,0/pci at 18,600000/scsi at 2/sd at 1,0
>
>
> The boot hangs after the sd1 message.
>
> Thanks,
> Tom Jones

Comments

Got something to say?

You must be logged in to post a comment.