TWO Netra T1 AC200 unable to boot. Don't see the SCSI bus or network interface

2007-12-24 21:30:00

Hello,

I have a problem with *TWO* Netra T1 AC200 machines. Both of them
show exactly the same problem after about a month running, and they
have failed simultaneously.

When resetting, the machines are unable to boot, giving a "Data
Access Exception" after probing the memory slots.

----- log snip 1 (after a reset-all; power-cycling does exactly the
same)
Resett
LOM event: +1d+22h24m28s host reset
ing ...

`
Processor Speed = 500 MHz
Baud rate is 9600
8 Data bits, 1 stop bits, no parity (configured from lom)

Firmware CORE Sun Microsystems, Inc.
@(#) core 1.0.3 2001/01/03 13:54
Software Power ON
Verifying NVRAM...Done
Bootmode is 0
[New I2C DIMM address]
MCR0 = 36a0bc04
MCR1 = c0804000
MCR2 = f3000bb
MCR3 = cf
Ecache Size = 256 KB
Clearing E$ Tags Done
Clearing I/D TLBs Done
Probing memory
Done
MEMBASE=0x20000000
MEMSIZE=0x10000000
Clearing memory...Done
Turning ON MMUs Done
Copy ROM to RAM (168992 bytes) Done
Orig PC=0x1fff0007e48 New PC=0xf0f07ea0
Processor Speed=500MHz
Looking for Dropin FVM ... found
Decompressing Client Done
Transferring control to Client...

ttya initialized
Reset Control: BXIR:0 BPOR:0 SXIR:0 SPOR:1 POR:0
Probing upa at 1f,0 pci pci pci
Probing upa at 0,0 SUNW,UltraSPARC-IIe SUNW,UltraSPARC-IIe (256 Kb)
Loading Support Packages: kbd-translator
Loading onboard drivers: ebus flashprom eeprom idprom SUNW,lomh
Probing /pci at 1,1 Device 3 pmu i2c temperature dimm dimm i2c-nvram
idprom motherboard-fru fan-control
lomp
Probing Memory Bank #0 256 Megabytes
Probing Memory Bank #1 256 Megabytes
Probing Memory Bank #2 0 Megabytes
Probing Memory Bank #3 0 Megabytes
Data Access Exception
ok

-------

If I try to do a "probe-scsi-all" in this state, I get an "ok"
answer just in less than a second, and it does nothing. The device
tree does *not* show the scsi or net subsystems:

---- snip 2
ok show-devs
/SUNW,UltraSPARC-IIe at 0,0
/pci at 1f,0
/virtual-memory
/memory at 0,0
/aliases
/options
/openprom
/chosen
/packages
/pci at 1f,0/pci at 1
/pci at 1f,0/pci at 1,1
/pci at 1f,0/pci at 1,1/lomp at 3
/pci at 1f,0/pci at 1,1/pmu at 3
/pci at 1f,0/pci at 1,1/ebus at c
/pci at 1f,0/pci at 1,1/pmu at 3/fan-control at 0,c8
/pci at 1f,0/pci at 1,1/pmu at 3/i2c at 0,0
/pci at 1f,0/pci at 1,1/pmu at 3/i2c at 0,0/motherboard-fru at 0,a2
/pci at 1f,0/pci at 1,1/pmu at 3/i2c at 0,0/i2c-nvram at 0,a0
/pci at 1f,0/pci at 1,1/pmu at 3/i2c at 0,0/dimm at 0,aa
/pci at 1f,0/pci at 1,1/pmu at 3/i2c at 0,0/dimm at 0,a8
/pci at 1f,0/pci at 1,1/pmu at 3/i2c at 0,0/temperature at 0,30
/pci at 1f,0/pci at 1,1/pmu at 3/i2c at 0,0/i2c-nvram at 0,a0/idprom at 1fd8
/pci at 1f,0/pci at 1,1/ebus at c/SUNW,lomh at 14,200000
/pci at 1f,0/pci at 1,1/ebus at c/idprom
/pci at 1f,0/pci at 1,1/ebus at c/eeprom at 14,0
/pci at 1f,0/pci at 1,1/ebus at c/flashprom at 10,0
/openprom/client-services
/packages/kbd-translator
/packages/dropins
/packages/SUNW,builtin-drivers
/packages/disk-label
/packages/obp-tftp
/packages/deblocker
/packages/terminal-emulator
ok

------

If I use "probe-all" it seems to detect the scsi and net devices,

------ snip 3
ok probe-all
Probing /pci at 1,1 Device 7 isa power serial serial
Probing /pci at 1,1 Device c network firewire usb
Probing /pci at 1,1 Device 3
Probing /pci at 1,1 Device d ide disk cdrom
Probing /pci at 1,1 Device 5 pci108e,1100 network firewire usb
Probing /pci at 1 Device 8 scsi disk tape scsi disk tape
Probing /pci at 1 Device 5 Nothing there
Probing /pci at 1 Device 6 Nothing there
Probing /pci at 1 Device 7 Nothing there

------

And the scsi bus now works.

------ snip 4

ok probe-scsi-all
/pci at 1f,0/pci at 1/scsi at 8,1

/pci at 1f,0/pci at 1/scsi at 8
Target 0
Unit 0 Disk SEAGATE ST318404LSUN18G 4203
Target 1
Unit 0 Disk SEAGATE ST318404LC 0006

-------

The device tree shows all devices now:

------- snip 5

ok show-devs
/SUNW,UltraSPARC-IIe at 0,0
/pci at 1f,0
/virtual-memory
/memory at 0,0
/aliases
/options
/openprom
/chosen
/packages
/pci at 1f,0/pci at 1
/pci at 1f,0/pci at 1,1
/pci at 1f,0/pci at 1/scsi at 8,1
/pci at 1f,0/pci at 1/scsi at 8
/pci at 1f,0/pci at 1/scsi at 8,1/tape
/pci at 1f,0/pci at 1/scsi at 8,1/disk
/pci at 1f,0/pci at 1/scsi at 8/tape
/pci at 1f,0/pci at 1/scsi at 8/disk
/pci at 1f,0/pci at 1,1/usb at 5,3
/pci at 1f,0/pci at 1,1/network at 5,1
/pci at 1f,0/pci at 1,1/ide at d
/pci at 1f,0/pci at 1,1/usb at c,3
/pci at 1f,0/pci at 1,1/network at c,1
/pci at 1f,0/pci at 1,1/isa at 7
/pci at 1f,0/pci at 1,1/lomp at 3
/pci at 1f,0/pci at 1,1/pmu at 3
/pci at 1f,0/pci at 1,1/ebus at c
/pci at 1f,0/pci at 1,1/ide at d/cdrom
/pci at 1f,0/pci at 1,1/ide at d/disk
/pci at 1f,0/pci at 1,1/isa at 7/serial at 0,2e8
/pci at 1f,0/pci at 1,1/isa at 7/serial at 0,3f8
/pci at 1f,0/pci at 1,1/isa at 7/power at 0,2000
/pci at 1f,0/pci at 1,1/pmu at 3/fan-control at 0,c8
/pci at 1f,0/pci at 1,1/pmu at 3/i2c at 0,0
/pci at 1f,0/pci at 1,1/pmu at 3/i2c at 0,0/motherboard-fru at 0,a2
/pci at 1f,0/pci at 1,1/pmu at 3/i2c at 0,0/i2c-nvram at 0,a0
/pci at 1f,0/pci at 1,1/pmu at 3/i2c at 0,0/dimm at 0,aa
/pci at 1f,0/pci at 1,1/pmu at 3/i2c at 0,0/dimm at 0,a8
/pci at 1f,0/pci at 1,1/pmu at 3/i2c at 0,0/temperature at 0,30
/pci at 1f,0/pci at 1,1/pmu at 3/i2c at 0,0/i2c-nvram at 0,a0/idprom at 1fd8
/pci at 1f,0/pci at 1,1/ebus at c/SUNW,lomh at 14,200000
/pci at 1f,0/pci at 1,1/ebus at c/idprom
/pci at 1f,0/pci at 1,1/ebus at c/eeprom at 14,0
/pci at 1f,0/pci at 1,1/ebus at c/flashprom at 10,0
/openprom/client-services
/packages/ufs-file-system
/packages/kbd-translator
/packages/dropins
/packages/SUNW,builtin-drivers
/packages/disk-label
/packages/obp-tftp
/packages/deblocker
/packages/terminal-emulator
ok

-------

But I try to boot and I get a "Fast Data Access MMU MIss"

------- snip 6
ok boot disk
Boot device: /pci at 1f,0/pci at 1/scsi at 8/disk at 0,0 File and args:
Loading ufs-file-system package 1.4 04 Aug 1995 13:02:54.
FCode UFS Reader 1.11 97/07/10 16:19:15.
Loading: /platform/SUNW,UltraAX-i2/ufsboot
Loading: /platform/sun4u/ufsboot
Fast Data Access MMU Miss

-------

These machines have been working without problems for about a month.
In fact, one of them has Oracle 9i installed. Any ideas? I cannot
find an OpenBoot manual for 4.0 version.

It seems strange because two machines are showing exactly the same
symptoms... Of course I have tried reverting the environment
variables to the default values, with no success.

Any ideas? Of course, I will summarize to the list. BTW, where can I
find documentation for OpenBoot 4.0?

Thank you,

Borja.

Comments

Got something to say?

You must be logged in to post a comment.