Wednesday 30 March 2016

Physical processor administration in solaris 10?

 # fmadm faulty
--------------- ------------------------------------  -------------- ---------
TIME            EVENT-ID                              MSG-ID         SEVERITY
--------------- ------------------------------------  -------------- ---------
Feb 20 14:41:13 350e799e-27c7-6621-d897-dbab2fbe0efb  SUN4U-8001-0J  Major
Host        : p10app04
Platform    : SUNW,Sun-Fire-V490        Chassis_id  :
Product_sn  :
Fault class : fault.cpu.ultraSPARC-IVplus.l3cachedata
Affects     : cpu:///cpuid=2/serial=80020228CE6C1347
              cpu:///cpuid=18/serial=80020228CE6C1347
                  faulted but still in service
FRU         : "Slot A" (hc://:product-id=SUNW,Sun-Fire-V490:server-id=p10app04/c
                  faulty
Description : The number of errors associated with this CPU has exceeded
              acceptable levels.  Refer to http://sun.com/msg/SUN4U-8001-0J for
              more information.
Response    : The fault manager will attempt to remove the affected CPU from
              service.
Impact      : System performance may be affected.
Action      : Schedule a repair procedure to replace the affected CPU, the
              identity of which can be determined using fmdump -v -u
              .


 # fmdump -v -u 350e799e-27c7-6621-d897-dbab2fbe0efb
TIME                 UUID                                 SUNW-MSG-ID
Feb 20 14:41:14.2577 350e799e-27c7-6621-d897-dbab2fbe0efb SUN4U-8001-0J
  100%  fault.cpu.ultraSPARC-IVplus.l3cachedata
        Problem in: -
           Affects: cpu:///cpuid=2/serial=80020228CE6C1347
               FRU: hc://:product-id=SUNW,Sun-Fire-V490:server-id=p10app04/component=Slot A
          Location: -
  100%  fault.cpu.ultraSPARC-IVplus.l3cachedata
        Problem in: -
           Affects: cpu:///cpuid=18/serial=80020228CE6C1347
               FRU: hc://:product-id=SUNW,Sun-Fire-V490:server-id=p10app04/component=Slot A
          Location: -

 # psrinfo
0       on-line   since 02/20/2013 14:40:34
1       on-line   since 02/20/2013 14:40:34
2       faulted   since 02/20/2013 14:41:14
3       on-line   since 02/20/2013 14:40:32
16      on-line   since 02/20/2013 14:40:34
17      on-line   since 02/20/2013 14:40:34
18      faulted   since 02/20/2013 14:41:14
19      on-line   since 02/20/2013 14:40:34

 # psrinfo -p
4

 # psrinfo |wc -l
       8


 # psradm -f 2
psradm: processor 2 in faulted state; add -F option to force change


 # psradm -f -F 2


 # psradm -f -F 18


# psrinfo
0       on-line   since 02/20/2013 14:40:34
1       on-line   since 02/20/2013 14:40:34
2       off-line  since 02/20/2013 19:54:54
3       on-line   since 02/20/2013 14:40:32
16      on-line   since 02/20/2013 14:40:34
17      on-line   since 02/20/2013 14:40:34
18      off-line  since 02/20/2013 19:55:18
19      on-line   since 02/20/2013 14:40:34


 # fmadm repaired 350e799e-27c7-6621-d897-dbab2fbe0efb
fmadm: failed to record repair to 350e799e-27c7-6621-d897-dbab2fbe0efb: specified resource is not known to be faulty


 # >errlog
 # >fltlog

# cd rsrc

# ls
479acae3-52ce-41be-92fb-ae8517bf4657  c784c664-1ea7-c332-814d-f76e939c2db3


 # file *
479acae3-52ce-41be-92fb-ae8517bf4657:  extended accounting file
c784c664-1ea7-c332-814d-f76e939c2db3:  extended accounting file

 # rm *

 # pwd
/var/fm/fmd/rsrc


# cd ..

 # ls
errlog  fltlog  rsrc    xprt

#psradm -n 2
#psradm -n 18

 # svcadm restart fmd

 # fmadm faulty

# psrinfo
0    on-line   since 02/20/2013 14:40:34
1    on-line   since 02/20/2013 14:40:34
2    on-line   since 02/20/2013 19:59:36
3    on-line   since 02/20/2013 14:40:32
16   on-line   since 02/20/2013 14:40:34
17   on-line   since 02/20/2013 14:40:34
18   on-line   since 02/20/2013 19:59:54
19   on-line   since 02/20/2013 14:40:34

No comments:

Post a Comment