Hardware failures in 2017 - Post yours!

Discussion in 'General Chat' started by Patrick, Jan 4, 2017.

  1. alex_stief

    alex_stief Active Member

    Joined:
    May 31, 2016
    Messages:
    406
    Likes Received:
    104
    Nothing spectacular: Two 8GB DDR3 RDIMMs from Samsung and two Hard drives: One Seagate, one Toshiba. Sooner or later all of our HDDs will be WD.
     
    #41
  2. Fritz

    Fritz Well-Known Member

    Joined:
    Apr 6, 2015
    Messages:
    1,867
    Likes Received:
    367
    Just had a Supermicro BPN-SAS2-846EL1 screw the pooch. :( All drives dropped never to be seen again. I had a spare SAS1 BP I replaced it with and since box only houses 2TB drives I'll be leaving it in.
     
    #42
  3. csc2ya

    csc2ya New Member

    Joined:
    Feb 1, 2018
    Messages:
    1
    Likes Received:
    0
    Hi all....first post...good to be here. I'm male, 33 years of age, from the UK. I've always loved servers and computers.

    Had 2 hardware failiures recently, both on my servers. My supermicro box at OVH running windows had it's motherboard fail at the end of last year. Second mirrored drive went first, OVH replaced it. I rebuilt it's mirror...but still had issues. OVH checked it again, and ended up replacing the motherboard. I updated the MAC address in windows according to their instructions, and it was back online. That same machine has been nearly entirely replaced over the 3 years i've had it (2 motherboards, several harddrives, memory, the power supply more than once...all with no data loss though...thank god for mirrored drives)

    My second supermicro box at online.net (also running windows) had some sort of failiure at the beginning of this year. I'm unsure what was wrong (possibly the mobo), but they replaced that machine (a blade in a supermicro microcloud) entirely.

    Fortunately no important data was lost in either case. The online.net box was a complete loss, but I had local backups of that one.
     
    #43
    Last edited: Feb 1, 2018
  4. TeeJayHoward

    TeeJayHoward Active Member

    Joined:
    Feb 12, 2013
    Messages:
    374
    Likes Received:
    106
    Well, it's technically 2018... But I've had my first hardware failure of the year, which somehow prompted an upgrade. It started out with me trying to migrate from vSphere to RHEV. I fired up a node I hadn't used in a while, and experienced a lot of issues with the ConnectX-2 card. Well, here's the reason:
    [​IMG]

    Popped it back on, and good to go. On another project, I was ripping some BluRays, and suddenly started seeing errors. Turns out I had a dead disk. Well, what can you do? Spend the $35 to replace it?
    [​IMG]

    Of course you could... OR, you could use it as an opportunity to upgrade. 6x12TB disks to add to the array, a card to turn one of the 4Us into a JBOD, a cable to attach them, and a bunch of leftover hardware from two lab revisions ago... Presto!
    [​IMG]
    [​IMG]

    Hrm, while I'm in here, how about tidying up the cables a bit for better airflow?
    [​IMG]

    ...Okay, I still need to finish that project. But it's time to stop the work that I pay for, and start the work that pays me. And if I get a chance tonight, maybe I'll try and find out who makes lacing cable and replace all those twist ties.
     
    #44
    MikeWebb, T_Minus, cactus and 2 others like this.
  5. T_Minus

    T_Minus Moderator

    Joined:
    Feb 15, 2015
    Messages:
    6,783
    Likes Received:
    1,457
    Did you just pop it back on or secure it this time?

    I'd say about 20-30% of the NICs I get in the mail are popped, honestly just pop them back here... but after hearing yours popped in a server I think I'll be securing them different now :D
     
    #45
  6. pricklypunter

    pricklypunter Well-Known Member

    Joined:
    Nov 10, 2015
    Messages:
    1,521
    Likes Received:
    432
    That's usually a symptom of the little brass rivets having lost their tension due to heating/ cooling. I usually just remove the heatsink, clean it and the chip and replace it with a dab of arctic mx something or other. I also splay the rivets a little with a thin spudger to re-tension them. After having done that I have never had one detach on its own again :)
     
    #46
  7. Patrick

    Patrick Administrator
    Staff Member

    Joined:
    Dec 21, 2010
    Messages:
    11,545
    Likes Received:
    4,467
    I have to admit, I wish they just used some sort of thermal tape/ cement on NIC heatsinks. This happens too often.
     
    #47
  8. pricklypunter

    pricklypunter Well-Known Member

    Joined:
    Nov 10, 2015
    Messages:
    1,521
    Likes Received:
    432
    In days gone by, manufacturers used to use chip bond, which was a sort of metal loaded epoxy, to directly bond heatsinks to the plastic or ceramic chip carrier. Unfortunately, miniaturisation, new soldering and machining processes, modern chip packaging, cost etc all played a part in the demise of such practises. Well that and folks also wanted to be able to pop a device, rework a board and replace it easily without resorting to using a crow bar to get the device off the board. It is also conductive, so being careful with it on a modern board is an understatement. Still I keep some around for just those times when nothing else will do :)
     
    #48
  9. TeeJayHoward

    TeeJayHoward Active Member

    Joined:
    Feb 12, 2013
    Messages:
    374
    Likes Received:
    106
    I just popped it back on and used a pocket knife to spread the tabs out an absurd amount. Not sayin' it can't come back off, just that it'd be impressive if it did.
     
    #49
  10. DRAGONKZ

    DRAGONKZ Member

    Joined:
    Apr 9, 2018
    Messages:
    61
    Likes Received:
    3
    I had the CIMC fail on my Cisco C220 M3 and it now refuses to boot.

    It’s impossible to get a board 2nd hand in AUS so I had to order one from the USA.

    Playing the homelab game is so much easier and cheaper over there!!
     
    #50
  11. Jaket

    Jaket Member

    Joined:
    Jan 4, 2017
    Messages:
    69
    Likes Received:
    11
    Not a 2017 but 2018 fail.
    We had a new custom built deep learning system built out shipped, couldn't get everything working after over a week of troubleshooting turns out we had to RMA the whole system was a 30k system wasn't a fun process.
     
    #51
Similar Threads: Hardware failures
Forum Title Date
General Chat Hardware suggestions to reduce power usage Nov 7, 2019
General Chat New guy intro, and hardware suggestions Oct 31, 2019
General Chat Looking for suggestions on how to sort hardware accessories (screws etc.) Jul 11, 2019
General Chat What does everyone do with old hardware? Jan 5, 2019
General Chat Dell hardware forwarding to EU Feb 1, 2018

Share This Page