Search results

  1. I

    bad ssd in nexus n9k

    I had to replace a boot ssd in a n9k-c93108 but i can't find any working images? Could anyone help me with such a thing? Thanks!
  2. I

    NVLINK bridge use in datacenter?

    I really like the HGX SXM4 A100 setups instead of PCIE AIC with a bridge. If your purchasing a whole server with everything I don't think there is much cost difference.
  3. I

    A100 vs A6000 vs 3090 for DL and FP32/FP64

    our 3090s running in a server room so the heat issue is not bad for us have the lowest latency with bs1 inference
  4. I

    3090 driver handicap?

    Machine (252GB total) Package L#0 NUMANode L#0 (P#0 126GB) L3 L#0 (16MB) L2 L#0 (512KB) + L1d L#0 (32KB) + L1i L#0 (32KB) + Core L#0 PU L#0 (P#0) PU L#1 (P#32) L2 L#1 (512KB) + L1d L#1 (32KB) + L1i L#1 (32KB) + Core L#1 PU L#2 (P#1) PU L#3...
  5. I

    3090 driver handicap?

    Right, we were investigating numa domains and such. Also correct that our 3090 test rig is a single processor i9 9900k, so much simpler topology. And we're talking about ~10ms/req, it seems much higher than could be possible with a numa issue and the machine not being under significant load.
  6. I

    3090 driver handicap?

    No, we are trying to figure out if its just too early with pytorch or a driver issue or something else? And by faster I ment lower latency per frame processed in small batch sizes, not more frames processed in big batches...
  7. I

    Best motherboard for many memory slots?

    The use case is storing a buffer of all frames from a bunch of cameras that infeed into a vision system in a factory so we can scrub through the data if we want, it uses about 1-2tb per 24h of run time. * all means frames that actually matter, not frames where for example no motion happened...
  8. I

    Best motherboard for many memory slots?

    I would like to stand up 2-4TB of memory for a rotating image cache not writing to a disk? These images are not critical to store for long periods, right now we just store a few minutes worth of content. Suggestions?
  9. I

    FS: Ubiquiti Unifi Stuff - 2x US‑48‑500W, 2x UAP‑AC‑HD, 2x UAP-IW-HD, US-8-150W, 2x UAP-AC-M

    I am interested in the 2x 48-500W and US-8-150W switches and 2x UAP‑AC‑HD I'm not local, shipping to CA would be required, but I will take both switches at once? Let me know.
  10. I

    3090 driver handicap?

    We are doing awesome with the RTX 3090 for pytorch and vision. It seems to be faster than RTX Titan and A100 SXM
  11. I

    A100 poor performance?

    Is anyone else doing inferance on a100 with pytorch? We run a number of vision machines with the following cards RTX Titan, RTX 3090, SXM A100 The important metric for us is frame to class latency with bursts of frames, we batch results up to N frames per batch and count the round trip time...
  12. I

    Wanted - V100 PCIE AIC

    Anyone getting rid of V100s yet? Let me know, I could use a few.
  13. I

    Best chassis for RTX Titans

    What kind of server chassis are people using for 4x or 8x RTX Titan configurations? I know there are a number of things built up for V100
  14. I

    HPE SAS SSD drives fail @ 32,768 hours

    Anyone have the firmware to update these?
  15. I

    When to switch from flat to routed?

    I've exaggerated the spanning tree event frequency, it just worries me that the network has become too large.
  16. I

    When to switch from flat to routed?

    How big is too big for people setting up flat networks? I'm getting to the point where I think there are a lot of spanning tree events and now I've had a situation with a Ubiquiti ES-16-XG 10gb switch that failed to block a spanning tree loop causing a forwarding loop, it took me a little bit...
  17. I

    Containers with macvlan network AND bridge

    I have a number of vlans and want to spin up containers that provide vlan specific services, this works awesome and is super easy with Docker. Now I want to also drop that container on the bridge 172.17/16 network and pull data with telegraf, this works with an error when I try to add the...
  18. I

    Patching Intel X520 EEPROM to unlock all SFP+ transceivers

    This is awesome, I have looked into this problem just enough to become frustrated and switch to MNX QSFP cards and have a pile of cards on a shelf that do not accept "locked"/unapproved SFP+ modules for SM LR 10k. I felt like I had done something wrong here and just needed to take more time on...
  19. I

    Mellanox 40gbe tuning windows? I have terrible performance.

    Yes, I think its quite important to decouple the storage performance and network performance. Most of the testing I have done with these MNX CX3 cards gets about 22gig/sec from host to switch to host For me, this is a very typical iperf3 output for a single thread from a xeon 2697v2 to another...
  20. I

    Need help with making reflashed HP / MCX354A work at 40Gb/s

    Have you tried multiple iperf3 processes on different ports at the same time?