1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

New 2011 v3 system with ASUS Z10PE-D16 WS and 2x e5-2686 v3

Discussion in 'DIY Server and Workstation Builds' started by wildpig1234, Aug 28, 2017.

  1. TLN

    TLN Member

    Joined:
    Feb 26, 2016
    Messages:
    221
    Likes Received:
    21
    What is the result I should be looking at? PI Calculation? I.e. my result is 220?
     
    #21
  2. Patriot

    Patriot Moderator

    Joined:
    Apr 18, 2011
    Messages:
    1,126
    Likes Received:
    588
    ooou oou can I play?
    [​IMG]
     
    #22
    TType85, wildpig1234 and Marsh like this.
  3. Marsh

    Marsh Moderator

    Joined:
    May 12, 2013
    Messages:
    1,253
    Likes Received:
    498
    Wow, :cool:
    I lookup the price of each retail E5-4669 v4 is $5K, my down payment for my first house was $2.5K.
     
    #23
  4. Patriot

    Patriot Moderator

    Joined:
    Apr 18, 2011
    Messages:
    1,126
    Likes Received:
    588
    The ebayed 4699s are often less than 2699s and they are 135w chips instead of 145w.
     
    #24
  5. wildpig1234

    wildpig1234 Active Member

    Joined:
    Aug 22, 2016
    Messages:
    564
    Likes Received:
    49
    You should submit the result so you can be on that page! your benchmark fits right on with the other two 2683 v3 on there! looks like it narrowly beat the dual epyc 7601!


    :) sure. download the y cruncher and crunch away... You should get some impressive time with that 44C system!
     
    #25
  6. TLN

    TLN Member

    Joined:
    Feb 26, 2016
    Messages:
    221
    Likes Received:
    21
    I'll be rebuilding my rig soon, and will try that test on bare metal. Today I was running with my Desktop VM in the background (and 10 other VMs). I can estimate 10-15% performance increase.
     
    #26
  7. wildpig1234

    wildpig1234 Active Member

    Joined:
    Aug 22, 2016
    Messages:
    564
    Likes Received:
    49
    TLN, if you got a chance and don't mind helping me out, much appreciated if you can test out the dual cpu with only 32GB ram or only 1 dimm per cpu and see what result you get for 5 billions digits. I am wondering if it's b/c I am running single channel that is limiting my time. thanks.
     
    #27
  8. TLN

    TLN Member

    Joined:
    Feb 26, 2016
    Messages:
    221
    Likes Received:
    21
    If you can wait a little bit - I can do that for sure. I'm getting new case tomorrow (according to UPS). Will start moving things into new enclosure later this week. During that time I wanna do a clean install of Windows and test some equipment that I have here. I'll be able to remove memory and run that test for you.
     
    #28
  9. wildpig1234

    wildpig1234 Active Member

    Joined:
    Aug 22, 2016
    Messages:
    564
    Likes Received:
    49
    thanks a lot. i won't get back til the end of the week to be able to run this test on my dual e5-2670 but i figure it would be more realistic to run this on a v3 system. still really bothered on why my time to 5 billion digits is so slow with the dual cpu.
     
    #29
  10. biorpg

    biorpg New Member

    Joined:
    May 21, 2017
    Messages:
    7
    Likes Received:
    2
    Thanks for the info on y-cruncher, I had not used it before. Here are the results I got on my similar system:

    Z10PE-D16 WS
    2x Xeon E5-2689 v4
    128GB(8x16GB) DDR4 2133MHz
    2x Geforce 1070 GTX

    These benchmarks were performed with a bclk of 103.5, NUMA mode memory(4 nodes), Cluster-on-Die QPI, and SLI disabled.

    Single-core benchmarks included to highlight the E5-2689 v4's outstanding 3.7GHz all-core turbo. (3.83Ghz with 103.5 bclk) The Pi-5b benchmark is the multi-core run, and the Pi-500m benchmark is with a single core.
    The 3694 Cinebench score shown above the current (bright orange) score was achieved with SLI enabled.(This is the only differing factor that I can think of.)

    [​IMG]
    [​IMG]
     
    #30
    Last edited: Sep 14, 2017
    wildpig1234 likes this.
  11. biorpg

    biorpg New Member

    Joined:
    May 21, 2017
    Messages:
    7
    Likes Received:
    2
    Fujitsu has some excellent documentation for their Primergy servers that is generally applicable to all systems using the same chipset and CPU architecture.
    The following table shows the relative performance of using less than 1 DPC (DIMM per Channel) which is the 'minimum' 4-way configuration. You are using 0.25 DPC(1-way), and as you can see, this scales almost linearly with the performance impact:
    [​IMG]
    (https://sp.ts.fujitsu.com/dmsp/Publications/public/wp-broadwell-ep-memory-performance-ww-en.pdf)

    In essence, its amazing you are even able to boot :D
     
    #31
    Last edited: Sep 14, 2017
    wildpig1234 likes this.
  12. wildpig1234

    wildpig1234 Active Member

    Joined:
    Aug 22, 2016
    Messages:
    564
    Likes Received:
    49
    So my first y cruncher run was with 1 cpu and 2 dimm for that one cpu and my 2nd run of y cruncher was with 2 cpu but only 1 dimm per cpu. I guess not having all 4 channels really limiting the performance?
     
    #32
  13. biorpg

    biorpg New Member

    Joined:
    May 21, 2017
    Messages:
    7
    Likes Received:
    2
    Sorry, a couple slight mistakes. That table and document are for Broadwell-EP (E5-2600 v4). And the table shows performance impact of actually setting the memory interleaving to 1-way within the BIOS, while actually populating 1 DPC (8 dimms).
    Here is the information for Haswell-EP (E5-2600 v3) along with more of the text explaining the table, and an excerpt from later on in the document that actually mentions your configuration.

    [​IMG]

    (https://sp.ts.fujitsu.com/dmsp/Publications/public/wp-haswell-ep-memory-performance-ww-en.pdf)
     
    #33
    Last edited: Sep 14, 2017
  14. wildpig1234

    wildpig1234 Active Member

    Joined:
    Aug 22, 2016
    Messages:
    564
    Likes Received:
    49
    Guess I ll have to get the other 6 dimm asap
     
    #34
  15. biorpg

    biorpg New Member

    Joined:
    May 21, 2017
    Messages:
    7
    Likes Received:
    2
    Is there any way to play DirectX12 games on 2008R2?
     
    #35
  16. wildpig1234

    wildpig1234 Active Member

    Joined:
    Aug 22, 2016
    Messages:
    564
    Likes Received:
    49
    #36
  17. wildpig1234

    wildpig1234 Active Member

    Joined:
    Aug 22, 2016
    Messages:
    564
    Likes Received:
    49
    with regard to the topic of memory bandwidth and number of channels and performance hit, I am assuming the above would also apply to 2011 v1? I still have to save up for the six 16gb ddr4 dimm which are not cheap at all ;(. but I am wondering if I can simulate this effect on my old 2011-v1 system with dual cpu and 16x 8GB dimm by changing the installed ram configuration accordingly.
     
    #37
  18. wildpig1234

    wildpig1234 Active Member

    Joined:
    Aug 22, 2016
    Messages:
    564
    Likes Received:
    49
    So I tested out using various different amount of memory channel on my e5-2670 system:
    2.5 billions digits pi using y cruncher which use 13gb ram

    1 cpu 2x 8gb (dual channel) : 481 sec
    2 cpu 2x 8gb (single channel, 1 dimm/cpu): 478 sec
    2 cpu 4x 8gb (dual channel , 2 dimm/cpu): 377 sec
    2 cpu 8x 8gb (quad channel, 4 dimm/cpu): 288 sec

    so seems like the number of memory channel can have very big impact
     
    #38
    Last edited: Sep 17, 2017 at 9:56 PM
    TType85 likes this.
  19. TLN

    TLN Member

    Joined:
    Feb 26, 2016
    Messages:
    221
    Likes Received:
    21
    It's not necessary channels, but memory bandwidth. And in that specific application: Easy to calculate, so you need as much mem bandwidth as possible.

    Also, desktop memory works at higher speeds: For example DDR4-4800 will give you more throughput then server DDR4-2133, even if you can compare 1CPU to 2CPU. You cannot use i7 in dual-cpu configurations.
     
    #39
  20. wildpig1234

    wildpig1234 Active Member

    Joined:
    Aug 22, 2016
    Messages:
    564
    Likes Received:
    49
    It seems like y cruncher uses the memory bandwidth differently than cinebench. Cinebench scales correctly even though I was only using single channel memory config with the dual 2686v3 but y cruncher was not scalung correctly when you use single channel.... y cruncher needs lots of memory bandwidth
     
    #40
Similar Threads: 2011 system
Forum Title Date
DIY Server and Workstation Builds Help requested: Quad LGA2011 Build Feb 19, 2017
DIY Server and Workstation Builds Unbranded LGA2011 motherboard Feb 11, 2017
DIY Server and Workstation Builds Intel S2600CP2J Motherboard Dual 2011 Socket 2 x E5-2670 Dec 23, 2016
DIY Server and Workstation Builds LGA 2011-3 Workstation with Xeon E5 1600v4, ECC RAM, and two GeForces 1080 Dec 8, 2016
DIY Server and Workstation Builds SBS 2011 Server Veeam backup fail Sep 11, 2015

Share This Page