ANSYS 17.2 CFX Benchmark External Flow Over a LeMans Car

Wow? yet another ANSYS Bench marking blog post? I know, but I have had four blog posts in limbo for months. There is no better time than now and since it is Friday. Time to knock out another one of these fine looking ANSYS 17.2 bench marking results of my list!

The ANSYS 17.2 CFX External Flow Over a LeMans Car Test Case

…dun dun dah!

On The Fast Track! ANSYS 17.2
On The Fast Track! ANSYS 17.2

The ANSYS CFX test case has approximately 1.8 million nodes

  • 10 million elements, all tetrahedral
  • Solves compressible fluid flow with heat transfer using the k-epsilon turbulence model.

ANSYS Benchmark Test Case Information

  • ANSYS HPC Licensing Packs required for this benchmark
    • I used (3) HPC Packs to unlock all 56 cores of the CUBE a56i.
    • The fastest solve time goes to the CUBE a56i – Boom!
      • From start to finish a total of forty-six (46) ticks on the clock on the wall occurred.
      • A total of fifty-five (55) cores in use between two twenty-eight (28) core nodes.
      • Windows 2012 R2 Standard Edition w/HPC update 3
      • MS-MPI v7.1
      • ANSYS CFX 17.2
  • Please contact your local ANSYS Software Sales Representative for more information on purchasing ANSYS HPC Packs. You too may be able to speed up your solve times by unlocking additional compute power!
  • What is a CUBE? For more information regarding our Numerical Simulation workstations and clusters please contact our CUBE Hardware Sales Representative at SALES@PADTINC.COM Designed, tested and configured within your budget. We are happy to help and to listen to your specific needs.

Figure 1 – ANSYS CFX benchmark data for the tetrahedral, 10 million elements External Flow Over a LeMans Car Test Case

ANSYS CFX Benchmark Data
ANSYS CFX Benchmark Data

ANSYS CFX Test Case Details – Click Here for more information on this benchmark

External Flow Over a LeMans Car
Number of nodes 1,864,025
Element type Tetrahedral
Models k-epsilon Turbulence, Heat Transfer
Solver Coupled Implicit

The CPU Information

The benchmark data is derived off of the running through the ANSYS CFX External Flow Over a LeMans Car test case. Take a minute or three to look at how these CPU’s perform with one of the very latest ANSYS releases, ANSYS Release 17.1 & ANSYS Release 17.2.

Wall Clock Time!

I have focused and tuned the numerical simulation machines with a focus on wall clock time for years now. What is funny if you ask Eric Miller we were talking about wall clock times this morning.

What is wall clock time? Simply put –> How does the solve time FEEL to the engineer…..yes, i just equated a feeling to a non-human event. Ah yes, to feel…oh and  I was reminded of old Van Halen song where David Lee Roth says.

Oh man, I think the clock is slow.

  I don’t feel tardy.

Class Dismissed!”

The CUBE phenomenon

CUBE a56i Appliance – Windows 2012 R2 Standard w/HPC
1U CUBE APPLIANCE (2 x 28)
4 x 14c @2.6GHz/ea – Intel® Xeon® e5-2690 V4
Dual Socket motherboard
256GB DDR4-2400 MHz LRDIMM
4 x 600GB SAS3 15k RPM
56Gbps Infiniband FDR CPU High Speed Interconnect
10Gbps Ethernet Low Latency
CUBE w32i Workstation – Windows 10 Professional
1 x 4U CUBE APPLIANCE
2 x 16c @2.6GHz/ea – Intel® Xeon® e5-2697a V4
Dual Socket motherboard
256GB DDR4-2400 MHz LRDIMM
2 x 600GB SAS3 15k RPM
NVIDIA QUADRO M4000

It Is All About The Data

 11/17/2016

PADT, Inc. – Tempe, AZ

ANSYS CFX 17.1 ANSYS CFX 17.1 ANSYS CFX 17.2
Total wall clock time Cores CUBE w32i CUBE a56i CUBE a56i
2 555 636 609
4 304 332 332
8 153 191 191
16 105 120 120
24 78 84 84
32 73 68 68
38 0 61 59
42 0 55 55
48 0 51 51
52 0 52 48
55 0 47 46
56 0 52 51

Picture Sharing Time!

Check out the pictures below of the Microsoft Server 2012 R2  HPC Cluster Manager.

I used the Windows Server 2012 R2  on both of the two compute nodes that make up the CUBE a56i.

Microsoft 2012 R2 w/HPC – is very quick, and oh so very powerful!

winhpc-cfx-56c-cpu

Windows 2012 HPC
Microsoft Windows 2012 R2 HPC. It is time…
INTEL XEON e5-2690 v4
The INTEL XEON e5-2690 v4 loves the turbo mode vrrooom It is time…

Please be safe out there in the wilds, you are all dismissed for the weekend!

Part 2: ANSYS FLUENT Performance Comparison: AMD Opteron vs. Intel XEON

AMD Opteron 6308, INTEL XEON e5-2690 & INTEL XEON e5-2667V2 Comparison using ANSYS FLUENT 14.5.7

Note: The information and data contained in this article was complied and generated on September 12, 2013 by PADT, Inc. on CUBE HVPC hardware using FLUEN 14.5.7.  Please remember that hardware and software change with new releases and you should always try to run your own benchmarks, on your own typical problems, to understand how performance will impact you.

By David Mastel

Due to the response to the original article on this subject,  I thought it would be good to do a quick follow-up using one of our latest CUBE HVPC builds. Again, the ANSYS Fluent standard benchmarks were used in garnering the stats on this dual socket INTEL XEON e5-2667V2 configuration.

CUBE HVPC Test configurations (Same as in last comparison)

  • Server 1: CUBE HVPC c16
  • CPU: 4, AMD Opteron 6308 @ 3.5GHz (Quad Core)
  • Memory: 256GB (32x8G) DDR3-1600 ECC Reg. RAM (1600MHz)
  • Hardware RAID Controller: Supermicro AOC-S2208L-H8iR 6Gbps, PCI-e x 8 Gen3
  • Hard Drives: Supermicro HDD-A0600-HUS156060VLS60 – Hitachi 600G SAS2.0 15K RPM 3.5″
  •  OS: Linux 64-bit / Kernel 2.6.32-358.18.1.e16.x86_64
  • App: ANSYS FLUENT 14.5.7
  • MPI: Platform MPI
  • HCA: SMC AOC-UIBQ-M2 – QDR Infiniband
    • The IB card installed however solves were run distributed locally
  • Switch: MELLANOX IS5023 Non-Blocking 18-port switch

Server 2: CUBE HVPC c16i (Intel server from last comparison)

  • CPU: 2, INTEL XEON e5-2690 @ 2.9GHz (Octa Core)
  • Memory: 128GB (16x8G) DDR3-1600 ECC Reg. RAM (1600MHz)
  • RAID Controller: Supermicro AOC-S2208L-H8iR 6Gbps, PCI-e x 8 Gen3
  • Hard Drives: Supermicro HDD-A0600-HUS156060VLS60 – Hitachi 600G SAS2.0 15K RPM 3.5″
  • OS: Windows 7 Professional 64-bit
  • App: ANSYS FLUENT 14.5.7
  • MPI: Platform MPI

Server 3: CUBE HVPC c16ivy (New “Ivy” based Intel server)

  • CPU: 2, INTEL XEON e5-2667V2 @ 3.3 (Octa Core)
  • Memory: 128GB (16x8G) DDR3-1600 ECC Reg. RAM (1600MHz)
  • RAID Controller: Supermicro AOC-S2208L-H8iR 6Gbps, PCI-e x 8 Gen3
  • Hard Drives: Supermicro HDD-A0600-HUS156060VLS60 – Hitachi 600G SAS2.0 15K RPM 3.5″
  • OS: Linux 64-bit / Kernel 2.6.32-358.18.1.e16.x86_64
  • App: ANSYS FLUENT 14.5.7
  • MPI: Platform MPI
  • HCA: SMC – QDR Infiniband
    • The IB card installed however solves were run distributed locally

ANSYS FLUENT 14.5.7 Performance using the ANSYS FLUENT Benchmark suite provided by ANSYS, Inc.

ANSYS Fluent Benchmark page link:http://www.ansys.com/Support/Platform+Support/Benchmarks+Overview/ANSYS+Fluent+Benchmarks

Release ANSYS FLUENT 14.5.7 Test Cases
(20 Iterations each)

  • Reacting Flow with Eddy Dissipation Model (eddy_417k)
  • Single-stage Turbomachinery Flow (turbo_500k)
  • External Flow Over an Aircraft Wing (aircraft_2m)
  • External Flow Over a Passenger Sedan (sedan_4m)
  • External Flow Over a Truck Body with a Polyhedral Mesh (truck_poly_14m)
  • External Flow Over a Truck Body 14m (truck_14m)

Here are the results for all three machines, total and average time:

Intel-AMD-Flunet-Part2-Chart1Intel-AMD-Flunet-Part2-Chart2

 

Summary: Are you sure? Part 2

So I didn’t have to have the “Are you sure?” question with Eric this time and I didn’t bother triple checking the results because indeed, the Ivy Bridge-EP Socket 2011 is one fast CPU! That combined with a 0.022 micron manufacturing process  the data speaks for itself. For example, lets re-dig into the data for the External Flow Over a Truck Body with a Polyhedral Mesh (truck_poly_14m) benchmark and see what we find:

Intel-AMD-FLUENT-Details

 

 

 

 

 

 

 

 

 

 

 

Intel-AMD-FLUENT-summary

 

 

 

 

 

 

 

 

 

 

 

Current Pricing of INTEL® and AMD® CPU’s

Here is the up to the minute pricing for each CPU’s. I took these prices off of NewEgg and IngramMicro’s website. The date of the monetary values was captured on October 4, 2013.

Note AMD’s price per CPU went up and the INTEL XEON e5-2690 went down. Again, these prices based on today’s pricing, October 4, 2013.

AMD Opteron 6308 Abu Dhabi 3.5GHz 4MB L2 Cache 16MB L3 Cache Socket G34 115W Quad-Core Server Processor OS6308WKT4GHKWOF

  •  $501 x 4 = $2004.00

Intel Xeon E5-2690 2.90 GHz Processor – Socket LGA-2011, L2 Cache 2MB, L3 Cache 20 MB, 8 GT/s QPI

  • $1986.48 x 2 = $3972.96

Intel Xeon E5-2667V2 3.3 GHz Processor – Socket LGA-2011, L2 Cache 2MB, L3 Cache 25 MB, 8 GT/s QPI,

  • $1933.88 x 2 = $3867.76

REFERENCES:
http://www.ingrammicro.com
http://www.newegg.com

INTEL XEON e5-2667V2
http://ark.intel.com/products/75273/Intel-Xeon-Processor-E5-2667-v2-25M-Cache-3_30-GHz

INTEL XEON e5-2690
http://ark.intel.com/products/64596/

AMD Opteron 6308
http://www.amd.com/us/Documents/Opteron_6300_QRG.pdf

http://en.wikipedia.org/wiki/Double-precision_floating-point_format

http://en.wikipedia.org/wiki/Central_processing_unit#Integer_range

http://en.wikipedia.org/wiki/Floating_point

STEP OUT OF THE BOX, STEP INTO A CUBE

PADT offers a line of high performance computing (HPC) systems specifically designed for CFD and FEA number crunching aimed at a balance between cost and performance. We call this concept High Value Performance Computing, or HVPC. These systems have allowed PADT and our customers to carry out larger simulations, with greater accuracy, in less time, at a lower cost than name-brand solutions. This leaves you more cash to buy more hardware or software.

Let CUBE HVPC by PADT, Inc. quote you a configuration today!