Greetings from the HPC numerical simulation proving grounds of PADT, Inc. in Tempe, Arizona. While bench marking the very latest version of ANSYS® Mechanical™ I learned something very significant and I need to share this information with you right now.As I gazed down on the data outputs from the new solve.out files, I began to notice something. Yes change indeed, something was different, something had changed.
A brief pause for emphasis, in regards in overall ANSYS® productivity and amazing improvements please read this post.
However, pertaining to this blog post, I am focusing on one very important HPC performance metric to me. It is one of the many HPC performance metrics that I have used when creating a balanced HPC server for engineering simulation.. But wait there is more! so please wait just a little bit longer, for very soon I will post even more juicy pieces of data garnered from taken from these new ANSYS® benchmark solver files.
To recap in all of its bullets points & glories:
- For today and just for today, we are focusing on just one of the performance metrics.
- The Time Spent Computing The Solution!
- This 1.3x speedup in solve times was achieved using just one CUBE workstation and with just one click!
- Open ANSYS®and while you are creating your solve.
- Select, withjust one click either the INTEL MPI or IBM Platform MPI.
- Next, run your test repeat as necessary using whichever MPI version that you did not start your test with.
The ANSYS® Mechanical™ Benchmark Description:
- V15sp-5
- Sparse solver, symmetric matrix, 6000k DOFs, transient, nonlinear, structural analysis with 1 iteration
- GPU Accelerator or Co-Processor enabled for: NVIDIA and Intel Phi
- A large sized job for direct solvers, should run incore on machines with 128 GB or more of memory, good test of processor flop speed if running incore and I/O if running out-of-core
CUBE ANSYS Numerical Simulation Appliance Used:
- CUBE w16i-v4
- Just One – Black 4U Supermicro Whisper Quiet Workstation Chassis
- CPU: Dual Intel® Xeon® e5-2667 V4’s
- RAM: DDR4-2400 MHz LRDIMM
- HDD: SAS3 15k RPM
- GPU/ACCELERATOR: NVIDIA QUADRO K6000
- OS: Windows 10 Professional 64-bit
- SOFTWARE:
- ANSYS® Mechanical™ 17.1
- INTEL® MPI v5.0.3.048
- Platform MPI v9.1.3.1
The ANSYS® Mechanical™ Benchmark Results:
|
TIME SPENT COMPUTING THE SOLUTION | TIME SPENT COMPUTING THE SOLUTION | |
IBM Platform MPI | INTEL MPI | ||
Cores | 2016 CUBE w16i-v4 | 2016 CUBE w16i-v4 | This Speedup is…X faster! |
2 | 396.1 | 380.9 | 1.04 |
4 | 239.7 | 229.6 | 1.04 |
6 | 210.1 | 196.7 | 1.07 |
8 | 182.9 | 168.7 | 1.08 |
10 | 167.2 | 161.4 | 1.04 |
12 | 167.1 | 160.7 | 1.04 |
14 | 196.1 | 151.3 | 1.30 |
16 | 184.7 | 161.7 | 1.14 |
Wow! using these latest 14nm INTEL® XEON® CPU’s, phew, I have been forever changed! As you can see from the data above, in just one simple click, changing from the IBM Platform MPI to using INTEL MPI and look! the benchmark time spent computing times are faster! A 1.3x Speedup!
Now in this specific benchmark example along with the use of the latest ANSYS® Mechanical achieving a 1.3x speedup without spending another penny is very wise and not so foolish.
Disclaimer: Please check with your ANSYS Software Sales Representative for the very latest on solver updates and information. Because some of the models and compatibility can very on the . You may need to use the MS-MPI, INTEL-MPI or IBM Platform MPI for your distributed solving. If you are not sure please contact your local ANSYS® Corporate Software Sales or ANSYS® Software Channel Partner that was assigned specifically to you and/or your company.
References:
http://www.ansys.com/Solutions/Solutions-by-Role/IT-Professionals/Platform-Support/Benchmarks-Overview/ANSYS-Mechanical-Benchmarks
You must be logged in to post a comment.