算流体力学相关解决.ppt
,TESLA,Computational Fluid Dynamics Module,Computational Fluid Dynamics,GPU Perf compared against Multi-core x86 CPU socket,features and may be a kernel to kernel perf comparison,GPU Value to EngineeringComputational Fluid Dynamics,FluiDyna LBultra20 x acceleration with 4 GPUs vs.2 x 6 core CPUs,CPU Intel Xeon X5670 2.93 GHz;GPU Tesla M2070,GPU READY APPLICATIONSAltair AcuSolveAutodesk MoldflowFluiDyna Culises for OpenFOAMFluiDyna LBultraVratis SpeedIT for OpenFOAMPrometech ParticleworksSandia NL and ORNLS3DSD+(SU-Jameson)FEFLO(GMU-Lohner)Turbostream,ANSYS CFD preliminary results of radiation heat transfer view-factor computations on GPUs vs.CPUsRHT on GPUs will release in 14.0 as betaRadiation HT Applications:,NOTE:Growing CPU time of view-factor computations inhibit proper inclusion of radiation HT effects,NOTE:GPU time remains low even as view-factor computations grow very large,ANSYS CFD 14.0 Offers First GPU Capability,Underhood coolingCabin comfort HVACFurnace simulationsSolar loads on buildingsCombustor in turbineElectronics passive cooling,Other ANSYS CFD Evaluations:,Models(e.g.disperse phase)Implicit equation solvers,OpenFOAM on GPUs ISVs FluiDyna and Vratis,Prometech and Particle-based CFD for Multi-GPUs,MPS-based method developed at the University of TokyoProf.Koshizuka Results shown for Particleworks 2.5 released in 2011Performance is relative to 4 cores of Intel i7 CPUContact Prometech for license details,http:/www.prometech.co.jp,Turbostream CFD for Gas Turbine Engines,Turbostream Simulation Speed-up 19x,19x,www.turbostream-|,www.many-core.group.cam.ac.uk/ukgpucc2/talks/Brandvik.pdf|,Sources:,www.hpc.cam.ac.uk/services/darwin.html,University of Cambridge DARWIN Cluster,CUDA Center of Excellence Since 2008GPU sub-cluster:Dell T5500 servers,32 dual-socket CPUsTesla S1070 GPUs,4 GPUs per socketfor total 128 GPUs,Sample Turbostream GPU Simulations,Typical Routine Simulation,Large-scale Simulation,19x Speedup,http:/www.turbostream-,Source:,Sample Turbostream GPU Simulations,Turbostream:CFD for Turbomachinery,Tokyo Institute of Technology AOKI LaboratoryCFD Research on#5 of Top 500 TSUBAME 2.0Simulations that scale to 4000 Fermi GPUsPresentation at Supercomputing 2010 Conference:“Large-scale CFD Applications on TSUBAME 2”Dr.Takayuki Aoki,Global Scientific Information and Computing Center(GSIC)of Tokyo Institute of Technology(Tokyo Tech)http:/,GPU Highlights on CFD Applications From the Top 500,FEFLO:Porting of an Edge-Based CFD Solver to GPUs AIAA-2010-0523 Andrew Corrigan,Ph.D.,Naval Research Lab;Rainald Lohner,Ph.D.,GMUFAST3D:Using GPU on HPC Applications to Satisfy Low Power Computational Requirement AIAA-2010-0524 Gopal Patnaik,Ph.D.,US Naval Research LabOVERFLOW:Rotor Wake Modeling with a Coupled Eulerian and Vortex Particle Method AIAA-2010-0312 Chris Stone,Ph.D.,Intelligent LightSOLAR:Unstructured CFD Solver on GPUs Jamil Appa,Ph.D.,BAE Systems Advanced Technology CentreelsA:Recent Results with elsA on Many-Cores Michel Gazaix and Steve Champagneux,ONERA/Airbus FranceTurbostream:Turbostream:A CFD Solver for Many-Core Processors Tobias Brandvik,Ph.D.,Whittle Lab,University of CambridgeOVERFLOW:Acceleration of a CFD Code with a GPU Dennis Jespersen,NASA Ames Research Center,48th AIAA Aerospace Sciences Meeting|Jan 2010|Orlando,FL,USA,CFD on Future Architectures|Oct 2009|DLR Braunschweig,DE,Parallel CFD 2009|May 2009|NASA Ames,Moffett Field,CA,USA,Published CFD Developments on Tesla GPU,Total 110 technical papers:32 or 30%included GPU-developments,up from 12 papers in 2010(Taipei,TW)and 4 papers in 2009(NASA,US)Included an invited full-day workshop on CUDA and GPUs for CFD Applications attended by more than 100 delegatesGPUs in talks from 6 of 7 plenary speakers:GPU-specific CFD:Aoki Tokyo Inst Tech,Lohner GMU,Barber BU GPU evaluation:Chalot Dassault Aviation,Gonzlez Next Limit,Jgerskpper DLR,GPUs Highlights from ParCFD 2011,23rd ParCFD 2011|16 20 May 2011|Barcelona,ES,GPU ApplicationJameson-developed CFD software SD+for high order method aerodynamic simulations,GPU BenefitUse of 16 x Tesla M2070:15 hrs vs.202 hrs for 16 x Xeon X5670Fast turnaround of complex LES simulations that would otherwise be impractical for CPU-only use,Stanford UniversityAerospace Computing Lab Prof.Antony Jameson,High-Order Aerodynamics Research on GPUs,15 hours on 16 x M2070s202 hours(one week)on 16 Xeon x5670 CPUs,Transitional flow over SD70053 airfoil,21M DOF,Ma=.2,Re=60K,AoA=4,4th order,400K RK iters,GPU ApplicationSJTU-developed CFD software NUS3D for aerodynamic simulations of wing shapes,GPU BenefitUse of Tesla C2070:20 x 37x vs.single core Intel core i7 CPUFaster simulations for more wing design candidates vs.wind tunnel testingExpanding to multi-GPU and full aircraft,COMAC and SJTUCommercial Aircraft Corporation of China,COMAC Wing Candidate,ONERA M6 WingCFD Simulation,Commercial Aircraft Wing Design on GPUs,GPU ApplicationBAE-developed CFD software Veloxi for aerodynamic simulation of aircraft,GPU BenefitUse of 2 x Tesla C2050:15x vs.QC Intel i7 CPUFaster simulations enabled design exploration of full aerodynamic envelope,GPU Speed-upvs.Multi-core,BAE SystemsTechnology and Engineering Services,Defense Aerodynamic Applications on GPUs,GPU ApplicationNRL-developed CFD software JENRE for simulation of jet engine acoustics,GPU BenefitUse of Tesla M2070:3x vs.Hex core Intel(Westmere)CPUMore detailed mesh simulations possible for longer durations of jet engine transient conditions,U.S.DoD Naval Research LabLab for Computational Physics and Fluid Dynamics,Fighter Jet Engine Noise Reduction on GPUs,GPU ApplicationEM Photonics-developed CFD software for unsteady aerodynamic simulations,GPU BenefitUse of Tesla C2070:54x for CFD kernel vs.8 core Intel i7 CPUsFast turnaround of simulations enables more flight conditions and aircraft approach directions,NAVAIR and EM PhotonicsU.S.DoD Naval Air Weapons Center,Pax River MD,Aircraft Carrier Landing Effects using GPUs,54xfaster with 1 GPU,