小程序中英文外文文献翻译.docx

上传人:b****9 文档编号:137040 上传时间:2022-10-04 格式:DOCX 页数:20 大小:22.75KB
下载 相关 举报
小程序中英文外文文献翻译.docx_第1页
第1页 / 共20页
小程序中英文外文文献翻译.docx_第2页
第2页 / 共20页
小程序中英文外文文献翻译.docx_第3页
第3页 / 共20页
小程序中英文外文文献翻译.docx_第4页
第4页 / 共20页
小程序中英文外文文献翻译.docx_第5页
第5页 / 共20页
点击查看更多>>
下载资源
资源描述

小程序中英文外文文献翻译.docx

《小程序中英文外文文献翻译.docx》由会员分享,可在线阅读,更多相关《小程序中英文外文文献翻译.docx(20页珍藏版)》请在冰豆网上搜索。

小程序中英文外文文献翻译.docx

本科毕业设计(论文)

中英文对照翻译

(此文档为word格式,下载后您可任意修改编辑!

外文文献翻译原文及译文

标题:

ENHANCINGAPPLICATIONPERFORMANCEUSINGMINI-APPS:

COMPARISONOFHYBRIDPARALLELPROGRAMMINGPARADIGMS

作者:

GaryLawsonMichaelPoteatMashaSosonkinaRobertBaurle

期刊:

ComputerScience

年份:

2016原文

ENHANCINGAPPLICATIONPERFORMANCEUSINGMINI-APPS:

COMPARISONOFHYBRIDPARALLELPROGRAMMINGPARADIGMS

GaryLawsonMichaelPoteatMashaSosonkinaRobertBaurle

ABSTRACT

Inmanyfields,real-worldapplicationsforHighPerformanceComputinghavealreadybeendeveloped.Fortheseapplicationstostayup-to-date,newparallelstrategiesmustbeexploredtoyieldthebestperformance;however,restructuringormodifyingareal-worldapplicationmaybedauntingdependingonthesizeofthecode.Inthiscase,amini-appmaybeemployedtoquicklyexploresuchoptionswithoutmodifyingtheentirecode.Inthiswork,severalmini-appshavebeencreatedtoenhanceareal-worldapplicationperformance,namelytheVULCANcodeforcomplexflowanalysisdevelopedattheNASALangleyResearchCenter.Thesemini-appsexplorehybridparallelprogrammingparadigmswithMessagePassingInterface(MPI)fordistributedmemoryaccessandeitherSharedMPI(SMPI)orOpenMPforsharedmemoryaccesses.PerformancetestingshowsthatMPI+SMPIyieldsthebestexecutionperformance,whilerequiringthelargestnumberofcodechanges.Amaximumspeedupof

23wasmeasuredforMPI+SMPI,butonly10wasmeasuredforMPI+OpenMP.Keywords:

Mini-apps,Performance,VULCAN,Shared

Memory,MPI,OpenMP1INTRODUCTION

Inmanyfields,real-worldapplicationshavealreadybeendeveloped.Forestablishedapplicationstostayup-to-date,newparallelstrategiesmustbeexploredtodeterminewhichmayyieldthebestperformance,especiallywithadvancesincomputinghardware.However,restructuringormodifyingareal-worldapplicationincursincreasedcostdependingonthesizeofthecodeandchangestobemade.Amini-appmaybecreatedtoquicklyexploresuchoptionswithoutmodifyingtheentirecode.Mini-appsreducetheoverheadofapplyingnewstrategies,thusvariousstrategiesmaybeimplementedandcompared.Thisworkpresentstheauthorsexperienceswhenfollowingthisstrategyforareal-worldapplicationdevelopedbyNASA.

VULCAN(ViscousUpwindAlgorithmforComplexFlowAnalysis)isaturbulent,noequilibrium,finite-ratechemicalkinetics,Navier-Stokesflowsolverforstructured,cell-centered,multiblockgridsthatismaintainedanddistributedbytheHypersonicAirBreathingPropulsionBranchoftheNASALangleyResearchCenter(NASA2016).Themini-appdevelopedinthisworkusestheHouseholderReflectorkernelforsolvingsystemsoflinearequations.Thiskernelisusedoftenbydifferentworkloads,andisagoodcandidatetodecidewhatstrategytypetoapply

toVULCAN.VULCANisbuiltonasingle-layerofMPIandthecodehasbeenoptimizedtoobtainperfectvectorization,thereforetwo-levelsofparallelismarecurrentlyused.Thisworkinvestigatestwoflavorsofshared-memoryparallelism,OpenMPandSharedMPI,whichwillprovidethethird-levelofparallelismfortheapplication.Athird-levelofparallelismincreasesperformance,whichdecreasesthetime-to-solution.

MPIhasextendedthestandardtoMPIversion3.0,whichincludestheSharedMemory(SHM)model(MikhailB.(Intel)2015,MessagePassingInterfaceForum2012),knowninthisworkasSharedMPI(SMPI).ThisextensionallowsMPItocreatememorywindowsthataresharedbetweenMPItasksonthesamephysicalnode.Inthisway,MPItasksareequivalenttothreads,exceptSharedMPIismoredifficultforaprogrammertoimplement.OpenMPisthemostcommonshared-memorylibraryusedtodatebecauseofitsease-of-use(OpenMP2016).Inmostcases,onlyafewOpenMPpragmasarerequiredtoparallelizealoop;however,OpenMPissubjecttoincreasedoverhead,whichmaydecreaseperformanceifnotproperlytuned.

Asearlyastheyear2000,theauthorsin(CappelloandEtiemble2000)foundthatlatencysensitivecodesseemtobenefitfrompureMPIimplementationswhereasbandwidthsensitivecodesbenefitfromhybridMPI+OpenMP.Also,theauthorsfoundthatfasterprocessorswillbenefithybridMPI+OpenMPcodesifdatamovementisnotanoverwhelming

bottleneck(CappelloandEtiemble2000).Sincethistime,hybridMPI+OpenMPimplementationshaveimproved,butnotwithoutdifficulties.In(Drosi-nosandKoziris2004,ChorleyandWalker2010),itwasfoundthatOpenMPincursmanyperformancereductions,including:

overhead(fork/join,atomics,etc),falsesharing,imbalancedmessagepassing,andasensitivitytoprocessormapping.However,OpenMPoverheadmaybehiddenwhenusingmorethreads.In(Rabenseifner,Hager,andJost2009),theauthorsfoundthatsimplyusingOpenMPcouldincurper-formancepenaltiesbecausethecompileravo

展开阅读全文
相关资源
猜你喜欢
相关搜索
资源标签

当前位置:首页 > 总结汇报 > 学习总结

copyright@ 2008-2022 冰豆网网站版权所有

经营许可证编号:鄂ICP备2022015515号-1