深度学习大数据分析中英文外文文献翻译.docx

上传人:b****1 文档编号:164949 上传时间:2022-10-04 格式:DOCX 页数:18 大小:21.89KB
下载 相关 举报
深度学习大数据分析中英文外文文献翻译.docx_第1页
第1页 / 共18页
深度学习大数据分析中英文外文文献翻译.docx_第2页
第2页 / 共18页
深度学习大数据分析中英文外文文献翻译.docx_第3页
第3页 / 共18页
深度学习大数据分析中英文外文文献翻译.docx_第4页
第4页 / 共18页
深度学习大数据分析中英文外文文献翻译.docx_第5页
第5页 / 共18页
点击查看更多>>
下载资源
资源描述

深度学习大数据分析中英文外文文献翻译.docx

《深度学习大数据分析中英文外文文献翻译.docx》由会员分享,可在线阅读,更多相关《深度学习大数据分析中英文外文文献翻译.docx(18页珍藏版)》请在冰豆网上搜索。

深度学习大数据分析中英文外文文献翻译.docx

本科毕业设计(论文)

中英文对照翻译

(此文档为word格式,下载后您可任意修改编辑!

标题:

PrototypingaGPGPUNeuralNetworkforDeep-LearningBigDataAnalysis

作者:

AlcidesFonseca, BrunoCabral

期刊:

BigDataResearch,卷8:

50-56页年份:

2017

原文

PrototypingaGPGPUNeuralNetworkforDeep-LearningBigDataAnalysis

AlcidesFonseca, BrunoCabral

Abstract

BigDataconcernswithlarge-volumecomplexgrowingdata.Giventhefastdevelopmentofdatastorageandnetwork,organizationsarecollectinglargeever-growingdatasetsthatcanhaveusefulinformation.Inordertoextractinformationfromthesedatasetswithinusefultime,itisimportanttousedistributedandparallelalgorithms.Onecommonusageofbigdataismachinelearning,inwhichcollecteddataisusedtopredictfuturebehavior.Deep-LearningusingArtificialNeuralNetworksisoneofthepopularmethodsforextractinginformationfromcomplexdatasets.Deep-learningiscapableofmorecreatingcomplexmodelsthantraditionalprobabilisticmachinelearningtechniques.

Thisworkpresentsastep-by-stepguideonhowtoprototypeaDeep-LearningapplicationthatexecutesbothonGPUandCPUclusters.PythonandRedisarethecoresupportingtoolsofthisguide.ThistutorialwillallowthereadertounderstandthebasicsofbuildingadistributedhighperformanceGPUapplicationinafewhours.Sincewedonotdependonanydeep-learningapplicationorframework—weuselow-levelbuilding

blocks—thistutorialcanbeadjustedforanyotherparallelalgorithmthereadermightwanttoprototypeonBigData.Finally,wewilldiscusshowtomovefromaprototypetoafullyblownproductionapplication.

Keywords:

Big-data; Deep-learning; Prototyping; GPGPU; Cluster; Parallelprogramming

Introduction

DeepLearningreferstotheusageofArtificialNeuralNetworks(ANNorNN)withseveralhiddenlayersusedfordatawithahighdimensionality.AcommonexampleandbenchmarkforDeepLearningisimageclassificationfromtheImageNetdataset.ANNscanbeusedforclassificationtasks,withseveralapplicationsinindustry,businessandscience.Examplesofapplicationsincludecharacterrecognitioninscanneddocuments,predictingbankruptcyorhealthcomplications.AutonomousdrivingalsomakesheavyuseofANNs.AnANNbeginswithrandomweights,practicallydecidingeverythingatrandom.BytrainingtheANNwithseveralexistinginstancesoftheproblem,onecanevaluatetheerrorproduced.Weightsarethenadjusted,takingintoaccountifitoverlyorunderlyestimatedthefinalvalue.

Inordertopredictvalues,ANNsarebuiltconnectinglayersofneurons.ANNsusethefirstlayerofneuronsforeachinputfeature,andthe

finallayerfortheclassificationoutput.Fig.1showsanexampleofanANNwithfourinputneurons,fourneuronsinthehiddenlayerandtwooutputneurons.Allneuronsinonelayerareconnectedtoalltheneuronsinthefollowinglayer.

Whenthenumberoffeaturesincreases(highdimensionality),thenumberofneuronsinthehiddenlayersincreasesaswell,inordertocompensateforthepossibleinteractionsofinputneurons.However,aruleofthumbistouseonlyonehiddenlayerwiththesamenumberofhiddenneuronsasthereareinputneurons.ThesecondscalabilityissuewithANNsisthatforahighaccuracy,theyhavetobetrainedwithalargedataset.Typically,toachieveagoodaccuracyscore,thenumberofinstancesshouldbethreeordersofmagnitudehigherthanthenumberoffeatures.Thus,wereachapointinwhichweneedtotrainanANNoverseveraliterations,usingahighnumberoffeaturesandinstances.Intheseconditions,traininganANNbecomesacomputationallyintensiveoperation,highlydemandingintermsofprocessing,memoryanddiskusage.Astheamountofdataavailablefortraininggoesaboveaterabyte,itbecomesBigDataproblem.

ThesolutionforBigDataprocessingistodistributethecomputationacrossdifferentmachines,splittingdataamongthemandmergingresultsafterwards.InMap-Reduceapproaches,itispossibletodividethecomputationintoindependentsub-problemsthatcanbecombinedto

produceafinalresult.HadoopandSparkarethemostusedframeworksforBigDataprocessing.

ANNsaredescribedbythefollowingcharacteristics:

layout(thenumberoflayersandneuronsoneachlayer)andtheweightsofconnectionsbetweenneurons(thesecondattributeisdependentonthefirst).WhentraininganANNforaspecificproblemdataset,theweightsarebeingadjustedtominimizetheoutputerror.BecausethepredictionofANNscanbedescribedasmatrixoperations(wearemultiplyingthesameweightstoaeachrowoffeaturesofprobleminstances),graphicalprocessingunits(GPUs)areusuallyagoodsolutionforimprovingperformanceandreducetrainingtimes.GPUsweredesignedtoperformmatrixoperationsinthecontextofvideoprocessing,buthave

展开阅读全文
相关资源
猜你喜欢
相关搜索

当前位置:首页 > 党团工作 > 党团建设

copyright@ 2008-2022 冰豆网网站版权所有

经营许可证编号:鄂ICP备2022015515号-1