计算机专业文献翻译面向数字图书馆的海量信息管理体系结构研究Word格式.docx

上传人:b****1 文档编号:12943298 上传时间:2022-09-30 格式:DOCX 页数:14 大小:25.82KB
下载 相关 举报
计算机专业文献翻译面向数字图书馆的海量信息管理体系结构研究Word格式.docx_第1页
第1页 / 共14页
计算机专业文献翻译面向数字图书馆的海量信息管理体系结构研究Word格式.docx_第2页
第2页 / 共14页
计算机专业文献翻译面向数字图书馆的海量信息管理体系结构研究Word格式.docx_第3页
第3页 / 共14页
计算机专业文献翻译面向数字图书馆的海量信息管理体系结构研究Word格式.docx_第4页
第4页 / 共14页
计算机专业文献翻译面向数字图书馆的海量信息管理体系结构研究Word格式.docx_第5页
第5页 / 共14页
点击查看更多>>
下载资源
资源描述

计算机专业文献翻译面向数字图书馆的海量信息管理体系结构研究Word格式.docx

《计算机专业文献翻译面向数字图书馆的海量信息管理体系结构研究Word格式.docx》由会员分享,可在线阅读,更多相关《计算机专业文献翻译面向数字图书馆的海量信息管理体系结构研究Word格式.docx(14页珍藏版)》请在冰豆网上搜索。

计算机专业文献翻译面向数字图书馆的海量信息管理体系结构研究Word格式.docx

(DepartmentofComputerScienceandTechnology,TsinghuaUniversity,Beijing100084,China)

XINGChun-Xiao+,ZENGChun,LIChao,ZHOULi-Zhu

Abstract

Thispaperinvestigatesthechallengingissuesandtechnologiesinmanagingverylargedigitalcontentsandcollections,andgivesanoverviewoftheworksandenablingtechnologiesintherelatedareas.Basedontheanalysisandcomparisonoftherelatedwork,anovelarchitectureofmassiveinformationmanagementfordigitallibraryisdesigned.Thekeycomponentsandcoreservicesaredescribedindetail.Finally,acasestudyTHADL(TsinghuaUniversityarchitecturedigitallibrary)thatcomplieswiththearchitecturalframeworkispresented.

Keywords:

digitallibrary;

architecture;

massiveinformationmanagement;

interoperability;

metadata

1Introducn

Intherecordedhitiostoryofhumanbeing,theprintedmaterialsusedtoplayadominantroleinthepreservationandpervasionofhumaninformationandknowledge.However,withtherapiddevelopmentoftechnologiesincomputer,communication,multimediaandstorage,thisroleisgivingawaytothedigitalresourcesinthenewera.Theexplosivegrowthofinformationindigitalformshasposedchallengesnotonlytotraditionalarchivesandtheirinformationproviders,butalsotoorganizationsinthegovernment,commercialandnon-profitsectors.AccordingtothelatestreportbyLymanandVarian,theworld’stotalyearlyproductionofprint,film,optical,andmagneticcontentwouldrequireroughly1.5billiongigabytesofstoragewhichisroughly250megabytesforeverypersonontheearth.Printeddocumentsofallkindscompriseonly0.003%ofthetotal.Magneticstorageisbyfarthelargestmediumforstoringinformationandisthemostrapidlygrowingsection,withashippedharddrivecapacitydoublingeveryyear.Thetypesofdigitalresourcesarediverse.Theyincludedigitaltexts,documents,scientificdata,images,animation,video,audioetc.Theapplicationsofthedigitalresourcesarequitebroad,includingDL(digitallibrary),movie/videocenter,otherpublicmedia(television,broadcast,newspaper,etc.),museum,andnationalorcooperativeinformationcenter.Atthesametimetheinformationhighway,whichisrepresentedbyInternet,hasbeenanimportanttoolofthepervasionofdigitalresources.Thegovernments,companies,groups,researchinstitutes,non-governmentorganizations,educationinstitutesallovertheworldputmassiveinformationontheWeb.

Technologychallengesandkeyissues

Thesemassivedigitalresourcespresentmanychallengingissuesindatamanagementtechnologyarea.Thefollowingaresomeexamples.

(1)Datamodel.

Traditionaldatamodeltheoriesareonlyapplicabletostructureddata,butnotforthemassivedigitalresourcesofvarioustypesandtheyaremostlysemi-structuredorunstructured.Thus,newdatamodelsaredemanded.

(2)Systemarchitecture.

Traditionaldatabasemanagementsystemsaredesignedforbusinessdataprocessingfeaturedbyconcurrent,short,andupdatetransactions.Thereforetransactionmanagementandconcurrentcontrolremainsasthecenterofsystemarchitecture.Thearchitectureisnotsuitableforthemanagementofdigitalresourcesasclassicaltransactionconceptisbecominglessimportantintheseresources.Weneedtopursuenovelanduniversalframeworksformassivedigitalresourcesmanagement.

(3)Massiveinformationstorage.

Thevolumeofdigitaldataresourcesiscountedbyterabytesorpetabytes.TraditionalstoragedevicesusingSCSIcannotworkforefficientstorage,onlinemigrationandpersistentarchiveofsuchmassivedigitalresources.Sotheresearchofmulti-levelstoragesystems,SAN(StorageAreaNetworks)andothertechnologyareinevitable.

(4)Queryprocessing.

Intraditionaldatabasesystems,queriesareexpressedinquerylanguagesuchasSQL,butinthequeryandsearchofmassivedigitalresources,manynewmechanismsshouldbeused,suchaskeywordsearch,full-textsearch,similarityquery,andcontent-basedmultimediaretrieval.Howtointegratethequerymethods(includingSQL,OQL,anddifferentXMLquerylanguages,e.g.,XQL,XML-QL,XML-GL)efficientlytobuildanefficientandflexiblequeryprocessingmethodhasnotbeensatisfactorilysolvedyet.

Tosolvetheproblemsmentionedabovewillremainasamajorgoaltoresearchersinthenextfewyears.Tofulfillthisend,wepresentanovelarchitectureformassiveinformationmanagementofdigitalresourcesinthispaper.Thisarchitectureisintendedtomeettherequirementsofmanagingdigitalresourcescharacterizedbydistributed,dynamic,massiveandheterogeneousproperties.

2OverviewoftheRelatedWork

TheIEEESTD610.12[2]definesarchitectureasthestructureofcomponents,theirrelat

展开阅读全文
相关资源
猜你喜欢
相关搜索

当前位置:首页 > 医药卫生 > 中医中药

copyright@ 2008-2022 冰豆网网站版权所有

经营许可证编号:鄂ICP备2022015515号-1