搜索引擎中英文对照外文翻译文献.docx
《搜索引擎中英文对照外文翻译文献.docx》由会员分享,可在线阅读,更多相关《搜索引擎中英文对照外文翻译文献.docx(33页珍藏版)》请在冰豆网上搜索。
搜索引擎中英文对照外文翻译文献
中英文资料对照外文翻译
InvestigatingtheQueryingandBrowsingBehaviorof
AdvancedSearchEngineUsers
BSTRACT
OnewaytohelpallusersofcommercialWebsearchenginesbemoresuccessfulintheirsearchesistobetterunderstandwhatthoseuserswithgreatersearchexpertisearedoing,andusethisknowledgetobenefiteveryone.Inthispaperwestudytheinteractionlogsofadvancedsearchengineusers(andthosenotsoadvanced)tobetterunderstandhowtheseusergroupssearch.Theresultsshowthattherearemarkeddifferencesinthequeries,resultclicks,post-querybrowsing,andsearchsuccessofusersweclassifyasadvanced(basedontheiruseofqueryoperators),relativetothoseclassifiedasnon-advanced.Ourfindingshaveimplicationsforhowadvancedusersshouldbesupportedduringtheirsearches,andhowtheirinteractionscouldbeusedtohelpsearchersofallexperiencelevelsfindmorerelevantinformationandlearnimprovedsearchingstrategies.
CategoriesandSubjectDescriptors
H.3.3[InformationSearchandRetrieval]:
queryformulation,searchprocess,relevancefeedback.
GeneralTerms
Experimentation,HumanFactors.
Keywords
Querysyntax,advancedsearchfeatures,expertsearching.
1.INTRODUCTION
TheformulationofquerystatementsthatcaptureboththesalientaspectsofinformationneedsandaremeaningfultoInformationRetrieval(IR)systemsposesachallengeformanysearchers[3].CommercialWebsearchenginessuchasGoogle,Yahoo!
andWindowsLiveSearchofferuserstheabilitytoimprovethequalityoftheirqueriesusingqueryoperatorssuchasquotationmarks,plusandminussigns,andmodifiersthatrestrictthesearchtoaparticularsiteortypeoffile.Thesetechniquescanbeusefulinimprovingresultprecisionyet,otherthanvialoganalyses(e.g.,[15][27]),theyhavegenerallybeenoverlookedbytheresearchcommunityinattemptstoimprovethequalityofsearchresults.
IRresearchhasgenerallyfocusedonalternativewaysforuserstospecifytheirneedsratherthanincreasingtheuptakeofadvancedsyntax.Researchonpracticaltechniquestosupplementexisting
Permissiontomakedigitalorhardcopiesofallorpartofthisworkforpersonalorclassroomuseisgrantedwithoutfeeprovidedthatcopiesarenotmadeordistributedforprofitorcommercialadvantageandthatcopiesbearthisnoticeandthefullcitationonthefirstpage.Tocopyotherwise,orrepublish,topostonserversortoredistributetolists,requirespriorspecificpermissionand/orafee.
SIGIR’07,July23–27,2007,Ámsterdam,TheNetherlands.Copyright2007ACM978-1-59593-597-7/07/0007...$5.00.searchtechnologyandsupportusershasbeenintensifyinginrecentyears(e.g.[18][34]).However,itischallengingtoimplementsuchtechniquesatlargescalewithtolerablelatencies.
TypicalqueriessubmittedtoWebsearchenginestaketheformofaseriesoftokensseparatedbyspaces.ThereisgenerallyanimpliedBooleanANDoperatorbetweentokensthatrestrictssearchresultstodocumentscontainingallqueryterms.DeLimaandPedersen[7]investigatedtheeffectofparsing,phraserecognition,andexpansiononWebsearchqueries.TheyshowedthattheautomaticrecognitionofphrasesinqueriescanimproveresultprecisioninWebsearch.However,thevalueofadvancedsyntaxfortypicalsearchershasgenerallybeenlimited,sincemostusersdonotknowaboutadvancedsyntaxordonotunderstandhowtouseit[15].Sinceitappearsoperatorscanhelpretrieverelevantdocuments,furtherinvestigationoftheiruseiswarranted.
Inthispaperweexploretheuseofqueryoperatorsinmoredetailandproposealternativeapplicationsthatdonotrequirealluserstouseadvancedsyntaxexplicitly.Wehypothesizethatsearcherswhouseadvancedquerysyntaxdemonstrateadegreeofsearchexpertisethatthemajorityoftheuserpopulationdoesnot;anassertionsupportedbypreviousresearch[13].Studyingthebehavioroftheseadvancedsearchengineusersmayyieldimportantinsightsaboutsearchingandresultbrowsingfromwhichothersmaybenefit.
Usinglogsgatheredfromalargenumberofconsentingusers,weinvestigatedifferencesbetweenthesearchbehaviorofthosethatuseadvancedsyntaxandthosethatdonot,anddifferencesintheinformationthoseuserstarget.Weareinterestedinansweringthreeresearchquestions:
(i)Istherearelationshipbetweentheuseofadvancedsyntaxandothercharacteristicsofasearch?
(ii)Istherearelationshipbetweentheuseofadvancedsyntaxandpost-querynavigationbehaviors?
(iii)Istherearelationshipbetweentheuseofadvancedsyntaxandmeasuresofsearchsuccess?
Throughanexperimentalstudyandanalysis,weofferpotentialanswersforeachofthesequestions.Arelationshipbetweentheuseofadvancedsyntaxandanyofthesefeaturescouldsupportthedesignofsystemstailoredtoadvancedsearchengineusers,oruseadvancedusers’interactionstohelpnon-advancedusersbemoresuccessfulintheirsearches.
WedescriberelatedworkinSection2,thedataweusedinthislog-basedstudyinSection3,thesearchcharacteristicsonwhichwefocusouranalysisinSection4,andthefindingsofthisanalysisinSection5.InSection6wediscusstheimplicationsofthisresearch,andweconcludeinSection7.
2.RELATEDWORK
Factorssuchaslackofdomainknowledge,poorunderstandingofthedocumentcollectionbeingsearched,andapoorlydevelopedinformationneedcanallinfluencethequalityofthequeriesthatuserssubmittoIRsystems([24],[28]).Therehasbeenavarietyofresearchintodifferentwaysofhelpingusersspecifytheirinformationneedsmoreeffectively.Belkinetal.[4]experimentedwithprovidingadditionalspaceforuserstotypeamoreverbosedescriptionoftheirinformationneeds.AsimilarapproachwasattemptedbyKellyetal.[18],whousedclarificationformstoelicitadditionalinformationaboutthesearchcontextfromusers.Theseapproacheshavebeenshowntobeeffectiveinbest-matchretrievalsystemswherelongerqueriesgenerallyleadtomorerelevantsearchresults[4].However,inWebsearch,wheremanyofthesystemsarebasedonanextendedBooleanretrievalmodel,longerqueriesmayactuallyhurtretrievalperformance,leadingtoasmallnumberofpotentiallyirrelevantresultsbeingretrieved.Itisnotsimplysufficienttorequestmoreinformationfromusers;thisinformationmustbeofbetterquality.
RelevanceFeedback(RF)[22]andinteractivequeryexpansion[9]arepopulartechniquesthathavebeenusedtoimprovethequalityofinformationthatusersprovidetoIRsystemsregardingtheirinformationneeds.InthecaseofRF,theuserpresentsthesystemwithexamplesofrelevantinformationthatarethenusedtoformulateanimprovedqueryorretrieveanewsetofdocuments.IthasprovendifficulttogetuserstouseRFintheWebdomainduetodifficultyinconveyingthemeaningandthebenefitofRFtotypicalusers[17].Querysuggestionsofferedbasedonquerylogshavethepotentialtoimproveretrievalperformancewithlimiteduserburden.Thisapproachislimitedtore-executingpopularqueries,andsearchersoftenignorethesuggestionspresentedtothem[1].Inaddition,bothofthesetechniquesdonothelpuserslearntoproducemoreeffectivequeries.
Mostcommercialsearchenginesprovideadvancedquerysyntaxthatallowsuserstospecifytheirinformationneedsinmoredetail.Querymodifierssuchas‘+’(plus),‘’(minus),and‘“”’(doublequotes)canbeusedtoemphasize,deemphasize,andgroupqueryterms.Booleanoperators(AND,OR,andNOT)canjointermsandphrases,andmodifierssuchas“site:
”and“link:
”canbeusedtorestrictthesearchspace.Queriescreatedwiththesetechniquescanbepowerful.However,thisfunctionalityisoftenhiddenfromtheimmediateviewofthesearcher,andunlesssheknowsthesyntax,shemustusetextfields,pull-downmenusandcomboboxesavailableviaadedicated“advancedsearch”interfacetoaccessthesefeatures.
Log-basedanalysisofusers’interactionswiththeExciteandAltaVistasearchengineshasshownthatonly10-20%ofqueriescontainedanyadvancedsyntax[14][25].ThisanalysiscanbeausefulwayofcapturingcharacteristicsofusersinteractingwithIRsystems.Researchinusermodeling[6]andpersonalization[30]hasshownthatgatheringmoreinformationaboutuserscanimprovetheeffectivenessofsearches,butrequiremoreinformationaboutusersthanistypicallyavailablefrominteractionlogsalone.Unlesscoupledwithaqualitativetechnique,suchasapost-sessionquestionnaire[23],itcanbedifficulttoassociateinteractionswithusercharacteristics.Inourstudyweconjecturethatgiventhedifficultyinlocatingadvancedsearchfeatureswithinthetypicalsearchinterface,andthepotentialproblemsinunderstandingthesyntax,thoseusersthatdouseadvancedsyntaxregularlyrepresentadistinctclassofsearcherswhowillexhibitothercommonsearchbehaviors.
Otherstudiesofadvancedsearchers’searchbehaviorshaveattemptedtobetterunderstandthestrategicknowledgetheyhaveacquired.However,suchstudiesaregenerallylimitedinsize(e.g.,[13][19])orfocusondomainexpertiseinareassuchashealthcareore-commerce(e.g.,[5]).Nonetheless,theycangivevaluableinsightaboutthebehaviorsofuserswithdomain,system,orsearchexpertisethatexceedsthatoftheaverageuser.Queryingbehaviorinparticularhasbeenstudiedextensivelytobetterunderstandusers[31]andsupportotherusers[16].
Inthispaperwestudyothersearchcharacteristicsofusersofadvancedsyntaxinanattempttodeterminewhetherthereisanythingdifferentabouthowthesesearchengineuserssearch,andwhethertheirsearchescanbeusedto