1、信息管理系统中英文翻译外文资料:Information management systemWiliam K.Thomson U.S.AAbstract:An information storage, searching and retrieval system for large (gigabytes) domains of archived textual dam. The system includes multiple query generation processes, a search process, and a presentation of search results th
2、at is sorted by category or type and that may be customized based on the professional discipline(or analogous personal characteristic of the user), thereby reducing the amount of time and cost required to retrieve relevant results. Keyword:Information management Retrieval system Object-Oriented 1.IN
3、TRUDUCTIONThis invention relates to an information storage, searching and retrieval system that incorporates a novel organization for presentation of search results from large (gigabytes) domains of archived textual data. 2.BACKGROUDN OF THE INVENTIONOn-line information retrieval systems are utilize
4、d for searching and retrieving many kinds of information. Most systems used today work in essentially the same manner; that is, users log on (through a computer terminal or personal microcomputer, and typically from a remote location), select a source of information (i.e., a particular database) whi
5、ch is usually something less than the complete domain, formulate a query, launch the search, and then review the search results displayed on the terminal or microcomputer, typically with documents (or summaries of documents) displayed in reverse chronological order. This process must be repeated eac
6、h time another source (database) or group of sources is selected (which is frequently necessary in order to insure all relevant documents have been found).Additionally, this process places on the user the burden of organizing and assimilating the multiple results generated from the launch of the sam
7、e query in each of the multiple sources (databases) that the user needs (or wants) to search. Present systems that allow searching of large domains require persons seeking information in these domains to attempt to modify their queries to reduce the search results to a size that the user can assimil
8、ate by browsing through them (thus, potentially eliminating relevant results). In many cases end users have been forced to use an intermediary (i.e., a professional searcher) because the current collections of sources are both complex and extensive, and effective search strategies often vary signifi
9、cantly from one source to another. Even with such guidance, potential relevant answers are missed because all potentially relevant databases or information sources are not searched on every query. Much effort has been expended on refining and improving source selection by grouping sources or databas
10、e files together. Significant effort has also been expended on query formulation through the use of knowledge bases and natural language processing. However, as the groupings of sources become larger, and the responses to more comprehensive search queries become more complete, the person seeking inf
11、ormation is often faced with the daunting task of sifting through large unorganized answer sets in an attempt to find the most relevant documents or information. 3.SUMMARY OF THE INVENTION The invention provides an information storage, searching and retrieval system for a large domain of archived da
12、ta of various types, in which the results of a search are organized into discrete types of documents and groups of document types so that users may easily identify relevant information more efficiently and more conveniently than systems currently in use. The system of the invention includes means fo
13、r storing a large domain of data contained in multiple source records, at least some of the source records being comprised of individual documents of multiple document types; means for searching substantially all of the domain with a single search query to identify documents responsive to the query;
14、 and means for categorizing documents responsive to the query based on document type, including means for generating a summary of the number of documents responsive to the query which fall within various predetermined categories of document types. The query generation process may contain a knowledge
15、 base including a thesaurus that has predetermined and embedded complex search queries, or use natural language processing, or fuzzy logic, or tree structures, or hierarchical relationship or a set of commands that allow persons seeking information to formulate their queries. The search process can
16、utilize any index and search engine techniques including Boolean, vector, and probabilistic as long as a substantial portion of the entire domain of archived textual data is searched for each query and all documents found are returned to the organizing process. The sorting/categorization process pre
17、pares the search results for presentation by assembling the various document types retrieved by the search engine and then arranging these basic document types into sometimes broader categories that are readily understood by and relevant to the user.The search results are then presented to the user
18、and arranged by category along with an indication as to the number of relevant documents found in each category. The user may then examine search results in multiple formats, allowing the user to view as much of the document as the user deems necessary. 4.BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is
19、a block diagram illustrating an information retrieval system of the invention; FIG. 2 is a diagram illustrating a query formulation and search process utilized in the invention; FIG. 3 is a diagram illustrating a sorting process for organizing and presenting search results.5.BEST MODE FOR CARRYING O
20、UT THE INVENTION As is illustrated in the block diagram of FIG. 1 , the information retrieval system of the invention includes an input/output process ,a query generation process, a search process that involves a large domain of textual data (typically in the multiple gigabyte range), an organizing
21、process, presentation of the information to the user, and a process to identify and characterize the types of documents contained in the large domain of data.Turning now to FIG. 2, the query generation process preferably includes a knowledge base containing a thesaurus and a note pad, and preferably
22、 utilizes embedded predefined complex Boolean strategies. Such a system allows the user to enter their description of the information needed using simple words/phrases made up of natural language and to rely on the system to assist in generating the full search query, which would include, e.g., syno
23、nyms and alternate phraseology. The user can then request, by a command such as VI CO 1, to view the complete document selected from the list, giving, in this case, complete information about the identity and credentials of the expert.FIG. 3 illustrates how five typical sources of information (i.e.,
24、 source records) can be sorted into many document types and then subsequently into categories. For example, a typical trade magazine may contain several types of information such as editorials, regular columns, feature articles, news, product announcements, and a calendar of events. Thus, the trade
25、magazine (i.e., the source record) may be sorted into these various document types, and these document types in turn may be categorized or grouped into categories contained in one or more sets of categories; each document type typically will be sorted into one category within a set of categories, bu
26、t the individual categories within each set will vary from one set to another. For example, one set of categories may be established for a first characteristic type of user, and a different set of categories may be established for a second characteristic type of user. When a user corresponding to ty
27、pe #1 executes a search, the system automatically utilizes the categories of set #1, corresponding to that particular type of user, in organizing the results of the search for review by the user. When a user from type #2 executes a search, however, the system automatically utilizes the categories of
28、 set #2 in presenting the search results to the user.The information storage, searching and retrieval system of the invention resolves the common difficulties in typical on-line information retrieval systems that operate on large (e.g., 2 gigabytes or more) domains of textual data, query generation,
29、 source selection, and organizing search results. The information base with the thesaurus and embedded search strategies allows users to generate expert search queries in their own natural language. Source (i.e., database) selection is not an issue because the search engines are capable of searching
30、 substantially the entire domain on every query. Moreover, the unique presentation of search results by category set substantially reduces the time and cost of performing repetitive searches in multiple databases and therefore of efficiently retrieving relevant search results.While a preferred embod
31、iment of the present invention has been described, it should be understood that various changes, adaptations and modifications may be made therein without departing from the spirit of the invention and the scope of the appended claims.中文译文:信息管理系统Wiliam K.Thomson U.S.A摘要:一个信息存储,查询和检索系统主要应用于大(千兆字节)的需要
32、存档的文字领域。该系统包括多个查询产生过程和一个搜索过程。而查询的结果一般是按类别和类型进行排序的,检索字段是由个人决定的,在查询的过程中,可能基于这个搜索结果查看到多个相关的信息(或类似的用户个人特点介绍),从而减少了搜索结果是所需的时间和费用。关键词:信息管理;检索系统;面向对象1. 简介信息的存储,查询和检索系统,主要应用原文档数据比较大的文档,利用搜索条件和索引字段可以快速查询结果。2. 开发背景网上查询系统主要用于查询和检索在线的各种各样的信息。今天所使用的多数系统实际上采用的是同一方式。也就是说,用户登录(通过计算机终端或个人微机,或者是远程登录),选择一个信息源(比如一个特定的数
33、据库),通常是一些不完整的检索条件,开始查询,启动搜索,然后查询结果将显示在计算机终端或个人微机上,且查询结果一般按照时间的顺序显示。在查询过程中,会不断的重复查询每一个数据来源或一组数据源,为了确保搜索出所有相关的文件,这个重复是非常必要的。另外,这个查询过程也给用户带来一定的负担,他要根据从同一个数据源查询出的多个结果,进行归纳和总结。而目前的系统可以搜寻大的数据,在这过程中要求人们寻求信息或试图修改他们的查询条件,以减少不必要的搜索结果(消灭潜在的相关结果),使用户查询到真正要查的数据。在许多情况下,用户被迫使用中介(例如专业的搜索引擎),因为当前收藏的来源是复杂和广泛的,并且有效的搜索策略经常从一个
copyright@ 2008-2022 冰豆网网站版权所有
经营许可证编号:鄂ICP备2022015515号-1