Introduction to Information Retrieval

Introduction to Information Retrieval

PDF Introduction to Information Retrieval Download

  • Author: Christopher D. Manning
  • Publisher: Cambridge University Press
  • ISBN: 1139472100
  • Category : Computers
  • Languages : en
  • Pages :

Class-tested and coherent, this textbook teaches classical and web information retrieval, including web search and the related areas of text classification and text clustering from basic concepts. It gives an up-to-date treatment of all aspects of the design and implementation of systems for gathering, indexing, and searching documents; methods for evaluating systems; and an introduction to the use of machine learning methods on text collections. All the important ideas are explained using examples and figures, making it perfect for introductory courses in information retrieval for advanced undergraduates and graduate students in computer science. Based on feedback from extensive classroom experience, the book has been carefully structured in order to make teaching more natural and effective. Slides and additional exercises (with solutions for lecturers) are also available through the book's supporting website to help course instructors prepare their lectures.


Document Retrieval Index

Document Retrieval Index

PDF Document Retrieval Index Download

  • Author:
  • Publisher:
  • ISBN:
  • Category : Criminal justice, Administration of
  • Languages : en
  • Pages : 640


Encyclopedia of Database Systems

Encyclopedia of Database Systems

PDF Encyclopedia of Database Systems Download

  • Author: Ling Liu
  • Publisher:
  • ISBN: 9781489979933
  • Category : Database management
  • Languages : en
  • Pages :


Modern Information Retrieval

Modern Information Retrieval

PDF Modern Information Retrieval Download

  • Author: Yates
  • Publisher: Pearson Education India
  • ISBN: 9788131709771
  • Category :
  • Languages : en
  • Pages : 540


Information Retrieval: Uncertainty and Logics

Information Retrieval: Uncertainty and Logics

PDF Information Retrieval: Uncertainty and Logics Download

  • Author: Cornelis Joost van Rijsbergen
  • Publisher: Springer Science & Business Media
  • ISBN: 1461556171
  • Category : Computers
  • Languages : en
  • Pages : 332

In recent years, there have been several attempts to define a logic for information retrieval (IR). The aim was to provide a rich and uniform representation of information and its semantics with the goal of improving retrieval effectiveness. The basis of a logical model for IR is the assumption that queries and documents can be represented effectively by logical formulae. To retrieve a document, an IR system has to infer the formula representing the query from the formula representing the document. This logical interpretation of query and document emphasizes that relevance in IR is an inference process. The use of logic to build IR models enables one to obtain models that are more general than earlier well-known IR models. Indeed, some logical models are able to represent within a uniform framework various features of IR systems such as hypermedia links, multimedia data, and user's knowledge. Logic also provides a common approach to the integration of IR systems with logical database systems. Finally, logic makes it possible to reason about an IR model and its properties. This latter possibility is becoming increasingly more important since conventional evaluation methods, although good indicators of the effectiveness of IR systems, often give results which cannot be predicted, or for that matter satisfactorily explained. However, logic by itself cannot fully model IR. The success or the failure of the inference of the query formula from the document formula is not enough to model relevance in IR. It is necessary to take into account the uncertainty inherent in such an inference process. In 1986, Van Rijsbergen proposed the uncertainty logical principle to model relevance as an uncertain inference process. When proposing the principle, Van Rijsbergen was not specific about which logic and which uncertainty theory to use. As a consequence, various logics and uncertainty theories have been proposed and investigated. The choice of an appropriate logic and uncertainty mechanism has been a main research theme in logical IR modeling leading to a number of logical IR models over the years. Information Retrieval: Uncertainty and Logics contains a collection of exciting papers proposing, developing and implementing logical IR models. This book is appropriate for use as a text for a graduate-level course on Information Retrieval or Database Systems, and as a reference for researchers and practitioners in industry.


Readings in Information Retrieval

Readings in Information Retrieval

PDF Readings in Information Retrieval Download

  • Author: Karen Sparck Jones
  • Publisher: Morgan Kaufmann
  • ISBN: 9781558604544
  • Category : Computers
  • Languages : en
  • Pages : 614

This compilation of original papers on information retrieval presents an overview, covering both general theory and specific methods, of the development and current status of information retrieval systems. Each chapter contains several papers carefully chosen to represent substantive research work that has been carried out in that area, each is preceded by an introductory overview and followed by supported references for further reading.


Per-document index for semantic searching

Per-document index for semantic searching

PDF Per-document index for semantic searching Download

  • Author:
  • Publisher:
  • ISBN:
  • Category :
  • Languages : en
  • Pages :


Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval

PDF Introduction to Modern Information Retrieval Download

  • Author: Gerard Salton
  • Publisher: New York ; Montreal : McGraw-Hill
  • ISBN:
  • Category : Computers
  • Languages : en
  • Pages : 470

Examines Concepts, Functions & Processes of Information Retrieval Systems


Language Modeling for Information Retrieval

Language Modeling for Information Retrieval

PDF Language Modeling for Information Retrieval Download

  • Author: W. Bruce Croft
  • Publisher: Springer Science & Business Media
  • ISBN: 9401701717
  • Category : Computers
  • Languages : en
  • Pages : 253

A statisticallanguage model, or more simply a language model, is a prob abilistic mechanism for generating text. Such adefinition is general enough to include an endless variety of schemes. However, a distinction should be made between generative models, which can in principle be used to synthesize artificial text, and discriminative techniques to classify text into predefined cat egories. The first statisticallanguage modeler was Claude Shannon. In exploring the application of his newly founded theory of information to human language, Shannon considered language as a statistical source, and measured how weH simple n-gram models predicted or, equivalently, compressed natural text. To do this, he estimated the entropy of English through experiments with human subjects, and also estimated the cross-entropy of the n-gram models on natural 1 text. The ability of language models to be quantitatively evaluated in tbis way is one of their important virtues. Of course, estimating the true entropy of language is an elusive goal, aiming at many moving targets, since language is so varied and evolves so quickly. Yet fifty years after Shannon's study, language models remain, by all measures, far from the Shannon entropy liInit in terms of their predictive power. However, tbis has not kept them from being useful for a variety of text processing tasks, and moreover can be viewed as encouragement that there is still great room for improvement in statisticallanguage modeling.


Indexing and Retrieval of Non-Text Information

Indexing and Retrieval of Non-Text Information

PDF Indexing and Retrieval of Non-Text Information Download

  • Author: Diane Rasmussen Neal
  • Publisher: Walter de Gruyter
  • ISBN: 3110260581
  • Category : Language Arts & Disciplines
  • Languages : en
  • Pages : 440

The scope of this volume will encompass a collection of research papers related to indexing and retrieval of online non-text information. In recent years, the Internet has seen an exponential increase in the number of documents placed online that are not in textual format. These documents appear in a variety of contexts, such as user-generated content sharing websites, social networking websites etc. and formats, including photographs, videos, recorded music, data visualizations etc. The prevalence of these contexts and data formats presents a particularly challenging task to information indexing and retrieval research due to many difficulties, such as assigning suitable semantic metadata, processing and extracting non-textual content automatically, and designing retrieval systems that "speak in the native language" of non-text documents.