CMPUT 361 - Introduction to Information Retrieval

Overview

Most of the knowledge we acquire, use, and share is expressed in natural language, and preserved as primarily textual documents. This course introduces the fundamental algorithms and data structures for organizing and searching through large collections of documents, and the techniques for evaluating the quality of search results. The course also covers practical Machine Learning algorithms that apply to text and foundational technologies used by Web search engines.

Objectives

  • Boolean retrieval
  • Ranked retrieval
  • Index construction
  • Evaluating search effectiveness
  • Relevance feedback
  • Probabilistic IR and Retrieval with Language Models
  • Text clustering and classification
  • Web search

Course Work

  • Programming assignments
  • Problem sets
  • Midterms
  • Final Exam