CMPUT 692 - Information Extraction Meets Databases

Overview

Information extraction from unstructured text has much in common with querying in databases systems. Despite some differences on how data is modeled or represented, the general goal remains the same, i.e. to retrieve data or tag elements that satisfy some user-specified constraints. In recent years, the two paradigms have become much closer thanks to the large volume of data on the Web and the need for more automated search tools. In this course, we study the areas where information extraction meets databases. In particular, we review the roots from a database perspective and some of the major related works that have emerged.

Objectives

In this course, you will gain an understanding of:

Relational querying foundations including conjunctive queries, negation and recursion
Web data wrapping and solution approaches for wrapper induction and maintenance
Information extraction and named-entity recognition
Relationship extraction and question answering

Course Work

Assignments
Presentation
Project

About

Research

Undergraduate

Graduate

Resources

CMPUT 692 - Information Extraction Meets Databases

Overview

Objectives

Course Work

Course Details