CMPUT 692 - Information Extraction Meets Databases
Overview
Information extraction from unstructured text has much in common with querying in databases systems. Despite some differences on how data is modeled or represented, the general goal remains the same, i.e. to retrieve data or tag elements that satisfy some user-specified constraints. In recent years, the two paradigms have become much closer thanks to the large volume of data on the Web and the need for more automated search tools. In this course, we study the areas where information extraction meets databases. In particular, we review the roots from a database perspective and some of the major related works that have emerged.
Objectives
In this course, you will gain an understanding of:
- Relational querying foundations including conjunctive queries, negation and recursion
- Web data wrapping and solution approaches for wrapper induction and maintenance
- Information extraction and named-entity recognition
- Relationship extraction and question answering
Course Work
- Assignments
- Presentation
- Project