CSI4107: Information Retrieval and the Internet
Description
Basic principles of Information Retrieval. Indexing methods. Query processing. Linguistic aspects of Information Retrieval. Agents and artificial intelligence approaches to Information Retrieval. Relation of Information Retrieval to the World Wide Web. Search engines. Servers and clients. Browser and server side programming for Information Retrieval. Source
Content
Professor: Caroline Barrière
Grade: A
Projects
The focus of the course was to learn in a hands-on way by using the lecture content to build our own search engine. This included using vector space and boolean models of information retrieval, implmenting user-facing features such as spelling correction, and more. My continued work on this project lives here.
Tests
There were four quizzes throughout the course, covering:
- Boolean and Vector Space retrieval models
- Language models, Probabilistic models, Query Completion, Model evaluation
- Machine learning (supervised and unsupervised), Query completion
- PageRank, Web crawling
Research Project
The final component of the course was an independent research project, within which we presented a 6 minute talk and authored a written report. The page for my talk is here (though most of the slide content is visual).