Instructor
Asst. Prof. Dr. Bundit Manaskasemsak
Time and Place
Thursday 9:30-12:30 @Room E501, 5th Fl., Computer Engineering Building, KU.
Course Description
The course studies the theory, design, and implementation of Information Retrieval (IR) techniques
in text-based information systems. The theoretical component of the course focuses on IR methods for
the processing, storing, accessing, organization, and classification of textual documents, including
hypertext documents available on the World Wide Web, as well as various techniques for evaluation of
complete retrieval system. The practical components of the course address the design and implementation
of high-capacity text retrieval system such as web search engine. These components cover the web
crawling, web indexing and retrieving, and web ranking. A variety of current research topics are also
covered, including statistical study of the web, link analysis, spam filtering, web refresh model, and
web resource discovery and mining.
Students are expected to perform individual studies of literature on relevant topics. You will also
be working on some individual assignments and/or implementation. Each assignment/project must be
submit within the deadline.
References / Text Books
The course presents selected topics from the books:
- "Modern Information Retrieval", R. Baeza-Yates and B. Ribeiro-Neto, Addison Wesley, 1999.
- "Mining the Web: Discovering Knowledge from Hypertext Data", S. Chakrabati, MKP, 2002.
- "Introduction to Information Retrieval", C. D. Manning, P. Raghavan, and H. Schütze, Cambridge University Press, 2008.
Course Syllabus
See more detail ... HERE