01204453 Web Information Retrieval and Mining
Instructor
Asst. Prof. Dr. Bundit Manaskasemsak
Time and Place
Thursday 9:30-12:30 @Room E501, 5th Fl., Computer Engineering Building, KU.
Course Description
The course studies the theory, design, and implementation of Information Retrieval (IR) techniques in text-based information systems. The theoretical component of the course focuses on IR methods for the processing, storing, accessing, organization, and classification of textual documents, including hypertext documents available on the World Wide Web, as well as various techniques for evaluation of complete retrieval system. The practical components of the course address the design and implementation of high-capacity text retrieval system such as web search engine. These components cover the web crawling, web indexing and retrieving, and web ranking. A variety of current research topics are also covered, including statistical study of the web, link analysis, spam filtering, web refresh model, and web resource discovery and mining.
Students are expected to perform individual studies of literature on relevant topics. You will also be working on some individual assignments and/or implementation. Each assignment/project must be submit within the deadline.
References / Text Books
The course presents selected topics from the books:
  • "Modern Information Retrieval", R. Baeza-Yates and B. Ribeiro-Neto, Addison Wesley, 1999.
  • "Mining the Web: Discovering Knowledge from Hypertext Data", S. Chakrabati, MKP, 2002.
  • "Introduction to Information Retrieval", C. D. Manning, P. Raghavan, and H. Schütze, Cambridge University Press, 2008.
Course Syllabus
See more detail ... HERE