Information retrieval is part of artificial intelligent system. Typically, it involves text documents. In some of applications, such as text classification and text clustering, information retrieval adopts machine learning algorithm. This course covers some methods, algorithms, and the applications of information retrieval.
Course Contents:
- Introduction to Information Retrieval
- Model of Information Retrieval: Boolean Model, inverted index, vector space model, text preprocessing
- Weighting Schemes
- Text Classification: Naive Bayes and Support Vector Machine
- Feature Selection: Information Gain, Chi Square, Term Distribution-Based
- Text Clustering: Types of clustering, Text labeling
- Word Embedding
- Relevance Feedback and Query Expansion
- Web Mining
- Text Summarization: Feature-based and Semantic-Based
- Automatic Short Answer Grading
- PageRank Algorithm
Text books:
- Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze, Introduction to Information Retrieval, Cambridge University Press. 2008