Personal tools

251-0341-01L Information Retrieval

News

 
21/02: To see your exam papers, you can come to CAB F78 during following hours:
  • Tuesday, 22 February: 10:00 – 12:00
  • Thursday, 24 February: 14:00 – 16:00
 
14/12: Last project's spec now contains the second phase too.
 
9/12: Last project specification is added now. Notice that doing this project is not mandatory if you have handed in all three previous projects.
 
24/11: Specification of the third project's second phase (clustering task) is now available.
 
17/11: Third project's specification is available now.
 
04/11: In next two exercise classes we will be covering basics of probability theory. Moreover, third project's specification will be released on November the 17th and the due date for its first phase is the week after.
 

27/10: Updated specification of the second project is available now.

20/10: Specification of the second project is added now.

06/10: Specification of the first project now includes the 2nd phase too.

29/09: Check out the course materials' page for the specification of the first project. Notice that the deadline for the first phase of this project is next Wednesday.

10/09: Happy new semester!


Course Description

The course presents an introduction to the field of information retrieval and discusses automated techniques to effectively handle and manage unstructured and semi-structured information. This includes methods and principles that are at the heart of various systems for information access, such as Web or enterprise search engines, categorization and recommender systems, as well as information extraction and knowledge management tools.

The plan for exercise classes: there will be 3 or 4 small programming projects which one of them is optional and the rest are mandatory. In other words, in order to pass the course you need to hand in all, except one, of the projects. It's up to you to choose which project to skip. Projects will be done by groups of 2 or 3 students. Please form (or join)  a group as soon as possible and let the TAs know. The projects results and artifacts will be discussed face to face in exercise classes. Moreover, some of the exercise sessions are reserved for giving tutorials on the subjects that are not fully covered in lectures (mainly, the necessary mathematical basics). The exact  schedule for these tutorials will be announced in advance.

 

Course Materials

See here 


Lecturer

Joachim Buhmann

Donald Kossmann

 

TAs


Language

English


Syllabus

  1. Introduction
  2. Vocabulary and Dictionaries
  3. Boolean Retrieval Model
  4. Index Construction and Compression
  5. Vector Space Model
  6. IR Evaluation
  7. Relevance Feedback and Query Expansion
  8. Text Categorization
  9. Document Clustering
  10. Web Search
  11. Link Analysis
 

Schedule

  • Lecture:   Wed     09:00 - 11:00   ML F 34
  • Exercise:  Wed     11:00 - 12:00   ML F 34


Literature

The main textbook of the course is:

Document Actions