AD Teaching Wiki:

Lecture 1, Thursday October 22, 2009

Part 1: Introduction (30 minutes)

Deutsch or English. Short introduction of myself. Demo of CompleteSearch, explain components. Difference between web search and search in homogenous collections. Comment on style of this course: exercises, Wiki, etc. Talk about block project at the end.

Part 2: Parsing / Tokenization (10 minutes)

Give examples, where this is not trivial. Chinese. UTF8. Compound words. Stemming.

Part 3: Inverted Index (30 minutes)

Why Indexing. Grep. Inverted Index. Building it. Querying it. List intersection. Quick analysis of index construction time and space and of query time.

Part 4: Exercises (20 minutes)

Go through exercises one by one. Explain about Wiki. Will be used throughout semester. For uploading exercises. For asking questions. For collaboration.

AD Teaching Wiki: SearchEnginesWS0910/Lecture1 (last edited 2009-10-20 22:56:32 by HSI-KBW-078-042-028-218)