== Overview of the lectures (tentative) == Here is an overview of the lectures that will be given. If I have the dates right, there will be 16 lectures overall, 9 lectures before the christmas break, and 7 after the christmas break. The following list is tentative and may (and probably) will change as the semester evolves. '''Lecture 1:''' Deutsch / English, Introduction, Demos, Parsing, Grep, Inverted Index, List Intersection, Exercises + Wiki HowTo. Goal: after this lecture you should be able to download a collection, parse it, build an index for it, and query that index. '''Lecture 2:''' HTML, form tag, web server, Apache, TCP / IP, socket communication. Goal: after this lecture you should be able to write a simple search engine user interface that sends a query to the backend, gets the answer back and displays it. '''Lecture 3:''' Ranking, Vector Space Model, Relevance Assessment. Goal: enhance your index by scores and your search engine by ranking. Automatic query evaluation. '''Lecture 4:''' Efficiency 1: compression, IO-efficiency / cache-efficiency, analysis. Goal: speed-up query processing and be able to analyze it. '''Lecture 5:''' Efficiency 2: top-k retrieval, list intersection. Goal: further speed up of query processing. '''Lecture 6:''' wildcard search, synonym search. Goal: dito. '''Lecture 7:''' DOM, JavaScript, AJAX, jquery. Goal: Build an interactive user interface, e.g., search as you type. '''Lecture 8:''' Error-correcting search, did you mean ... ? Goal: dito. '''Lecture 9:''' Suffix Arrays. Goal: dito. CHRISTMAS BREAK (2 weeks) '''Lecture 10:''' Entity recognition. '''Lecture 11:''' XML retrieval, what ist XML, XQuery, XPath, Nexi, XML retrieval with an inverted index, XML retrieval via suffix arrays. '''Lecture 12:''' Semantic Web, SPARQL. '''Lecture 13:''' Programming languages. DO THAT EARLIER! '''Lecture 14:''' Clustering. '''Lecture 15:''' Machine Learning, SVMs. '''Lecture 16:''' Overflow buffer. Talk about 2-week block-project to come. Other topics: Internationalization / UTF-8. MapReduce. Web crawling. END OF SEMESTER Right after the lectures, a 2-week block project on the topic of the lectures will take place.