Welcome to the Wiki of the course "Information Retrieval" in the winter semester 2022/2023
The course is given by Prof. Dr. Hannah Bast and assisted by Natalie Prange. It takes place every Tuesday from 14:05 - 15:50 h in the seminar room 101-1-9/13 and on Zoom (meeting link, meeting ID: 668 8383 5704, passcode: IRWS22/23). The first lecture is on Tuesday 18.10.2022 and the last lecture is on Tuesday 07.02.2023. There will be no lectures on the Tuesdays 01.11.2022 (All Saints), 06.12.2022, 27.12.2022 and 03.01.2023 (Christmas break).
The tutors for this course are Daniel Bindemann, Patrick Brosi, Johannes Kalmbach, Elias Kempf, Robin Textor-Falconi and Sebastian Walter. The administrator of the supporting systems (Daphne, SVN, Forum, Jenkins) is Axel Lehmann.
Our course management system Daphne (login with your RZ user name and password).
The forum for announcements and questions.
The manual for how to ask questions on the forum.
Our rules for the exercise sheets. Please read them carefully, completely and before you start working on the exercises. They are valid for all exercise sheets of this course.
Our CodingStandards for C++, Java and Python 3.
Information about Subversion (SVN) can be found here (in German) and here (in English); about some editors (including Vim) here (in German); and about the installation of gtest (for C++ only) here (in English).
The courses from previous semesters: Information Retrieval WS 21/22, Information Retrieval WS 19/20, Information Retrieval WS 18/19, Information Retrieval WS 17/18, Information Retrieval WS 16/17, Information Retrieval WS 15/16, Information Retrieval WS 13/14, Information Retrieval WS 12/13.
Here you can download our Linux Image (for Virtualbox or VMWare).
A cheat sheet for NumPy and SciPy can be found here.
Lecture Slides, Video Recordings, Exercise Sheets, and Code
For visitors from outside the University of Freiburg: You can freely access all the course materials below, except the solutions, by replacing /svn/ by /svn-public/ in the URL.
Lecture 1, Tuesday, October 18, 2022 (Introduction, Inverted Index, Zipf's Law):
Video Recording (MP4 Download), Slides, Exercise Sheet 1, Code from the lecture + unit tests for Exercise Sheet 1, Dataset for ES1 (104,609 movies with title + description + 3 columns with scores that you can ignore for this sheet). Note: If the dataset download does not start automatically when clicking the link, copy the link address and paste it in a new tab or download it using the wget command. Solution
Lecture 2, Tuesday, October 25, 2022 (Ranking and Evaluation):
Video Recording (MP4 Download), Slides, Exercise Sheet 2, Code template + unit tests for Exercise Sheet 2, Movies Training Benchmark, Movies Testing Benchmark, Table for your ranking results, Solution.
Lecture 5, Tuesday, November 22, 2022 (Fuzzy Search, Prefix Edit Distance, q-Gram Index):
Video Recording (MP4 Download), Slides, Exercise Sheet 5, Code template + unit tests for Exercise Sheet 5, Wikidata entities (2,627,825 entities from Wikidata with name + popularity score + description + additional columns that you can use optionally), Wikidata Entities SMALL (100,000 entities), Table for your results, Solution .
Lecture 6, Tuesday, November 29, 2022 (Web applications, Part 1):
Video Recording (MP4 Download), Slides, Exercise Sheet 6, Code template + test queries for Exercise Sheet 6, Wikidata entities (same as for ES5), Solution .
Lecture 7, Tuesday, December 13, 2022 (Web applications, Part 2):
Video Recording (MP4 Download), Slides, Exercise Sheet 7, Code template + test queries for Exercise Sheet 7, Wikidata entities, Wikidata Entities SMALL (updated datasets!), Solution.
Lecture 8, Tuesday, December 20, 2022 (Vector Space Model):
Video Recording (MP4 Download), Slides, Exercise Sheet 8, Code from the lecture + code template with unit tests for ES8, movies.tsv (same as for ES1+ES2), Movies Testing Benchmark (same as for ES2), Table for your ranking results, Solution.
Lecture 10, Tuesday, January 17, 2023 (Classification, Naive Bayes):
Video Recording (MP4 Download), Slides, Exercise Sheet 10, Code template with unit tests for ES10, Movie Genres (train), Movie Genres (test), Table for your results, Solution.
Lecture 11, Tuesday, January 24, 2023 (Linear Classifiers, Perceptrons, Logistic Regression):
Video Recording (MP4 Download), Slides, Exercise Sheet 11, Code template with unit tests for ES11, datasets (same as for ES10): Movie Genres (train), Movie Genres (test), Table for your results, Solution.
Lecture 12, Tuesday, January 31, 2023 (Knowledge Graphs, SPARQL, Translation to SQL):
Video Recording (MP4 Download), Slides, Exercise Sheet 12, Code template with unit tests for ES12, wikidata.tsv (ca. 38M triples from Wikidata), wikidata.small.tsv (smaller dataset with ca. 4.5M triples in case you have performance issues with the bigger one), Table for your results, Solution.