#acl Claudius Korzen:read,write Patrick Brosi:read,write Axel Lehmann:read,write Natalie Prange:read,write All:read This page gives an overview of available, ongoing and completed Bachelor's and Master's Projects and Theses at the Chair for Algorithms and Data Structures. <> = Available projects and theses = ||'''Description''' ||'''Your interests or skills''' ||'''Supervisor''' || = Ongoing projects and theses = ||'''Description''' ||'''Your interests or skills''' ||'''Supervisor''' || = Completed projects and theses = For a detailed description of completed projects see our [[https://ad-blog.informatik.uni-freiburg.de/|AD Blog]]. For details about completed theses see our [[https://ad.informatik.uni-freiburg.de/publikationen/bachelor_master_arbeiten|Website]]. ||'''Description''' ||'''Your interests or skills''' ||'''Supervisor''' || ||'''Merging Overlapping GTFS Feeds (Bachelor project or thesis):''' Many transportation companies publish their timetable data either directly as GTFS feeds or in formats that can be converted to GTFS. As soon as you have two GTFS feeds (two sets of timetable data) that cover either the same or adjacent areas, the problem of duplicate trips arises. You should develop a tool that merges two or more GTFS feeds and solves duplication / fragmentation issues. As a bachelor project or thesis. ||blabla would be helpful, interest in bla ||[[https://ad.informatik.uni-freiburg.de/staff/brosi|Patrick Brosi]] || ||'''Extract and Analyze Scientist's Homepages (project and/or thesis):''' Extract a large number of scientist's homepages from the [[http://commoncrawl.org/|CommonCrawl]] web crawl. Extract the central information from these pages, including: name, profession, gender, affiliation. It will be relativel straightforward to get results of medium quality. The challenge is to achieve results of high quality. Machine learning will be crucial to achieve that. Exploring suitable methods is part of the challenge. || ||[[https://ad.informatik.uni-freiburg.de/staff/bast|Hannah Bast]] || ||'''Tokenization Repair (project and/or thesis):'''Interesting and well-defined problem, the solution of which is relevant in a variety of information retrieval scenarios. Simple rule-based solutions come to mind easily, but machine learning is key to get very good results. A background in machine learning, or a strong willingness to aquire one as part of the project/thesis, is therefore mandatory for this project. || ||[[https://ad.informatik.uni-freiburg.de/staff/bast|Hannah Bast]] ||