As a combination of bachelor project / bachelor thesis.
Goal: Given a collection of PDF timetables like this one, extract machine-readable schedule data (in the GTFS format) from it.
Requirements:
- Your tool should not require any additional user input.
- You should write a tool that can be used from the command line.
- The output format should be valid GTFS.