1877
Comment:
|
10320
|
Deletions are marked like this. | Additions are marked like this. |
Line 1: | Line 1: |
#acl -All:write | |
Line 4: | Line 5: |
1. (only necessary once, before you upload something for the first time) Assume your name is Donald Duck. (1) Type the following URL in your browser: http://ad-wiki.informatik.uni-freiburg.de/teaching/SearchEnginesWS0910/DonaldDuckExercises. (2) Click on "create new empty page". (3) Add the following line as the first line of this page: #acl DonaldDuck:read,write -All:read. (Without the final dot.) This will ensure that only yourself and the organizers of the course can see your solutions to the exercises, the number of points you got, etc. (4) Save the page. | 1. (only necessary once, before you upload something for the first time) Assume your name is Donald Duck. (0) If you haven't already done so, create a Wiki account with your name DonaldDuck (click on "Login" on the top left, then click on "you can create one now"). Always be logged in when you are about to change anything on the Wiki. (1) Type the following URL in your browser: http://ad-wiki.informatik.uni-freiburg.de/teaching/SearchEnginesWS0910/DonaldDuckExercises. (2) Click on "create new empty page" and save the empty page. (3) We will then add asap the following line to your page: #acl DonaldDuck:read,write -All:read. This will ensure that only yourself and the organizers of the course can see your solutions to the exercises, the number of points you got, etc. |
Line 6: | Line 7: |
2. (assuming you already have created your page http://ad-wiki.informatik.uni-freiburg.de/teaching/SearchEnginesWS0910/DonaldDuckExercises as described in 0.) (1) Recall that your name is not Donald Duck. (2) Go to your page FirstnameLastnameExercises. (2) Upload your solutions there as PDF (no other formats allowed), giving your file the name firstname_lastname_ex1.pdf. (3) Upload your code separately as ZIP or GZIPPED TAR archive, giving your file the name firstname_lastname.zip or firstname_lastname.tgz. (4) Put the corresponding links in the table below, as well as the other information requested. Follow the pattern of the lines already there. ||'''Name''' ||'''Link to uploaded solution''' ||'''Link to uploaded code''' ||'''Name of collection''' ||'''#Docs in collection''' ||'''Zipf epsilon''' || ||[[SearchEnginesWS0910/JohannesStorkExercises|Johannes Stork]] ||TODO||TODO ||RFCs and german news websites ||5540 and 1836 ||0.5562 and 0.2462 || ||[[SearchEnginesWS0910/ChristianSimonExercises|Christian Simon]] ||[[attachment:SearchEnginesWS0910/ChristianSimonExercises/christian_simon_ex1.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/ChristianSimonExercises/christian_simon_ex1.zip|PDF]] ||selected archives from www.textfiles.com ||2865 ||0.052 || |
2. (assuming you already have created your page http://ad-wiki.informatik.uni-freiburg.de/teaching/SearchEnginesWS0910/DonaldDuckExercises as described above) (1) Recall that your name is not Donald Duck. (2) Go to your page DonaldDuckExercises. (2) Upload your solutions there as PDF (no other formats allowed), giving your file the name donald_duck_ex1.pdf. (3) Upload your code separately as ZIP or GZIPPED TAR archive, giving your file the name donald_duck_ex1.zip or donald_duck_ex1.tgz. (4) Put the corresponding links in the table below, as well as the other information requested. Follow the pattern of the lines already there. '''PLEASE UPLOAD SOLUTIONS (PDF) AND CODE (ZIP OR TGZ) SEPARATELY !''' ||<tablewidth="828px" tableheight="764px">'''Name''' ||'''Link to uploaded solution''' ||'''Link to uploaded code''' ||'''Name of collection''' ||'''#Docs in collection''' ||'''Zipf epsilon''' || ||[[SearchEnginesWS0910/JohannesStorkExercises|Johannes Stork]] ||[[attachment:johannes_stork_ex1.pdf|PDF]] ||[[attachment:johannes_stork_ex1.tgz|TARGZ]] ||RFCs and german news websites and www.textfiles.com ||5540 and 5415 and 48799 ||0.1052 and 0.0762 and 0.0762 || ||[[SearchEnginesWS0910/ChristianSimonExercises|Christian Simon]] ||[[attachment:SearchEnginesWS0910/ChristianSimonExercises/christian_simon_ex1.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/ChristianSimonExercises/christian_simon_ex1.zip|ZIP]] ||selected archives from www.textfiles.com ||2865 ||0.052 || ||[[SearchEnginesWS0910/MatthiasSauerExercises|Matthias Sauer]] ||Included in Code zip ||[[attachment:SearchEnginesWS0910/MatthiasSauerExercises/matthias_sauer_ex1.zip|ZIP]] ||non-selected archives from www.textfiles.com ||4328 ||0.788 || ||[[SearchEnginesWS0910/ZhongjieCaiExercises|Zhongjie Cai]] ||[[attachment:SearchEnginesWS0910/ZhongjieCaiExercises/zhongjie_cai_ex1.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/ZhongjieCaiExercises/zhongjie_cai_ex1.zip|ZIP]] ||RFC Documents and Text Stories ||5549 and 1255 ||0.6364 and 0.5137 || ||[[SearchEnginesWS0910/WaldemarWittmannExercises|Waldemar Wittmann]] ||[[attachment:SearchEnginesWS0910/WaldemarWittmannExercises/waldemar_wittmann_ex1_update.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/WaldemarWittmannExercises/waldemar_wittmann_ex1_update2.tar.gz|TARGZ]] ||RFC Documents ||1459 ||0.08396 || ||[[SearchEnginesWS0910/FlorianBaeurleExercises|Florian Bäurle]] ||[[attachment:SearchEnginesWS0910/FlorianBaeurleExercises/florian_baeurle_ex1.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/FlorianBaeurleExercises/florian_baeurle_ex1.zip|ZIP]] ||RFCs and selected files from www.textfiles.com ||44618 ||0.08243 || ||[[SearchEnginesWS0910/MariusGreitschusExercises|Marius Greitschus]] ||[[attachment:SearchEnginesWS0910/MariusGreitschusExercises/marius_greitschus_ex1.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/MariusGreitschusExercises/marius_greitschus_ex1.tar.gz|.tar.gz]] ||GNU Man-Pages ||5051 ||0.098 || ||[[SearchEnginesWS0910/MarkusGruetznerExercises|Markus Gruetzner]] ||[[attachment:SearchEnginesWS0910/MarkusGruetznerExercises/markus_gruetzner_ex1.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/MarkusGruetznerExercises/markus_gruetzner_ex1.zip|ZIP]] ||RFC ||~5500 ||0.01299 || ||[[SearchEnginesWS0910/ThomasLiebetrautExercises|Thomas Liebetraut]] ||[[attachment:SearchEnginesWS0910/ThomasLiebetrautExercises/thomas_liebetraut_ex1.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/ThomasLiebetrautExercises/thomas_liebetraut_ex1.tgz|tgz]] ||IRC logs ||~3800 ||0.122 || ||[[SearchEnginesWS0910/ClaudiusKorzenExercises|Claudius Korzen]] ||[[attachment:SearchEnginesWS0910/ClaudiusKorzenExercises/claudius_korzen_ex1.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/ClaudiusKorzenExercises/claudius_korzen_ex1.zip|ZIP]] ||RFC's ||1460 ||0.031 || ||[[SearchEnginesWS0910/DanielSchauenbergExercises|Daniel Schauenberg]] ||[[attachment:SearchEnginesWS0910/DanielSchauenbergExercises/daniel_schauenberg_ex1.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/DanielSchauenbergExercises/daniel_schauenberg_ex1.tar.gz|tar.gz]] ||Excerpt from RFCs ||2000 ||0.0164 || ||[[SearchEnginesWS0910/AlexanderGutjahrExercises|Alexander Gutjahr]] ||[[attachment:SearchEnginesWS0910/AlexanderGutjahrExercises/alexander_gutjahr_ex1.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/AlexanderGutjahrExercises/alexander_gutjahr_ex1.tar.gz|tar.gz]] ||RFCs 1- 2000 ||ca. 2000 ||0.06095 || ||[[SearchEnginesWS0910/BjörnBuchholdExercises|Björn Buchhold]] ||[[attachment:SearchEnginesWS0910/BjörnBuchholdExercises/björn_buchhold_ex1.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/BjörnBuchholdExercises/björn_buchhold_ex1.zip|ZIP]] ||some RFCs ||3100 ||0.017163 || ||[[SearchEnginesWS0910/nibblerExercises|Ivo M.]] ||[[attachment:SearchEnginesWS0910/nibblerExercises/Blatt1.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/nibblerExercises/abgabe2.src|tar]] ||RFCs ||5520 ||0.94 || ||[[SearchEnginesWS0910/MirkoBrodesserExercises|Mirko Brodesser]] ||[[attachment:SearchEnginesWS0910/MirkoBrodesserExercises/mirko_brodesser_ex1.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/MirkoBrodesserExercises/mirko_brodesser_ex1.zip|ZIP]] ||some humor/fun files from textfiles.com ||~1000 ||0.1 || ||[[SearchEnginesWS0910/TriatmokoExercises|Triatmoko]] ||[[attachment:SearchEnginesWS0910/TriatmokoExercises/Triatmoko_ex1.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/TriatmokoExercises/Triatmoko_ex1.rar|RAR]] ||Archives from www.textfiles.com ||ca 2000 ||0.016 || ||[[SearchEnginesWS0910/AlexanderNutzExercises|AlexanderNutz]] ||[[attachment:SearchEnginesWS0910/AlexanderNutzExercises/alexander_nutz_ex1.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/AlexanderNutzExercises/alexander_nutz_ex1.zip|ZIP]] ||html-dateien von fünf-filmfreunde.de (blog über filme..) ||~ 5000 ||~ 0.022 || ||[[SearchEnginesWS0910/JonasKrischExercises|Jonas Krisch]] ||[[attachment:SearchEnginesWS0910/JonasKrischExcersises/jonas_krisch_ex1.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/JonasKrischExercises/jonas_krisch_ex1.zip|ZIP]] ||textfiles ||~1500 ||0.154 || ||[[SearchEnginesWS0910/AndreBorgeatExercises|Andre Borgeat]] ||[[attachment:SearchEnginesWS0910/AndreBorgeatExercises/andre_borgeat_ex1.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/AndreBorgeatExercises/andre_borgeat_ex1.zip|ZIP]] ||Reuters-21578 ||~20000 || || ||[[SearchEnginesWS0910/JonasKoenemannExercises|Jonas Koenemann]] ||[[attachment:SearchEnginesWS0910/JonasKoenemannExercises/Jonas_Koenemann_ex1.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/JonasKoenemannExercises/Jonas_Koenemann_ex1.zip|ZIP]] ||wegt from different pages ||~1500 || || ||[[SearchEnginesWS0910/PareshParadkarExcercises|Paresh Paradkar]] ||[[attachment:SearchEnginesWS0910/PareshParadkarExcercises/Paresh_Paradkar_ex1.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/PareshParadkarExcercises/Paresh_Paradkar_ex1.zip|ZIP]] ||Selective archives from www.ibibo.org ||~1600 ||0.05966 || ||[[SearchEnginesWS0910/AlexanderSchneiderExercises|AlexanderSchneider]] ||[[attachment:SearchEnginesWS0910/AlexanderSchneiderExercises/alexander_schneider_ex1.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/AlexanderSchneiderExercises/alexander_schneider_ex1.zip|ZIP]] ||selected archives from http://textfiles.com/ ||~ 1000 ||~ 0.023 || ||[[SearchEnginesWS0910/JensSilvaSantistebanExercises|JensSilvaSantisteban]] ||[[attachment:SearchEnginesWS0910/JensSilvaSantistebanExercises/Jens_SilvaSantisteban_ex1.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/JensSilvaSantistebanExercises/Jens_SilvaSantisteban_ex1.zip|ZIP]] ||RFCs and some other files from the web ||~ 1400 ||~ 0.084 || ||[[SearchEnginesWS0910/DanielFreyExercises|Daniel Frey]] ||[[attachment:SearchEnginesWS0910/DanielFreyExercises/blatt1.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/DanielFreyExercises/source.zip|ZIP]] ||archives from http://textfiles.com/ ||~ 50000 ||n.a. || ||[[SearchEnginesWS0910/JohannBetzExercises|JohannBetz]] ||[[attachment:SearchEnginesWS0910/JohannBetzExercises/johann_betz_ex1.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/JohannBetzExercises/johann_betz_ex1.zip|ZIP]] ||All textual RFCs ||5536 ||n.a. || ||[[SearchEnginesWS0910/MatthiasFrorathExercises|Matthias Frorath]] ||[[attachment:SearchEnginesWS0910/MatthiasFrorathExercises/Matthias_Frorath_ex1.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/MatthiasFrorathExercises/Matthias_Frorath_ex1.zip|ZIP]] ||Some files from textfiles.com ||~ 1300 || || ||[[SearchEnginesWS0910/IvoChichkovExercises|Ivo Chichkov]] ||[[attachment:SearchEnginesWS0910/IvoChichkovExercises/ivo_chichkov_ex1.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/IvoChichkovExercises/ivo_chichkov_ex1.zip|ZIP]] ||text converted HTML files - eNews ||~1500 ||0.154 || ||[[SearchEnginesWS0910/ManuelaOrtliebExercises|Manuela Ortlieb]] ||[[attachment:SearchEnginesWS0910/ManuelaOrtliebExercises/Manuela_Ortlieb_ex1.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/ManuelaOrtliebExercises/Manuela_Ortlieb_ex1.zip|ZIP]] ||text converted different eBooks ||2288 ||0.001 || ||[[SearchEnginesWS0910/JonasSterniskoExercises|Jonas Sternisko]] ||[[attachment:SearchEnginesWS0910/JonasSterniskoExercises/jonas_sternisko_ex01.pdf|PDF]] ||[[attachment:SearchEnginesWS0910/JonasSterniskoExercises/jonas_sternisko_ex01.tar.gz|.tgz]] ||text mined with wget from different sources ||27k+ ||0.223 || ||[[SearchEnginesWS0910/EricLacherExercises|Eric Lacher]] || [[http://data.lacher.name/blatt1.zip|blatt1]] ||[[http://data.lacher.name/InvIndex.zip|InvIndex.zip]] ||RFCs ||about 6000 ||0.912232 || |
Exercise Sheet 1
Instructions:
1. (only necessary once, before you upload something for the first time) Assume your name is Donald Duck. (0) If you haven't already done so, create a Wiki account with your name DonaldDuck (click on "Login" on the top left, then click on "you can create one now"). Always be logged in when you are about to change anything on the Wiki. (1) Type the following URL in your browser: http://ad-wiki.informatik.uni-freiburg.de/teaching/SearchEnginesWS0910/DonaldDuckExercises. (2) Click on "create new empty page" and save the empty page. (3) We will then add asap the following line to your page: #acl DonaldDuck:read,write -All:read. This will ensure that only yourself and the organizers of the course can see your solutions to the exercises, the number of points you got, etc.
2. (assuming you already have created your page http://ad-wiki.informatik.uni-freiburg.de/teaching/SearchEnginesWS0910/DonaldDuckExercises as described above) (1) Recall that your name is not Donald Duck. (2) Go to your page DonaldDuckExercises. (2) Upload your solutions there as PDF (no other formats allowed), giving your file the name donald_duck_ex1.pdf. (3) Upload your code separately as ZIP or GZIPPED TAR archive, giving your file the name donald_duck_ex1.zip or donald_duck_ex1.tgz. (4) Put the corresponding links in the table below, as well as the other information requested. Follow the pattern of the lines already there.
PLEASE UPLOAD SOLUTIONS (PDF) AND CODE (ZIP OR TGZ) SEPARATELY !
Name |
Link to uploaded solution |
Link to uploaded code |
Name of collection |
#Docs in collection |
Zipf epsilon |
RFCs and german news websites and www.textfiles.com |
5540 and 5415 and 48799 |
0.1052 and 0.0762 and 0.0762 |
|||
selected archives from www.textfiles.com |
2865 |
0.052 |
|||
Included in Code zip |
non-selected archives from www.textfiles.com |
4328 |
0.788 |
||
RFC Documents and Text Stories |
5549 and 1255 |
0.6364 and 0.5137 |
|||
RFC Documents |
1459 |
0.08396 |
|||
RFCs and selected files from www.textfiles.com |
44618 |
0.08243 |
|||
GNU Man-Pages |
5051 |
0.098 |
|||
RFC |
~5500 |
0.01299 |
|||
IRC logs |
~3800 |
0.122 |
|||
RFC's |
1460 |
0.031 |
|||
Excerpt from RFCs |
2000 |
0.0164 |
|||
RFCs 1- 2000 |
ca. 2000 |
0.06095 |
|||
some RFCs |
3100 |
0.017163 |
|||
RFCs |
5520 |
0.94 |
|||
some humor/fun files from textfiles.com |
~1000 |
0.1 |
|||
Archives from www.textfiles.com |
ca 2000 |
0.016 |
|||
html-dateien von fünf-filmfreunde.de (blog über filme..) |
~ 5000 |
~ 0.022 |
|||
textfiles |
~1500 |
0.154 |
|||
Reuters-21578 |
~20000 |
|
|||
wegt from different pages |
~1500 |
|
|||
Selective archives from www.ibibo.org |
~1600 |
0.05966 |
|||
selected archives from http://textfiles.com/ |
~ 1000 |
~ 0.023 |
|||
RFCs and some other files from the web |
~ 1400 |
~ 0.084 |
|||
archives from http://textfiles.com/ |
~ 50000 |
n.a. |
|||
All textual RFCs |
5536 |
n.a. |
|||
Some files from textfiles.com |
~ 1300 |
|
|||
text converted HTML files - eNews |
~1500 |
0.154 |
|||
text converted different eBooks |
2288 |
0.001 |
|||
text mined with wget from different sources |
27k+ |
0.223 |
|||
RFCs |
about 6000 |
0.912232 |