1755
Comment:
|
1531
|
Deletions are marked like this. | Additions are marked like this. |
Line 19: | Line 19: |
||Samuel ||0.2, 0.2, 0.4, 0.6, 0.8 ||0.4, 0.2, 0.4, 0.8, 0.6 ||29.0% ||86.7% ||28.3% || ---- /!\ '''Edit conflict - other version:''' ---- |
||Samuel ||0.2, 0.2, 0.4, 0.6, 0.8 ||0.4, 0.2, 0.4, 0.8, 0.6 ||28.9% ||86.7% ||28.3% || |
Line 23: | Line 21: |
---- /!\ '''Edit conflict - your version:''' ---- ||Ramin ||0, 0, 0.4, 0.3, 1||0, 0, 0.6, 0.2, 1 ||6.6% ||97.7% ||8.2% || ---- /!\ '''End of edit conflict''' ---- |
Results for Exercise Sheet 13 (Statistical Significance)
Please add your row to the table below, following the examples already there:
- The five precision numbers for TF-IDF.
- The five precision numbers for BM25.
- The p-value according to Student's T-Test, in percent with one digit after the dot.
- The p-value according to Fisher's Randomization Test, in percent with one digit after the dot.
- The p-value according to the Z-Test, in percent with one digit after the dot.
Note that your figures can (and probably will) vary, because of differences in how exactly you parsed the text collections, and how exactly you determined relevance of a returned document for a query.
Name |
Precision TF-IDF |
Precision BM25 |
p-value T-Test |
p-value R-Test |
p-value Z-Test |
Björn |
0.2, 0.2, 0.2, 0.4, 0.6 |
0.6, 0.2, 0.2, 0.6, 0.4 |
17.8% |
55.3% |
14.7% |
Janosch |
0.2, 0.4, 0.4, 0.8, 0.4 |
0.4, 0.4, 0.4, 0.8, 0.6 |
19.8% |
58.9% |
16.8% |
MartinM |
0.4, 0.2, 0.6, 0.8, 0.8 |
0.2, 0.2, 0.4, 0.8, 0.8 |
34.7% |
69.7% |
32.4% |
Jens |
0.2, 0.4, 0.4, 0.8, 1.0 |
0.6, 0.4, 0.4, 0.8, 1.0 |
18.4% |
39.4% |
17.3% |
Nico |
0.2, 0.2, 0.4, 0.6, 0.8 |
0.4, 0.2, 0.4, 0.6, 0.8 |
|
|
|
Pat |
0.4, 0.6, 0.6, 0.8, 1.0 |
1.0, 0.6, 0.6, 0.8, 0.8 |
9.9% |
30.0% |
8.4% |
Samuel |
0.2, 0.2, 0.4, 0.6, 0.8 |
0.4, 0.2, 0.4, 0.8, 0.6 |
28.9% |
86.7% |
28.3% |
Ramin |
0, 0, 0.4, 0.3, 1 |
0, 0, 0.6, 0.2, 1 |
6.6% |
97.7% |
8.2% |