840
Comment:
|
2764
|
Deletions are marked like this. | Additions are marked like this. |
Line 1: | Line 1: |
Describe InformationRetrievalWS1516/ResultsES12 here. | #acl All:read,write = Results for Exercise Sheet 12 (statistical significance tests) = Please add your row to the table below, following the examples already there. Report test results for each of the three test datasets in increasing size (50 examples/200 examples/all examples). Specify all numbers in percent, rounded to the first decimal place. Briefly specify your variant and any refinements you might have implemented. |
Line 3: | Line 5: |
'''Precision''': the precision in percent rounded to integer '''p-value (Z)''': the p-value of the Z-tests in percent rounded to integer '''p-value (T)''': the p-value of the T-tests in percent rounded to integer ||<tablewidth="100%" tableheight="auto" style="text-align:center" |2> '''Name''' ||<:> '''Baseline''' |||||| '''Variant 1''' |||||| '''Variant 2''' ||<style="text-align:center" |2>'''Parameters''' || ||<:> '''Precision''' ||<:> '''Precision''' ||<:> '''p-value (Z)''' ||<:> '''p-value (T)''' ||<:> '''Precision''' ||<:> '''p-value (Z)''' ||<:> '''p-value (T)''' || ||Elmar ||<:>50 / 60 / 70||<:>50 / 60 / 72||<:>1.5 / 4.6 / 2 ||<:>1.5 / 4.6 / 2 ||<:>50 / 60 / 73||<:>1.5 / 4.6 / 0 ||<:>1.5 / 4.6 / 0 || V1: avg. perceptron, V2: log. regression || |
||<tablewidth="100%" tableheight="auto" style="text-align:center" |2> '''Name''' |||||| '''50 Examples''' |||||| '''200 Examples''' |||||| '''3140 Examples''' ||<style="text-align:center" |2>'''Parameters''' || ||<:> '''Baseline''' ||<style="text-align:center"> '''Variant''' ||<:> '''p-Value''' ||<:> '''Baseline''' ||<:> '''Variant''' ||<:> '''p-Value''' ||<:> '''Baseline''' ||<:> '''Variant''' ||<:> '''p-Value''' || ||Elmar ||<:> 74.0% ||<:> 80.0% ||<:> 47.7% ||<:> 66.0% ||<:> 68.0% ||<:> 67.1% ||<:> 69.8% ||<:> 73.2% ||<:> 0.2% || Variant: logistic regression without batches, 10 iterations || ||Elias & Maxi ||<:> 86.4% ||<:> 84.6% ||<:> 82.7% ||<:> 63.8% ||<:> 63.9% ||<:> 83.7% ||<:> 71.4% ||<:> 72.1% ||<:> 26.9% || Variant: logistic regression (no batches), 30 iterations || ||ES ||<:> 68.0% ||<:> 74.0% ||<:> 64.0% ||<:> 64.0% ||<:> 69.0% ||<:> 45.3% ||<:> 71.1% ||<:> 73.1% ||<:> 21.0% || Variant: logistic regression (no batches), averaging, stop if less then 7% of documents wrong || ||ID ||<:> 68.0% ||<:> 76.0% ||<:> 52.8% ||<:> 65.0% ||<:> 66.0% ||<:> 88.2% ||<:> 68.2% ||<:> 72.4% ||<:> 1.04% || Variant: logistic regression without batches, 10 iterations + averaging || ||Hui Hui ||<:> 74.0% ||<:> 78.0% ||<:> 69.5% ||<:> 66.0% ||<:> 67.0% ||<:> 81.8% ||<:> 69.8% ||<:> 71.3% ||<:> 15.9% || Variant: logistic regression || ||Perspective Daily ||<:> 74.0% ||<:> 76.0% ||<:> 43.5% ||<:> 66.0% ||<:> 68.0% ||<:> 38.1% ||<:> 69.8% ||<:> 72.8% ||<:> 2.7% || Variant: Averaging || ||Alex ||<:> 74.0% ||<:> 76.0% ||<:> 87.1% ||<:> 67.0% ||<:> 70.0% ||<:> 70.4% ||<:> 69.0% ||<:> 74.0% ||<:> 0.71% || Variant: logistic regression with averaging, 10 Iterations || ||David ||<:> 75.2% ||<:> 76.9% ||<:> 47.3% ||<:> 67.0% ||<:> 67.3% ||<:> 49.2% ||<:> 69.7% ||<:> 72.8% ||<:> 16.9% || Variant: Averaging, 10 Iterations || ||Robin ||<:> 70.0% ||<:> 72.0% ||<:> 82.6% ||<:> 64.0% ||<:> 68.5% ||<:> 34.1% ||<:> 70.4% ||<:> 72.0% ||<:> 15.4% || Variant: Averaging, 10 Iterations || ||Daniel ||<:> 73.6% ||<:> 72.7% ||<:> 66.8% ||<:> 67.3% ||<:> 67.6% ||<:> 47.5% ||<:> 70.4% ||<:> 72.5% ||<:> 0.0% || Variant: Averaging, Logistic Regression, 10 Iterations || |
Results for Exercise Sheet 12 (statistical significance tests)
Please add your row to the table below, following the examples already there. Report test results for each of the three test datasets in increasing size (50 examples/200 examples/all examples). Specify all numbers in percent, rounded to the first decimal place. Briefly specify your variant and any refinements you might have implemented.
Name |
50 Examples |
200 Examples |
3140 Examples |
Parameters |
||||||
Baseline |
Variant |
p-Value |
Baseline |
Variant |
p-Value |
Baseline |
Variant |
p-Value |
||
Elmar |
74.0% |
80.0% |
47.7% |
66.0% |
68.0% |
67.1% |
69.8% |
73.2% |
0.2% |
Variant: logistic regression without batches, 10 iterations |
Elias & Maxi |
86.4% |
84.6% |
82.7% |
63.8% |
63.9% |
83.7% |
71.4% |
72.1% |
26.9% |
Variant: logistic regression (no batches), 30 iterations |
ES |
68.0% |
74.0% |
64.0% |
64.0% |
69.0% |
45.3% |
71.1% |
73.1% |
21.0% |
Variant: logistic regression (no batches), averaging, stop if less then 7% of documents wrong |
ID |
68.0% |
76.0% |
52.8% |
65.0% |
66.0% |
88.2% |
68.2% |
72.4% |
1.04% |
Variant: logistic regression without batches, 10 iterations + averaging |
Hui Hui |
74.0% |
78.0% |
69.5% |
66.0% |
67.0% |
81.8% |
69.8% |
71.3% |
15.9% |
Variant: logistic regression |
Perspective Daily |
74.0% |
76.0% |
43.5% |
66.0% |
68.0% |
38.1% |
69.8% |
72.8% |
2.7% |
Variant: Averaging |
Alex |
74.0% |
76.0% |
87.1% |
67.0% |
70.0% |
70.4% |
69.0% |
74.0% |
0.71% |
Variant: logistic regression with averaging, 10 Iterations |
David |
75.2% |
76.9% |
47.3% |
67.0% |
67.3% |
49.2% |
69.7% |
72.8% |
16.9% |
Variant: Averaging, 10 Iterations |
Robin |
70.0% |
72.0% |
82.6% |
64.0% |
68.5% |
34.1% |
70.4% |
72.0% |
15.4% |
Variant: Averaging, 10 Iterations |
Daniel |
73.6% |
72.7% |
66.8% |
67.3% |
67.6% |
47.5% |
70.4% |
72.5% |
0.0% |
Variant: Averaging, Logistic Regression, 10 Iterations |