Differences between revisions 28 and 120 (spanning 92 versions)

Results for Exercise Sheet 12 (statistical significance tests)

Please add your row to the table below, following the examples already there. Report test results for each of the three test datasets in increasing size (50 examples/200 examples/all examples). Specify all numbers in percent, rounded to the first decimal place. Briefly specify your variant and any refinements you might have implemented.

Name	50 Examples			200 Examples			3140 Examples			Parameters
Name	Baseline	Variant	p-Value	Baseline	Variant	p-Value	Baseline	Variant	p-Value	Parameters
Elmar	74.0%	80.0%	47.7%	66.0%	68.0%	67.1%	69.8%	73.2%	0.2%	Variant: logistic regression without batches, 10 iterations
Elias & Maxi	86.4%	84.6%	82.7%	63.8%	63.9%	83.7%	71.4%	72.1%	26.9%	Variant: logistic regression (no batches), 30 iterations
ES	68.0%	74.0%	64.0%	64.0%	69.0%	45.3%	71.1%	73.1%	21.0%	Variant: logistic regression (no batches), averaging, stop if less then 7% of documents wrong
ID	68.0%	76.0%	52.8%	65.0%	66.0%	88.2%	68.2%	72.4%	1.04%	Variant: logistic regression without batches, 10 iterations + averaging
Hui Hui	74.0%	78.0%	69.5%	66.0%	67.0%	81.8%	69.8%	71.3%	15.9%	Variant: logistic regression
Perspective Daily	74.0%	76.0%	43.5%	66.0%	68.0%	38.1%	69.8%	72.8%	2.7%	Variant: Averaging
Alex	74.0%	76.0%	87.1%	67.0%	70.0%	70.4%	69.0%	74.0%	0.71%	Variant: logistic regression with averaging, 10 Iterations
David	75.2%	76.9%	47.3%	67.0%	67.3%	49.2%	69.7%	72.8%	16.9%	Variant: Averaging, 10 Iterations
Robin	70.0%	72.0%	82.6%	64.0%	68.5%	34.1%	70.4%	72.0%	15.4%	Variant: Averaging, 10 Iterations
Daniel	73.6%	72.7%	66.8%	67.3%	67.6%	47.5%	70.4%	72.5%	0.0%	Variant: Averaging, Logistic Regression, 10 Iterations

-  ⇤ ← Revision 28 as of 2016-01-25 21:03:56 → 
  Size: 840
  Editor: adpult
  Comment:
+   ← Revision 120 as of 2016-02-02 12:05:22 → ⇥
  Size: 2764
  Editor: 10
  Comment:
-Deletions are marked like this.
+Additions are marked like this.
 Line 1:
-Describe InformationRetrievalWS1516/ResultsES12 here.
+#acl All:read,write
= Results for Exercise Sheet 12 (statistical significance tests) =
Please add your row to the table below, following the examples already there. Report test results for each of the three test datasets in increasing size (50 examples/200 examples/all examples). Specify all numbers in percent, rounded to the first decimal place. Briefly specify your variant and any refinements you might have implemented.
-Line 3:
+Line 5:
-'''Precision''': the precision in percent rounded to integer
'''p-value (Z)''': the p-value of the Z-tests in percent rounded to integer
'''p-value (T)''': the p-value of the T-tests in percent rounded to integer

||<tablewidth="100%" tableheight="auto" style="text-align:center" |2> '''Name''' ||<:> '''Baseline''' |||||| '''Variant 1''' |||||| '''Variant 2'''  ||<style="text-align:center" |2>'''Parameters''' ||
||<:>   '''Precision''' ||<:>   '''Precision''' ||<:> '''p-value (Z)''' ||<:> '''p-value (T)''' ||<:>   '''Precision''' ||<:> '''p-value (Z)''' ||<:> '''p-value (T)''' ||
||Elmar         ||<:>50 / 60 / 70||<:>50 / 60 / 72||<:>1.5 / 4.6 / 2 ||<:>1.5 / 4.6 / 2 ||<:>50 / 60 / 73||<:>1.5 / 4.6 / 0 ||<:>1.5 / 4.6 / 0 || V1: avg. perceptron, V2: log. regression ||
+||<tablewidth="100%" tableheight="auto" style="text-align:center" |2> '''Name''' |||||| '''50 Examples''' |||||| '''200 Examples''' |||||| '''3140 Examples'''  ||<style="text-align:center" |2>'''Parameters''' ||
||<:>   '''Baseline''' ||<style="text-align:center"> '''Variant''' ||<:> '''p-Value''' ||<:>   '''Baseline''' ||<:> '''Variant''' ||<:> '''p-Value''' ||<:>   '''Baseline''' ||<:> '''Variant''' ||<:> '''p-Value''' ||
||Elmar      ||<:>    74.0%        ||<:>   80.0%        ||<:>     47.7%       ||<:>   66.0%     ||<:>    68.0%     ||<:>   67.1%        ||<:>    69.8%      ||<:>    73.2%       ||<:>   0.2%           || Variant: logistic regression without batches, 10 iterations ||
||Elias & Maxi  ||<:> 86.4% ||<:> 84.6% ||<:> 82.7% ||<:> 63.8% ||<:> 63.9% ||<:> 83.7% ||<:> 71.4% ||<:> 72.1% ||<:> 26.9% || Variant: logistic regression (no batches), 30 iterations ||
||ES            ||<:> 68.0% ||<:> 74.0% ||<:> 64.0% ||<:> 64.0% ||<:> 69.0% ||<:> 45.3% ||<:> 71.1% ||<:> 73.1% ||<:> 21.0% || Variant: logistic regression (no batches), averaging, stop if less then 7% of documents wrong ||
||ID            ||<:> 68.0% ||<:> 76.0% ||<:> 52.8% ||<:> 65.0% ||<:> 66.0% ||<:> 88.2% ||<:> 68.2% ||<:> 72.4% ||<:> 1.04% || Variant: logistic regression without batches, 10 iterations + averaging ||
||Hui Hui       ||<:> 74.0% ||<:> 78.0% ||<:> 69.5% ||<:> 66.0% ||<:> 67.0% ||<:> 81.8% ||<:> 69.8% ||<:> 71.3% ||<:> 15.9% || Variant: logistic regression ||
||Perspective Daily  ||<:> 74.0% ||<:> 76.0% ||<:> 43.5% ||<:> 66.0% ||<:> 68.0% ||<:> 38.1% ||<:> 69.8% ||<:> 72.8% ||<:> 2.7% || Variant: Averaging ||
||Alex          ||<:> 74.0% ||<:> 76.0% ||<:> 87.1% ||<:> 67.0% ||<:> 70.0% ||<:> 70.4% ||<:> 69.0% ||<:> 74.0% ||<:> 0.71% || Variant: logistic regression with averaging, 10 Iterations ||
||David         ||<:> 75.2% ||<:> 76.9% ||<:> 47.3% ||<:> 67.0% ||<:> 67.3% ||<:> 49.2% ||<:> 69.7% ||<:> 72.8% ||<:> 16.9% || Variant: Averaging, 10 Iterations ||
||Robin         ||<:> 70.0% ||<:> 72.0% ||<:> 82.6% ||<:> 64.0% ||<:> 68.5% ||<:> 34.1% ||<:> 70.4% ||<:> 72.0% ||<:> 15.4% || Variant: Averaging, 10 Iterations ||
||Daniel        ||<:> 73.6% ||<:> 72.7% ||<:> 66.8% ||<:> 67.3% ||<:> 67.6% ||<:> 47.5% ||<:> 70.4% ||<:> 72.5% ||<:> 0.0%  || Variant: Averaging, Logistic Regression, 10 Iterations ||