The rows are the txt words. The columns are hyp words.
Resource summary:Txt: An Italian became the world's greatest tenor.
Hyp: There was an Italian who became the world's greatest tenor. (yes)
There EX |
was VBD |
an DT |
Italian JJ |
who WP |
became VBD |
the DT |
world NN |
greatest JJS |
tenor NN |
|
An:DT | 1.31 | 1.21 | 0.13 | 1.44 | 1.16 | 1.25 | 1.31 | 1.00 | 1.18 | 1.00 |
Italian:JJ | 1.02 | 1.22 | 1.02 | 0.74 | 1.30 | 1.22 | 1.02 | 1.13 | 1.18 | 1.13 |
became:VBD | 0.74 | 0.93 | 0.76 | 1.28 | 0.54 | 1.32 | 0.76 | 0.91 | 0.99 | 0.89 |
the:DT | 1.20 | 1.25 | 1.31 | 1.44 | 1.09 | 1.25 | 2.42 | 1.00 | 1.18 | 1.00 |
world:NN | 0.71 | 0.82 | 0.71 | 1.24 | 1.58 | 0.82 | 0.71 | 0.28 | 0.38 | 0.84 |
greatest:JJS | 0.74 | 0.96 | 0.76 | 1.18 | 1.05 | 0.93 | 0.76 | 0.27 | 0.74 | 0.72 |
tenor:NN | 0.66 | 0.82 | 0.71 | 1.24 | 1.58 | 0.80 | 0.71 | 0.84 | 0.83 | 0.28 |
NO_WORD | 0.29 | 0.17 | 0.82 | 0.23 | 0.92 | 0.25 | 0.82 | 0.12 | 0.11 | 0.10 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.34 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.76 Alignment.isBad
-0.10 10.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "greatest" modifying "tenor"
0.00 1.00 NegPolarity.hypNegWord : "greatest": tag "JJS" is in NegPolarityMarkers list
-1.00 1.00 NullPunisher.other : became
-1.00 1.00 NullPunisher.other : tenor
-0.10 1.00 NullPunisher.functionWord : who
-0.10 1.00 NullPunisher.functionWord : There
-1.00 1.00 NullPunisher.other : greatest
-0.10 1.00 NullPunisher.article : an
-0.10 1.00 NullPunisher.article : the
-3.00 1.00 NullPunisher.entity : Italian
-0.05 1.00 NullPunisher.aux : was
-1.00 1.00 NullPunisher.other : world
-2.00 1.00 RootEntailment.unalignedRoot : "was" not aligned to anything
Hand-tuned score (dot product of above): -11.6465
Threshold: -9.4738
Txt: Every Italian man wants to be a great tenor. Some Italian men are great tenors.
Hyp: There are Italian men who want to be a great tenor. (yes)
There EX |
are VBP |
Italian JJ |
men NNS |
who WP |
want VBP |
to TO |
be VB |
a DT |
great JJ |
tenor NN |
|
Every:DT | 1.22 | 1.25 | 1.44 | 1.00 | 1.16 | 1.25 | 1.31 | 1.25 | 1.31 | 1.14 | 0.95 |
Italian:JJ | 1.02 | 1.22 | 0.74 | 1.13 | 1.30 | 1.22 | 1.02 | 1.22 | 1.02 | 1.19 | 1.13 |
man:NN | 0.71 | 0.82 | 1.24 | 0.01 | 1.58 | 0.72 | 0.71 | 0.82 | 0.71 | 0.85 | 0.80 |
wants:VBZ | 0.76 | 0.80 | 1.28 | 0.82 | 0.54 | 1.10 | 0.76 | 0.80 | 0.76 | 0.93 | 0.86 |
to:TO | 1.31 | 1.25 | 1.44 | 1.00 | 1.12 | 1.25 | 2.42 | 1.25 | 1.31 | 1.18 | 0.96 |
be:VB | 0.76 | 0.14 | 1.28 | 0.86 | 0.54 | 0.88 | 0.76 | 1.32 | 0.76 | 1.02 | 0.91 |
a:DT | 1.31 | 1.25 | 1.44 | 1.00 | 1.16 | 1.25 | 1.31 | 1.25 | 2.42 | 1.18 | 1.00 |
great:JJ | 0.72 | 0.90 | 1.19 | 0.88 | 1.05 | 0.93 | 0.76 | 0.96 | 0.76 | 0.74 | 0.60 |
tenor:NN | 0.66 | 0.82 | 1.24 | 0.79 | 1.58 | 0.79 | 0.67 | 0.82 | 0.71 | 0.70 | 0.28 |
Some:DT | 1.28 | 1.22 | 1.44 | 0.96 | 1.16 | 1.25 | 1.31 | 1.25 | 1.31 | 1.18 | 1.00 |
Italian:JJ | 1.02 | 1.22 | 0.74 | 1.13 | 1.30 | 1.22 | 1.02 | 1.22 | 1.02 | 1.19 | 1.13 |
men:NNS | 0.71 | 0.82 | 1.24 | 0.28 | 1.58 | 0.72 | 0.71 | 0.77 | 0.71 | 0.99 | 0.79 |
are:VBP | 0.70 | 1.32 | 1.28 | 0.91 | 0.54 | 0.85 | 0.76 | 0.14 | 0.76 | 0.96 | 0.91 |
great:JJ | 0.72 | 0.90 | 1.19 | 0.88 | 1.05 | 0.93 | 0.76 | 0.96 | 0.76 | 0.74 | 0.60 |
tenors:NNS | 0.65 | 0.82 | 1.23 | 0.66 | 1.58 | 0.69 | 0.71 | 0.82 | 0.71 | 0.99 | 0.99 |
NO_WORD | 0.29 | 0.17 | 0.11 | 0.28 | 0.92 | 0.25 | 1.55 | 0.34 | 0.82 | 0.11 | 0.10 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.58 Alignment.score
1.00 0.27 Alignment.isGood
-1.00 0.71 Alignment.isBad
-0.10 8.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.27 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "great" modifying "tenor"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "great" modifying "tenors" is dropped on aligned hypothesis word "tenor"
-0.05 1.00 NullPunisher.aux : are
-3.00 1.00 NullPunisher.entity : Italian
-0.10 1.00 NullPunisher.functionWord : who
-0.10 1.00 NullPunisher.article : a
-0.10 1.00 NullPunisher.functionWord : to
-0.10 1.00 NullPunisher.functionWord : There
-1.00 1.00 NullPunisher.other : great
-0.05 1.00 NullPunisher.aux : be
-2.00 1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -7.5273
Threshold: -9.4738
Txt: All Italian men want to be a great tenor. Some Italian men are great tenors.
Hyp: There are Italian men who want to be a great tenor. (yes)
There EX |
are VBP |
Italian JJ |
men NNS |
who WP |
want VBP |
to TO |
be VB |
a DT |
great JJ |
tenor NN |
|
All:DT | 1.31 | 1.18 | 1.44 | 1.00 | 1.16 | 1.22 | 1.31 | 1.25 | 1.31 | 1.18 | 1.00 |
Italian:JJ | 1.02 | 1.22 | 0.74 | 1.13 | 1.30 | 1.22 | 1.02 | 1.22 | 1.02 | 1.19 | 1.13 |
men:NNS | 0.71 | 0.82 | 1.24 | 0.28 | 1.58 | 0.72 | 0.71 | 0.77 | 0.71 | 0.99 | 0.79 |
want:VBP | 0.76 | 0.77 | 1.28 | 0.81 | 0.51 | 1.32 | 0.76 | 0.80 | 0.76 | 0.99 | 0.88 |
to:TO | 1.31 | 1.25 | 1.44 | 1.00 | 1.12 | 1.25 | 2.42 | 1.25 | 1.31 | 1.18 | 0.96 |
be:VB | 0.76 | 0.14 | 1.28 | 0.86 | 0.54 | 0.88 | 0.76 | 1.32 | 0.76 | 1.02 | 0.91 |
a:DT | 1.31 | 1.25 | 1.44 | 1.00 | 1.16 | 1.25 | 1.31 | 1.25 | 2.42 | 1.18 | 1.00 |
great:JJ | 0.72 | 0.90 | 1.19 | 0.88 | 1.05 | 0.93 | 0.76 | 0.96 | 0.76 | 0.74 | 0.60 |
tenor:NN | 0.66 | 0.82 | 1.24 | 0.79 | 1.58 | 0.79 | 0.67 | 0.82 | 0.71 | 0.70 | 0.28 |
Some:DT | 1.28 | 1.22 | 1.44 | 0.96 | 1.16 | 1.25 | 1.31 | 1.25 | 1.31 | 1.18 | 1.00 |
Italian:JJ | 1.02 | 1.22 | 0.74 | 1.13 | 1.30 | 1.22 | 1.02 | 1.22 | 1.02 | 1.19 | 1.13 |
men:NNS | 0.71 | 0.82 | 1.24 | 0.28 | 1.58 | 0.72 | 0.71 | 0.77 | 0.71 | 0.99 | 0.79 |
are:VBP | 0.70 | 1.32 | 1.28 | 0.91 | 0.54 | 0.85 | 0.76 | 0.14 | 0.76 | 0.96 | 0.91 |
great:JJ | 0.72 | 0.90 | 1.19 | 0.88 | 1.05 | 0.93 | 0.76 | 0.96 | 0.76 | 0.74 | 0.60 |
tenors:NNS | 0.65 | 0.82 | 1.23 | 0.66 | 1.58 | 0.69 | 0.71 | 0.82 | 0.71 | 0.99 | 0.99 |
NO_WORD | 0.29 | 0.17 | 0.11 | 0.28 | 0.92 | 0.25 | 1.55 | 0.34 | 0.82 | 0.11 | 0.10 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.48 Alignment.score
1.00 0.25 Alignment.isGood
-1.00 0.73 Alignment.isBad
-0.10 10.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.09 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "great" modifying "tenor"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "great" modifying "tenors" is dropped on aligned hypothesis word "tenor"
-0.10 1.00 NullPunisher.functionWord : There
-1.00 1.00 NullPunisher.other : great
-1.00 1.00 NullPunisher.other : men
-0.10 1.00 NullPunisher.article : a
-3.00 1.00 NullPunisher.entity : Italian
-0.05 1.00 NullPunisher.aux : are
-1.00 1.00 NullPunisher.other : want
-0.10 1.00 NullPunisher.functionWord : who
-0.05 1.00 NullPunisher.aux : be
-0.10 1.00 NullPunisher.functionWord : to
-2.00 1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -9.8908
Threshold: -9.4738
Txt: Each Italian tenor wants to be great. Some Italian tenors are great.
Hyp: There are Italian tenors who want to be great. (yes)
There EX |
are VBP |
Italian JJ |
tenors RB |
who WP |
want VBP |
to TO |
be VB |
great JJ |
|
Each:DT | 1.31 | 1.22 | 1.44 | 1.45 | 1.16 | 1.20 | 1.31 | 1.25 | 1.16 |
Italian:JJ | 1.02 | 1.22 | 0.74 | 1.25 | 1.30 | 1.22 | 1.02 | 1.22 | 1.19 |
tenor:NN | 0.66 | 0.82 | 1.24 | 0.48 | 1.58 | 0.79 | 0.67 | 0.82 | 0.70 |
wants:VBZ | 0.76 | 0.80 | 1.28 | 1.05 | 0.54 | 1.10 | 0.76 | 0.80 | 0.93 |
to:TO | 1.31 | 1.25 | 1.44 | 1.45 | 1.12 | 1.25 | 2.42 | 1.25 | 1.18 |
be:VB | 0.76 | 0.14 | 1.28 | 1.11 | 0.54 | 0.88 | 0.76 | 1.32 | 1.02 |
great:JJ | 0.72 | 0.90 | 1.19 | 1.01 | 1.05 | 0.93 | 0.76 | 0.96 | 0.74 |
Some:DT | 1.28 | 1.22 | 1.44 | 1.45 | 1.16 | 1.25 | 1.31 | 1.25 | 1.18 |
Italian:JJ | 1.02 | 1.22 | 0.74 | 1.25 | 1.30 | 1.22 | 1.02 | 1.22 | 1.19 |
tenors:NNS | 0.65 | 0.82 | 1.23 | 2.58 | 1.58 | 0.69 | 0.71 | 0.82 | 0.99 |
are:VBP | 0.70 | 1.32 | 1.28 | 1.11 | 0.54 | 0.85 | 0.76 | 0.14 | 0.96 |
great:JJ | 0.72 | 0.90 | 1.19 | 1.01 | 1.05 | 0.93 | 0.76 | 0.96 | 0.74 |
NO_WORD | 0.29 | 0.17 | 0.09 | 0.04 | 1.07 | 0.31 | 1.55 | 1.43 | 0.32 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.91 Alignment.score
1.00 0.34 Alignment.isGood
-1.00 0.64 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.22 Alignment.txtSpan
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "Italian" modifying "tenors" is dropped on aligned hypothesis word "tenors"
-0.10 1.00 NullPunisher.functionWord : who
-0.10 1.00 NullPunisher.functionWord : There
-0.05 1.00 NullPunisher.aux : be
-1.00 1.00 NullPunisher.other : great
-0.05 1.00 NullPunisher.aux : are
-3.00 1.00 NullPunisher.entity : Italian
-0.10 1.00 NullPunisher.functionWord : to
-2.00 1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -5.8723
Threshold: -9.4738
Txt: The really ambitious tenors are Italian.
Hyp: There are really ambitious tenors who are Italian. (yes)
There EX |
are VBP |
really RB |
ambitious JJ |
tenors RB |
who WP |
are VBP |
Italian JJ |
|
The:DT | 1.20 | 1.18 | 1.45 | 1.18 | 1.45 | 1.09 | 1.18 | 1.44 |
really:RB | 1.09 | 0.91 | 1.23 | 1.05 | 0.70 | 1.05 | 0.91 | 1.25 |
ambitious:JJ | 0.76 | 0.96 | 1.01 | 0.74 | 0.82 | 1.05 | 0.96 | 1.19 |
tenors:NNS | 0.65 | 0.82 | 1.15 | 0.79 | 2.58 | 1.58 | 0.82 | 1.23 |
are:VBP | 0.70 | 1.32 | 1.11 | 1.02 | 1.11 | 0.54 | 1.32 | 1.28 |
Italian:JJ | 1.02 | 1.22 | 1.22 | 1.19 | 1.25 | 1.30 | 1.22 | 0.74 |
NO_WORD | 0.29 | 0.17 | 0.04 | 0.09 | 0.04 | 1.07 | 0.31 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.53 Alignment.score
1.00 0.26 Alignment.isGood
-1.00 0.72 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.12 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "really" modifying "ambitious"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "ambitious" modifying "tenors" is dropped on aligned hypothesis word "tenors"
-1.00 1.00 NullPunisher.other : ambitious
-1.00 1.00 NullPunisher.other : really
-0.05 1.00 NullPunisher.aux : are
-0.05 1.00 NullPunisher.aux : are
-0.10 1.00 NullPunisher.functionWord : There
-3.00 1.00 NullPunisher.entity : Italian
-0.10 1.00 NullPunisher.functionWord : who
-2.00 1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -8.3104
Threshold: -9.4738
Txt: No really great tenors are modest.
Hyp: There are really great tenors who are modest. (don't know)
There EX |
are VBP |
really RB |
great JJ |
tenors RB |
who WP |
are VBP |
modest JJ |
|
No:DT | 1.31 | 1.25 | 1.45 | 1.18 | 1.45 | 1.12 | 1.25 | 1.18 |
really:RB | 1.09 | 0.91 | 1.23 | 0.58 | 0.70 | 1.05 | 0.91 | 1.05 |
great:JJ | 0.72 | 0.90 | 0.54 | 0.74 | 1.01 | 1.05 | 0.90 | 0.70 |
tenors:NNS | 0.65 | 0.82 | 1.15 | 0.99 | 2.58 | 1.58 | 0.82 | 0.89 |
are:VBP | 0.70 | 1.32 | 1.11 | 0.96 | 1.11 | 0.54 | 1.32 | 1.02 |
modest:JJ | 0.74 | 0.96 | 1.01 | 0.70 | 0.92 | 1.05 | 0.96 | 0.74 |
NO_WORD | 0.29 | 0.17 | 0.04 | 0.09 | 0.04 | 1.07 | 0.31 | 0.09 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.53 Alignment.score
1.00 0.26 Alignment.isGood
-1.00 0.72 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.12 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "really" modifying "great"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "great" modifying "tenors" is dropped on aligned hypothesis word "tenors"
-0.05 1.00 NullPunisher.aux : are
-0.10 1.00 NullPunisher.functionWord : There
-0.10 1.00 NullPunisher.functionWord : who
-1.00 1.00 NullPunisher.other : really
-1.00 1.00 NullPunisher.other : modest
-1.00 1.00 NullPunisher.other : great
-0.05 1.00 NullPunisher.aux : are
-2.00 1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -6.3104
Threshold: -9.4738
Txt: Some great tenors are Swedish.
Hyp: There are great tenors who are Swedish. (yes)
There EX |
are VBP |
great JJ |
tenors JJ |
who WP |
are VBP |
Swedish NNP |
|
Some:DT | 1.28 | 1.22 | 1.18 | 1.18 | 1.16 | 1.22 | 1.25 |
great:JJ | 0.72 | 0.90 | 0.74 | 0.93 | 1.05 | 0.90 | 1.13 |
tenors:NNS | 0.65 | 0.82 | 0.99 | 1.93 | 1.58 | 0.82 | 0.97 |
are:VBP | 0.70 | 1.32 | 0.96 | 1.02 | 0.54 | 1.32 | 1.16 |
Swedish:NNP | 0.96 | 1.07 | 1.24 | 1.23 | 1.84 | 1.07 | 0.28 |
NO_WORD | 0.29 | 0.17 | 0.09 | 0.29 | 0.54 | 0.36 | 0.10 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.47 Alignment.score
1.00 0.25 Alignment.isGood
-1.00 0.73 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.14 Alignment.txtSpan
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "great" modifying "tenors" is dropped on aligned hypothesis word "tenors"
-0.05 1.00 NullPunisher.aux : are
-0.05 1.00 NullPunisher.aux : are
-3.00 1.00 NullPunisher.entity : Swedish
-0.10 1.00 NullPunisher.functionWord : who
-0.10 1.00 NullPunisher.functionWord : There
-1.00 1.00 NullPunisher.other : great
-2.00 1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -6.2944
Threshold: -9.4738
Txt: Many great tenors are German.
Hyp: There are great tenors who are German. (yes)
There EX |
are VBP |
great JJ |
tenors RB |
who WP |
are VBP |
German JJ |
|
Many:JJ | 0.76 | 0.93 | 0.93 | 1.01 | 1.05 | 0.93 | 1.15 |
great:JJ | 0.72 | 0.90 | 0.74 | 1.01 | 1.05 | 0.90 | 1.09 |
tenors:NNS | 0.65 | 0.82 | 0.99 | 2.58 | 1.58 | 0.82 | 1.21 |
are:VBP | 0.70 | 1.32 | 0.96 | 1.11 | 0.54 | 1.32 | 1.28 |
German:JJ | 1.00 | 1.22 | 1.09 | 1.23 | 1.30 | 1.22 | 0.74 |
NO_WORD | 0.29 | 0.17 | 0.09 | 0.04 | 1.07 | 0.31 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.61 Alignment.score
1.00 0.27 Alignment.isGood
-1.00 0.70 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.14 Alignment.txtSpan
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "great" modifying "tenors" is dropped on aligned hypothesis word "tenors"
-3.00 1.00 NullPunisher.entity : German
-0.05 1.00 NullPunisher.aux : are
-0.10 1.00 NullPunisher.functionWord : There
-0.10 1.00 NullPunisher.functionWord : who
-1.00 1.00 NullPunisher.other : great
-0.05 1.00 NullPunisher.aux : are
-2.00 1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -6.1099
Threshold: -9.4738
Txt: Several great tenors are British.
Hyp: There are great tenors who are British. (yes)
There EX |
are VBP |
great JJ |
tenors RB |
who WP |
are VBP |
British JJ |
|
Several:JJ | 0.72 | 0.96 | 0.90 | 0.96 | 1.05 | 0.96 | 1.19 |
great:JJ | 0.72 | 0.90 | 0.74 | 1.01 | 1.05 | 0.90 | 1.19 |
tenors:NNS | 0.65 | 0.82 | 0.99 | 2.58 | 1.58 | 0.82 | 1.23 |
are:VBP | 0.70 | 1.32 | 0.96 | 1.11 | 0.54 | 1.32 | 1.28 |
British:JJ | 1.02 | 1.22 | 1.19 | 1.25 | 1.30 | 1.22 | 0.74 |
NO_WORD | 0.29 | 0.17 | 0.09 | 0.04 | 1.07 | 0.31 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.61 Alignment.score
1.00 0.27 Alignment.isGood
-1.00 0.70 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.14 Alignment.txtSpan
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "great" modifying "tenors" is dropped on aligned hypothesis word "tenors"
-0.10 1.00 NullPunisher.functionWord : There
-0.05 1.00 NullPunisher.aux : are
-0.05 1.00 NullPunisher.aux : are
-1.00 1.00 NullPunisher.other : great
-0.10 1.00 NullPunisher.functionWord : who
-3.00 1.00 NullPunisher.entity : British
-2.00 1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -6.1099
Threshold: -9.4738
Txt: Most great tenors are Italian.
Hyp: There are great tenors who are Italian. (yes)
There EX |
are VBP |
great JJ |
tenors RB |
who WP |
are VBP |
Italian JJ |
|
Most:JJS | 0.76 | 0.96 | 0.91 | 1.01 | 1.05 | 0.96 | 1.19 |
great:JJ | 0.72 | 0.90 | 0.74 | 1.01 | 1.05 | 0.90 | 1.19 |
tenors:NNS | 0.65 | 0.82 | 0.99 | 2.58 | 1.58 | 0.82 | 1.23 |
are:VBP | 0.70 | 1.32 | 0.96 | 1.11 | 0.54 | 1.32 | 1.28 |
Italian:JJ | 1.02 | 1.22 | 1.19 | 1.25 | 1.30 | 1.22 | 0.74 |
NO_WORD | 0.29 | 0.17 | 0.09 | 0.04 | 1.07 | 0.31 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.61 Alignment.score
1.00 0.27 Alignment.isGood
-1.00 0.70 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.14 Alignment.txtSpan
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "great" modifying "tenors" is dropped on aligned hypothesis word "tenors"
-3.00 1.00 NullPunisher.entity : Italian
-0.10 1.00 NullPunisher.functionWord : who
-0.10 1.00 NullPunisher.functionWord : There
-0.05 1.00 NullPunisher.aux : are
-1.00 1.00 NullPunisher.other : great
-0.05 1.00 NullPunisher.aux : are
-2.00 1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -6.1099
Threshold: -9.4738
Txt: A few great tenors sing popular music. Some great tenors like popular music.
Hyp: There are great tenors who sing popular music. (yes)
There EX |
are VBP |
great JJ |
tenors RB |
who WP |
sing VBP |
popular_music NN |
|
A:DT | 1.31 | 1.25 | 1.18 | 1.45 | 1.16 | 1.25 | 1.00 |
few:JJ | 0.76 | 0.96 | 0.93 | 1.01 | 1.05 | 0.96 | 0.88 |
great:JJ | 0.72 | 0.90 | 0.74 | 1.01 | 1.05 | 0.87 | 0.62 |
tenors:NNS | 0.65 | 0.82 | 0.99 | 2.58 | 1.58 | 0.47 | 0.55 |
sing:VBP | 0.76 | 0.72 | 0.93 | 0.77 | 0.54 | 1.32 | 0.44 |
popular_music:NN | 0.71 | 0.82 | 0.73 | 1.11 | 1.58 | 0.35 | 0.28 |
Some:DT | 1.28 | 1.22 | 1.18 | 1.45 | 1.16 | 1.20 | 1.00 |
great:JJ | 0.72 | 0.90 | 0.74 | 1.01 | 1.05 | 0.87 | 0.62 |
tenors:NNS | 0.65 | 0.82 | 0.99 | 2.58 | 1.58 | 0.47 | 0.55 |
like:VBP | 0.73 | 0.76 | 0.61 | 1.11 | 0.54 | 0.13 | 0.66 |
popular_music:NN | 0.71 | 0.82 | 0.73 | 1.11 | 1.58 | 0.35 | 0.28 |
NO_WORD | 0.29 | 0.17 | 0.09 | 0.04 | 1.07 | 0.31 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.63 Alignment.score
1.00 0.28 Alignment.isGood
-1.00 0.70 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.14 Alignment.txtSpan
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "great" modifying "tenors" is dropped on aligned hypothesis word "tenors"
-1.00 1.00 NullPunisher.other : great
-0.10 1.00 NullPunisher.functionWord : who
-1.00 1.00 NullPunisher.other : popular_music
-0.05 1.00 NullPunisher.aux : are
-0.10 1.00 NullPunisher.functionWord : There
-1.00 1.00 NullPunisher.other : sing
-2.00 1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -5.0239
Threshold: -9.4738
Txt: Both leading tenors are excellent. Leading tenors who are excellent are indispensable.
Hyp: Both leading tenors are indispensable. (yes)
Both DT |
leading JJ |
tenors NNS |
are VBP |
indispensable JJ |
|
Both:DT | 2.42 | 1.18 | 1.00 | 1.25 | 1.18 |
leading:JJ | 0.76 | 0.74 | 0.86 | 0.96 | 0.75 |
tenors:NNS | 0.71 | 0.97 | 0.28 | 0.82 | 0.83 |
are:VBP | 0.76 | 1.02 | 0.91 | 1.32 | 1.02 |
excellent:JJ | 0.76 | 0.91 | 0.88 | 0.96 | 0.66 |
Leading:VBG | 0.76 | 0.84 | 0.89 | 0.81 | 1.02 |
tenors:NNS | 0.71 | 0.97 | 0.28 | 0.82 | 0.83 |
who:WP | 1.11 | 1.39 | 1.30 | 0.97 | 1.39 |
are:VBP | 0.76 | 1.02 | 0.91 | 1.32 | 1.02 |
excellent:JJ | 0.76 | 0.91 | 0.88 | 0.96 | 0.66 |
are:VBP | 0.76 | 1.02 | 0.91 | 1.32 | 1.02 |
indispensable:JJ | 0.76 | 0.75 | 0.72 | 0.96 | 0.74 |
NO_WORD | 0.82 | 0.11 | 0.28 | 1.43 | 0.16 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.59 Alignment.score
1.00 0.27 Alignment.isGood
-1.00 0.70 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.20 Alignment.txtSpan
-0.05 1.00 NullPunisher.aux : are
-1.00 1.00 NullPunisher.other : Both
-1.00 1.00 NullPunisher.other : indispensable
-1.00 1.00 NullPunisher.other : tenors
-2.00 1.00 RootEntailment.unalignedRoot : "indispensable" not aligned to anything
-1.00 1.00 Structure.relMismatch : hypothesis noun modifier "leading" and aligned text noun modifier "tenors" bear the same relation ("amod") but different parents ("tenors" vs. "tenors")
Hand-tuned score (dot product of above): -6.1702
Threshold: -9.4738
Txt: Neither leading tenor comes cheap. One of the leading tenors is Pavarotti.
Hyp: Pavarotti is a leading tenor who comes cheap. (don't know)
Pavarotti NNP |
is VBZ |
a DT |
leading VBG |
tenor NN |
who WP |
comes VBZ |
cheap JJ |
|
Neither:DT | 1.25 | 1.25 | 1.31 | 1.22 | 0.96 | 1.16 | 1.25 | 1.18 |
leading:JJ | 1.13 | 0.96 | 0.76 | 1.16 | 0.88 | 1.05 | 0.95 | 0.93 |
tenor:NN | 1.07 | 0.82 | 0.71 | 0.82 | 0.28 | 1.58 | 0.82 | 0.99 |
comes:VBZ | 1.16 | 0.81 | 0.76 | 0.67 | 0.91 | 0.54 | 1.32 | 0.81 |
cheap:JJ | 1.13 | 0.96 | 0.76 | 0.96 | 0.88 | 1.05 | 0.75 | 0.74 |
One:CD | 1.24 | 1.30 | 0.89 | 1.30 | 1.33 | 1.32 | 1.25 | 1.00 |
the:DT | 1.25 | 1.25 | 1.31 | 1.25 | 1.00 | 1.09 | 1.25 | 1.13 |
leading:JJ | 1.13 | 0.96 | 0.76 | 1.16 | 0.88 | 1.05 | 0.95 | 0.93 |
tenors:NNS | 1.07 | 0.82 | 0.71 | 0.80 | 0.99 | 1.58 | 0.80 | 0.99 |
is:VBZ | 1.16 | 1.32 | 0.76 | 0.89 | 0.91 | 0.54 | 0.90 | 1.02 |
Pavarotti:NNP | 0.28 | 1.07 | 0.96 | 1.07 | 0.99 | 1.84 | 1.07 | 1.24 |
NO_WORD | 0.28 | 1.43 | 0.82 | 0.12 | 0.28 | 0.92 | 0.25 | 0.09 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.75 Alignment.score
1.00 0.30 Alignment.isGood
-1.00 0.67 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.25 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "comes" modifying "tenor"
-1.00 1.00 NullPunisher.other : cheap
-0.10 1.00 NullPunisher.article : a
-3.00 1.00 NullPunisher.entity : Pavarotti
-1.00 1.00 NullPunisher.other : comes
-0.10 1.00 NullPunisher.functionWord : who
-0.05 1.00 NullPunisher.aux : is
1.00 1.00 Quant.equivalent : Replacing the quantifier "the" by an equivalent quantifier "a" preserves truth.
Hand-tuned score (dot product of above): -5.2429
Threshold: -9.4738
Txt: At least three tenors will take part in the concert.
Hyp: There are tenors who will take part in the concert. (yes)
There EX |
are VBP |
tenors JJ |
who WP |
will MD |
take_part VB |
the DT |
concert NN |
|
At:IN | 1.05 | 1.20 | 0.77 | 1.57 | 1.05 | 1.24 | 1.05 | 1.31 |
least:JJS | 0.76 | 0.96 | 0.91 | 1.05 | 0.76 | 0.93 | 0.76 | 0.88 |
three:CD | 0.76 | 1.25 | 0.83 | 1.32 | 0.89 | 1.16 | 0.78 | 1.33 |
tenors:NNS | 0.65 | 0.82 | 1.93 | 1.58 | 0.71 | 0.77 | 0.71 | 0.55 |
will:MD | 1.31 | 1.60 | 1.18 | 1.13 | 2.42 | 1.37 | 1.31 | 1.00 |
take_part:VB | 0.73 | 0.75 | 0.97 | 0.54 | 0.87 | 1.32 | 0.76 | 0.88 |
the:DT | 1.20 | 1.18 | 1.18 | 1.09 | 1.31 | 1.25 | 2.42 | 1.00 |
concert:NN | 0.67 | 0.82 | 0.73 | 1.58 | 0.71 | 0.79 | 0.71 | 0.28 |
NO_WORD | 0.29 | 0.17 | 0.09 | 0.54 | 1.55 | 0.36 | 0.82 | 0.07 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.70 Alignment.score
1.00 0.29 Alignment.isGood
-1.00 0.68 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.12 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "concert" modifying "take_part"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "three" modifying "tenors" is dropped on aligned hypothesis word "tenors"
-1.00 1.00 NullPunisher.other : take_part
-0.10 1.00 NullPunisher.functionWord : There
-0.10 1.00 NullPunisher.article : the
-0.10 1.00 NullPunisher.functionWord : who
-0.05 1.00 NullPunisher.aux : will
-1.00 1.00 NullPunisher.other : concert
-0.05 1.00 NullPunisher.aux : are
-2.00 1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -5.1777
Threshold: -9.4738
Txt: An Irishman won the Nobel prize for literature.
Hyp: An Irishman won a Nobel prize. (yes)
An DT |
Irishman NNP |
won VBD |
a DT |
Nobel_prize NN |
|
An:DT | 2.42 | 1.25 | 1.21 | 1.14 | 1.00 |
Irishman:NNP | 0.96 | 0.28 | 1.07 | 0.96 | 1.07 |
won:VBD | 0.71 | 1.16 | 1.32 | 0.76 | 0.57 |
the:DT | 1.31 | 1.25 | 1.25 | 1.31 | 1.00 |
Nobel_prize:NN | 0.71 | 1.07 | 0.49 | 0.71 | 0.28 |
literature:NN | 0.71 | 1.07 | 0.82 | 0.71 | 0.38 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.82 | 0.25 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.36 Alignment.score
1.00 0.23 Alignment.isGood
-1.00 0.75 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : Nobel_prize
-0.10 1.00 NullPunisher.article : An
-1.00 1.00 NullPunisher.other : won
-0.10 1.00 NullPunisher.article : a
-3.00 1.00 NullPunisher.entity : Irishman
-2.00 1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -7.8694
Threshold: -9.4738
Txt: Every European has the right to live in Europe. Every European is a person. Every person who has the right to live in Europe can travel freely within Europe.
Hyp: Every European can travel freely within Europe. (yes)
Every DT |
European NNP |
can MD |
travel VB |
freely RB |
Europe NNP |
|
Every:DT | 2.42 | 1.24 | 1.31 | 1.23 | 1.39 | 1.23 |
European:NNP | 0.95 | 0.28 | 0.96 | 1.04 | 1.53 | 0.28 |
has:VBZ | 0.76 | 1.16 | 0.93 | 0.94 | 1.11 | 1.16 |
the:DT | 1.31 | 1.25 | 1.31 | 1.23 | 1.45 | 1.25 |
right:NN | 0.71 | 1.18 | 0.71 | 0.80 | 1.19 | 1.17 |
to:TO | 1.31 | 1.25 | 1.31 | 1.25 | 1.45 | 1.25 |
live:VB | 0.73 | 1.16 | 0.87 | 0.59 | 0.96 | 1.16 |
Europe:NNP | 0.94 | 0.28 | 0.96 | 1.04 | 1.56 | 0.28 |
Every:DT | 2.42 | 1.24 | 1.31 | 1.23 | 1.39 | 1.23 |
European:NNP | 0.95 | 0.28 | 0.96 | 1.04 | 1.53 | 0.28 |
is:VBZ | 0.76 | 1.16 | 1.06 | 1.00 | 1.11 | 1.16 |
a:DT | 1.31 | 1.25 | 1.31 | 1.25 | 1.45 | 1.25 |
person:NN | 0.69 | 1.11 | 0.71 | 0.59 | 1.22 | 1.17 |
Every:DT | 2.42 | 1.24 | 1.31 | 1.23 | 1.39 | 1.23 |
person:NN | 0.69 | 1.11 | 0.71 | 0.59 | 1.22 | 1.17 |
who:WP | 1.11 | 1.55 | 1.18 | 0.97 | 1.05 | 1.55 |
has:VBZ | 0.76 | 1.16 | 0.93 | 0.94 | 1.11 | 1.16 |
the:DT | 1.31 | 1.25 | 1.31 | 1.23 | 1.45 | 1.25 |
right:NN | 0.71 | 1.18 | 0.71 | 0.80 | 1.19 | 1.17 |
to:TO | 1.31 | 1.25 | 1.31 | 1.25 | 1.45 | 1.25 |
live:VB | 0.73 | 1.16 | 0.87 | 0.59 | 0.96 | 1.16 |
Europe:NNP | 0.94 | 0.28 | 0.96 | 1.04 | 1.56 | 0.28 |
can:MD | 1.31 | 1.25 | 2.42 | 1.51 | 1.45 | 1.25 |
travel:VB | 0.74 | 1.13 | 1.02 | 1.32 | 0.90 | 1.13 |
freely:RB | 1.03 | 1.44 | 1.09 | 0.70 | 1.23 | 1.47 |
Europe:NNP | 0.94 | 0.28 | 0.96 | 1.04 | 1.56 | 0.28 |
NO_WORD | 0.82 | 0.28 | 1.55 | 0.17 | 0.04 | 0.33 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.53 Alignment.score
1.00 0.26 Alignment.isGood
-1.00 0.72 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.17 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "freely" modifying "travel"
2.00 1.00 Modal.yes : actual -> possible
-1.00 1.00 NullPunisher.other : travel
-0.05 1.00 NullPunisher.aux : can
-1.00 1.00 NullPunisher.other : Every
-1.00 1.00 NullPunisher.other : freely
-2.00 1.00 RootEntailment.unalignedRoot : "travel" not aligned to anything
Hand-tuned score (dot product of above): -4.2688
Threshold: -9.4738
Txt: All Europeans have the right to live in Europe. Every European is a person. Every person who has the right to live in Europe can travel freely within Europe.
Hyp: All Europeans can travel freely within Europe. (yes)
All DT |
Europeans NNPS |
can MD |
travel VB |
freely RB |
Europe NNP |
|
All:DT | 2.42 | 1.25 | 1.31 | 1.23 | 1.45 | 1.25 |
Europeans:NNPS | 0.96 | 0.28 | 0.96 | 1.06 | 1.55 | 0.46 |
have:VBP | 0.73 | 1.16 | 0.97 | 0.27 | 1.11 | 1.16 |
the:DT | 1.31 | 1.25 | 1.31 | 1.23 | 1.45 | 1.25 |
right:NN | 0.71 | 1.18 | 0.71 | 0.80 | 1.19 | 1.17 |
to:TO | 1.31 | 1.25 | 1.31 | 1.25 | 1.45 | 1.25 |
live:VB | 0.76 | 1.16 | 0.87 | 0.59 | 0.96 | 1.16 |
Europe:NNP | 0.96 | 0.46 | 0.96 | 1.04 | 1.56 | 0.28 |
Every:DT | 1.31 | 1.25 | 1.31 | 1.23 | 1.39 | 1.23 |
European:NNP | 0.96 | 1.37 | 0.96 | 1.04 | 1.53 | 0.28 |
is:VBZ | 0.76 | 1.16 | 1.06 | 1.00 | 1.11 | 1.16 |
a:DT | 1.31 | 1.25 | 1.31 | 1.25 | 1.45 | 1.25 |
person:NN | 0.71 | 1.12 | 0.71 | 0.59 | 1.22 | 1.17 |
Every:DT | 1.31 | 1.25 | 1.31 | 1.23 | 1.39 | 1.23 |
person:NN | 0.71 | 1.12 | 0.71 | 0.59 | 1.22 | 1.17 |
who:WP | 1.11 | 1.55 | 1.18 | 0.97 | 1.05 | 1.55 |
has:VBZ | 0.76 | 1.16 | 0.93 | 0.94 | 1.11 | 1.16 |
the:DT | 1.31 | 1.25 | 1.31 | 1.23 | 1.45 | 1.25 |
right:NN | 0.71 | 1.18 | 0.71 | 0.80 | 1.19 | 1.17 |
to:TO | 1.31 | 1.25 | 1.31 | 1.25 | 1.45 | 1.25 |
live:VB | 0.76 | 1.16 | 0.87 | 0.59 | 0.96 | 1.16 |
Europe:NNP | 0.96 | 0.46 | 0.96 | 1.04 | 1.56 | 0.28 |
can:MD | 1.31 | 1.25 | 2.42 | 1.51 | 1.45 | 1.25 |
travel:VB | 0.73 | 1.15 | 1.02 | 1.32 | 0.90 | 1.13 |
freely:RB | 1.09 | 1.46 | 1.09 | 0.70 | 1.23 | 1.47 |
Europe:NNP | 0.96 | 0.46 | 0.96 | 1.04 | 1.56 | 0.28 |
NO_WORD | 0.82 | 0.28 | 1.55 | 0.17 | 0.04 | 0.33 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.69 Alignment.score
1.00 0.29 Alignment.isGood
-1.00 0.68 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.50 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "freely" modifying "travel"
2.00 1.00 Modal.yes : actual -> possible
-1.00 1.00 NullPunisher.other : freely
-0.05 1.00 NullPunisher.aux : can
-1.00 1.00 NullPunisher.other : All
1.00 1.00 Quant.equivalent : Replacing the quantifier "every" by an equivalent quantifier "all" preserves truth.
-3.00 1.00 Structure.argsMismatch : args have different parents but same relations: text "European" <-nsubj-- "person" vs. hyp "Europeans" <-nsubj-- "travel", which aligned to text "have"
Hand-tuned score (dot product of above): -2.9090
Threshold: -9.4738
Txt: Each European has the right to live in Europe. Every European is a person. Every person who has the right to live in Europe can travel freely within Europe.
Hyp: Each European can travel freely within Europe. (yes)
Each DT |
European NNP |
can MD |
travel VB |
freely RB |
Europe NNP |
|
Each:DT | 2.42 | 1.00 | 1.53 | 1.51 | 1.71 | 1.17 |
European:NNP | 0.71 | 0.28 | 0.96 | 1.04 | 1.53 | 0.28 |
has:VBZ | 0.98 | 1.16 | 0.93 | 0.94 | 1.11 | 1.16 |
the:DT | 1.56 | 1.25 | 1.31 | 1.23 | 1.45 | 1.25 |
right:NN | 0.94 | 1.18 | 0.71 | 0.80 | 1.19 | 1.17 |
to:TO | 1.56 | 1.25 | 1.31 | 1.25 | 1.45 | 1.25 |
live:VB | 1.01 | 1.16 | 0.87 | 0.59 | 0.96 | 1.16 |
Europe:NNP | 0.87 | 0.28 | 0.96 | 1.04 | 1.56 | 0.28 |
Every:DT | 1.54 | 1.24 | 1.31 | 1.23 | 1.39 | 1.23 |
European:NNP | 0.71 | 0.28 | 0.96 | 1.04 | 1.53 | 0.28 |
is:VBZ | 1.01 | 1.16 | 1.06 | 1.00 | 1.11 | 1.16 |
a:DT | 1.56 | 1.25 | 1.31 | 1.25 | 1.45 | 1.25 |
person:NN | 0.96 | 1.11 | 0.71 | 0.59 | 1.22 | 1.17 |
Every:DT | 1.54 | 1.24 | 1.31 | 1.23 | 1.39 | 1.23 |
person:NN | 0.96 | 1.11 | 0.71 | 0.59 | 1.22 | 1.17 |
who:WP | 1.37 | 1.55 | 1.18 | 0.97 | 1.05 | 1.55 |
has:VBZ | 0.98 | 1.16 | 0.93 | 0.94 | 1.11 | 1.16 |
the:DT | 1.56 | 1.25 | 1.31 | 1.23 | 1.45 | 1.25 |
right:NN | 0.94 | 1.18 | 0.71 | 0.80 | 1.19 | 1.17 |
to:TO | 1.56 | 1.25 | 1.31 | 1.25 | 1.45 | 1.25 |
live:VB | 1.01 | 1.16 | 0.87 | 0.59 | 0.96 | 1.16 |
Europe:NNP | 0.87 | 0.28 | 0.96 | 1.04 | 1.56 | 0.28 |
can:MD | 1.53 | 1.25 | 2.42 | 1.51 | 1.45 | 1.25 |
travel:VB | 1.01 | 1.13 | 1.02 | 1.32 | 0.90 | 1.13 |
freely:RB | 1.35 | 1.44 | 1.09 | 0.70 | 1.23 | 1.47 |
Europe:NNP | 0.87 | 0.28 | 0.96 | 1.04 | 1.56 | 0.28 |
NO_WORD | 0.82 | 0.28 | 1.55 | 0.17 | 0.04 | 0.33 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.53 Alignment.score
1.00 0.26 Alignment.isGood
-1.00 0.72 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.17 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "freely" modifying "travel"
2.00 1.00 Modal.yes : actual -> possible
-1.00 1.00 NullPunisher.other : travel
-0.05 1.00 NullPunisher.aux : can
-1.00 1.00 NullPunisher.other : freely
-3.00 1.00 NullPunisher.entity : Each
-2.00 1.00 RootEntailment.unalignedRoot : "travel" not aligned to anything
Hand-tuned score (dot product of above): -6.2688
Threshold: -9.4738
Txt: The residents of member states have the right to live in Europe. All residents of member states are individuals. Every individual who has the right to live in Europe can travel freely within Europe.
Hyp: The residents of member states can travel freely within Europe. (yes)
The DT |
residents NNS |
member NN |
states NNS |
can MD |
travel VB |
freely RB |
Europe NNP |
|
The:DT | 2.42 | 1.00 | 1.00 | 0.97 | 1.31 | 1.23 | 1.45 | 1.25 |
residents:NNS | 0.71 | 0.28 | 0.79 | 0.87 | 0.71 | 0.69 | 1.27 | 1.09 |
member:NN | 0.71 | 0.79 | 0.28 | 0.80 | 0.71 | 0.75 | 1.27 | 1.13 |
states:NNS | 0.68 | 0.87 | 0.80 | 0.28 | 0.71 | 0.60 | 1.20 | 1.15 |
have:VBP | 0.73 | 0.91 | 0.91 | 0.86 | 0.97 | 0.27 | 1.11 | 1.16 |
the:DT | 0.01 | 1.00 | 1.00 | 0.97 | 1.31 | 1.23 | 1.45 | 1.25 |
right:NN | 0.71 | 0.87 | 0.93 | 0.99 | 0.71 | 0.80 | 1.19 | 1.17 |
to:TO | 1.26 | 1.00 | 1.00 | 1.00 | 1.31 | 1.25 | 1.45 | 1.25 |
live:VB | 0.73 | 0.58 | 0.91 | 0.80 | 0.87 | 0.59 | 0.96 | 1.16 |
Europe:NNP | 0.96 | 1.09 | 1.13 | 1.15 | 0.96 | 1.04 | 1.56 | 0.28 |
All:DT | 1.31 | 1.00 | 1.00 | 1.00 | 1.31 | 1.23 | 1.45 | 1.25 |
residents:NNS | 0.71 | 0.28 | 0.79 | 0.87 | 0.71 | 0.69 | 1.27 | 1.09 |
member:NN | 0.71 | 0.79 | 0.28 | 0.80 | 0.71 | 0.75 | 1.27 | 1.13 |
states:NNS | 0.68 | 0.87 | 0.80 | 0.28 | 0.71 | 0.60 | 1.20 | 1.15 |
are:VBP | 0.68 | 0.91 | 0.91 | 0.88 | 1.06 | 0.98 | 1.09 | 1.14 |
individuals:NNS | 0.71 | 0.74 | 0.85 | 0.94 | 0.71 | 0.66 | 0.97 | 1.21 |
Every:DT | 1.31 | 1.00 | 0.94 | 1.00 | 1.31 | 1.23 | 1.39 | 1.23 |
individual:NN | 0.71 | 0.86 | 0.88 | 0.99 | 0.71 | 0.82 | 1.07 | 1.21 |
who:WP | 1.04 | 1.30 | 1.30 | 1.30 | 1.18 | 0.97 | 1.05 | 1.55 |
has:VBZ | 0.76 | 0.91 | 0.91 | 0.88 | 0.93 | 0.94 | 1.11 | 1.16 |
the:DT | 0.01 | 1.00 | 1.00 | 0.97 | 1.31 | 1.23 | 1.45 | 1.25 |
right:NN | 0.71 | 0.87 | 0.93 | 0.99 | 0.71 | 0.80 | 1.19 | 1.17 |
to:TO | 1.26 | 1.00 | 1.00 | 1.00 | 1.31 | 1.25 | 1.45 | 1.25 |
live:VB | 0.73 | 0.58 | 0.91 | 0.80 | 0.87 | 0.59 | 0.96 | 1.16 |
Europe:NNP | 0.96 | 1.09 | 1.13 | 1.15 | 0.96 | 1.04 | 1.56 | 0.28 |
can:MD | 1.31 | 1.00 | 1.00 | 1.00 | 2.42 | 1.51 | 1.45 | 1.25 |
travel:VB | 0.73 | 0.78 | 0.84 | 0.69 | 1.02 | 1.32 | 0.90 | 1.13 |
freely:RB | 1.09 | 1.18 | 1.18 | 1.11 | 1.09 | 0.70 | 1.23 | 1.47 |
Europe:NNP | 0.96 | 1.09 | 1.13 | 1.15 | 0.96 | 1.04 | 1.56 | 0.28 |
NO_WORD | 0.82 | 0.28 | 0.05 | 0.04 | 1.55 | 0.17 | 0.04 | 0.33 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.49 Alignment.score
1.00 0.25 Alignment.isGood
-1.00 0.73 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 3.00 Alignment.hypSpan
0.10 0.50 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "freely" modifying "travel"
2.00 1.00 Modal.yes : actual -> possible
-0.10 1.00 NullPunisher.article : The
-0.05 1.00 NullPunisher.aux : can
-1.00 1.00 NullPunisher.other : freely
2.00 1.00 Location.match : Locations match between txt and hyp; both have lemma "Europe"
Hand-tuned score (dot product of above): 1.9116
Threshold: -9.4738
Txt: No delegate finished the report on time.
Hyp: No delegate finished the report. (don't know)
No DT |
delegate NN |
finished VBD |
the DT |
report NN |
|
No:DT | 2.42 | 1.00 | 1.25 | 1.31 | 1.00 |
delegate:NN | 0.71 | 0.28 | 0.74 | 0.71 | 0.74 |
finished:VBD | 0.76 | 0.83 | 1.32 | 0.76 | 0.86 |
the:DT | 1.31 | 1.00 | 1.25 | 2.42 | 1.00 |
report:NN | 0.71 | 0.74 | 0.77 | 0.71 | 0.28 |
time:NN | 0.71 | 0.86 | 0.68 | 0.61 | 0.91 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.32 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.76 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : finished
-1.00 1.00 NullPunisher.other : report
-1.00 1.00 NullPunisher.other : No
-1.00 1.00 NullPunisher.other : delegate
-0.10 1.00 NullPunisher.article : the
-2.00 1.00 RootEntailment.unalignedRoot : "finished" not aligned to anything
Hand-tuned score (dot product of above): -6.8134
Threshold: -9.4738
Txt: Some delegates finished the survey on time.
Hyp: Some delegates finished the survey. (yes)
Some DT |
delegates NNS |
finished VBD |
the DT |
survey NN |
|
Some:DT | 2.42 | 1.00 | 1.25 | 1.28 | 0.95 |
delegates:NNS | 0.71 | 0.28 | 0.78 | 0.71 | 0.53 |
finished:VBD | 0.76 | 0.87 | 1.32 | 0.76 | 0.91 |
the:DT | 1.28 | 1.00 | 1.25 | 2.42 | 1.00 |
survey:NN | 0.66 | 0.53 | 0.82 | 0.71 | 0.28 |
time:NN | 0.60 | 0.86 | 0.68 | 0.61 | 0.84 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.32 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.76 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : survey
-1.00 1.00 NullPunisher.other : Some
-1.00 1.00 NullPunisher.other : delegates
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : finished
-2.00 1.00 RootEntailment.unalignedRoot : "finished" not aligned to anything
Hand-tuned score (dot product of above): -6.8134
Threshold: -9.4738
Txt: Many delegates obtained interesting results from the survey.
Hyp: Many delegates obtained results from the survey. (yes)
Many JJ |
delegates NNS |
obtained VBD |
results NNS |
the DT |
survey NN |
|
Many:JJ | 0.74 | 0.88 | 0.96 | 0.88 | 0.76 | 0.88 |
delegates:NNS | 0.99 | 0.28 | 0.80 | 0.63 | 0.71 | 0.53 |
obtained:VBD | 1.02 | 0.89 | 1.32 | 0.85 | 0.76 | 0.61 |
interesting:JJ | 0.93 | 0.77 | 0.90 | 0.71 | 0.76 | 0.74 |
results:NNS | 0.99 | 0.63 | 0.76 | 0.28 | 0.71 | 0.63 |
the:DT | 1.18 | 1.00 | 1.25 | 1.00 | 2.42 | 1.00 |
survey:NN | 0.99 | 0.53 | 0.52 | 0.63 | 0.71 | 0.28 |
NO_WORD | 0.11 | 0.28 | 0.17 | 0.09 | 0.82 | 0.15 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.13 Alignment.score
1.00 0.19 Alignment.isGood
-1.00 0.79 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "survey" modifying "obtained"
-1.00 1.00 NullPunisher.other : results
-1.00 1.00 NullPunisher.other : delegates
-1.00 1.00 NullPunisher.other : obtained
-1.00 1.00 NullPunisher.other : Many
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : survey
-2.00 1.00 RootEntailment.unalignedRoot : "obtained" not aligned to anything
Hand-tuned score (dot product of above): -9.1748
Threshold: -9.4738
Txt: Several delegates got the results published in major national newspapers.
Hyp: Several delegates got the results published. (yes)
Several JJ |
delegates NNS |
got VBD |
the DT |
results NNS |
published VBD |
|
Several:JJ | 0.74 | 0.82 | 0.96 | 0.76 | 0.84 | 0.96 |
delegates:NNS | 0.93 | 0.28 | 0.82 | 0.71 | 0.63 | 0.71 |
got:VBD | 1.02 | 0.91 | 1.32 | 0.76 | 0.91 | 0.74 |
the:DT | 1.18 | 1.00 | 1.25 | 2.42 | 1.00 | 1.25 |
results:NNS | 0.95 | 0.63 | 0.82 | 0.71 | 0.28 | 0.64 |
published:VBN | 1.02 | 0.80 | 0.74 | 0.76 | 0.73 | 2.02 |
major:JJ | 0.93 | 0.88 | 0.96 | 0.76 | 0.88 | 0.95 |
national:JJ | 0.89 | 0.75 | 0.96 | 0.76 | 0.88 | 0.83 |
newspapers:NNS | 0.97 | 0.56 | 0.77 | 0.71 | 0.82 | 0.56 |
NO_WORD | 0.11 | 0.28 | 0.17 | 0.82 | 0.28 | 0.36 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.49 Alignment.score
1.00 0.25 Alignment.isGood
-1.00 0.73 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.33 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Several" modifying "delegates"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "newspapers" modifying "published" is dropped on aligned hypothesis word "published"
-1.00 1.00 NullPunisher.other : delegates
-1.00 1.00 NullPunisher.other : Several
-1.00 1.00 NullPunisher.other : got
-0.10 1.00 NullPunisher.article : the
-2.00 1.00 RootEntailment.unalignedRoot : "got" not aligned to anything
Hand-tuned score (dot product of above): -5.7572
Threshold: -9.4738
Txt: Most Europeans are resident in Europe. All Europeans are people. All people who are resident in Europe can travel freely within Europe.
Hyp: Most Europeans can travel freely within Europe. (yes)
Most JJS |
Europeans NNPS |
can MD |
travel VB |
freely RB |
Europe NNP |
|
Most:JJS | 0.74 | 1.13 | 0.76 | 0.96 | 1.01 | 1.13 |
Europeans:NNPS | 1.24 | 0.28 | 0.96 | 1.06 | 1.55 | 0.46 |
are:VBP | 1.02 | 1.16 | 1.06 | 0.98 | 1.09 | 1.14 |
resident:VBN | 1.02 | 1.12 | 0.76 | 0.23 | 1.08 | 1.16 |
Europe:NNP | 1.24 | 0.46 | 0.96 | 1.04 | 1.56 | 0.28 |
All:DT | 1.18 | 1.25 | 1.31 | 1.23 | 1.45 | 1.25 |
Europeans:NNPS | 1.24 | 0.28 | 0.96 | 1.06 | 1.55 | 0.46 |
are:VBP | 1.02 | 1.16 | 1.06 | 0.98 | 1.09 | 1.14 |
people:NNS | 0.99 | 1.21 | 0.71 | 0.66 | 1.22 | 1.14 |
All:DT | 1.18 | 1.25 | 1.31 | 1.23 | 1.45 | 1.25 |
people:NNS | 0.99 | 1.21 | 0.71 | 0.66 | 1.22 | 1.14 |
who:WP | 1.39 | 1.55 | 1.18 | 0.97 | 1.05 | 1.55 |
are:VBP | 1.02 | 1.16 | 1.06 | 0.98 | 1.09 | 1.14 |
resident:NN | 0.99 | 0.89 | 0.71 | 0.49 | 1.27 | 1.09 |
Europe:NNP | 1.24 | 0.46 | 0.96 | 1.04 | 1.56 | 0.28 |
can:MD | 1.18 | 1.25 | 2.42 | 1.51 | 1.45 | 1.25 |
travel:VB | 1.02 | 1.15 | 1.02 | 1.32 | 0.90 | 1.13 |
freely:RB | 1.05 | 1.46 | 1.09 | 0.70 | 1.23 | 1.47 |
Europe:NNP | 1.24 | 0.46 | 0.96 | 1.04 | 1.56 | 0.28 |
NO_WORD | 0.11 | 0.28 | 1.55 | 0.17 | 0.04 | 0.33 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.47 Alignment.score
1.00 0.25 Alignment.isGood
-1.00 0.73 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.17 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "freely" modifying "travel"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "Most" modifying "Europeans" is dropped on aligned hypothesis word "Europe"
2.00 1.00 Modal.yes : actual -> possible
0.00 1.00 NegPolarity.hypNegWord : "Most": tag "JJS" is in NegPolarityMarkers list
-0.05 1.00 NullPunisher.aux : can
-1.00 1.00 NullPunisher.other : freely
-1.00 1.00 NullPunisher.other : travel
-1.00 1.00 NullPunisher.other : Most
-2.00 1.00 RootEntailment.unalignedRoot : "travel" not aligned to anything
Hand-tuned score (dot product of above): -3.8498
Threshold: -9.4738
Txt: A few committee members are from Sweden. All committee members are people. All people who are from Sweden are from Scandinavia.
Hyp: At least a few committee members are from Scandinavia. (yes)
At IN |
least JJS |
a DT |
few JJ |
committee_members NNS |
are VBP |
Scandinavia NNP |
|
A:DT | 1.00 | 1.18 | 0.13 | 1.18 | 1.00 | 1.25 | 1.25 |
few:JJ | 1.05 | 0.93 | 0.76 | 0.74 | 0.88 | 0.96 | 1.13 |
committee_members:NNS | 0.50 | 0.99 | 0.71 | 0.99 | 0.28 | 0.82 | 1.06 |
are:VBP | 1.11 | 1.02 | 0.76 | 1.02 | 0.91 | 1.32 | 1.16 |
Sweden:NNP | 0.75 | 1.22 | 0.96 | 1.24 | 1.07 | 1.07 | 0.76 |
All:DT | 1.00 | 1.18 | 1.31 | 1.18 | 1.00 | 1.18 | 1.25 |
committee_members:NNS | 0.50 | 0.99 | 0.71 | 0.99 | 0.28 | 0.82 | 1.06 |
are:VBP | 1.11 | 1.02 | 0.76 | 1.02 | 0.91 | 1.32 | 1.16 |
people:NNS | 0.50 | 0.97 | 0.71 | 0.99 | 0.84 | 0.82 | 1.13 |
All:DT | 1.00 | 1.18 | 1.31 | 1.18 | 1.00 | 1.18 | 1.25 |
people:NNS | 0.50 | 0.97 | 0.71 | 0.99 | 0.84 | 0.82 | 1.13 |
who:WP | 1.05 | 1.39 | 1.11 | 1.39 | 1.30 | 0.97 | 1.55 |
are:VBP | 1.11 | 1.02 | 0.76 | 1.02 | 0.91 | 1.32 | 1.16 |
Sweden:NNP | 0.75 | 1.22 | 0.96 | 1.24 | 1.07 | 1.07 | 0.76 |
are:VBP | 1.11 | 1.02 | 0.76 | 1.02 | 0.91 | 1.32 | 1.16 |
Scandinavia:NNP | 0.75 | 1.24 | 0.96 | 1.24 | 1.06 | 1.07 | 0.28 |
NO_WORD | 0.73 | 0.29 | 0.82 | 0.11 | 0.28 | 0.17 | 0.15 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.24 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Scandinavia" modifying "are"
0.00 1.00 NegPolarity.hypNegWord : "least": tag "JJS" is in NegPolarityMarkers list
0.00 1.00 NegPolarity.hypNegWord : "few": is in NegPolarityMarkers list
-1.00 1.00 NullPunisher.other : least
-0.05 1.00 NullPunisher.aux : are
-3.00 1.00 NullPunisher.entity : Scandinavia
-1.00 1.00 NullPunisher.other : At
-1.00 1.00 NullPunisher.other : committee_members
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : few
-2.00 1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -11.1707
Threshold: -9.4738
Txt: Few committee members are from Portugal. All committee members are people. All people who are from Portugal are from southern Europe.
Hyp: There are few committee members from southern Europe. (don't know)
There EX |
are VBP |
few JJ |
committee_members NNS |
southern JJ |
Europe NNP |
|
Few:JJ | 0.76 | 0.96 | 0.76 | 0.88 | 0.93 | 1.13 |
committee_members:NNS | 0.71 | 0.82 | 0.99 | 0.28 | 0.99 | 1.11 |
are:VBP | 0.70 | 1.32 | 1.02 | 0.91 | 1.02 | 1.14 |
Portugal:NNP | 0.96 | 1.07 | 1.24 | 1.07 | 1.19 | 0.80 |
All:DT | 1.31 | 1.18 | 1.18 | 1.00 | 1.18 | 1.25 |
committee_members:NNS | 0.71 | 0.82 | 0.99 | 0.28 | 0.99 | 1.11 |
are:VBP | 0.70 | 1.32 | 1.02 | 0.91 | 1.02 | 1.14 |
people:NNS | 0.69 | 0.82 | 0.99 | 0.84 | 0.99 | 1.14 |
All:DT | 1.31 | 1.18 | 1.18 | 1.00 | 1.18 | 1.25 |
people:NNS | 0.69 | 0.82 | 0.99 | 0.84 | 0.99 | 1.14 |
who:WP | 1.11 | 0.97 | 1.39 | 1.30 | 1.39 | 1.55 |
are:VBP | 0.70 | 1.32 | 1.02 | 0.91 | 1.02 | 1.14 |
Portugal:NNP | 0.96 | 1.07 | 1.24 | 1.07 | 1.19 | 0.80 |
are:VBP | 0.70 | 1.32 | 1.02 | 0.91 | 1.02 | 1.14 |
southern:NNP | 0.62 | 0.82 | 0.99 | 0.82 | 1.87 | 1.07 |
Europe:NNP | 0.94 | 1.05 | 1.24 | 1.11 | 1.24 | 0.28 |
NO_WORD | 0.29 | 0.17 | 0.11 | 0.28 | 0.11 | 0.15 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.45 Alignment.score
1.00 0.24 Alignment.isGood
-1.00 0.73 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.17 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Europe" modifying "committee_members"
0.00 1.00 NegPolarity.hypNegWord : "few": is in NegPolarityMarkers list
-1.00 1.00 NullPunisher.other : committee_members
-0.05 1.00 NullPunisher.aux : are
-0.10 1.00 NullPunisher.functionWord : There
-3.00 1.00 NullPunisher.entity : Europe
-2.00 1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
-1.00 1.00 Structure.relMismatch : hypothesis noun modifier "southern" and aligned text noun modifier "Europe" bear the same relation ("amod") but different parents ("Europe" vs. "Europe") hypothesis noun modifier "few" and aligned text noun modifier "committee_members" bear the same relation ("amod") but different parents ("committee_members" vs. "committee_members")
Hand-tuned score (dot product of above): -8.4799
Threshold: -9.4738
Txt: Both commissioners used to be leading businessmen.
Hyp: Both commissioners used to be businessmen. (yes)
Both DT |
commissioners NNS |
used VBD |
to TO |
be VB |
businessmen NNS |
|
Both:DT | 2.42 | 1.00 | 1.25 | 1.31 | 1.25 | 1.00 |
commissioners:NNS | 0.71 | 0.28 | 0.82 | 0.71 | 0.82 | 0.76 |
used:VBD | 0.76 | 0.91 | 1.32 | 0.76 | 0.93 | 0.91 |
to:TO | 1.31 | 1.00 | 1.25 | 2.42 | 1.25 | 1.00 |
be:VB | 0.76 | 0.91 | 0.93 | 0.76 | 1.32 | 0.91 |
leading:VBG | 0.76 | 0.91 | 0.63 | 0.76 | 0.81 | 0.80 |
businessmen:NNS | 0.71 | 0.76 | 0.82 | 0.71 | 0.82 | 0.28 |
NO_WORD | 0.82 | 0.28 | 0.17 | 1.55 | 1.43 | 0.28 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.66 Alignment.score
1.00 0.28 Alignment.isGood
-1.00 0.69 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : businessmen
-0.10 1.00 NullPunisher.functionWord : to
-1.00 1.00 NullPunisher.other : Both
-1.00 1.00 NullPunisher.other : used
-1.00 1.00 NullPunisher.other : commissioners
-0.05 1.00 NullPunisher.aux : be
-2.00 1.00 RootEntailment.unalignedRoot : "used" not aligned to anything
Hand-tuned score (dot product of above): -6.4921
Threshold: -9.4738
Txt: Neither commissioner spends a lot of time at home.
Hyp: Neither commissioner spends time at home. (don't know)
Neither DT |
commissioner NN |
spends VBZ |
time NN |
at_home IN |
|
Neither:DT | 2.42 | 0.98 | 1.25 | 0.98 | 1.05 |
commissioner:NN | 0.69 | 0.28 | 0.77 | 0.87 | 0.50 |
spends:VBZ | 0.76 | 0.86 | 1.32 | 0.59 | 1.07 |
a:DT | 1.31 | 1.00 | 1.25 | 1.00 | 1.05 |
lot:NN | 0.71 | 0.87 | 0.70 | 0.62 | 0.45 |
of:IN | 1.05 | 1.31 | 1.24 | 1.31 | 1.05 |
time:NN | 0.69 | 0.87 | 0.50 | 0.28 | 0.30 |
at_home:IN | 1.05 | 1.31 | 1.15 | 1.11 | 1.59 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.09 | 0.99 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.36 Alignment.score
1.00 0.23 Alignment.isGood
-1.00 0.75 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : at_home
-1.00 1.00 NullPunisher.other : spends
-1.00 1.00 NullPunisher.other : Neither
-1.00 1.00 NullPunisher.other : commissioner
-1.00 1.00 NullPunisher.other : time
-2.00 1.00 RootEntailment.unalignedRoot : "spends" not aligned to anything
Hand-tuned score (dot product of above): -7.6657
Threshold: -9.4738
Txt: At least three commissioners spend a lot of time at home.
Hyp: At least three commissioners spend time at home. (yes)
At IN |
least JJS |
three CD |
commissioners NNS |
spend VBP |
time NN |
at_home IN |
|
At:IN | 1.59 | 0.74 | 1.30 | 1.31 | 1.24 | 1.31 | 1.05 |
least:JJS | 1.01 | 0.74 | 1.13 | 0.88 | 0.96 | 0.88 | 1.05 |
three:CD | 1.30 | 1.00 | 0.85 | 1.18 | 1.22 | 1.01 | 1.23 |
commissioners:NNS | 0.50 | 0.99 | 1.15 | 0.28 | 0.82 | 0.87 | 0.50 |
spend:VBP | 1.15 | 1.02 | 1.31 | 0.91 | 1.32 | 0.51 | 1.07 |
a:DT | 1.00 | 1.18 | 1.18 | 1.00 | 1.25 | 1.00 | 1.05 |
lot:NN | 0.45 | 0.93 | 1.27 | 0.87 | 0.53 | 0.62 | 0.45 |
time:NN | 0.50 | 0.99 | 0.98 | 0.87 | 0.42 | 0.28 | 0.30 |
at_home:IN | 1.05 | 0.77 | 1.23 | 1.31 | 1.15 | 1.11 | 1.59 |
NO_WORD | 0.84 | 0.29 | 0.52 | 0.28 | 0.17 | 0.09 | 1.16 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.30 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.76 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.29 Alignment.txtSpan
0.00 1.00 NegPolarity.hypNegWord : "least": tag "JJS" is in NegPolarityMarkers list
-1.00 1.00 NullPunisher.other : time
-1.00 1.00 NullPunisher.other : least
-1.00 1.00 NullPunisher.other : spend
-1.00 1.00 NullPunisher.other : At
-1.00 1.00 NullPunisher.other : at_home
-2.00 1.00 RootEntailment.unalignedRoot : "spend" not aligned to anything
Hand-tuned score (dot product of above): -7.5125
Threshold: -9.4738
Txt: At most ten commissioners spend a lot of time at home.
Hyp: At most ten commissioners spend time at home. (don't know)
At IN |
most JJS |
ten NN |
commissioners NNS |
spend VBP |
time NN |
at_home IN |
|
At:IN | 1.59 | 0.77 | 1.31 | 1.31 | 1.24 | 1.31 | 1.05 |
most:RBS | 1.05 | 1.44 | 1.21 | 1.21 | 0.91 | 1.21 | 1.05 |
ten:NN | 0.50 | 0.99 | 0.28 | 0.71 | 0.75 | 0.71 | 0.50 |
commissioners:NNS | 0.50 | 0.99 | 0.71 | 0.28 | 0.82 | 0.87 | 0.50 |
spend:VBP | 1.15 | 1.02 | 0.84 | 0.91 | 1.32 | 0.51 | 1.07 |
a:DT | 1.00 | 1.18 | 1.00 | 1.00 | 1.25 | 1.00 | 1.05 |
lot:NN | 0.45 | 0.89 | 0.76 | 0.87 | 0.53 | 0.62 | 0.45 |
time:NN | 0.50 | 0.99 | 0.71 | 0.87 | 0.42 | 0.28 | 0.30 |
at_home:IN | 1.05 | 0.77 | 1.31 | 1.31 | 1.15 | 1.11 | 1.59 |
NO_WORD | 1.16 | 0.29 | 0.05 | 0.28 | 0.17 | 0.09 | 1.16 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.63 Alignment.score
1.00 0.28 Alignment.isGood
-1.00 0.70 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 3.00 Alignment.hypSpan
0.10 0.43 Alignment.txtSpan
0.00 1.00 NegPolarity.hypNegWord : "most": tag "JJS" is in NegPolarityMarkers list
-1.00 1.00 NullPunisher.other : time
-1.00 1.00 NullPunisher.other : At
-1.00 1.00 NullPunisher.other : at_home
-1.00 1.00 NullPunisher.other : spend
-2.00 1.00 RootEntailment.unalignedRoot : "spend" not aligned to anything
Hand-tuned score (dot product of above): -5.8497
Threshold: -9.4738
Txt: An Irishman won a Nobel prize.
Hyp: An Irishman won the Nobel prize for literature. (don't know)
An DT |
Irishman NNP |
won VBD |
the DT |
Nobel_prize NN |
literature NN |
|
An:DT | 2.42 | 1.25 | 1.21 | 1.31 | 1.00 | 1.00 |
Irishman:NNP | 0.96 | 0.28 | 1.07 | 0.96 | 1.07 | 1.07 |
won:VBD | 0.71 | 1.16 | 1.32 | 0.76 | 0.57 | 0.91 |
a:DT | 1.14 | 1.25 | 1.25 | 1.31 | 1.00 | 1.00 |
Nobel_prize:NN | 0.71 | 1.07 | 0.49 | 0.71 | 0.28 | 0.38 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.82 | 0.09 | 0.05 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.28 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "literature" modifying "won"
-1.00 1.00 NullPunisher.other : Nobel_prize
-1.00 1.00 NullPunisher.other : won
-1.00 1.00 NullPunisher.other : literature
-0.10 1.00 NullPunisher.article : An
-3.00 1.00 NullPunisher.entity : Irishman
-0.10 1.00 NullPunisher.article : the
-2.00 1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -10.0749
Threshold: -9.4738
Txt: Every European can travel freely within Europe. Every European is a person. Every person who has the right to live in Europe can travel freely within Europe.
Hyp: Every European has the right to live in Europe. (don't know)
Every DT |
European NNP |
has VBZ |
the DT |
right NN |
to TO |
live VB |
Europe NNP |
|
Every:DT | 2.42 | 1.24 | 1.25 | 1.31 | 1.00 | 1.31 | 1.23 | 1.23 |
European:NNP | 0.95 | 0.28 | 1.07 | 0.96 | 1.18 | 0.96 | 1.07 | 0.28 |
can:MD | 1.31 | 1.25 | 1.43 | 1.31 | 1.00 | 1.31 | 1.37 | 1.25 |
travel:VB | 0.74 | 1.13 | 0.94 | 0.73 | 0.89 | 0.76 | 0.59 | 1.13 |
freely:RB | 1.03 | 1.44 | 0.91 | 1.09 | 1.10 | 1.09 | 0.76 | 1.47 |
Europe:NNP | 0.94 | 0.28 | 1.07 | 0.96 | 1.17 | 0.96 | 1.07 | 0.28 |
Every:DT | 2.42 | 1.24 | 1.25 | 1.31 | 1.00 | 1.31 | 1.23 | 1.23 |
European:NNP | 0.95 | 0.28 | 1.07 | 0.96 | 1.18 | 0.96 | 1.07 | 0.28 |
is:VBZ | 0.76 | 1.16 | 1.01 | 0.76 | 0.91 | 0.76 | 0.39 | 1.16 |
a:DT | 1.31 | 1.25 | 1.25 | 1.31 | 1.00 | 1.31 | 1.25 | 1.25 |
person:NN | 0.69 | 1.11 | 0.82 | 0.71 | 0.84 | 0.71 | 0.65 | 1.17 |
Every:DT | 2.42 | 1.24 | 1.25 | 1.31 | 1.00 | 1.31 | 1.23 | 1.23 |
person:NN | 0.69 | 1.11 | 0.82 | 0.71 | 0.84 | 0.71 | 0.65 | 1.17 |
who:WP | 1.11 | 1.55 | 0.97 | 1.04 | 1.30 | 1.06 | 0.97 | 1.55 |
has:VBZ | 0.76 | 1.16 | 1.32 | 0.76 | 0.91 | 0.76 | 0.75 | 1.16 |
the:DT | 1.31 | 1.25 | 1.25 | 2.42 | 1.00 | 1.26 | 1.22 | 1.25 |
right:NN | 0.71 | 1.18 | 0.82 | 0.71 | 0.28 | 0.71 | 0.79 | 1.17 |
to:TO | 1.31 | 1.25 | 1.25 | 1.26 | 1.00 | 2.42 | 1.25 | 1.25 |
live:VB | 0.73 | 1.16 | 0.75 | 0.73 | 0.88 | 0.76 | 1.32 | 1.16 |
Europe:NNP | 0.94 | 0.28 | 1.07 | 0.96 | 1.17 | 0.96 | 1.07 | 0.28 |
can:MD | 1.31 | 1.25 | 1.43 | 1.31 | 1.00 | 1.31 | 1.37 | 1.25 |
travel:VB | 0.74 | 1.13 | 0.94 | 0.73 | 0.89 | 0.76 | 0.59 | 1.13 |
freely:RB | 1.03 | 1.44 | 0.91 | 1.09 | 1.10 | 1.09 | 0.76 | 1.47 |
Europe:NNP | 0.94 | 0.28 | 1.07 | 0.96 | 1.17 | 0.96 | 1.07 | 0.28 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.82 | 0.09 | 1.55 | 0.12 | 0.07 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.52 Alignment.score
1.00 0.26 Alignment.isGood
-1.00 0.72 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.12 Alignment.txtSpan
-0.10 1.00 NullPunisher.functionWord : to
-1.00 1.00 NullPunisher.other : Every
-0.05 1.00 NullPunisher.aux : has
-1.00 1.00 NullPunisher.other : right
-1.00 1.00 NullPunisher.other : live
-0.10 1.00 NullPunisher.article : the
-2.00 1.00 RootEntailment.unalignedRoot : "has" not aligned to anything
Hand-tuned score (dot product of above): -5.6849
Threshold: -9.4738
Txt: All Europeans can travel freely within Europe. Every European is a person. Every person who has the right to live in Europe can travel freely within Europe.
Hyp: All Europeans have the right to live in Europe. (don't know)
All DT |
Europeans NNPS |
have VBP |
the DT |
right NN |
to TO |
live VB |
Europe NNP |
|
All:DT | 2.42 | 1.25 | 1.22 | 1.31 | 1.00 | 1.31 | 1.25 | 1.25 |
Europeans:NNPS | 0.96 | 0.28 | 1.07 | 0.96 | 1.18 | 0.96 | 1.07 | 0.46 |
can:MD | 1.31 | 1.25 | 1.47 | 1.31 | 1.00 | 1.31 | 1.37 | 1.25 |
travel:VB | 0.73 | 1.15 | 0.27 | 0.73 | 0.89 | 0.76 | 0.59 | 1.13 |
freely:RB | 1.09 | 1.46 | 0.91 | 1.09 | 1.10 | 1.09 | 0.76 | 1.47 |
Europe:NNP | 0.96 | 0.46 | 1.07 | 0.96 | 1.17 | 0.96 | 1.07 | 0.28 |
Every:DT | 1.31 | 1.25 | 1.23 | 1.31 | 1.00 | 1.31 | 1.23 | 1.23 |
European:NNP | 0.96 | 1.37 | 1.07 | 0.96 | 1.18 | 0.96 | 1.07 | 0.28 |
is:VBZ | 0.76 | 1.16 | 0.98 | 0.76 | 0.91 | 0.76 | 0.39 | 1.16 |
a:DT | 1.31 | 1.25 | 1.25 | 1.31 | 1.00 | 1.31 | 1.25 | 1.25 |
person:NN | 0.71 | 1.12 | 0.82 | 0.71 | 0.84 | 0.71 | 0.65 | 1.17 |
Every:DT | 1.31 | 1.25 | 1.23 | 1.31 | 1.00 | 1.31 | 1.23 | 1.23 |
person:NN | 0.71 | 1.12 | 0.82 | 0.71 | 0.84 | 0.71 | 0.65 | 1.17 |
who:WP | 1.11 | 1.55 | 0.97 | 1.04 | 1.30 | 1.06 | 0.97 | 1.55 |
has:VBZ | 0.76 | 1.16 | 0.19 | 0.76 | 0.91 | 0.76 | 0.75 | 1.16 |
the:DT | 1.31 | 1.25 | 1.22 | 2.42 | 1.00 | 1.26 | 1.22 | 1.25 |
right:NN | 0.71 | 1.18 | 0.82 | 0.71 | 0.28 | 0.71 | 0.79 | 1.17 |
to:TO | 1.31 | 1.25 | 1.25 | 1.26 | 1.00 | 2.42 | 1.25 | 1.25 |
live:VB | 0.76 | 1.16 | 0.64 | 0.73 | 0.88 | 0.76 | 1.32 | 1.16 |
Europe:NNP | 0.96 | 0.46 | 1.07 | 0.96 | 1.17 | 0.96 | 1.07 | 0.28 |
can:MD | 1.31 | 1.25 | 1.47 | 1.31 | 1.00 | 1.31 | 1.37 | 1.25 |
travel:VB | 0.73 | 1.15 | 0.27 | 0.73 | 0.89 | 0.76 | 0.59 | 1.13 |
freely:RB | 1.09 | 1.46 | 0.91 | 1.09 | 1.10 | 1.09 | 0.76 | 1.47 |
Europe:NNP | 0.96 | 0.46 | 1.07 | 0.96 | 1.17 | 0.96 | 1.07 | 0.28 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.82 | 0.09 | 1.55 | 0.12 | 0.07 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.64 Alignment.score
1.00 0.28 Alignment.isGood
-1.00 0.70 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.38 Alignment.txtSpan
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "Europe" modifying "travel" is dropped on aligned hypothesis word "have"
-2.00 1.00 Modal.dontKnow : possible -> actual
-1.00 1.00 NullPunisher.other : All
-1.00 1.00 NullPunisher.other : live
-1.00 1.00 NullPunisher.other : right
-0.10 1.00 NullPunisher.article : the
-0.10 1.00 NullPunisher.functionWord : to
1.00 1.00 Quant.equivalent : Replacing the quantifier "every" by an equivalent quantifier "all" preserves truth.
-3.00 1.00 Structure.argsMismatch : args have different parents but same relations: text "European" <-nsubj-- "person" vs. hyp "Europeans" <-nsubj-- "have", which aligned to text "travel"
Hand-tuned score (dot product of above): -6.7407
Threshold: -9.4738
Txt: Each European can travel freely within Europe. Every European is a person. Every person who has the right to live in Europe can travel freely within Europe.
Hyp: Each European has the right to live in Europe. (don't know)
Each DT |
European NNP |
has VBZ |
the DT |
right NN |
to TO |
live VB |
Europe NNP |
|
Each:DT | 2.42 | 1.00 | 1.48 | 1.56 | 1.23 | 1.56 | 1.51 | 1.17 |
European:NNP | 0.71 | 0.28 | 1.07 | 0.96 | 1.18 | 0.96 | 1.07 | 0.28 |
can:MD | 1.53 | 1.25 | 1.43 | 1.31 | 1.00 | 1.31 | 1.37 | 1.25 |
travel:VB | 1.01 | 1.13 | 0.94 | 0.73 | 0.89 | 0.76 | 0.59 | 1.13 |
freely:RB | 1.35 | 1.44 | 0.91 | 1.09 | 1.10 | 1.09 | 0.76 | 1.47 |
Europe:NNP | 0.87 | 0.28 | 1.07 | 0.96 | 1.17 | 0.96 | 1.07 | 0.28 |
Every:DT | 1.54 | 1.24 | 1.25 | 1.31 | 1.00 | 1.31 | 1.23 | 1.23 |
European:NNP | 0.71 | 0.28 | 1.07 | 0.96 | 1.18 | 0.96 | 1.07 | 0.28 |
is:VBZ | 1.01 | 1.16 | 1.01 | 0.76 | 0.91 | 0.76 | 0.39 | 1.16 |
a:DT | 1.56 | 1.25 | 1.25 | 1.31 | 1.00 | 1.31 | 1.25 | 1.25 |
person:NN | 0.96 | 1.11 | 0.82 | 0.71 | 0.84 | 0.71 | 0.65 | 1.17 |
Every:DT | 1.54 | 1.24 | 1.25 | 1.31 | 1.00 | 1.31 | 1.23 | 1.23 |
person:NN | 0.96 | 1.11 | 0.82 | 0.71 | 0.84 | 0.71 | 0.65 | 1.17 |
who:WP | 1.37 | 1.55 | 0.97 | 1.04 | 1.30 | 1.06 | 0.97 | 1.55 |
has:VBZ | 0.98 | 1.16 | 1.32 | 0.76 | 0.91 | 0.76 | 0.75 | 1.16 |
the:DT | 1.56 | 1.25 | 1.25 | 2.42 | 1.00 | 1.26 | 1.22 | 1.25 |
right:NN | 0.94 | 1.18 | 0.82 | 0.71 | 0.28 | 0.71 | 0.79 | 1.17 |
to:TO | 1.56 | 1.25 | 1.25 | 1.26 | 1.00 | 2.42 | 1.25 | 1.25 |
live:VB | 1.01 | 1.16 | 0.75 | 0.73 | 0.88 | 0.76 | 1.32 | 1.16 |
Europe:NNP | 0.87 | 0.28 | 1.07 | 0.96 | 1.17 | 0.96 | 1.07 | 0.28 |
can:MD | 1.53 | 1.25 | 1.43 | 1.31 | 1.00 | 1.31 | 1.37 | 1.25 |
travel:VB | 1.01 | 1.13 | 0.94 | 0.73 | 0.89 | 0.76 | 0.59 | 1.13 |
freely:RB | 1.35 | 1.44 | 0.91 | 1.09 | 1.10 | 1.09 | 0.76 | 1.47 |
Europe:NNP | 0.87 | 0.28 | 1.07 | 0.96 | 1.17 | 0.96 | 1.07 | 0.28 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.82 | 0.09 | 1.55 | 0.12 | 0.07 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.52 Alignment.score
1.00 0.26 Alignment.isGood
-1.00 0.72 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.12 Alignment.txtSpan
-0.10 1.00 NullPunisher.functionWord : to
-1.00 1.00 NullPunisher.other : live
-1.00 1.00 NullPunisher.other : right
-3.00 1.00 NullPunisher.entity : Each
-0.10 1.00 NullPunisher.article : the
-0.05 1.00 NullPunisher.aux : has
-2.00 1.00 RootEntailment.unalignedRoot : "has" not aligned to anything
Hand-tuned score (dot product of above): -7.6849
Threshold: -9.4738
Txt: The residents of member states can travel freely within Europe. All residents of member states are individuals. Every individual who has the right to live anywhere in Europe can travel freely within Europe.
Hyp: The residents of member states have the right to live anywhere in Europe. (don't know)
The DT |
residents NNS |
member NN |
states NNS |
have VBP |
the DT |
right NN |
to TO |
live VB |
anywhere RB |
Europe NNP |
|
The:DT | 2.42 | 1.00 | 1.00 | 0.97 | 1.22 | 0.01 | 1.00 | 1.26 | 1.22 | 1.45 | 1.25 |
residents:NNS | 0.71 | 0.28 | 0.79 | 0.87 | 0.82 | 0.71 | 0.87 | 0.71 | 0.49 | 1.29 | 1.09 |
member:NN | 0.71 | 0.79 | 0.28 | 0.80 | 0.82 | 0.71 | 0.93 | 0.71 | 0.82 | 1.27 | 1.13 |
states:NNS | 0.68 | 0.87 | 0.80 | 0.28 | 0.77 | 0.68 | 0.99 | 0.71 | 0.71 | 1.31 | 1.15 |
can:MD | 1.31 | 1.00 | 1.00 | 1.00 | 1.47 | 1.31 | 1.00 | 1.31 | 1.37 | 1.45 | 1.25 |
travel:VB | 0.73 | 0.78 | 0.84 | 0.69 | 0.27 | 0.73 | 0.89 | 0.76 | 0.59 | 1.11 | 1.13 |
freely:RB | 1.09 | 1.18 | 1.18 | 1.11 | 0.91 | 1.09 | 1.10 | 1.09 | 0.76 | 0.86 | 1.47 |
Europe:NNP | 0.96 | 1.09 | 1.13 | 1.15 | 1.07 | 0.96 | 1.17 | 0.96 | 1.07 | 1.56 | 0.28 |
All:DT | 1.31 | 1.00 | 1.00 | 1.00 | 1.22 | 1.31 | 1.00 | 1.31 | 1.25 | 1.45 | 1.25 |
residents:NNS | 0.71 | 0.28 | 0.79 | 0.87 | 0.82 | 0.71 | 0.87 | 0.71 | 0.49 | 1.29 | 1.09 |
member:NN | 0.71 | 0.79 | 0.28 | 0.80 | 0.82 | 0.71 | 0.93 | 0.71 | 0.82 | 1.27 | 1.13 |
states:NNS | 0.68 | 0.87 | 0.80 | 0.28 | 0.77 | 0.68 | 0.99 | 0.71 | 0.71 | 1.31 | 1.15 |
are:VBP | 0.68 | 0.91 | 0.91 | 0.88 | 0.89 | 0.68 | 0.91 | 0.76 | 0.36 | 1.09 | 1.14 |
individuals:NNS | 0.71 | 0.74 | 0.85 | 0.94 | 0.82 | 0.71 | 1.02 | 0.71 | 0.80 | 1.31 | 1.21 |
Every:DT | 1.31 | 1.00 | 0.94 | 1.00 | 1.23 | 1.31 | 1.00 | 1.31 | 1.23 | 1.44 | 1.23 |
individual:NN | 0.71 | 0.86 | 0.88 | 0.99 | 0.82 | 0.71 | 0.95 | 0.71 | 0.82 | 1.31 | 1.21 |
who:WP | 1.04 | 1.30 | 1.30 | 1.30 | 0.97 | 1.04 | 1.30 | 1.06 | 0.97 | 1.05 | 1.55 |
has:VBZ | 0.76 | 0.91 | 0.91 | 0.88 | 0.19 | 0.76 | 0.91 | 0.76 | 0.75 | 1.11 | 1.16 |
the:DT | 0.01 | 1.00 | 1.00 | 0.97 | 1.22 | 2.42 | 1.00 | 1.26 | 1.22 | 1.45 | 1.25 |
right:NN | 0.71 | 0.87 | 0.93 | 0.99 | 0.82 | 0.71 | 0.28 | 0.71 | 0.79 | 1.31 | 1.17 |
to:TO | 1.26 | 1.00 | 1.00 | 1.00 | 1.25 | 1.26 | 1.00 | 2.42 | 1.25 | 1.45 | 1.25 |
live:VB | 0.73 | 0.58 | 0.91 | 0.80 | 0.64 | 0.73 | 0.88 | 0.76 | 1.32 | 1.11 | 1.16 |
anywhere:RB | 1.09 | 1.20 | 1.18 | 1.21 | 0.91 | 1.09 | 1.21 | 1.09 | 0.91 | 1.23 | 1.47 |
Europe:NNP | 0.96 | 1.09 | 1.13 | 1.15 | 1.07 | 0.96 | 1.17 | 0.96 | 1.07 | 1.56 | 0.28 |
can:MD | 1.31 | 1.00 | 1.00 | 1.00 | 1.47 | 1.31 | 1.00 | 1.31 | 1.37 | 1.45 | 1.25 |
travel:VB | 0.73 | 0.78 | 0.84 | 0.69 | 0.27 | 0.73 | 0.89 | 0.76 | 0.59 | 1.11 | 1.13 |
freely:RB | 1.09 | 1.18 | 1.18 | 1.11 | 0.91 | 1.09 | 1.10 | 1.09 | 0.76 | 0.86 | 1.47 |
Europe:NNP | 0.96 | 1.09 | 1.13 | 1.15 | 1.07 | 0.96 | 1.17 | 0.96 | 1.07 | 1.56 | 0.28 |
NO_WORD | 0.82 | 0.28 | 0.05 | 0.04 | 0.17 | 0.82 | 0.09 | 1.55 | 0.12 | 0.04 | 0.07 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.47 Alignment.score
1.00 0.25 Alignment.isGood
-1.00 0.73 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 4.00 Alignment.hypSpan
0.10 0.36 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Europe" modifying "live"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "Europe" modifying "travel" is dropped on aligned hypothesis word "have"
-2.00 1.00 Modal.dontKnow : possible -> actual
-0.10 1.00 NullPunisher.functionWord : to
-1.00 1.00 NullPunisher.other : right
-0.10 1.00 NullPunisher.article : The
-1.00 1.00 NullPunisher.other : anywhere
-0.10 1.00 NullPunisher.article : the
-3.00 1.00 NullPunisher.entity : Europe
-1.00 1.00 NullPunisher.other : live
Hand-tuned score (dot product of above): -9.0804
Threshold: -9.4738
Txt: No delegate finished the report.
Hyp: Some delegate finished the report on time. (don't know)
Some DT |
delegate NN |
finished VBD |
the DT |
report NN |
time NN |
|
No:DT | 1.31 | 1.00 | 1.25 | 1.31 | 1.00 | 1.00 |
delegate:NN | 0.71 | 0.28 | 0.74 | 0.71 | 0.74 | 0.86 |
finished:VBD | 0.76 | 0.83 | 1.32 | 0.76 | 0.86 | 0.77 |
the:DT | 1.28 | 1.00 | 1.25 | 2.42 | 1.00 | 0.90 |
report:NN | 0.71 | 0.74 | 0.77 | 0.71 | 0.28 | 0.91 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.82 | 0.09 | 0.02 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.27 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "time" modifying "finished"
-1.00 1.00 NullPunisher.other : delegate
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : time
-1.00 1.00 NullPunisher.other : report
-1.00 1.00 NullPunisher.other : finished
-1.00 1.00 NullPunisher.other : Some
-2.00 1.00 RootEntailment.unalignedRoot : "finished" not aligned to anything
Hand-tuned score (dot product of above): -8.9897
Threshold: -9.4738
Txt: Some delegates finished the survey.
Hyp: Some delegates finished the survey on time. (don't know)
Some DT |
delegates NNS |
finished VBD |
the DT |
survey NN |
time NN |
|
Some:DT | 2.42 | 1.00 | 1.25 | 1.28 | 0.95 | 0.89 |
delegates:NNS | 0.71 | 0.28 | 0.78 | 0.71 | 0.53 | 0.86 |
finished:VBD | 0.76 | 0.87 | 1.32 | 0.76 | 0.91 | 0.77 |
the:DT | 1.28 | 1.00 | 1.25 | 2.42 | 1.00 | 0.90 |
survey:NN | 0.66 | 0.53 | 0.82 | 0.71 | 0.28 | 0.84 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.82 | 0.09 | 0.02 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.27 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "time" modifying "finished"
-1.00 1.00 NullPunisher.other : time
-1.00 1.00 NullPunisher.other : Some
-1.00 1.00 NullPunisher.other : delegates
-1.00 1.00 NullPunisher.other : finished
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : survey
-2.00 1.00 RootEntailment.unalignedRoot : "finished" not aligned to anything
Hand-tuned score (dot product of above): -8.9897
Threshold: -9.4738
Txt: Many delegates obtained results from the survey.
Hyp: Many delegates obtained interesting results from the survey. (don't know)
Many JJ |
delegates NNS |
obtained VBD |
interesting JJ |
results NNS |
the DT |
survey NN |
|
Many:JJ | 0.74 | 0.88 | 0.96 | 0.93 | 0.88 | 0.76 | 0.88 |
delegates:NNS | 0.99 | 0.28 | 0.80 | 0.88 | 0.63 | 0.71 | 0.53 |
obtained:VBD | 1.02 | 0.89 | 1.32 | 0.96 | 0.85 | 0.76 | 0.61 |
results:NNS | 0.99 | 0.63 | 0.76 | 0.82 | 0.28 | 0.71 | 0.63 |
the:DT | 1.18 | 1.00 | 1.25 | 1.18 | 1.00 | 2.42 | 1.00 |
survey:NN | 0.99 | 0.53 | 0.52 | 0.85 | 0.63 | 0.71 | 0.28 |
NO_WORD | 0.11 | 0.28 | 0.17 | 0.11 | 0.09 | 0.82 | 0.15 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.13 Alignment.score
1.00 0.19 Alignment.isGood
-1.00 0.79 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "survey" modifying "obtained"
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : results
-1.00 1.00 NullPunisher.other : survey
-1.00 1.00 NullPunisher.other : delegates
-1.00 1.00 NullPunisher.other : Many
-1.00 1.00 NullPunisher.other : obtained
-1.00 1.00 NullPunisher.other : interesting
-2.00 1.00 RootEntailment.unalignedRoot : "obtained" not aligned to anything
Hand-tuned score (dot product of above): -10.2775
Threshold: -9.4738
Txt: Several delegates got the results published.
Hyp: Several delegates got the results published in major national newspapers. (don't know)
Several JJ |
delegates NNS |
got VBD |
the DT |
results NNS |
published VBN |
major JJ |
national JJ |
newspapers NNS |
|
Several:JJ | 0.74 | 0.82 | 0.96 | 0.76 | 0.84 | 0.96 | 0.93 | 0.89 | 0.86 |
delegates:NNS | 0.93 | 0.28 | 0.82 | 0.71 | 0.63 | 0.71 | 0.99 | 0.86 | 0.56 |
got:VBD | 1.02 | 0.91 | 1.32 | 0.76 | 0.91 | 0.74 | 1.02 | 1.02 | 0.86 |
the:DT | 1.18 | 1.00 | 1.25 | 2.42 | 1.00 | 1.25 | 1.18 | 1.18 | 1.00 |
results:NNS | 0.95 | 0.63 | 0.82 | 0.71 | 0.28 | 0.64 | 0.99 | 0.99 | 0.82 |
published:VBD | 1.02 | 0.80 | 0.74 | 0.76 | 0.73 | 2.02 | 1.01 | 0.89 | 0.65 |
NO_WORD | 0.11 | 0.28 | 0.17 | 0.82 | 0.09 | 0.16 | 0.11 | 0.11 | 0.07 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.34 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.75 Alignment.isBad
-0.10 8.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.11 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "newspapers" modifying "published"
-1.00 1.00 NullPunisher.other : Several
-1.00 1.00 NullPunisher.other : delegates
-1.00 1.00 NullPunisher.other : national
-1.00 1.00 NullPunisher.other : got
-1.00 1.00 NullPunisher.other : major
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : results
-1.00 1.00 NullPunisher.other : newspapers
-2.00 1.00 RootEntailment.unalignedRoot : "got" not aligned to anything
Hand-tuned score (dot product of above): -10.9745
Threshold: -9.4738
Txt: Most Europeans can travel freely within Europe. All Europeans are people. All people who are resident in Europe can travel freely within Europe.
Hyp: Most Europeans are resident in Europe. (don't know)
Most JJS |
Europeans NNPS |
are VBP |
resident VBN |
Europe NNP |
|
Most:JJS | 0.74 | 1.13 | 0.96 | 0.96 | 1.13 |
Europeans:NNPS | 1.24 | 0.28 | 1.07 | 1.03 | 0.46 |
can:MD | 1.18 | 1.25 | 1.55 | 1.25 | 1.25 |
travel:VB | 1.02 | 1.15 | 0.98 | 0.23 | 1.13 |
freely:RB | 1.05 | 1.46 | 0.89 | 0.88 | 1.47 |
Europe:NNP | 1.24 | 0.46 | 1.05 | 1.07 | 0.28 |
All:DT | 1.18 | 1.25 | 1.18 | 1.25 | 1.25 |
Europeans:NNPS | 1.24 | 0.28 | 1.07 | 1.03 | 0.46 |
are:VBP | 1.02 | 1.16 | 1.32 | 0.56 | 1.14 |
people:NNS | 0.99 | 1.21 | 0.82 | 0.60 | 1.14 |
All:DT | 1.18 | 1.25 | 1.18 | 1.25 | 1.25 |
people:NNS | 0.99 | 1.21 | 0.82 | 0.60 | 1.14 |
who:WP | 1.39 | 1.55 | 0.97 | 0.97 | 1.55 |
are:VBP | 1.02 | 1.16 | 1.32 | 0.56 | 1.14 |
resident:NN | 0.99 | 0.89 | 0.82 | 2.71 | 1.09 |
Europe:NNP | 1.24 | 0.46 | 1.05 | 1.07 | 0.28 |
can:MD | 1.18 | 1.25 | 1.55 | 1.25 | 1.25 |
travel:VB | 1.02 | 1.15 | 0.98 | 0.23 | 1.13 |
freely:RB | 1.05 | 1.46 | 0.89 | 0.88 | 1.47 |
Europe:NNP | 1.24 | 0.46 | 1.05 | 1.07 | 0.28 |
NO_WORD | 0.11 | 0.15 | 1.57 | 0.17 | 0.07 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.99 Alignment.score
1.00 0.35 Alignment.isGood
-1.00 0.62 Alignment.isBad
-0.10 2.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.40 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Most" modifying "Europeans"
0.00 1.00 NegPolarity.hypNegWord : "Most": tag "JJS" is in NegPolarityMarkers list
-0.05 1.00 NullPunisher.aux : are
-1.00 1.00 NullPunisher.other : Most
2.00 1.00 Location.match : Locations match between txt and hyp; both have lemma "Europe"
Hand-tuned score (dot product of above): 0.7143
Threshold: -9.4738
Txt: A few committee members are from Scandinavia. All committee members are people. All people who are from Sweden are from Scandinavia.
Hyp: At least a few committee members are from Sweden. (don't know)
At IN |
least JJS |
a DT |
few JJ |
committee_members NNS |
are VBP |
Sweden NNP |
|
A:DT | 1.00 | 1.18 | 0.13 | 1.18 | 1.00 | 1.25 | 1.25 |
few:JJ | 1.05 | 0.93 | 0.76 | 0.74 | 0.88 | 0.96 | 1.13 |
committee_members:NNS | 0.50 | 0.99 | 0.71 | 0.99 | 0.28 | 0.82 | 1.07 |
are:VBP | 1.11 | 1.02 | 0.76 | 1.02 | 0.91 | 1.32 | 1.16 |
Scandinavia:NNP | 0.75 | 1.24 | 0.96 | 1.24 | 1.06 | 1.07 | 0.76 |
All:DT | 1.00 | 1.18 | 1.31 | 1.18 | 1.00 | 1.18 | 1.25 |
committee_members:NNS | 0.50 | 0.99 | 0.71 | 0.99 | 0.28 | 0.82 | 1.07 |
are:VBP | 1.11 | 1.02 | 0.76 | 1.02 | 0.91 | 1.32 | 1.16 |
people:NNS | 0.50 | 0.97 | 0.71 | 0.99 | 0.84 | 0.82 | 1.16 |
All:DT | 1.00 | 1.18 | 1.31 | 1.18 | 1.00 | 1.18 | 1.25 |
people:NNS | 0.50 | 0.97 | 0.71 | 0.99 | 0.84 | 0.82 | 1.16 |
who:WP | 1.05 | 1.39 | 1.11 | 1.39 | 1.30 | 0.97 | 1.55 |
are:VBP | 1.11 | 1.02 | 0.76 | 1.02 | 0.91 | 1.32 | 1.16 |
Sweden:NNP | 0.75 | 1.22 | 0.96 | 1.24 | 1.07 | 1.07 | 0.28 |
are:VBP | 1.11 | 1.02 | 0.76 | 1.02 | 0.91 | 1.32 | 1.16 |
Scandinavia:NNP | 0.75 | 1.24 | 0.96 | 1.24 | 1.06 | 1.07 | 0.76 |
NO_WORD | 0.73 | 0.29 | 0.82 | 0.11 | 0.28 | 0.17 | 0.15 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.24 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Sweden" modifying "are"
0.00 1.00 NegPolarity.hypNegWord : "least": tag "JJS" is in NegPolarityMarkers list
0.00 1.00 NegPolarity.hypNegWord : "few": is in NegPolarityMarkers list
-1.00 1.00 NullPunisher.other : least
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : At
-1.00 1.00 NullPunisher.other : few
-0.05 1.00 NullPunisher.aux : are
-1.00 1.00 NullPunisher.other : committee_members
-3.00 1.00 NullPunisher.entity : Sweden
-2.00 1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -11.1707
Threshold: -9.4738
Txt: Few committee members are from southern Europe. All committee members are people. All people who are from Portugal are from southern Europe.
Hyp: There are few committee members from Portugal. (yes)
There EX |
are VBP |
few JJ |
committee_members NNS |
Portugal NNP |
|
Few:JJ | 0.76 | 0.96 | 0.76 | 0.88 | 1.13 |
committee_members:NNS | 0.71 | 0.82 | 0.99 | 0.28 | 1.07 |
are:VBP | 0.70 | 1.32 | 1.02 | 0.91 | 1.16 |
southern:JJ | 0.68 | 0.96 | 0.93 | 0.88 | 1.08 |
Europe:NNP | 0.94 | 1.05 | 1.24 | 1.11 | 0.80 |
All:DT | 1.31 | 1.18 | 1.18 | 1.00 | 1.25 |
committee_members:NNS | 0.71 | 0.82 | 0.99 | 0.28 | 1.07 |
are:VBP | 0.70 | 1.32 | 1.02 | 0.91 | 1.16 |
people:NNS | 0.69 | 0.82 | 0.99 | 0.84 | 1.14 |
All:DT | 1.31 | 1.18 | 1.18 | 1.00 | 1.25 |
people:NNS | 0.69 | 0.82 | 0.99 | 0.84 | 1.14 |
who:WP | 1.11 | 0.97 | 1.39 | 1.30 | 1.55 |
are:VBP | 0.70 | 1.32 | 1.02 | 0.91 | 1.16 |
Portugal:NNP | 0.96 | 1.07 | 1.24 | 1.07 | 0.28 |
are:VBP | 0.70 | 1.32 | 1.02 | 0.91 | 1.16 |
southern:NNP | 0.62 | 0.82 | 0.99 | 0.82 | 1.02 |
Europe:NNP | 0.94 | 1.05 | 1.24 | 1.11 | 0.80 |
NO_WORD | 0.29 | 0.17 | 0.11 | 0.28 | 0.15 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.16 Alignment.score
1.00 0.19 Alignment.isGood
-1.00 0.79 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.20 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Portugal" modifying "committee_members"
0.00 1.00 NegPolarity.hypNegWord : "few": is in NegPolarityMarkers list
-0.05 1.00 NullPunisher.aux : are
-1.00 1.00 NullPunisher.other : committee_members
-3.00 1.00 NullPunisher.entity : Portugal
-0.10 1.00 NullPunisher.functionWord : There
-2.00 1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
-1.00 1.00 Structure.relMismatch : hypothesis noun modifier "few" and aligned text noun modifier "committee_members" bear the same relation ("amod") but different parents ("committee_members" vs. "committee_members")
Hand-tuned score (dot product of above): -8.8627
Threshold: -9.4738
Txt: Both commissioners used to be businessmen.
Hyp: Both commissioners used to be leading businessmen. (don't know)
Both DT |
commissioners NNS |
used VBD |
to TO |
be VB |
leading VBG |
businessmen NNS |
|
Both:DT | 2.42 | 1.00 | 1.25 | 1.31 | 1.25 | 1.25 | 1.00 |
commissioners:NNS | 0.71 | 0.28 | 0.82 | 0.71 | 0.82 | 0.82 | 0.76 |
used:VBD | 0.76 | 0.91 | 1.32 | 0.76 | 0.93 | 0.63 | 0.91 |
to:TO | 1.31 | 1.00 | 1.25 | 2.42 | 1.25 | 1.25 | 1.00 |
be:VB | 0.76 | 0.91 | 0.93 | 0.76 | 1.32 | 0.89 | 0.91 |
businessmen:NNS | 0.71 | 0.76 | 0.82 | 0.71 | 0.82 | 0.71 | 0.28 |
NO_WORD | 0.82 | 0.28 | 0.17 | 1.55 | 1.43 | 0.12 | 0.28 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.59 Alignment.score
1.00 0.27 Alignment.isGood
-1.00 0.71 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "leading" modifying "businessmen"
-1.00 1.00 NullPunisher.other : commissioners
-1.00 1.00 NullPunisher.other : leading
-0.05 1.00 NullPunisher.aux : be
-1.00 1.00 NullPunisher.other : used
-0.10 1.00 NullPunisher.functionWord : to
-1.00 1.00 NullPunisher.other : Both
-1.00 1.00 NullPunisher.other : businessmen
-2.00 1.00 RootEntailment.unalignedRoot : "used" not aligned to anything
Hand-tuned score (dot product of above): -8.7006
Threshold: -9.4738
Txt: Neither commissioner spends time at home.
Hyp: One of the commissioners spends a lot of time at home. (don't know)
One CD |
the DT |
commissioners NNS |
spends VBZ |
a DT |
lot NN |
of IN |
time NN |
at_home IN |
|
Neither:DT | 1.18 | 1.26 | 1.00 | 1.25 | 1.31 | 1.00 | 1.05 | 0.98 | 1.05 |
commissioner:NN | 1.29 | 0.71 | 1.47 | 0.77 | 0.71 | 0.87 | 0.50 | 0.87 | 0.50 |
spends:VBZ | 1.39 | 0.76 | 0.91 | 1.32 | 0.76 | 0.79 | 1.15 | 0.59 | 1.07 |
time:NN | 1.26 | 0.61 | 0.87 | 0.50 | 0.71 | 0.62 | 0.50 | 0.28 | 0.30 |
at_home:IN | 1.30 | 1.00 | 1.31 | 1.15 | 1.05 | 1.26 | 1.05 | 1.11 | 1.59 |
NO_WORD | 0.79 | 0.82 | 0.04 | 0.17 | 0.82 | 0.09 | 1.16 | 0.25 | 0.99 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.55 Alignment.score
1.00 0.26 Alignment.isGood
-1.00 0.71 Alignment.isBad
-0.10 8.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.11 Alignment.txtSpan
-3.00 1.00 NullPunisher.entity : One
-1.00 1.00 NullPunisher.other : lot
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : of
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : at_home
-1.00 1.00 NullPunisher.other : time
-1.00 1.00 NullPunisher.other : spends
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '1.0' vs ''
-2.00 1.00 RootEntailment.unalignedRoot : "spends" not aligned to anything
Hand-tuned score (dot product of above): -16.7851
Threshold: -9.4738
Txt: At least three commissioners spend time at home.
Hyp: At least three commissioners spend a lot of time at home. (don't know)
At IN |
least JJS |
three CD |
commissioners NNS |
spend VBP |
a DT |
lot NN |
time NN |
at_home IN |
|
At:IN | 1.59 | 0.74 | 1.30 | 1.31 | 1.24 | 1.00 | 1.27 | 1.31 | 1.05 |
least:JJS | 1.01 | 0.74 | 1.13 | 0.88 | 0.96 | 0.76 | 0.82 | 0.88 | 1.05 |
three:CD | 1.30 | 1.00 | 0.85 | 1.18 | 1.22 | 0.89 | 1.31 | 1.01 | 1.23 |
commissioners:NNS | 0.50 | 0.99 | 1.15 | 0.28 | 0.82 | 0.71 | 0.87 | 0.87 | 0.50 |
spend:VBP | 1.15 | 1.02 | 1.31 | 0.91 | 1.32 | 0.76 | 0.62 | 0.51 | 1.07 |
time:NN | 0.50 | 0.99 | 0.98 | 0.87 | 0.42 | 0.71 | 0.62 | 0.28 | 0.30 |
at_home:IN | 1.05 | 0.77 | 1.23 | 1.31 | 1.15 | 1.05 | 1.26 | 1.11 | 1.59 |
NO_WORD | 0.84 | 0.29 | 0.52 | 0.28 | 0.17 | 0.82 | 0.09 | 0.04 | 0.99 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.31 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.76 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.22 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "time" modifying "lot"
0.00 1.00 NegPolarity.hypNegWord : "least": tag "JJS" is in NegPolarityMarkers list
-1.00 1.00 NullPunisher.other : at_home
-1.00 1.00 NullPunisher.other : time
-1.00 1.00 NullPunisher.other : At
-1.00 1.00 NullPunisher.other : spend
-1.00 1.00 NullPunisher.other : lot
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : least
-2.00 1.00 RootEntailment.unalignedRoot : "spend" not aligned to anything
Hand-tuned score (dot product of above): -9.8066
Threshold: -9.4738
Txt: At most ten commissioners spend time at home.
Hyp: At most ten commissioners spend a lot of time at home. (yes)
At IN |
most RBS |
ten NN |
commissioners NNS |
spend VBP |
a DT |
lot NN |
time NN |
at_home IN |
|
At:IN | 1.59 | 1.23 | 1.31 | 1.31 | 1.24 | 1.00 | 1.27 | 1.31 | 1.05 |
most:JJS | 1.05 | 1.65 | 0.88 | 0.88 | 0.96 | 0.76 | 0.78 | 0.88 | 1.05 |
ten:NN | 0.50 | 1.31 | 0.28 | 0.71 | 0.75 | 0.71 | 0.76 | 0.71 | 0.50 |
commissioners:NNS | 0.50 | 1.31 | 0.71 | 0.28 | 0.82 | 0.71 | 0.87 | 0.87 | 0.50 |
spend:VBP | 1.15 | 1.11 | 0.84 | 0.91 | 1.32 | 0.76 | 0.62 | 0.51 | 1.07 |
time:NN | 0.50 | 1.31 | 0.71 | 0.87 | 0.42 | 0.71 | 0.62 | 0.28 | 0.30 |
at_home:IN | 1.05 | 1.23 | 1.31 | 1.31 | 1.15 | 1.05 | 1.26 | 1.11 | 1.59 |
NO_WORD | 0.99 | 0.04 | 0.05 | 0.28 | 0.17 | 0.82 | 0.09 | 0.04 | 0.99 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.57 Alignment.score
1.00 0.27 Alignment.isGood
-1.00 0.71 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 3.00 Alignment.hypSpan
0.10 0.33 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "time" modifying "lot"
0.00 1.00 NegPolarity.txtNegWord : "most": tag "JJS" is in NegPolarityMarkers list
-1.00 1.00 NullPunisher.other : at_home
-1.00 1.00 NullPunisher.other : time
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : lot
-1.00 1.00 NullPunisher.other : spend
-1.00 1.00 NullPunisher.other : At
-2.00 1.00 RootEntailment.unalignedRoot : "spend" not aligned to anything
Hand-tuned score (dot product of above): -8.2409
Threshold: -9.4738
Txt: A Swede won a Nobel prize. Every Swede is a Scandinavian.
Hyp: A Scandinavian won a Nobel prize. (yes)
A DT |
Scandinavian NNP |
won VBD |
a DT |
Nobel_prize NN |
|
A:DT | 2.42 | 1.25 | 1.25 | 0.13 | 1.00 |
Swede:NN | 0.96 | 0.56 | 1.07 | 0.96 | 1.07 |
won:VBD | 0.76 | 1.16 | 1.32 | 0.76 | 0.57 |
a:DT | 0.13 | 1.25 | 1.25 | 2.42 | 1.00 |
Nobel_prize:NN | 0.71 | 1.06 | 0.49 | 0.71 | 0.28 |
Every:DT | 1.31 | 1.25 | 1.25 | 1.31 | 1.00 |
Swede:NN | 0.96 | 0.56 | 1.07 | 0.96 | 1.07 |
is:VBZ | 0.76 | 1.16 | 0.88 | 0.76 | 0.91 |
a:DT | 0.13 | 1.25 | 1.25 | 2.42 | 1.00 |
Scandinavian:NNP | 0.96 | 0.28 | 1.07 | 0.96 | 1.06 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.82 | 0.25 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.36 Alignment.score
1.00 0.23 Alignment.isGood
-1.00 0.75 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-3.00 1.00 NullPunisher.entity : Scandinavian
-1.00 1.00 NullPunisher.other : won
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : Nobel_prize
-0.10 1.00 NullPunisher.article : A
-2.00 1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -7.8694
Threshold: -9.4738
Txt: Every Canadian resident can travel freely within Europe. Every Canadian resident is a resident of the North American continent.
Hyp: Every resident of the North American continent can travel freely within Europe. (don't know)
Every DT |
resident NN |
the DT |
North_American NN |
continent NN |
can MD |
travel VB |
freely RB |
Europe NNP |
|
Every:DT | 2.42 | 0.98 | 1.31 | 1.25 | 1.00 | 1.31 | 1.23 | 1.39 | 1.23 |
Canadian:JJ | 1.02 | 1.11 | 1.02 | 0.86 | 1.07 | 1.00 | 1.22 | 1.27 | 1.05 |
resident:NN | 0.69 | 0.28 | 0.71 | 0.91 | 0.65 | 0.71 | 0.49 | 1.27 | 1.09 |
can:MD | 1.31 | 1.00 | 1.31 | 1.25 | 1.00 | 2.42 | 1.51 | 1.45 | 1.25 |
travel:VB | 0.74 | 0.58 | 0.73 | 1.16 | 0.67 | 1.02 | 1.32 | 0.90 | 1.13 |
freely:RB | 1.03 | 1.18 | 1.09 | 1.47 | 0.95 | 1.09 | 0.70 | 1.23 | 1.47 |
Europe:NNP | 0.94 | 1.09 | 0.96 | 0.99 | 0.84 | 0.96 | 1.04 | 1.56 | 0.28 |
Every:DT | 2.42 | 0.98 | 1.31 | 1.25 | 1.00 | 1.31 | 1.23 | 1.39 | 1.23 |
Canadian:JJ | 1.02 | 1.11 | 1.02 | 0.86 | 1.07 | 1.00 | 1.22 | 1.27 | 1.05 |
resident:NN | 0.69 | 0.28 | 0.71 | 0.91 | 0.65 | 0.71 | 0.49 | 1.27 | 1.09 |
is:VBZ | 0.76 | 0.91 | 0.76 | 1.16 | 0.91 | 1.06 | 1.00 | 1.11 | 1.16 |
a:DT | 1.31 | 1.00 | 1.31 | 1.25 | 1.00 | 1.31 | 1.25 | 1.45 | 1.25 |
resident:NN | 0.69 | 0.28 | 0.71 | 0.91 | 0.65 | 0.71 | 0.49 | 1.27 | 1.09 |
the:DT | 1.31 | 1.00 | 2.42 | 1.25 | 1.00 | 1.31 | 1.23 | 1.45 | 1.25 |
North_American:NNP | 0.96 | 0.91 | 0.96 | 2.22 | 1.08 | 0.96 | 1.07 | 1.56 | 0.99 |
continent:NNP | 0.71 | 0.65 | 0.71 | 1.08 | 3.06 | 0.71 | 0.58 | 1.04 | 0.93 |
NO_WORD | 0.82 | 0.28 | 0.82 | 0.05 | 0.04 | 1.55 | 0.17 | 0.04 | 0.33 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 1.01 Alignment.score
1.00 0.36 Alignment.isGood
-1.00 0.61 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.33 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "freely" modifying "travel"
2.00 1.00 Modal.yes : actual -> possible
-1.00 1.00 NullPunisher.other : travel
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : freely
-1.00 1.00 NullPunisher.other : Every
-1.00 1.00 NullPunisher.other : resident
-0.05 1.00 NullPunisher.aux : can
-2.00 1.00 RootEntailment.unalignedRoot : "travel" not aligned to anything
2.00 1.00 Location.match : Locations match between txt and hyp; both have lemma "Europe"
4.00 1.00 Location.match&OK.Root.poorlyOrUnAligned :
Hand-tuned score (dot product of above): 1.2367
Threshold: -9.4738
Txt: All Canadian residents can travel freely within Europe. Every Canadian resident is a resident of the North American continent.
Hyp: All residents of the North American continent can travel freely within Europe. (don't know)
All DT |
residents NNS |
the DT |
North_American NN |
continent NN |
can MD |
travel VB |
freely RB |
Europe NNP |
|
All:DT | 2.42 | 1.00 | 1.31 | 1.25 | 1.00 | 1.31 | 1.23 | 1.45 | 1.25 |
Canadian:JJ | 1.02 | 1.12 | 1.02 | 0.86 | 1.07 | 1.00 | 1.22 | 1.27 | 1.05 |
residents:NNS | 0.71 | 0.28 | 0.71 | 0.91 | 0.67 | 0.71 | 0.69 | 1.27 | 1.09 |
can:MD | 1.31 | 1.00 | 1.31 | 1.25 | 1.00 | 2.42 | 1.51 | 1.45 | 1.25 |
travel:VB | 0.73 | 0.78 | 0.73 | 1.16 | 0.67 | 1.02 | 1.32 | 0.90 | 1.13 |
freely:RB | 1.09 | 1.18 | 1.09 | 1.47 | 0.95 | 1.09 | 0.70 | 1.23 | 1.47 |
Europe:NNP | 0.96 | 1.09 | 0.96 | 0.99 | 0.84 | 0.96 | 1.04 | 1.56 | 0.28 |
Every:DT | 1.31 | 1.00 | 1.31 | 1.25 | 1.00 | 1.31 | 1.23 | 1.39 | 1.23 |
Canadian:JJ | 1.02 | 1.12 | 1.02 | 0.86 | 1.07 | 1.00 | 1.22 | 1.27 | 1.05 |
resident:NN | 0.71 | 1.32 | 0.71 | 0.91 | 0.65 | 0.71 | 0.49 | 1.27 | 1.09 |
is:VBZ | 0.76 | 0.91 | 0.76 | 1.16 | 0.91 | 1.06 | 1.00 | 1.11 | 1.16 |
a:DT | 1.31 | 1.00 | 1.31 | 1.25 | 1.00 | 1.31 | 1.25 | 1.45 | 1.25 |
resident:NN | 0.71 | 1.32 | 0.71 | 0.91 | 0.65 | 0.71 | 0.49 | 1.27 | 1.09 |
the:DT | 1.31 | 1.00 | 2.42 | 1.25 | 1.00 | 1.31 | 1.23 | 1.45 | 1.25 |
North_American:NNP | 0.96 | 0.91 | 0.96 | 2.22 | 1.08 | 0.96 | 1.07 | 1.56 | 0.99 |
continent:NNP | 0.71 | 0.67 | 0.71 | 1.08 | 3.06 | 0.71 | 0.58 | 1.04 | 0.93 |
NO_WORD | 0.82 | 0.28 | 0.82 | 0.05 | 0.04 | 1.55 | 0.17 | 0.04 | 0.33 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 1.27 Alignment.score
1.00 0.42 Alignment.isGood
-1.00 0.55 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.22 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "freely" modifying "travel"
2.00 1.00 Modal.yes : actual -> possible
-1.00 1.00 NullPunisher.other : freely
-1.00 1.00 NullPunisher.other : All
-0.05 1.00 NullPunisher.aux : can
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : travel
-7.00 1.00 Quant.expand : Invalid quantifier strengthening: replacing the quantifier "a" by a stronger quantifier "all" does NOT preserve truth.
-2.00 1.00 RootEntailment.unalignedRoot : "travel" not aligned to anything
2.00 1.00 Location.match : Locations match between txt and hyp; both have lemma "Europe"
4.00 1.00 Location.match&OK.Root.poorlyOrUnAligned :
Hand-tuned score (dot product of above): -4.2912
Threshold: -9.4738
Txt: Each Canadian resident can travel freely within Europe. Every Canadian resident is a resident of the North American continent.
Hyp: Each resident of the North American continent can travel freely within Europe. (don't know)
Each DT |
resident NN |
the DT |
North_American NN |
continent NN |
can MD |
travel VB |
freely RB |
Europe NNP |
|
Each:DT | 2.42 | 1.00 | 1.31 | 1.25 | 1.00 | 1.28 | 1.25 | 1.45 | 1.25 |
Canadian:JJ | 1.02 | 1.11 | 1.02 | 0.86 | 1.07 | 1.00 | 1.22 | 1.27 | 1.05 |
resident:NN | 0.71 | 0.28 | 0.71 | 0.91 | 0.65 | 0.71 | 0.49 | 1.27 | 1.09 |
can:MD | 1.28 | 1.00 | 1.31 | 1.25 | 1.00 | 2.42 | 1.51 | 1.45 | 1.25 |
travel:VB | 0.76 | 0.58 | 0.73 | 1.16 | 0.67 | 1.02 | 1.32 | 0.90 | 1.13 |
freely:RB | 1.09 | 1.18 | 1.09 | 1.47 | 0.95 | 1.09 | 0.70 | 1.23 | 1.47 |
Europe:NNP | 0.96 | 1.09 | 0.96 | 0.99 | 0.84 | 0.96 | 1.04 | 1.56 | 0.28 |
Every:DT | 1.28 | 0.98 | 1.31 | 1.25 | 1.00 | 1.31 | 1.23 | 1.39 | 1.23 |
Canadian:JJ | 1.02 | 1.11 | 1.02 | 0.86 | 1.07 | 1.00 | 1.22 | 1.27 | 1.05 |
resident:NN | 0.71 | 0.28 | 0.71 | 0.91 | 0.65 | 0.71 | 0.49 | 1.27 | 1.09 |
is:VBZ | 0.76 | 0.91 | 0.76 | 1.16 | 0.91 | 1.06 | 1.00 | 1.11 | 1.16 |
a:DT | 1.31 | 1.00 | 1.31 | 1.25 | 1.00 | 1.31 | 1.25 | 1.45 | 1.25 |
resident:NN | 0.71 | 0.28 | 0.71 | 0.91 | 0.65 | 0.71 | 0.49 | 1.27 | 1.09 |
the:DT | 1.31 | 1.00 | 2.42 | 1.25 | 1.00 | 1.31 | 1.23 | 1.45 | 1.25 |
North_American:NNP | 0.96 | 0.91 | 0.96 | 2.22 | 1.08 | 0.96 | 1.07 | 1.56 | 0.99 |
continent:NNP | 0.71 | 0.65 | 0.71 | 1.08 | 3.06 | 0.71 | 0.58 | 1.04 | 0.93 |
NO_WORD | 0.82 | 0.28 | 0.82 | 0.05 | 0.04 | 1.55 | 0.17 | 0.04 | 0.33 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 1.01 Alignment.score
1.00 0.36 Alignment.isGood
-1.00 0.61 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.33 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "freely" modifying "travel"
2.00 1.00 Modal.yes : actual -> possible
-1.00 1.00 NullPunisher.other : freely
-0.05 1.00 NullPunisher.aux : can
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : resident
-1.00 1.00 NullPunisher.other : Each
-1.00 1.00 NullPunisher.other : travel
-2.00 1.00 RootEntailment.unalignedRoot : "travel" not aligned to anything
2.00 1.00 Location.match : Locations match between txt and hyp; both have lemma "Europe"
4.00 1.00 Location.match&OK.Root.poorlyOrUnAligned :
Hand-tuned score (dot product of above): 1.2367
Threshold: -9.4738
Txt: The residents of major western countries can travel freely within Europe. All residents of major western countries are residents of western countries.
Hyp: The residents of western countries have the right to live in Europe. (don't know)
The DT |
residents NNS |
western JJ |
countries NNS |
have VBP |
the DT |
right NN |
to TO |
live VB |
Europe NNP |
|
The:DT | 2.42 | 1.00 | 1.18 | 1.00 | 1.22 | 0.01 | 1.00 | 1.26 | 1.22 | 1.25 |
residents:NNS | 0.71 | 0.28 | 0.66 | 0.86 | 0.82 | 0.71 | 0.87 | 0.71 | 0.49 | 1.09 |
major:JJ | 0.76 | 0.88 | 0.76 | 0.70 | 0.93 | 0.76 | 0.88 | 0.76 | 0.96 | 1.11 |
western:JJ | 0.76 | 0.55 | 0.74 | 0.48 | 0.96 | 0.76 | 0.88 | 0.76 | 0.80 | 1.12 |
countries:NNS | 0.71 | 0.86 | 0.59 | 0.28 | 0.82 | 0.71 | 0.95 | 0.71 | 0.73 | 1.17 |
can:MD | 1.31 | 1.00 | 1.18 | 1.00 | 1.47 | 1.31 | 1.00 | 1.31 | 1.37 | 1.25 |
travel:VB | 0.73 | 0.78 | 0.94 | 0.69 | 0.27 | 0.73 | 0.89 | 0.76 | 0.59 | 1.13 |
freely:RB | 1.09 | 1.18 | 0.77 | 1.01 | 0.91 | 1.09 | 1.10 | 1.09 | 0.76 | 1.47 |
Europe:NNP | 0.96 | 1.09 | 1.23 | 1.17 | 1.07 | 0.96 | 1.17 | 0.96 | 1.07 | 0.28 |
All:DT | 1.31 | 1.00 | 1.18 | 1.00 | 1.22 | 1.31 | 1.00 | 1.31 | 1.25 | 1.25 |
residents:NNS | 0.71 | 0.28 | 0.66 | 0.86 | 0.82 | 0.71 | 0.87 | 0.71 | 0.49 | 1.09 |
major:JJ | 0.76 | 0.88 | 0.76 | 0.70 | 0.93 | 0.76 | 0.88 | 0.76 | 0.96 | 1.11 |
western:JJ | 0.76 | 0.55 | 0.74 | 0.48 | 0.96 | 0.76 | 0.88 | 0.76 | 0.80 | 1.12 |
countries:NNS | 0.71 | 0.86 | 0.59 | 0.28 | 0.82 | 0.71 | 0.95 | 0.71 | 0.73 | 1.17 |
are:VBP | 0.68 | 0.91 | 1.02 | 0.91 | 0.89 | 0.68 | 0.91 | 0.76 | 0.36 | 1.14 |
residents:NNS | 0.71 | 0.28 | 0.66 | 0.86 | 0.82 | 0.71 | 0.87 | 0.71 | 0.49 | 1.09 |
western:JJ | 0.76 | 0.55 | 0.74 | 0.48 | 0.96 | 0.76 | 0.88 | 0.76 | 0.80 | 1.12 |
countries:NNS | 0.71 | 0.86 | 0.59 | 0.28 | 0.82 | 0.71 | 0.95 | 0.71 | 0.73 | 1.17 |
NO_WORD | 0.82 | 0.28 | 0.11 | 0.04 | 0.17 | 0.82 | 0.09 | 1.55 | 0.12 | 0.07 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.46 Alignment.score
1.00 0.24 Alignment.isGood
-1.00 0.73 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.20 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Europe" modifying "live"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "Europe" modifying "travel" is dropped on aligned hypothesis word "have"
-2.00 1.00 Modal.dontKnow : possible -> actual
-0.10 1.00 NullPunisher.article : The
-0.10 1.00 NullPunisher.article : the
-3.00 1.00 NullPunisher.entity : Europe
-1.00 1.00 NullPunisher.other : live
-1.00 1.00 NullPunisher.other : western
-1.00 1.00 NullPunisher.other : right
-0.10 1.00 NullPunisher.functionWord : to
Hand-tuned score (dot product of above): -9.3122
Threshold: -9.4738
Txt: No Scandinavian delegate finished the report on time.
Hyp: Some delegate finished the report on time. (don't know)
Some DT |
delegate NN |
finished VBD |
the DT |
report NN |
time NN |
|
No:DT | 1.31 | 1.00 | 1.25 | 1.31 | 1.00 | 1.00 |
Scandinavian:JJ | 1.02 | 1.13 | 1.22 | 1.02 | 1.13 | 1.13 |
delegate:NN | 0.71 | 0.28 | 0.74 | 0.71 | 0.74 | 0.86 |
finished:VBD | 0.76 | 0.83 | 1.32 | 0.76 | 0.86 | 0.77 |
the:DT | 1.28 | 1.00 | 1.25 | 2.42 | 1.00 | 0.90 |
report:NN | 0.71 | 0.74 | 0.77 | 0.71 | 0.28 | 0.91 |
time:NN | 0.60 | 0.86 | 0.68 | 0.61 | 0.91 | 0.28 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.82 | 0.09 | 0.02 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.27 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "time" modifying "finished"
-1.00 1.00 NullPunisher.other : Some
-1.00 1.00 NullPunisher.other : finished
-1.00 1.00 NullPunisher.other : time
-1.00 1.00 NullPunisher.other : delegate
-1.00 1.00 NullPunisher.other : report
-0.10 1.00 NullPunisher.article : the
-2.00 1.00 RootEntailment.unalignedRoot : "finished" not aligned to anything
Hand-tuned score (dot product of above): -8.9897
Threshold: -9.4738
Txt: Some Irish delegates finished the survey on time.
Hyp: Some delegates finished the survey on time. (yes)
Some DT |
delegates NNS |
finished VBD |
the DT |
survey NN |
time NN |
|
Some:DT | 2.42 | 1.00 | 1.25 | 1.28 | 0.95 | 0.89 |
Irish:JJ | 1.02 | 1.13 | 1.13 | 1.02 | 1.11 | 1.11 |
delegates:NNS | 0.71 | 0.28 | 0.78 | 0.71 | 0.53 | 0.86 |
finished:VBD | 0.76 | 0.87 | 1.32 | 0.76 | 0.91 | 0.77 |
the:DT | 1.28 | 1.00 | 1.25 | 2.42 | 1.00 | 0.90 |
survey:NN | 0.66 | 0.53 | 0.82 | 0.71 | 0.28 | 0.84 |
time:NN | 0.60 | 0.86 | 0.68 | 0.61 | 0.84 | 0.28 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.82 | 0.09 | 0.02 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.27 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "time" modifying "finished"
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : time
-1.00 1.00 NullPunisher.other : finished
-1.00 1.00 NullPunisher.other : delegates
-1.00 1.00 NullPunisher.other : Some
-1.00 1.00 NullPunisher.other : survey
-2.00 1.00 RootEntailment.unalignedRoot : "finished" not aligned to anything
Hand-tuned score (dot product of above): -8.9897
Threshold: -9.4738
Txt: Many British delegates obtained interesting results from the survey.
Hyp: Many delegates obtained interesting results from the survey. (don't know)
Many JJ |
delegates NNS |
obtained VBD |
interesting JJ |
results NNS |
the DT |
survey NN |
|
Many:JJ | 0.74 | 0.88 | 0.96 | 0.93 | 0.88 | 0.76 | 0.88 |
British:JJ | 1.19 | 1.13 | 1.20 | 1.17 | 1.13 | 1.02 | 1.13 |
delegates:NNS | 0.99 | 0.28 | 0.80 | 0.88 | 0.63 | 0.71 | 0.53 |
obtained:VBD | 1.02 | 0.89 | 1.32 | 0.96 | 0.85 | 0.76 | 0.61 |
interesting:JJ | 0.93 | 0.77 | 0.90 | 0.74 | 0.71 | 0.76 | 0.74 |
results:NNS | 0.99 | 0.63 | 0.76 | 0.82 | 0.28 | 0.71 | 0.63 |
the:DT | 1.18 | 1.00 | 1.25 | 1.18 | 1.00 | 2.42 | 1.00 |
survey:NN | 0.99 | 0.53 | 0.52 | 0.85 | 0.63 | 0.71 | 0.28 |
NO_WORD | 0.11 | 0.28 | 0.17 | 0.11 | 0.09 | 0.82 | 0.15 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.13 Alignment.score
1.00 0.19 Alignment.isGood
-1.00 0.79 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "survey" modifying "obtained"
-1.00 1.00 NullPunisher.other : survey
-1.00 1.00 NullPunisher.other : delegates
-1.00 1.00 NullPunisher.other : interesting
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : Many
-1.00 1.00 NullPunisher.other : results
-1.00 1.00 NullPunisher.other : obtained
-2.00 1.00 RootEntailment.unalignedRoot : "obtained" not aligned to anything
Hand-tuned score (dot product of above): -10.2775
Threshold: -9.4738
Txt: Several Portuguese delegates got the results published in major national newspapers.
Hyp: Several delegates got the results published in major national newspapers. (yes)
Several JJ |
delegates NNS |
got VBD |
the DT |
results NNS |
published VBN |
major JJ |
national JJ |
newspapers NNS |
|
Several:JJ | 0.74 | 0.82 | 0.96 | 0.76 | 0.84 | 0.96 | 0.93 | 0.89 | 0.86 |
Portuguese:NNP | 1.24 | 1.04 | 1.07 | 0.96 | 1.06 | 1.04 | 1.24 | 1.24 | 1.05 |
delegates:NNS | 0.93 | 0.28 | 0.82 | 0.71 | 0.63 | 0.71 | 0.99 | 0.86 | 0.56 |
got:VBD | 1.02 | 0.91 | 1.32 | 0.76 | 0.91 | 0.74 | 1.02 | 1.02 | 0.86 |
the:DT | 1.18 | 1.00 | 1.25 | 2.42 | 1.00 | 1.25 | 1.18 | 1.18 | 1.00 |
results:NNS | 0.95 | 0.63 | 0.82 | 0.71 | 0.28 | 0.64 | 0.99 | 0.99 | 0.82 |
published:VBN | 1.02 | 0.80 | 0.74 | 0.76 | 0.73 | 1.32 | 1.01 | 0.89 | 0.65 |
major:JJ | 0.93 | 0.88 | 0.96 | 0.76 | 0.88 | 0.95 | 0.74 | 0.63 | 0.73 |
national:JJ | 0.89 | 0.75 | 0.96 | 0.76 | 0.88 | 0.83 | 0.63 | 0.74 | 0.62 |
newspapers:NNS | 0.97 | 0.56 | 0.77 | 0.71 | 0.82 | 0.56 | 0.84 | 0.73 | 0.28 |
NO_WORD | 0.11 | 0.28 | 0.17 | 0.82 | 0.09 | 0.16 | 0.11 | 0.11 | 0.07 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.14 Alignment.score
1.00 0.19 Alignment.isGood
-1.00 0.79 Alignment.isBad
-0.10 9.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "newspapers" modifying "published"
-1.00 1.00 NullPunisher.other : results
-1.00 1.00 NullPunisher.other : Several
-1.00 1.00 NullPunisher.other : delegates
-1.00 1.00 NullPunisher.other : national
-1.00 1.00 NullPunisher.other : newspapers
-1.00 1.00 NullPunisher.other : major
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : got
-1.00 1.00 NullPunisher.other : published
-2.00 1.00 RootEntailment.unalignedRoot : "got" not aligned to anything
Hand-tuned score (dot product of above): -12.4619
Threshold: -9.4738
Txt: Most Europeans who are resident in Europe can travel freely within Europe.
Hyp: Most Europeans can travel freely within Europe. (don't know)
Most JJS |
Europeans NNPS |
can MD |
travel VB |
freely RB |
Europe NNP |
|
Most:JJS | 0.74 | 1.13 | 0.76 | 0.96 | 1.01 | 1.13 |
Europeans:NNS | 1.24 | 2.22 | 0.96 | 1.06 | 1.55 | 0.46 |
who:WP | 1.39 | 1.55 | 1.18 | 0.97 | 1.05 | 1.55 |
are:VBP | 1.02 | 1.16 | 1.06 | 0.98 | 1.09 | 1.14 |
resident:NN | 0.99 | 0.89 | 0.71 | 0.49 | 1.27 | 1.09 |
Europe:NNP | 1.24 | 0.46 | 0.96 | 1.04 | 1.56 | 0.28 |
can:MD | 1.18 | 1.25 | 2.42 | 1.51 | 1.45 | 1.25 |
travel:VB | 1.02 | 1.15 | 1.02 | 1.32 | 0.90 | 1.13 |
freely:RB | 1.05 | 1.46 | 1.09 | 0.70 | 1.23 | 1.47 |
Europe:NNP | 1.24 | 0.46 | 0.96 | 1.04 | 1.56 | 0.28 |
NO_WORD | 0.11 | 0.28 | 1.55 | 0.17 | 0.04 | 0.33 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.47 Alignment.score
1.00 0.25 Alignment.isGood
-1.00 0.73 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.17 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "freely" modifying "travel"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "resident" modifying "Europeans" is dropped on aligned hypothesis word "Europe"
2.00 1.00 Modal.yes : actual -> possible
0.00 1.00 NegPolarity.hypNegWord : "Most": tag "JJS" is in NegPolarityMarkers list
-1.00 1.00 NullPunisher.other : travel
-1.00 1.00 NullPunisher.other : freely
-0.05 1.00 NullPunisher.aux : can
-1.00 1.00 NullPunisher.other : Most
-2.00 1.00 RootEntailment.unalignedRoot : "travel" not aligned to anything
Hand-tuned score (dot product of above): -3.8498
Threshold: -9.4738
Txt: A few female committee members are from Scandinavia.
Hyp: At least a few committee members are from Scandinavia. (yes)
At IN |
least JJS |
a DT |
few JJ |
committee_members NNS |
are VBP |
Scandinavia NNP |
|
A:DT | 1.00 | 1.18 | 0.13 | 1.18 | 1.00 | 1.25 | 1.25 |
few:JJ | 1.05 | 0.93 | 0.76 | 0.74 | 0.88 | 0.96 | 1.13 |
female:JJ | 1.05 | 0.87 | 0.76 | 0.91 | 0.59 | 0.93 | 1.13 |
committee_members:NNS | 0.50 | 0.99 | 0.71 | 0.99 | 0.28 | 0.82 | 1.06 |
are:VBP | 1.11 | 1.02 | 0.76 | 1.02 | 0.91 | 1.32 | 1.16 |
Scandinavia:NNP | 0.75 | 1.24 | 0.96 | 1.24 | 1.06 | 1.07 | 0.28 |
NO_WORD | 0.73 | 0.29 | 0.82 | 0.11 | 0.28 | 0.17 | 0.15 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.24 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Scandinavia" modifying "are"
0.00 1.00 NegPolarity.hypNegWord : "least": tag "JJS" is in NegPolarityMarkers list
0.00 1.00 NegPolarity.hypNegWord : "few": is in NegPolarityMarkers list
-1.00 1.00 NullPunisher.other : committee_members
-3.00 1.00 NullPunisher.entity : Scandinavia
-0.05 1.00 NullPunisher.aux : are
-1.00 1.00 NullPunisher.other : At
-1.00 1.00 NullPunisher.other : few
-1.00 1.00 NullPunisher.other : least
-0.10 1.00 NullPunisher.article : a
-2.00 1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -11.1707
Threshold: -9.4738
Txt: Few female committee members are from southern Europe.
Hyp: Few committee members are from southern Europe. (don't know)
Few JJ |
committee_members NNS |
are VBP |
southern JJ |
Europe NNP |
|
Few:JJ | 0.74 | 0.88 | 0.96 | 0.93 | 1.13 |
female:JJ | 0.91 | 0.59 | 0.93 | 0.93 | 1.10 |
committee_members:NNS | 0.99 | 0.28 | 0.82 | 0.99 | 1.11 |
are:VBP | 1.02 | 0.91 | 1.32 | 1.02 | 1.14 |
southern:JJ | 0.93 | 0.88 | 0.96 | 0.74 | 1.13 |
Europe:NNP | 1.24 | 1.11 | 1.05 | 1.24 | 0.28 |
NO_WORD | 0.11 | 0.28 | 0.17 | 0.11 | 0.15 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 -0.00 Alignment.score
1.00 0.17 Alignment.isGood
-1.00 0.81 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Europe" modifying "are"
-3.00 1.00 NullPunisher.entity : Europe
-1.00 1.00 NullPunisher.other : southern
-1.00 1.00 NullPunisher.other : Few
-0.05 1.00 NullPunisher.aux : are
-1.00 1.00 NullPunisher.other : committee_members
-2.00 1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -10.1982
Threshold: -9.4738
Txt: At least three female commissioners spend time at home.
Hyp: At least three commissioners spend time at home. (yes)
At IN |
least JJS |
three CD |
commissioners NNS |
spend VBP |
time NN |
at_home IN |
|
At:IN | 1.59 | 0.74 | 1.30 | 1.31 | 1.24 | 1.31 | 1.05 |
least:JJS | 1.01 | 0.74 | 1.13 | 0.88 | 0.96 | 0.88 | 1.05 |
three:CD | 1.30 | 1.00 | 0.85 | 1.18 | 1.22 | 1.01 | 1.23 |
female:JJ | 1.05 | 0.87 | 1.02 | 0.87 | 0.96 | 0.83 | 1.03 |
commissioners:NNS | 0.50 | 0.99 | 1.15 | 0.28 | 0.82 | 0.87 | 0.50 |
spend:VBP | 1.15 | 1.02 | 1.31 | 0.91 | 1.32 | 0.51 | 1.07 |
time:NN | 0.50 | 0.99 | 0.98 | 0.87 | 0.42 | 0.28 | 0.30 |
at_home:IN | 1.05 | 0.77 | 1.23 | 1.31 | 1.15 | 1.11 | 1.59 |
NO_WORD | 0.84 | 0.29 | 0.52 | 0.28 | 0.17 | 0.09 | 1.16 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.30 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.76 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.29 Alignment.txtSpan
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "female" modifying "commissioners" is dropped on aligned hypothesis word "commissioners"
0.00 1.00 NegPolarity.hypNegWord : "least": tag "JJS" is in NegPolarityMarkers list
-1.00 1.00 NullPunisher.other : least
-1.00 1.00 NullPunisher.other : at_home
-1.00 1.00 NullPunisher.other : At
-1.00 1.00 NullPunisher.other : spend
-1.00 1.00 NullPunisher.other : time
-2.00 1.00 RootEntailment.unalignedRoot : "spend" not aligned to anything
Hand-tuned score (dot product of above): -7.0125
Threshold: -9.4738
Txt: At most ten female commissioners spend time at home.
Hyp: At most ten commissioners spend time at home. (don't know)
At IN |
most JJS |
ten NN |
commissioners NNS |
spend VBP |
time NN |
at_home IN |
|
most:JJS | 1.05 | 0.74 | 0.88 | 0.88 | 0.96 | 0.88 | 1.05 |
ten:NN | 0.50 | 0.99 | 0.28 | 0.71 | 0.75 | 0.71 | 0.50 |
female:JJ | 1.05 | 0.93 | 0.88 | 0.87 | 0.96 | 0.83 | 1.03 |
commissioners:NNS | 0.50 | 0.99 | 0.71 | 0.28 | 0.82 | 0.87 | 0.50 |
spend:VBP | 1.15 | 1.02 | 0.84 | 0.91 | 1.32 | 0.51 | 1.07 |
time:NN | 0.50 | 0.99 | 0.71 | 0.87 | 0.42 | 0.28 | 0.30 |
at_home:IN | 1.05 | 0.77 | 1.31 | 1.31 | 1.15 | 1.11 | 1.59 |
NO_WORD | 1.16 | 0.29 | 0.05 | 0.28 | 0.17 | 0.09 | 1.16 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.36 Alignment.score
1.00 0.23 Alignment.isGood
-1.00 0.75 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "ten" modifying "commissioners"
0.00 1.00 NegPolarity.hypNegWord : "most": tag "JJS" is in NegPolarityMarkers list
-1.00 1.00 NullPunisher.other : At
-1.00 1.00 NullPunisher.other : spend
-1.00 1.00 NullPunisher.other : at_home
-1.00 1.00 NullPunisher.other : ten
-1.00 1.00 NullPunisher.other : commissioners
-1.00 1.00 NullPunisher.other : time
-1.00 1.00 NullPunisher.other : most
-2.00 1.00 RootEntailment.unalignedRoot : "spend" not aligned to anything
Hand-tuned score (dot product of above): -10.8594
Threshold: -9.4738
Txt: A Scandinavian won a Nobel prize. Every Swede is a Scandinavian.
Hyp: A Swede won a Nobel prize. (don't know)
A DT |
Swede NN |
won VBD |
a DT |
Nobel_prize NN |
|
A:DT | 2.42 | 1.25 | 1.25 | 0.13 | 1.00 |
Scandinavian:NNP | 0.96 | 0.56 | 1.07 | 0.96 | 1.06 |
won:VBD | 0.76 | 1.16 | 1.32 | 0.76 | 0.57 |
a:DT | 0.13 | 1.25 | 1.25 | 2.42 | 1.00 |
Nobel_prize:NN | 0.71 | 1.07 | 0.49 | 0.71 | 0.28 |
Every:DT | 1.31 | 1.21 | 1.25 | 1.31 | 1.00 |
Swede:NN | 0.96 | 0.28 | 1.07 | 0.96 | 1.07 |
is:VBZ | 0.76 | 1.16 | 0.88 | 0.76 | 0.91 |
a:DT | 0.13 | 1.25 | 1.25 | 2.42 | 1.00 |
Scandinavian:NNP | 0.96 | 0.56 | 1.07 | 0.96 | 1.06 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.82 | 0.25 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.36 Alignment.score
1.00 0.23 Alignment.isGood
-1.00 0.75 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : Nobel_prize
-3.00 1.00 NullPunisher.entity : Swede
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : won
-0.10 1.00 NullPunisher.article : A
-2.00 1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -7.8694
Threshold: -9.4738
Txt: Every resident of the North American continent can travel freely within Europe. Every Canadian resident is a resident of the North American continent.
Hyp: Every Canadian resident can travel freely within Europe. (yes)
Every DT |
Canadian JJ |
resident NN |
can MD |
travel VB |
freely RB |
Europe NNP |
|
Every:DT | 2.42 | 1.44 | 0.98 | 1.31 | 1.23 | 1.39 | 1.23 |
resident:NN | 0.69 | 1.22 | 0.28 | 0.71 | 0.49 | 1.27 | 1.09 |
the:DT | 1.31 | 1.44 | 1.00 | 1.31 | 1.23 | 1.45 | 1.25 |
North_American:NN | 0.96 | 0.97 | 0.91 | 0.96 | 1.07 | 1.56 | 0.99 |
continent:NN | 0.71 | 1.18 | 0.65 | 0.71 | 0.58 | 1.04 | 0.93 |
can:MD | 1.31 | 1.42 | 1.00 | 2.42 | 1.51 | 1.45 | 1.25 |
travel:VB | 0.74 | 1.28 | 0.58 | 1.02 | 1.32 | 0.90 | 1.13 |
freely:RB | 1.03 | 1.30 | 1.18 | 1.09 | 0.70 | 1.23 | 1.47 |
Europe:NNP | 0.94 | 1.15 | 1.09 | 0.96 | 1.04 | 1.56 | 0.28 |
Every:DT | 2.42 | 1.44 | 0.98 | 1.31 | 1.23 | 1.39 | 1.23 |
Canadian:JJ | 1.02 | 0.74 | 1.11 | 1.00 | 1.22 | 1.27 | 1.05 |
resident:NN | 0.69 | 1.22 | 0.28 | 0.71 | 0.49 | 1.27 | 1.09 |
is:VBZ | 0.76 | 1.28 | 0.91 | 1.06 | 1.00 | 1.11 | 1.16 |
a:DT | 1.31 | 1.44 | 1.00 | 1.31 | 1.25 | 1.45 | 1.25 |
resident:NN | 0.69 | 1.22 | 0.28 | 0.71 | 0.49 | 1.27 | 1.09 |
the:DT | 1.31 | 1.44 | 1.00 | 1.31 | 1.23 | 1.45 | 1.25 |
North_American:NNP | 0.96 | 0.97 | 0.91 | 0.96 | 1.07 | 1.56 | 0.99 |
continent:NNP | 0.71 | 1.18 | 0.65 | 0.71 | 0.58 | 1.04 | 0.93 |
NO_WORD | 0.82 | 0.11 | 0.28 | 1.55 | 0.17 | 0.04 | 0.33 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.31 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.76 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.14 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "freely" modifying "travel"
2.00 1.00 Modal.yes : actual -> possible
-0.05 1.00 NullPunisher.aux : can
-1.00 1.00 NullPunisher.other : freely
-3.00 1.00 NullPunisher.entity : Canadian
-1.00 1.00 NullPunisher.other : resident
-1.00 1.00 NullPunisher.other : travel
-1.00 1.00 NullPunisher.other : Every
-2.00 1.00 RootEntailment.unalignedRoot : "travel" not aligned to anything
2.00 1.00 Location.match : Locations match between txt and hyp; both have lemma "Europe"
4.00 1.00 Location.match&OK.Root.poorlyOrUnAligned :
Hand-tuned score (dot product of above): -2.7729
Threshold: -9.4738
Txt: All residents of the North American continent can travel freely within Europe. Every Canadian resident is a resident of the North American continent.
Hyp: All Canadian residents can travel freely within Europe. (yes)
All DT |
Canadian JJ |
residents NNS |
can MD |
travel VB |
freely RB |
Europe NNP |
|
All:DT | 2.42 | 1.44 | 1.00 | 1.31 | 1.23 | 1.45 | 1.25 |
residents:NNS | 0.71 | 1.23 | 0.28 | 0.71 | 0.69 | 1.27 | 1.09 |
the:DT | 1.31 | 1.44 | 1.00 | 1.31 | 1.23 | 1.45 | 1.25 |
North_American:NN | 0.96 | 0.97 | 0.91 | 0.96 | 1.07 | 1.56 | 0.99 |
continent:NN | 0.71 | 1.18 | 0.67 | 0.71 | 0.58 | 1.04 | 0.93 |
can:MD | 1.31 | 1.42 | 1.00 | 2.42 | 1.51 | 1.45 | 1.25 |
travel:VB | 0.73 | 1.28 | 0.78 | 1.02 | 1.32 | 0.90 | 1.13 |
freely:RB | 1.09 | 1.30 | 1.18 | 1.09 | 0.70 | 1.23 | 1.47 |
Europe:NNP | 0.96 | 1.15 | 1.09 | 0.96 | 1.04 | 1.56 | 0.28 |
Every:DT | 1.31 | 1.44 | 1.00 | 1.31 | 1.23 | 1.39 | 1.23 |
Canadian:JJ | 1.02 | 0.74 | 1.12 | 1.00 | 1.22 | 1.27 | 1.05 |
resident:NN | 0.71 | 1.22 | 1.32 | 0.71 | 0.49 | 1.27 | 1.09 |
is:VBZ | 0.76 | 1.28 | 0.91 | 1.06 | 1.00 | 1.11 | 1.16 |
a:DT | 1.31 | 1.44 | 1.00 | 1.31 | 1.25 | 1.45 | 1.25 |
resident:NN | 0.71 | 1.22 | 1.32 | 0.71 | 0.49 | 1.27 | 1.09 |
the:DT | 1.31 | 1.44 | 1.00 | 1.31 | 1.23 | 1.45 | 1.25 |
North_American:NNP | 0.96 | 0.97 | 0.91 | 0.96 | 1.07 | 1.56 | 0.99 |
continent:NNP | 0.71 | 1.18 | 0.67 | 0.71 | 0.58 | 1.04 | 0.93 |
NO_WORD | 0.82 | 0.11 | 0.28 | 1.55 | 0.17 | 0.04 | 0.33 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.53 Alignment.score
1.00 0.26 Alignment.isGood
-1.00 0.72 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.29 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "freely" modifying "travel"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "Canadian" modifying "resident" is dropped on aligned hypothesis word "residents"
2.00 1.00 Modal.yes : actual -> possible
-1.00 1.00 NullPunisher.other : travel
-0.05 1.00 NullPunisher.aux : can
-1.00 1.00 NullPunisher.other : freely
-1.00 1.00 NullPunisher.other : All
-3.00 1.00 NullPunisher.entity : Canadian
1.00 1.00 Quant.equivalent : Replacing the quantifier "every" by an equivalent quantifier "all" preserves truth.
-2.00 1.00 RootEntailment.unalignedRoot : "travel" not aligned to anything
2.00 1.00 Location.match : Locations match between txt and hyp; both have lemma "Europe"
4.00 1.00 Location.match&OK.Root.poorlyOrUnAligned :
Hand-tuned score (dot product of above): 0.1542
Threshold: -9.4738
Txt: Each resident of the North American continent can travel freely within Europe. Every Canadian resident is a resident of the North American continent.
Hyp: Each Canadian resident can travel freely within Europe. (yes)
Each DT |
Canadian JJ |
resident NN |
can MD |
travel VB |
freely RB |
Europe NNP |
|
Each:DT | 2.42 | 1.44 | 1.00 | 1.28 | 1.25 | 1.45 | 1.25 |
resident:NN | 0.71 | 1.22 | 0.28 | 0.71 | 0.49 | 1.27 | 1.09 |
the:DT | 1.31 | 1.44 | 1.00 | 1.31 | 1.23 | 1.45 | 1.25 |
North_American:NN | 0.96 | 0.97 | 0.91 | 0.96 | 1.07 | 1.56 | 0.99 |
continent:NN | 0.71 | 1.18 | 0.65 | 0.71 | 0.58 | 1.04 | 0.93 |
can:MD | 1.28 | 1.42 | 1.00 | 2.42 | 1.51 | 1.45 | 1.25 |
travel:VB | 0.76 | 1.28 | 0.58 | 1.02 | 1.32 | 0.90 | 1.13 |
freely:RB | 1.09 | 1.30 | 1.18 | 1.09 | 0.70 | 1.23 | 1.47 |
Europe:NNP | 0.96 | 1.15 | 1.09 | 0.96 | 1.04 | 1.56 | 0.28 |
Every:DT | 1.28 | 1.44 | 0.98 | 1.31 | 1.23 | 1.39 | 1.23 |
Canadian:JJ | 1.02 | 0.74 | 1.11 | 1.00 | 1.22 | 1.27 | 1.05 |
resident:NN | 0.71 | 1.22 | 0.28 | 0.71 | 0.49 | 1.27 | 1.09 |
is:VBZ | 0.76 | 1.28 | 0.91 | 1.06 | 1.00 | 1.11 | 1.16 |
a:DT | 1.31 | 1.44 | 1.00 | 1.31 | 1.25 | 1.45 | 1.25 |
resident:NN | 0.71 | 1.22 | 0.28 | 0.71 | 0.49 | 1.27 | 1.09 |
the:DT | 1.31 | 1.44 | 1.00 | 1.31 | 1.23 | 1.45 | 1.25 |
North_American:NNP | 0.96 | 0.97 | 0.91 | 0.96 | 1.07 | 1.56 | 0.99 |
continent:NNP | 0.71 | 1.18 | 0.65 | 0.71 | 0.58 | 1.04 | 0.93 |
NO_WORD | 0.82 | 0.11 | 0.28 | 1.55 | 0.17 | 0.04 | 0.33 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.31 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.76 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.14 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "freely" modifying "travel"
2.00 1.00 Modal.yes : actual -> possible
-0.05 1.00 NullPunisher.aux : can
-1.00 1.00 NullPunisher.other : travel
-1.00 1.00 NullPunisher.other : resident
-1.00 1.00 NullPunisher.other : freely
-1.00 1.00 NullPunisher.other : Each
-3.00 1.00 NullPunisher.entity : Canadian
-2.00 1.00 RootEntailment.unalignedRoot : "travel" not aligned to anything
2.00 1.00 Location.match : Locations match between txt and hyp; both have lemma "Europe"
4.00 1.00 Location.match&OK.Root.poorlyOrUnAligned :
Hand-tuned score (dot product of above): -2.7729
Threshold: -9.4738
Txt: The residents of western countries can travel freely within Europe. All residents of major western countries are residents of western countries.
Hyp: The residents of major western countries have the right to live in Europe. (yes)
The DT |
residents NNS |
major JJ |
western JJ |
countries NNS |
have VBP |
the DT |
right NN |
to TO |
live VB |
Europe NNP |
|
The:DT | 2.42 | 1.00 | 1.18 | 1.18 | 1.00 | 1.22 | 0.01 | 1.00 | 1.26 | 1.22 | 1.25 |
residents:NNS | 0.71 | 0.28 | 0.99 | 0.66 | 0.86 | 0.82 | 0.71 | 0.87 | 0.71 | 0.49 | 1.09 |
western:JJ | 0.76 | 0.55 | 0.76 | 0.74 | 0.48 | 0.96 | 0.76 | 0.88 | 0.76 | 0.80 | 1.12 |
countries:NNS | 0.71 | 0.86 | 0.81 | 0.59 | 0.28 | 0.82 | 0.71 | 0.95 | 0.71 | 0.73 | 1.17 |
can:MD | 1.31 | 1.00 | 1.18 | 1.18 | 1.00 | 1.47 | 1.31 | 1.00 | 1.31 | 1.37 | 1.25 |
travel:VB | 0.73 | 0.78 | 0.91 | 0.94 | 0.69 | 0.27 | 0.73 | 0.89 | 0.76 | 0.59 | 1.13 |
freely:RB | 1.09 | 1.18 | 0.99 | 0.77 | 1.01 | 0.91 | 1.09 | 1.10 | 1.09 | 0.76 | 1.47 |
Europe:NNP | 0.96 | 1.09 | 1.22 | 1.23 | 1.17 | 1.07 | 0.96 | 1.17 | 0.96 | 1.07 | 0.28 |
All:DT | 1.31 | 1.00 | 1.18 | 1.18 | 1.00 | 1.22 | 1.31 | 1.00 | 1.31 | 1.25 | 1.25 |
residents:NNS | 0.71 | 0.28 | 0.99 | 0.66 | 0.86 | 0.82 | 0.71 | 0.87 | 0.71 | 0.49 | 1.09 |
major:JJ | 0.76 | 0.88 | 0.74 | 0.76 | 0.70 | 0.93 | 0.76 | 0.88 | 0.76 | 0.96 | 1.11 |
western:JJ | 0.76 | 0.55 | 0.76 | 0.74 | 0.48 | 0.96 | 0.76 | 0.88 | 0.76 | 0.80 | 1.12 |
countries:NNS | 0.71 | 0.86 | 0.81 | 0.59 | 0.28 | 0.82 | 0.71 | 0.95 | 0.71 | 0.73 | 1.17 |
are:VBP | 0.68 | 0.91 | 1.02 | 1.02 | 0.91 | 0.89 | 0.68 | 0.91 | 0.76 | 0.36 | 1.14 |
residents:NNS | 0.71 | 0.28 | 0.99 | 0.66 | 0.86 | 0.82 | 0.71 | 0.87 | 0.71 | 0.49 | 1.09 |
western:JJ | 0.76 | 0.55 | 0.76 | 0.74 | 0.48 | 0.96 | 0.76 | 0.88 | 0.76 | 0.80 | 1.12 |
countries:NNS | 0.71 | 0.86 | 0.81 | 0.59 | 0.28 | 0.82 | 0.71 | 0.95 | 0.71 | 0.73 | 1.17 |
NO_WORD | 0.82 | 0.28 | 0.11 | 0.11 | 0.04 | 0.17 | 0.82 | 0.09 | 1.55 | 0.12 | 0.07 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.42 Alignment.score
1.00 0.24 Alignment.isGood
-1.00 0.74 Alignment.isBad
-0.10 8.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.18 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Europe" modifying "live"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "Europe" modifying "travel" is dropped on aligned hypothesis word "have"
-2.00 1.00 Modal.dontKnow : possible -> actual
-0.10 1.00 NullPunisher.article : The
-1.00 1.00 NullPunisher.other : western
-1.00 1.00 NullPunisher.other : live
-1.00 1.00 NullPunisher.other : right
-0.10 1.00 NullPunisher.functionWord : to
-3.00 1.00 NullPunisher.entity : Europe
-1.00 1.00 NullPunisher.other : major
-0.10 1.00 NullPunisher.article : the
Hand-tuned score (dot product of above): -10.4568
Threshold: -9.4738
Txt: No delegate finished the report on time.
Hyp: Some Scandinavian delegate finished the report on time. (don't know)
Some DT |
Scandinavian JJ |
delegate NN |
finished VBD |
the DT |
report NN |
time NN |
|
No:DT | 1.31 | 1.44 | 1.00 | 1.25 | 1.31 | 1.00 | 1.00 |
delegate:NN | 0.71 | 1.24 | 0.28 | 0.74 | 0.71 | 0.74 | 0.86 |
finished:VBD | 0.76 | 1.28 | 0.83 | 1.32 | 0.76 | 0.86 | 0.77 |
the:DT | 1.28 | 1.44 | 1.00 | 1.25 | 2.42 | 1.00 | 0.90 |
report:NN | 0.71 | 1.24 | 0.74 | 0.77 | 0.71 | 0.28 | 0.91 |
time:NN | 0.60 | 1.24 | 0.86 | 0.68 | 0.61 | 0.91 | 0.28 |
NO_WORD | 0.82 | 0.11 | 0.28 | 0.17 | 0.82 | 0.09 | 0.02 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.25 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "time" modifying "finished"
-1.00 1.00 NullPunisher.other : finished
-1.00 1.00 NullPunisher.other : report
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : delegate
-1.00 1.00 NullPunisher.other : Some
-1.00 1.00 NullPunisher.other : time
-3.00 1.00 NullPunisher.entity : Scandinavian
-2.00 1.00 RootEntailment.unalignedRoot : "finished" not aligned to anything
Hand-tuned score (dot product of above): -12.1191
Threshold: -9.4738
Txt: Some delegates finished the survey on time.
Hyp: Some Irish delegates finished the survey on time. (don't know)
Some DT |
Irish JJ |
delegates NNS |
finished VBD |
the DT |
survey NN |
time NN |
|
Some:DT | 2.42 | 1.44 | 1.00 | 1.25 | 1.28 | 0.95 | 0.89 |
delegates:NNS | 0.71 | 1.24 | 0.28 | 0.78 | 0.71 | 0.53 | 0.86 |
finished:VBD | 0.76 | 1.19 | 0.87 | 1.32 | 0.76 | 0.91 | 0.77 |
the:DT | 1.28 | 1.44 | 1.00 | 1.25 | 2.42 | 1.00 | 0.90 |
survey:NN | 0.66 | 1.22 | 0.53 | 0.82 | 0.71 | 0.28 | 0.84 |
time:NN | 0.60 | 1.22 | 0.86 | 0.68 | 0.61 | 0.84 | 0.28 |
NO_WORD | 0.82 | 0.11 | 0.28 | 0.17 | 0.82 | 0.09 | 0.02 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.25 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "time" modifying "finished"
-1.00 1.00 NullPunisher.other : finished
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : time
-3.00 1.00 NullPunisher.entity : Irish
-1.00 1.00 NullPunisher.other : delegates
-1.00 1.00 NullPunisher.other : Some
-1.00 1.00 NullPunisher.other : survey
-2.00 1.00 RootEntailment.unalignedRoot : "finished" not aligned to anything
Hand-tuned score (dot product of above): -12.1191
Threshold: -9.4738
Txt: Many delegates obtained interesting results from the survey.
Hyp: Many British delegates obtained interesting results from the survey. (don't know)
Many JJ |
British JJ |
delegates NNS |
obtained VBD |
interesting JJ |
results NNS |
the DT |
survey NN |
|
Many:JJ | 0.74 | 1.19 | 0.88 | 0.96 | 0.93 | 0.88 | 0.76 | 0.88 |
delegates:NNS | 0.99 | 1.24 | 0.28 | 0.80 | 0.88 | 0.63 | 0.71 | 0.53 |
obtained:VBD | 1.02 | 1.26 | 0.89 | 1.32 | 0.96 | 0.85 | 0.76 | 0.61 |
interesting:JJ | 0.93 | 1.17 | 0.77 | 0.90 | 0.74 | 0.71 | 0.76 | 0.74 |
results:NNS | 0.99 | 1.24 | 0.63 | 0.76 | 0.82 | 0.28 | 0.71 | 0.63 |
the:DT | 1.18 | 1.44 | 1.00 | 1.25 | 1.18 | 1.00 | 2.42 | 1.00 |
survey:NN | 0.99 | 1.24 | 0.53 | 0.52 | 0.85 | 0.63 | 0.71 | 0.28 |
NO_WORD | 0.11 | 0.11 | 0.28 | 0.17 | 0.11 | 0.09 | 0.82 | 0.15 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.12 Alignment.score
1.00 0.19 Alignment.isGood
-1.00 0.79 Alignment.isBad
-0.10 8.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "survey" modifying "obtained"
-1.00 1.00 NullPunisher.other : obtained
-3.00 1.00 NullPunisher.entity : British
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : Many
-1.00 1.00 NullPunisher.other : delegates
-1.00 1.00 NullPunisher.other : interesting
-1.00 1.00 NullPunisher.other : survey
-1.00 1.00 NullPunisher.other : results
-2.00 1.00 RootEntailment.unalignedRoot : "obtained" not aligned to anything
Hand-tuned score (dot product of above): -13.3794
Threshold: -9.4738
Txt: Several delegates got the results published in major national newspapers.
Hyp: Several Portuguese delegates got the results published in major national newspapers. (don't know)
Several JJ |
Portuguese NNP |
delegates NNS |
got VBD |
the DT |
results NNS |
published VBN |
major JJ |
national JJ |
newspapers NNS |
|
Several:JJ | 0.74 | 1.13 | 0.82 | 0.96 | 0.76 | 0.84 | 0.96 | 0.93 | 0.89 | 0.86 |
delegates:NNS | 0.93 | 1.04 | 0.28 | 0.82 | 0.71 | 0.63 | 0.71 | 0.99 | 0.86 | 0.56 |
got:VBD | 1.02 | 1.16 | 0.91 | 1.32 | 0.76 | 0.91 | 0.74 | 1.02 | 1.02 | 0.86 |
the:DT | 1.18 | 1.25 | 1.00 | 1.25 | 2.42 | 1.00 | 1.25 | 1.18 | 1.18 | 1.00 |
results:NNS | 0.95 | 1.06 | 0.63 | 0.82 | 0.71 | 0.28 | 0.64 | 0.99 | 0.99 | 0.82 |
published:VBN | 1.02 | 1.13 | 0.80 | 0.74 | 0.76 | 0.73 | 1.32 | 1.01 | 0.89 | 0.65 |
major:JJ | 0.93 | 1.13 | 0.88 | 0.96 | 0.76 | 0.88 | 0.95 | 0.74 | 0.63 | 0.73 |
national:JJ | 0.89 | 1.13 | 0.75 | 0.96 | 0.76 | 0.88 | 0.83 | 0.63 | 0.74 | 0.62 |
newspapers:NNS | 0.97 | 1.05 | 0.56 | 0.77 | 0.71 | 0.82 | 0.56 | 0.84 | 0.73 | 0.28 |
NO_WORD | 0.11 | 0.05 | 0.28 | 0.17 | 0.82 | 0.09 | 0.16 | 0.11 | 0.11 | 0.07 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.12 Alignment.score
1.00 0.19 Alignment.isGood
-1.00 0.79 Alignment.isBad
-0.10 10.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "newspapers" modifying "published"
-1.00 1.00 NullPunisher.other : national
-1.00 1.00 NullPunisher.other : got
-1.00 1.00 NullPunisher.other : results
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : newspapers
-1.00 1.00 NullPunisher.other : major
-3.00 1.00 NullPunisher.entity : Portuguese
-1.00 1.00 NullPunisher.other : published
-1.00 1.00 NullPunisher.other : delegates
-1.00 1.00 NullPunisher.other : Several
-2.00 1.00 RootEntailment.unalignedRoot : "got" not aligned to anything
Hand-tuned score (dot product of above): -15.5872
Threshold: -9.4738
Txt: Most Europeans can travel freely within Europe.
Hyp: Most Europeans who are resident outside Europe can travel freely within Europe. (don't know)
Most JJS |
Europeans NNS |
who WP |
are VBP |
resident NN |
Europe NNP |
can MD |
travel VB |
freely RB |
Europe NNP |
|
Most:JJS | 0.74 | 1.13 | 1.05 | 0.96 | 0.88 | 1.13 | 0.76 | 0.96 | 1.01 | 1.13 |
Europeans:NNPS | 1.24 | 2.22 | 1.84 | 1.07 | 0.89 | 0.46 | 0.96 | 1.06 | 1.55 | 0.46 |
can:MD | 1.18 | 1.25 | 1.24 | 1.55 | 1.00 | 1.25 | 2.42 | 1.51 | 1.45 | 1.25 |
travel:VB | 1.02 | 1.15 | 0.54 | 0.98 | 0.58 | 1.13 | 1.02 | 1.32 | 0.90 | 1.13 |
freely:RB | 1.05 | 1.46 | 1.05 | 0.89 | 1.18 | 1.47 | 1.09 | 0.70 | 1.23 | 1.47 |
Europe:NNP | 1.24 | 0.46 | 1.84 | 1.05 | 1.09 | 0.28 | 0.96 | 1.04 | 1.56 | 0.28 |
NO_WORD | 0.11 | 0.28 | 0.92 | 1.43 | 0.19 | 0.10 | 1.55 | 0.17 | 0.04 | 0.33 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.65 Alignment.score
1.00 0.28 Alignment.isGood
-1.00 0.69 Alignment.isBad
-0.10 8.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.10 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "freely" modifying "travel"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "Most" modifying "Europeans" is dropped on aligned hypothesis word "Europeans"
2.00 1.00 Modal.yes : actual -> possible
0.00 1.00 NegPolarity.hypNegWord : "Most": tag "JJS" is in NegPolarityMarkers list
-0.05 1.00 NullPunisher.aux : can
-3.00 1.00 NullPunisher.entity : Europe
-1.00 1.00 NullPunisher.other : Most
-0.10 1.00 NullPunisher.functionWord : who
-1.00 1.00 NullPunisher.other : resident
-1.00 1.00 NullPunisher.other : travel
-0.05 1.00 NullPunisher.aux : are
-1.00 1.00 NullPunisher.other : freely
-2.00 1.00 RootEntailment.unalignedRoot : "travel" not aligned to anything
2.00 1.00 Location.match : Locations match between txt and hyp; both have lemma "Europe"
4.00 1.00 Location.match&OK.Root.poorlyOrUnAligned :
Hand-tuned score (dot product of above): -2.1558
Threshold: -9.4738
Txt: A few committee members are from Scandinavia.
Hyp: At least a few female committee members are from Scandinavia. (don't know)
At IN |
least JJS |
a DT |
few JJ |
female JJ |
committee_members NNS |
are VBP |
Scandinavia NNP |
|
A:DT | 1.00 | 1.18 | 0.13 | 1.18 | 1.18 | 1.00 | 1.25 | 1.25 |
few:JJ | 1.05 | 0.93 | 0.76 | 0.74 | 0.91 | 0.88 | 0.96 | 1.13 |
committee_members:NNS | 0.50 | 0.99 | 0.71 | 0.99 | 0.70 | 0.28 | 0.82 | 1.06 |
are:VBP | 1.11 | 1.02 | 0.76 | 1.02 | 0.99 | 0.91 | 1.32 | 1.16 |
Scandinavia:NNP | 0.75 | 1.24 | 0.96 | 1.24 | 1.24 | 1.06 | 1.07 | 0.28 |
NO_WORD | 0.73 | 0.29 | 0.82 | 0.11 | 0.11 | 0.28 | 0.17 | 0.15 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.23 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 8.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Scandinavia" modifying "are"
0.00 1.00 NegPolarity.hypNegWord : "least": tag "JJS" is in NegPolarityMarkers list
0.00 1.00 NegPolarity.hypNegWord : "few": is in NegPolarityMarkers list
-1.00 1.00 NullPunisher.other : female
-1.00 1.00 NullPunisher.other : At
-0.05 1.00 NullPunisher.aux : are
-3.00 1.00 NullPunisher.entity : Scandinavia
-1.00 1.00 NullPunisher.other : committee_members
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : few
-1.00 1.00 NullPunisher.other : least
-2.00 1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -12.2924
Threshold: -9.4738
Txt: Few committee members are from southern Europe.
Hyp: Few female committee members are from southern Europe. (yes)
Few JJ |
female JJ |
committee_members NNS |
are VBP |
southern JJ |
Europe NNP |
|
Few:JJ | 0.74 | 0.91 | 0.88 | 0.96 | 0.93 | 1.13 |
committee_members:NNS | 0.99 | 0.70 | 0.28 | 0.82 | 0.99 | 1.11 |
are:VBP | 1.02 | 0.99 | 0.91 | 1.32 | 1.02 | 1.14 |
southern:JJ | 0.93 | 0.93 | 0.88 | 0.96 | 0.74 | 1.13 |
Europe:NNP | 1.24 | 1.21 | 1.11 | 1.05 | 1.24 | 0.28 |
NO_WORD | 0.11 | 0.11 | 0.28 | 0.17 | 0.11 | 0.15 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.01 Alignment.score
1.00 0.17 Alignment.isGood
-1.00 0.81 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Europe" modifying "are"
-1.00 1.00 NullPunisher.other : Few
-0.05 1.00 NullPunisher.aux : are
-1.00 1.00 NullPunisher.other : committee_members
-1.00 1.00 NullPunisher.other : female
-3.00 1.00 NullPunisher.entity : Europe
-1.00 1.00 NullPunisher.other : southern
-2.00 1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -11.2726
Threshold: -9.4738
Txt: At least three commissioners spend time at home.
Hyp: At least three male commissioners spend time at home. (don't know)
At IN |
least JJS |
three CD |
male JJ |
commissioners NNS |
spend VBP |
time NN |
at_home IN |
|
At:IN | 1.59 | 0.74 | 1.30 | 0.77 | 1.31 | 1.24 | 1.31 | 1.05 |
least:JJS | 1.01 | 0.74 | 1.13 | 0.91 | 0.88 | 0.96 | 0.88 | 1.05 |
three:CD | 1.30 | 1.00 | 0.85 | 0.91 | 1.18 | 1.22 | 1.01 | 1.23 |
commissioners:NNS | 0.50 | 0.99 | 1.15 | 0.99 | 0.28 | 0.82 | 0.87 | 0.50 |
spend:VBP | 1.15 | 1.02 | 1.31 | 1.02 | 0.91 | 1.32 | 0.51 | 1.07 |
time:NN | 0.50 | 0.99 | 0.98 | 0.93 | 0.87 | 0.42 | 0.28 | 0.30 |
at_home:IN | 1.05 | 0.77 | 1.23 | 0.77 | 1.31 | 1.15 | 1.11 | 1.59 |
NO_WORD | 0.84 | 0.29 | 0.52 | 0.11 | 0.28 | 0.17 | 0.09 | 1.16 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.28 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.25 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "male" modifying "commissioners"
0.00 1.00 NegPolarity.hypNegWord : "least": tag "JJS" is in NegPolarityMarkers list
-1.00 1.00 NullPunisher.other : male
-1.00 1.00 NullPunisher.other : least
-1.00 1.00 NullPunisher.other : At
-1.00 1.00 NullPunisher.other : spend
-1.00 1.00 NullPunisher.other : time
-1.00 1.00 NullPunisher.other : at_home
-2.00 1.00 RootEntailment.unalignedRoot : "spend" not aligned to anything
Hand-tuned score (dot product of above): -9.7480
Threshold: -9.4738
Txt: At most ten commissioners spend time at home.
Hyp: At most ten female commissioners spend time at home. (yes)
most JJS |
ten NN |
female JJ |
commissioners NNS |
spend VBP |
time NN |
at_home IN |
|
At:IN | 0.77 | 1.31 | 0.77 | 1.31 | 1.24 | 1.31 | 1.05 |
most:JJS | 0.74 | 0.88 | 0.93 | 0.88 | 0.96 | 0.88 | 1.05 |
ten:NN | 0.99 | 0.28 | 0.98 | 0.71 | 0.75 | 0.71 | 0.50 |
commissioners:NNS | 0.99 | 0.71 | 0.98 | 0.28 | 0.82 | 0.87 | 0.50 |
spend:VBP | 1.02 | 0.84 | 1.02 | 0.91 | 1.32 | 0.51 | 1.07 |
time:NN | 0.99 | 0.71 | 0.94 | 0.87 | 0.42 | 0.28 | 0.30 |
at_home:IN | 0.77 | 1.31 | 0.75 | 1.31 | 1.15 | 1.11 | 1.59 |
NO_WORD | 0.11 | 0.20 | 0.11 | 0.28 | 0.17 | 0.09 | 1.16 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.22 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "female" modifying "commissioners"
0.00 1.00 NegPolarity.hypNegWord : "most": tag "JJS" is in NegPolarityMarkers list
-1.00 1.00 NullPunisher.other : commissioners
-1.00 1.00 NullPunisher.other : time
-1.00 1.00 NullPunisher.other : ten
-1.00 1.00 NullPunisher.other : most
-1.00 1.00 NullPunisher.other : at_home
-1.00 1.00 NullPunisher.other : female
-1.00 1.00 NullPunisher.other : spend
-2.00 1.00 RootEntailment.unalignedRoot : "spend" not aligned to anything
Hand-tuned score (dot product of above): -11.0461
Threshold: -9.4738
Txt: Smith, Jones and Anderson signed the contract.
Hyp: Jones signed the contract. (yes)
Jones NNP |
signed VBD |
the DT |
contract NN |
|
Smith:NNP | 0.70 | 1.05 | 0.96 | 1.16 |
Jones:NNP | 0.28 | 1.01 | 0.96 | 1.16 |
Anderson:NNP | 0.75 | 1.07 | 0.96 | 1.10 |
signed:VBD | 1.10 | 1.32 | 0.76 | 0.45 |
the:DT | 1.25 | 1.25 | 2.42 | 1.00 |
contract:NN | 1.16 | 0.36 | 0.71 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.20 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-3.00 1.00 NullPunisher.entity : Jones
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : contract
-1.00 1.00 NullPunisher.other : signed
-2.00 1.00 RootEntailment.unalignedRoot : "signed" not aligned to anything
Hand-tuned score (dot product of above): -7.8790
Threshold: -9.4738
Txt: Smith, Jones and several lawyers signed the contract.
Hyp: Jones signed the contract. (yes)
Jones NNP |
signed VBD |
the DT |
contract NN |
|
Smith:NNP | 0.70 | 1.05 | 0.96 | 1.16 |
Jones:NNP | 0.28 | 1.01 | 0.96 | 1.16 |
several:JJ | 1.13 | 0.94 | 0.76 | 0.83 |
lawyers:NNS | 1.03 | 0.62 | 0.71 | 0.84 |
signed:VBD | 1.10 | 1.32 | 0.76 | 0.45 |
the:DT | 1.25 | 1.25 | 2.42 | 1.00 |
contract:NN | 1.16 | 0.36 | 0.71 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.20 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-3.00 1.00 NullPunisher.entity : Jones
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : signed
-1.00 1.00 NullPunisher.other : contract
-2.00 1.00 RootEntailment.unalignedRoot : "signed" not aligned to anything
Hand-tuned score (dot product of above): -7.8790
Threshold: -9.4738
Txt: Either Smith, Jones or Anderson signed the contract.
Hyp: Jones signed the contract. (don't know)
Jones NNP |
signed VBD |
the DT |
contract NN |
|
Either:CC | 1.23 | 1.18 | 1.23 | 1.00 |
Smith:NNP | 0.70 | 1.05 | 0.96 | 1.16 |
Jones:NNP | 0.28 | 1.01 | 0.96 | 1.16 |
Anderson:NNP | 0.75 | 1.07 | 0.96 | 1.10 |
signed:VBD | 1.10 | 1.32 | 0.76 | 0.45 |
the:DT | 1.25 | 1.25 | 2.42 | 1.00 |
contract:NN | 1.16 | 0.36 | 0.71 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.20 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : signed
-1.00 1.00 NullPunisher.other : contract
-3.00 1.00 NullPunisher.entity : Jones
-0.10 1.00 NullPunisher.article : the
-2.00 1.00 RootEntailment.unalignedRoot : "signed" not aligned to anything
Hand-tuned score (dot product of above): -7.8790
Threshold: -9.4738
Txt: Either Smith, Jones or Anderson signed the contract.
Hyp: If Smith and Anderson did not sign the contract, Jones signed the contract. (yes)
If IN |
Smith NNP |
Anderson NNP |
did VBD |
not RB |
sign VB |
the DT |
contract NN |
Jones NNP |
signed VBD |
the DT |
contract NN |
|
Either:CC | 1.05 | 1.19 | 1.25 | 1.25 | 1.45 | 1.25 | 1.23 | 1.00 | 1.23 | 1.18 | 1.23 | 1.00 |
Smith:NNP | 0.75 | 0.28 | 0.76 | 1.07 | 1.56 | 1.00 | 0.96 | 1.16 | 0.70 | 1.05 | 0.96 | 1.16 |
Jones:NNP | 0.75 | 0.70 | 0.75 | 1.07 | 1.56 | 1.07 | 0.96 | 1.16 | 0.28 | 1.01 | 0.96 | 1.16 |
Anderson:NNP | 0.75 | 0.76 | 0.28 | 1.07 | 1.56 | 1.07 | 0.96 | 1.10 | 0.75 | 1.07 | 0.96 | 1.10 |
signed:VBD | 1.15 | 1.14 | 1.16 | 0.63 | 1.11 | 0.93 | 0.76 | 0.45 | 1.10 | 1.32 | 0.76 | 0.45 |
the:DT | 1.05 | 1.25 | 1.25 | 1.25 | 1.45 | 1.25 | 2.42 | 1.00 | 1.25 | 1.25 | 2.42 | 1.00 |
contract:NN | 0.50 | 1.16 | 1.10 | 0.71 | 1.31 | 0.57 | 0.71 | 0.28 | 1.16 | 0.36 | 0.71 | 0.28 |
NO_WORD | 0.92 | 0.28 | 0.01 | 1.30 | 0.24 | 0.42 | 0.82 | 0.09 | 0.28 | 0.17 | 0.82 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.50 Alignment.score
1.00 0.25 Alignment.isGood
-1.00 0.72 Alignment.isBad
-0.10 9.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.17 Alignment.txtSpan
0.00 1.00 NegPolarity.hypNegWord : "sign": has child with relation "neg"
-1.00 1.00 NullPunisher.other : signed
-0.10 1.00 NullPunisher.article : the
-0.05 1.00 NullPunisher.aux : did
-1.00 1.00 NullPunisher.other : contract
-3.00 1.00 NullPunisher.entity : Anderson
-0.10 1.00 NullPunisher.article : the
-3.00 1.00 NullPunisher.entity : Jones
-1.00 1.00 NullPunisher.other : not
-1.00 1.00 NullPunisher.other : If
1.00 1.00 Quant.contract : Valid quantifier weakening: replacing the quantifier "the" by a weaker quantifier "the" preserves truth.
-2.00 1.00 RootEntailment.unalignedRoot : "signed" not aligned to anything
Hand-tuned score (dot product of above): -12.0113
Threshold: -9.4738
Txt: Exactly two lawyers and three accountants signed the contract.
Hyp: Six lawyers signed the contract. (don't know)
Six CD |
lawyers NNS |
signed VBD |
the DT |
contract NN |
|
Exactly:RB | 1.36 | 1.21 | 0.91 | 1.09 | 1.20 |
two:CD | 0.35 | 1.32 | 1.17 | 0.82 | 1.25 |
lawyers:NNS | 1.29 | 0.28 | 0.62 | 0.71 | 0.84 |
three:CD | 0.35 | 1.33 | 1.26 | 0.78 | 1.32 |
accountants:NNS | 1.29 | 0.37 | 0.82 | 0.71 | 0.81 |
signed:VBD | 1.37 | 0.70 | 1.32 | 0.76 | 0.45 |
the:DT | 1.18 | 1.00 | 1.25 | 2.42 | 1.00 |
contract:NN | 1.29 | 0.84 | 0.36 | 0.71 | 0.28 |
NO_WORD | 0.52 | 0.28 | 0.17 | 0.82 | 0.09 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.23 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.40 Alignment.txtSpan
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : contract
-1.00 1.00 NullPunisher.other : signed
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '6.0' vs '2.0'
-2.00 1.00 RootEntailment.unalignedRoot : "signed" not aligned to anything
Hand-tuned score (dot product of above): -10.5021
Threshold: -9.4738
Txt: Exactly two lawyers and three accountants signed the contract.
Hyp: Six accountants signed the contract. (don't know)
Six CD |
accountants NNS |
signed VBD |
the DT |
contract NN |
|
Exactly:RB | 1.36 | 1.21 | 0.91 | 1.09 | 1.20 |
two:CD | 0.35 | 1.21 | 1.17 | 0.82 | 1.25 |
lawyers:NNS | 1.29 | 0.37 | 0.62 | 0.71 | 0.84 |
three:CD | 0.35 | 1.24 | 1.26 | 0.78 | 1.32 |
accountants:NNS | 1.29 | 0.28 | 0.82 | 0.71 | 0.81 |
signed:VBD | 1.37 | 0.91 | 1.32 | 0.76 | 0.45 |
the:DT | 1.18 | 1.00 | 1.25 | 2.42 | 1.00 |
contract:NN | 1.29 | 0.81 | 0.36 | 0.71 | 0.28 |
NO_WORD | 0.52 | 0.28 | 0.17 | 0.82 | 0.09 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.23 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.40 Alignment.txtSpan
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : signed
-1.00 1.00 NullPunisher.other : contract
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '6.0' vs '3.0'
-2.00 1.00 RootEntailment.unalignedRoot : "signed" not aligned to anything
Hand-tuned score (dot product of above): -10.5021
Threshold: -9.4738
Txt: Every representative and client was at the meeting.
Hyp: Every representative was at the meeting. (yes)
Every DT |
representative NN |
was VBD |
the DT |
meeting NN |
|
Every:DT | 2.42 | 1.00 | 1.25 | 1.31 | 1.00 |
representative:NN | 0.71 | 0.28 | 0.82 | 0.71 | 0.82 |
client:NN | 0.69 | 0.84 | 0.82 | 0.71 | 0.89 |
was:VBD | 0.76 | 0.91 | 1.32 | 0.76 | 0.91 |
the:DT | 1.31 | 1.00 | 1.25 | 2.42 | 1.00 |
meeting:NN | 0.71 | 0.82 | 0.82 | 0.71 | 0.28 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.82 | 0.20 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.35 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.75 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "meeting" modifying "was"
-1.00 1.00 NullPunisher.other : meeting
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : Every
-0.05 1.00 NullPunisher.aux : was
-1.00 1.00 NullPunisher.other : representative
-2.00 1.00 RootEntailment.unalignedRoot : "was" not aligned to anything
Hand-tuned score (dot product of above): -6.8325
Threshold: -9.4738
Txt: Every representative and client was at the meeting.
Hyp: Every representative was at the meeting. (don't know)
Every DT |
representative NN |
was VBD |
the DT |
meeting NN |
|
Every:DT | 2.42 | 1.00 | 1.25 | 1.31 | 1.00 |
representative:NN | 0.71 | 0.28 | 0.82 | 0.71 | 0.82 |
client:NN | 0.69 | 0.84 | 0.82 | 0.71 | 0.89 |
was:VBD | 0.76 | 0.91 | 1.32 | 0.76 | 0.91 |
the:DT | 1.31 | 1.00 | 1.25 | 2.42 | 1.00 |
meeting:NN | 0.71 | 0.82 | 0.82 | 0.71 | 0.28 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.82 | 0.20 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.35 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.75 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "meeting" modifying "was"
-1.00 1.00 NullPunisher.other : representative
-1.00 1.00 NullPunisher.other : meeting
-1.00 1.00 NullPunisher.other : Every
-0.05 1.00 NullPunisher.aux : was
-0.10 1.00 NullPunisher.article : the
-2.00 1.00 RootEntailment.unalignedRoot : "was" not aligned to anything
Hand-tuned score (dot product of above): -6.8325
Threshold: -9.4738
Txt: Every representative or client was at the meeting.
Hyp: Every representative and every client was at the meeting. (yes)
Every DT |
representative NN |
every DT |
client NN |
was VBD |
the DT |
meeting NN |
|
Every:DT | 2.42 | 1.00 | 0.22 | 0.98 | 1.25 | 1.31 | 1.00 |
representative:NN | 0.71 | 0.28 | 0.71 | 0.84 | 0.82 | 0.71 | 0.82 |
client:NN | 0.69 | 0.84 | 0.69 | 0.28 | 0.82 | 0.71 | 0.89 |
was:VBD | 0.76 | 0.91 | 0.76 | 0.91 | 1.32 | 0.76 | 0.91 |
the:DT | 1.31 | 1.00 | 1.31 | 1.00 | 1.25 | 2.42 | 1.00 |
meeting:NN | 0.71 | 0.82 | 0.71 | 0.89 | 0.82 | 0.71 | 0.28 |
NO_WORD | 0.82 | 0.28 | 0.82 | 0.01 | 0.17 | 0.82 | 0.20 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.36 Alignment.score
1.00 0.23 Alignment.isGood
-1.00 0.75 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "meeting" modifying "was"
-1.00 1.00 NullPunisher.other : client
-1.00 1.00 NullPunisher.other : meeting
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : Every
-0.05 1.00 NullPunisher.aux : was
-1.00 1.00 NullPunisher.other : every
-1.00 1.00 NullPunisher.other : representative
-2.00 1.00 RootEntailment.unalignedRoot : "was" not aligned to anything
Hand-tuned score (dot product of above): -9.0074
Threshold: -9.4738
Txt: The chairman read out the items on the agenda.
Hyp: The chairman read out every item on the agenda. (yes)
The DT |
chairman NN |
read VB |
every DT |
item NN |
the DT |
agenda NN |
|
The:DT | 2.42 | 1.00 | 1.25 | 1.31 | 0.96 | 0.01 | 1.00 |
chairman:NN | 0.71 | 0.28 | 0.82 | 0.71 | 0.89 | 0.71 | 0.75 |
read:VB | 0.76 | 0.91 | 1.32 | 0.73 | 0.66 | 0.76 | 0.81 |
the:DT | 0.01 | 1.00 | 1.25 | 1.31 | 0.96 | 2.42 | 1.00 |
items:NNS | 0.71 | 0.96 | 0.72 | 0.66 | 1.06 | 0.71 | 0.84 |
the:DT | 0.01 | 1.00 | 1.25 | 1.31 | 0.96 | 2.42 | 1.00 |
agenda:NN | 0.71 | 0.75 | 0.72 | 0.69 | 0.78 | 0.71 | 0.28 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.82 | 0.62 | 0.82 | 0.02 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.55 Alignment.score
1.00 0.26 Alignment.isGood
-1.00 0.71 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.14 Alignment.txtSpan
-1.00 1.00 NullPunisher.other : chairman
-1.00 1.00 NullPunisher.other : every
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : read
-0.10 1.00 NullPunisher.article : The
-7.00 1.00 Quant.expand : Invalid quantifier strengthening: replacing the quantifier "the" by a stronger quantifier "every" does NOT preserve truth.
-2.00 1.00 RootEntailment.unalignedRoot : "read" not aligned to anything
Hand-tuned score (dot product of above): -12.4918
Threshold: -9.4738
Txt: The people who were at the meeting voted for a new chairman.
Hyp: Everyone at the meeting voted for a new chairman. (don't know)
Everyone NN |
the DT |
meeting NN |
voted VBD |
a DT |
new JJ |
chairman NN |
|
The:DT | 1.00 | 0.01 | 1.00 | 1.25 | 1.31 | 1.18 | 1.00 |
people:NNS | 0.79 | 0.71 | 0.86 | 0.80 | 0.71 | 0.99 | 1.02 |
who:WP | 1.30 | 1.04 | 1.30 | 0.97 | 1.11 | 1.39 | 1.30 |
were:VBD | 0.87 | 0.73 | 0.91 | 0.76 | 0.76 | 0.99 | 0.91 |
the:DT | 1.00 | 2.42 | 1.00 | 1.25 | 1.31 | 1.18 | 1.00 |
meeting:NN | 0.77 | 0.71 | 0.28 | 0.54 | 0.71 | 0.99 | 0.67 |
voted:VBN | 0.89 | 0.76 | 0.63 | 2.10 | 0.76 | 0.99 | 0.55 |
a:DT | 1.00 | 1.31 | 1.00 | 1.25 | 2.42 | 1.18 | 1.00 |
new:JJ | 0.88 | 0.76 | 0.88 | 0.93 | 0.76 | 0.74 | 0.87 |
chairman:NN | 0.79 | 0.71 | 0.67 | 0.46 | 0.71 | 0.98 | 0.28 |
NO_WORD | 0.28 | 0.82 | 0.20 | 0.17 | 0.82 | 0.11 | 0.05 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.60 Alignment.score
1.00 0.27 Alignment.isGood
-1.00 0.70 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.14 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "new" modifying "chairman"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "new" modifying "chairman" is dropped on aligned hypothesis word "chairman"
-0.10 1.00 NullPunisher.article : the
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : meeting
-1.00 1.00 NullPunisher.other : Everyone
-1.00 1.00 NullPunisher.other : new
Hand-tuned score (dot product of above): -3.9137
Threshold: -9.4738
Txt: All the people who were at the meeting voted for a new chairman.
Hyp: Everyone at the meeting voted for a new chairman. (yes)
Everyone NN |
the DT |
meeting NN |
voted VBD |
a DT |
new JJ |
chairman NN |
|
All:PDT | 1.00 | 1.31 | 1.00 | 1.25 | 1.31 | 1.18 | 1.00 |
the:DT | 1.00 | 2.42 | 1.00 | 1.25 | 1.31 | 1.18 | 1.00 |
people:NNS | 0.79 | 0.71 | 0.86 | 0.80 | 0.71 | 0.99 | 1.02 |
who:WP | 1.30 | 1.04 | 1.30 | 0.97 | 1.11 | 1.39 | 1.30 |
were:VBD | 0.87 | 0.73 | 0.91 | 0.76 | 0.76 | 0.99 | 0.91 |
the:DT | 1.00 | 2.42 | 1.00 | 1.25 | 1.31 | 1.18 | 1.00 |
meeting:NN | 0.77 | 0.71 | 0.28 | 0.54 | 0.71 | 0.99 | 0.67 |
voted:VBN | 0.89 | 0.76 | 0.63 | 2.10 | 0.76 | 0.99 | 0.55 |
a:DT | 1.00 | 1.31 | 1.00 | 1.25 | 2.42 | 1.18 | 1.00 |
new:JJ | 0.88 | 0.76 | 0.88 | 0.93 | 0.76 | 0.74 | 0.87 |
chairman:NN | 0.79 | 0.71 | 0.67 | 0.46 | 0.71 | 0.98 | 0.28 |
NO_WORD | 0.28 | 0.82 | 0.20 | 0.17 | 0.82 | 0.11 | 0.05 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.60 Alignment.score
1.00 0.27 Alignment.isGood
-1.00 0.70 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.14 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "new" modifying "chairman"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "new" modifying "chairman" is dropped on aligned hypothesis word "chairman"
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : Everyone
-1.00 1.00 NullPunisher.other : new
-1.00 1.00 NullPunisher.other : meeting
-0.10 1.00 NullPunisher.article : a
Hand-tuned score (dot product of above): -3.9137
Threshold: -9.4738
Txt: The people who were at the meeting all voted for a new chairman.
Hyp: Everyone at the meeting voted for a new chairman. (yes)
Everyone NN |
the DT |
meeting NN |
voted VBD |
a DT |
new JJ |
chairman NN |
|
The:DT | 1.00 | 0.01 | 1.00 | 1.25 | 1.31 | 1.18 | 1.00 |
people:NNS | 0.79 | 0.71 | 0.86 | 0.80 | 0.71 | 0.99 | 1.02 |
who:WP | 1.30 | 1.04 | 1.30 | 0.97 | 1.11 | 1.39 | 1.30 |
were:VBD | 0.87 | 0.73 | 0.91 | 0.76 | 0.76 | 0.99 | 0.91 |
the:DT | 1.00 | 2.42 | 1.00 | 1.25 | 1.31 | 1.18 | 1.00 |
meeting:NN | 0.77 | 0.71 | 0.28 | 0.54 | 0.71 | 0.99 | 0.67 |
all:DT | 1.00 | 1.31 | 1.00 | 1.25 | 1.31 | 1.18 | 1.00 |
voted:VBN | 0.89 | 0.76 | 0.63 | 2.10 | 0.76 | 0.99 | 0.55 |
a:DT | 1.00 | 1.31 | 1.00 | 1.25 | 2.42 | 1.18 | 1.00 |
new:JJ | 0.88 | 0.76 | 0.88 | 0.93 | 0.76 | 0.74 | 0.87 |
chairman:NN | 0.79 | 0.71 | 0.67 | 0.46 | 0.71 | 0.98 | 0.28 |
NO_WORD | 0.28 | 0.82 | 0.20 | 0.17 | 0.82 | 0.11 | 0.05 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.60 Alignment.score
1.00 0.27 Alignment.isGood
-1.00 0.70 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.14 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "new" modifying "chairman"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "new" modifying "chairman" is dropped on aligned hypothesis word "chairman"
-1.00 1.00 NullPunisher.other : new
-1.00 1.00 NullPunisher.other : meeting
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : Everyone
-0.10 1.00 NullPunisher.article : the
Hand-tuned score (dot product of above): -3.9137
Threshold: -9.4738
Txt: The inhabitants of Cambridge voted for a Labour MP.
Hyp: Every inhabitant of Cambridge voted for a Labour MP. (don't know)
Every DT |
inhabitant NN |
Cambridge NNP |
voted VBD |
a DT |
Labour_MP NNP |
|
The:DT | 1.31 | 1.00 | 1.25 | 1.25 | 1.31 | 1.25 |
inhabitants:NNS | 0.71 | 1.17 | 1.08 | 0.82 | 0.71 | 1.15 |
Cambridge:NNP | 0.96 | 1.07 | 0.28 | 1.07 | 0.96 | 0.99 |
voted:VBD | 0.76 | 0.91 | 1.16 | 1.32 | 0.76 | 1.16 |
a:DT | 1.31 | 1.00 | 1.25 | 1.25 | 2.42 | 1.25 |
Labour_MP:NNP | 0.96 | 1.16 | 0.99 | 1.07 | 0.96 | 0.28 |
NO_WORD | 0.82 | 0.28 | 0.04 | 0.17 | 0.82 | 0.05 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.58 Alignment.score
1.00 0.27 Alignment.isGood
-1.00 0.71 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.33 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Labour_MP" modifying "voted"
-1.00 1.00 NullPunisher.other : voted
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : Every
-3.00 1.00 NullPunisher.entity : Labour_MP
-7.00 1.00 Quant.expand : Invalid quantifier strengthening: replacing the quantifier "the" by a stronger quantifier "every" does NOT preserve truth.
-2.00 1.00 RootEntailment.unalignedRoot : "voted" not aligned to anything
2.00 1.00 Location.match : Locations match between txt and hyp; both have lemma "Cambridge"
4.00 1.00 Location.match&OK.Root.poorlyOrUnAligned :
Hand-tuned score (dot product of above): -9.1256
Threshold: -9.4738
Txt: The Ancient Greeks were noted philosophers.
Hyp: Every Ancient Greek was a noted philosopher. (don't know)
Every DT |
Ancient_Greek NNP |
was VBD |
a DT |
noted JJ |
philosopher NN |
|
The:DT | 1.31 | 1.25 | 1.25 | 1.31 | 1.18 | 1.00 |
Ancient:NNP | 0.96 | 0.39 | 1.07 | 0.96 | 1.21 | 1.00 |
Greeks:NNP | 0.94 | 0.49 | 1.07 | 0.96 | 1.22 | 1.10 |
were:VBD | 0.68 | 1.16 | 0.13 | 0.76 | 0.99 | 0.91 |
noted:VBN | 0.76 | 1.16 | 0.86 | 0.76 | 2.37 | 0.88 |
philosophers:NNS | 0.71 | 1.06 | 0.82 | 0.71 | 0.93 | 1.46 |
NO_WORD | 0.82 | 0.28 | 1.43 | 0.82 | 0.11 | 0.12 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 1.32 Alignment.score
1.00 0.43 Alignment.isGood
-1.00 0.54 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.50 Alignment.txtSpan
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "Ancient" modifying "Greeks" is dropped on aligned hypothesis word "Ancient_Greek"
-1.00 1.00 Apposition.mismatch : no apposition in text between philosopher and Ancient_Greek
-1.00 1.00 Hypernym.posNarrow : Narrowing a term (from "Greeks" to "Ancient_Greek") does NOT preserve truth in a positive context
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : Every
-0.05 1.00 NullPunisher.aux : was
-7.00 1.00 Quant.expand : Invalid quantifier strengthening: replacing the quantifier "the" by a stronger quantifier "every" does NOT preserve truth.
Hand-tuned score (dot product of above): -8.4827
Threshold: -9.4738
Txt: The Ancient Greeks were all noted philosophers.
Hyp: Every Ancient Greek was a noted philosopher. (yes)
Every DT |
Ancient_Greek NNP |
was VBD |
a DT |
noted JJ |
philosopher NN |
|
The:DT | 1.31 | 1.25 | 1.25 | 1.31 | 1.18 | 1.00 |
Ancient:NNP | 0.96 | 0.39 | 1.07 | 0.96 | 1.21 | 1.00 |
Greeks:NNP | 0.94 | 0.49 | 1.07 | 0.96 | 1.22 | 1.10 |
were:VBD | 0.68 | 1.16 | 0.13 | 0.76 | 0.99 | 0.91 |
all:RB | 1.09 | 1.47 | 0.91 | 1.09 | 1.05 | 1.21 |
noted:VBN | 0.76 | 1.16 | 0.86 | 0.76 | 2.37 | 0.88 |
philosophers:NNS | 0.71 | 1.06 | 0.82 | 0.71 | 0.93 | 1.46 |
NO_WORD | 0.82 | 0.28 | 1.43 | 0.82 | 0.11 | 0.12 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 1.32 Alignment.score
1.00 0.43 Alignment.isGood
-1.00 0.54 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.33 Alignment.txtSpan
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "all" modifying "noted" is dropped on aligned hypothesis word "noted"
-1.00 1.00 Apposition.mismatch : no apposition in text between philosopher and Ancient_Greek
-1.00 1.00 Hypernym.posNarrow : Narrowing a term (from "Greeks" to "Ancient_Greek") does NOT preserve truth in a positive context
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : Every
-0.05 1.00 NullPunisher.aux : was
-7.00 1.00 Quant.expand : Invalid quantifier strengthening: replacing the quantifier "the" by a stronger quantifier "every" does NOT preserve truth.
Hand-tuned score (dot product of above): -8.4993
Threshold: -9.4738
Txt: Software faults were blamed for the system failure.
Hyp: The system failure was blamed on one or more software faults. (yes)
The DT |
system NN |
failure NN |
was VBD |
blamed VBN |
one CD |
or CC |
more JJR |
software NN |
faults NNS |
|
Software:NNP | 0.71 | 0.94 | 0.89 | 0.82 | 0.82 | 1.29 | 0.71 | 0.95 | 1.67 | 0.92 |
faults:NNS | 0.71 | 0.72 | 0.27 | 0.79 | 0.82 | 1.29 | 0.71 | 0.99 | 0.82 | 0.28 |
were:VBD | 0.73 | 0.91 | 0.89 | 0.13 | 0.76 | 1.36 | 0.76 | 0.91 | 0.87 | 0.91 |
blamed:VBN | 0.76 | 0.76 | 0.50 | 0.76 | 1.32 | 1.39 | 0.76 | 1.02 | 0.91 | 0.91 |
the:DT | 0.01 | 1.00 | 1.00 | 1.25 | 1.25 | 1.11 | 1.31 | 1.15 | 1.00 | 1.00 |
system:NN | 0.71 | 0.28 | 0.52 | 0.82 | 0.67 | 1.29 | 0.71 | 0.99 | 0.49 | 0.72 |
failure:NN | 0.71 | 0.52 | 0.28 | 0.82 | 0.41 | 1.29 | 0.71 | 0.97 | 0.86 | 0.27 |
NO_WORD | 0.82 | 0.05 | 0.15 | 1.57 | 0.17 | 0.52 | 0.56 | 0.15 | 0.05 | 0.02 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.42 Alignment.score
1.00 0.24 Alignment.isGood
-1.00 0.74 Alignment.isBad
-0.10 9.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.10 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "faults" modifying "blamed"
-1.00 1.00 NullPunisher.other : faults
-1.00 1.00 NullPunisher.other : blamed
-0.10 1.00 NullPunisher.article : The
-3.00 1.00 NullPunisher.entity : one
-1.00 1.00 NullPunisher.other : or
-0.05 1.00 NullPunisher.aux : was
-1.00 1.00 NullPunisher.other : failure
-1.00 1.00 NullPunisher.other : system
-1.00 1.00 NullPunisher.other : more
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '>=1.0' vs ''
-2.00 1.00 RootEntailment.unalignedRoot : "blamed" not aligned to anything
Hand-tuned score (dot product of above): -19.0219
Threshold: -9.4738
Txt: Software faults were blamed for the system failure. Bug # 32-985 is a known software fault.
Hyp: Bug # 32-985 was blamed for the system failure. (don't know)
Bug NN |
# # |
32-985 CD |
was VBD |
blamed VBN |
the DT |
system NN |
failure NN |
|
Software:NNP | 0.88 | 0.96 | 1.29 | 0.82 | 0.82 | 0.71 | 0.94 | 0.89 |
faults:NNS | 0.85 | 0.96 | 1.29 | 0.79 | 0.82 | 0.71 | 0.72 | 0.27 |
were:VBD | 0.91 | 1.01 | 1.39 | 0.13 | 0.76 | 0.73 | 0.91 | 0.89 |
blamed:VBN | 0.91 | 0.96 | 1.39 | 0.76 | 1.32 | 0.76 | 0.76 | 0.50 |
the:DT | 1.00 | 1.56 | 1.18 | 1.25 | 1.25 | 2.42 | 1.00 | 1.00 |
system:NN | 0.83 | 0.96 | 1.29 | 0.82 | 0.67 | 0.71 | 0.28 | 0.52 |
failure:NN | 0.87 | 0.96 | 1.29 | 0.82 | 0.41 | 0.71 | 0.52 | 0.28 |
Bug:NN | 0.28 | 0.96 | 1.29 | 0.82 | 0.82 | 0.71 | 0.83 | 0.87 |
#:# | 1.25 | 2.42 | 0.75 | 1.51 | 1.46 | 1.56 | 1.25 | 1.25 |
32-985:CD | 1.33 | 0.46 | 0.85 | 1.30 | 1.30 | 0.89 | 1.32 | 1.33 |
is:VBZ | 0.91 | 1.01 | 1.39 | 0.05 | 0.76 | 0.76 | 0.91 | 0.91 |
a:DT | 1.00 | 1.56 | 1.18 | 1.25 | 1.25 | 1.31 | 1.00 | 1.00 |
known:JJ | 0.88 | 1.02 | 1.13 | 0.96 | 0.86 | 0.76 | 0.67 | 0.62 |
software:NN | 0.88 | 0.96 | 1.29 | 0.82 | 0.82 | 0.71 | 0.49 | 0.86 |
fault:NN | 0.85 | 0.96 | 1.25 | 0.82 | 0.53 | 0.71 | 0.73 | 0.06 |
NO_WORD | 0.15 | 0.56 | 0.26 | 1.57 | 0.17 | 0.82 | 0.05 | 0.05 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.34 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.75 Alignment.isBad
-0.10 8.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "failure" modifying "blamed"
-3.00 1.00 NullPunisher.entity : #
-1.00 1.00 NullPunisher.other : failure
-1.00 1.00 NullPunisher.other : Bug
-3.00 1.00 NullPunisher.entity : 32-985
-0.05 1.00 NullPunisher.aux : was
-1.00 1.00 NullPunisher.other : system
-1.00 1.00 NullPunisher.other : blamed
-0.10 1.00 NullPunisher.article : the
-2.00 1.00 RootEntailment.unalignedRoot : "blamed" not aligned to anything
Hand-tuned score (dot product of above): -14.1423
Threshold: -9.4738
Txt: Clients at the demonstration were all impressed by the system's performance. Smith was a client at the demonstration.
Hyp: Smith was impressed by the system's performance. (yes)
Smith NNP |
was VBD |
impressed VBN |
the DT |
system NN |
performance NN |
|
Clients:NNS | 1.01 | 0.82 | 0.82 | 0.71 | 0.87 | 0.91 |
the:DT | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 | 1.00 |
demonstration:NN | 1.17 | 0.82 | 0.72 | 0.71 | 0.84 | 0.85 |
were:VBD | 1.16 | 0.13 | 0.89 | 0.73 | 0.91 | 0.91 |
all:RB | 1.47 | 0.91 | 0.91 | 1.09 | 1.21 | 1.21 |
impressed:VBN | 1.16 | 0.89 | 1.32 | 0.76 | 0.89 | 0.76 |
the:DT | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 | 1.00 |
system:NN | 1.05 | 0.82 | 0.80 | 0.71 | 0.28 | 0.96 |
performance:NN | 1.15 | 0.82 | 0.67 | 0.71 | 0.96 | 0.28 |
Smith:NNP | 0.28 | 1.07 | 1.07 | 0.96 | 1.05 | 1.15 |
was:VBD | 1.16 | 1.32 | 0.89 | 0.76 | 0.91 | 0.91 |
a:DT | 1.25 | 1.25 | 1.25 | 1.31 | 1.00 | 1.00 |
client:NN | 1.02 | 0.82 | 0.70 | 0.71 | 0.87 | 0.88 |
the:DT | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 | 1.00 |
demonstration:NN | 1.17 | 0.82 | 0.72 | 0.71 | 0.84 | 0.85 |
NO_WORD | 0.15 | 1.57 | 0.17 | 0.82 | 0.12 | 0.21 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.39 Alignment.score
1.00 0.23 Alignment.isGood
-1.00 0.75 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : impressed
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : performance
-1.00 1.00 NullPunisher.other : system
-0.05 1.00 NullPunisher.aux : was
-3.00 1.00 NullPunisher.entity : Smith
-2.00 1.00 RootEntailment.unalignedRoot : "impressed" not aligned to anything
Hand-tuned score (dot product of above): -8.8786
Threshold: -9.4738
Txt: Clients at the demonstration were impressed by the system's performance.
Hyp: Most clients at the demonstration were impressed by the system's performance. (yes)
Most JJS |
clients NNS |
the DT |
demonstration NN |
were VBD |
impressed VBN |
the DT |
system NN |
performance NN |
|
Clients:NNS | 0.99 | 1.53 | 0.71 | 0.92 | 0.82 | 0.82 | 0.71 | 0.87 | 0.91 |
the:DT | 1.18 | 1.00 | 2.42 | 1.00 | 1.22 | 1.25 | 2.42 | 1.00 | 1.00 |
demonstration:NN | 0.99 | 0.92 | 0.71 | 0.28 | 0.82 | 0.72 | 0.71 | 0.84 | 0.85 |
were:VBD | 1.02 | 0.91 | 0.73 | 0.91 | 1.32 | 0.89 | 0.73 | 0.91 | 0.91 |
impressed:VBN | 1.02 | 0.89 | 0.76 | 0.81 | 0.89 | 1.32 | 0.76 | 0.89 | 0.76 |
the:DT | 1.18 | 1.00 | 2.42 | 1.00 | 1.22 | 1.25 | 2.42 | 1.00 | 1.00 |
system:NN | 0.94 | 0.87 | 0.71 | 0.84 | 0.82 | 0.80 | 0.71 | 0.28 | 0.96 |
performance:NN | 0.99 | 0.82 | 0.71 | 0.77 | 0.82 | 0.67 | 0.71 | 0.96 | 0.28 |
NO_WORD | 0.11 | 0.15 | 0.82 | 0.20 | 1.57 | 0.17 | 0.82 | 0.12 | 0.21 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.57 Alignment.score
1.00 0.27 Alignment.isGood
-1.00 0.71 Alignment.isBad
-0.10 8.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.11 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "demonstration" modifying "clients"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "demonstration" modifying "Clients" is dropped on aligned hypothesis word "clients"
0.00 1.00 NegPolarity.hypNegWord : "Most": tag "JJS" is in NegPolarityMarkers list
-1.00 1.00 NullPunisher.other : Most
-1.00 1.00 NullPunisher.other : demonstration
-0.05 1.00 NullPunisher.aux : were
-1.00 1.00 NullPunisher.other : performance
-0.10 1.00 NullPunisher.article : the
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : system
-1.00 1.00 NullPunisher.other : impressed
-2.00 1.00 RootEntailment.unalignedRoot : "impressed" not aligned to anything
Hand-tuned score (dot product of above): -8.3133
Threshold: -9.4738
Txt: University graduates make poor stock-market traders. Smith is a university graduate.
Hyp: Smith is likely to make a poor stock market trader. (yes)
Smith NNP |
is VBZ |
likely JJ |
to TO |
make VB |
a DT |
poor JJ |
stock-market NN |
trader NN |
|
University:NNP | 1.13 | 0.82 | 0.96 | 0.71 | 0.82 | 0.71 | 0.99 | 0.87 | 0.90 |
graduates:NNS | 1.02 | 0.82 | 0.90 | 0.71 | 0.79 | 0.71 | 0.92 | 0.79 | 0.71 |
make:VBP | 1.14 | 0.84 | 0.78 | 0.76 | 2.10 | 0.76 | 0.88 | 0.83 | 0.86 |
poor:JJ | 1.13 | 0.96 | 0.93 | 0.76 | 0.82 | 0.76 | 0.74 | 0.78 | 0.79 |
stock-market:NN | 1.08 | 0.82 | 0.86 | 0.71 | 0.74 | 0.71 | 0.88 | 0.28 | 0.60 |
traders:NNS | 1.06 | 0.82 | 0.61 | 0.71 | 0.80 | 0.71 | 0.99 | 0.53 | 1.22 |
Smith:NNP | 0.28 | 1.07 | 1.24 | 0.96 | 1.05 | 0.96 | 1.24 | 1.08 | 1.06 |
is:VBZ | 1.16 | 1.32 | 1.02 | 0.76 | 0.92 | 0.76 | 1.02 | 0.91 | 0.91 |
a:DT | 1.25 | 1.25 | 1.18 | 1.31 | 1.25 | 2.42 | 1.18 | 1.00 | 1.00 |
university:NN | 1.13 | 0.82 | 0.96 | 0.71 | 0.82 | 0.71 | 0.99 | 0.87 | 0.90 |
graduate:NN | 1.02 | 0.82 | 0.99 | 0.71 | 0.82 | 0.71 | 0.98 | 0.80 | 0.72 |
NO_WORD | 0.28 | 1.43 | 0.16 | 1.55 | 0.34 | 0.82 | 0.11 | 0.05 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.95 Alignment.score
1.00 0.35 Alignment.isGood
-1.00 0.62 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.33 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "poor" modifying "trader"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "poor" modifying "traders" is dropped on aligned hypothesis word "trader"
-0.10 1.00 NullPunisher.functionWord : to
-0.05 1.00 NullPunisher.aux : is
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : likely
-1.00 1.00 NullPunisher.other : poor
-3.00 1.00 NullPunisher.entity : Smith
-2.00 1.00 RootEntailment.unalignedRoot : "likely" not aligned to anything
Hand-tuned score (dot product of above): -7.4405
Threshold: -9.4738
Txt: University graduates make poor stock-market traders. Smith is a university graduate.
Hyp: Smith will make a poor stock market trader. (don't know)
Smith NNP |
will MD |
make VB |
a DT |
poor JJ |
stock-market NN |
trader NN |
|
University:NNP | 1.13 | 0.71 | 0.82 | 0.71 | 0.99 | 0.87 | 0.90 |
graduates:NNS | 1.02 | 0.71 | 0.79 | 0.71 | 0.92 | 0.79 | 0.71 |
make:VBP | 1.14 | 1.00 | 2.10 | 0.76 | 0.88 | 0.83 | 0.86 |
poor:JJ | 1.13 | 0.76 | 0.82 | 0.76 | 0.74 | 0.78 | 0.79 |
stock-market:NN | 1.08 | 0.71 | 0.74 | 0.71 | 0.88 | 0.28 | 0.60 |
traders:NNS | 1.06 | 0.71 | 0.80 | 0.71 | 0.99 | 0.53 | 1.22 |
Smith:NNP | 0.28 | 0.94 | 1.05 | 0.96 | 1.24 | 1.08 | 1.06 |
is:VBZ | 1.16 | 1.11 | 0.92 | 0.76 | 1.02 | 0.91 | 0.91 |
a:DT | 1.25 | 1.31 | 1.25 | 2.42 | 1.18 | 1.00 | 1.00 |
university:NN | 1.13 | 0.71 | 0.82 | 0.71 | 0.99 | 0.87 | 0.90 |
graduate:NN | 1.02 | 0.71 | 0.82 | 0.71 | 0.98 | 0.80 | 0.72 |
NO_WORD | 0.28 | 1.55 | 0.17 | 0.82 | 0.11 | 0.05 | 0.09 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 1.00 Alignment.score
1.00 0.36 Alignment.isGood
-1.00 0.61 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.43 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "poor" modifying "trader"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "poor" modifying "traders" is dropped on aligned hypothesis word "trader"
-0.10 1.00 NullPunisher.article : a
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : poor
-0.05 1.00 NullPunisher.aux : will
Hand-tuned score (dot product of above): -4.0655
Threshold: -9.4738
Txt: All APCOM managers have company cars. Jones is an APCOM manager.
Hyp: Jones has a company car. (yes)
Jones NNP |
has VBZ |
a DT |
company NN |
car NN |
|
All:DT | 1.25 | 1.25 | 1.31 | 1.00 | 1.00 |
APCOM:NNP | 0.95 | 1.07 | 0.96 | 1.07 | 1.07 |
managers:NNS | 0.93 | 0.82 | 0.71 | 0.85 | 0.86 |
have:VBP | 1.14 | 0.19 | 0.76 | 0.91 | 0.87 |
company:NN | 1.20 | 0.82 | 0.71 | 0.28 | 0.80 |
cars:NNS | 1.09 | 0.72 | 0.71 | 0.86 | 1.20 |
Jones:NNP | 0.28 | 1.07 | 0.96 | 1.20 | 1.12 |
is:VBZ | 1.16 | 1.01 | 0.76 | 0.91 | 0.91 |
an:DT | 1.25 | 1.21 | 1.21 | 1.00 | 0.95 |
APCOM:NNP | 0.95 | 1.07 | 0.96 | 1.07 | 1.07 |
manager:NN | 0.95 | 0.82 | 0.71 | 0.96 | 0.86 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.05 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.68 Alignment.score
1.00 0.29 Alignment.isGood
-1.00 0.69 Alignment.isBad
-0.10 1.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.80 Alignment.txtSpan
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "APCOM" modifying "managers" is dropped on aligned hypothesis word "Jones"
-0.10 1.00 NullPunisher.article : a
Hand-tuned score (dot product of above): 0.8655
Threshold: -9.4738
Txt: All APCOM managers have company cars. Jones is an APCOM manager.
Hyp: Jones has more than one company car. (don't know)
Jones NNP |
has VBZ |
more_than IN |
one CD |
company NN |
car NN |
|
All:DT | 1.25 | 1.25 | 1.05 | 1.18 | 1.00 | 1.00 |
APCOM:NNP | 0.95 | 1.07 | 0.75 | 1.20 | 1.07 | 1.07 |
managers:NNS | 0.93 | 0.82 | 0.48 | 1.29 | 0.85 | 0.86 |
have:VBP | 1.14 | 0.19 | 1.15 | 1.36 | 0.91 | 0.87 |
company:NN | 1.20 | 0.82 | 0.47 | 1.29 | 0.28 | 0.80 |
cars:NNS | 1.09 | 0.72 | 0.50 | 1.29 | 0.86 | 1.20 |
Jones:NNP | 0.28 | 1.07 | 0.75 | 1.09 | 1.20 | 1.12 |
is:VBZ | 1.16 | 1.01 | 1.15 | 1.39 | 0.91 | 0.91 |
an:DT | 1.25 | 1.21 | 1.05 | 1.14 | 1.00 | 0.95 |
APCOM:NNP | 0.95 | 1.07 | 0.75 | 1.20 | 1.07 | 1.07 |
manager:NN | 0.95 | 0.82 | 0.50 | 1.29 | 0.96 | 0.86 |
NO_WORD | 0.28 | 0.17 | 0.84 | 0.52 | 0.05 | 0.09 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.49 Alignment.score
1.00 0.25 Alignment.isGood
-1.00 0.73 Alignment.isBad
-0.10 2.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.67 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "one" modifying "car"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "APCOM" modifying "managers" is dropped on aligned hypothesis word "Jones"
-1.00 1.00 NullPunisher.other : more_than
-3.00 1.00 NullPunisher.entity : one
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '>1.0' vs ''
Hand-tuned score (dot product of above): -10.4225
Threshold: -9.4738
Txt: Just one accountant attended the meeting.
Hyp: No accountants attended the meeting. (don't know)
No DT |
accountants NNS |
attended VBD |
the DT |
meeting NN |
|
Just:RB | 1.09 | 1.21 | 0.91 | 1.09 | 1.21 |
one:CD | 0.85 | 1.33 | 1.30 | 0.82 | 1.33 |
accountant:NN | 0.71 | 1.32 | 0.77 | 0.71 | 0.87 |
attended:VBD | 0.76 | 0.82 | 1.32 | 0.76 | 0.57 |
the:DT | 1.31 | 1.00 | 1.25 | 2.42 | 1.00 |
meeting:NN | 0.71 | 0.89 | 0.48 | 0.71 | 0.28 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.82 | 0.09 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.64 Alignment.score
1.00 0.28 Alignment.isGood
-1.00 0.69 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.20 Alignment.txtSpan
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "one" modifying "accountant" is dropped on aligned hypothesis word "accountants"
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : meeting
-1.00 1.00 NullPunisher.other : No
-1.00 1.00 NullPunisher.other : attended
-6.00 1.00 Quant.oneNo : [one,no]
-2.00 1.00 RootEntailment.unalignedRoot : "attended" not aligned to anything
Hand-tuned score (dot product of above): -10.6506
Threshold: -9.4738
Txt: Just one accountant attended the meeting.
Hyp: No accountant attended the meeting. (don't know)
No DT |
accountant NN |
attended VBD |
the DT |
meeting NN |
|
Just:RB | 1.09 | 1.21 | 0.91 | 1.09 | 1.21 |
one:CD | 0.85 | 1.33 | 1.30 | 0.82 | 1.33 |
accountant:NN | 0.71 | 0.28 | 0.77 | 0.71 | 0.87 |
attended:VBD | 0.76 | 0.86 | 1.32 | 0.76 | 0.57 |
the:DT | 1.31 | 1.00 | 1.25 | 2.42 | 1.00 |
meeting:NN | 0.71 | 0.87 | 0.48 | 0.71 | 0.28 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.32 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.76 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : accountant
-1.00 1.00 NullPunisher.other : attended
-1.00 1.00 NullPunisher.other : meeting
-1.00 1.00 NullPunisher.other : No
-2.00 1.00 RootEntailment.unalignedRoot : "attended" not aligned to anything
Hand-tuned score (dot product of above): -6.8134
Threshold: -9.4738
Txt: Just one accountant attended the meeting.
Hyp: Some accountants attended the meeting. (yes)
Some DT |
accountants NNS |
attended VBD |
the DT |
meeting NN |
|
Just:RB | 1.09 | 1.21 | 0.91 | 1.09 | 1.21 |
one:CD | 0.79 | 1.33 | 1.30 | 0.82 | 1.33 |
accountant:NN | 0.71 | 1.32 | 0.77 | 0.71 | 0.87 |
attended:VBD | 0.76 | 0.82 | 1.32 | 0.76 | 0.57 |
the:DT | 1.28 | 1.00 | 1.25 | 2.42 | 1.00 |
meeting:NN | 0.71 | 0.89 | 0.48 | 0.71 | 0.28 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.64 Alignment.score
1.00 0.28 Alignment.isGood
-1.00 0.69 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.20 Alignment.txtSpan
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "one" modifying "accountant" is dropped on aligned hypothesis word "accountants"
-1.00 1.00 NullPunisher.other : attended
-1.00 1.00 NullPunisher.other : meeting
-1.00 1.00 NullPunisher.other : Some
-0.10 1.00 NullPunisher.article : the
1.00 1.00 Quant.equivalent : Replacing the quantifier "one" by an equivalent quantifier "some" preserves truth.
-2.00 1.00 RootEntailment.unalignedRoot : "attended" not aligned to anything
Hand-tuned score (dot product of above): -3.6506
Threshold: -9.4738
Txt: Just one accountant attended the meeting.
Hyp: Some accountant attended the meeting. (yes)
Some DT |
accountant NN |
attended VBD |
the DT |
meeting NN |
|
Just:RB | 1.09 | 1.21 | 0.91 | 1.09 | 1.21 |
one:CD | 0.79 | 1.33 | 1.30 | 0.82 | 1.33 |
accountant:NN | 0.71 | 0.28 | 0.77 | 0.71 | 0.87 |
attended:VBD | 0.76 | 0.86 | 1.32 | 0.76 | 0.57 |
the:DT | 1.28 | 1.00 | 1.25 | 2.42 | 1.00 |
meeting:NN | 0.71 | 0.87 | 0.48 | 0.71 | 0.28 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.32 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.76 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : accountant
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : meeting
-1.00 1.00 NullPunisher.other : attended
-1.00 1.00 NullPunisher.other : Some
-2.00 1.00 RootEntailment.unalignedRoot : "attended" not aligned to anything
Hand-tuned score (dot product of above): -6.8134
Threshold: -9.4738
Txt: Just one accountant attended the meeting.
Hyp: Some accountants attended the meeting. (don't know)
Some DT |
accountants NNS |
attended VBD |
the DT |
meeting NN |
|
Just:RB | 1.09 | 1.21 | 0.91 | 1.09 | 1.21 |
one:CD | 0.79 | 1.33 | 1.30 | 0.82 | 1.33 |
accountant:NN | 0.71 | 1.32 | 0.77 | 0.71 | 0.87 |
attended:VBD | 0.76 | 0.82 | 1.32 | 0.76 | 0.57 |
the:DT | 1.28 | 1.00 | 1.25 | 2.42 | 1.00 |
meeting:NN | 0.71 | 0.89 | 0.48 | 0.71 | 0.28 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.64 Alignment.score
1.00 0.28 Alignment.isGood
-1.00 0.69 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.20 Alignment.txtSpan
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "one" modifying "accountant" is dropped on aligned hypothesis word "accountants"
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : attended
-1.00 1.00 NullPunisher.other : meeting
-1.00 1.00 NullPunisher.other : Some
1.00 1.00 Quant.equivalent : Replacing the quantifier "one" by an equivalent quantifier "some" preserves truth.
-2.00 1.00 RootEntailment.unalignedRoot : "attended" not aligned to anything
Hand-tuned score (dot product of above): -3.6506
Threshold: -9.4738
Txt: Just one accountant attended the meeting.
Hyp: Some accountant attended the meeting. (yes)
Some DT |
accountant NN |
attended VBD |
the DT |
meeting NN |
|
Just:RB | 1.09 | 1.21 | 0.91 | 1.09 | 1.21 |
one:CD | 0.79 | 1.33 | 1.30 | 0.82 | 1.33 |
accountant:NN | 0.71 | 0.28 | 0.77 | 0.71 | 0.87 |
attended:VBD | 0.76 | 0.86 | 1.32 | 0.76 | 0.57 |
the:DT | 1.28 | 1.00 | 1.25 | 2.42 | 1.00 |
meeting:NN | 0.71 | 0.87 | 0.48 | 0.71 | 0.28 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.32 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.76 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : Some
-1.00 1.00 NullPunisher.other : attended
-1.00 1.00 NullPunisher.other : accountant
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : meeting
-2.00 1.00 RootEntailment.unalignedRoot : "attended" not aligned to anything
Hand-tuned score (dot product of above): -6.8134
Threshold: -9.4738
Txt: Smith signed one contract. Jones signed another contract.
Hyp: Smith and Jones signed two contracts. (yes)
Smith NNP |
Jones NNP |
signed VBD |
two CD |
contracts NNS |
|
Smith:NNP | 0.28 | 0.70 | 1.05 | 1.20 | 1.16 |
signed:VBD | 1.14 | 1.10 | 1.32 | 1.26 | 0.57 |
one:CD | 1.24 | 1.13 | 1.28 | 0.48 | 1.33 |
contract:NN | 1.16 | 1.16 | 0.36 | 1.21 | 1.44 |
Jones:NNP | 0.70 | 0.28 | 1.01 | 1.20 | 1.15 |
signed:VBD | 1.14 | 1.10 | 1.32 | 1.26 | 0.57 |
another:DT | 1.22 | 1.22 | 1.24 | 1.18 | 1.00 |
contract:NN | 1.16 | 1.16 | 0.36 | 1.21 | 1.44 |
NO_WORD | 0.28 | 0.01 | 0.17 | 0.52 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.31 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.76 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.40 Alignment.txtSpan
-3.00 1.00 NullPunisher.entity : Jones
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : signed
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '2.0' vs '1.0'
-2.00 1.00 RootEntailment.unalignedRoot : "signed" not aligned to anything
Hand-tuned score (dot product of above): -15.2891
Threshold: -9.4738
Txt: Smith signed two contracts. Jones signed two contracts.
Hyp: Smith and Jones signed two contracts. (yes)
Smith NNP |
Jones NNP |
signed VBD |
two CD |
contracts NNS |
|
Smith:NNP | 0.28 | 0.70 | 1.05 | 1.20 | 1.16 |
signed:VBD | 1.14 | 1.10 | 1.32 | 1.26 | 0.57 |
two:CD | 1.24 | 1.24 | 1.17 | 0.85 | 1.15 |
contracts:NNS | 1.16 | 1.15 | 0.48 | 1.11 | 0.28 |
Jones:NNP | 0.70 | 0.28 | 1.01 | 1.20 | 1.15 |
signed:VBD | 1.14 | 1.10 | 1.32 | 1.26 | 0.57 |
two:CD | 1.24 | 1.24 | 1.17 | 0.85 | 1.15 |
contracts:NNS | 1.16 | 1.15 | 0.48 | 1.11 | 0.28 |
NO_WORD | 0.28 | 0.01 | 0.17 | 0.52 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 -0.11 Alignment.score
1.00 0.16 Alignment.isGood
-1.00 0.83 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "two" modifying "contracts"
-3.00 1.00 NullPunisher.entity : Smith
-3.00 1.00 NullPunisher.entity : two
-1.00 1.00 NullPunisher.other : signed
-3.00 1.00 NullPunisher.entity : Jones
-1.00 1.00 NullPunisher.other : contracts
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '2.0' vs ''
-2.00 1.00 RootEntailment.unalignedRoot : "signed" not aligned to anything
Hand-tuned score (dot product of above): -21.2778
Threshold: -9.4738
Txt: Smith signed two contracts. Jones also signed them.
Hyp: Smith and Jones signed two contracts. (yes)
Smith NNP |
Jones NNP |
signed VBD |
two CD |
contracts NNS |
|
Smith:NNP | 0.28 | 0.70 | 1.05 | 1.20 | 1.16 |
signed:VBD | 1.14 | 1.10 | 1.32 | 1.26 | 0.57 |
two:CD | 1.24 | 1.24 | 1.17 | 0.85 | 1.15 |
contracts:NNS | 1.16 | 1.15 | 0.48 | 1.11 | 0.28 |
Jones:NNP | 0.70 | 0.28 | 1.01 | 1.20 | 1.15 |
also:RB | 1.47 | 1.47 | 0.91 | 1.33 | 1.21 |
signed:VBD | 1.14 | 1.10 | 1.32 | 1.26 | 0.57 |
them:PRP | 1.55 | 2.74 | 1.21 | 1.43 | 1.30 |
NO_WORD | 0.28 | 0.01 | 0.17 | 0.52 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 -0.11 Alignment.score
1.00 0.16 Alignment.isGood
-1.00 0.83 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "two" modifying "contracts"
-1.00 1.00 NullPunisher.other : signed
-1.00 1.00 NullPunisher.other : contracts
-3.00 1.00 NullPunisher.entity : Smith
-3.00 1.00 NullPunisher.entity : Jones
-3.00 1.00 NullPunisher.entity : two
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '2.0' vs ''
-2.00 1.00 RootEntailment.unalignedRoot : "signed" not aligned to anything
Hand-tuned score (dot product of above): -21.2778
Threshold: -9.4738
Txt: Mary used her workstation.
Hyp: Mary's workstation was used. (yes)
Mary NNP |
workstation NN |
was VBD |
used VBN |
|
Mary:NNP | 0.28 | 1.04 | 1.04 | 1.07 |
used:VBD | 1.16 | 0.69 | 0.93 | 1.57 |
her:PRP$ | 2.73 | 1.30 | 1.27 | 0.94 |
workstation:NN | 1.04 | 0.28 | 0.82 | 0.60 |
NO_WORD | 0.12 | 0.15 | 1.57 | 0.17 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.89 Alignment.score
1.00 0.33 Alignment.isGood
-1.00 0.64 Alignment.isBad
-0.10 2.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.50 Alignment.txtSpan
-0.05 1.00 NullPunisher.aux : was
-3.00 1.00 NullPunisher.entity : Mary
Hand-tuned score (dot product of above): -2.5091
Threshold: -9.4738
Txt: Mary used her workstation.
Hyp: Mary has a workstation. (yes)
Mary NNP |
has VBZ |
a DT |
workstation NN |
|
Mary:NNP | 0.28 | 1.04 | 0.96 | 1.04 |
used:VBD | 1.16 | 0.79 | 0.76 | 0.69 |
her:PRP$ | 2.73 | 1.20 | 1.41 | 1.30 |
workstation:NN | 1.04 | 0.82 | 0.71 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.25 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.24 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-0.05 1.00 NullPunisher.aux : has
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : workstation
-3.00 1.00 NullPunisher.entity : Mary
-2.00 1.00 RootEntailment.unalignedRoot : "has" not aligned to anything
Hand-tuned score (dot product of above): -6.8749
Threshold: -9.4738
Txt: Mary used her workstation.
Hyp: Mary is female. (yes)
Mary NNP |
is VBZ |
female JJ |
|
Mary:NNP | 0.28 | 1.07 | 1.20 |
used:VBD | 1.16 | 0.93 | 1.02 |
her:PRP$ | 2.73 | 0.97 | 1.39 |
workstation:NN | 1.04 | 0.82 | 0.99 |
NO_WORD | 0.28 | 1.43 | 0.16 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.44 Alignment.score
1.00 0.24 Alignment.isGood
-1.00 0.74 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-3.00 1.00 NullPunisher.entity : Mary
-1.00 1.00 NullPunisher.other : female
-0.05 1.00 NullPunisher.aux : is
-2.00 1.00 RootEntailment.unalignedRoot : "female" not aligned to anything
Hand-tuned score (dot product of above): -6.4057
Threshold: -9.4738
Txt: Every student used her workstation. Mary is a student.
Hyp: Mary used her workstation. (yes)
Mary NNP |
used VBD |
her PRP$ |
workstation NN |
|
Every:DT | 1.18 | 1.23 | 1.11 | 1.00 |
student:NN | 1.03 | 0.63 | 1.58 | 0.79 |
used:VBD | 1.16 | 1.32 | 0.51 | 0.69 |
her:PRP$ | 1.52 | 1.28 | 1.05 | 1.30 |
workstation:NN | 1.04 | 0.60 | 1.58 | 0.28 |
Mary:NNP | 0.28 | 1.07 | 1.80 | 1.04 |
is:VBZ | 1.16 | 0.93 | 0.54 | 0.91 |
a:DT | 1.25 | 1.25 | 1.16 | 1.00 |
student:NN | 1.03 | 0.63 | 1.58 | 0.79 |
NO_WORD | 0.28 | 0.17 | 0.93 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.23 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-3.00 1.00 NullPunisher.entity : Mary
-0.10 1.00 NullPunisher.functionWord : her
-1.00 1.00 NullPunisher.other : used
-1.00 1.00 NullPunisher.other : workstation
-2.00 1.00 RootEntailment.unalignedRoot : "used" not aligned to anything
Hand-tuned score (dot product of above): -7.8399
Threshold: -9.4738
Txt: Every student used her workstation. Mary is a student.
Hyp: Mary has a workstation. (yes)
Mary NNP |
has VBZ |
a DT |
workstation NN |
|
Every:DT | 1.18 | 1.25 | 1.31 | 1.00 |
student:NN | 1.03 | 0.82 | 0.71 | 0.79 |
used:VBD | 1.16 | 0.79 | 0.76 | 0.69 |
her:PRP$ | 1.52 | 0.90 | 1.11 | 1.30 |
workstation:NN | 1.04 | 0.82 | 0.71 | 0.28 |
Mary:NNP | 0.28 | 1.04 | 0.96 | 1.04 |
is:VBZ | 1.16 | 1.01 | 0.76 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 |
student:NN | 1.03 | 0.82 | 0.71 | 0.79 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.25 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.24 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-3.00 1.00 NullPunisher.entity : Mary
-0.05 1.00 NullPunisher.aux : has
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : workstation
-2.00 1.00 RootEntailment.unalignedRoot : "has" not aligned to anything
Hand-tuned score (dot product of above): -6.8749
Threshold: -9.4738
Txt: No student used her workstation. Mary is a student.
Hyp: Mary used a workstation. (don't know)
Mary NNP |
used VBD |
a DT |
workstation NN |
|
No:DT | 1.25 | 1.25 | 1.32 | 1.00 |
student:NN | 1.03 | 0.63 | 0.71 | 0.79 |
used:VBD | 1.16 | 1.32 | 0.76 | 0.69 |
her:PRP$ | 1.52 | 1.28 | 1.11 | 1.30 |
workstation:NN | 1.04 | 0.60 | 0.71 | 0.28 |
Mary:NNP | 0.28 | 1.07 | 0.96 | 1.04 |
is:VBZ | 1.16 | 0.93 | 0.76 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 |
student:NN | 1.03 | 0.63 | 0.71 | 0.79 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.20 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-0.10 1.00 NullPunisher.article : a
-3.00 1.00 NullPunisher.entity : Mary
-1.00 1.00 NullPunisher.other : workstation
-1.00 1.00 NullPunisher.other : used
-2.00 1.00 RootEntailment.unalignedRoot : "used" not aligned to anything
Hand-tuned score (dot product of above): -7.8790
Threshold: -9.4738
Txt: Smith attended a meeting. She chaired it.
Hyp: Smith chaired a meeting. (yes)
Smith NNP |
chaired VBD |
a DT |
meeting NN |
|
Smith:NNP | 0.28 | 1.07 | 0.96 | 1.15 |
attended:VBD | 1.16 | 0.43 | 0.76 | 0.57 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 |
meeting:NN | 1.15 | 0.52 | 0.71 | 0.28 |
She:PRP | 2.76 | 0.97 | 1.11 | 1.30 |
chaired:VBD | 1.16 | 1.32 | 0.76 | 0.61 |
it:PRP | 1.52 | 0.97 | 1.10 | 1.30 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.20 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : meeting
-3.00 1.00 NullPunisher.entity : Smith
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : chaired
-2.00 1.00 RootEntailment.unalignedRoot : "chaired" not aligned to anything
Hand-tuned score (dot product of above): -7.8790
Threshold: -9.4738
Txt: Smith delivered a report to ITEL. She also delivered them an invoice. And she delivered them a project proposal.
Hyp: Smith delivered a report, an invoice and a project proposal to ITEL. (yes)
Smith NNP |
delivered VBD |
a DT |
report NN |
an DT |
invoice NN |
a DT |
project NN |
proposal NN |
to TO |
ITEL NNP |
|
Smith:NNP | 0.28 | 1.07 | 0.96 | 1.14 | 0.96 | 1.15 | 0.96 | 1.18 | 1.16 | 0.96 | 0.93 |
delivered:VBD | 1.16 | 1.32 | 0.76 | 0.82 | 0.76 | 0.65 | 0.76 | 0.91 | 0.89 | 0.76 | 1.16 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.21 | 1.00 | 2.42 | 1.00 | 1.00 | 1.31 | 1.25 |
report:NN | 1.14 | 0.73 | 0.71 | 0.28 | 0.71 | 0.54 | 0.71 | 0.87 | 0.76 | 0.71 | 1.07 |
to:TO | 1.25 | 1.25 | 1.31 | 1.00 | 1.40 | 1.00 | 1.31 | 1.00 | 1.00 | 2.42 | 1.25 |
ITEL:NNP | 0.93 | 1.07 | 0.96 | 1.07 | 0.96 | 1.07 | 0.96 | 1.07 | 1.07 | 0.96 | 0.28 |
She:PRP | 2.76 | 0.97 | 1.11 | 1.30 | 1.11 | 1.30 | 1.11 | 1.30 | 1.30 | 1.35 | 1.52 |
also:RB | 1.47 | 0.91 | 1.09 | 1.21 | 1.09 | 1.21 | 1.09 | 1.21 | 1.21 | 1.09 | 1.47 |
delivered:VBD | 1.16 | 1.32 | 0.76 | 0.82 | 0.76 | 0.65 | 0.76 | 0.91 | 0.89 | 0.76 | 1.16 |
them:PRP | 2.76 | 0.97 | 1.11 | 1.30 | 1.11 | 1.30 | 1.11 | 1.30 | 1.30 | 1.35 | 1.50 |
an:DT | 1.25 | 1.25 | 1.21 | 1.00 | 2.42 | 1.00 | 1.21 | 1.00 | 1.00 | 1.40 | 1.25 |
invoice:NN | 1.15 | 0.56 | 0.71 | 0.54 | 0.71 | 0.28 | 0.71 | 0.93 | 0.73 | 0.71 | 1.07 |
she:PRP | 2.76 | 0.97 | 1.11 | 1.30 | 1.11 | 1.30 | 1.11 | 1.30 | 1.30 | 1.35 | 1.52 |
delivered:VBD | 1.16 | 1.32 | 0.76 | 0.82 | 0.76 | 0.65 | 0.76 | 0.91 | 0.89 | 0.76 | 1.16 |
them:PRP | 2.76 | 0.97 | 1.11 | 1.30 | 1.11 | 1.30 | 1.11 | 1.30 | 1.30 | 1.35 | 1.50 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.21 | 1.00 | 2.42 | 1.00 | 1.00 | 1.31 | 1.25 |
project:NN | 1.18 | 0.82 | 0.71 | 0.87 | 0.71 | 0.93 | 0.71 | 0.28 | 0.77 | 0.71 | 1.07 |
proposal:NN | 1.16 | 0.80 | 0.71 | 0.76 | 0.71 | 0.73 | 0.71 | 0.77 | 0.28 | 0.71 | 1.07 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 | 0.82 | 0.01 | 0.82 | 0.05 | 0.01 | 0.56 | 0.25 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.29 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.76 Alignment.isBad
-0.10 11.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "project" modifying "proposal"
-3.00 1.00 NullPunisher.entity : Smith
-0.10 1.00 NullPunisher.functionWord : to
-0.10 1.00 NullPunisher.article : an
-3.00 1.00 NullPunisher.entity : ITEL
-1.00 1.00 NullPunisher.other : report
-1.00 1.00 NullPunisher.other : invoice
-1.00 1.00 NullPunisher.other : proposal
-1.00 1.00 NullPunisher.other : delivered
-0.10 1.00 NullPunisher.article : a
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : project
-2.00 1.00 RootEntailment.unalignedRoot : "delivered" not aligned to anything
Hand-tuned score (dot product of above): -15.7571
Threshold: -9.4738
Txt: Every committee has a chairman. He is appointed its members.
Hyp: Every committee has a chairman appointed by members of the committee. (yes)
Every DT |
committee NN |
has VBZ |
a DT |
chairman NN |
appointed VBN |
members NNS |
the DT |
committee NN |
|
Every:DT | 2.42 | 1.00 | 1.25 | 1.31 | 1.00 | 1.25 | 0.92 | 1.31 | 1.00 |
committee:NN | 0.71 | 0.28 | 0.82 | 0.71 | 0.39 | 0.71 | 0.39 | 0.71 | 0.28 |
has:VBZ | 0.76 | 0.91 | 1.32 | 0.76 | 0.91 | 0.77 | 0.91 | 0.76 | 0.91 |
a:DT | 1.31 | 1.00 | 1.25 | 2.42 | 1.00 | 1.25 | 1.06 | 1.31 | 1.00 |
chairman:NN | 0.71 | 0.39 | 0.82 | 0.71 | 0.28 | 0.31 | 0.52 | 0.71 | 0.39 |
He:PRP | 1.11 | 2.50 | 0.93 | 1.11 | 1.56 | 1.37 | 1.43 | 1.25 | 2.50 |
is:VBZ | 0.76 | 0.91 | 1.01 | 0.76 | 0.91 | 0.81 | 0.91 | 0.76 | 0.91 |
appointed:VBN | 0.76 | 0.80 | 0.77 | 0.76 | 0.40 | 1.32 | 0.81 | 0.76 | 0.80 |
its:PRP$ | 1.26 | 1.45 | 1.20 | 1.25 | 2.50 | 0.97 | 1.30 | 1.26 | 1.45 |
members:NNS | 0.63 | 0.39 | 0.82 | 0.77 | 0.52 | 0.72 | 0.28 | 0.77 | 0.39 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.82 | 0.09 | 0.16 | 0.30 | 0.82 | 0.04 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.26 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 8.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.11 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "committee" modifying "members"
-1.00 1.00 NullPunisher.other : committee
-1.00 1.00 NullPunisher.other : Every
-0.05 1.00 NullPunisher.aux : has
-1.00 1.00 NullPunisher.other : chairman
-0.10 1.00 NullPunisher.article : a
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : appointed
-1.00 1.00 NullPunisher.other : committee
-2.00 1.00 RootEntailment.unalignedRoot : "has" not aligned to anything
Hand-tuned score (dot product of above): -9.2357
Threshold: -9.4738
Txt: ITEL has sent most of the reports Smith needs. They are on her desk.
Hyp: There are some reports from ITEL on Smith's desk. (yes)
There EX |
are VBP |
some DT |
reports NNS |
ITEL NNP |
Smith NNP |
desk NN |
|
ITEL:NNP | 0.94 | 1.04 | 0.96 | 1.07 | 0.28 | 0.93 | 1.07 |
has:VBZ | 0.76 | 1.05 | 0.76 | 0.91 | 1.16 | 1.16 | 0.87 |
sent:VBN | 0.73 | 0.88 | 0.70 | 0.55 | 1.16 | 1.09 | 0.66 |
most:JJS | 0.76 | 0.96 | 0.70 | 0.86 | 1.13 | 1.11 | 0.82 |
of:IN | 1.05 | 1.24 | 1.05 | 1.31 | 1.57 | 1.57 | 1.31 |
the:DT | 1.20 | 1.18 | 1.28 | 1.00 | 1.22 | 1.25 | 1.00 |
reports:NNS | 0.71 | 0.82 | 0.71 | 0.28 | 1.07 | 1.14 | 0.73 |
Smith:NNP | 0.96 | 1.07 | 0.94 | 1.14 | 0.93 | 0.28 | 1.08 |
needs:VBZ | 0.71 | 0.82 | 0.76 | 0.87 | 1.14 | 1.16 | 0.83 |
They:PRP | 0.13 | 0.94 | 1.11 | 1.30 | 1.50 | 1.55 | 1.30 |
are:VBP | 0.70 | 1.32 | 0.73 | 0.91 | 1.13 | 1.16 | 0.91 |
her:PRP$ | 1.00 | 0.97 | 1.11 | 1.30 | 2.73 | 1.55 | 1.26 |
desk:NN | 0.68 | 0.82 | 0.71 | 0.73 | 1.07 | 1.08 | 0.28 |
NO_WORD | 0.29 | 0.17 | 0.82 | 0.28 | 0.15 | 0.12 | 0.02 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.14 Alignment.score
1.00 0.19 Alignment.isGood
-1.00 0.79 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "desk" modifying "ITEL"
-0.05 1.00 NullPunisher.aux : are
-1.00 1.00 NullPunisher.other : reports
-0.10 1.00 NullPunisher.functionWord : There
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : desk
-1.00 1.00 NullPunisher.other : some
-3.00 1.00 NullPunisher.entity : ITEL
-2.00 1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -13.3141
Threshold: -9.4738
Txt: Two out of ten machines are missing. They have been removed.
Hyp: Two machines have been removed. (yes)
Two CD |
machines NNS |
have VBP |
been VBN |
removed VBN |
|
Two:CD | 0.85 | 1.33 | 1.30 | 1.30 | 1.30 |
ten:NN | 1.22 | 0.88 | 0.82 | 0.72 | 0.74 |
machines:NNS | 1.29 | 0.28 | 0.82 | 0.82 | 0.80 |
are:VBP | 1.39 | 0.91 | 0.89 | 0.13 | 0.91 |
missing:VBG | 1.39 | 0.83 | 1.83 | 0.82 | 0.42 |
They:PRP | 1.43 | 2.50 | 1.27 | 1.07 | 1.12 |
have:VBP | 1.39 | 0.91 | 1.32 | 0.98 | 0.83 |
been:VBN | 1.39 | 0.91 | 0.98 | 1.32 | 0.89 |
removed:VBN | 1.39 | 0.89 | 0.83 | 0.89 | 1.32 |
NO_WORD | 0.52 | 0.15 | 1.30 | 1.57 | 0.17 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.47 Alignment.score
1.00 0.25 Alignment.isGood
-1.00 0.73 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Two" modifying "machines"
-0.05 1.00 NullPunisher.aux : have
-3.00 1.00 NullPunisher.entity : Two
-1.00 1.00 NullPunisher.other : removed
-0.05 1.00 NullPunisher.aux : been
-1.00 1.00 NullPunisher.other : machines
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '2.0' vs ''
-2.00 1.00 RootEntailment.unalignedRoot : "removed" not aligned to anything
Hand-tuned score (dot product of above): -14.6058
Threshold: -9.4738
Txt: Two out of ten machines are missing. They have been removed.
Hyp: Eight machines have been removed. (don't know)
Eight CD |
machines NNS |
have VBP |
been VBN |
removed VBN |
|
Two:CD | 0.35 | 1.33 | 1.30 | 1.30 | 1.30 |
ten:NN | 1.29 | 0.88 | 0.82 | 0.72 | 0.74 |
machines:NNS | 1.29 | 0.28 | 0.82 | 0.82 | 0.80 |
are:VBP | 1.39 | 0.91 | 0.89 | 0.13 | 0.91 |
missing:VBG | 1.39 | 0.83 | 1.83 | 0.82 | 0.42 |
They:PRP | 1.61 | 2.50 | 1.27 | 1.07 | 1.12 |
have:VBP | 1.39 | 0.91 | 1.32 | 0.98 | 0.83 |
been:VBN | 1.39 | 0.91 | 0.98 | 1.32 | 0.89 |
removed:VBN | 1.39 | 0.89 | 0.83 | 0.89 | 1.32 |
NO_WORD | 0.52 | 0.15 | 1.30 | 1.57 | 0.17 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.51 Alignment.score
1.00 0.25 Alignment.isGood
-1.00 0.72 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.20 Alignment.txtSpan
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "machines" modifying "Two" is dropped on aligned hypothesis word "Eight"
-0.05 1.00 NullPunisher.aux : been
-1.00 1.00 NullPunisher.other : machines
-1.00 1.00 NullPunisher.other : removed
-0.05 1.00 NullPunisher.aux : have
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '8.0' vs '2.0'
-2.00 1.00 RootEntailment.unalignedRoot : "removed" not aligned to anything
Hand-tuned score (dot product of above): -9.8394
Threshold: -9.4738
Txt: Two out of ten machines are missing. They were all here yesterday.
Hyp: Ten machines were here yesterday. (yes)
Ten CD |
machines NNS |
were VBD |
here RB |
yesterday NN |
|
Two:CD | 0.28 | 1.33 | 1.30 | 1.30 | 1.33 |
ten:NN | 0.00 | 0.88 | 0.78 | 1.27 | 0.78 |
machines:NNS | 1.29 | 0.28 | 0.82 | 1.31 | 0.96 |
are:VBP | 1.39 | 0.91 | 0.11 | 1.02 | 0.91 |
missing:VBG | 1.39 | 0.83 | 0.82 | 1.11 | 0.91 |
They:PRP | 1.52 | 2.50 | 1.12 | 1.29 | 1.30 |
were:VBD | 1.36 | 0.91 | 1.32 | 0.17 | 0.91 |
all:DT | 1.18 | 1.00 | 1.25 | 1.45 | 1.00 |
here:RB | 1.33 | 1.21 | 0.37 | 1.23 | 1.21 |
yesterday:NN | 1.29 | 0.96 | 0.82 | 1.31 | 0.28 |
NO_WORD | 0.52 | 0.28 | 0.31 | 0.04 | 0.12 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.08 Alignment.score
1.00 0.18 Alignment.isGood
-1.00 0.80 Alignment.isBad
-0.10 2.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.60 Alignment.txtSpan
-1.00 1.00 NullPunisher.other : yesterday
-1.00 1.00 NullPunisher.other : machines
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '10.0' vs ''
-2.00 1.00 RootEntailment.unalignedRoot : "yesterday" not aligned to anything
Hand-tuned score (dot product of above): -10.4837
Threshold: -9.4738
Txt: Smith took a machine on Tuesday, and Jones took a machine on Wednesday. They put them in the lobby.
Hyp: Smith and Jones put two machines in the lobby. (yes)
Smith NNP |
Jones NNP |
put VBD |
two CD |
machines NNS |
the DT |
lobby NN |
|
Smith:NNP | 0.28 | 0.70 | 1.07 | 1.20 | 1.11 | 0.96 | 1.06 |
took:VBD | 1.16 | 1.14 | 0.24 | 0.93 | 0.91 | 0.73 | 0.78 |
a:DT | 1.25 | 1.25 | 1.25 | 1.18 | 1.00 | 1.31 | 1.00 |
machine:NN | 1.11 | 1.13 | 0.54 | 1.29 | 1.40 | 0.71 | 0.81 |
Tuesday:NNP | 1.04 | 1.05 | 1.07 | 1.20 | 1.18 | 0.96 | 1.12 |
Jones:NNP | 0.70 | 0.28 | 1.07 | 1.20 | 1.08 | 0.96 | 1.03 |
took:VBD | 1.16 | 1.14 | 0.24 | 0.93 | 0.91 | 0.73 | 0.78 |
a:DT | 1.25 | 1.25 | 1.25 | 1.18 | 1.00 | 1.31 | 1.00 |
machine:NN | 1.11 | 1.13 | 0.54 | 1.29 | 1.40 | 0.71 | 0.81 |
Wednesday:NNP | 1.03 | 1.01 | 1.07 | 1.20 | 1.16 | 0.96 | 1.11 |
They:PRP | 1.73 | 1.70 | 0.97 | 1.43 | 2.33 | 1.30 | 1.27 |
put:VBD | 1.16 | 1.16 | 1.32 | 1.35 | 0.85 | 0.76 | 0.74 |
them:PRP | 1.73 | 1.70 | 0.97 | 1.43 | 2.33 | 0.80 | 1.30 |
the:DT | 1.25 | 1.25 | 1.25 | 1.11 | 1.00 | 2.42 | 1.00 |
lobby:NN | 1.06 | 1.03 | 0.65 | 1.29 | 0.81 | 0.71 | 0.28 |
NO_WORD | 0.28 | 0.01 | 0.17 | 0.52 | 0.09 | 0.82 | 0.07 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.22 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.14 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "lobby" modifying "put"
-3.00 1.00 NullPunisher.entity : Smith
-3.00 1.00 NullPunisher.entity : two
-1.00 1.00 NullPunisher.other : lobby
-1.00 1.00 NullPunisher.other : put
-0.10 1.00 NullPunisher.article : the
-3.00 1.00 NullPunisher.entity : Jones
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '2.0' vs ''
-2.00 1.00 RootEntailment.unalignedRoot : "put" not aligned to anything
Hand-tuned score (dot product of above): -20.9395
Threshold: -9.4738
Txt: John and his colleagues went to a meeting. They hated it.
Hyp: John's colleagues hated the meeting. (yes)
John NNP |
colleagues NNS |
hated VBZ |
the DT |
meeting NN |
|
John:NNP | 0.28 | 1.08 | 1.07 | 0.93 | 1.18 |
his:PRP$ | 2.76 | 1.30 | 0.97 | 1.41 | 1.30 |
colleagues:NNS | 1.08 | 0.28 | 0.80 | 0.71 | 0.60 |
went:VBD | 1.16 | 0.81 | 0.19 | 0.76 | 0.86 |
a:DT | 1.25 | 1.00 | 1.25 | 1.31 | 1.00 |
meeting:NN | 1.18 | 0.60 | 0.82 | 0.71 | 0.28 |
They:PRP | 1.67 | 2.50 | 1.19 | 1.07 | 1.42 |
hated:VBZ | 1.16 | 0.89 | 1.32 | 0.76 | 0.91 |
it:PRP | 1.55 | 1.30 | 0.97 | 1.11 | 1.30 |
NO_WORD | 0.12 | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.18 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-3.00 1.00 NullPunisher.entity : John
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : colleagues
-1.00 1.00 NullPunisher.other : meeting
-1.00 1.00 NullPunisher.other : hated
-2.00 1.00 RootEntailment.unalignedRoot : "hated" not aligned to anything
Hand-tuned score (dot product of above): -9.0010
Threshold: -9.4738
Txt: John and his colleagues went to a meeting. They hated it.
Hyp: John hated the meeting. (yes)
John NNP |
hated NNP |
the DT |
meeting NN |
|
John:NNP | 0.28 | 1.07 | 0.93 | 1.18 |
his:PRP$ | 2.76 | 1.30 | 1.41 | 1.30 |
colleagues:NNS | 1.08 | 0.80 | 0.71 | 0.60 |
went:VBD | 1.16 | 0.40 | 0.76 | 0.86 |
a:DT | 1.25 | 1.00 | 1.31 | 1.00 |
meeting:NN | 1.18 | 0.82 | 0.71 | 0.28 |
They:PRP | 1.67 | 1.51 | 1.07 | 1.42 |
hated:VBZ | 1.16 | 1.75 | 0.76 | 0.91 |
it:PRP | 1.55 | 1.30 | 1.11 | 1.30 |
NO_WORD | 0.05 | 0.12 | 0.82 | 0.25 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.69 Alignment.score
1.00 0.29 Alignment.isGood
-1.00 0.68 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.25 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "John" modifying "hated"
-1.00 1.00 NullPunisher.other : meeting
-3.00 1.00 NullPunisher.entity : John
-0.10 1.00 NullPunisher.article : the
Hand-tuned score (dot product of above): -4.9782
Threshold: -9.4738
Txt: John and his colleagues went to a meeting. They hated it.
Hyp: John hated the meeting. (don't know)
John NNP |
hated NNP |
the DT |
meeting NN |
|
John:NNP | 0.28 | 1.07 | 0.93 | 1.18 |
his:PRP$ | 2.76 | 1.30 | 1.41 | 1.30 |
colleagues:NNS | 1.08 | 0.80 | 0.71 | 0.60 |
went:VBD | 1.16 | 0.40 | 0.76 | 0.86 |
a:DT | 1.25 | 1.00 | 1.31 | 1.00 |
meeting:NN | 1.18 | 0.82 | 0.71 | 0.28 |
They:PRP | 1.67 | 1.51 | 1.07 | 1.42 |
hated:VBZ | 1.16 | 1.75 | 0.76 | 0.91 |
it:PRP | 1.55 | 1.30 | 1.11 | 1.30 |
NO_WORD | 0.05 | 0.12 | 0.82 | 0.25 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.69 Alignment.score
1.00 0.29 Alignment.isGood
-1.00 0.68 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.25 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "John" modifying "hated"
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : meeting
-3.00 1.00 NullPunisher.entity : John
Hand-tuned score (dot product of above): -4.9782
Threshold: -9.4738
Txt: Each department has a dedicated line. They rent them from BT.
Hyp: Every department rents a line from BT. (yes)
Every DT |
department NN |
rents VBZ |
a DT |
line NN |
BT NNP |
|
Each:DT | 1.28 | 1.00 | 1.23 | 1.31 | 1.00 | 1.25 |
department:NN | 0.71 | 0.28 | 0.80 | 0.71 | 0.79 | 1.07 |
has:VBZ | 0.76 | 0.91 | 0.70 | 0.76 | 0.91 | 1.16 |
a:DT | 1.31 | 1.00 | 1.25 | 2.42 | 1.00 | 1.25 |
dedicated:JJ | 0.76 | 0.82 | 0.93 | 0.76 | 0.83 | 1.13 |
line:NN | 0.71 | 0.79 | 0.71 | 0.71 | 0.28 | 1.07 |
They:PRP | 1.04 | 1.30 | 0.97 | 1.11 | 2.50 | 1.55 |
rent:VB | 0.73 | 0.87 | 1.51 | 0.76 | 0.75 | 1.16 |
them:PRP | 1.08 | 1.30 | 0.97 | 1.11 | 2.50 | 1.55 |
BT:NNP | 0.96 | 1.07 | 1.07 | 0.96 | 1.07 | 0.28 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.82 | 0.09 | 0.15 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.47 Alignment.score
1.00 0.25 Alignment.isGood
-1.00 0.73 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.17 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "BT" modifying "rents"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "them" modifying "rent" is dropped on aligned hypothesis word "rents"
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : line
-1.00 1.00 NullPunisher.other : department
-3.00 1.00 NullPunisher.entity : BT
-1.00 1.00 NullPunisher.other : Every
Hand-tuned score (dot product of above): -6.9986
Threshold: -9.4738
Txt: Each department has a dedicated line. The sales department rents it from BT.
Hyp: The sales department rents a line from BT. (yes)
The DT |
sales_department NN |
rents VBZ |
a DT |
line NN |
BT NNP |
|
Each:DT | 1.31 | 1.00 | 1.23 | 1.31 | 1.00 | 1.25 |
department:NN | 0.71 | 1.41 | 0.80 | 0.71 | 0.79 | 1.07 |
has:VBZ | 0.76 | 0.91 | 0.70 | 0.76 | 0.91 | 1.16 |
a:DT | 1.31 | 1.00 | 1.25 | 2.42 | 1.00 | 1.25 |
dedicated:JJ | 0.76 | 0.85 | 0.93 | 0.76 | 0.83 | 1.13 |
line:NN | 0.67 | 0.76 | 0.71 | 0.71 | 0.28 | 1.07 |
The:DT | 2.42 | 1.00 | 1.25 | 1.31 | 0.96 | 1.25 |
sales_department:NN | 0.71 | 0.28 | 0.82 | 0.71 | 0.76 | 1.07 |
rents:VBZ | 0.76 | 0.91 | 1.32 | 0.76 | 0.80 | 1.16 |
it:PRP | 1.18 | 2.50 | 1.20 | 1.18 | 1.45 | 1.44 |
BT:NNP | 0.96 | 1.07 | 1.07 | 0.96 | 1.07 | 0.28 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.82 | 0.09 | 0.15 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.53 Alignment.score
1.00 0.26 Alignment.isGood
-1.00 0.72 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.17 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "BT" modifying "rents"
-1.00 1.00 Hypernym.posNarrow : Narrowing a term (from "department" to "sales_department") does NOT preserve truth in a positive context
-3.00 1.00 NullPunisher.entity : BT
-1.00 1.00 NullPunisher.other : rents
-1.00 1.00 NullPunisher.other : line
-0.10 1.00 NullPunisher.article : a
-0.10 1.00 NullPunisher.article : The
-2.00 1.00 RootEntailment.unalignedRoot : "rents" not aligned to anything
Hand-tuned score (dot product of above): -9.5190
Threshold: -9.4738
Txt: GFI owns several computers. ITEL maintains them.
Hyp: ITEL maintains all the computers that GFI owns. (yes)
ITEL NNP |
maintains VBZ |
all PDT |
the DT |
computers NNS |
that IN |
GFI NNP |
owns VBZ |
|
GFI:NNP | 0.82 | 1.07 | 0.96 | 0.96 | 1.07 | 0.75 | 0.28 | 1.07 |
owns:VBZ | 1.16 | 0.46 | 0.76 | 0.76 | 0.91 | 1.15 | 1.16 | 1.32 |
several:JJ | 1.11 | 0.96 | 0.76 | 0.76 | 0.88 | 1.05 | 1.13 | 0.96 |
computers:NNS | 1.07 | 0.77 | 0.71 | 0.71 | 0.28 | 0.50 | 1.07 | 0.82 |
ITEL:NNP | 0.28 | 1.07 | 0.93 | 0.93 | 1.07 | 0.75 | 0.82 | 1.07 |
maintains:VBZ | 1.16 | 1.32 | 0.76 | 0.76 | 0.86 | 1.15 | 1.16 | 0.46 |
them:PRP | 1.50 | 1.10 | 1.11 | 0.59 | 2.50 | 1.07 | 1.55 | 1.24 |
NO_WORD | 0.28 | 0.17 | 0.41 | 0.82 | 0.09 | 0.60 | 0.28 | 0.36 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.24 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 8.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "owns" modifying "computers"
-1.00 1.00 NullPunisher.other : computers
-1.00 1.00 NullPunisher.other : all
-3.00 1.00 NullPunisher.entity : ITEL
-3.00 1.00 NullPunisher.entity : GFI
-1.00 1.00 NullPunisher.other : owns
-0.10 1.00 NullPunisher.functionWord : that
-1.00 1.00 NullPunisher.other : maintains
-0.10 1.00 NullPunisher.article : the
-2.00 1.00 RootEntailment.unalignedRoot : "maintains" not aligned to anything
Hand-tuned score (dot product of above): -14.3284
Threshold: -9.4738
Txt: Every customer who owns a computer has a service contract for it. MFI is a customer that owns exactly one computer.
Hyp: MFI has a service contract for all its computers. (yes)
MFI NNP |
has VBZ |
a DT |
service_contract NN |
all PDT |
its PRP$ |
computers NNS |
|
Every:DT | 1.25 | 1.25 | 1.31 | 1.00 | 1.31 | 1.16 | 1.00 |
customer:NN | 1.07 | 0.82 | 0.71 | 0.77 | 0.71 | 1.58 | 0.53 |
who:WP | 1.55 | 0.97 | 1.11 | 1.30 | 1.11 | 0.71 | 1.30 |
owns:VBZ | 1.16 | 0.46 | 0.76 | 0.91 | 0.76 | 0.51 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.31 | 1.15 | 1.00 |
computer:NN | 1.07 | 0.82 | 0.71 | 0.82 | 0.71 | 1.58 | 1.65 |
has:VBZ | 1.16 | 1.32 | 0.76 | 0.91 | 0.76 | 0.47 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.31 | 1.15 | 1.00 |
service_contract:NN | 1.07 | 0.82 | 0.71 | 0.28 | 0.71 | 1.58 | 0.82 |
it:PRP | 1.55 | 0.97 | 1.10 | 1.30 | 1.11 | 0.35 | 1.30 |
MFI:NNP | 0.28 | 1.07 | 0.96 | 1.07 | 0.96 | 1.84 | 1.07 |
is:VBZ | 1.16 | 1.01 | 0.76 | 0.91 | 0.76 | 0.18 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.31 | 1.15 | 1.00 |
customer:NN | 1.07 | 0.82 | 0.71 | 0.77 | 0.71 | 1.58 | 0.53 |
that:WDT | 1.25 | 1.16 | 1.31 | 1.00 | 1.31 | 1.16 | 1.00 |
owns:VBZ | 1.16 | 0.46 | 0.76 | 0.91 | 0.76 | 0.51 | 0.91 |
exactly:RB | 1.47 | 0.91 | 1.09 | 1.20 | 1.09 | 1.05 | 1.21 |
one:CD | 1.24 | 1.30 | 0.89 | 1.33 | 0.89 | 1.32 | 1.33 |
computer:NN | 1.07 | 0.82 | 0.71 | 0.82 | 0.71 | 1.58 | 1.65 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 | 0.41 | 0.93 | 0.05 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.54 Alignment.score
1.00 0.26 Alignment.isGood
-1.00 0.72 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.14 Alignment.txtSpan
-1.00 1.00 NullPunisher.other : service_contract
-0.05 1.00 NullPunisher.aux : has
-1.00 1.00 NullPunisher.other : all
-3.00 1.00 NullPunisher.entity : MFI
-0.10 1.00 NullPunisher.article : a
-0.10 1.00 NullPunisher.functionWord : its
-7.00 1.00 Quant.expand : Invalid quantifier strengthening: replacing the quantifier "a" by a stronger quantifier "all" does NOT preserve truth.
-2.00 1.00 RootEntailment.unalignedRoot : "has" not aligned to anything
Hand-tuned score (dot product of above): -14.6474
Threshold: -9.4738
Txt: Every customer who owns a computer has a service contract for it. MFI is a customer that owns several computers.
Hyp: MFI has a service contract for all its computers. (yes)
MFI NNP |
has VBZ |
a DT |
service_contract NN |
all PDT |
its PRP$ |
computers NNS |
|
Every:DT | 1.25 | 1.25 | 1.31 | 1.00 | 1.31 | 1.16 | 1.00 |
customer:NN | 1.07 | 0.82 | 0.71 | 0.77 | 0.71 | 1.58 | 0.53 |
who:WP | 1.55 | 0.97 | 1.11 | 1.30 | 1.11 | 0.71 | 1.30 |
owns:VBZ | 1.16 | 0.46 | 0.76 | 0.91 | 0.76 | 0.51 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.31 | 1.15 | 1.00 |
computer:NN | 1.07 | 0.82 | 0.71 | 0.82 | 0.71 | 1.58 | 1.65 |
has:VBZ | 1.16 | 1.32 | 0.76 | 0.91 | 0.76 | 0.47 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.31 | 1.15 | 1.00 |
service_contract:NN | 1.07 | 0.82 | 0.71 | 0.28 | 0.71 | 1.58 | 0.82 |
it:PRP | 1.55 | 0.97 | 1.10 | 1.30 | 1.11 | 0.35 | 1.30 |
MFI:NNP | 0.28 | 1.07 | 0.96 | 1.07 | 0.96 | 1.84 | 1.07 |
is:VBZ | 1.16 | 1.01 | 0.76 | 0.91 | 0.76 | 0.18 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.31 | 1.15 | 1.00 |
customer:NN | 1.07 | 0.82 | 0.71 | 0.77 | 0.71 | 1.58 | 0.53 |
that:WDT | 1.25 | 1.16 | 1.31 | 1.00 | 1.31 | 1.16 | 1.00 |
owns:VBZ | 1.16 | 0.46 | 0.76 | 0.91 | 0.76 | 0.51 | 0.91 |
several:JJ | 1.13 | 0.96 | 0.76 | 0.85 | 0.76 | 1.05 | 0.88 |
computers:NNS | 1.07 | 0.82 | 0.71 | 0.82 | 0.71 | 1.58 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 | 0.41 | 0.93 | 0.05 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.54 Alignment.score
1.00 0.26 Alignment.isGood
-1.00 0.72 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.14 Alignment.txtSpan
-0.05 1.00 NullPunisher.aux : has
-1.00 1.00 NullPunisher.other : service_contract
-0.10 1.00 NullPunisher.article : a
-3.00 1.00 NullPunisher.entity : MFI
-0.10 1.00 NullPunisher.functionWord : its
-1.00 1.00 NullPunisher.other : all
-7.00 1.00 Quant.expand : Invalid quantifier strengthening: replacing the quantifier "a" by a stronger quantifier "all" does NOT preserve truth.
-2.00 1.00 RootEntailment.unalignedRoot : "has" not aligned to anything
Hand-tuned score (dot product of above): -14.6474
Threshold: -9.4738
Txt: Every executive who had a laptop computer brought it to take notes at the meeting. Smith is a executive who owns five different laptop computers.
Hyp: Smith took five laptop computers to the meeting. (don't know)
Smith NNP |
took VBD |
five CD |
laptop_computers NNS |
the DT |
meeting NN |
|
Every:DT | 1.25 | 1.25 | 1.16 | 1.00 | 1.31 | 1.00 |
executive:NN | 0.98 | 0.79 | 1.25 | 0.83 | 0.71 | 0.80 |
who:WP | 1.55 | 0.94 | 1.46 | 1.30 | 1.04 | 1.30 |
had:VBD | 1.16 | 0.41 | 1.39 | 0.91 | 0.76 | 0.91 |
a:DT | 1.25 | 1.25 | 1.18 | 1.00 | 1.31 | 1.00 |
laptop_computer:NN | 1.05 | 0.82 | 1.28 | 2.31 | 0.71 | 0.86 |
brought:VBD | 1.16 | 0.18 | 1.38 | 0.91 | 0.76 | 0.91 |
it:PRP | 1.52 | 0.97 | 1.46 | 1.30 | 1.11 | 1.30 |
to:TO | 1.25 | 1.18 | 1.18 | 1.00 | 1.26 | 1.00 |
take_notes:VB | 1.16 | 0.40 | 1.26 | 0.84 | 0.76 | 0.81 |
the:DT | 1.25 | 1.22 | 1.15 | 1.00 | 2.42 | 1.00 |
meeting:NN | 1.15 | 0.82 | 1.29 | 0.86 | 0.71 | 0.28 |
Smith:NNP | 0.28 | 1.07 | 1.18 | 1.05 | 0.96 | 1.15 |
is:VBZ | 1.16 | 0.78 | 1.39 | 0.91 | 0.76 | 0.91 |
a:DT | 1.25 | 1.25 | 1.18 | 1.00 | 1.31 | 1.00 |
executive:NN | 0.98 | 0.79 | 1.25 | 0.83 | 0.71 | 0.80 |
who:WP | 1.55 | 0.94 | 1.46 | 1.30 | 1.04 | 1.30 |
owns:VBZ | 1.16 | 0.79 | 1.39 | 0.91 | 0.76 | 0.91 |
five:CD | 1.21 | 1.04 | 0.85 | 1.33 | 0.86 | 1.33 |
different:JJ | 1.13 | 0.96 | 0.90 | 0.73 | 0.76 | 0.82 |
laptop_computers:NNS | 1.05 | 0.82 | 1.29 | 0.28 | 0.71 | 0.86 |
NO_WORD | 0.28 | 0.17 | 0.52 | 0.09 | 0.82 | 0.74 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.54 Alignment.score
1.00 0.26 Alignment.isGood
-1.00 0.72 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.33 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "meeting" modifying "took"
-1.00 1.00 NullPunisher.other : meeting
-3.00 1.00 NullPunisher.entity : Smith
-3.00 1.00 NullPunisher.entity : five
-0.10 1.00 NullPunisher.article : the
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '5.0' vs ''
-3.00 1.00 Structure.argsMismatch : args have different parents but same relations: text "laptop_computer" <-dobj-- "had" vs. hyp "laptop_computers" <-dobj-- "took", which aligned to text "brought"
Hand-tuned score (dot product of above): -17.2796
Threshold: -9.4738
Txt: There are 100 companies. ICM is one of the companies and owns 150 computers. It does not have service contracts for any of its computers. Each of the other 99 companies owns one computer. They have service contracts for them.
Hyp: Most companies that own a computer have a service contract for it. (yes)
Most JJS |
companies NNS |
that WDT |
own VBP |
a DT |
computer NN |
have VBP |
a DT |
service_contract NN |
it PRP |
|
There:EX | 1.18 | 1.00 | 1.23 | 1.25 | 1.31 | 1.00 | 1.18 | 1.31 | 1.00 | 1.16 |
are:VBP | 1.02 | 0.91 | 0.76 | 0.96 | 0.76 | 0.91 | 0.89 | 0.76 | 0.91 | 0.54 |
100:CD | 1.00 | 1.28 | 0.89 | 1.30 | 0.89 | 1.33 | 1.30 | 0.89 | 1.18 | 1.32 |
companies:NNS | 0.99 | 0.28 | 0.71 | 0.82 | 0.71 | 0.68 | 0.82 | 0.71 | 0.82 | 1.58 |
ICM:NNP | 1.24 | 1.07 | 0.96 | 1.07 | 0.96 | 1.07 | 1.07 | 0.96 | 1.07 | 1.79 |
is:VBZ | 1.02 | 0.91 | 0.76 | 0.96 | 0.76 | 0.91 | 0.98 | 0.76 | 0.91 | 0.43 |
one:CD | 0.97 | 1.33 | 0.89 | 1.23 | 0.89 | 1.33 | 1.27 | 0.89 | 1.33 | 1.32 |
the:DT | 1.18 | 1.00 | 1.21 | 1.25 | 1.31 | 1.00 | 1.22 | 1.31 | 1.00 | 1.16 |
companies:NNS | 0.99 | 0.28 | 0.71 | 0.82 | 0.71 | 0.68 | 0.82 | 0.71 | 0.82 | 1.58 |
owns:VBZ | 1.02 | 0.88 | 0.76 | 0.67 | 0.76 | 0.91 | 0.50 | 0.76 | 0.91 | 0.54 |
150:CD | 1.00 | 1.33 | 0.89 | 1.30 | 0.89 | 1.33 | 1.30 | 0.89 | 1.23 | 1.32 |
computers:NNS | 0.99 | 0.78 | 0.71 | 0.82 | 0.71 | 1.65 | 0.82 | 0.71 | 0.82 | 1.58 |
It:PRP | 1.39 | 1.30 | 1.11 | 0.97 | 1.10 | 1.30 | 0.97 | 1.10 | 1.30 | 0.73 |
does:VBZ | 0.96 | 0.89 | 0.76 | 0.81 | 0.76 | 0.91 | 0.86 | 0.76 | 0.84 | 0.54 |
not:RB | 0.95 | 1.21 | 1.06 | 0.91 | 1.09 | 1.21 | 0.91 | 1.09 | 1.21 | 1.00 |
have:VB | 1.02 | 0.91 | 0.70 | 0.50 | 0.76 | 0.91 | 0.68 | 0.76 | 0.91 | 0.54 |
service_contracts:NNS | 0.99 | 0.62 | 0.71 | 0.82 | 0.71 | 0.82 | 0.82 | 0.71 | 2.11 | 1.58 |
any:DT | 1.18 | 1.00 | 1.31 | 1.25 | 1.31 | 1.00 | 1.22 | 1.31 | 1.00 | 1.16 |
its:PRP$ | 1.36 | 1.30 | 1.11 | 0.97 | 1.10 | 1.30 | 0.97 | 1.10 | 1.30 | 0.35 |
computers:NNS | 0.99 | 0.78 | 0.71 | 0.82 | 0.71 | 1.65 | 0.82 | 0.71 | 0.82 | 1.58 |
Each:DT | 1.18 | 1.00 | 1.31 | 1.25 | 1.31 | 1.00 | 1.20 | 1.31 | 1.00 | 1.16 |
the:DT | 1.18 | 1.00 | 1.21 | 1.25 | 1.31 | 1.00 | 1.22 | 1.31 | 1.00 | 1.16 |
other:JJ | 0.93 | 0.88 | 0.69 | 0.96 | 0.76 | 0.83 | 0.93 | 0.76 | 0.88 | 1.05 |
99:CD | 1.00 | 1.33 | 0.89 | 1.30 | 0.89 | 1.23 | 1.30 | 0.89 | 1.13 | 1.32 |
companies:NNS | 0.99 | 0.28 | 0.71 | 0.82 | 0.71 | 0.68 | 0.82 | 0.71 | 0.82 | 1.58 |
owns:VBZ | 1.02 | 0.88 | 0.76 | 0.67 | 0.76 | 0.91 | 0.50 | 0.76 | 0.91 | 0.54 |
one:CD | 0.97 | 1.33 | 0.89 | 1.23 | 0.89 | 1.33 | 1.27 | 0.89 | 1.33 | 1.32 |
computer:NN | 0.99 | 0.68 | 0.71 | 0.82 | 0.71 | 0.28 | 0.82 | 0.71 | 0.82 | 1.58 |
They:PRP | 1.52 | 2.50 | 1.13 | 1.24 | 1.24 | 1.97 | 1.24 | 1.24 | 1.30 | 0.84 |
have:VBP | 1.02 | 0.91 | 0.70 | 0.50 | 0.76 | 0.91 | 1.32 | 0.76 | 0.91 | 0.54 |
service_contracts:NNS | 0.99 | 0.62 | 0.71 | 0.82 | 0.71 | 0.82 | 0.82 | 0.71 | 2.11 | 1.58 |
them:PRP | 1.52 | 2.50 | 1.13 | 1.24 | 1.24 | 1.97 | 1.24 | 1.24 | 1.30 | 0.98 |
NO_WORD | 0.11 | 0.28 | 0.41 | 0.25 | 0.82 | 0.09 | 0.17 | 0.82 | 0.09 | 0.87 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.95 Alignment.score
1.00 0.35 Alignment.isGood
-1.00 0.63 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.40 Alignment.txtSpan
0.00 1.00 NegPolarity.hypNegWord : "Most": tag "JJS" is in NegPolarityMarkers list
0.00 1.00 NegPolarity.txtNegWord : "have": has child with relation "neg"
0.00 1.00 NegPolarity.txtNegRoot : "have": has child with relation "neg"
-1.00 1.00 NullPunisher.other : it
-0.10 1.00 NullPunisher.functionWord : that
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : Most
-1.00 1.00 NullPunisher.other : companies
-0.10 1.00 NullPunisher.article : a
-6.00 1.00 Quant.oneNo : [any,a]
0.00 1.00 NegPolarity.txtNegWord&NegPolarity.hypNegWord :
Hand-tuned score (dot product of above): -8.9901
Threshold: -9.4738
Txt: Every report has a cover page. R-95-103 is a report. Smith signed the cover page.
Hyp: Smith signed the cover page of R-95-103. (yes)
Smith NNP |
signed VBD |
the DT |
cover NN |
page NN |
R-95-103 NN |
|
Every:DT | 1.25 | 1.25 | 1.31 | 0.91 | 1.00 | 1.00 |
report:NN | 1.14 | 0.82 | 0.71 | 0.76 | 0.71 | 0.87 |
has:VBZ | 1.16 | 0.74 | 0.76 | 0.91 | 0.87 | 0.91 |
a:DT | 1.25 | 1.25 | 1.31 | 1.00 | 1.00 | 1.00 |
cover:NN | 1.06 | 0.80 | 0.71 | 0.28 | 0.71 | 0.77 |
page:NN | 1.09 | 0.77 | 0.67 | 0.71 | 0.28 | 0.85 |
R-95-103:NNP | 1.12 | 0.66 | 0.71 | 0.77 | 0.85 | 3.06 |
is:VBZ | 1.16 | 0.78 | 0.76 | 0.91 | 0.91 | 0.91 |
a:DT | 1.25 | 1.25 | 1.31 | 1.00 | 1.00 | 1.00 |
report:NN | 1.14 | 0.82 | 0.71 | 0.76 | 0.71 | 0.87 |
Smith:NNP | 0.28 | 1.05 | 0.96 | 1.06 | 1.09 | 1.12 |
signed:VBD | 1.14 | 1.32 | 0.76 | 0.89 | 0.86 | 0.75 |
the:DT | 1.25 | 1.25 | 2.42 | 1.00 | 0.96 | 1.00 |
cover:NN | 1.06 | 0.80 | 0.71 | 0.28 | 0.71 | 0.77 |
page:NN | 1.09 | 0.77 | 0.67 | 0.71 | 0.28 | 0.85 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.05 | 0.09 | 0.04 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.63 Alignment.score
1.00 0.28 Alignment.isGood
-1.00 0.70 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.17 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "cover" modifying "page"
-1.00 1.00 NullPunisher.other : signed
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : page
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : cover
-2.00 1.00 RootEntailment.unalignedRoot : "signed" not aligned to anything
Hand-tuned score (dot product of above): -9.2654
Threshold: -9.4738
Txt: A company director awarded himself a large payrise.
Hyp: A company director has awarded and been awarded a payrise. (yes)
A DT |
company NN |
director NN |
has VBZ |
awarded VBN |
been VBN |
awarded VBN |
a DT |
payrise NN |
|
A:DT | 2.42 | 1.00 | 1.00 | 1.25 | 1.25 | 1.25 | 1.25 | 0.13 | 1.00 |
company:NN | 0.71 | 0.28 | 0.92 | 0.82 | 0.82 | 0.82 | 0.82 | 0.71 | 0.82 |
director:NN | 0.71 | 0.92 | 0.28 | 0.82 | 0.79 | 0.82 | 0.79 | 0.71 | 0.82 |
awarded:VBD | 0.76 | 0.91 | 0.88 | 0.73 | 2.02 | 0.78 | 2.02 | 0.76 | 0.87 |
himself:PRP | 1.11 | 1.26 | 1.28 | 0.97 | 0.97 | 0.97 | 0.97 | 1.11 | 1.30 |
a:DT | 0.13 | 1.00 | 1.00 | 1.25 | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 |
large:JJ | 0.76 | 0.82 | 0.88 | 0.96 | 0.89 | 0.96 | 0.89 | 0.76 | 0.80 |
payrise:NN | 0.71 | 0.82 | 0.82 | 0.82 | 0.78 | 0.82 | 0.78 | 0.71 | 0.28 |
NO_WORD | 0.82 | 0.05 | 0.15 | 1.30 | 0.17 | 1.57 | 0.06 | 0.82 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.80 Alignment.score
1.00 0.31 Alignment.isGood
-1.00 0.66 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.33 Alignment.txtSpan
-1.00 1.00 NullPunisher.other : payrise
-0.05 1.00 NullPunisher.aux : has
-0.10 1.00 NullPunisher.article : A
-1.00 1.00 NullPunisher.other : awarded
-0.10 1.00 NullPunisher.article : a
-0.05 1.00 NullPunisher.aux : been
-3.00 1.00 Structure.clearBadness : for predicate awarded, text actor set [director] overlaps hyp undergoer set [director]
-3.00 1.00 Structure.argsMismatch : awarded (verbal compl of the root) not aligned
Hand-tuned score (dot product of above): -8.2148
Threshold: -9.4738
Txt: John said Bill had hurt himself.
Hyp: John said Bill had been hurt. (yes)
John NNP |
said VBD |
Bill NNP |
had VBD |
been VBN |
hurt VBN |
|
John:NNP | 0.28 | 1.07 | 0.92 | 1.07 | 1.02 | 1.07 |
said:VBD | 1.16 | 1.32 | 1.16 | 0.84 | 1.00 | 0.72 |
Bill:NNP | 0.92 | 1.07 | 0.28 | 1.07 | 1.02 | 1.07 |
had:VBD | 1.16 | 0.84 | 1.16 | 1.32 | 0.98 | 0.64 |
hurt:VBN | 1.16 | 0.72 | 1.16 | 0.72 | 0.72 | 1.32 |
himself:PRP | 2.76 | 0.97 | 1.53 | 0.97 | 1.27 | 0.97 |
NO_WORD | 0.28 | 0.17 | 0.15 | 1.30 | 1.57 | 0.36 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.50 Alignment.score
1.00 0.25 Alignment.isGood
-1.00 0.72 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-0.05 1.00 NullPunisher.aux : had
-1.00 1.00 NullPunisher.other : hurt
-0.05 1.00 NullPunisher.aux : been
-3.00 1.00 NullPunisher.entity : John
-3.00 1.00 NullPunisher.entity : Bill
-1.00 1.00 NullPunisher.other : said
-2.00 1.00 RootEntailment.unalignedRoot : "said" not aligned to anything
Hand-tuned score (dot product of above): -10.6755
Threshold: -9.4738
Txt: John said Bill had hurt himself.
Hyp: Someone said John had been hurt. (don't know)
Someone NN |
said VBD |
John NNP |
had VBD |
been VBN |
hurt VBN |
|
John:NNP | 1.14 | 1.07 | 0.28 | 1.07 | 1.02 | 1.07 |
said:VBD | 0.91 | 1.32 | 1.16 | 0.84 | 1.00 | 0.72 |
Bill:NNP | 1.31 | 1.07 | 0.92 | 1.07 | 1.02 | 1.07 |
had:VBD | 0.91 | 0.84 | 1.16 | 1.32 | 0.98 | 0.64 |
hurt:VBN | 0.91 | 0.72 | 1.16 | 0.72 | 0.72 | 1.32 |
himself:PRP | 1.26 | 0.97 | 2.76 | 0.97 | 1.27 | 0.97 |
NO_WORD | 0.28 | 0.17 | 0.15 | 1.30 | 1.57 | 0.36 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.50 Alignment.score
1.00 0.25 Alignment.isGood
-1.00 0.72 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-0.05 1.00 NullPunisher.aux : been
-1.00 1.00 NullPunisher.other : Someone
-1.00 1.00 NullPunisher.other : hurt
-1.00 1.00 NullPunisher.other : said
-0.05 1.00 NullPunisher.aux : had
-3.00 1.00 NullPunisher.entity : John
-2.00 1.00 RootEntailment.unalignedRoot : "said" not aligned to anything
Hand-tuned score (dot product of above): -8.6755
Threshold: -9.4738
Txt: John spoke to Mary. So did Bill.
Hyp: Bill spoke to Mary. (yes)
Bill NNP |
spoke VBD |
Mary NNP |
|
John:NNP | 0.92 | 1.05 | 0.82 |
spoke:VBD | 1.16 | 1.32 | 1.16 |
Mary:NNP | 0.87 | 1.07 | 0.28 |
did:VBD | 1.13 | 0.60 | 1.16 |
Bill:NNP | 0.28 | 1.07 | 0.87 |
NO_WORD | 0.28 | 0.17 | 0.74 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.21 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Mary" modifying "spoke"
-3.00 1.00 NullPunisher.entity : Bill
-1.00 1.00 NullPunisher.other : spoke
-3.00 1.00 NullPunisher.entity : Mary
-2.00 1.00 RootEntailment.unalignedRoot : "spoke" not aligned to anything
Hand-tuned score (dot product of above): -10.6630
Threshold: -9.4738
Txt: John spoke to Mary. So did Bill. John spoke to Mary at four o'clock.
Hyp: Bill spoke to Mary at four o'clock. (don't know)
Bill NNP |
spoke VBD |
Mary NNP |
four_o'clock CD |
|
John:NNP | 0.92 | 1.05 | 0.82 | 1.20 |
spoke:VBD | 1.16 | 1.32 | 1.16 | 1.22 |
Mary:NNP | 0.87 | 1.07 | 0.28 | 1.20 |
did:VBD | 1.13 | 0.60 | 1.16 | 1.39 |
Bill:NNP | 0.28 | 1.07 | 0.87 | 1.20 |
John:NNP | 0.92 | 1.05 | 0.82 | 1.20 |
spoke:VBD | 1.16 | 1.32 | 1.16 | 1.22 |
Mary:NNP | 0.87 | 1.07 | 0.28 | 1.20 |
four_o'clock:CD | 1.24 | 1.13 | 1.24 | 0.85 |
NO_WORD | 0.28 | 0.17 | 0.74 | 0.30 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.08 Alignment.score
1.00 0.18 Alignment.isGood
-1.00 0.80 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "four_o'clock" modifying "spoke"
-1.00 1.00 NullPunisher.other : spoke
-3.00 1.00 NullPunisher.entity : Bill
-3.00 1.00 NullPunisher.entity : Mary
-3.00 1.00 NullPunisher.entity : four_o'clock
-2.00 1.00 RootEntailment.unalignedRoot : "spoke" not aligned to anything
Hand-tuned score (dot product of above): -13.9336
Threshold: -9.4738
Txt: John spoke to Mary at four o'clock. So did Bill.
Hyp: Bill spoke to Mary at four o'clock. (yes)
Bill NNP |
spoke VBD |
Mary NNP |
four_o'clock CD |
|
John:NNP | 0.92 | 1.05 | 0.82 | 1.20 |
spoke:VBD | 1.16 | 1.32 | 1.16 | 1.22 |
Mary:NNP | 0.87 | 1.07 | 0.28 | 1.20 |
four_o'clock:CD | 1.24 | 1.13 | 1.24 | 0.85 |
did:VBD | 1.13 | 0.60 | 1.16 | 1.39 |
Bill:NNP | 0.28 | 1.07 | 0.87 | 1.20 |
NO_WORD | 0.28 | 0.17 | 0.74 | 0.30 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.08 Alignment.score
1.00 0.18 Alignment.isGood
-1.00 0.80 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "four_o'clock" modifying "spoke"
-3.00 1.00 NullPunisher.entity : Mary
-3.00 1.00 NullPunisher.entity : Bill
-1.00 1.00 NullPunisher.other : spoke
-3.00 1.00 NullPunisher.entity : four_o'clock
-2.00 1.00 RootEntailment.unalignedRoot : "spoke" not aligned to anything
Hand-tuned score (dot product of above): -13.9336
Threshold: -9.4738
Txt: John spoke to Mary at four o'clock. And Bill did at five o'clock.
Hyp: Bill spoke to Mary at five o'clock. (yes)
Bill NNP |
spoke VBD |
Mary NNP |
five CD |
o'clock RB |
|
John:NNP | 0.92 | 1.05 | 0.82 | 1.20 | 1.56 |
spoke:VBD | 1.16 | 1.32 | 1.16 | 1.37 | 0.94 |
Mary:NNP | 0.87 | 1.07 | 0.28 | 1.20 | 1.56 |
four_o'clock:CD | 1.24 | 1.13 | 1.24 | 0.28 | 0.82 |
Bill:NNP | 0.28 | 1.07 | 0.87 | 1.15 | 1.56 |
did:VBD | 1.13 | 0.60 | 1.16 | 1.28 | 1.11 |
five:CD | 1.18 | 1.28 | 1.24 | 0.85 | 1.30 |
o'clock:RB | 1.47 | 0.74 | 1.47 | 1.36 | 1.23 |
NO_WORD | 0.28 | 0.17 | 0.74 | 0.30 | 0.04 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.08 Alignment.score
1.00 0.18 Alignment.isGood
-1.00 0.80 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.20 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "o'clock" modifying "five"
-1.00 1.00 NullPunisher.other : spoke
-1.00 1.00 NullPunisher.other : o'clock
-3.00 1.00 NullPunisher.entity : Mary
-3.00 1.00 NullPunisher.entity : Bill
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '5.0' vs ''
-2.00 1.00 RootEntailment.unalignedRoot : "spoke" not aligned to anything
Hand-tuned score (dot product of above): -17.8173
Threshold: -9.4738
Txt: John has spoken to Mary. Bill is going to.
Hyp: Bill will speak to Mary. (yes)
Bill NNP |
will MD |
speak VB |
Mary NNP |
|
John:NNP | 1.18 | 0.96 | 1.07 | 0.82 |
has:VBZ | 0.91 | 1.05 | 0.86 | 1.13 |
spoken:VBN | 0.91 | 1.00 | 0.44 | 1.16 |
Mary:NNP | 1.13 | 0.96 | 1.07 | 0.28 |
Bill:NNP | 0.28 | 0.57 | 0.82 | 1.13 |
is:VBZ | 0.91 | 1.11 | 0.92 | 1.16 |
going_to:VBG | 0.91 | 0.93 | 0.74 | 1.16 |
NO_WORD | 0.28 | 1.55 | 0.17 | 0.74 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.61 Alignment.score
1.00 0.27 Alignment.isGood
-1.00 0.70 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.25 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Mary" modifying "speak"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "Mary" modifying "spoken" is dropped on aligned hypothesis word "speak"
-0.05 1.00 NullPunisher.aux : will
-3.00 1.00 NullPunisher.entity : Mary
-1.00 1.00 NullPunisher.other : Bill
Hand-tuned score (dot product of above): -4.5368
Threshold: -9.4738
Txt: John spoke to Mary on Monday. Bill didn't.
Hyp: Bill spoke to Mary on Monday. (don't know)
Bill NNP |
spoke VBD |
Mary NNP |
on IN |
Monday NNP |
|
John:NNP | 1.18 | 1.05 | 0.82 | 0.68 | 1.07 |
spoke:VBD | 0.91 | 1.32 | 1.16 | 1.15 | 1.16 |
Mary:NNP | 1.13 | 1.07 | 0.28 | 0.75 | 0.98 |
on:IN | 1.31 | 1.24 | 1.57 | 1.59 | 1.57 |
Monday:NNP | 1.15 | 1.07 | 0.98 | 0.75 | 0.28 |
Bill:NNP | 0.28 | 0.82 | 1.13 | 0.50 | 1.15 |
did:VBD | 0.87 | 0.60 | 1.16 | 1.15 | 1.16 |
n't:RB | 1.21 | 0.91 | 1.47 | 1.05 | 1.47 |
NO_WORD | 0.28 | 0.17 | 0.74 | 1.16 | 0.09 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.34 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.75 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Monday" modifying "spoke"
-2.00 1.00 Date.hypDateIns : hypothesis date insertion: Monday
-1.00 1.00 NullPunisher.other : Bill
-1.00 1.00 NullPunisher.other : spoke
-3.00 1.00 NullPunisher.entity : Monday
-3.00 1.00 NullPunisher.entity : Mary
-1.00 1.00 NullPunisher.other : on
-6.00 1.00 Numeric.mismatch : DATE mismatch: '01/01/1000' vs ''
-2.00 1.00 RootEntailment.unalignedRoot : "spoke" not aligned to anything
Hand-tuned score (dot product of above): -20.6883
Threshold: -9.4738
Txt: Has John spoken to Mary? Bill has.
Hyp: Bill has spoken to Mary. (yes)
Bill NNP |
has VBZ |
spoken VBN |
Mary NNP |
|
Has:NNP | 1.07 | 0.54 | 0.82 | 1.04 |
John:NNP | 1.09 | 1.07 | 1.03 | 0.82 |
spoken:VBN | 1.16 | 0.86 | 1.32 | 1.16 |
Mary:NNP | 1.04 | 1.04 | 1.07 | 0.28 |
Bill:NNP | 0.28 | 1.07 | 1.07 | 1.04 |
has:VBZ | 1.16 | 1.32 | 0.86 | 1.13 |
NO_WORD | 0.28 | 1.30 | 0.17 | 0.74 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.48 Alignment.score
1.00 0.25 Alignment.isGood
-1.00 0.73 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Mary" modifying "spoken"
-3.00 1.00 NullPunisher.entity : Mary
-1.00 1.00 NullPunisher.other : spoken
-3.00 1.00 NullPunisher.entity : Bill
-0.05 1.00 NullPunisher.aux : has
-2.00 1.00 RootEntailment.unalignedRoot : "spoken" not aligned to anything
Hand-tuned score (dot product of above): -10.4431
Threshold: -9.4738
Txt: John has spoken to Mary. The students have too.
Hyp: The students have spoken to Mary. (yes)
The DT |
students NNS |
have VBP |
spoken VBN |
Mary NNP |
|
John:NNP | 0.93 | 1.10 | 1.07 | 1.03 | 0.82 |
has:VBZ | 0.76 | 0.91 | 0.19 | 0.86 | 1.13 |
spoken:VBN | 0.76 | 0.81 | 0.86 | 1.32 | 1.16 |
Mary:NNP | 0.96 | 1.03 | 1.02 | 1.07 | 0.28 |
The:DT | 2.42 | 1.00 | 1.22 | 1.25 | 1.25 |
students:NNS | 0.71 | 0.28 | 0.82 | 0.72 | 1.03 |
have:VBP | 0.73 | 0.91 | 1.32 | 0.86 | 1.11 |
too:RB | 1.02 | 1.21 | 0.91 | 0.91 | 1.47 |
NO_WORD | 0.82 | 0.28 | 1.30 | 0.17 | 0.74 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.55 Alignment.score
1.00 0.26 Alignment.isGood
-1.00 0.71 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Mary" modifying "spoken"
-0.10 1.00 NullPunisher.article : The
-1.00 1.00 NullPunisher.other : spoken
-1.00 1.00 NullPunisher.other : students
-3.00 1.00 NullPunisher.entity : Mary
-0.05 1.00 NullPunisher.aux : have
-2.00 1.00 RootEntailment.unalignedRoot : "spoken" not aligned to anything
Hand-tuned score (dot product of above): -8.5507
Threshold: -9.4738
Txt: John went to Paris by car, and Bill by train.
Hyp: Bill went to Paris by train. (yes)
Bill NNP |
went VBD |
Paris NNP |
train NN |
|
John:NNP | 0.92 | 1.07 | 1.01 | 1.02 |
went:VBD | 1.16 | 1.32 | 1.16 | 0.69 |
Paris:NNP | 1.05 | 1.07 | 0.28 | 1.02 |
car:NN | 1.20 | 0.63 | 1.05 | 0.29 |
Bill:NNP | 0.28 | 1.07 | 1.05 | 1.14 |
train:NN | 1.14 | 0.60 | 1.02 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.74 | 0.30 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.09 Alignment.score
1.00 0.18 Alignment.isGood
-1.00 0.80 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.25 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Paris" modifying "went"
-3.00 1.00 NullPunisher.entity : Bill
-3.00 1.00 NullPunisher.entity : Paris
-1.00 1.00 NullPunisher.other : went
-2.00 1.00 RootEntailment.unalignedRoot : "went" not aligned to anything
Hand-tuned score (dot product of above): -10.7005
Threshold: -9.4738
Txt: John went to Paris by car, and Bill by train to Berlin.
Hyp: Bill went to Berlin by train. (yes)
Bill NNP |
went VBD |
Berlin NNP |
train NN |
|
John:NNP | 0.92 | 1.07 | 0.99 | 1.02 |
went:VBD | 1.16 | 1.32 | 1.16 | 0.69 |
Paris:NNP | 1.05 | 1.07 | 0.57 | 1.02 |
car:NN | 1.20 | 0.63 | 1.09 | 0.29 |
Bill:NNP | 0.28 | 1.07 | 1.00 | 1.14 |
train:NN | 1.14 | 0.60 | 0.96 | 0.28 |
Berlin:NNP | 1.00 | 1.07 | 0.28 | 0.96 |
NO_WORD | 0.28 | 0.17 | 0.74 | 0.30 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.09 Alignment.score
1.00 0.18 Alignment.isGood
-1.00 0.80 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.25 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Berlin" modifying "went"
-1.00 1.00 NullPunisher.other : went
-3.00 1.00 NullPunisher.entity : Bill
-3.00 1.00 NullPunisher.entity : Berlin
-2.00 1.00 RootEntailment.unalignedRoot : "went" not aligned to anything
Hand-tuned score (dot product of above): -10.7005
Threshold: -9.4738
Txt: John went to Paris by car, and Bill to Berlin.
Hyp: Bill went to Berlin by car. (yes)
Bill NNP |
went VBD |
Berlin NNP |
car NN |
|
John:NNP | 0.92 | 1.07 | 0.99 | 1.09 |
went:VBD | 1.16 | 1.32 | 1.16 | 0.72 |
Paris:NNP | 1.05 | 1.07 | 0.57 | 1.05 |
car:NN | 1.20 | 0.63 | 1.09 | 0.28 |
Bill:NNP | 0.28 | 1.07 | 1.00 | 1.20 |
to:TO | 1.25 | 1.25 | 1.25 | 1.00 |
Berlin:NNP | 1.00 | 1.07 | 0.28 | 1.02 |
NO_WORD | 0.28 | 0.17 | 0.74 | 0.30 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.09 Alignment.score
1.00 0.18 Alignment.isGood
-1.00 0.80 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.25 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Berlin" modifying "went"
-3.00 1.00 NullPunisher.entity : Bill
-1.00 1.00 NullPunisher.other : went
-3.00 1.00 NullPunisher.entity : Berlin
-2.00 1.00 RootEntailment.unalignedRoot : "went" not aligned to anything
Hand-tuned score (dot product of above): -10.7005
Threshold: -9.4738
Txt: John is going to Paris by car, and the students by train.
Hyp: The students are going to Paris by train. (yes)
The DT |
students NNS |
are VBP |
going VBG |
Paris NNP |
train NN |
|
John:NNP | 0.93 | 1.10 | 1.07 | 1.00 | 1.01 | 1.02 |
is:VBZ | 0.76 | 0.91 | 0.02 | 1.00 | 1.13 | 0.91 |
going:VBG | 0.76 | 0.89 | 0.92 | 1.32 | 1.16 | 0.70 |
Paris:NNP | 0.96 | 1.08 | 1.02 | 1.07 | 0.28 | 1.02 |
car:NN | 0.71 | 0.86 | 0.74 | 0.81 | 1.05 | 0.29 |
the:DT | 0.01 | 1.00 | 1.18 | 1.25 | 1.25 | 1.00 |
students:NNS | 0.71 | 0.28 | 0.82 | 0.80 | 1.08 | 0.63 |
train:NN | 0.71 | 0.63 | 0.82 | 0.61 | 1.02 | 0.28 |
NO_WORD | 0.82 | 0.28 | 1.30 | 0.17 | 0.74 | 0.30 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.41 Alignment.score
1.00 0.24 Alignment.isGood
-1.00 0.74 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.17 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Paris" modifying "going"
-1.00 1.00 NullPunisher.other : students
-0.10 1.00 NullPunisher.article : The
-1.00 1.00 NullPunisher.other : going
-0.05 1.00 NullPunisher.aux : are
-3.00 1.00 NullPunisher.entity : Paris
-2.00 1.00 RootEntailment.unalignedRoot : "going" not aligned to anything
Hand-tuned score (dot product of above): -8.6256
Threshold: -9.4738
Txt: John went to Paris by car. Bill by train.
Hyp: Bill went to Paris by train. (yes)
Bill NNP |
went VBD |
Paris NNP |
train NN |
|
John:NNP | 1.18 | 1.07 | 1.01 | 1.02 |
went:VBD | 0.91 | 1.32 | 1.16 | 0.69 |
Paris:NNP | 1.14 | 1.07 | 0.28 | 1.02 |
car:NN | 0.94 | 0.63 | 1.05 | 0.29 |
Bill:VB | 0.85 | 0.78 | 1.16 | 0.91 |
train:NN | 0.88 | 0.60 | 1.02 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.74 | 0.30 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.37 Alignment.score
1.00 0.23 Alignment.isGood
-1.00 0.75 Alignment.isBad
-0.10 2.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.50 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Paris" modifying "went"
-3.00 1.00 NullPunisher.entity : Paris
-1.00 1.00 NullPunisher.other : went
-2.00 1.00 RootEntailment.unalignedRoot : "went" not aligned to anything
Hand-tuned score (dot product of above): -7.1988
Threshold: -9.4738
Txt: John owns a car. Bill owns one too.
Hyp: Bill owns a car. (yes)
Bill NNP |
owns VBZ |
a DT |
car NN |
|
John:NNP | 1.06 | 1.02 | 0.96 | 1.09 |
owns:VBZ | 1.16 | 1.32 | 0.76 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 |
car:NN | 1.20 | 0.82 | 0.71 | 0.28 |
Bill:NNP | 0.28 | 1.07 | 0.96 | 1.20 |
owns:VBZ | 1.16 | 1.32 | 0.76 | 0.91 |
one:CD | 1.24 | 1.21 | 0.89 | 1.33 |
too:RB | 1.47 | 0.91 | 1.09 | 1.21 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.20 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-0.10 1.00 NullPunisher.article : a
-3.00 1.00 NullPunisher.entity : Bill
-1.00 1.00 NullPunisher.other : owns
-1.00 1.00 NullPunisher.other : car
-2.00 1.00 RootEntailment.unalignedRoot : "owns" not aligned to anything
Hand-tuned score (dot product of above): -7.8790
Threshold: -9.4738
Txt: John owns a car. Bill owns one too.
Hyp: There is a car that John and Bill own. (don't know)
There EX |
is VBZ |
a DT |
car NN |
that IN |
John NNP |
Bill NNP |
own VBP |
|
John:NNP | 0.96 | 1.07 | 0.96 | 1.09 | 0.75 | 0.28 | 1.06 | 0.98 |
owns:VBZ | 0.76 | 0.96 | 0.76 | 0.91 | 1.15 | 1.11 | 1.16 | 0.67 |
a:DT | 1.31 | 1.25 | 2.42 | 1.00 | 1.05 | 1.25 | 1.25 | 1.25 |
car:NN | 0.71 | 0.82 | 0.71 | 0.28 | 0.47 | 1.09 | 1.20 | 0.82 |
Bill:NNP | 0.96 | 1.07 | 0.96 | 1.20 | 0.75 | 1.06 | 0.28 | 1.07 |
owns:VBZ | 0.76 | 0.96 | 0.76 | 0.91 | 1.15 | 1.11 | 1.16 | 0.67 |
one:CD | 0.89 | 1.30 | 0.89 | 1.33 | 1.30 | 1.21 | 1.24 | 1.23 |
too:RB | 1.09 | 0.91 | 1.09 | 1.21 | 1.01 | 1.44 | 1.47 | 0.91 |
NO_WORD | 0.29 | 0.17 | 0.82 | 0.28 | 0.99 | 0.28 | 0.01 | 0.31 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.38 Alignment.score
1.00 0.23 Alignment.isGood
-1.00 0.75 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.25 Alignment.txtSpan
-3.00 1.00 NullPunisher.entity : Bill
-0.10 1.00 NullPunisher.article : a
-0.10 1.00 NullPunisher.functionWord : There
-0.05 1.00 NullPunisher.aux : is
-1.00 1.00 NullPunisher.other : car
-0.10 1.00 NullPunisher.functionWord : that
-2.00 1.00 RootEntailment.unalignedRoot : "is" not aligned to anything
Hand-tuned score (dot product of above): -6.9581
Threshold: -9.4738
Txt: John owns a red car. Bill owns a blue one.
Hyp: Bill owns a blue car. (yes)
Bill NNP |
owns VBZ |
a DT |
blue JJ |
car NN |
|
John:NNP | 0.92 | 1.02 | 0.96 | 1.24 | 1.09 |
owns:VBZ | 1.16 | 1.32 | 0.76 | 1.02 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.18 | 1.00 |
red:JJ | 1.13 | 0.96 | 0.76 | 0.46 | 0.76 |
car:NN | 1.20 | 0.82 | 0.71 | 0.78 | 0.28 |
Bill:NNP | 0.28 | 1.07 | 0.96 | 1.19 | 1.20 |
owns:VBZ | 1.16 | 1.32 | 0.76 | 1.02 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.18 | 1.00 |
blue:JJ | 1.08 | 0.96 | 0.76 | 0.74 | 0.67 |
one:NN | 1.15 | 0.72 | 0.71 | 0.95 | 0.92 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.11 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.18 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "blue" modifying "car"
-1.00 1.00 NullPunisher.other : owns
-1.00 1.00 NullPunisher.other : car
-1.00 1.00 NullPunisher.other : blue
-0.10 1.00 NullPunisher.article : a
-3.00 1.00 NullPunisher.entity : Bill
-2.00 1.00 RootEntailment.unalignedRoot : "owns" not aligned to anything
Hand-tuned score (dot product of above): -10.0020
Threshold: -9.4738
Txt: John owns a red car. Bill owns a blue one.
Hyp: Bill owns a red car. (don't know)
Bill NNP |
owns VBZ |
a DT |
red JJ |
car NN |
|
John:NNP | 0.92 | 1.02 | 0.96 | 1.24 | 1.09 |
owns:VBZ | 1.16 | 1.32 | 0.76 | 1.02 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.18 | 1.00 |
red:JJ | 1.13 | 0.96 | 0.76 | 0.74 | 0.76 |
car:NN | 1.20 | 0.82 | 0.71 | 0.87 | 0.28 |
Bill:NNP | 0.28 | 1.07 | 0.96 | 1.24 | 1.20 |
owns:VBZ | 1.16 | 1.32 | 0.76 | 1.02 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.18 | 1.00 |
blue:JJ | 1.08 | 0.96 | 0.76 | 0.46 | 0.67 |
one:NN | 1.15 | 0.72 | 0.71 | 0.99 | 0.92 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.11 | 0.09 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.18 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "red" modifying "car"
-3.00 1.00 NullPunisher.entity : Bill
-1.00 1.00 NullPunisher.other : owns
-1.00 1.00 NullPunisher.other : red
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : car
-2.00 1.00 RootEntailment.unalignedRoot : "owns" not aligned to anything
Hand-tuned score (dot product of above): -10.0020
Threshold: -9.4738
Txt: John owns a red car. Bill owns a fast one.
Hyp: Bill owns a fast car. (yes)
Bill NNP |
owns VBZ |
a DT |
fast JJ |
car NN |
|
John:NNP | 0.92 | 1.02 | 0.96 | 1.24 | 1.09 |
owns:VBZ | 1.16 | 1.32 | 0.76 | 1.02 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.18 | 1.00 |
red:JJ | 1.13 | 0.96 | 0.76 | 0.93 | 0.76 |
car:NN | 1.20 | 0.82 | 0.71 | 0.82 | 0.28 |
Bill:NNP | 0.28 | 1.07 | 0.96 | 1.24 | 1.20 |
owns:VBZ | 1.16 | 1.32 | 0.76 | 1.02 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.18 | 1.00 |
fast_one:. | 1.25 | 1.25 | 1.31 | 0.01 | 1.00 |
.:. | 1.25 | 1.25 | 1.31 | 1.02 | 0.97 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.11 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.18 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "fast" modifying "car"
-1.00 1.00 NullPunisher.other : owns
-1.00 1.00 NullPunisher.other : fast
-3.00 1.00 NullPunisher.entity : Bill
-1.00 1.00 NullPunisher.other : car
-0.10 1.00 NullPunisher.article : a
-2.00 1.00 RootEntailment.unalignedRoot : "owns" not aligned to anything
Hand-tuned score (dot product of above): -10.0020
Threshold: -9.4738
Txt: John owns a red car. Bill owns a fast one.
Hyp: Bill owns a fast red car. (yes)
Bill NNP |
owns VBZ |
a DT |
fast JJ |
red JJ |
car NN |
|
John:NNP | 0.92 | 1.02 | 0.96 | 1.24 | 1.24 | 1.09 |
owns:VBZ | 1.16 | 1.32 | 0.76 | 1.02 | 1.02 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.18 | 1.18 | 1.00 |
red:JJ | 1.13 | 0.96 | 0.76 | 0.93 | 0.74 | 0.76 |
car:NN | 1.20 | 0.82 | 0.71 | 0.82 | 0.87 | 0.28 |
Bill:NNP | 0.28 | 1.07 | 0.96 | 1.24 | 1.24 | 1.20 |
owns:VBZ | 1.16 | 1.32 | 0.76 | 1.02 | 1.02 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.18 | 1.18 | 1.00 |
fast_one:. | 1.25 | 1.25 | 1.31 | 0.01 | 1.18 | 1.00 |
.:. | 1.25 | 1.25 | 1.31 | 1.02 | 1.13 | 0.97 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.11 | 0.11 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.17 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "red" modifying "car"
-1.00 1.00 NullPunisher.other : red
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : fast
-1.00 1.00 NullPunisher.other : car
-3.00 1.00 NullPunisher.entity : Bill
-1.00 1.00 NullPunisher.other : owns
-2.00 1.00 RootEntailment.unalignedRoot : "owns" not aligned to anything
Hand-tuned score (dot product of above): -11.1172
Threshold: -9.4738
Txt: John owns a red car. Bill owns a fast one.
Hyp: Bill owns a fast red car. (don't know)
Bill NNP |
owns VBZ |
a DT |
fast JJ |
red JJ |
car NN |
|
John:NNP | 0.92 | 1.02 | 0.96 | 1.24 | 1.24 | 1.09 |
owns:VBZ | 1.16 | 1.32 | 0.76 | 1.02 | 1.02 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.18 | 1.18 | 1.00 |
red:JJ | 1.13 | 0.96 | 0.76 | 0.93 | 0.74 | 0.76 |
car:NN | 1.20 | 0.82 | 0.71 | 0.82 | 0.87 | 0.28 |
Bill:NNP | 0.28 | 1.07 | 0.96 | 1.24 | 1.24 | 1.20 |
owns:VBZ | 1.16 | 1.32 | 0.76 | 1.02 | 1.02 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.18 | 1.18 | 1.00 |
fast_one:. | 1.25 | 1.25 | 1.31 | 0.01 | 1.18 | 1.00 |
.:. | 1.25 | 1.25 | 1.31 | 1.02 | 1.13 | 0.97 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.11 | 0.11 | 0.09 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.17 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "red" modifying "car"
-1.00 1.00 NullPunisher.other : car
-1.00 1.00 NullPunisher.other : fast
-3.00 1.00 NullPunisher.entity : Bill
-1.00 1.00 NullPunisher.other : red
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : owns
-2.00 1.00 RootEntailment.unalignedRoot : "owns" not aligned to anything
Hand-tuned score (dot product of above): -11.1172
Threshold: -9.4738
Txt: John owns a fast red car. Bill owns a slow one.
Hyp: Bill owns a slow red car. (yes)
Bill NNP |
owns VBZ |
a DT |
slow JJ |
red JJ |
car NN |
|
John:NNP | 0.92 | 1.02 | 0.96 | 1.24 | 1.24 | 1.09 |
owns:VBZ | 1.16 | 1.32 | 0.76 | 1.02 | 1.02 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.18 | 1.18 | 1.00 |
fast:JJ | 1.13 | 0.96 | 0.76 | 1.36 | 0.93 | 0.72 |
red:JJ | 1.13 | 0.96 | 0.76 | 0.93 | 0.74 | 0.76 |
car:NN | 1.20 | 0.82 | 0.71 | 0.95 | 0.87 | 0.28 |
Bill:NNP | 0.28 | 1.07 | 0.96 | 1.24 | 1.24 | 1.20 |
owns:VBZ | 1.16 | 1.32 | 0.76 | 1.02 | 1.02 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.18 | 1.18 | 1.00 |
slow:JJ | 1.13 | 0.96 | 0.76 | 0.74 | 0.93 | 0.84 |
one:NN | 1.15 | 0.72 | 0.71 | 0.99 | 0.99 | 0.92 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.11 | 0.11 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.17 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "red" modifying "car"
-0.10 1.00 NullPunisher.article : a
-3.00 1.00 NullPunisher.entity : Bill
-1.00 1.00 NullPunisher.other : slow
-1.00 1.00 NullPunisher.other : owns
-1.00 1.00 NullPunisher.other : car
-1.00 1.00 NullPunisher.other : red
-2.00 1.00 RootEntailment.unalignedRoot : "owns" not aligned to anything
Hand-tuned score (dot product of above): -11.1172
Threshold: -9.4738
Txt: John had his paper accepted. Bill doesn't know why.
Hyp: Bill knows why John had his paper accepted. (don't know)
Bill NNP |
knows VBZ |
why WRB |
John NNP |
had VBD |
his PRP$ |
paper NN |
accepted VBN |
|
John:NNP | 1.18 | 1.05 | 0.93 | 0.28 | 1.07 | 1.84 | 1.17 | 1.07 |
had:VBD | 0.91 | 0.71 | 0.76 | 1.16 | 1.32 | 0.47 | 0.91 | 0.45 |
his:PRP$ | 1.26 | 0.97 | 1.41 | 2.76 | 0.90 | 1.05 | 1.30 | 0.97 |
paper:NN | 0.94 | 0.82 | 0.71 | 1.17 | 0.82 | 1.58 | 0.28 | 0.61 |
accepted:VBN | 0.91 | 0.74 | 0.76 | 1.16 | 0.37 | 0.54 | 0.70 | 1.32 |
Bill:NNP | 0.28 | 0.82 | 0.71 | 1.18 | 0.82 | 1.55 | 0.94 | 0.82 |
does:VBZ | 0.91 | 0.03 | 0.76 | 1.11 | 0.86 | 0.51 | 0.88 | 0.81 |
n't:RB | 1.21 | 0.79 | 1.09 | 1.47 | 0.91 | 1.05 | 1.21 | 0.91 |
know:VB | 0.91 | 1.10 | 0.76 | 1.16 | 0.79 | 0.54 | 0.91 | 0.66 |
why:WRB | 1.00 | 1.25 | 2.42 | 1.22 | 1.25 | 1.16 | 1.00 | 1.25 |
NO_WORD | 0.28 | 0.17 | 0.30 | 0.28 | 0.36 | 0.93 | 0.09 | 0.16 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.38 Alignment.score
1.00 0.23 Alignment.isGood
-1.00 0.75 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.12 Alignment.txtSpan
0.00 1.00 NegPolarity.txtNegWord : "know": has child with relation "neg"
0.00 1.00 NegPolarity.txtNegRoot : "know": has child with relation "neg"
-1.00 1.00 NullPunisher.other : accepted
-3.00 1.00 NullPunisher.entity : John
-0.10 1.00 NullPunisher.functionWord : his
-1.00 1.00 NullPunisher.other : why
-1.00 1.00 NullPunisher.other : paper
-0.05 1.00 NullPunisher.aux : had
-3.00 1.00 Structure.argsMismatch : had (verbal compl of the root) not aligned
Hand-tuned score (dot product of above): -9.6688
Threshold: -9.4738
Txt: John spoke to Mary. And to Sue.
Hyp: John spoke to Sue. (yes)
John NNP |
spoke VBD |
Sue NNP |
|
John:NNP | 0.28 | 1.05 | 1.07 |
spoke:VBD | 1.14 | 1.32 | 0.85 |
Mary:NNP | 0.82 | 1.07 | 1.07 |
to:TO | 1.25 | 1.25 | 1.00 |
Sue:VBZ | 1.16 | 0.64 | 0.85 |
NO_WORD | 0.28 | 0.17 | 0.74 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.25 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 2.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.33 Alignment.txtSpan
-1.00 1.00 NullPunisher.other : spoke
-3.00 1.00 NullPunisher.entity : John
-2.00 1.00 RootEntailment.unalignedRoot : "spoke" not aligned to anything
Hand-tuned score (dot product of above): -6.3821
Threshold: -9.4738
Txt: John spoke to Mary. On Friday.
Hyp: John spoke to Mary on Friday. (yes)
John NNP |
spoke VBD |
Mary NNP |
on IN |
Friday NNP |
|
John:NNP | 0.28 | 1.05 | 0.82 | 0.68 | 1.09 |
spoke:VBD | 1.14 | 1.32 | 1.16 | 1.15 | 1.16 |
Mary:NNP | 0.82 | 1.07 | 0.28 | 0.75 | 1.04 |
On:IN | 1.50 | 1.24 | 1.57 | 0.25 | 1.57 |
Friday:NNP | 1.09 | 1.07 | 1.04 | 0.75 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.74 | 1.16 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.34 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.75 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Friday" modifying "spoke"
-2.00 1.00 Date.hypDateIns : hypothesis date insertion: Friday
-1.00 1.00 NullPunisher.other : on
-1.00 1.00 NullPunisher.other : spoke
-3.00 1.00 NullPunisher.entity : John
-3.00 1.00 NullPunisher.entity : Mary
-3.00 1.00 NullPunisher.entity : Friday
-6.00 1.00 Numeric.mismatch : DATE mismatch: '01/01/1000' vs ''
-2.00 1.00 RootEntailment.unalignedRoot : "spoke" not aligned to anything
Hand-tuned score (dot product of above): -22.6883
Threshold: -9.4738
Txt: John spoke to Mary on Thursday. And on Friday.
Hyp: John spoke to Mary on Friday. (yes)
John NNP |
spoke VBD |
Mary NNP |
on IN |
Friday NNP |
|
John:NNP | 0.28 | 1.05 | 0.82 | 0.68 | 1.09 |
spoke:VBD | 1.14 | 1.32 | 1.16 | 1.15 | 1.16 |
Mary:NNP | 0.82 | 1.07 | 0.28 | 0.75 | 1.04 |
on:IN | 1.50 | 1.24 | 1.57 | 1.59 | 1.57 |
Thursday:NNP | 1.06 | 1.07 | 1.02 | 0.75 | 0.58 |
And:CC | 1.25 | 1.25 | 1.22 | 1.00 | 1.25 |
on:IN | 1.50 | 1.24 | 1.57 | 1.59 | 1.57 |
Friday:NNP | 1.09 | 1.07 | 1.04 | 0.75 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.74 | 1.16 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.34 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.75 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Friday" modifying "spoke"
-2.00 1.00 Date.hypDateIns : hypothesis date insertion: Friday
-3.00 1.00 NullPunisher.entity : Mary
-3.00 1.00 NullPunisher.entity : John
-1.00 1.00 NullPunisher.other : spoke
-3.00 1.00 NullPunisher.entity : Friday
-1.00 1.00 NullPunisher.other : on
-6.00 1.00 Numeric.mismatch : DATE mismatch: '01/01/1000' vs ''
-2.00 1.00 RootEntailment.unalignedRoot : "spoke" not aligned to anything
Hand-tuned score (dot product of above): -22.6883
Threshold: -9.4738
Txt: Twenty men work in the Sales Department. But only one woman.
Hyp: Two women work in the Sales Department. (don't know)
Two CD |
women NNS |
work VBP |
the DT |
Sales_Department NNP |
|
Twenty:CD | 0.33 | 1.31 | 1.30 | 0.86 | 1.24 |
men:NNS | 1.29 | 1.08 | 0.72 | 0.71 | 1.07 |
work:VBP | 1.36 | 0.70 | 1.32 | 0.76 | 1.16 |
the:DT | 1.11 | 1.00 | 1.25 | 2.42 | 1.25 |
Sales_Department:NNP | 1.20 | 1.07 | 1.07 | 0.96 | 0.28 |
But:CC | 1.18 | 1.00 | 1.25 | 1.31 | 1.25 |
only:RB | 1.36 | 1.19 | 0.91 | 1.09 | 1.47 |
one:CD | 0.35 | 1.27 | 1.27 | 0.82 | 1.24 |
woman:VBZ | 1.39 | 0.01 | 0.44 | 0.76 | 1.16 |
NO_WORD | 0.52 | 0.28 | 0.17 | 0.82 | 0.07 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.11 Alignment.score
1.00 0.19 Alignment.isGood
-1.00 0.79 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.40 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Sales_Department" modifying "work"
-0.10 1.00 NullPunisher.article : the
-3.00 1.00 NullPunisher.entity : Sales_Department
-1.00 1.00 NullPunisher.other : work
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '2.0' vs '1.0'
-2.00 1.00 RootEntailment.unalignedRoot : "work" not aligned to anything
Hand-tuned score (dot product of above): -13.6544
Threshold: -9.4738
Txt: Five men work part time. And forty five women.
Hyp: Forty five women work part time. (yes)
Forty CD |
five CD |
women NNS |
work VBP |
part NN |
time NN |
|
Five:CD | 0.33 | 1.35 | 1.30 | 1.30 | 1.33 | 1.22 |
men:NNS | 1.29 | 1.29 | 1.08 | 0.72 | 1.05 | 0.92 |
work:VBP | 1.32 | 1.39 | 0.70 | 1.32 | 0.68 | 0.69 |
part:NN | 1.22 | 1.28 | 1.01 | 0.59 | 0.28 | 1.00 |
time:NN | 1.29 | 0.93 | 0.86 | 0.60 | 1.00 | 0.28 |
And:CC | 1.18 | 1.18 | 1.00 | 1.25 | 0.96 | 1.00 |
forty:RB | 0.17 | 0.85 | 1.16 | 0.84 | 1.14 | 0.91 |
five:CD | 0.33 | 0.85 | 1.30 | 1.30 | 1.31 | 0.96 |
women:VBZ | 1.35 | 1.37 | 1.75 | 0.35 | 0.89 | 0.83 |
NO_WORD | 0.42 | 0.52 | 0.28 | 0.17 | 0.05 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.46 Alignment.score
1.00 0.24 Alignment.isGood
-1.00 0.73 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 3.00 Alignment.hypSpan
0.10 0.50 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "part" modifying "time"
-1.00 1.00 NullPunisher.other : work
-1.00 1.00 NullPunisher.other : part
-1.00 1.00 NullPunisher.other : time
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '0.0' vs ''
-2.00 1.00 RootEntailment.unalignedRoot : "work" not aligned to anything
Hand-tuned score (dot product of above): -11.9807
Threshold: -9.4738
Txt: John found Mary before Bill.
Hyp: John found Mary before Bill found Mary. (yes)
John NNP |
found VBD |
Mary NNP |
Bill NNP |
found VBD |
Mary NNP |
|
John:NNP | 0.28 | 1.00 | 0.82 | 0.92 | 1.00 | 0.82 |
found:VBD | 1.09 | 1.32 | 1.16 | 1.16 | 1.32 | 1.16 |
Mary:NNP | 0.82 | 1.07 | 0.28 | 0.87 | 1.07 | 0.28 |
Bill:NNP | 0.92 | 1.07 | 0.87 | 0.28 | 1.07 | 0.87 |
NO_WORD | 0.28 | 0.17 | 0.28 | 0.10 | 0.36 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.03 Alignment.score
1.00 0.17 Alignment.isGood
-1.00 0.81 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Bill" modifying "Mary"
-3.00 1.00 NullPunisher.entity : Mary
-1.00 1.00 NullPunisher.other : found
-1.00 1.00 NullPunisher.other : found
-3.00 1.00 NullPunisher.entity : Mary
-3.00 1.00 NullPunisher.entity : Bill
-3.00 1.00 NullPunisher.entity : John
-2.00 1.00 RootEntailment.unalignedRoot : "found" not aligned to anything
Hand-tuned score (dot product of above): -18.2039
Threshold: -9.4738
Txt: John found Mary before Bill.
Hyp: John found Mary before John found Bill. (yes)
John NNP |
found VBD |
Mary NNP |
John NNP |
found VBD |
Bill NNP |
|
John:NNP | 0.28 | 1.00 | 0.82 | 0.28 | 1.00 | 0.92 |
found:VBD | 1.09 | 1.32 | 1.16 | 1.09 | 1.32 | 1.16 |
Mary:NNP | 0.82 | 1.07 | 0.28 | 0.82 | 1.07 | 0.87 |
Bill:NNP | 0.92 | 1.07 | 0.87 | 0.92 | 1.07 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.28 | 0.10 | 0.36 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.03 Alignment.score
1.00 0.17 Alignment.isGood
-1.00 0.81 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "John" modifying "Mary"
-3.00 1.00 NullPunisher.entity : John
-3.00 1.00 NullPunisher.entity : Bill
-3.00 1.00 NullPunisher.entity : Mary
-3.00 1.00 NullPunisher.entity : John
-1.00 1.00 NullPunisher.other : found
-1.00 1.00 NullPunisher.other : found
-2.00 1.00 RootEntailment.unalignedRoot : "found" not aligned to anything
Hand-tuned score (dot product of above): -18.2039
Threshold: -9.4738
Txt: John wants to know how many men work part time. And women.
Hyp: John wants to know how many women work part time. (yes)
John NNP |
wants VBZ |
to TO |
know VB |
how WRB |
many JJ |
women NNS |
work VBP |
part NN |
time NN |
|
John:NNP | 0.28 | 1.07 | 0.96 | 1.07 | 0.93 | 1.24 | 1.05 | 1.02 | 1.27 | 1.18 |
wants:VBZ | 1.16 | 1.32 | 0.76 | 0.60 | 0.76 | 0.95 | 0.76 | 0.60 | 0.74 | 0.81 |
to:TO | 1.25 | 1.25 | 2.42 | 1.25 | 1.26 | 1.18 | 1.06 | 1.25 | 1.00 | 1.00 |
know:VB | 1.16 | 0.60 | 0.76 | 1.32 | 0.66 | 1.02 | 0.91 | 0.47 | 0.91 | 0.86 |
how:WRB | 1.22 | 1.25 | 1.26 | 1.16 | 2.42 | 1.18 | 1.00 | 1.22 | 1.00 | 1.00 |
many:JJ | 1.13 | 0.89 | 0.76 | 0.96 | 0.76 | 0.74 | 0.85 | 0.96 | 0.82 | 0.88 |
men:NNS | 1.19 | 0.73 | 0.71 | 0.82 | 0.71 | 0.89 | 1.08 | 0.72 | 1.05 | 0.92 |
work:VBP | 1.11 | 0.60 | 0.76 | 0.47 | 0.73 | 1.02 | 0.70 | 1.32 | 0.68 | 0.69 |
part:NN | 1.27 | 0.65 | 0.71 | 0.82 | 0.71 | 0.93 | 1.01 | 0.59 | 0.28 | 1.00 |
time:NN | 1.18 | 0.72 | 0.71 | 0.77 | 0.71 | 0.99 | 0.86 | 0.60 | 1.00 | 0.28 |
And:CC | 1.25 | 1.20 | 1.31 | 1.22 | 1.31 | 1.09 | 1.00 | 1.25 | 0.96 | 1.00 |
women:RB | 1.40 | 0.77 | 1.09 | 0.91 | 1.09 | 1.02 | 2.56 | 0.70 | 1.20 | 1.14 |
NO_WORD | 0.28 | 0.17 | 1.55 | 0.34 | 0.30 | 0.29 | 0.28 | 0.36 | 0.05 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.53 Alignment.score
1.00 0.26 Alignment.isGood
-1.00 0.72 Alignment.isBad
-0.10 9.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.10 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "part" modifying "time"
-1.00 1.00 NullPunisher.other : many
-1.00 1.00 NullPunisher.other : how
-3.00 1.00 NullPunisher.entity : John
-1.00 1.00 NullPunisher.other : work
-0.10 1.00 NullPunisher.functionWord : to
-1.00 1.00 NullPunisher.other : wants
-1.00 1.00 NullPunisher.other : part
-1.00 1.00 NullPunisher.other : time
-1.00 1.00 NullPunisher.other : know
-2.00 1.00 RootEntailment.unalignedRoot : "wants" not aligned to anything
Hand-tuned score (dot product of above): -13.8132
Threshold: -9.4738
Txt: John wants to know how many men work part time, and which.
Hyp: John wants to know which men work part time. (yes)
John NNP |
wants VBZ |
to TO |
know VB |
which WDT |
men NNS |
work VBP |
part NN |
time NN |
|
John:NNP | 0.28 | 1.07 | 0.96 | 1.07 | 0.96 | 1.19 | 1.02 | 1.27 | 1.18 |
wants:VBZ | 1.16 | 1.32 | 0.76 | 0.60 | 0.71 | 0.82 | 0.60 | 0.74 | 0.81 |
to:TO | 1.25 | 1.25 | 2.42 | 1.25 | 1.31 | 1.00 | 1.25 | 1.00 | 1.00 |
know:VB | 1.16 | 0.60 | 0.76 | 1.32 | 0.76 | 0.91 | 0.47 | 0.91 | 0.86 |
how:WRB | 1.22 | 1.25 | 1.26 | 1.16 | 1.31 | 1.00 | 1.22 | 1.00 | 1.00 |
many:JJ | 1.13 | 0.89 | 0.76 | 0.96 | 0.76 | 0.78 | 0.96 | 0.82 | 0.88 |
men:NNS | 1.19 | 0.73 | 0.71 | 0.82 | 0.71 | 0.28 | 0.72 | 1.05 | 0.92 |
work:VBP | 1.11 | 0.60 | 0.76 | 0.47 | 0.73 | 0.81 | 1.32 | 0.68 | 0.69 |
part:NN | 1.27 | 0.65 | 0.71 | 0.82 | 0.71 | 1.05 | 0.59 | 0.28 | 1.00 |
time:NN | 1.18 | 0.72 | 0.71 | 0.77 | 0.68 | 0.92 | 0.60 | 1.00 | 0.28 |
which:WDT | 1.25 | 1.21 | 1.31 | 1.25 | 2.42 | 1.00 | 1.23 | 1.00 | 0.97 |
NO_WORD | 0.28 | 0.17 | 1.55 | 0.34 | 0.40 | 0.28 | 0.36 | 0.05 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.26 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 9.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "part" modifying "time"
-1.00 1.00 NullPunisher.other : part
-1.00 1.00 NullPunisher.other : wants
-1.00 1.00 NullPunisher.other : time
-1.00 1.00 NullPunisher.other : men
-3.00 1.00 NullPunisher.entity : John
-1.00 1.00 NullPunisher.other : know
-1.00 1.00 NullPunisher.other : work
-0.10 1.00 NullPunisher.functionWord : to
-0.10 1.00 NullPunisher.functionWord : which
-2.00 1.00 RootEntailment.unalignedRoot : "wants" not aligned to anything
Hand-tuned score (dot product of above): -13.4042
Threshold: -9.4738
Txt: Bill spoke to everyone that John did. John spoke to Mary.
Hyp: Bill spoke to Mary. (yes)
Bill NNP |
spoke VBD |
Mary NNP |
|
Bill:NNP | 0.28 | 0.82 | 1.13 |
spoke:VBD | 0.91 | 1.32 | 1.16 |
everyone:NN | 0.82 | 0.80 | 1.07 |
that:IN | 1.31 | 1.24 | 1.57 |
John:NNP | 1.18 | 1.05 | 0.82 |
did:VBD | 0.87 | 0.60 | 1.16 |
John:NNP | 1.18 | 1.05 | 0.82 |
spoke:VBD | 0.91 | 1.32 | 1.16 |
Mary:NNP | 1.13 | 1.07 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.74 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.21 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Mary" modifying "spoke"
-3.00 1.00 NullPunisher.entity : Mary
-1.00 1.00 NullPunisher.other : spoke
-1.00 1.00 NullPunisher.other : Bill
-2.00 1.00 RootEntailment.unalignedRoot : "spoke" not aligned to anything
Hand-tuned score (dot product of above): -8.6630
Threshold: -9.4738
Txt: Bill spoke to everyone that John did. Bill spoke to Mary.
Hyp: John spoke to Mary. (don't know)
John NNP |
spoke VBD |
Mary NNP |
|
Bill:NNP | 1.18 | 0.82 | 1.13 |
spoke:VBD | 1.14 | 1.32 | 1.16 |
everyone:NN | 1.07 | 0.80 | 1.07 |
that:IN | 1.57 | 1.24 | 1.57 |
John:NNP | 0.28 | 1.05 | 0.82 |
did:VBD | 1.16 | 0.60 | 1.16 |
Bill:NNP | 1.18 | 0.82 | 1.13 |
spoke:VBD | 1.14 | 1.32 | 1.16 |
Mary:NNP | 0.82 | 1.07 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.74 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.21 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Mary" modifying "spoke"
-1.00 1.00 NullPunisher.other : spoke
-3.00 1.00 NullPunisher.entity : John
-3.00 1.00 NullPunisher.entity : Mary
-2.00 1.00 RootEntailment.unalignedRoot : "spoke" not aligned to anything
Hand-tuned score (dot product of above): -10.6630
Threshold: -9.4738
Txt: John said Mary wrote a report, and Bill did too.
Hyp: Bill said Mary wrote a report. (yes)
Bill NNP |
said VBD |
Mary NNP |
wrote VBD |
a DT |
report NN |
|
John:NNP | 0.92 | 1.07 | 0.82 | 1.05 | 0.96 | 1.17 |
said:VBD | 1.16 | 1.32 | 1.11 | 0.61 | 0.76 | 0.91 |
Mary:NNP | 0.87 | 1.02 | 0.28 | 1.07 | 0.96 | 1.12 |
wrote:VBD | 1.16 | 0.61 | 1.16 | 1.32 | 0.76 | 0.75 |
a:DT | 1.25 | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 |
report:NN | 1.01 | 0.82 | 1.12 | 0.66 | 0.71 | 0.28 |
Bill:NNP | 0.28 | 1.07 | 0.87 | 1.07 | 0.96 | 1.01 |
did:VBD | 1.13 | 0.65 | 1.16 | 0.56 | 0.76 | 0.85 |
too:RB | 1.47 | 0.91 | 1.47 | 0.91 | 1.09 | 1.21 |
NO_WORD | 0.28 | 0.17 | 0.28 | 0.36 | 0.82 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.15 Alignment.score
1.00 0.19 Alignment.isGood
-1.00 0.79 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : report
-3.00 1.00 NullPunisher.entity : Bill
-1.00 1.00 NullPunisher.other : wrote
-3.00 1.00 NullPunisher.entity : Mary
-1.00 1.00 NullPunisher.other : said
-2.00 1.00 RootEntailment.unalignedRoot : "said" not aligned to anything
Hand-tuned score (dot product of above): -12.1485
Threshold: -9.4738
Txt: John said Mary wrote a report, and Bill did too.
Hyp: John said Bill wrote a report. (yes)
John NNP |
said VBD |
Bill NNP |
wrote VBD |
a DT |
report NN |
|
John:NNP | 0.28 | 1.07 | 0.92 | 1.05 | 0.96 | 1.17 |
said:VBD | 1.16 | 1.32 | 1.16 | 0.61 | 0.76 | 0.91 |
Mary:NNP | 0.82 | 1.02 | 0.87 | 1.07 | 0.96 | 1.12 |
wrote:VBD | 1.14 | 0.61 | 1.16 | 1.32 | 0.76 | 0.75 |
a:DT | 1.25 | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 |
report:NN | 1.17 | 0.82 | 1.01 | 0.66 | 0.71 | 0.28 |
Bill:NNP | 0.92 | 1.07 | 0.28 | 1.07 | 0.96 | 1.01 |
did:VBD | 1.16 | 0.65 | 1.13 | 0.56 | 0.76 | 0.85 |
too:RB | 1.44 | 0.91 | 1.47 | 0.91 | 1.09 | 1.21 |
NO_WORD | 0.28 | 0.17 | 0.28 | 0.36 | 0.82 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.15 Alignment.score
1.00 0.19 Alignment.isGood
-1.00 0.79 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-3.00 1.00 NullPunisher.entity : Bill
-3.00 1.00 NullPunisher.entity : John
-1.00 1.00 NullPunisher.other : said
-1.00 1.00 NullPunisher.other : report
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : wrote
-2.00 1.00 RootEntailment.unalignedRoot : "said" not aligned to anything
Hand-tuned score (dot product of above): -12.1485
Threshold: -9.4738
Txt: John said that Mary wrote a report, and that Bill did too.
Hyp: Bill said Mary wrote a report. (don't know)
Bill NNP |
said VBD |
Mary NNP |
wrote VBD |
a DT |
report NN |
|
John:NNP | 0.92 | 1.07 | 0.82 | 1.05 | 0.96 | 1.17 |
said:VBD | 1.16 | 1.32 | 1.11 | 0.61 | 0.76 | 0.91 |
that:IN | 1.57 | 1.24 | 1.57 | 1.21 | 1.05 | 1.31 |
Mary:NNP | 0.87 | 1.02 | 0.28 | 1.07 | 0.96 | 1.12 |
wrote:VBD | 1.16 | 0.61 | 1.16 | 1.32 | 0.76 | 0.75 |
a:DT | 1.25 | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 |
report:NN | 1.01 | 0.82 | 1.12 | 0.66 | 0.71 | 0.28 |
that:IN | 1.57 | 1.24 | 1.57 | 1.21 | 1.05 | 1.31 |
Bill:NNP | 0.28 | 1.07 | 0.87 | 1.07 | 0.96 | 1.01 |
did:VBD | 1.13 | 0.65 | 1.16 | 0.56 | 0.76 | 0.85 |
too:RB | 1.47 | 0.91 | 1.47 | 0.91 | 1.09 | 1.21 |
NO_WORD | 0.28 | 0.17 | 0.28 | 0.36 | 0.82 | 0.09 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.15 Alignment.score
1.00 0.19 Alignment.isGood
-1.00 0.79 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : said
-1.00 1.00 NullPunisher.other : wrote
-3.00 1.00 NullPunisher.entity : Bill
-3.00 1.00 NullPunisher.entity : Mary
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : report
-2.00 1.00 RootEntailment.unalignedRoot : "said" not aligned to anything
Hand-tuned score (dot product of above): -12.1485
Threshold: -9.4738
Txt: John wrote a report, and Bill said Peter did too.
Hyp: Bill said Peter wrote a report. (yes)
Bill NNP |
said VBD |
Peter NNP |
wrote VBD |
a DT |
report NN |
|
John:NNP | 0.92 | 1.07 | 0.84 | 1.05 | 0.96 | 1.17 |
wrote:VBD | 1.16 | 0.61 | 1.12 | 1.32 | 0.76 | 0.75 |
a:DT | 1.25 | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 |
report:NN | 1.01 | 0.82 | 1.08 | 0.66 | 0.71 | 0.28 |
Bill:NNP | 0.28 | 1.07 | 0.89 | 1.07 | 0.96 | 1.01 |
said:VBD | 1.16 | 1.32 | 1.16 | 0.61 | 0.76 | 0.91 |
Peter:NNP | 0.89 | 1.07 | 0.28 | 1.03 | 0.96 | 1.08 |
did:VBD | 1.13 | 0.65 | 1.16 | 0.56 | 0.76 | 0.85 |
too:RB | 1.47 | 0.91 | 1.47 | 0.91 | 1.09 | 1.21 |
NO_WORD | 0.28 | 0.17 | 0.28 | 0.36 | 0.82 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.15 Alignment.score
1.00 0.19 Alignment.isGood
-1.00 0.79 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : said
-3.00 1.00 NullPunisher.entity : Peter
-1.00 1.00 NullPunisher.other : wrote
-3.00 1.00 NullPunisher.entity : Bill
-1.00 1.00 NullPunisher.other : report
-2.00 1.00 RootEntailment.unalignedRoot : "said" not aligned to anything
Hand-tuned score (dot product of above): -12.1485
Threshold: -9.4738
Txt: If John wrote a report, then Bill did too. John wrote a report.
Hyp: Bill wrote a report. (yes)
Bill NNP |
wrote VBD |
a DT |
report NN |
|
If:IN | 1.57 | 1.24 | 1.05 | 1.31 |
John:NNP | 0.92 | 1.05 | 0.96 | 1.17 |
wrote:VBD | 1.16 | 1.32 | 0.76 | 0.75 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 |
report:NN | 1.01 | 0.66 | 0.71 | 0.28 |
then:RB | 1.47 | 0.91 | 1.09 | 1.21 |
Bill:NNP | 0.28 | 1.07 | 0.96 | 1.01 |
did:VBD | 1.13 | 0.56 | 0.76 | 0.85 |
too:RB | 1.47 | 0.91 | 1.09 | 1.21 |
John:NNP | 0.92 | 1.05 | 0.96 | 1.17 |
wrote:VBD | 1.16 | 1.32 | 0.76 | 0.75 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 |
report:NN | 1.01 | 0.66 | 0.71 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.20 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : report
-1.00 1.00 NullPunisher.other : wrote
-0.10 1.00 NullPunisher.article : a
-3.00 1.00 NullPunisher.entity : Bill
-2.00 1.00 RootEntailment.unalignedRoot : "wrote" not aligned to anything
Hand-tuned score (dot product of above): -7.8790
Threshold: -9.4738
Txt: John wanted to buy a car, and he did.
Hyp: John bought a car. (yes)
John NNP |
bought VBD |
a DT |
car NN |
|
John:NNP | 0.28 | 1.03 | 0.96 | 1.09 |
wanted:VBD | 1.16 | 0.53 | 0.76 | 0.73 |
to:TO | 1.25 | 1.25 | 1.31 | 1.00 |
buy:VB | 1.16 | 0.47 | 0.76 | 0.83 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 |
car:NN | 1.09 | 0.73 | 0.71 | 0.28 |
he:PRP | 2.76 | 0.97 | 1.11 | 1.30 |
did:VBD | 1.16 | 0.72 | 0.76 | 0.72 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.39 Alignment.score
1.00 0.23 Alignment.isGood
-1.00 0.75 Alignment.isBad
-0.10 2.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.50 Alignment.txtSpan
1.00 1.00 Factive.inPositiveEmbedding : embedded positive text
-0.10 1.00 NullPunisher.article : a
-3.00 1.00 NullPunisher.entity : John
Hand-tuned score (dot product of above): -2.2764
Threshold: -9.4738
Txt: John needed to buy a car, and Bill did.
Hyp: Bill bought a car. (don't know)
Bill NNP |
bought VBD |
a DT |
car NN |
|
John:NNP | 0.92 | 1.03 | 0.96 | 1.09 |
needed:VBD | 1.16 | 0.75 | 0.76 | 0.91 |
to:TO | 1.25 | 1.25 | 1.31 | 1.00 |
buy:VB | 1.13 | 0.47 | 0.76 | 0.83 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 |
car:NN | 1.20 | 0.73 | 0.71 | 0.28 |
Bill:NNP | 0.28 | 1.07 | 0.96 | 1.20 |
did:VBD | 1.13 | 0.72 | 0.76 | 0.72 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.39 Alignment.score
1.00 0.23 Alignment.isGood
-1.00 0.75 Alignment.isBad
-0.10 2.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.50 Alignment.txtSpan
1.00 1.00 Factive.inPositiveEmbedding : embedded positive text
-3.00 1.00 NullPunisher.entity : Bill
-0.10 1.00 NullPunisher.article : a
Hand-tuned score (dot product of above): -2.2764
Threshold: -9.4738
Txt: Smith represents his company and so does Jones.
Hyp: Jones represents Jones' company. (yes)
Jones NNP |
represents VBZ |
Jones NNP |
company NN |
|
Smith:NNP | 0.70 | 1.07 | 0.70 | 1.18 |
represents:VBZ | 1.16 | 1.32 | 1.16 | 0.84 |
his:PRP$ | 1.55 | 0.97 | 1.55 | 1.30 |
company:NN | 1.20 | 0.75 | 1.20 | 0.28 |
so:RB | 1.47 | 0.91 | 1.47 | 1.21 |
does:VBZ | 1.04 | 0.70 | 1.04 | 0.83 |
Jones:NNP | 0.28 | 1.07 | 0.28 | 1.20 |
NO_WORD | 0.28 | 0.17 | 0.12 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.03 Alignment.score
1.00 0.17 Alignment.isGood
-1.00 0.81 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-3.00 1.00 NullPunisher.entity : Jones
-1.00 1.00 NullPunisher.other : company
-3.00 1.00 NullPunisher.entity : Jones
-1.00 1.00 NullPunisher.other : represents
-2.00 1.00 RootEntailment.unalignedRoot : "represents" not aligned to anything
Hand-tuned score (dot product of above): -11.0087
Threshold: -9.4738
Txt: Smith represents his company and so does Jones.
Hyp: Jones represents Smith's company. (yes)
Jones NNP |
represents VBZ |
Smith NNP |
company NN |
|
Smith:NNP | 0.70 | 1.07 | 0.28 | 1.18 |
represents:VBZ | 1.16 | 1.32 | 1.16 | 0.84 |
his:PRP$ | 1.55 | 0.97 | 2.76 | 1.30 |
company:NN | 1.20 | 0.75 | 1.18 | 0.28 |
so:RB | 1.47 | 0.91 | 1.47 | 1.21 |
does:VBZ | 1.04 | 0.70 | 1.16 | 0.83 |
Jones:NNP | 0.28 | 1.07 | 0.70 | 1.20 |
NO_WORD | 0.28 | 0.17 | 0.12 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.03 Alignment.score
1.00 0.17 Alignment.isGood
-1.00 0.81 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : company
-3.00 1.00 NullPunisher.entity : Jones
-1.00 1.00 NullPunisher.other : represents
-3.00 1.00 NullPunisher.entity : Smith
-2.00 1.00 RootEntailment.unalignedRoot : "represents" not aligned to anything
Hand-tuned score (dot product of above): -11.0087
Threshold: -9.4738
Txt: Smith represents his company and so does Jones.
Hyp: Smith represents Jones' company. (don't know)
Smith NNP |
represents VBZ |
Jones NNP |
company NN |
|
Smith:NNP | 0.28 | 1.07 | 0.70 | 1.18 |
represents:VBZ | 1.16 | 1.32 | 1.16 | 0.84 |
his:PRP$ | 2.76 | 0.97 | 1.55 | 1.30 |
company:NN | 1.18 | 0.75 | 1.20 | 0.28 |
so:RB | 1.47 | 0.91 | 1.47 | 1.21 |
does:VBZ | 1.16 | 0.70 | 1.04 | 0.83 |
Jones:NNP | 0.70 | 1.07 | 0.28 | 1.20 |
NO_WORD | 0.28 | 0.17 | 0.12 | 0.09 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.03 Alignment.score
1.00 0.17 Alignment.isGood
-1.00 0.81 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : represents
-3.00 1.00 NullPunisher.entity : Smith
-3.00 1.00 NullPunisher.entity : Jones
-1.00 1.00 NullPunisher.other : company
-2.00 1.00 RootEntailment.unalignedRoot : "represents" not aligned to anything
Hand-tuned score (dot product of above): -11.0087
Threshold: -9.4738
Txt: Smith claimed he had costed his proposal and so did Jones.
Hyp: Jones claimed he had costed his own proposal. (yes)
Jones NNP |
claimed VBD |
he PRP |
had VBD |
costed VBN |
his PRP$ |
own JJ |
proposal NN |
|
Smith:NNP | 0.70 | 1.07 | 1.84 | 1.07 | 1.05 | 1.84 | 1.24 | 1.16 |
claimed:VBD | 1.16 | 1.32 | 0.54 | 0.78 | 0.71 | 0.54 | 1.02 | 0.91 |
he:PRP | 1.55 | 0.97 | 1.05 | 0.93 | 0.97 | 0.82 | 1.39 | 1.30 |
had:VBD | 1.16 | 0.78 | 0.50 | 1.32 | 0.94 | 0.47 | 1.02 | 0.91 |
costed:VBN | 1.10 | 0.71 | 0.54 | 0.94 | 1.32 | 0.54 | 1.02 | 0.91 |
his:PRP$ | 1.55 | 0.97 | 0.58 | 0.90 | 0.97 | 1.05 | 1.39 | 1.30 |
proposal:NN | 1.16 | 0.82 | 1.58 | 0.82 | 0.82 | 1.58 | 0.99 | 0.28 |
so:RB | 1.47 | 0.91 | 1.05 | 0.91 | 0.91 | 1.05 | 1.05 | 1.21 |
did:VBD | 1.16 | 0.43 | 0.54 | 0.79 | 0.88 | 0.47 | 1.02 | 0.87 |
Jones:NNP | 0.28 | 1.07 | 1.84 | 1.07 | 1.01 | 1.84 | 1.24 | 1.16 |
NO_WORD | 0.28 | 0.17 | 0.54 | 1.30 | 0.36 | 0.93 | 0.11 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.40 Alignment.score
1.00 0.23 Alignment.isGood
-1.00 0.74 Alignment.isBad
-0.10 8.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "own" modifying "proposal"
-1.00 1.00 NullPunisher.other : own
-1.00 1.00 NullPunisher.other : costed
-3.00 1.00 NullPunisher.entity : Jones
-0.05 1.00 NullPunisher.aux : had
-1.00 1.00 NullPunisher.other : claimed
-0.10 1.00 NullPunisher.functionWord : his
-1.00 1.00 NullPunisher.other : proposal
-1.00 1.00 NullPunisher.other : he
-2.00 1.00 RootEntailment.unalignedRoot : "claimed" not aligned to anything
Hand-tuned score (dot product of above): -12.0533
Threshold: -9.4738
Txt: Smith claimed he had costed his proposal and so did Jones.
Hyp: Jones claimed he had costed Smith's proposal. (yes)
Jones NNP |
claimed VBD |
he PRP |
had VBD |
costed VBN |
Smith NNP |
proposal NN |
|
Smith:NNP | 0.70 | 1.07 | 1.84 | 1.07 | 1.05 | 0.28 | 1.16 |
claimed:VBD | 1.16 | 1.32 | 0.54 | 0.78 | 0.71 | 1.16 | 0.91 |
he:PRP | 1.55 | 0.97 | 1.05 | 0.93 | 0.97 | 2.76 | 1.30 |
had:VBD | 1.16 | 0.78 | 0.50 | 1.32 | 0.94 | 1.16 | 0.91 |
costed:VBN | 1.10 | 0.71 | 0.54 | 0.94 | 1.32 | 1.14 | 0.91 |
his:PRP$ | 1.55 | 0.97 | 0.58 | 0.90 | 0.97 | 2.76 | 1.30 |
proposal:NN | 1.16 | 0.82 | 1.58 | 0.82 | 0.82 | 1.16 | 0.28 |
so:RB | 1.47 | 0.91 | 1.05 | 0.91 | 0.91 | 1.47 | 1.21 |
did:VBD | 1.16 | 0.43 | 0.54 | 0.79 | 0.88 | 1.16 | 0.87 |
Jones:NNP | 0.28 | 1.07 | 1.84 | 1.07 | 1.01 | 0.70 | 1.16 |
NO_WORD | 0.28 | 0.17 | 0.54 | 1.30 | 0.36 | 0.12 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.33 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.76 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : proposal
-1.00 1.00 NullPunisher.other : costed
-1.00 1.00 NullPunisher.other : he
-3.00 1.00 NullPunisher.entity : Jones
-3.00 1.00 NullPunisher.entity : Smith
-0.05 1.00 NullPunisher.aux : had
-1.00 1.00 NullPunisher.other : claimed
-2.00 1.00 RootEntailment.unalignedRoot : "claimed" not aligned to anything
Hand-tuned score (dot product of above): -12.9557
Threshold: -9.4738
Txt: Smith claimed he had costed his proposal and so did Jones.
Hyp: Jones claimed Smith had costed Smith's proposal. (yes)
Jones NNP |
claimed VBD |
Smith NNP |
had VBD |
costed VBN |
Smith NNP |
proposal NN |
|
Smith:NNP | 0.70 | 1.07 | 0.28 | 1.07 | 1.05 | 0.28 | 1.16 |
claimed:VBD | 1.16 | 1.32 | 1.16 | 0.78 | 0.71 | 1.16 | 0.91 |
he:PRP | 1.55 | 0.97 | 2.76 | 0.93 | 0.97 | 2.76 | 1.30 |
had:VBD | 1.16 | 0.78 | 1.16 | 1.32 | 0.94 | 1.16 | 0.91 |
costed:VBN | 1.10 | 0.71 | 1.14 | 0.94 | 1.32 | 1.14 | 0.91 |
his:PRP$ | 1.55 | 0.97 | 2.76 | 0.90 | 0.97 | 2.76 | 1.30 |
proposal:NN | 1.16 | 0.82 | 1.16 | 0.82 | 0.82 | 1.16 | 0.28 |
so:RB | 1.47 | 0.91 | 1.47 | 0.91 | 0.91 | 1.47 | 1.21 |
did:VBD | 1.16 | 0.43 | 1.16 | 0.79 | 0.88 | 1.16 | 0.87 |
Jones:NNP | 0.28 | 1.07 | 0.70 | 1.07 | 1.01 | 0.70 | 1.16 |
NO_WORD | 0.28 | 0.17 | 0.28 | 1.30 | 0.36 | 0.12 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.21 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : proposal
-3.00 1.00 NullPunisher.entity : Smith
-0.05 1.00 NullPunisher.aux : had
-3.00 1.00 NullPunisher.entity : Jones
-1.00 1.00 NullPunisher.other : claimed
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : costed
-2.00 1.00 RootEntailment.unalignedRoot : "claimed" not aligned to anything
Hand-tuned score (dot product of above): -15.1125
Threshold: -9.4738
Txt: Smith claimed he had costed his proposal and so did Jones.
Hyp: Jones claimed Smith had costed Jones' proposal. (don't know)
Jones NNP |
claimed VBD |
Smith NNP |
had VBD |
costed VBN |
Jones NNP |
proposal NN |
|
Smith:NNP | 0.70 | 1.07 | 0.28 | 1.07 | 1.05 | 0.70 | 1.16 |
claimed:VBD | 1.16 | 1.32 | 1.16 | 0.78 | 0.71 | 1.16 | 0.91 |
he:PRP | 1.55 | 0.97 | 2.76 | 0.93 | 0.97 | 1.55 | 1.30 |
had:VBD | 1.16 | 0.78 | 1.16 | 1.32 | 0.94 | 1.16 | 0.91 |
costed:VBN | 1.10 | 0.71 | 1.14 | 0.94 | 1.32 | 1.10 | 0.91 |
his:PRP$ | 1.55 | 0.97 | 2.76 | 0.90 | 0.97 | 1.55 | 1.30 |
proposal:NN | 1.16 | 0.82 | 1.16 | 0.82 | 0.82 | 1.16 | 0.28 |
so:RB | 1.47 | 0.91 | 1.47 | 0.91 | 0.91 | 1.47 | 1.21 |
did:VBD | 1.16 | 0.43 | 1.16 | 0.79 | 0.88 | 1.16 | 0.87 |
Jones:NNP | 0.28 | 1.07 | 0.70 | 1.07 | 1.01 | 0.28 | 1.16 |
NO_WORD | 0.28 | 0.17 | 0.28 | 1.30 | 0.36 | 0.12 | 0.09 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.21 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-3.00 1.00 NullPunisher.entity : Jones
-1.00 1.00 NullPunisher.other : proposal
-1.00 1.00 NullPunisher.other : costed
-0.05 1.00 NullPunisher.aux : had
-3.00 1.00 NullPunisher.entity : Jones
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : claimed
-2.00 1.00 RootEntailment.unalignedRoot : "claimed" not aligned to anything
Hand-tuned score (dot product of above): -15.1125
Threshold: -9.4738
Txt: John is a man and Mary is a woman. John represents his company and so does Mary.
Hyp: Mary represents her own company. (yes)
Mary NNP |
represents VBZ |
her PRP$ |
own JJ |
company NN |
|
John:NNP | 0.82 | 1.07 | 1.84 | 1.15 | 1.20 |
is:VBZ | 1.16 | 0.31 | 0.54 | 1.02 | 0.91 |
a:DT | 1.25 | 1.25 | 1.16 | 1.18 | 1.00 |
man:NN | 0.94 | 0.82 | 1.58 | 0.91 | 0.85 |
Mary:NNP | 0.28 | 1.07 | 1.80 | 1.24 | 1.10 |
is:VBZ | 1.16 | 0.31 | 0.54 | 1.02 | 0.91 |
a:DT | 1.25 | 1.25 | 1.16 | 1.18 | 1.00 |
woman:NN | 1.02 | 0.80 | 1.58 | 0.93 | 0.78 |
John:NNP | 0.82 | 1.07 | 1.84 | 1.15 | 1.20 |
represents:VBZ | 1.16 | 1.32 | 0.54 | 1.02 | 0.84 |
his:PRP$ | 1.55 | 0.97 | 0.90 | 1.99 | 1.30 |
company:NN | 1.10 | 0.75 | 1.58 | 0.99 | 0.28 |
so:RB | 1.47 | 0.91 | 1.05 | 1.05 | 1.21 |
does:VBZ | 1.16 | 0.70 | 0.51 | 0.99 | 0.83 |
Mary:NNP | 0.28 | 1.07 | 1.80 | 1.24 | 1.10 |
NO_WORD | 0.28 | 0.17 | 0.93 | 0.11 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.21 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "own" modifying "company"
-3.00 1.00 NullPunisher.entity : Mary
-0.10 1.00 NullPunisher.functionWord : her
-1.00 1.00 NullPunisher.other : company
-1.00 1.00 NullPunisher.other : own
-1.00 1.00 NullPunisher.other : represents
-2.00 1.00 RootEntailment.unalignedRoot : "represents" not aligned to anything
Hand-tuned score (dot product of above): -9.9708
Threshold: -9.4738
Txt: John is a man and Mary is a woman. John represents his company and so does Mary.
Hyp: Mary represents John's company. (yes)
Mary NNP |
represents VBZ |
John NNP |
company NN |
|
John:NNP | 0.82 | 1.07 | 0.28 | 1.20 |
is:VBZ | 1.16 | 0.31 | 1.16 | 0.91 |
a:DT | 1.25 | 1.25 | 1.25 | 1.00 |
man:NN | 0.94 | 0.82 | 1.08 | 0.85 |
Mary:NNP | 0.28 | 1.07 | 0.82 | 1.10 |
is:VBZ | 1.16 | 0.31 | 1.16 | 0.91 |
a:DT | 1.25 | 1.25 | 1.25 | 1.00 |
woman:NN | 1.02 | 0.80 | 1.05 | 0.78 |
John:NNP | 0.82 | 1.07 | 0.28 | 1.20 |
represents:VBZ | 1.16 | 1.32 | 1.16 | 0.84 |
his:PRP$ | 1.55 | 0.97 | 2.76 | 1.30 |
company:NN | 1.10 | 0.75 | 1.20 | 0.28 |
so:RB | 1.47 | 0.91 | 1.47 | 1.21 |
does:VBZ | 1.16 | 0.70 | 1.11 | 0.83 |
Mary:NNP | 0.28 | 1.07 | 0.82 | 1.10 |
NO_WORD | 0.28 | 0.17 | 0.12 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.03 Alignment.score
1.00 0.17 Alignment.isGood
-1.00 0.81 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-3.00 1.00 NullPunisher.entity : John
-1.00 1.00 NullPunisher.other : company
-3.00 1.00 NullPunisher.entity : Mary
-1.00 1.00 NullPunisher.other : represents
-2.00 1.00 RootEntailment.unalignedRoot : "represents" not aligned to anything
Hand-tuned score (dot product of above): -11.0087
Threshold: -9.4738
Txt: Bill suggested to Frank's boss that they should go to the meeting together, and Carl to Alan's wife.
Hyp: If it was suggested that Bill and Frank should go together, it was suggested that Carl and Alan should go together. (yes)
If IN |
it PRP |
was VBD |
suggested VBN |
that IN |
Bill NNP |
Frank NNP |
should MD |
go VB |
together RB |
it PRP |
was VBD |
suggested VBN |
that IN |
Carl NNP |
Alan NNP |
should MD |
go VB |
together RB |
|
Bill:NNP | 0.75 | 1.84 | 1.07 | 1.07 | 0.75 | 0.28 | 0.88 | 0.96 | 1.07 | 1.56 | 1.84 | 1.07 | 1.07 | 0.75 | 0.76 | 0.82 | 0.96 | 1.07 | 1.56 |
suggested:VBD | 1.15 | 0.54 | 0.86 | 2.02 | 1.15 | 1.16 | 1.16 | 0.74 | 0.82 | 1.05 | 0.54 | 0.86 | 2.02 | 1.15 | 1.16 | 1.16 | 0.74 | 0.82 | 1.05 |
Frank:NNP | 0.75 | 1.84 | 1.07 | 1.07 | 0.73 | 0.88 | 0.28 | 0.96 | 1.07 | 1.56 | 1.84 | 1.07 | 1.07 | 0.73 | 0.79 | 0.74 | 0.96 | 1.07 | 1.56 |
boss:NN | 0.50 | 1.58 | 0.78 | 0.75 | 0.50 | 1.11 | 1.04 | 0.71 | 0.76 | 1.31 | 1.58 | 0.78 | 0.75 | 0.50 | 1.07 | 1.07 | 0.71 | 0.76 | 1.31 |
that:IN | 1.05 | 1.57 | 1.21 | 1.24 | 1.59 | 1.57 | 1.54 | 1.05 | 1.24 | 1.23 | 1.57 | 1.21 | 1.24 | 1.59 | 1.57 | 1.51 | 1.05 | 1.24 | 1.23 |
they:PRP | 1.05 | 0.85 | 1.21 | 0.97 | 1.18 | 1.55 | 2.76 | 1.11 | 0.97 | 1.01 | 0.85 | 1.21 | 0.97 | 1.18 | 1.79 | 2.04 | 1.11 | 0.97 | 1.01 |
should:MD | 1.05 | 1.16 | 1.25 | 1.24 | 1.05 | 1.25 | 1.25 | 2.42 | 1.25 | 1.45 | 1.16 | 1.25 | 1.24 | 1.05 | 1.25 | 1.25 | 2.42 | 1.25 | 1.45 |
go:VB | 1.15 | 0.54 | 0.92 | 0.82 | 1.15 | 1.16 | 1.16 | 0.76 | 1.32 | 1.11 | 0.54 | 0.92 | 0.82 | 1.15 | 1.16 | 1.16 | 0.76 | 1.32 | 1.11 |
the:DT | 1.05 | 1.16 | 1.25 | 1.25 | 0.95 | 1.25 | 1.25 | 1.31 | 1.25 | 1.43 | 1.16 | 1.25 | 1.25 | 0.95 | 1.25 | 1.25 | 1.31 | 1.25 | 1.43 |
meeting:NN | 0.50 | 1.58 | 0.82 | 0.41 | 0.50 | 1.17 | 1.14 | 0.71 | 0.75 | 1.26 | 1.58 | 0.82 | 0.41 | 0.50 | 1.07 | 1.07 | 0.71 | 0.75 | 1.26 |
together:RB | 1.05 | 1.05 | 0.91 | 0.85 | 1.05 | 1.47 | 1.47 | 1.09 | 0.91 | 1.23 | 1.05 | 0.91 | 0.85 | 1.05 | 1.47 | 1.47 | 1.09 | 0.91 | 1.23 |
Carl:NNP | 0.75 | 1.84 | 1.04 | 1.07 | 0.75 | 0.76 | 0.79 | 0.96 | 1.07 | 1.56 | 1.84 | 1.04 | 1.07 | 0.75 | 0.28 | 0.82 | 0.96 | 1.07 | 1.56 |
Alan:NNP | 0.75 | 1.84 | 1.04 | 1.07 | 0.70 | 0.82 | 0.74 | 0.96 | 1.07 | 1.56 | 1.84 | 1.04 | 1.07 | 0.70 | 0.82 | 0.28 | 0.96 | 1.07 | 1.56 |
wife:NN | 0.42 | 1.58 | 0.78 | 0.75 | 0.50 | 1.09 | 1.03 | 0.71 | 0.69 | 1.31 | 1.58 | 0.78 | 0.75 | 0.50 | 1.07 | 1.07 | 0.71 | 0.69 | 1.31 |
NO_WORD | 0.92 | 0.67 | 1.57 | 0.42 | 0.60 | 0.28 | 0.01 | 1.55 | 0.36 | 0.04 | 0.67 | 1.57 | 0.17 | 0.60 | 0.28 | 0.01 | 1.55 | 0.36 | 0.04 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.65 Alignment.score
1.00 0.28 Alignment.isGood
-1.00 0.69 Alignment.isBad
-0.10 18.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.05 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "together" modifying "go"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "boss" modifying "suggested" is dropped on aligned hypothesis word "suggested"
-3.00 1.00 NullPunisher.entity : Bill
-1.00 1.00 NullPunisher.other : go
-1.00 1.00 NullPunisher.other : together
-0.05 1.00 NullPunisher.aux : was
-0.10 1.00 NullPunisher.functionWord : that
-0.05 1.00 NullPunisher.aux : should
-1.00 1.00 NullPunisher.other : together
-0.10 1.00 NullPunisher.functionWord : that
-3.00 1.00 NullPunisher.entity : Carl
-0.05 1.00 NullPunisher.aux : should
-1.00 1.00 NullPunisher.other : it
-0.05 1.00 NullPunisher.aux : was
-1.00 1.00 NullPunisher.other : it
-1.00 1.00 NullPunisher.other : suggested
-1.00 1.00 NullPunisher.other : If
-3.00 1.00 NullPunisher.entity : Alan
-3.00 1.00 NullPunisher.entity : Frank
-1.00 1.00 NullPunisher.other : go
-3.00 1.00 Structure.argsMismatch : go (verbal compl of the root) not aligned
Hand-tuned score (dot product of above): -25.3504
Threshold: -9.4738
Txt: Bill suggested to Frank's boss that they should go to the meeting together, and Carl to Alan's wife.
Hyp: If it was suggested that Bill and Frank should go together, it was suggested that Carl and Alan's wife should go together. (don't know)
If IN |
it PRP |
was VBD |
suggested VBN |
that IN |
Bill NNP |
Frank NNP |
should MD |
go VB |
together RB |
it PRP |
was VBD |
suggested VBN |
that IN |
Carl NNP |
Alan NNP |
wife NN |
should MD |
go VB |
together RB |
|
Bill:NNP | 0.75 | 1.84 | 1.07 | 1.07 | 0.75 | 0.28 | 0.88 | 0.96 | 1.07 | 1.56 | 1.84 | 1.07 | 1.07 | 0.75 | 0.76 | 0.82 | 1.09 | 0.96 | 1.07 | 1.56 |
suggested:VBD | 1.15 | 0.54 | 0.86 | 2.02 | 1.15 | 1.16 | 1.16 | 0.74 | 0.82 | 1.05 | 0.54 | 0.86 | 2.02 | 1.15 | 1.16 | 1.16 | 0.84 | 0.74 | 0.82 | 1.05 |
Frank:NNP | 0.75 | 1.84 | 1.07 | 1.07 | 0.73 | 0.88 | 0.28 | 0.96 | 1.07 | 1.56 | 1.84 | 1.07 | 1.07 | 0.73 | 0.79 | 0.74 | 1.03 | 0.96 | 1.07 | 1.56 |
boss:NN | 0.50 | 1.58 | 0.78 | 0.75 | 0.50 | 1.11 | 1.04 | 0.71 | 0.76 | 1.31 | 1.58 | 0.78 | 0.75 | 0.50 | 1.07 | 1.07 | 0.53 | 0.71 | 0.76 | 1.31 |
that:IN | 1.05 | 1.57 | 1.21 | 1.24 | 1.59 | 1.57 | 1.54 | 1.05 | 1.24 | 1.23 | 1.57 | 1.21 | 1.24 | 1.59 | 1.57 | 1.51 | 1.31 | 1.05 | 1.24 | 1.23 |
they:PRP | 1.05 | 0.85 | 1.21 | 0.97 | 1.18 | 1.55 | 2.76 | 1.11 | 0.97 | 1.01 | 0.85 | 1.21 | 0.97 | 1.18 | 1.79 | 2.04 | 1.30 | 1.11 | 0.97 | 1.01 |
should:MD | 1.05 | 1.16 | 1.25 | 1.24 | 1.05 | 1.25 | 1.25 | 2.42 | 1.25 | 1.45 | 1.16 | 1.25 | 1.24 | 1.05 | 1.25 | 1.25 | 1.00 | 2.42 | 1.25 | 1.45 |
go:VB | 1.15 | 0.54 | 0.92 | 0.82 | 1.15 | 1.16 | 1.16 | 0.76 | 1.32 | 1.11 | 0.54 | 0.92 | 0.82 | 1.15 | 1.16 | 1.16 | 0.78 | 0.76 | 1.32 | 1.11 |
the:DT | 1.05 | 1.16 | 1.25 | 1.25 | 0.95 | 1.25 | 1.25 | 1.31 | 1.25 | 1.43 | 1.16 | 1.25 | 1.25 | 0.95 | 1.25 | 1.25 | 0.96 | 1.31 | 1.25 | 1.43 |
meeting:NN | 0.50 | 1.58 | 0.82 | 0.41 | 0.50 | 1.17 | 1.14 | 0.71 | 0.75 | 1.26 | 1.58 | 0.82 | 0.41 | 0.50 | 1.07 | 1.07 | 0.89 | 0.71 | 0.75 | 1.26 |
together:RB | 1.05 | 1.05 | 0.91 | 0.85 | 1.05 | 1.47 | 1.47 | 1.09 | 0.91 | 1.23 | 1.05 | 0.91 | 0.85 | 1.05 | 1.47 | 1.47 | 1.21 | 1.09 | 0.91 | 1.23 |
Carl:NNP | 0.75 | 1.84 | 1.04 | 1.07 | 0.75 | 0.76 | 0.79 | 0.96 | 1.07 | 1.56 | 1.84 | 1.04 | 1.07 | 0.75 | 0.28 | 0.82 | 1.07 | 0.96 | 1.07 | 1.56 |
Alan:NNP | 0.75 | 1.84 | 1.04 | 1.07 | 0.70 | 0.82 | 0.74 | 0.96 | 1.07 | 1.56 | 1.84 | 1.04 | 1.07 | 0.70 | 0.82 | 0.28 | 1.07 | 0.96 | 1.07 | 1.56 |
wife:NN | 0.42 | 1.58 | 0.78 | 0.75 | 0.50 | 1.09 | 1.03 | 0.71 | 0.69 | 1.31 | 1.58 | 0.78 | 0.75 | 0.50 | 1.07 | 1.07 | 0.28 | 0.71 | 0.69 | 1.31 |
NO_WORD | 0.92 | 0.67 | 1.57 | 0.42 | 0.60 | 0.28 | 0.01 | 1.55 | 0.36 | 0.04 | 0.67 | 1.57 | 0.17 | 0.60 | 0.28 | 0.12 | 0.01 | 1.55 | 0.36 | 0.04 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.63 Alignment.score
1.00 0.28 Alignment.isGood
-1.00 0.70 Alignment.isBad
-0.10 19.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.05 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "together" modifying "go"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "boss" modifying "suggested" is dropped on aligned hypothesis word "suggested"
-1.00 1.00 NullPunisher.other : go
-0.10 1.00 NullPunisher.functionWord : that
-1.00 1.00 NullPunisher.other : If
-0.05 1.00 NullPunisher.aux : should
-1.00 1.00 NullPunisher.other : together
-1.00 1.00 NullPunisher.other : it
-3.00 1.00 NullPunisher.entity : Carl
-1.00 1.00 NullPunisher.other : it
-3.00 1.00 NullPunisher.entity : Frank
-1.00 1.00 NullPunisher.other : wife
-3.00 1.00 NullPunisher.entity : Bill
-0.05 1.00 NullPunisher.aux : was
-1.00 1.00 NullPunisher.other : go
-1.00 1.00 NullPunisher.other : together
-0.05 1.00 NullPunisher.aux : was
-0.10 1.00 NullPunisher.functionWord : that
-1.00 1.00 NullPunisher.other : suggested
-3.00 1.00 NullPunisher.entity : Alan
-0.05 1.00 NullPunisher.aux : should
-3.00 1.00 Structure.argsMismatch : go (verbal compl of the root) not aligned
Hand-tuned score (dot product of above): -26.4886
Threshold: -9.4738
Txt: Bill suggested to Frank's boss that they should go to the meeting together, and Carl to Alan's wife.
Hyp: If it was suggested that Bill and Frank's boss should go together, it was suggested that Carl and Alan's wife should go together. (yes)
If IN |
it PRP |
was VBD |
suggested VBN |
that IN |
Bill NNP |
Frank NNP |
boss NN |
should MD |
go VB |
together RB |
it PRP |
was VBD |
suggested VBN |
that IN |
Carl NNP |
Alan NNP |
wife NN |
should MD |
go VB |
together RB |
|
Bill:NNP | 0.75 | 1.84 | 1.07 | 1.07 | 0.75 | 0.28 | 0.88 | 1.11 | 0.96 | 1.07 | 1.56 | 1.84 | 1.07 | 1.07 | 0.75 | 0.76 | 0.82 | 1.09 | 0.96 | 1.07 | 1.56 |
suggested:VBD | 1.15 | 0.54 | 0.86 | 2.02 | 1.15 | 1.16 | 1.16 | 0.83 | 0.74 | 0.82 | 1.05 | 0.54 | 0.86 | 2.02 | 1.15 | 1.16 | 1.16 | 0.84 | 0.74 | 0.82 | 1.05 |
Frank:NNP | 0.75 | 1.84 | 1.07 | 1.07 | 0.73 | 0.88 | 0.28 | 1.04 | 0.96 | 1.07 | 1.56 | 1.84 | 1.07 | 1.07 | 0.73 | 0.79 | 0.74 | 1.03 | 0.96 | 1.07 | 1.56 |
boss:NN | 0.50 | 1.58 | 0.78 | 0.75 | 0.50 | 1.11 | 1.04 | 0.28 | 0.71 | 0.76 | 1.31 | 1.58 | 0.78 | 0.75 | 0.50 | 1.07 | 1.07 | 0.53 | 0.71 | 0.76 | 1.31 |
that:IN | 1.05 | 1.57 | 1.21 | 1.24 | 1.59 | 1.57 | 1.54 | 1.31 | 1.05 | 1.24 | 1.23 | 1.57 | 1.21 | 1.24 | 1.59 | 1.57 | 1.51 | 1.31 | 1.05 | 1.24 | 1.23 |
they:PRP | 1.05 | 0.85 | 1.21 | 0.97 | 1.18 | 1.55 | 2.76 | 1.30 | 1.11 | 0.97 | 1.01 | 0.85 | 1.21 | 0.97 | 1.18 | 1.79 | 2.04 | 1.30 | 1.11 | 0.97 | 1.01 |
should:MD | 1.05 | 1.16 | 1.25 | 1.24 | 1.05 | 1.25 | 1.25 | 1.00 | 2.42 | 1.25 | 1.45 | 1.16 | 1.25 | 1.24 | 1.05 | 1.25 | 1.25 | 1.00 | 2.42 | 1.25 | 1.45 |
go:VB | 1.15 | 0.54 | 0.92 | 0.82 | 1.15 | 1.16 | 1.16 | 0.85 | 0.76 | 1.32 | 1.11 | 0.54 | 0.92 | 0.82 | 1.15 | 1.16 | 1.16 | 0.78 | 0.76 | 1.32 | 1.11 |
the:DT | 1.05 | 1.16 | 1.25 | 1.25 | 0.95 | 1.25 | 1.25 | 1.00 | 1.31 | 1.25 | 1.43 | 1.16 | 1.25 | 1.25 | 0.95 | 1.25 | 1.25 | 0.96 | 1.31 | 1.25 | 1.43 |
meeting:NN | 0.50 | 1.58 | 0.82 | 0.41 | 0.50 | 1.17 | 1.14 | 0.91 | 0.71 | 0.75 | 1.26 | 1.58 | 0.82 | 0.41 | 0.50 | 1.07 | 1.07 | 0.89 | 0.71 | 0.75 | 1.26 |
together:RB | 1.05 | 1.05 | 0.91 | 0.85 | 1.05 | 1.47 | 1.47 | 1.21 | 1.09 | 0.91 | 1.23 | 1.05 | 0.91 | 0.85 | 1.05 | 1.47 | 1.47 | 1.21 | 1.09 | 0.91 | 1.23 |
Carl:NNP | 0.75 | 1.84 | 1.04 | 1.07 | 0.75 | 0.76 | 0.79 | 1.07 | 0.96 | 1.07 | 1.56 | 1.84 | 1.04 | 1.07 | 0.75 | 0.28 | 0.82 | 1.07 | 0.96 | 1.07 | 1.56 |
Alan:NNP | 0.75 | 1.84 | 1.04 | 1.07 | 0.70 | 0.82 | 0.74 | 1.07 | 0.96 | 1.07 | 1.56 | 1.84 | 1.04 | 1.07 | 0.70 | 0.82 | 0.28 | 1.07 | 0.96 | 1.07 | 1.56 |
wife:NN | 0.42 | 1.58 | 0.78 | 0.75 | 0.50 | 1.09 | 1.03 | 0.53 | 0.71 | 0.69 | 1.31 | 1.58 | 0.78 | 0.75 | 0.50 | 1.07 | 1.07 | 0.28 | 0.71 | 0.69 | 1.31 |
NO_WORD | 0.92 | 0.67 | 1.57 | 0.42 | 0.60 | 0.28 | 0.12 | 0.01 | 1.55 | 0.36 | 0.04 | 0.67 | 1.57 | 0.17 | 0.60 | 0.28 | 0.12 | 0.01 | 1.55 | 0.36 | 0.04 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.60 Alignment.score
1.00 0.27 Alignment.isGood
-1.00 0.70 Alignment.isBad
-0.10 20.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.05 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "together" modifying "go"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "boss" modifying "suggested" is dropped on aligned hypothesis word "suggested"
-1.00 1.00 NullPunisher.other : go
-1.00 1.00 NullPunisher.other : suggested
-3.00 1.00 NullPunisher.entity : Alan
-1.00 1.00 NullPunisher.other : together
-0.05 1.00 NullPunisher.aux : was
-1.00 1.00 NullPunisher.other : it
-0.05 1.00 NullPunisher.aux : was
-0.05 1.00 NullPunisher.aux : should
-0.10 1.00 NullPunisher.functionWord : that
-0.05 1.00 NullPunisher.aux : should
-1.00 1.00 NullPunisher.other : If
-3.00 1.00 NullPunisher.entity : Carl
-1.00 1.00 NullPunisher.other : boss
-1.00 1.00 NullPunisher.other : wife
-1.00 1.00 NullPunisher.other : it
-3.00 1.00 NullPunisher.entity : Bill
-1.00 1.00 NullPunisher.other : together
-0.10 1.00 NullPunisher.functionWord : that
-1.00 1.00 NullPunisher.other : go
-3.00 1.00 NullPunisher.entity : Frank
-3.00 1.00 Structure.argsMismatch : go (verbal compl of the root) not aligned
Hand-tuned score (dot product of above): -27.6230
Threshold: -9.4738
Txt: Bill suggested to Frank's boss that they should go to the meeting together, and Carl to Alan's wife.
Hyp: If it was suggested that Bill and Frank's boss should go together, it was suggested that Carl and Alan should go together. (don't know)
If IN |
it PRP |
was VBD |
suggested VBN |
that IN |
Bill NNP |
Frank NNP |
boss NN |
should MD |
go VB |
together RB |
it PRP |
was VBD |
suggested VBN |
that IN |
Carl NNP |
Alan NNP |
should MD |
go VB |
together RB |
|
Bill:NNP | 0.75 | 1.84 | 1.07 | 1.07 | 0.75 | 0.28 | 0.88 | 1.11 | 0.96 | 1.07 | 1.56 | 1.84 | 1.07 | 1.07 | 0.75 | 0.76 | 0.82 | 0.96 | 1.07 | 1.56 |
suggested:VBD | 1.15 | 0.54 | 0.86 | 2.02 | 1.15 | 1.16 | 1.16 | 0.83 | 0.74 | 0.82 | 1.05 | 0.54 | 0.86 | 2.02 | 1.15 | 1.16 | 1.16 | 0.74 | 0.82 | 1.05 |
Frank:NNP | 0.75 | 1.84 | 1.07 | 1.07 | 0.73 | 0.88 | 0.28 | 1.04 | 0.96 | 1.07 | 1.56 | 1.84 | 1.07 | 1.07 | 0.73 | 0.79 | 0.74 | 0.96 | 1.07 | 1.56 |
boss:NN | 0.50 | 1.58 | 0.78 | 0.75 | 0.50 | 1.11 | 1.04 | 0.28 | 0.71 | 0.76 | 1.31 | 1.58 | 0.78 | 0.75 | 0.50 | 1.07 | 1.07 | 0.71 | 0.76 | 1.31 |
that:IN | 1.05 | 1.57 | 1.21 | 1.24 | 1.59 | 1.57 | 1.54 | 1.31 | 1.05 | 1.24 | 1.23 | 1.57 | 1.21 | 1.24 | 1.59 | 1.57 | 1.51 | 1.05 | 1.24 | 1.23 |
they:PRP | 1.05 | 0.85 | 1.21 | 0.97 | 1.18 | 1.55 | 2.76 | 1.30 | 1.11 | 0.97 | 1.01 | 0.85 | 1.21 | 0.97 | 1.18 | 1.79 | 2.04 | 1.11 | 0.97 | 1.01 |
should:MD | 1.05 | 1.16 | 1.25 | 1.24 | 1.05 | 1.25 | 1.25 | 1.00 | 2.42 | 1.25 | 1.45 | 1.16 | 1.25 | 1.24 | 1.05 | 1.25 | 1.25 | 2.42 | 1.25 | 1.45 |
go:VB | 1.15 | 0.54 | 0.92 | 0.82 | 1.15 | 1.16 | 1.16 | 0.85 | 0.76 | 1.32 | 1.11 | 0.54 | 0.92 | 0.82 | 1.15 | 1.16 | 1.16 | 0.76 | 1.32 | 1.11 |
the:DT | 1.05 | 1.16 | 1.25 | 1.25 | 0.95 | 1.25 | 1.25 | 1.00 | 1.31 | 1.25 | 1.43 | 1.16 | 1.25 | 1.25 | 0.95 | 1.25 | 1.25 | 1.31 | 1.25 | 1.43 |
meeting:NN | 0.50 | 1.58 | 0.82 | 0.41 | 0.50 | 1.17 | 1.14 | 0.91 | 0.71 | 0.75 | 1.26 | 1.58 | 0.82 | 0.41 | 0.50 | 1.07 | 1.07 | 0.71 | 0.75 | 1.26 |
together:RB | 1.05 | 1.05 | 0.91 | 0.85 | 1.05 | 1.47 | 1.47 | 1.21 | 1.09 | 0.91 | 1.23 | 1.05 | 0.91 | 0.85 | 1.05 | 1.47 | 1.47 | 1.09 | 0.91 | 1.23 |
Carl:NNP | 0.75 | 1.84 | 1.04 | 1.07 | 0.75 | 0.76 | 0.79 | 1.07 | 0.96 | 1.07 | 1.56 | 1.84 | 1.04 | 1.07 | 0.75 | 0.28 | 0.82 | 0.96 | 1.07 | 1.56 |
Alan:NNP | 0.75 | 1.84 | 1.04 | 1.07 | 0.70 | 0.82 | 0.74 | 1.07 | 0.96 | 1.07 | 1.56 | 1.84 | 1.04 | 1.07 | 0.70 | 0.82 | 0.28 | 0.96 | 1.07 | 1.56 |
wife:NN | 0.42 | 1.58 | 0.78 | 0.75 | 0.50 | 1.09 | 1.03 | 0.53 | 0.71 | 0.69 | 1.31 | 1.58 | 0.78 | 0.75 | 0.50 | 1.07 | 1.07 | 0.71 | 0.69 | 1.31 |
NO_WORD | 0.92 | 0.67 | 1.57 | 0.42 | 0.60 | 0.28 | 0.12 | 0.01 | 1.55 | 0.36 | 0.04 | 0.67 | 1.57 | 0.17 | 0.60 | 0.28 | 0.01 | 1.55 | 0.36 | 0.04 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.63 Alignment.score
1.00 0.28 Alignment.isGood
-1.00 0.70 Alignment.isBad
-0.10 19.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.05 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "together" modifying "go"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "boss" modifying "suggested" is dropped on aligned hypothesis word "suggested"
-1.00 1.00 NullPunisher.other : go
-0.05 1.00 NullPunisher.aux : was
-1.00 1.00 NullPunisher.other : suggested
-0.05 1.00 NullPunisher.aux : should
-3.00 1.00 NullPunisher.entity : Frank
-0.10 1.00 NullPunisher.functionWord : that
-0.05 1.00 NullPunisher.aux : was
-1.00 1.00 NullPunisher.other : it
-3.00 1.00 NullPunisher.entity : Alan
-3.00 1.00 NullPunisher.entity : Carl
-1.00 1.00 NullPunisher.other : together
-1.00 1.00 NullPunisher.other : together
-1.00 1.00 NullPunisher.other : go
-0.05 1.00 NullPunisher.aux : should
-1.00 1.00 NullPunisher.other : boss
-1.00 1.00 NullPunisher.other : If
-0.10 1.00 NullPunisher.functionWord : that
-3.00 1.00 NullPunisher.entity : Bill
-1.00 1.00 NullPunisher.other : it
-3.00 1.00 Structure.argsMismatch : go (verbal compl of the root) not aligned
Hand-tuned score (dot product of above): -26.4886
Threshold: -9.4738
Txt: Bill suggested to Frank's boss that they should go to the meeting together, and Carl to Alan's wife.
Hyp: If it was suggested that Bill, Frank and Frank's boss should go together, it was suggested that Carl, Alan and Alan's wife should go together. (yes)
If IN |
it PRP |
was VBD |
suggested VBN |
that IN |
Bill NNP |
Frank NNP |
Frank NNP |
boss NN |
should MD |
go VB |
together RB |
it PRP |
was VBD |
suggested VBN |
that IN |
Carl NNP |
Alan NNP |
Alan NNP |
wife NN |
should MD |
go VB |
together RB |
|
Bill:NNP | 0.75 | 1.84 | 1.07 | 1.07 | 0.75 | 0.28 | 0.88 | 0.88 | 1.11 | 0.96 | 1.07 | 1.56 | 1.84 | 1.07 | 1.07 | 0.75 | 0.76 | 0.82 | 0.82 | 1.09 | 0.96 | 1.07 | 1.56 |
suggested:VBD | 1.15 | 0.54 | 0.86 | 2.02 | 1.15 | 1.16 | 1.16 | 1.16 | 0.83 | 0.74 | 0.82 | 1.05 | 0.54 | 0.86 | 2.02 | 1.15 | 1.16 | 1.16 | 1.16 | 0.84 | 0.74 | 0.82 | 1.05 |
Frank:NNP | 0.75 | 1.84 | 1.07 | 1.07 | 0.73 | 0.88 | 0.28 | 0.28 | 1.04 | 0.96 | 1.07 | 1.56 | 1.84 | 1.07 | 1.07 | 0.73 | 0.79 | 0.74 | 0.74 | 1.03 | 0.96 | 1.07 | 1.56 |
boss:NN | 0.50 | 1.58 | 0.78 | 0.75 | 0.50 | 1.11 | 1.04 | 1.04 | 0.28 | 0.71 | 0.76 | 1.31 | 1.58 | 0.78 | 0.75 | 0.50 | 1.07 | 1.07 | 1.07 | 0.53 | 0.71 | 0.76 | 1.31 |
that:IN | 1.05 | 1.57 | 1.21 | 1.24 | 1.59 | 1.57 | 1.54 | 1.54 | 1.31 | 1.05 | 1.24 | 1.23 | 1.57 | 1.21 | 1.24 | 1.59 | 1.57 | 1.51 | 1.51 | 1.31 | 1.05 | 1.24 | 1.23 |
they:PRP | 1.05 | 0.85 | 1.21 | 0.97 | 1.18 | 1.55 | 2.76 | 2.76 | 1.30 | 1.11 | 0.97 | 1.01 | 0.85 | 1.21 | 0.97 | 1.18 | 1.79 | 2.04 | 2.04 | 1.30 | 1.11 | 0.97 | 1.01 |
should:MD | 1.05 | 1.16 | 1.25 | 1.24 | 1.05 | 1.25 | 1.25 | 1.25 | 1.00 | 2.42 | 1.25 | 1.45 | 1.16 | 1.25 | 1.24 | 1.05 | 1.25 | 1.25 | 1.25 | 1.00 | 2.42 | 1.25 | 1.45 |
go:VB | 1.15 | 0.54 | 0.92 | 0.82 | 1.15 | 1.16 | 1.16 | 1.16 | 0.85 | 0.76 | 1.32 | 1.11 | 0.54 | 0.92 | 0.82 | 1.15 | 1.16 | 1.16 | 1.16 | 0.78 | 0.76 | 1.32 | 1.11 |
the:DT | 1.05 | 1.16 | 1.25 | 1.25 | 0.95 | 1.25 | 1.25 | 1.25 | 1.00 | 1.31 | 1.25 | 1.43 | 1.16 | 1.25 | 1.25 | 0.95 | 1.25 | 1.25 | 1.25 | 0.96 | 1.31 | 1.25 | 1.43 |
meeting:NN | 0.50 | 1.58 | 0.82 | 0.41 | 0.50 | 1.17 | 1.14 | 1.14 | 0.91 | 0.71 | 0.75 | 1.26 | 1.58 | 0.82 | 0.41 | 0.50 | 1.07 | 1.07 | 1.07 | 0.89 | 0.71 | 0.75 | 1.26 |
together:RB | 1.05 | 1.05 | 0.91 | 0.85 | 1.05 | 1.47 | 1.47 | 1.47 | 1.21 | 1.09 | 0.91 | 1.23 | 1.05 | 0.91 | 0.85 | 1.05 | 1.47 | 1.47 | 1.47 | 1.21 | 1.09 | 0.91 | 1.23 |
Carl:NNP | 0.75 | 1.84 | 1.04 | 1.07 | 0.75 | 0.76 | 0.79 | 0.79 | 1.07 | 0.96 | 1.07 | 1.56 | 1.84 | 1.04 | 1.07 | 0.75 | 0.28 | 0.82 | 0.82 | 1.07 | 0.96 | 1.07 | 1.56 |
Alan:NNP | 0.75 | 1.84 | 1.04 | 1.07 | 0.70 | 0.82 | 0.74 | 0.74 | 1.07 | 0.96 | 1.07 | 1.56 | 1.84 | 1.04 | 1.07 | 0.70 | 0.82 | 0.28 | 0.28 | 1.07 | 0.96 | 1.07 | 1.56 |
wife:NN | 0.42 | 1.58 | 0.78 | 0.75 | 0.50 | 1.09 | 1.03 | 1.03 | 0.53 | 0.71 | 0.69 | 1.31 | 1.58 | 0.78 | 0.75 | 0.50 | 1.07 | 1.07 | 1.07 | 0.28 | 0.71 | 0.69 | 1.31 |
NO_WORD | 0.92 | 0.67 | 1.57 | 0.42 | 0.60 | 0.05 | 0.28 | 0.12 | 0.01 | 1.55 | 0.36 | 0.04 | 0.67 | 1.57 | 0.17 | 0.60 | 0.05 | 0.28 | 0.12 | 0.01 | 1.55 | 0.36 | 0.04 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.55 Alignment.score
1.00 0.26 Alignment.isGood
-1.00 0.71 Alignment.isBad
-0.10 22.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.04 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "together" modifying "go"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "boss" modifying "suggested" is dropped on aligned hypothesis word "suggested"
-3.00 1.00 NullPunisher.entity : Bill
-3.00 1.00 NullPunisher.entity : Alan
-1.00 1.00 NullPunisher.other : together
-1.00 1.00 NullPunisher.other : together
-1.00 1.00 NullPunisher.other : it
-0.10 1.00 NullPunisher.functionWord : that
-0.05 1.00 NullPunisher.aux : was
-0.10 1.00 NullPunisher.functionWord : that
-1.00 1.00 NullPunisher.other : boss
-3.00 1.00 NullPunisher.entity : Alan
-1.00 1.00 NullPunisher.other : wife
-0.05 1.00 NullPunisher.aux : should
-0.05 1.00 NullPunisher.aux : should
-3.00 1.00 NullPunisher.entity : Frank
-1.00 1.00 NullPunisher.other : it
-1.00 1.00 NullPunisher.other : suggested
-3.00 1.00 NullPunisher.entity : Carl
-0.05 1.00 NullPunisher.aux : was
-3.00 1.00 NullPunisher.entity : Frank
-1.00 1.00 NullPunisher.other : go
-1.00 1.00 NullPunisher.other : go
-1.00 1.00 NullPunisher.other : If
-3.00 1.00 Structure.argsMismatch : go (verbal compl of the root) not aligned
Hand-tuned score (dot product of above): -33.9034
Threshold: -9.4738
Txt: A lawyer signed every report, and so did an auditor. That is, there was one lawyer who signed all the reports.
Hyp: There was one auditor who signed all the reports. (yes)
There EX |
was VBD |
one CD |
auditor NN |
who WP |
signed VBD |
all PDT |
the DT |
reports NNS |
|
A:DT | 1.31 | 1.25 | 1.18 | 1.00 | 1.16 | 1.25 | 1.31 | 1.31 | 1.00 |
lawyer:NN | 0.69 | 0.82 | 1.29 | 0.65 | 1.58 | 0.60 | 0.71 | 0.71 | 0.92 |
signed:VBD | 0.74 | 0.78 | 1.37 | 0.91 | 0.54 | 1.32 | 0.76 | 0.76 | 0.91 |
every:DT | 1.22 | 1.25 | 1.18 | 1.00 | 1.16 | 1.25 | 1.31 | 1.31 | 0.96 |
report:NN | 0.69 | 0.82 | 1.29 | 0.57 | 1.58 | 0.82 | 0.71 | 0.71 | 1.47 |
so:RB | 1.09 | 0.91 | 1.36 | 1.21 | 1.00 | 0.91 | 1.09 | 1.09 | 1.21 |
did:VBD | 0.76 | 0.87 | 1.39 | 0.70 | 0.54 | 0.63 | 0.76 | 0.76 | 0.85 |
an:DT | 1.31 | 1.21 | 1.14 | 1.00 | 1.16 | 1.25 | 1.26 | 1.31 | 1.00 |
auditor:NN | 0.71 | 0.82 | 1.29 | 0.28 | 1.58 | 0.82 | 0.71 | 0.71 | 0.67 |
That:DT | 1.23 | 1.22 | 1.18 | 1.00 | 1.13 | 1.25 | 1.31 | 1.21 | 1.00 |
is:VBZ | 0.76 | 0.05 | 1.39 | 0.91 | 0.54 | 0.78 | 0.76 | 0.76 | 0.91 |
there:EX | 0.37 | 1.25 | 1.18 | 1.00 | 1.16 | 1.23 | 1.31 | 1.20 | 1.00 |
was:VBD | 0.76 | 1.32 | 1.39 | 0.91 | 0.47 | 0.78 | 0.76 | 0.76 | 0.91 |
one:CD | 0.89 | 1.30 | 0.85 | 1.33 | 1.32 | 1.28 | 0.89 | 0.82 | 1.33 |
lawyer:NN | 0.69 | 0.82 | 1.29 | 0.65 | 1.58 | 0.60 | 0.71 | 0.71 | 0.92 |
who:WP | 1.11 | 0.90 | 1.46 | 1.30 | 1.05 | 0.97 | 1.11 | 1.04 | 1.30 |
signed:VBD | 0.74 | 0.78 | 1.37 | 0.91 | 0.54 | 1.32 | 0.76 | 0.76 | 0.91 |
all:PDT | 1.31 | 1.25 | 1.18 | 1.00 | 1.16 | 1.25 | 2.42 | 1.31 | 1.00 |
the:DT | 1.20 | 1.25 | 1.11 | 1.00 | 1.09 | 1.25 | 1.31 | 2.42 | 1.00 |
reports:NNS | 0.71 | 0.82 | 1.29 | 0.67 | 1.58 | 0.82 | 0.71 | 0.71 | 0.28 |
NO_WORD | 0.29 | 0.17 | 0.52 | 0.28 | 0.92 | 0.25 | 0.41 | 0.82 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.40 Alignment.score
1.00 0.23 Alignment.isGood
-1.00 0.74 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.22 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "signed" modifying "auditor"
-0.05 1.00 NullPunisher.aux : was
-0.10 1.00 NullPunisher.functionWord : who
-3.00 1.00 NullPunisher.entity : one
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : all
-1.00 1.00 NullPunisher.other : signed
-1.00 1.00 NullPunisher.other : auditor
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '1.0' vs ''
1.00 1.00 Quant.equivalent : Replacing the quantifier "every" by an equivalent quantifier "all" preserves truth.
-2.00 1.00 RootEntailment.unalignedRoot : "was" not aligned to anything
Hand-tuned score (dot product of above): -14.9360
Threshold: -9.4738
Txt: John has a genuine diamond.
Hyp: John has a diamond. (yes)
John NNP |
has VBZ |
a DT |
diamond NN |
|
John:NNP | 0.28 | 1.07 | 0.96 | 1.05 |
has:VBZ | 1.16 | 1.32 | 0.76 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 |
genuine:JJ | 1.13 | 0.96 | 0.76 | 0.84 |
diamond:NN | 1.05 | 0.82 | 0.71 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.25 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.24 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-3.00 1.00 NullPunisher.entity : John
-1.00 1.00 NullPunisher.other : diamond
-0.05 1.00 NullPunisher.aux : has
-0.10 1.00 NullPunisher.article : a
-2.00 1.00 RootEntailment.unalignedRoot : "has" not aligned to anything
Hand-tuned score (dot product of above): -6.8749
Threshold: -9.4738
Txt: John is a former university student.
Hyp: John is a university student. (don't know)
John NNP |
is VBZ |
a DT |
university_student NN |
|
John:NNP | 0.28 | 1.07 | 0.96 | 1.07 |
is:VBZ | 1.16 | 1.32 | 0.76 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 |
former:JJ | 1.13 | 0.96 | 0.76 | 0.88 |
university_student:NN | 1.07 | 0.82 | 0.71 | 0.28 |
NO_WORD | 0.28 | 1.43 | 0.82 | 0.12 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.52 Alignment.score
1.00 0.26 Alignment.isGood
-1.00 0.72 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : university_student
-3.00 1.00 NullPunisher.entity : John
-0.05 1.00 NullPunisher.aux : is
-0.10 1.00 NullPunisher.article : a
-2.00 1.00 RootEntailment.unalignedRoot : "university_student" not aligned to anything
Hand-tuned score (dot product of above): -6.4901
Threshold: -9.4738
Txt: John is a successful former university student.
Hyp: John is successful. (yes)
John NNP |
is VBZ |
successful JJ |
|
John:NNP | 0.28 | 1.07 | 1.24 |
is:VBZ | 1.16 | 1.32 | 1.02 |
a:DT | 1.25 | 1.25 | 1.18 |
successful:JJ | 1.13 | 0.96 | 0.74 |
former:JJ | 1.13 | 0.96 | 0.93 |
university_student:NN | 1.07 | 0.82 | 0.89 |
NO_WORD | 0.28 | 1.43 | 0.16 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.44 Alignment.score
1.00 0.24 Alignment.isGood
-1.00 0.74 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-0.05 1.00 NullPunisher.aux : is
-1.00 1.00 NullPunisher.other : successful
-3.00 1.00 NullPunisher.entity : John
-2.00 1.00 RootEntailment.unalignedRoot : "successful" not aligned to anything
Hand-tuned score (dot product of above): -6.4057
Threshold: -9.4738
Txt: John is a former successful university student.
Hyp: John is successful. (don't know)
John NNP |
is VBZ |
successful JJ |
|
John:NNP | 0.28 | 1.07 | 1.24 |
is:VBZ | 1.16 | 1.32 | 1.02 |
a:DT | 1.25 | 1.25 | 1.18 |
former:JJ | 1.13 | 0.96 | 0.93 |
successful:JJ | 1.13 | 0.96 | 0.74 |
university_student:NN | 1.07 | 0.82 | 0.89 |
NO_WORD | 0.28 | 1.43 | 0.16 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.44 Alignment.score
1.00 0.24 Alignment.isGood
-1.00 0.74 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-3.00 1.00 NullPunisher.entity : John
-1.00 1.00 NullPunisher.other : successful
-0.05 1.00 NullPunisher.aux : is
-2.00 1.00 RootEntailment.unalignedRoot : "successful" not aligned to anything
Hand-tuned score (dot product of above): -6.4057
Threshold: -9.4738
Txt: John is a former successful university student.
Hyp: John is a university student. (don't know)
John NNP |
is VBZ |
a DT |
university_student NN |
|
John:NNP | 0.28 | 1.07 | 0.96 | 1.07 |
is:VBZ | 1.16 | 1.32 | 0.76 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 |
former:JJ | 1.13 | 0.96 | 0.76 | 0.88 |
successful:JJ | 1.13 | 0.96 | 0.76 | 0.78 |
university_student:NN | 1.07 | 0.82 | 0.71 | 0.28 |
NO_WORD | 0.28 | 1.43 | 0.82 | 0.12 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.52 Alignment.score
1.00 0.26 Alignment.isGood
-1.00 0.72 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : university_student
-0.05 1.00 NullPunisher.aux : is
-3.00 1.00 NullPunisher.entity : John
-0.10 1.00 NullPunisher.article : a
-2.00 1.00 RootEntailment.unalignedRoot : "university_student" not aligned to anything
Hand-tuned score (dot product of above): -6.4901
Threshold: -9.4738
Txt: Every mammal is an animal.
Hyp: Every four-legged mammal is a four-legged animal. (yes)
Every DT |
four-legged JJ |
mammal NN |
is VBZ |
a DT |
four-legged JJ |
animal NN |
|
Every:DT | 2.42 | 1.18 | 1.00 | 1.25 | 1.31 | 1.18 | 1.00 |
mammal:NN | 0.71 | 0.99 | 0.28 | 0.82 | 0.71 | 0.99 | 0.13 |
is:VBZ | 0.76 | 1.02 | 0.91 | 1.32 | 0.76 | 1.02 | 0.91 |
an:DT | 1.31 | 1.18 | 1.00 | 1.25 | 1.21 | 1.18 | 1.00 |
animal:NN | 0.71 | 0.99 | 0.19 | 0.82 | 0.71 | 0.99 | 0.28 |
NO_WORD | 0.82 | 0.11 | 0.28 | 1.43 | 0.82 | 0.11 | 0.12 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.46 Alignment.score
1.00 0.25 Alignment.isGood
-1.00 0.73 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.14 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "four-legged" modifying "animal"
-1.00 1.00 Hypernym.posNarrow : Narrowing a term (from "animal" to "mammal") does NOT preserve truth in a positive context
-1.00 1.00 NullPunisher.other : four-legged
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : Every
-1.00 1.00 NullPunisher.other : four-legged
-0.05 1.00 NullPunisher.aux : is
-1.00 1.00 NullPunisher.other : animal
-7.00 1.00 Quant.expand : Invalid quantifier strengthening: replacing the quantifier "a" by a stronger quantifier "every" does NOT preserve truth.
-2.00 1.00 RootEntailment.unalignedRoot : "animal" not aligned to anything
Hand-tuned score (dot product of above): -15.6622
Threshold: -9.4738
Txt: Dumbo is a four-legged animal.
Hyp: Dumbo is four-legged. (yes)
Dumbo NNP |
is VBZ |
four-legged JJ |
|
Dumbo:NNP | 0.28 | 1.07 | 1.24 |
is:VBZ | 1.16 | 1.32 | 1.02 |
a:DT | 1.25 | 1.25 | 1.18 |
four-legged:JJ | 1.13 | 0.96 | 0.74 |
animal:NN | 1.05 | 0.82 | 0.99 |
NO_WORD | 0.28 | 1.43 | 0.16 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.44 Alignment.score
1.00 0.24 Alignment.isGood
-1.00 0.74 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : four-legged
-0.05 1.00 NullPunisher.aux : is
-3.00 1.00 NullPunisher.entity : Dumbo
-2.00 1.00 RootEntailment.unalignedRoot : "four-legged" not aligned to anything
Hand-tuned score (dot product of above): -6.4057
Threshold: -9.4738
Txt: Mickey is a small animal.
Hyp: Mickey is a large animal. (don't know)
Mickey NNP |
is VBZ |
a DT |
large JJ |
animal NN |
|
Mickey:NNP | 0.28 | 1.07 | 0.96 | 1.22 | 1.05 |
is:VBZ | 1.16 | 1.32 | 0.76 | 1.02 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.18 | 1.00 |
small:JJ | 1.13 | 0.96 | 0.76 | 1.28 | 0.72 |
animal:NN | 1.05 | 0.82 | 0.71 | 0.94 | 0.28 |
NO_WORD | 0.28 | 1.43 | 0.82 | 0.11 | 0.12 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.44 Alignment.score
1.00 0.24 Alignment.isGood
-1.00 0.74 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "large" modifying "animal"
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : animal
-0.05 1.00 NullPunisher.aux : is
-1.00 1.00 NullPunisher.other : large
-3.00 1.00 NullPunisher.entity : Mickey
-2.00 1.00 RootEntailment.unalignedRoot : "animal" not aligned to anything
Hand-tuned score (dot product of above): -8.7032
Threshold: -9.4738
Txt: Dumbo is a large animal.
Hyp: Dumbo is a small animal. (don't know)
Dumbo NNP |
is VBZ |
a DT |
small JJ |
animal NN |
|
Dumbo:NNP | 0.28 | 1.07 | 0.96 | 1.24 | 1.05 |
is:VBZ | 1.16 | 1.32 | 0.76 | 1.02 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.18 | 1.00 |
large:JJ | 1.13 | 0.96 | 0.76 | 1.28 | 0.83 |
animal:NN | 1.05 | 0.82 | 0.71 | 0.83 | 0.28 |
NO_WORD | 0.28 | 1.43 | 0.82 | 0.11 | 0.12 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.44 Alignment.score
1.00 0.24 Alignment.isGood
-1.00 0.74 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "small" modifying "animal"
-1.00 1.00 NullPunisher.other : small
-3.00 1.00 NullPunisher.entity : Dumbo
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : animal
-0.05 1.00 NullPunisher.aux : is
-2.00 1.00 RootEntailment.unalignedRoot : "animal" not aligned to anything
Hand-tuned score (dot product of above): -8.7032
Threshold: -9.4738
Txt: Fido is not a small animal.
Hyp: Fido is a large animal. (don't know)
Fido NNP |
is VBZ |
a DT |
large JJ |
animal NN |
|
Fido:NNP | 0.28 | 0.82 | 0.71 | 0.99 | 0.82 |
is:VBZ | 0.91 | 1.32 | 0.76 | 1.02 | 0.91 |
not:RB | 1.21 | 0.91 | 1.09 | 1.05 | 1.21 |
a:DT | 1.00 | 1.25 | 2.42 | 1.18 | 1.00 |
small:JJ | 0.88 | 0.96 | 0.76 | 1.28 | 0.72 |
animal:NN | 0.82 | 0.82 | 0.71 | 0.94 | 0.28 |
NO_WORD | 0.28 | 1.43 | 0.82 | 0.11 | 0.12 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.44 Alignment.score
1.00 0.24 Alignment.isGood
-1.00 0.74 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "large" modifying "animal"
-1.00 1.00 NullPunisher.other : animal
-1.00 1.00 NullPunisher.other : large
-1.00 1.00 NullPunisher.other : Fido
-0.05 1.00 NullPunisher.aux : is
-0.10 1.00 NullPunisher.article : a
-2.00 1.00 RootEntailment.unalignedRoot : "animal" not aligned to anything
Hand-tuned score (dot product of above): -6.7032
Threshold: -9.4738
Txt: Fido is not a large animal.
Hyp: Fido is a small animal. (don't know)
Fido NNP |
is VBZ |
a DT |
small JJ |
animal NN |
|
Fido:NNP | 0.28 | 0.82 | 0.71 | 0.99 | 0.82 |
is:VBZ | 0.91 | 1.32 | 0.76 | 1.02 | 0.91 |
not:RB | 1.21 | 0.91 | 1.09 | 1.05 | 1.21 |
a:DT | 1.00 | 1.25 | 2.42 | 1.18 | 1.00 |
large:JJ | 0.88 | 0.96 | 0.76 | 1.28 | 0.83 |
animal:NN | 0.82 | 0.82 | 0.71 | 0.83 | 0.28 |
NO_WORD | 0.28 | 1.43 | 0.82 | 0.11 | 0.12 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.44 Alignment.score
1.00 0.24 Alignment.isGood
-1.00 0.74 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "small" modifying "animal"
-0.05 1.00 NullPunisher.aux : is
-1.00 1.00 NullPunisher.other : small
-1.00 1.00 NullPunisher.other : animal
-1.00 1.00 NullPunisher.other : Fido
-0.10 1.00 NullPunisher.article : a
-2.00 1.00 RootEntailment.unalignedRoot : "animal" not aligned to anything
Hand-tuned score (dot product of above): -6.7032
Threshold: -9.4738
Txt: Mickey is a small animal. Dumbo is a large animal.
Hyp: Mickey is smaller than Dumbo. (yes)
Mickey NNP |
is VBZ |
smaller JJR |
Dumbo NNP |
|
Mickey:NNP | 0.28 | 1.07 | 1.19 | 0.95 |
is:VBZ | 1.16 | 1.32 | 1.02 | 1.16 |
a:DT | 1.25 | 1.25 | 1.18 | 1.25 |
small:JJ | 1.13 | 0.96 | 1.81 | 1.13 |
animal:NN | 1.05 | 0.82 | 0.96 | 1.05 |
Dumbo:NNP | 0.95 | 1.07 | 1.24 | 0.28 |
is:VBZ | 1.16 | 1.32 | 1.02 | 1.16 |
a:DT | 1.25 | 1.25 | 1.18 | 1.25 |
large:JJ | 1.11 | 0.96 | 0.29 | 1.13 |
animal:NN | 1.05 | 0.82 | 0.96 | 1.05 |
NO_WORD | 0.28 | 1.43 | 0.16 | 0.07 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.81 Alignment.score
1.00 0.32 Alignment.isGood
-1.00 0.66 Alignment.isBad
-0.10 2.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.25 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Dumbo" modifying "smaller"
-0.05 1.00 NullPunisher.aux : is
-3.00 1.00 NullPunisher.entity : Dumbo
Hand-tuned score (dot product of above): -3.6529
Threshold: -9.4738
Txt: Mickey is a small animal. Dumbo is a large animal.
Hyp: Mickey is larger than Dumbo. (don't know)
Mickey NNP |
is VBZ |
larger JJR |
Dumbo NNP |
|
Mickey:NNP | 0.28 | 1.07 | 1.21 | 0.95 |
is:VBZ | 1.16 | 1.32 | 1.02 | 1.16 |
a:DT | 1.25 | 1.25 | 1.18 | 1.25 |
small:JJ | 1.13 | 0.96 | 0.41 | 1.13 |
animal:NN | 1.05 | 0.82 | 0.84 | 1.05 |
Dumbo:NNP | 0.95 | 1.07 | 1.24 | 0.28 |
is:VBZ | 1.16 | 1.32 | 1.02 | 1.16 |
a:DT | 1.25 | 1.25 | 1.18 | 1.25 |
large:JJ | 1.11 | 0.96 | 2.04 | 1.13 |
animal:NN | 1.05 | 0.82 | 0.84 | 1.05 |
NO_WORD | 0.28 | 1.43 | 0.16 | 0.07 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.78 Alignment.score
1.00 0.31 Alignment.isGood
-1.00 0.66 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.25 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Dumbo" modifying "larger"
-3.00 1.00 NullPunisher.entity : Mickey
-3.00 1.00 NullPunisher.entity : Dumbo
-0.05 1.00 NullPunisher.aux : is
Hand-tuned score (dot product of above): -6.8002
Threshold: -9.4738
Txt: All mice are small animals. Mickey is a large mouse.
Hyp: Mickey is a large animal. (don't know)
Mickey NNP |
is VBZ |
a DT |
large JJ |
animal NN |
|
All:DT | 1.25 | 1.25 | 1.31 | 1.18 | 0.97 |
mice:NNS | 0.86 | 0.82 | 0.71 | 0.94 | 0.13 |
are:VBP | 1.16 | 0.02 | 0.76 | 0.91 | 0.91 |
small:JJ | 1.13 | 0.96 | 0.76 | 1.28 | 0.72 |
animals:NNS | 1.03 | 0.82 | 0.71 | 0.90 | 1.51 |
Mickey:NNP | 0.28 | 1.07 | 0.96 | 1.22 | 1.05 |
is:VBZ | 1.16 | 1.32 | 0.76 | 1.02 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.18 | 1.00 |
large:JJ | 1.11 | 0.96 | 0.76 | 0.74 | 0.83 |
mouse:NN | 0.94 | 0.82 | 0.71 | 0.94 | 0.46 |
NO_WORD | 0.28 | 1.43 | 0.82 | 0.11 | 0.12 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.74 Alignment.score
1.00 0.30 Alignment.isGood
-1.00 0.67 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.20 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "large" modifying "animal"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "small" modifying "animals" is dropped on aligned hypothesis word "animal"
-1.00 1.00 NullPunisher.other : large
-0.05 1.00 NullPunisher.aux : is
-0.10 1.00 NullPunisher.article : a
Hand-tuned score (dot product of above): -1.4690
Threshold: -9.4738
Txt: All elephants are large animals. Dumbo is a small elephant.
Hyp: Dumbo is a small animal. (don't know)
Dumbo NNP |
is VBZ |
a DT |
small JJ |
animal NN |
|
All:DT | 1.25 | 1.25 | 1.31 | 1.07 | 0.97 |
elephants:NNS | 1.07 | 0.82 | 0.71 | 0.70 | 0.32 |
are:VBP | 1.16 | 0.02 | 0.76 | 1.02 | 0.91 |
large:JJ | 1.13 | 0.96 | 0.76 | 1.28 | 0.83 |
animals:NNS | 1.07 | 0.82 | 0.71 | 0.86 | 1.51 |
Dumbo:NNP | 0.28 | 1.07 | 0.96 | 1.24 | 1.05 |
is:VBZ | 1.16 | 1.32 | 0.76 | 1.02 | 0.91 |
a:DT | 1.25 | 1.25 | 2.42 | 1.18 | 1.00 |
small:JJ | 1.13 | 0.96 | 0.76 | 0.74 | 0.72 |
elephant:NN | 1.07 | 0.82 | 0.71 | 0.86 | 0.39 |
NO_WORD | 0.28 | 1.43 | 0.82 | 0.11 | 0.12 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.72 Alignment.score
1.00 0.30 Alignment.isGood
-1.00 0.68 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.20 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "small" modifying "animal"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "large" modifying "animals" is dropped on aligned hypothesis word "animal"
-0.05 1.00 NullPunisher.aux : is
-1.00 1.00 NullPunisher.other : small
-0.10 1.00 NullPunisher.article : a
-3.00 1.00 NullPunisher.entity : Dumbo
Hand-tuned score (dot product of above): -4.5939
Threshold: -9.4738
Txt: All mice are small animals. All elephants are large animals. Mickey is a large mouse. Dumbo is a small elephant.
Hyp: Dumbo is larger than Mickey. (yes)
Dumbo NNP |
is VBZ |
larger JJR |
Mickey NNP |
|
All:DT | 1.25 | 1.25 | 1.18 | 1.25 |
mice:NNS | 1.07 | 0.82 | 0.95 | 0.86 |
are:VBP | 1.16 | 0.02 | 0.95 | 1.16 |
small:JJ | 1.13 | 0.96 | 0.41 | 1.13 |
animals:NNS | 1.07 | 0.82 | 0.78 | 1.03 |
All:DT | 1.25 | 1.25 | 1.18 | 1.25 |
elephants:NNS | 1.07 | 0.82 | 0.93 | 0.99 |
are:VBP | 1.16 | 0.02 | 0.95 | 1.16 |
large:JJ | 1.13 | 0.96 | 2.04 | 1.11 |
animals:NNS | 1.07 | 0.82 | 0.78 | 1.03 |
Mickey:NNP | 0.95 | 1.07 | 1.21 | 0.28 |
is:VBZ | 1.16 | 1.32 | 1.02 | 1.16 |
a:DT | 1.25 | 1.25 | 1.18 | 1.25 |
large:JJ | 1.13 | 0.96 | 2.04 | 1.11 |
mouse:NN | 1.07 | 0.82 | 0.97 | 0.94 |
Dumbo:NNP | 0.28 | 1.07 | 1.24 | 0.95 |
is:VBZ | 1.16 | 1.32 | 1.02 | 1.16 |
a:DT | 1.25 | 1.25 | 1.18 | 1.25 |
small:JJ | 1.13 | 0.96 | 0.41 | 1.13 |
elephant:NN | 1.07 | 0.82 | 0.93 | 0.99 |
NO_WORD | 0.28 | 1.43 | 0.16 | 0.07 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.78 Alignment.score
1.00 0.31 Alignment.isGood
-1.00 0.66 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.25 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Mickey" modifying "larger"
-0.05 1.00 NullPunisher.aux : is
-3.00 1.00 NullPunisher.entity : Mickey
-3.00 1.00 NullPunisher.entity : Dumbo
Hand-tuned score (dot product of above): -6.8002
Threshold: -9.4738
Txt: All legal authorities are law lecturers. All law lecturers are legal authorities.
Hyp: All fat legal authorities are fat law lecturers. (yes)
All DT |
fat JJ |
legal JJ |
authorities NNS |
are VBP |
fat JJ |
law NN |
lecturers NNS |
|
All:DT | 2.42 | 1.18 | 1.18 | 1.00 | 1.18 | 1.18 | 1.00 | 1.00 |
legal:JJ | 0.76 | 0.93 | 0.74 | 0.78 | 0.96 | 0.93 | 0.23 | 0.88 |
authorities:NNS | 0.71 | 0.99 | 0.89 | 0.28 | 0.82 | 0.99 | 0.69 | 0.89 |
are:VBP | 0.68 | 1.02 | 1.02 | 0.91 | 1.32 | 1.02 | 0.91 | 0.91 |
law:NN | 0.71 | 0.88 | 0.34 | 0.63 | 0.82 | 0.88 | 0.28 | 0.75 |
lecturers:NNS | 0.71 | 0.96 | 0.99 | 0.89 | 0.82 | 0.96 | 0.75 | 0.28 |
All:DT | 2.42 | 1.18 | 1.18 | 1.00 | 1.18 | 1.18 | 1.00 | 1.00 |
law:NN | 0.71 | 0.88 | 0.34 | 0.63 | 0.82 | 0.88 | 0.28 | 0.75 |
lecturers:NNS | 0.71 | 0.96 | 0.99 | 0.89 | 0.82 | 0.96 | 0.75 | 0.28 |
are:VBP | 0.68 | 1.02 | 1.02 | 0.91 | 1.32 | 1.02 | 0.91 | 0.91 |
legal:JJ | 0.76 | 0.93 | 0.74 | 0.78 | 0.96 | 0.93 | 0.23 | 0.88 |
authorities:NNS | 0.71 | 0.99 | 0.89 | 0.28 | 0.82 | 0.99 | 0.69 | 0.89 |
NO_WORD | 0.82 | 0.11 | 0.11 | 0.28 | 1.43 | 0.11 | 0.05 | 0.12 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.30 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.76 Alignment.isBad
-0.10 8.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "law" modifying "lecturers"
-0.05 1.00 NullPunisher.aux : are
-1.00 1.00 NullPunisher.other : fat
-1.00 1.00 NullPunisher.other : All
-1.00 1.00 NullPunisher.other : lecturers
-1.00 1.00 NullPunisher.other : authorities
-1.00 1.00 NullPunisher.other : fat
-1.00 1.00 NullPunisher.other : law
-1.00 1.00 NullPunisher.other : legal
-2.00 1.00 RootEntailment.unalignedRoot : "lecturers" not aligned to anything
Hand-tuned score (dot product of above): -11.0990
Threshold: -9.4738
Txt: All legal authorities are law lecturers. All law lecturers are legal authorities.
Hyp: All competent legal authorities are competent law lecturers. (don't know)
All DT |
competent JJ |
legal JJ |
authorities NNS |
are VBP |
competent JJ |
law NN |
lecturers NNS |
|
All:DT | 2.42 | 1.18 | 1.18 | 1.00 | 1.18 | 1.18 | 1.00 | 1.00 |
legal:JJ | 0.76 | 0.45 | 0.74 | 0.78 | 0.96 | 0.45 | 0.23 | 0.88 |
authorities:NNS | 0.71 | 0.98 | 0.89 | 0.28 | 0.82 | 0.98 | 0.69 | 0.89 |
are:VBP | 0.68 | 1.02 | 1.02 | 0.91 | 1.32 | 1.02 | 0.91 | 0.91 |
law:NN | 0.71 | 0.76 | 0.34 | 0.63 | 0.82 | 0.76 | 0.28 | 0.75 |
lecturers:NNS | 0.71 | 0.96 | 0.99 | 0.89 | 0.82 | 0.96 | 0.75 | 0.28 |
All:DT | 2.42 | 1.18 | 1.18 | 1.00 | 1.18 | 1.18 | 1.00 | 1.00 |
law:NN | 0.71 | 0.76 | 0.34 | 0.63 | 0.82 | 0.76 | 0.28 | 0.75 |
lecturers:NNS | 0.71 | 0.96 | 0.99 | 0.89 | 0.82 | 0.96 | 0.75 | 0.28 |
are:VBP | 0.68 | 1.02 | 1.02 | 0.91 | 1.32 | 1.02 | 0.91 | 0.91 |
legal:JJ | 0.76 | 0.45 | 0.74 | 0.78 | 0.96 | 0.45 | 0.23 | 0.88 |
authorities:NNS | 0.71 | 0.98 | 0.89 | 0.28 | 0.82 | 0.98 | 0.69 | 0.89 |
NO_WORD | 0.82 | 0.11 | 0.11 | 0.28 | 1.43 | 0.11 | 0.05 | 0.12 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.30 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.76 Alignment.isBad
-0.10 8.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "law" modifying "lecturers"
-1.00 1.00 NullPunisher.other : law
-1.00 1.00 NullPunisher.other : All
-1.00 1.00 NullPunisher.other : competent
-1.00 1.00 NullPunisher.other : competent
-1.00 1.00 NullPunisher.other : legal
-1.00 1.00 NullPunisher.other : authorities
-0.05 1.00 NullPunisher.aux : are
-1.00 1.00 NullPunisher.other : lecturers
-2.00 1.00 RootEntailment.unalignedRoot : "lecturers" not aligned to anything
Hand-tuned score (dot product of above): -11.0990
Threshold: -9.4738
Txt: John is a fatter politician than Bill.
Hyp: John is fatter than Bill. (yes)
John NNP |
is VBZ |
fatter VBN |
Bill NNP |
|
John:NNP | 0.28 | 1.07 | 1.07 | 1.18 |
is:VBZ | 1.16 | 1.32 | 0.56 | 0.91 |
a:DT | 1.25 | 1.25 | 1.25 | 1.00 |
fatter:JJR | 1.13 | 0.96 | 2.00 | 0.88 |
politician:NN | 1.12 | 0.82 | 0.82 | 0.93 |
Bill:NNP | 1.18 | 0.82 | 0.82 | 0.28 |
NO_WORD | 0.15 | 1.57 | 0.17 | 0.07 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.84 Alignment.score
1.00 0.32 Alignment.isGood
-1.00 0.65 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.25 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Bill" modifying "fatter"
-0.05 1.00 NullPunisher.aux : is
-3.00 1.00 NullPunisher.entity : John
-1.00 1.00 NullPunisher.other : Bill
Hand-tuned score (dot product of above): -4.7183
Threshold: -9.4738
Txt: John is a cleverer politician than Bill.
Hyp: John is cleverer than Bill. (don't know)
John NNP |
is VBZ |
cleverer VB |
Bill NNP |
|
John:NNP | 0.28 | 1.07 | 1.07 | 1.18 |
is:VBZ | 1.16 | 1.32 | 0.56 | 0.91 |
a:DT | 1.25 | 1.25 | 1.25 | 1.00 |
cleverer:NN | 1.07 | 0.82 | 2.71 | 0.82 |
politician:NN | 1.12 | 0.82 | 0.76 | 0.93 |
Bill:NNP | 1.18 | 0.82 | 0.82 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.36 | 0.07 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.63 Alignment.score
1.00 0.28 Alignment.isGood
-1.00 0.70 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.25 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Bill" modifying "cleverer"
-3.00 1.00 NullPunisher.entity : John
-1.00 1.00 NullPunisher.other : Bill
-0.05 1.00 NullPunisher.aux : is
-2.00 1.00 RootEntailment.unalignedRoot : "is" not aligned to anything
Hand-tuned score (dot product of above): -7.0099
Threshold: -9.4738
Txt: Kim is a clever person.
Hyp: Kim is clever. (yes)
Kim NNP |
is VBZ |
clever JJ |
|
Kim:NNP | 0.28 | 1.03 | 1.24 |
is:VBZ | 1.12 | 1.32 | 1.02 |
a:DT | 1.25 | 1.25 | 1.18 |
clever:JJ | 1.13 | 0.96 | 0.74 |
person:NN | 1.07 | 0.82 | 0.87 |
NO_WORD | 0.28 | 1.43 | 0.16 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.44 Alignment.score
1.00 0.24 Alignment.isGood
-1.00 0.74 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-0.05 1.00 NullPunisher.aux : is
-3.00 1.00 NullPunisher.entity : Kim
-1.00 1.00 NullPunisher.other : clever
-2.00 1.00 RootEntailment.unalignedRoot : "clever" not aligned to anything
Hand-tuned score (dot product of above): -6.4057
Threshold: -9.4738
Txt: Kim is a clever politician.
Hyp: Kim is clever. (don't know)
Kim NNP |
is VBZ |
clever JJ |
|
Kim:NNP | 0.28 | 1.03 | 1.24 |
is:VBZ | 1.12 | 1.32 | 1.02 |
a:DT | 1.25 | 1.25 | 1.18 |
clever:JJ | 1.13 | 0.96 | 0.74 |
politician:NN | 1.07 | 0.82 | 0.73 |
NO_WORD | 0.28 | 1.43 | 0.16 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.44 Alignment.score
1.00 0.24 Alignment.isGood
-1.00 0.74 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-3.00 1.00 NullPunisher.entity : Kim
-1.00 1.00 NullPunisher.other : clever
-0.05 1.00 NullPunisher.aux : is
-2.00 1.00 RootEntailment.unalignedRoot : "clever" not aligned to anything
Hand-tuned score (dot product of above): -6.4057
Threshold: -9.4738
Txt: The PC-6082 is faster than the ITEL-XZ. The ITEL-XZ is fast.
Hyp: The PC-6082 is fast. (yes)
The DT |
PC-6082 NN |
is VBZ |
fast JJ |
|
The:DT | 2.42 | 1.00 | 1.25 | 1.18 |
PC-6082:NN | 0.71 | 0.28 | 0.82 | 0.99 |
is:VBZ | 0.76 | 0.91 | 1.32 | 1.02 |
faster:RBR | 1.09 | 1.21 | 0.91 | 1.14 |
the:DT | 0.01 | 1.00 | 1.25 | 1.18 |
ITEL-XZ:NNP | 0.71 | 0.82 | 0.82 | 0.99 |
The:DT | 2.42 | 1.00 | 1.25 | 1.18 |
ITEL-XZ:NN | 0.71 | 0.82 | 0.82 | 0.99 |
is:VBZ | 0.76 | 0.91 | 1.32 | 1.02 |
fast:JJ | 0.76 | 0.88 | 0.96 | 0.74 |
NO_WORD | 0.82 | 0.28 | 1.43 | 0.16 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.91 Alignment.score
1.00 0.34 Alignment.isGood
-1.00 0.64 Alignment.isBad
-0.10 2.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.50 Alignment.txtSpan
-0.05 1.00 NullPunisher.aux : is
-0.10 1.00 NullPunisher.article : The
Hand-tuned score (dot product of above): 0.4092
Threshold: -9.4738
Txt: The PC-6082 is faster than the ITEL-XZ.
Hyp: The PC-6082 is fast. (don't know)
The DT |
PC-6082 NN |
is VBZ |
fast JJ |
|
The:DT | 2.42 | 1.00 | 1.25 | 1.18 |
PC-6082:NN | 0.71 | 0.28 | 0.82 | 0.99 |
is:VBZ | 0.76 | 0.91 | 1.32 | 1.02 |
faster:RBR | 1.09 | 1.21 | 0.91 | 1.14 |
the:DT | 0.01 | 1.00 | 1.25 | 1.18 |
ITEL-XZ:NNP | 0.71 | 0.82 | 0.82 | 0.99 |
NO_WORD | 0.82 | 0.28 | 1.43 | 0.16 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.91 Alignment.score
1.00 0.34 Alignment.isGood
-1.00 0.64 Alignment.isBad
-0.10 2.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.50 Alignment.txtSpan
-0.10 1.00 NullPunisher.article : The
-0.05 1.00 NullPunisher.aux : is
Hand-tuned score (dot product of above): 0.4092
Threshold: -9.4738
Txt: The PC-6082 is faster than the ITEL-XZ. The PC-6082 is fast.
Hyp: The ITEL-XZ is fast. (don't know)
The DT |
ITEL-XZ NN |
is VBZ |
fast JJ |
|
The:DT | 2.42 | 1.00 | 1.25 | 1.18 |
PC-6082:NN | 0.71 | 0.82 | 0.82 | 0.99 |
is:VBZ | 0.76 | 0.91 | 1.32 | 1.02 |
faster:RBR | 1.09 | 1.21 | 0.91 | 1.14 |
the:DT | 0.01 | 1.00 | 1.25 | 1.18 |
ITEL-XZ:NNP | 0.71 | 1.82 | 0.82 | 0.99 |
The:DT | 2.42 | 1.00 | 1.25 | 1.18 |
PC-6082:NN | 0.71 | 0.82 | 0.82 | 0.99 |
is:VBZ | 0.76 | 0.91 | 1.32 | 1.02 |
fast:JJ | 0.76 | 0.88 | 0.96 | 0.74 |
NO_WORD | 0.82 | 0.28 | 1.43 | 0.16 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 1.45 Alignment.score
1.00 0.47 Alignment.isGood
-1.00 0.50 Alignment.isBad
-0.10 1.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.75 Alignment.txtSpan
-0.05 1.00 NullPunisher.aux : is
-2.00 1.00 Structure.parentsMismatch : args have different parents, different relations: text "ITEL-XZ" <-prep_than-- "is" vs. hyp "ITEL-XZ" <-nsubj-- "fast", which aligned to text "faster"
Hand-tuned score (dot product of above): -0.4689
Threshold: -9.4738
Txt: The PC-6082 is faster than the ITEL-XZ. The PC-6082 is slow.
Hyp: The ITEL-XZ is fast. (don't know)
The DT |
ITEL-XZ NN |
is VBZ |
fast JJ |
|
The:DT | 2.42 | 1.00 | 1.25 | 1.18 |
PC-6082:NN | 0.71 | 0.82 | 0.82 | 0.99 |
is:VBZ | 0.76 | 0.91 | 1.32 | 1.02 |
faster:RBR | 1.09 | 1.21 | 0.91 | 1.14 |
the:DT | 0.01 | 1.00 | 1.25 | 1.18 |
ITEL-XZ:NNP | 0.71 | 1.82 | 0.82 | 0.99 |
The:DT | 2.42 | 1.00 | 1.25 | 1.18 |
PC-6082:NN | 0.71 | 0.82 | 0.82 | 0.99 |
is:VBZ | 0.76 | 0.91 | 1.32 | 1.02 |
slow:JJ | 0.76 | 0.88 | 0.96 | 1.36 |
NO_WORD | 0.82 | 0.28 | 1.43 | 0.16 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 1.45 Alignment.score
1.00 0.47 Alignment.isGood
-1.00 0.50 Alignment.isBad
-0.10 1.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.75 Alignment.txtSpan
-0.05 1.00 NullPunisher.aux : is
-2.00 1.00 Structure.parentsMismatch : args have different parents, different relations: text "ITEL-XZ" <-prep_than-- "is" vs. hyp "ITEL-XZ" <-nsubj-- "fast", which aligned to text "faster"
Hand-tuned score (dot product of above): -0.4689
Threshold: -9.4738
Txt: The PC-6082 is as fast as the ITEL-XZ. The ITEL-XZ is fast.
Hyp: The PC-6082 is fast. (yes)
The DT |
PC-6082 NN |
is VBZ |
fast JJ |
|
The:DT | 2.42 | 1.00 | 1.25 | 1.18 |
PC-6082:NN | 0.71 | 0.28 | 0.82 | 0.99 |
is:VBZ | 0.76 | 0.91 | 1.32 | 1.02 |
as:RB | 1.09 | 1.21 | 0.80 | 0.97 |
fast:RB | 1.09 | 1.21 | 0.91 | 2.87 |
the:DT | 0.01 | 1.00 | 1.25 | 1.18 |
ITEL-XZ:NNP | 0.71 | 0.82 | 0.82 | 0.99 |
The:DT | 2.42 | 1.00 | 1.25 | 1.18 |
ITEL-XZ:NN | 0.71 | 0.82 | 0.82 | 0.99 |
is:VBZ | 0.76 | 0.91 | 1.32 | 1.02 |
fast:JJ | 0.76 | 0.88 | 0.96 | 0.74 |
NO_WORD | 0.82 | 0.28 | 1.43 | 0.16 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 1.34 Alignment.score
1.00 0.44 Alignment.isGood
-1.00 0.53 Alignment.isBad
-0.10 2.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.25 Alignment.txtSpan
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "as" modifying "fast" is dropped on aligned hypothesis word "fast"
-0.10 1.00 NullPunisher.article : The
-0.05 1.00 NullPunisher.aux : is
Hand-tuned score (dot product of above): 1.5225
Threshold: -9.4738
Txt: The PC-6082 is as fast as the ITEL-XZ.
Hyp: The PC-6082 is fast. (don't know)
The DT |
PC-6082 NN |
is VBZ |
fast JJ |
|
The:DT | 2.42 | 1.00 | 1.25 | 1.18 |
PC-6082:NN | 0.71 | 0.28 | 0.82 | 0.99 |
is:VBZ | 0.76 | 0.91 | 1.32 | 1.02 |
as:RB | 1.09 | 1.21 | 0.80 | 0.97 |
fast:RB | 1.09 | 1.21 | 0.91 | 2.87 |
the:DT | 0.01 | 1.00 | 1.25 | 1.18 |
ITEL-XZ:NNP | 0.71 | 0.82 | 0.82 | 0.99 |
NO_WORD | 0.82 | 0.28 | 1.43 | 0.16 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 1.34 Alignment.score
1.00 0.44 Alignment.isGood
-1.00 0.53 Alignment.isBad
-0.10 2.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.25 Alignment.txtSpan
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "as" modifying "fast" is dropped on aligned hypothesis word "fast"
-0.10 1.00 NullPunisher.article : The
-0.05 1.00 NullPunisher.aux : is
Hand-tuned score (dot product of above): 1.5225
Threshold: -9.4738
Txt: The PC-6082 is as fast as the ITEL-XZ. The PC-6082 is fast.
Hyp: The ITEL-XZ is fast. (don't know)
The DT |
ITEL-XZ NN |
is VBZ |
fast JJ |
|
The:DT | 2.42 | 1.00 | 1.25 | 1.18 |
PC-6082:NN | 0.71 | 0.82 | 0.82 | 0.99 |
is:VBZ | 0.76 | 0.91 | 1.32 | 1.02 |
as:RB | 1.09 | 1.21 | 0.80 | 0.97 |
fast:RB | 1.09 | 1.21 | 0.91 | 2.87 |
the:DT | 0.01 | 1.00 | 1.25 | 1.18 |
ITEL-XZ:NNP | 0.71 | 1.82 | 0.82 | 0.99 |
The:DT | 2.42 | 1.00 | 1.25 | 1.18 |
PC-6082:NN | 0.71 | 0.82 | 0.82 | 0.99 |
is:VBZ | 0.76 | 0.91 | 1.32 | 1.02 |
fast:JJ | 0.76 | 0.88 | 0.96 | 0.74 |
NO_WORD | 0.82 | 0.28 | 1.43 | 0.16 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 1.88 Alignment.score
1.00 0.57 Alignment.isGood
-1.00 0.40 Alignment.isBad
-0.10 1.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.75 Alignment.txtSpan
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "as" modifying "fast" is dropped on aligned hypothesis word "fast"
-0.05 1.00 NullPunisher.aux : is
-2.00 1.00 Structure.parentsMismatch : args have different parents, different relations: text "ITEL-XZ" <-prep_as-- "is" vs. hyp "ITEL-XZ" <-nsubj-- "fast", which aligned to text "fast"
-3.00 1.00 Structure.parentsMismatch&Align.veryGood :
0.50 1.00 Adjunct.dropPosCxt&Align.veryGood :
Hand-tuned score (dot product of above): -1.8234
Threshold: -9.4738
Txt: The PC-6082 is as fast as the ITEL-XZ. The PC-6082 is slow.
Hyp: The ITEL-XZ is fast. (don't know)
The DT |
ITEL-XZ NN |
is VBZ |
fast JJ |
|
The:DT | 2.42 | 1.00 | 1.25 | 1.18 |
PC-6082:NN | 0.71 | 0.82 | 0.82 | 0.99 |
is:VBZ | 0.76 | 0.91 | 1.32 | 1.02 |
as:RB | 1.09 | 1.21 | 0.80 | 0.97 |
fast:RB | 1.09 | 1.21 | 0.91 | 2.87 |
the:DT | 0.01 | 1.00 | 1.25 | 1.18 |
ITEL-XZ:NNP | 0.71 | 1.82 | 0.82 | 0.99 |
The:DT | 2.42 | 1.00 | 1.25 | 1.18 |
PC-6082:NN | 0.71 | 0.82 | 0.82 | 0.99 |
is:VBZ | 0.76 | 0.91 | 1.32 | 1.02 |
slow:JJ | 0.76 | 0.88 | 0.96 | 1.36 |
NO_WORD | 0.82 | 0.28 | 1.43 | 0.16 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 1.88 Alignment.score
1.00 0.57 Alignment.isGood
-1.00 0.40 Alignment.isBad
-0.10 1.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.75 Alignment.txtSpan
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "as" modifying "fast" is dropped on aligned hypothesis word "fast"
-0.05 1.00 NullPunisher.aux : is
-2.00 1.00 Structure.parentsMismatch : args have different parents, different relations: text "ITEL-XZ" <-prep_as-- "is" vs. hyp "ITEL-XZ" <-nsubj-- "fast", which aligned to text "fast"
-3.00 1.00 Structure.parentsMismatch&Align.veryGood :
0.50 1.00 Adjunct.dropPosCxt&Align.veryGood :
Hand-tuned score (dot product of above): -1.8234
Threshold: -9.4738
Txt: The PC-6082 is as fast as the ITEL-XZ.
Hyp: The PC-6082 is faster than the ITEL-XZ. (don't know)
The DT |
PC-6082 NN |
is VBZ |
faster RBR |
the DT |
ITEL-XZ NNP |
|
The:DT | 2.42 | 1.00 | 1.25 | 1.45 | 0.01 | 1.00 |
PC-6082:NN | 0.71 | 0.28 | 0.82 | 1.31 | 0.71 | 0.82 |
is:VBZ | 0.76 | 0.91 | 1.32 | 1.11 | 0.76 | 0.91 |
as:RB | 1.09 | 1.21 | 0.80 | 0.86 | 1.09 | 1.21 |
fast:RB | 1.09 | 1.21 | 0.91 | 1.33 | 1.09 | 1.21 |
the:DT | 0.01 | 1.00 | 1.25 | 1.45 | 2.42 | 1.00 |
ITEL-XZ:NNP | 0.71 | 0.82 | 0.82 | 1.31 | 0.71 | 0.28 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.04 | 0.82 | 0.07 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.46 Alignment.score
1.00 0.25 Alignment.isGood
-1.00 0.73 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.17 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "ITEL-XZ" modifying "is"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "as" modifying "fast" is dropped on aligned hypothesis word "faster"
-0.05 1.00 NullPunisher.aux : is
-0.10 1.00 NullPunisher.article : The
-1.00 1.00 NullPunisher.other : ITEL-XZ
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : PC-6082
-2.00 1.00 RootEntailment.unalignedRoot : "is" not aligned to anything
Hand-tuned score (dot product of above): -5.1535
Threshold: -9.4738
Txt: The PC-6082 is as fast as the ITEL-XZ.
Hyp: The PC-6082 is slower than the ITEL-XZ. (don't know)
The DT |
PC-6082 NN |
is VBZ |
slower JJR |
the DT |
ITEL-XZ NNP |
|
The:DT | 2.42 | 1.00 | 1.25 | 1.18 | 0.01 | 1.00 |
PC-6082:NN | 0.71 | 0.28 | 0.82 | 0.99 | 0.71 | 0.82 |
is:VBZ | 0.76 | 0.91 | 1.32 | 1.02 | 0.76 | 0.91 |
as:RB | 1.09 | 1.21 | 0.80 | 1.05 | 1.09 | 1.21 |
fast:RB | 1.09 | 1.21 | 0.91 | 0.45 | 1.09 | 1.21 |
the:DT | 0.01 | 1.00 | 1.25 | 1.18 | 2.42 | 1.00 |
ITEL-XZ:NNP | 0.71 | 0.82 | 0.82 | 0.99 | 0.71 | 0.28 |
NO_WORD | 0.82 | 0.28 | 1.43 | 0.16 | 0.82 | 0.07 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.48 Alignment.score
1.00 0.25 Alignment.isGood
-1.00 0.73 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "ITEL-XZ" modifying "slower"
-1.00 1.00 NullPunisher.other : slower
-1.00 1.00 NullPunisher.other : ITEL-XZ
-1.00 1.00 NullPunisher.other : PC-6082
-0.10 1.00 NullPunisher.article : The
-0.05 1.00 NullPunisher.aux : is
-0.10 1.00 NullPunisher.article : the
-2.00 1.00 RootEntailment.unalignedRoot : "slower" not aligned to anything
Hand-tuned score (dot product of above): -6.8496
Threshold: -9.4738
Txt: ITEL won more orders than APCOM did.
Hyp: ITEL won some orders. (yes)
ITEL NNP |
won VBD |
some DT |
orders NNS |
|
ITEL:NNP | 0.28 | 1.07 | 0.96 | 1.07 |
won:VBD | 1.16 | 1.32 | 0.73 | 0.91 |
more:JJR | 1.13 | 0.93 | 0.65 | 0.83 |
orders:NNS | 1.07 | 0.82 | 0.71 | 0.28 |
APCOM:NNP | 1.07 | 0.82 | 0.68 | 0.82 |
did:VBD | 1.16 | 0.71 | 0.76 | 0.85 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.20 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : won
-1.00 1.00 NullPunisher.other : some
-3.00 1.00 NullPunisher.entity : ITEL
-1.00 1.00 NullPunisher.other : orders
-2.00 1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -8.7790
Threshold: -9.4738
Txt: ITEL won more orders than APCOM did.
Hyp: APCOM won some orders. (don't know)
APCOM NNP |
won VBD |
some DT |
orders NNS |
|
ITEL:NNP | 1.07 | 1.07 | 0.96 | 1.07 |
won:VBD | 0.91 | 1.32 | 0.73 | 0.91 |
more:JJR | 0.88 | 0.93 | 0.65 | 0.83 |
orders:NNS | 0.82 | 0.82 | 0.71 | 0.28 |
APCOM:NNP | 0.28 | 0.82 | 0.68 | 0.82 |
did:VBD | 0.91 | 0.71 | 0.76 | 0.85 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.20 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : some
-1.00 1.00 NullPunisher.other : APCOM
-1.00 1.00 NullPunisher.other : won
-1.00 1.00 NullPunisher.other : orders
-2.00 1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -6.7790
Threshold: -9.4738
Txt: ITEL won more orders than APCOM did. APCOM won ten orders.
Hyp: ITEL won at least eleven orders. (yes)
ITEL NNP |
won VBD |
at IN |
least JJS |
eleven NN |
orders NNS |
|
ITEL:NNP | 0.28 | 1.07 | 0.75 | 1.24 | 1.07 | 1.07 |
won:VBD | 1.16 | 1.32 | 1.15 | 1.02 | 0.68 | 0.91 |
more:JJR | 1.13 | 0.93 | 1.05 | 0.93 | 0.88 | 0.83 |
orders:NNS | 1.07 | 0.82 | 0.50 | 0.97 | 0.71 | 0.28 |
APCOM:NNP | 1.07 | 0.82 | 0.50 | 0.99 | 0.82 | 0.82 |
did:VBD | 1.16 | 0.71 | 1.15 | 1.02 | 0.84 | 0.85 |
APCOM:NNP | 1.07 | 0.82 | 0.50 | 0.99 | 0.82 | 0.82 |
won:VBD | 1.16 | 1.32 | 1.15 | 1.02 | 0.68 | 0.91 |
ten:NN | 0.98 | 0.57 | 0.50 | 0.99 | 0.00 | 0.87 |
orders:NNS | 1.07 | 0.82 | 0.50 | 0.97 | 0.71 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.99 | 0.29 | 0.05 | 0.25 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.24 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.17 Alignment.txtSpan
0.00 1.00 NegPolarity.hypNegWord : "least": tag "JJS" is in NegPolarityMarkers list
-1.00 1.00 NullPunisher.other : won
-1.00 1.00 NullPunisher.other : orders
-1.00 1.00 NullPunisher.other : at
-1.00 1.00 NullPunisher.other : least
-3.00 1.00 NullPunisher.entity : ITEL
-2.00 1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -9.7116
Threshold: -9.4738
Txt: ITEL won more orders than APCOM.
Hyp: ITEL won some orders. (yes)
ITEL NNP |
won VBD |
some DT |
orders NNS |
|
ITEL:NNP | 0.28 | 1.07 | 0.96 | 1.07 |
won:VBD | 1.16 | 1.32 | 0.73 | 0.91 |
more:JJR | 1.13 | 0.93 | 0.65 | 0.83 |
orders:NNS | 1.07 | 0.82 | 0.71 | 0.28 |
APCOM:NNP | 1.07 | 0.82 | 0.68 | 0.82 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.20 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : won
-1.00 1.00 NullPunisher.other : orders
-1.00 1.00 NullPunisher.other : some
-3.00 1.00 NullPunisher.entity : ITEL
-2.00 1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -8.7790
Threshold: -9.4738
Txt: ITEL won more orders than APCOM.
Hyp: APCOM won some orders. (don't know)
APCOM NNP |
won VBD |
some DT |
orders NNS |
|
ITEL:NNP | 1.07 | 1.07 | 0.96 | 1.07 |
won:VBD | 0.91 | 1.32 | 0.73 | 0.91 |
more:JJR | 0.88 | 0.93 | 0.65 | 0.83 |
orders:NNS | 0.82 | 0.82 | 0.71 | 0.28 |
APCOM:NNP | 0.28 | 0.82 | 0.68 | 0.82 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.20 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : orders
-1.00 1.00 NullPunisher.other : APCOM
-1.00 1.00 NullPunisher.other : won
-1.00 1.00 NullPunisher.other : some
-2.00 1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -6.7790
Threshold: -9.4738
Txt: ITEL won more orders than APCOM. APCOM won ten orders.
Hyp: ITEL won at least eleven orders. (yes)
ITEL NNP |
won VBD |
at IN |
least JJS |
eleven NN |
orders NNS |
|
ITEL:NNP | 0.28 | 1.07 | 0.75 | 1.24 | 1.07 | 1.07 |
won:VBD | 1.16 | 1.32 | 1.15 | 1.02 | 0.68 | 0.91 |
more:JJR | 1.13 | 0.93 | 1.05 | 0.93 | 0.88 | 0.83 |
orders:NNS | 1.07 | 0.82 | 0.50 | 0.97 | 0.71 | 0.28 |
APCOM:NNP | 1.07 | 0.82 | 0.50 | 0.99 | 0.82 | 0.82 |
APCOM:NNP | 1.07 | 0.82 | 0.50 | 0.99 | 0.82 | 0.82 |
won:VBD | 1.16 | 1.32 | 1.15 | 1.02 | 0.68 | 0.91 |
ten:NN | 0.98 | 0.57 | 0.50 | 0.99 | 0.00 | 0.87 |
orders:NNS | 1.07 | 0.82 | 0.50 | 0.97 | 0.71 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.99 | 0.29 | 0.05 | 0.25 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.24 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.17 Alignment.txtSpan
0.00 1.00 NegPolarity.hypNegWord : "least": tag "JJS" is in NegPolarityMarkers list
-1.00 1.00 NullPunisher.other : orders
-1.00 1.00 NullPunisher.other : won
-3.00 1.00 NullPunisher.entity : ITEL
-1.00 1.00 NullPunisher.other : at
-1.00 1.00 NullPunisher.other : least
-2.00 1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -9.7116
Threshold: -9.4738
Txt: ITEL won more orders than the APCOM contract.
Hyp: ITEL won the APCOM contract. (yes)
ITEL NNP |
won VBD |
the DT |
APCOM NNP |
contract NN |
|
ITEL:NNP | 0.28 | 1.07 | 0.93 | 1.07 | 1.07 |
won:VBD | 1.16 | 1.32 | 0.76 | 0.91 | 0.65 |
more:JJR | 1.13 | 0.93 | 0.73 | 0.88 | 0.88 |
orders:NNS | 1.07 | 0.82 | 0.71 | 0.82 | 0.92 |
the:DT | 1.22 | 1.25 | 2.42 | 1.00 | 1.00 |
APCOM:NNP | 1.07 | 0.82 | 0.71 | 0.28 | 0.82 |
contract:NN | 1.07 | 0.56 | 0.71 | 0.82 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.05 | 0.25 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.18 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "APCOM" modifying "contract"
-1.00 1.00 NullPunisher.other : won
-1.00 1.00 NullPunisher.other : contract
-3.00 1.00 NullPunisher.entity : ITEL
-1.00 1.00 NullPunisher.other : APCOM
-0.10 1.00 NullPunisher.article : the
-2.00 1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -10.0034
Threshold: -9.4738
Txt: ITEL won more orders than the APCOM contract.
Hyp: ITEL won more than one order. (yes)
ITEL NNP |
won VBD |
more_than IN |
one CD |
order NN |
|
ITEL:NNP | 0.28 | 1.07 | 0.75 | 1.17 | 1.05 |
won:VBD | 1.16 | 1.32 | 1.15 | 1.32 | 0.91 |
more:JJR | 1.13 | 0.93 | 0.12 | 1.03 | 0.80 |
orders:NNS | 1.07 | 0.82 | 0.48 | 1.27 | 0.95 |
the:DT | 1.22 | 1.25 | 1.05 | 1.11 | 1.00 |
APCOM:NNP | 1.07 | 0.82 | 0.50 | 1.29 | 0.82 |
contract:NN | 1.07 | 0.56 | 0.46 | 1.29 | 0.90 |
NO_WORD | 0.28 | 0.17 | 0.84 | 0.52 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.23 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.20 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "one" modifying "order"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "more" modifying "orders" is dropped on aligned hypothesis word "order"
-3.00 1.00 NullPunisher.entity : one
-1.00 1.00 NullPunisher.other : more_than
-3.00 1.00 NullPunisher.entity : ITEL
-1.00 1.00 NullPunisher.other : won
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '>1.0' vs ''
-2.00 1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -17.1146
Threshold: -9.4738
Txt: ITEL won twice as many orders than APCOM. APCOM won ten orders.
Hyp: ITEL won twenty orders. (yes)
ITEL NNP |
won VBD |
twenty CD |
orders NNS |
|
ITEL:NNP | 0.28 | 1.07 | 1.20 | 1.07 |
won:VBD | 1.16 | 1.32 | 1.36 | 0.91 |
twice:RB | 1.45 | 0.54 | 1.14 | 1.20 |
as:RB | 1.47 | 0.91 | 1.36 | 1.21 |
many:JJ | 1.13 | 0.93 | 1.08 | 0.88 |
orders:NNS | 1.07 | 0.82 | 1.29 | 0.28 |
APCOM:NNP | 1.07 | 0.82 | 1.29 | 0.82 |
APCOM:NNP | 1.07 | 0.82 | 1.29 | 0.82 |
won:VBD | 1.16 | 1.32 | 1.36 | 0.91 |
ten:NN | 0.98 | 0.57 | 0.50 | 0.87 |
orders:NNS | 1.07 | 0.82 | 1.29 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.52 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 -0.13 Alignment.score
1.00 0.15 Alignment.isGood
-1.00 0.83 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.25 Alignment.txtSpan
-1.00 1.00 NullPunisher.other : won
-3.00 1.00 NullPunisher.entity : ITEL
-1.00 1.00 NullPunisher.other : orders
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '20.0' vs ''
-2.00 1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -13.9813
Threshold: -9.4738
Txt: ITEL won more orders than APCOM lost.
Hyp: ITEL won some orders. (yes)
ITEL NNP |
won VBD |
some DT |
orders NNS |
|
ITEL:NNP | 0.28 | 1.07 | 0.96 | 1.07 |
won:VBD | 1.16 | 1.32 | 0.73 | 0.91 |
more:JJR | 1.13 | 0.93 | 0.65 | 0.83 |
orders:NNS | 1.07 | 0.82 | 0.71 | 0.28 |
APCOM:NNP | 1.07 | 0.82 | 0.68 | 0.82 |
lost:VBD | 1.16 | 1.33 | 0.70 | 0.91 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.20 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : some
-3.00 1.00 NullPunisher.entity : ITEL
-1.00 1.00 NullPunisher.other : orders
-1.00 1.00 NullPunisher.other : won
-2.00 1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -8.7790
Threshold: -9.4738
Txt: ITEL won more orders than APCOM lost.
Hyp: APCOM lost some orders. (don't know)
APCOM NNP |
lost VBD |
some DT |
orders NNS |
|
ITEL:NNP | 1.07 | 1.07 | 0.96 | 1.07 |
won:VBD | 0.91 | 1.33 | 0.73 | 0.91 |
more:JJR | 0.88 | 0.90 | 0.65 | 0.83 |
orders:NNS | 0.82 | 0.82 | 0.71 | 0.28 |
APCOM:NNP | 0.28 | 0.82 | 0.68 | 0.82 |
lost:VBD | 0.91 | 1.32 | 0.70 | 0.91 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.20 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : APCOM
-1.00 1.00 NullPunisher.other : orders
-1.00 1.00 NullPunisher.other : lost
-1.00 1.00 NullPunisher.other : some
-2.00 1.00 RootEntailment.unalignedRoot : "lost" not aligned to anything
Hand-tuned score (dot product of above): -6.7790
Threshold: -9.4738
Txt: ITEL won more orders than APCOM lost. APCOM lost ten orders.
Hyp: ITEL won at least eleven orders. (yes)
ITEL NNP |
won VBD |
at IN |
least JJS |
eleven NN |
orders NNS |
|
ITEL:NNP | 0.28 | 1.07 | 0.75 | 1.24 | 1.07 | 1.07 |
won:VBD | 1.16 | 1.32 | 1.15 | 1.02 | 0.68 | 0.91 |
more:JJR | 1.13 | 0.93 | 1.05 | 0.93 | 0.88 | 0.83 |
orders:NNS | 1.07 | 0.82 | 0.50 | 0.97 | 0.71 | 0.28 |
APCOM:NNP | 1.07 | 0.82 | 0.50 | 0.99 | 0.82 | 0.82 |
lost:VBD | 1.16 | 1.33 | 1.15 | 0.90 | 0.83 | 0.91 |
APCOM:NNP | 1.07 | 0.82 | 0.50 | 0.99 | 0.82 | 0.82 |
lost:VBD | 1.16 | 1.33 | 1.15 | 0.90 | 0.83 | 0.91 |
ten:NN | 0.98 | 0.57 | 0.50 | 0.99 | 0.00 | 0.87 |
orders:NNS | 1.07 | 0.82 | 0.50 | 0.97 | 0.71 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.99 | 0.29 | 0.05 | 0.25 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.24 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.17 Alignment.txtSpan
0.00 1.00 NegPolarity.hypNegWord : "least": tag "JJS" is in NegPolarityMarkers list
-1.00 1.00 NullPunisher.other : orders
-1.00 1.00 NullPunisher.other : won
-1.00 1.00 NullPunisher.other : least
-1.00 1.00 NullPunisher.other : at
-3.00 1.00 NullPunisher.entity : ITEL
-2.00 1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -9.7116
Threshold: -9.4738
Txt: The PC-6082 is faster than 500 MIPS. The ITEL-ZX is slower than 500 MIPS.
Hyp: The PC-6082 is faster than the ITEL-ZX. (yes)
The DT |
PC-6082 NN |
is VBZ |
faster RBR |
the DT |
ITEL-ZX NNP |
|
The:DT | 2.42 | 1.00 | 1.25 | 1.45 | 0.01 | 1.00 |
PC-6082:NN | 0.71 | 0.28 | 0.82 | 1.31 | 0.71 | 0.82 |
is:VBZ | 0.76 | 0.91 | 1.32 | 1.11 | 0.76 | 0.91 |
faster:RBR | 1.09 | 1.21 | 0.91 | 1.23 | 1.09 | 1.21 |
500:CD | 0.89 | 1.33 | 1.30 | 1.30 | 0.89 | 1.33 |
MIPS:NNS | 0.71 | 0.85 | 0.74 | 1.31 | 0.71 | 0.82 |
The:DT | 2.42 | 1.00 | 1.25 | 1.45 | 0.01 | 1.00 |
ITEL-ZX:NN | 0.71 | 0.82 | 0.82 | 1.31 | 0.71 | 1.82 |
is:VBZ | 0.76 | 0.91 | 1.32 | 1.11 | 0.76 | 0.91 |
slower:JJR | 0.76 | 0.88 | 0.96 | 0.23 | 0.76 | 0.88 |
500:CD | 0.89 | 1.33 | 1.30 | 1.30 | 0.89 | 1.33 |
MIPS:NNS | 0.71 | 0.85 | 0.74 | 1.31 | 0.71 | 0.82 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.04 | 0.82 | 0.07 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.66 Alignment.score
1.00 0.28 Alignment.isGood
-1.00 0.69 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.33 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "faster" modifying "is"
-0.10 1.00 NullPunisher.article : The
-1.00 1.00 NullPunisher.other : PC-6082
-1.00 1.00 NullPunisher.other : faster
-0.05 1.00 NullPunisher.aux : is
-2.00 1.00 RootEntailment.unalignedRoot : "is" not aligned to anything
Hand-tuned score (dot product of above): -5.0640
Threshold: -9.4738
Txt: ITEL sold 3000 more computers than APCOM. APCOM sold exactly 2500 computers.
Hyp: ITEL sold 5500 computers. (yes)
ITEL NNP |
sold VBD |
5500 CD |
computers NNS |
|
ITEL:NNP | 0.28 | 1.07 | 1.20 | 1.07 |
sold:VBD | 1.16 | 1.32 | 1.39 | 0.80 |
3000:CD | 1.24 | 1.30 | 0.02 | 1.24 |
more:JJR | 1.13 | 0.90 | 1.13 | 0.88 |
computers:NNS | 1.07 | 0.71 | 1.19 | 0.28 |
APCOM:NNP | 1.07 | 0.82 | 1.29 | 0.82 |
APCOM:NNP | 1.07 | 0.82 | 1.29 | 0.82 |
sold:VBD | 1.16 | 1.32 | 1.39 | 0.80 |
exactly:RB | 1.47 | 0.91 | 1.32 | 1.21 |
2500:CD | 1.24 | 1.30 | 1.22 | 1.18 |
computers:NNS | 1.07 | 0.71 | 1.19 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.52 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.30 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.76 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.25 Alignment.txtSpan
-1.00 1.00 NullPunisher.other : computers
-1.00 1.00 NullPunisher.other : sold
-3.00 1.00 NullPunisher.entity : ITEL
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '5500.0' vs '2500.0'
-2.00 1.00 RootEntailment.unalignedRoot : "sold" not aligned to anything
Hand-tuned score (dot product of above): -13.4173
Threshold: -9.4738
Txt: APCOM has a more important customer than ITEL.
Hyp: APCOM has a more important customer than ITEL is. (yes)
APCOM NNP |
has VBZ |
a DT |
more RBR |
important JJ |
customer NN |
than IN |
ITEL NNP |
is VBZ |
|
APCOM:NNP | 0.28 | 0.82 | 0.71 | 1.31 | 0.99 | 0.80 | 0.50 | 1.07 | 0.82 |
has:VBZ | 0.91 | 1.32 | 0.76 | 1.11 | 1.02 | 0.91 | 1.06 | 1.16 | 1.01 |
a:DT | 1.00 | 1.25 | 2.42 | 1.45 | 1.18 | 1.00 | 1.05 | 1.25 | 1.25 |
more:RBR | 1.21 | 0.91 | 1.09 | 1.23 | 1.03 | 1.21 | 1.05 | 1.47 | 0.91 |
important:JJ | 0.88 | 0.96 | 0.76 | 1.00 | 0.74 | 0.85 | 1.05 | 1.13 | 0.96 |
customer:NN | 0.80 | 0.82 | 0.71 | 1.31 | 0.96 | 0.28 | 0.50 | 1.07 | 0.82 |
ITEL:NNP | 1.07 | 1.07 | 0.96 | 1.56 | 1.24 | 1.07 | 0.75 | 0.28 | 1.07 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.04 | 0.11 | 0.09 | 0.92 | 0.28 | 0.31 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.21 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 9.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "important" modifying "customer"
-1.00 1.00 NullPunisher.other : than
-1.00 1.00 NullPunisher.other : customer
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : APCOM
-3.00 1.00 NullPunisher.entity : ITEL
-0.05 1.00 NullPunisher.aux : is
-0.05 1.00 NullPunisher.aux : has
-1.00 1.00 NullPunisher.other : more
-1.00 1.00 NullPunisher.other : important
-2.00 1.00 RootEntailment.unalignedRoot : "has" not aligned to anything
Hand-tuned score (dot product of above): -12.4637
Threshold: -9.4738
Txt: APCOM has a more important customer than ITEL.
Hyp: APCOM has a more important customer than ITEL has. (yes)
APCOM NNP |
has VBZ |
a DT |
more RBR |
important JJ |
customer NN |
than IN |
ITEL NNP |
has VBZ |
|
APCOM:NNP | 0.28 | 0.82 | 0.71 | 1.31 | 0.99 | 0.80 | 0.50 | 1.07 | 0.82 |
has:VBZ | 0.91 | 1.32 | 0.76 | 1.11 | 1.02 | 0.91 | 1.06 | 1.16 | 1.32 |
a:DT | 1.00 | 1.25 | 2.42 | 1.45 | 1.18 | 1.00 | 1.05 | 1.25 | 1.25 |
more:RBR | 1.21 | 0.91 | 1.09 | 1.23 | 1.03 | 1.21 | 1.05 | 1.47 | 0.91 |
important:JJ | 0.88 | 0.96 | 0.76 | 1.00 | 0.74 | 0.85 | 1.05 | 1.13 | 0.96 |
customer:NN | 0.80 | 0.82 | 0.71 | 1.31 | 0.96 | 0.28 | 0.50 | 1.07 | 0.82 |
ITEL:NNP | 1.07 | 1.07 | 0.96 | 1.56 | 1.24 | 1.07 | 0.75 | 0.28 | 1.07 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.04 | 0.11 | 0.09 | 0.92 | 0.28 | 0.31 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.21 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 9.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "important" modifying "customer"
-1.00 1.00 NullPunisher.other : than
-1.00 1.00 NullPunisher.other : important
-1.00 1.00 NullPunisher.other : APCOM
-0.05 1.00 NullPunisher.aux : has
-1.00 1.00 NullPunisher.other : customer
-0.05 1.00 NullPunisher.aux : has
-1.00 1.00 NullPunisher.other : more
-3.00 1.00 NullPunisher.entity : ITEL
-0.10 1.00 NullPunisher.article : a
-2.00 1.00 RootEntailment.unalignedRoot : "has" not aligned to anything
Hand-tuned score (dot product of above): -12.4637
Threshold: -9.4738
Txt: The PC-6082 is faster than every ITEL computer. The ITEL-ZX is an ITEL computer.
Hyp: The PC-6082 is faster than the ITEL-ZX. (yes)
The DT |
PC-6082 NN |
is VBZ |
faster RBR |
the DT |
ITEL-ZX NNP |
|
The:DT | 2.42 | 1.00 | 1.25 | 1.45 | 0.01 | 1.00 |
PC-6082:NN | 0.71 | 0.28 | 0.82 | 1.31 | 0.71 | 0.82 |
is:VBZ | 0.76 | 0.91 | 1.32 | 1.11 | 0.76 | 0.91 |
faster:RBR | 1.09 | 1.21 | 0.91 | 1.23 | 1.09 | 1.21 |
every:DT | 1.31 | 1.00 | 1.25 | 1.43 | 1.31 | 1.00 |
ITEL:NNP | 0.93 | 1.07 | 1.07 | 1.52 | 0.93 | 0.14 |
computer:NN | 0.71 | 0.67 | 0.82 | 0.94 | 0.71 | 0.82 |
The:DT | 2.42 | 1.00 | 1.25 | 1.45 | 0.01 | 1.00 |
ITEL-ZX:NN | 0.71 | 0.82 | 0.82 | 1.31 | 0.71 | 1.82 |
is:VBZ | 0.76 | 0.91 | 1.32 | 1.11 | 0.76 | 0.91 |
an:DT | 1.31 | 1.00 | 1.25 | 1.45 | 1.31 | 1.00 |
ITEL:NNP | 0.93 | 1.07 | 1.07 | 1.52 | 0.93 | 0.14 |
computer:NN | 0.71 | 0.67 | 0.82 | 0.94 | 0.71 | 0.82 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.04 | 0.82 | 0.07 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.66 Alignment.score
1.00 0.28 Alignment.isGood
-1.00 0.69 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.33 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "faster" modifying "is"
-1.00 1.00 NullPunisher.other : faster
-0.10 1.00 NullPunisher.article : The
-0.05 1.00 NullPunisher.aux : is
-1.00 1.00 NullPunisher.other : PC-6082
-2.00 1.00 RootEntailment.unalignedRoot : "is" not aligned to anything
Hand-tuned score (dot product of above): -5.0640
Threshold: -9.4738
Txt: The PC-6082 is faster than some ITEL computer. The ITEL-ZX is an ITEL computer.
Hyp: The PC-6082 is faster than the ITEL-ZX. (don't know)
The DT |
PC-6082 NN |
is VBZ |
faster RBR |
the DT |
ITEL-ZX NNP |
|
The:DT | 2.42 | 1.00 | 1.25 | 1.45 | 0.01 | 1.00 |
PC-6082:NN | 0.71 | 0.28 | 0.82 | 1.31 | 0.71 | 0.82 |
is:VBZ | 0.76 | 0.91 | 1.32 | 1.11 | 0.76 | 0.91 |
faster:RBR | 1.09 | 1.21 | 0.91 | 1.23 | 1.09 | 1.21 |
some:DT | 1.28 | 1.00 | 1.25 | 1.45 | 1.28 | 1.00 |
ITEL:NNP | 0.93 | 1.07 | 1.07 | 1.52 | 0.93 | 0.14 |
computer:NN | 0.71 | 0.67 | 0.82 | 0.94 | 0.71 | 0.82 |
The:DT | 2.42 | 1.00 | 1.25 | 1.45 | 0.01 | 1.00 |
ITEL-ZX:NN | 0.71 | 0.82 | 0.82 | 1.31 | 0.71 | 1.82 |
is:VBZ | 0.76 | 0.91 | 1.32 | 1.11 | 0.76 | 0.91 |
an:DT | 1.31 | 1.00 | 1.25 | 1.45 | 1.31 | 1.00 |
ITEL:NNP | 0.93 | 1.07 | 1.07 | 1.52 | 0.93 | 0.14 |
computer:NN | 0.71 | 0.67 | 0.82 | 0.94 | 0.71 | 0.82 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.04 | 0.82 | 0.07 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.66 Alignment.score
1.00 0.28 Alignment.isGood
-1.00 0.69 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.33 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "faster" modifying "is"
-1.00 1.00 NullPunisher.other : faster
-1.00 1.00 NullPunisher.other : PC-6082
-0.05 1.00 NullPunisher.aux : is
-0.10 1.00 NullPunisher.article : The
-2.00 1.00 RootEntailment.unalignedRoot : "is" not aligned to anything
Hand-tuned score (dot product of above): -5.0640
Threshold: -9.4738
Txt: The PC-6082 is faster than any ITEL computer. The ITEL-ZX is an ITEL computer.
Hyp: The PC-6082 is faster than the ITEL-ZX. (yes)
The DT |
PC-6082 NN |
is VBZ |
faster RBR |
the DT |
ITEL-ZX NNP |
|
The:DT | 2.42 | 1.00 | 1.25 | 1.45 | 0.01 | 1.00 |
PC-6082:NN | 0.71 | 0.28 | 0.82 | 1.31 | 0.71 | 0.82 |
is:VBZ | 0.76 | 0.91 | 1.32 | 1.11 | 0.76 | 0.91 |
faster:RBR | 1.09 | 1.21 | 0.91 | 1.23 | 1.09 | 1.21 |
any:DT | 1.31 | 1.00 | 1.25 | 1.45 | 1.31 | 1.00 |
ITEL:NNP | 0.93 | 1.07 | 1.07 | 1.52 | 0.93 | 0.14 |
computer:NN | 0.71 | 0.67 | 0.82 | 0.94 | 0.71 | 0.82 |
The:DT | 2.42 | 1.00 | 1.25 | 1.45 | 0.01 | 1.00 |
ITEL-ZX:NN | 0.71 | 0.82 | 0.82 | 1.31 | 0.71 | 1.82 |
is:VBZ | 0.76 | 0.91 | 1.32 | 1.11 | 0.76 | 0.91 |
an:DT | 1.31 | 1.00 | 1.25 | 1.45 | 1.31 | 1.00 |
ITEL:NNP | 0.93 | 1.07 | 1.07 | 1.52 | 0.93 | 0.14 |
computer:NN | 0.71 | 0.67 | 0.82 | 0.94 | 0.71 | 0.82 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.04 | 0.82 | 0.07 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.66 Alignment.score
1.00 0.28 Alignment.isGood
-1.00 0.69 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.33 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "faster" modifying "is"
-0.05 1.00 NullPunisher.aux : is
-1.00 1.00 NullPunisher.other : PC-6082
-1.00 1.00 NullPunisher.other : faster
-0.10 1.00 NullPunisher.article : The
-2.00 1.00 RootEntailment.unalignedRoot : "is" not aligned to anything
Hand-tuned score (dot product of above): -5.0640
Threshold: -9.4738
Txt: The PC-6082 is faster than the ITEL-ZX and the ITEL-ZY.
Hyp: The PC-6082 is faster than the ITEL-ZX. (yes)
The DT |
PC-6082 NN |
is VBZ |
faster RBR |
the DT |
ITEL-ZX NNP |
|
The:DT | 2.42 | 1.00 | 1.25 | 1.45 | 0.01 | 1.00 |
PC-6082:NN | 0.71 | 0.28 | 0.82 | 1.31 | 0.71 | 0.82 |
is:VBZ | 0.76 | 0.91 | 1.32 | 1.11 | 0.76 | 0.91 |
faster:RBR | 1.09 | 1.21 | 0.91 | 1.23 | 1.09 | 1.21 |
the:DT | 0.01 | 1.00 | 1.25 | 1.45 | 2.42 | 1.00 |
ITEL-ZX:NN | 0.71 | 0.82 | 0.82 | 1.31 | 0.71 | 1.82 |
the:DT | 0.01 | 1.00 | 1.25 | 1.45 | 2.42 | 1.00 |
ITEL-ZY:NN | 0.71 | 0.82 | 0.82 | 1.31 | 0.71 | 0.98 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.04 | 0.82 | 0.07 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.56 Alignment.score
1.00 0.27 Alignment.isGood
-1.00 0.71 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.17 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "faster" modifying "is"
-0.10 1.00 NullPunisher.article : The
-0.05 1.00 NullPunisher.aux : is
-1.00 1.00 NullPunisher.other : PC-6082
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : faster
-2.00 1.00 RootEntailment.unalignedRoot : "is" not aligned to anything
Hand-tuned score (dot product of above): -5.5142
Threshold: -9.4738
Txt: The PC-6082 is faster than the ITEL-ZX or the ITEL-ZY.
Hyp: The PC-6082 is faster than the ITEL-ZX. (yes)
The DT |
PC-6082 NN |
is VBZ |
faster RBR |
the DT |
ITEL-ZX NNP |
|
The:DT | 2.42 | 1.00 | 1.25 | 1.45 | 0.01 | 1.00 |
PC-6082:NN | 0.71 | 0.28 | 0.82 | 1.31 | 0.71 | 0.82 |
is:VBZ | 0.76 | 0.91 | 1.32 | 1.11 | 0.76 | 0.91 |
faster:RBR | 1.09 | 1.21 | 0.91 | 1.23 | 1.09 | 1.21 |
the:DT | 0.01 | 1.00 | 1.25 | 1.45 | 2.42 | 1.00 |
ITEL-ZX:NN | 0.71 | 0.82 | 0.82 | 1.31 | 0.71 | 1.82 |
the:DT | 0.01 | 1.00 | 1.25 | 1.45 | 2.42 | 1.00 |
ITEL-ZY:NN | 0.71 | 0.82 | 0.82 | 1.31 | 0.71 | 0.98 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.04 | 0.82 | 0.07 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.56 Alignment.score
1.00 0.27 Alignment.isGood
-1.00 0.71 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.17 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "faster" modifying "is"
-1.00 1.00 NullPunisher.other : PC-6082
-1.00 1.00 NullPunisher.other : faster
-0.10 1.00 NullPunisher.article : The
-0.10 1.00 NullPunisher.article : the
-0.05 1.00 NullPunisher.aux : is
-2.00 1.00 RootEntailment.unalignedRoot : "is" not aligned to anything
Hand-tuned score (dot product of above): -5.5142
Threshold: -9.4738
Txt: ITEL has a factory in Birmingham.
Hyp: ITEL currently has a factory in Birmingham. (yes)
ITEL NNP |
currently RB |
has VBZ |
a DT |
factory NN |
Birmingham NNP |
|
ITEL:NNP | 0.28 | 1.56 | 1.07 | 0.96 | 1.07 | 0.99 |
has:VBZ | 1.16 | 1.11 | 1.32 | 0.76 | 0.91 | 1.16 |
a:DT | 1.25 | 1.45 | 1.25 | 2.42 | 1.00 | 1.25 |
factory:NN | 1.07 | 1.31 | 0.82 | 0.71 | 0.28 | 1.06 |
Birmingham:NNP | 0.99 | 1.53 | 1.07 | 0.96 | 1.06 | 0.28 |
NO_WORD | 0.28 | 0.04 | 0.17 | 0.82 | 0.09 | 0.07 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.13 Alignment.score
1.00 0.19 Alignment.isGood
-1.00 0.79 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Birmingham" modifying "factory"
-1.00 1.00 NullPunisher.other : currently
-0.10 1.00 NullPunisher.article : a
-0.05 1.00 NullPunisher.aux : has
-3.00 1.00 NullPunisher.entity : ITEL
-3.00 1.00 NullPunisher.entity : Birmingham
-1.00 1.00 NullPunisher.other : factory
-2.00 1.00 RootEntailment.unalignedRoot : "has" not aligned to anything
Hand-tuned score (dot product of above): -12.2229
Threshold: -9.4738
Txt: Since 1992 ITEL has been in Birmingham. It is now 1996.
Hyp: Itel was in Birmingham in 1993. (yes)
Itel NNP |
was VBD |
Birmingham NNP |
1993 CD |
|
1992:CD | 1.24 | 1.30 | 1.24 | 1.60 |
ITEL:NNP | 0.54 | 1.07 | 0.99 | 1.20 |
has:VBZ | 1.16 | 0.84 | 1.16 | 1.39 |
been:VBN | 1.11 | 0.10 | 1.16 | 1.39 |
Birmingham:NNP | 0.99 | 1.07 | 0.28 | 1.20 |
It:PRP | 1.48 | 0.97 | 1.55 | 1.46 |
is:VBZ | 1.16 | 0.05 | 1.16 | 1.39 |
now:RB | 1.47 | 0.91 | 1.47 | 1.36 |
1996:CD | 1.24 | 1.30 | 1.24 | 0.95 |
NO_WORD | 0.28 | 0.17 | 0.07 | 0.57 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.56 Alignment.score
1.00 0.26 Alignment.isGood
-1.00 0.71 Alignment.isBad
-0.10 2.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.50 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Birmingham" modifying "was"
-3.00 1.00 Date.dateHeadMismatch : 1993 vs. 1992
-3.00 1.00 NullPunisher.entity : Birmingham
-0.05 1.00 NullPunisher.aux : was
-2.00 1.00 RootEntailment.unalignedRoot : "was" not aligned to anything
Hand-tuned score (dot product of above): -8.9872
Threshold: -9.4738
Txt: ITEL has developed a new editor since 1992. It is now 1996.
Hyp: ITEL developed a new editor in 1993. (don't know)
ITEL NNP |
developed VBD |
a DT |
new JJ |
editor NN |
1993 CD |
|
ITEL:NNP | 0.28 | 1.07 | 0.96 | 1.21 | 1.03 | 1.20 |
has:VBZ | 1.16 | 0.79 | 0.76 | 1.02 | 0.91 | 1.39 |
developed:VBN | 1.16 | 1.50 | 0.76 | 0.95 | 0.88 | 1.27 |
a:DT | 1.25 | 1.25 | 2.42 | 1.18 | 1.00 | 1.18 |
new:JJ | 1.10 | 0.89 | 0.76 | 0.74 | 0.84 | 1.13 |
editor:NN | 1.03 | 0.79 | 0.71 | 0.94 | 0.28 | 1.19 |
1992:CD | 1.24 | 1.13 | 0.89 | 1.00 | 1.32 | 1.60 |
It:PRP | 2.69 | 0.97 | 1.10 | 1.39 | 1.30 | 1.46 |
is:VBZ | 1.16 | 0.84 | 0.76 | 1.02 | 0.91 | 1.39 |
now:RB | 1.47 | 0.91 | 1.09 | 0.90 | 1.21 | 1.36 |
1996:CD | 1.24 | 1.30 | 0.89 | 0.79 | 1.26 | 0.95 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.11 | 0.09 | 0.57 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.84 Alignment.score
1.00 0.32 Alignment.isGood
-1.00 0.65 Alignment.isBad
-0.10 2.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.33 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "new" modifying "editor"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "new" modifying "editor" is dropped on aligned hypothesis word "editor"
-3.00 1.00 Date.dateHeadMismatch : 1993 vs. 1992
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : new
-1.00 1.00 Structure.relMismatch : text "1992" is prep_since of "developed" while hyp "1993" is prep_in of "developed" which aligned to text "developed"
Hand-tuned score (dot product of above): -5.0548
Threshold: -9.4738
Txt: ITEL has expanded since 1992. It is now 1996.
Hyp: ITEL expanded in 1993. (don't know)
ITEL NNP |
expanded VBD |
1993 CD |
|
ITEL:NNP | 0.28 | 1.07 | 1.20 |
has:VBZ | 1.16 | 0.74 | 1.39 |
expanded:VBN | 1.16 | 2.02 | 1.11 |
1992:CD | 1.24 | 1.06 | 1.60 |
It:PRP | 2.69 | 0.97 | 1.46 |
is:VBZ | 1.16 | 0.79 | 1.39 |
now:RB | 1.47 | 0.91 | 1.36 |
1996:CD | 1.24 | 1.22 | 0.95 |
NO_WORD | 0.28 | 0.17 | 0.57 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 1.37 Alignment.score
1.00 0.45 Alignment.isGood
-1.00 0.52 Alignment.isBad
-0.10 0.00 Alignment.nbWordNotAligned
0.10 3.00 Alignment.hypSpan
0.10 1.00 Alignment.txtSpan
-3.00 1.00 Date.dateHeadMismatch : 1993 vs. 1992
-1.00 1.00 Structure.relMismatch : text "1992" is prep_since of "expanded" while hyp "1993" is prep_in of "expanded" which aligned to text "expanded"
Hand-tuned score (dot product of above): -2.3128
Threshold: -9.4738
Txt: Since 1992 ITEL has made a loss. It is now 1996.
Hyp: ITEL made a loss in 1993. (yes)
ITEL NNP |
made VBD |
a DT |
loss NN |
1993 CD |
|
1992:CD | 1.24 | 0.91 | 0.89 | 1.33 | 1.60 |
ITEL:NNP | 0.28 | 1.07 | 0.96 | 1.07 | 1.20 |
has:VBZ | 1.16 | 0.35 | 0.76 | 0.87 | 1.39 |
made:VBN | 1.16 | 2.10 | 0.76 | 0.76 | 1.06 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.18 |
loss:NN | 1.07 | 0.67 | 0.71 | 0.28 | 1.22 |
It:PRP | 1.48 | 0.97 | 1.10 | 1.30 | 1.46 |
is:VBZ | 1.16 | 0.92 | 0.76 | 0.91 | 1.39 |
now:RB | 1.47 | 0.91 | 1.09 | 1.18 | 1.36 |
1996:CD | 1.24 | 1.30 | 0.89 | 1.25 | 0.95 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 | 0.57 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 1.11 Alignment.score
1.00 0.38 Alignment.isGood
-1.00 0.59 Alignment.isBad
-0.10 1.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.80 Alignment.txtSpan
-3.00 1.00 Date.dateHeadMismatch : 1993 vs. 1992
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 Structure.relMismatch : text "1992" is prep_since of "made" while hyp "1993" is prep_in of "made" which aligned to text "made"
Hand-tuned score (dot product of above): -3.0194
Threshold: -9.4738
Txt: ITEL has made a loss since 1992. It is now 1996.
Hyp: ITEL made a loss in 1993. (don't know)
ITEL NNP |
made VBD |
a DT |
loss NN |
1993 CD |
|
ITEL:NNP | 0.28 | 1.07 | 0.96 | 1.07 | 1.20 |
has:VBZ | 1.16 | 0.35 | 0.76 | 0.87 | 1.39 |
made:VBN | 1.16 | 2.10 | 0.76 | 0.76 | 1.06 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.18 |
loss:NN | 1.07 | 0.67 | 0.71 | 0.28 | 1.22 |
1992:CD | 1.24 | 0.91 | 0.89 | 1.33 | 1.60 |
It:PRP | 2.69 | 0.97 | 1.10 | 1.30 | 1.46 |
is:VBZ | 1.16 | 0.92 | 0.76 | 0.91 | 1.39 |
now:RB | 1.47 | 0.91 | 1.09 | 1.18 | 1.36 |
1996:CD | 1.24 | 1.30 | 0.89 | 1.25 | 0.95 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 | 0.57 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 1.11 Alignment.score
1.00 0.38 Alignment.isGood
-1.00 0.59 Alignment.isBad
-0.10 1.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.80 Alignment.txtSpan
-3.00 1.00 Date.dateHeadMismatch : 1993 vs. 1992
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 Structure.relMismatch : text "1992" is prep_since of "made" while hyp "1993" is prep_in of "made" which aligned to text "made"
Hand-tuned score (dot product of above): -3.0194
Threshold: -9.4738
Txt: ITEL has made a loss since 1992. It is now 1996.
Hyp: ITEL made a loss in 1993. (yes)
ITEL NNP |
made VBD |
a DT |
loss NN |
1993 CD |
|
ITEL:NNP | 0.28 | 1.07 | 0.96 | 1.07 | 1.20 |
has:VBZ | 1.16 | 0.35 | 0.76 | 0.87 | 1.39 |
made:VBN | 1.16 | 2.10 | 0.76 | 0.76 | 1.06 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.18 |
loss:NN | 1.07 | 0.67 | 0.71 | 0.28 | 1.22 |
1992:CD | 1.24 | 0.91 | 0.89 | 1.33 | 1.60 |
It:PRP | 2.69 | 0.97 | 1.10 | 1.30 | 1.46 |
is:VBZ | 1.16 | 0.92 | 0.76 | 0.91 | 1.39 |
now:RB | 1.47 | 0.91 | 1.09 | 1.18 | 1.36 |
1996:CD | 1.24 | 1.30 | 0.89 | 1.25 | 0.95 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 | 0.57 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 1.11 Alignment.score
1.00 0.38 Alignment.isGood
-1.00 0.59 Alignment.isBad
-0.10 1.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.80 Alignment.txtSpan
-3.00 1.00 Date.dateHeadMismatch : 1993 vs. 1992
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 Structure.relMismatch : text "1992" is prep_since of "made" while hyp "1993" is prep_in of "made" which aligned to text "made"
Hand-tuned score (dot product of above): -3.0194
Threshold: -9.4738
Txt: In March 1993 APCOM founded ITEL.
Hyp: ITEL existed in 1992. (don't know)
ITEL NNP |
existed VBD |
1992 CD |
|
March:NNP | 0.99 | 1.07 | 1.03 |
1993:CD | 1.24 | 1.10 | 1.60 |
APCOM:NNP | 1.07 | 0.82 | 1.29 |
founded:VBD | 1.16 | 0.47 | 1.10 |
ITEL:NNP | 0.28 | 1.01 | 1.20 |
NO_WORD | 0.28 | 0.17 | 0.57 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.50 Alignment.score
1.00 0.25 Alignment.isGood
-1.00 0.72 Alignment.isBad
-0.10 2.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.33 Alignment.txtSpan
-3.00 1.00 Date.dateHeadMismatch : 1992 vs. 1993
-3.00 1.00 NullPunisher.entity : ITEL
-1.00 1.00 NullPunisher.other : existed
-2.00 1.00 RootEntailment.unalignedRoot : "existed" not aligned to anything
Hand-tuned score (dot product of above): -9.0419
Threshold: -9.4738
Txt: The conference started on July 4th, 1994. It lasted 2 days.
Hyp: The conference was over on July 8th, 1994. (yes)
The DT |
conference NN |
was VBD |
over IN |
on IN |
July NNP |
8th CD |
1994 CD |
|
The:DT | 2.42 | 1.00 | 1.25 | 1.01 | 1.05 | 1.25 | 1.11 | 1.18 |
conference:NN | 0.71 | 0.28 | 0.82 | 0.50 | 0.50 | 1.12 | 1.29 | 1.29 |
started:VBD | 0.76 | 0.91 | 0.90 | 1.15 | 1.15 | 1.16 | 1.39 | 1.24 |
July:NNP | 0.96 | 1.12 | 1.07 | 0.75 | 0.75 | 0.28 | 1.03 | 1.03 |
4th:CD | 0.82 | 1.33 | 1.30 | 1.30 | 1.30 | 1.07 | 0.49 | 0.34 |
1994:CD | 0.89 | 1.33 | 1.30 | 1.30 | 1.30 | 1.07 | 0.35 | 0.85 |
It:PRP | 1.23 | 2.50 | 0.97 | 1.28 | 1.29 | 1.55 | 1.42 | 1.46 |
lasted:VBD | 0.76 | 0.91 | 0.60 | 1.15 | 1.15 | 1.16 | 1.39 | 1.39 |
2:CD | 0.89 | 1.33 | 1.30 | 1.30 | 1.30 | 1.24 | 0.40 | 0.38 |
days:NNS | 0.71 | 0.82 | 0.72 | 0.50 | 0.50 | 1.01 | 1.29 | 1.29 |
NO_WORD | 0.82 | 0.28 | 0.17 | 1.16 | 0.99 | 0.25 | 0.52 | 0.26 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.42 Alignment.score
1.00 0.24 Alignment.isGood
-1.00 0.74 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.12 Alignment.txtSpan
-2.00 1.00 Date.hypDateIns : hypothesis date insertion: July
-1.00 1.00 NullPunisher.other : on
-1.00 1.00 NullPunisher.other : over
-3.00 1.00 NullPunisher.entity : 1994
-3.00 1.00 NullPunisher.entity : July
-0.05 1.00 NullPunisher.aux : was
-1.00 1.00 NullPunisher.other : conference
-0.10 1.00 NullPunisher.article : The
-6.00 1.00 Numeric.mismatch : DATE mismatch: '07/08/1994' vs ''
-2.00 1.00 RootEntailment.unalignedRoot : "was" not aligned to anything
Hand-tuned score (dot product of above): -19.8221
Threshold: -9.4738
Txt: Yesterday APCOM signed the contract. Today is Saturday, July 14th.
Hyp: APCOM signed the contract Friday, 13th. (yes)
APCOM NNP |
signed VBD |
the DT |
contract NN |
Friday NNP |
13th NN |
|
Yesterday:NNP | 0.82 | 0.80 | 0.71 | 0.89 | 1.02 | 0.78 |
APCOM:NNP | 0.28 | 0.82 | 0.71 | 0.82 | 1.07 | 0.82 |
signed:VBD | 0.91 | 1.32 | 0.76 | 0.45 | 1.16 | 0.91 |
the:DT | 1.00 | 1.25 | 2.42 | 1.00 | 1.25 | 0.96 |
contract:NN | 0.82 | 0.36 | 0.71 | 0.28 | 1.18 | 0.89 |
Today:NNP | 0.82 | 0.82 | 0.71 | 0.90 | 1.05 | 0.87 |
is:VBZ | 0.91 | 0.78 | 0.76 | 0.91 | 1.16 | 0.91 |
Saturday:NNP | 1.07 | 1.04 | 0.96 | 1.11 | 0.60 | 0.90 |
July:NNP | 1.07 | 1.07 | 0.96 | 1.16 | 0.74 | 0.98 |
14th:NNP | 1.07 | 1.07 | 0.93 | 1.15 | 0.67 | 0.07 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 | 0.09 | 0.25 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.16 Alignment.score
1.00 0.19 Alignment.isGood
-1.00 0.79 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Friday" modifying "signed"
-2.00 1.00 Date.hypDateIns : hypothesis date insertion: Friday
-1.00 1.00 NullPunisher.other : contract
-1.00 1.00 NullPunisher.other : 13th
-1.00 1.00 NullPunisher.other : APCOM
-1.00 1.00 NullPunisher.other : signed
-3.00 1.00 NullPunisher.entity : Friday
-0.10 1.00 NullPunisher.article : the
-6.00 1.00 Numeric.mismatch : DATE mismatch: '01/01/1000' vs ''
-2.00 1.00 RootEntailment.unalignedRoot : "signed" not aligned to anything
Hand-tuned score (dot product of above): -19.1308
Threshold: -9.4738
Txt: Smith left before Jones left. Jones left before Anderson left.
Hyp: Smith left before Anderson left. (yes)
Smith NNP |
left VBD |
before IN |
Anderson NNP |
left VBD |
|
Smith:NNP | 0.28 | 1.05 | 0.75 | 0.76 | 1.05 |
left:VBD | 1.14 | 1.32 | 1.11 | 1.16 | 1.32 |
before:IN | 1.57 | 1.20 | 1.59 | 1.57 | 1.20 |
Jones:NNP | 0.70 | 1.07 | 0.73 | 0.75 | 1.07 |
left:VBD | 1.14 | 1.32 | 1.11 | 1.16 | 1.32 |
Jones:NNP | 0.70 | 1.07 | 0.73 | 0.75 | 1.07 |
left:VBD | 1.14 | 1.32 | 1.11 | 1.16 | 1.32 |
before:IN | 1.57 | 1.20 | 1.59 | 1.57 | 1.20 |
Anderson:NNP | 0.76 | 1.07 | 0.75 | 0.28 | 1.07 |
left:VBD | 1.14 | 1.32 | 1.11 | 1.16 | 1.32 |
NO_WORD | 0.28 | 0.17 | 0.92 | 0.28 | 0.42 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.19 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-3.00 1.00 NullPunisher.entity : Anderson
-1.00 1.00 NullPunisher.other : before
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : left
-1.00 1.00 NullPunisher.other : left
-2.00 1.00 RootEntailment.unalignedRoot : "left" not aligned to anything
Hand-tuned score (dot product of above): -11.8902
Threshold: -9.4738
Txt: Smith left after Jones left. Jones left after Anderson left.
Hyp: Smith left after Anderson left. (yes)
Smith NNP |
left VBD |
after IN |
Anderson NNP |
left VBD |
|
Smith:NNP | 0.28 | 1.05 | 0.75 | 0.76 | 1.05 |
left:VBD | 1.14 | 1.32 | 1.13 | 1.16 | 1.32 |
after:IN | 1.57 | 1.21 | 1.59 | 1.52 | 1.21 |
Jones:NNP | 0.70 | 1.07 | 0.71 | 0.75 | 1.07 |
left:VBD | 1.14 | 1.32 | 1.13 | 1.16 | 1.32 |
Jones:NNP | 0.70 | 1.07 | 0.71 | 0.75 | 1.07 |
left:VBD | 1.14 | 1.32 | 1.13 | 1.16 | 1.32 |
after:IN | 1.57 | 1.21 | 1.59 | 1.52 | 1.21 |
Anderson:NNP | 0.76 | 1.07 | 0.70 | 0.28 | 1.07 |
left:VBD | 1.14 | 1.32 | 1.13 | 1.16 | 1.32 |
NO_WORD | 0.28 | 0.17 | 0.92 | 0.28 | 0.42 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.19 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : left
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : after
-3.00 1.00 NullPunisher.entity : Anderson
-1.00 1.00 NullPunisher.other : left
-2.00 1.00 RootEntailment.unalignedRoot : "left" not aligned to anything
Hand-tuned score (dot product of above): -11.8902
Threshold: -9.4738
Txt: Smith was present after Jones left. Jones left after Anderson was present.
Hyp: Smith was present after Anderson was present. (don't know)
Smith NNP |
was VBD |
present JJ |
after IN |
Anderson NNP |
was VBD |
present JJ |
|
Smith:NNP | 0.28 | 1.07 | 1.24 | 0.75 | 0.76 | 1.07 | 1.24 |
was:VBD | 1.16 | 1.32 | 1.02 | 1.15 | 1.16 | 1.32 | 1.02 |
present:JJ | 1.13 | 0.96 | 0.74 | 1.05 | 1.09 | 0.96 | 0.74 |
after:IN | 1.57 | 1.24 | 0.77 | 1.59 | 1.52 | 1.24 | 0.77 |
Jones:NNP | 0.70 | 1.07 | 1.24 | 0.71 | 0.75 | 1.07 | 1.24 |
left:VBD | 1.14 | 0.90 | 1.00 | 1.13 | 1.16 | 0.90 | 1.00 |
Jones:NNP | 0.70 | 1.07 | 1.24 | 0.71 | 0.75 | 1.07 | 1.24 |
left:VBD | 1.14 | 0.90 | 1.00 | 1.13 | 1.16 | 0.90 | 1.00 |
after:IN | 1.57 | 1.24 | 0.77 | 1.59 | 1.52 | 1.24 | 0.77 |
Anderson:NNP | 0.76 | 1.07 | 1.20 | 0.70 | 0.28 | 1.07 | 1.20 |
was:VBD | 1.16 | 1.32 | 1.02 | 1.15 | 1.16 | 1.32 | 1.02 |
present:JJ | 1.13 | 0.96 | 0.74 | 1.05 | 1.09 | 0.96 | 0.74 |
NO_WORD | 0.28 | 1.43 | 0.16 | 0.92 | 0.28 | 1.43 | 0.35 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.53 Alignment.score
1.00 0.26 Alignment.isGood
-1.00 0.72 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-3.00 1.00 NullPunisher.entity : Anderson
-0.05 1.00 NullPunisher.aux : was
-1.00 1.00 NullPunisher.other : after
-0.05 1.00 NullPunisher.aux : was
-1.00 1.00 NullPunisher.other : present
-1.00 1.00 NullPunisher.other : present
-3.00 1.00 NullPunisher.entity : Smith
-2.00 1.00 RootEntailment.unalignedRoot : "present" not aligned to anything
Hand-tuned score (dot product of above): -11.7224
Threshold: -9.4738
Txt: Smith left. Jones left. Smith left before Jones left.
Hyp: Jones left after Smith left. (yes)
Jones NNP |
left VBD |
after IN |
Smith NNP |
left VBD |
|
Smith:NNP | 0.70 | 1.05 | 0.75 | 0.28 | 1.05 |
left:VBD | 1.16 | 1.32 | 1.13 | 1.14 | 1.32 |
Jones:NNP | 0.28 | 1.07 | 0.71 | 0.70 | 1.07 |
left:VBD | 1.16 | 1.32 | 1.13 | 1.14 | 1.32 |
Smith:NNP | 0.70 | 1.05 | 0.75 | 0.28 | 1.05 |
left:VBD | 1.16 | 1.32 | 1.13 | 1.14 | 1.32 |
before:IN | 1.55 | 1.20 | 2.47 | 1.57 | 1.20 |
Jones:NNP | 0.28 | 1.07 | 0.71 | 0.70 | 1.07 |
left:VBD | 1.16 | 1.32 | 1.13 | 1.14 | 1.32 |
NO_WORD | 0.28 | 0.17 | 0.92 | 0.28 | 0.42 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.19 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-3.00 1.00 NullPunisher.entity : Jones
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : left
-1.00 1.00 NullPunisher.other : left
-1.00 1.00 NullPunisher.other : after
-2.00 1.00 RootEntailment.unalignedRoot : "left" not aligned to anything
Hand-tuned score (dot product of above): -11.8902
Threshold: -9.4738
Txt: Smith left. Jones left. Smith left after Jones left.
Hyp: Jones left before Smith left. (yes)
Jones NNP |
left VBD |
before IN |
Smith NNP |
left VBD |
|
Smith:NNP | 0.70 | 1.05 | 0.75 | 0.28 | 1.05 |
left:VBD | 1.16 | 1.32 | 1.11 | 1.14 | 1.32 |
Jones:NNP | 0.28 | 1.07 | 0.73 | 0.70 | 1.07 |
left:VBD | 1.16 | 1.32 | 1.11 | 1.14 | 1.32 |
Smith:NNP | 0.70 | 1.05 | 0.75 | 0.28 | 1.05 |
left:VBD | 1.16 | 1.32 | 1.11 | 1.14 | 1.32 |
after:IN | 1.52 | 1.21 | 2.47 | 1.57 | 1.21 |
Jones:NNP | 0.28 | 1.07 | 0.73 | 0.70 | 1.07 |
left:VBD | 1.16 | 1.32 | 1.11 | 1.14 | 1.32 |
NO_WORD | 0.28 | 0.17 | 0.92 | 0.28 | 0.42 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.19 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : left
-1.00 1.00 NullPunisher.other : before
-3.00 1.00 NullPunisher.entity : Jones
-1.00 1.00 NullPunisher.other : left
-2.00 1.00 RootEntailment.unalignedRoot : "left" not aligned to anything
Hand-tuned score (dot product of above): -11.8902
Threshold: -9.4738
Txt: Smith left. Jones left. Jones left before Smith left.
Hyp: Smith left after Jones left. (yes)
Smith NNP |
left VBD |
after IN |
Jones NNP |
left VBD |
|
Smith:NNP | 0.28 | 1.05 | 0.75 | 0.70 | 1.05 |
left:VBD | 1.14 | 1.32 | 1.13 | 1.16 | 1.32 |
Jones:NNP | 0.70 | 1.07 | 0.71 | 0.28 | 1.07 |
left:VBD | 1.14 | 1.32 | 1.13 | 1.16 | 1.32 |
Jones:NNP | 0.70 | 1.07 | 0.71 | 0.28 | 1.07 |
left:VBD | 1.14 | 1.32 | 1.13 | 1.16 | 1.32 |
before:IN | 1.57 | 1.20 | 2.47 | 1.55 | 1.20 |
Smith:NNP | 0.28 | 1.05 | 0.75 | 0.70 | 1.05 |
left:VBD | 1.14 | 1.32 | 1.13 | 1.16 | 1.32 |
NO_WORD | 0.28 | 0.17 | 0.92 | 0.28 | 0.42 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.19 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : after
-3.00 1.00 NullPunisher.entity : Jones
-1.00 1.00 NullPunisher.other : left
-1.00 1.00 NullPunisher.other : left
-3.00 1.00 NullPunisher.entity : Smith
-2.00 1.00 RootEntailment.unalignedRoot : "left" not aligned to anything
Hand-tuned score (dot product of above): -11.8902
Threshold: -9.4738
Txt: Jones revised the contract. Smith revised the contract. Jones revised the contract before Smith did.
Hyp: Smith revised the contract after Jones did. (yes)
Smith NNP |
revised VBD |
the DT |
contract NN |
after IN |
Jones NNP |
did VBD |
|
Jones:NNP | 0.70 | 1.07 | 0.96 | 1.16 | 0.71 | 0.28 | 1.07 |
revised:VBD | 1.16 | 1.32 | 0.76 | 0.79 | 1.15 | 1.16 | 0.64 |
the:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.16 | 1.25 | 1.25 |
contract:NN | 1.16 | 0.70 | 0.71 | 0.28 | 0.50 | 1.16 | 0.71 |
Smith:NNP | 0.28 | 1.07 | 0.96 | 1.16 | 0.75 | 0.70 | 1.07 |
revised:VBD | 1.16 | 1.32 | 0.76 | 0.79 | 1.15 | 1.16 | 0.64 |
the:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.16 | 1.25 | 1.25 |
contract:NN | 1.16 | 0.70 | 0.71 | 0.28 | 0.50 | 1.16 | 0.71 |
Jones:NNP | 0.70 | 1.07 | 0.96 | 1.16 | 0.71 | 0.28 | 1.07 |
revised:VBD | 1.16 | 1.32 | 0.76 | 0.79 | 1.15 | 1.16 | 0.64 |
the:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.16 | 1.25 | 1.25 |
contract:NN | 1.16 | 0.70 | 0.71 | 0.28 | 0.50 | 1.16 | 0.71 |
Smith:NNP | 0.28 | 1.07 | 0.96 | 1.16 | 0.75 | 0.70 | 1.07 |
did:VBD | 1.16 | 0.64 | 0.76 | 0.80 | 1.15 | 1.16 | 1.32 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 | 0.92 | 0.28 | 0.42 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.27 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-0.10 1.00 NullPunisher.article : the
-3.00 1.00 NullPunisher.entity : Jones
-1.00 1.00 NullPunisher.other : contract
-0.05 1.00 NullPunisher.aux : did
-1.00 1.00 NullPunisher.other : revised
-1.00 1.00 NullPunisher.other : after
-3.00 1.00 NullPunisher.entity : Smith
-2.00 1.00 RootEntailment.unalignedRoot : "revised" not aligned to anything
Hand-tuned score (dot product of above): -12.1404
Threshold: -9.4738
Txt: Jones revised the contract. Smith revised the contract. Jones revised the contract after Smith did.
Hyp: Smith revised the contract before Jones did. (yes)
Smith NNP |
revised VBD |
the DT |
contract NN |
Jones NNP |
did VBD |
|
Jones:NNP | 0.70 | 1.07 | 0.96 | 1.16 | 0.28 | 1.07 |
revised:VBD | 1.16 | 1.32 | 0.76 | 0.79 | 1.16 | 0.64 |
the:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.25 | 1.25 |
contract:NN | 1.16 | 0.70 | 0.71 | 0.28 | 1.16 | 0.71 |
Smith:NNP | 0.28 | 1.07 | 0.96 | 1.16 | 0.70 | 1.07 |
revised:VBD | 1.16 | 1.32 | 0.76 | 0.79 | 1.16 | 0.64 |
the:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.25 | 1.25 |
contract:NN | 1.16 | 0.70 | 0.71 | 0.28 | 1.16 | 0.71 |
Jones:NNP | 0.70 | 1.07 | 0.96 | 1.16 | 0.28 | 1.07 |
revised:VBD | 1.16 | 1.32 | 0.76 | 0.79 | 1.16 | 0.64 |
the:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.25 | 1.25 |
contract:NN | 1.16 | 0.70 | 0.71 | 0.28 | 1.16 | 0.71 |
after:IN | 1.57 | 1.24 | 1.16 | 1.31 | 1.52 | 1.24 |
Smith:NNP | 0.28 | 1.07 | 0.96 | 1.16 | 0.70 | 1.07 |
did:VBD | 1.16 | 0.64 | 0.76 | 0.80 | 1.16 | 1.32 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.28 | 0.10 | 0.36 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.15 Alignment.score
1.00 0.19 Alignment.isGood
-1.00 0.79 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Jones" modifying "contract"
-0.10 1.00 NullPunisher.article : the
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : revised
-3.00 1.00 NullPunisher.entity : Jones
-0.05 1.00 NullPunisher.aux : did
-1.00 1.00 NullPunisher.other : contract
-2.00 1.00 RootEntailment.unalignedRoot : "revised" not aligned to anything
Hand-tuned score (dot product of above): -12.1952
Threshold: -9.4738
Txt: Smith swam. Jones swam. Smith swam before Jones swam.
Hyp: Jones swam after Smith swam. (don't know)
Jones NNP |
swam VB |
Smith NNP |
swam NN |
|
Smith:NNP | 0.70 | 1.05 | 0.28 | 1.05 |
swam:VBP | 1.16 | 2.02 | 1.14 | 1.75 |
Jones:NNP | 0.28 | 1.07 | 0.70 | 1.07 |
swam:VBP | 1.16 | 2.02 | 1.14 | 1.75 |
Smith:NNP | 0.70 | 1.05 | 0.28 | 1.05 |
swam:VB | 1.16 | 1.32 | 1.14 | 1.75 |
Jones:NNP | 0.28 | 1.07 | 0.70 | 1.07 |
swam:NN | 1.07 | 2.77 | 1.05 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.05 | 0.23 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 1.24 Alignment.score
1.00 0.41 Alignment.isGood
-1.00 0.56 Alignment.isBad
-0.10 0.00 Alignment.nbWordNotAligned
0.10 4.00 Alignment.hypSpan
0.10 1.00 Alignment.txtSpan
Hand-tuned score (dot product of above): 1.5946
Threshold: -9.4738
Txt: Smith swam to the shore. Jones swam to the shore. Smith swam to the shore before Jones swam to the shore.
Hyp: Jones swam to the shore after Smith swam to the shore. (yes)
Jones NNP |
swam VBP |
the DT |
shore NN |
after IN |
Smith NNP |
swam VB |
the DT |
shore NN |
|
Smith:NNP | 0.70 | 1.05 | 0.96 | 1.03 | 0.75 | 0.28 | 1.05 | 0.96 | 1.03 |
swam:VB | 1.16 | 2.02 | 0.76 | 0.77 | 1.15 | 1.14 | 1.32 | 0.76 | 0.77 |
the:DT | 1.25 | 1.25 | 2.42 | 0.94 | 1.16 | 1.25 | 1.25 | 2.42 | 0.94 |
shore:NN | 1.04 | 0.69 | 0.65 | 0.28 | 0.50 | 1.03 | 0.69 | 0.65 | 0.28 |
Jones:NNP | 0.28 | 1.07 | 0.96 | 1.04 | 0.71 | 0.70 | 1.07 | 0.96 | 1.04 |
swam:VB | 1.16 | 2.02 | 0.76 | 0.77 | 1.15 | 1.14 | 1.32 | 0.76 | 0.77 |
the:DT | 1.25 | 1.25 | 2.42 | 0.94 | 1.16 | 1.25 | 1.25 | 2.42 | 0.94 |
shore:NN | 1.04 | 0.69 | 0.65 | 0.28 | 0.50 | 1.03 | 0.69 | 0.65 | 0.28 |
Smith:NNP | 0.70 | 1.05 | 0.96 | 1.03 | 0.75 | 0.28 | 1.05 | 0.96 | 1.03 |
swam:VBP | 1.16 | 1.32 | 0.76 | 0.77 | 1.15 | 1.14 | 2.02 | 0.76 | 0.77 |
the:DT | 1.25 | 1.25 | 2.42 | 0.94 | 1.16 | 1.25 | 1.25 | 2.42 | 0.94 |
shore:NN | 1.04 | 0.69 | 0.65 | 0.28 | 0.50 | 1.03 | 0.69 | 0.65 | 0.28 |
before:IN | 1.55 | 1.24 | 1.05 | 1.21 | 2.47 | 1.57 | 1.24 | 1.05 | 1.21 |
Jones:NNP | 0.28 | 1.07 | 0.96 | 1.04 | 0.71 | 0.70 | 1.07 | 0.96 | 1.04 |
swam:VB | 1.16 | 2.02 | 0.76 | 0.77 | 1.15 | 1.14 | 1.32 | 0.76 | 0.77 |
the:DT | 1.25 | 1.25 | 2.42 | 0.94 | 1.16 | 1.25 | 1.25 | 2.42 | 0.94 |
shore:NN | 1.04 | 0.69 | 0.65 | 0.28 | 0.50 | 1.03 | 0.69 | 0.65 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.74 | 0.92 | 0.28 | 0.42 | 0.82 | 0.74 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.96 Alignment.score
1.00 0.35 Alignment.isGood
-1.00 0.62 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.44 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "shore" modifying "swam"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "shore" modifying "swam" is dropped on aligned hypothesis word "swam"
-1.00 1.00 NullPunisher.other : shore
-1.00 1.00 NullPunisher.other : after
-1.00 1.00 NullPunisher.other : shore
-0.10 1.00 NullPunisher.article : the
-0.10 1.00 NullPunisher.article : the
Hand-tuned score (dot product of above): -3.2654
Threshold: -9.4738
Txt: Smith was present. Jones was present. Smith was present after Jones was present.
Hyp: Jones was present before Smith was present. (don't know)
Jones NNP |
was VBD |
present JJ |
before IN |
Smith NNP |
was VBD |
present JJ |
|
Smith:NNP | 0.70 | 1.07 | 1.24 | 0.75 | 0.28 | 1.07 | 1.24 |
was:VBD | 1.16 | 1.32 | 1.02 | 1.15 | 1.16 | 1.32 | 1.02 |
present:JJ | 1.13 | 0.96 | 0.74 | 1.03 | 1.13 | 0.96 | 0.74 |
Jones:NNP | 0.28 | 1.07 | 1.24 | 0.73 | 0.70 | 1.07 | 1.24 |
was:VBD | 1.16 | 1.32 | 1.02 | 1.15 | 1.16 | 1.32 | 1.02 |
present:JJ | 1.13 | 0.96 | 0.74 | 1.03 | 1.13 | 0.96 | 0.74 |
Smith:NNP | 0.70 | 1.07 | 1.24 | 0.75 | 0.28 | 1.07 | 1.24 |
was:VBD | 1.16 | 1.32 | 1.02 | 1.15 | 1.16 | 1.32 | 1.02 |
present:JJ | 1.13 | 0.96 | 0.74 | 1.03 | 1.13 | 0.96 | 0.74 |
after:IN | 1.52 | 1.24 | 0.77 | 2.47 | 1.57 | 1.24 | 0.77 |
Jones:NNP | 0.28 | 1.07 | 1.24 | 0.73 | 0.70 | 1.07 | 1.24 |
was:VBD | 1.16 | 1.32 | 1.02 | 1.15 | 1.16 | 1.32 | 1.02 |
present:JJ | 1.13 | 0.96 | 0.74 | 1.03 | 1.13 | 0.96 | 0.74 |
NO_WORD | 0.28 | 1.43 | 0.16 | 0.92 | 0.28 | 1.43 | 0.35 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.53 Alignment.score
1.00 0.26 Alignment.isGood
-1.00 0.72 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-0.05 1.00 NullPunisher.aux : was
-1.00 1.00 NullPunisher.other : present
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : present
-1.00 1.00 NullPunisher.other : before
-3.00 1.00 NullPunisher.entity : Jones
-0.05 1.00 NullPunisher.aux : was
-2.00 1.00 RootEntailment.unalignedRoot : "present" not aligned to anything
Hand-tuned score (dot product of above): -11.7224
Threshold: -9.4738
Txt: Smith was present. Jones was present. Smith was present before Jones was present.
Hyp: Jones was present after Smith was present. (don't know)
Jones NNP |
was VBD |
present JJ |
after IN |
Smith NNP |
was VBD |
present JJ |
|
Smith:NNP | 0.70 | 1.07 | 1.24 | 0.75 | 0.28 | 1.07 | 1.24 |
was:VBD | 1.16 | 1.32 | 1.02 | 1.15 | 1.16 | 1.32 | 1.02 |
present:JJ | 1.13 | 0.96 | 0.74 | 1.05 | 1.13 | 0.96 | 0.74 |
Jones:NNP | 0.28 | 1.07 | 1.24 | 0.71 | 0.70 | 1.07 | 1.24 |
was:VBD | 1.16 | 1.32 | 1.02 | 1.15 | 1.16 | 1.32 | 1.02 |
present:JJ | 1.13 | 0.96 | 0.74 | 1.05 | 1.13 | 0.96 | 0.74 |
Smith:NNP | 0.70 | 1.07 | 1.24 | 0.75 | 0.28 | 1.07 | 1.24 |
was:VBD | 1.16 | 1.32 | 1.02 | 1.15 | 1.16 | 1.32 | 1.02 |
present:JJ | 1.13 | 0.96 | 0.74 | 1.05 | 1.13 | 0.96 | 0.74 |
before:IN | 1.55 | 1.24 | 0.75 | 2.47 | 1.57 | 1.24 | 0.75 |
Jones:NNP | 0.28 | 1.07 | 1.24 | 0.71 | 0.70 | 1.07 | 1.24 |
was:VBD | 1.16 | 1.32 | 1.02 | 1.15 | 1.16 | 1.32 | 1.02 |
present:JJ | 1.13 | 0.96 | 0.74 | 1.05 | 1.13 | 0.96 | 0.74 |
NO_WORD | 0.28 | 1.43 | 0.16 | 0.92 | 0.28 | 1.43 | 0.35 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.53 Alignment.score
1.00 0.26 Alignment.isGood
-1.00 0.72 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : after
-1.00 1.00 NullPunisher.other : present
-1.00 1.00 NullPunisher.other : present
-0.05 1.00 NullPunisher.aux : was
-3.00 1.00 NullPunisher.entity : Jones
-0.05 1.00 NullPunisher.aux : was
-2.00 1.00 RootEntailment.unalignedRoot : "present" not aligned to anything
Hand-tuned score (dot product of above): -11.7224
Threshold: -9.4738
Txt: Smith was writing a report. Jones was writing a report. Smith was writing a report before Jones was writing a report.
Hyp: Jones was writing a report after Smith was writing a report. (don't know)
Jones NNP |
was VBD |
writing VBG |
a DT |
report NN |
after IN |
Smith NNP |
was VBD |
writing VBG |
a DT |
report NN |
|
Smith:NNP | 0.70 | 1.07 | 1.04 | 0.96 | 1.14 | 0.75 | 0.28 | 1.07 | 1.04 | 0.96 | 1.14 |
was:VBD | 1.16 | 1.32 | 0.84 | 0.76 | 0.91 | 1.15 | 1.16 | 1.32 | 0.84 | 0.76 | 0.91 |
writing:VBG | 1.16 | 0.84 | 1.32 | 0.76 | 0.80 | 1.15 | 1.13 | 0.84 | 1.32 | 0.76 | 0.80 |
a:DT | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 | 1.12 | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 |
report:NN | 1.16 | 0.82 | 0.71 | 0.71 | 0.28 | 0.48 | 1.14 | 0.82 | 0.71 | 0.71 | 0.28 |
Jones:NNP | 0.28 | 1.07 | 1.07 | 0.96 | 1.16 | 0.71 | 0.70 | 1.07 | 1.07 | 0.96 | 1.16 |
was:VBD | 1.16 | 1.32 | 0.84 | 0.76 | 0.91 | 1.15 | 1.16 | 1.32 | 0.84 | 0.76 | 0.91 |
writing:VBG | 1.16 | 0.84 | 1.32 | 0.76 | 0.80 | 1.15 | 1.13 | 0.84 | 1.32 | 0.76 | 0.80 |
a:DT | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 | 1.12 | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 |
report:NN | 1.16 | 0.82 | 0.71 | 0.71 | 0.28 | 0.48 | 1.14 | 0.82 | 0.71 | 0.71 | 0.28 |
Smith:NNP | 0.70 | 1.07 | 1.04 | 0.96 | 1.14 | 0.75 | 0.28 | 1.07 | 1.04 | 0.96 | 1.14 |
was:VBD | 1.16 | 1.32 | 0.84 | 0.76 | 0.91 | 1.15 | 1.16 | 1.32 | 0.84 | 0.76 | 0.91 |
writing:VBG | 1.16 | 0.84 | 1.32 | 0.76 | 0.80 | 1.15 | 1.13 | 0.84 | 1.32 | 0.76 | 0.80 |
a:DT | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 | 1.12 | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 |
report:NN | 1.16 | 0.82 | 0.71 | 0.71 | 0.28 | 0.48 | 1.14 | 0.82 | 0.71 | 0.71 | 0.28 |
before:IN | 1.55 | 1.24 | 1.24 | 1.11 | 1.20 | 2.47 | 1.57 | 1.24 | 1.24 | 1.11 | 1.20 |
Jones:NNP | 0.28 | 1.07 | 1.07 | 0.96 | 1.16 | 0.71 | 0.70 | 1.07 | 1.07 | 0.96 | 1.16 |
was:VBD | 1.16 | 1.32 | 0.84 | 0.76 | 0.91 | 1.15 | 1.16 | 1.32 | 0.84 | 0.76 | 0.91 |
writing:VBG | 1.16 | 0.84 | 1.32 | 0.76 | 0.80 | 1.15 | 1.13 | 0.84 | 1.32 | 0.76 | 0.80 |
a:DT | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 | 1.12 | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 |
report:NN | 1.16 | 0.82 | 0.71 | 0.71 | 0.28 | 0.48 | 1.14 | 0.82 | 0.71 | 0.71 | 0.28 |
NO_WORD | 0.28 | 1.30 | 0.17 | 0.82 | 0.09 | 0.92 | 0.28 | 1.30 | 0.42 | 0.82 | 0.09 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.49 Alignment.score
1.00 0.25 Alignment.isGood
-1.00 0.73 Alignment.isBad
-0.10 11.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-0.10 1.00 NullPunisher.article : a
-0.05 1.00 NullPunisher.aux : was
-1.00 1.00 NullPunisher.other : report
-1.00 1.00 NullPunisher.other : report
-0.05 1.00 NullPunisher.aux : was
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : writing
-3.00 1.00 NullPunisher.entity : Jones
-1.00 1.00 NullPunisher.other : after
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : writing
-2.00 1.00 RootEntailment.unalignedRoot : "writing" not aligned to anything
Hand-tuned score (dot product of above): -14.3875
Threshold: -9.4738
Txt: Smith was writing a report. Jones was writing a report. Smith was writing a report after Jones was writing a report.
Hyp: Jones was writing a report before Smith was writing a report. (don't know)
Jones NNP |
was VBD |
writing VBG |
a DT |
report NN |
before IN |
Smith NNP |
was VBD |
writing VBG |
a DT |
report NN |
|
Smith:NNP | 0.70 | 1.07 | 1.04 | 0.96 | 1.14 | 0.75 | 0.28 | 1.07 | 1.04 | 0.96 | 1.14 |
was:VBD | 1.16 | 1.32 | 0.84 | 0.76 | 0.91 | 1.15 | 1.16 | 1.32 | 0.84 | 0.76 | 0.91 |
writing:VBG | 1.16 | 0.84 | 1.32 | 0.76 | 0.80 | 1.15 | 1.13 | 0.84 | 1.32 | 0.76 | 0.80 |
a:DT | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 | 1.11 | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 |
report:NN | 1.16 | 0.82 | 0.71 | 0.71 | 0.28 | 0.39 | 1.14 | 0.82 | 0.71 | 0.71 | 0.28 |
Jones:NNP | 0.28 | 1.07 | 1.07 | 0.96 | 1.16 | 0.73 | 0.70 | 1.07 | 1.07 | 0.96 | 1.16 |
was:VBD | 1.16 | 1.32 | 0.84 | 0.76 | 0.91 | 1.15 | 1.16 | 1.32 | 0.84 | 0.76 | 0.91 |
writing:VBG | 1.16 | 0.84 | 1.32 | 0.76 | 0.80 | 1.15 | 1.13 | 0.84 | 1.32 | 0.76 | 0.80 |
a:DT | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 | 1.11 | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 |
report:NN | 1.16 | 0.82 | 0.71 | 0.71 | 0.28 | 0.39 | 1.14 | 0.82 | 0.71 | 0.71 | 0.28 |
Smith:NNP | 0.70 | 1.07 | 1.04 | 0.96 | 1.14 | 0.75 | 0.28 | 1.07 | 1.04 | 0.96 | 1.14 |
was:VBD | 1.16 | 1.32 | 0.84 | 0.76 | 0.91 | 1.15 | 1.16 | 1.32 | 0.84 | 0.76 | 0.91 |
writing:VBG | 1.16 | 0.84 | 1.32 | 0.76 | 0.80 | 1.15 | 1.13 | 0.84 | 1.32 | 0.76 | 0.80 |
a:DT | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 | 1.11 | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 |
report:NN | 1.16 | 0.82 | 0.71 | 0.71 | 0.28 | 0.39 | 1.14 | 0.82 | 0.71 | 0.71 | 0.28 |
after:IN | 1.52 | 1.24 | 1.24 | 1.12 | 1.29 | 2.47 | 1.57 | 1.24 | 1.24 | 1.12 | 1.29 |
Jones:NNP | 0.28 | 1.07 | 1.07 | 0.96 | 1.16 | 0.73 | 0.70 | 1.07 | 1.07 | 0.96 | 1.16 |
was:VBD | 1.16 | 1.32 | 0.84 | 0.76 | 0.91 | 1.15 | 1.16 | 1.32 | 0.84 | 0.76 | 0.91 |
writing:VBG | 1.16 | 0.84 | 1.32 | 0.76 | 0.80 | 1.15 | 1.13 | 0.84 | 1.32 | 0.76 | 0.80 |
a:DT | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 | 1.11 | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 |
report:NN | 1.16 | 0.82 | 0.71 | 0.71 | 0.28 | 0.39 | 1.14 | 0.82 | 0.71 | 0.71 | 0.28 |
NO_WORD | 0.28 | 1.30 | 0.17 | 0.82 | 0.09 | 0.92 | 0.28 | 1.30 | 0.42 | 0.82 | 0.09 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.49 Alignment.score
1.00 0.25 Alignment.isGood
-1.00 0.73 Alignment.isBad
-0.10 11.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : writing
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : before
-1.00 1.00 NullPunisher.other : report
-1.00 1.00 NullPunisher.other : report
-3.00 1.00 NullPunisher.entity : Jones
-1.00 1.00 NullPunisher.other : writing
-3.00 1.00 NullPunisher.entity : Smith
-0.05 1.00 NullPunisher.aux : was
-0.05 1.00 NullPunisher.aux : was
-2.00 1.00 RootEntailment.unalignedRoot : "writing" not aligned to anything
Hand-tuned score (dot product of above): -14.3875
Threshold: -9.4738
Txt: Smith left the meeting before he lost his temper.
Hyp: Smith lost his temper. (don't know)
Smith NNP |
lost VBD |
his PRP$ |
temper NN |
|
Smith:NNP | 0.28 | 1.05 | 1.84 | 1.06 |
left:VBD | 1.14 | 0.41 | 0.54 | 0.77 |
the:DT | 1.25 | 1.25 | 1.16 | 0.97 |
meeting:NN | 1.15 | 0.82 | 1.58 | 0.69 |
before:IN | 1.57 | 1.24 | 1.57 | 1.27 |
he:PRP | 2.76 | 1.21 | 0.82 | 1.30 |
lost:VBD | 1.14 | 1.32 | 0.51 | 0.87 |
his:PRP$ | 2.76 | 1.18 | 1.05 | 1.30 |
temper:NN | 1.06 | 0.78 | 1.58 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.93 | 0.09 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.23 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-0.10 1.00 NullPunisher.functionWord : his
-1.00 1.00 NullPunisher.other : temper
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : lost
-2.00 1.00 RootEntailment.unalignedRoot : "lost" not aligned to anything
Hand-tuned score (dot product of above): -7.8399
Threshold: -9.4738
Txt: Smith lived in Birmingham in 1991.
Hyp: Smith lived in Birmingham in 1992. (don't know)
Smith NNP |
lived VBD |
Birmingham NNP |
1992 CD |
|
Smith:NNP | 0.28 | 1.07 | 0.95 | 1.20 |
lived:VBD | 1.16 | 1.32 | 1.16 | 1.39 |
Birmingham:NNP | 0.95 | 1.07 | 0.28 | 1.20 |
1991:CD | 1.24 | 1.17 | 1.24 | 1.56 |
NO_WORD | 0.28 | 0.17 | 0.07 | 0.57 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.35 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.75 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.25 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Birmingham" modifying "lived"
-3.00 1.00 Date.dateHeadMismatch : 1992 vs. 1991
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : lived
-3.00 1.00 NullPunisher.entity : Birmingham
-2.00 1.00 RootEntailment.unalignedRoot : "lived" not aligned to anything
Hand-tuned score (dot product of above): -13.3578
Threshold: -9.4738
Txt: Smith wrote his first novel in 1991.
Hyp: Smith wrote his first novel in 1992. (don't know)
Smith NNP |
wrote VBD |
his PRP$ |
first JJ |
novel NN |
1992 CD |
|
Smith:NNP | 0.28 | 1.03 | 1.84 | 1.15 | 1.11 | 1.20 |
wrote:VBD | 1.12 | 1.32 | 0.54 | 1.23 | 0.33 | 1.21 |
his:PRP$ | 2.76 | 1.21 | 1.05 | 1.59 | 1.30 | 1.46 |
first:JJ | 1.05 | 1.17 | 1.25 | 0.74 | 1.13 | 1.04 |
novel:NN | 1.11 | 0.24 | 1.58 | 1.24 | 0.28 | 1.22 |
1991:CD | 1.24 | 0.97 | 1.32 | 0.91 | 1.12 | 1.56 |
NO_WORD | 0.28 | 0.17 | 0.93 | 0.11 | 0.09 | 0.57 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.43 Alignment.score
1.00 0.24 Alignment.isGood
-1.00 0.74 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.17 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "first" modifying "novel"
-3.00 1.00 Date.dateHeadMismatch : 1992 vs. 1991
-3.00 1.00 NullPunisher.entity : first
-1.00 1.00 NullPunisher.other : wrote
-1.00 1.00 NullPunisher.other : novel
-3.00 1.00 NullPunisher.entity : Smith
-0.10 1.00 NullPunisher.functionWord : his
-2.00 1.00 RootEntailment.unalignedRoot : "wrote" not aligned to anything
Hand-tuned score (dot product of above): -14.5496
Threshold: -9.4738
Txt: Smith wrote a novel in 1991.
Hyp: Smith wrote it in 1992. (don't know)
Smith NNP |
wrote VBD |
it PRP |
1992 CD |
|
Smith:NNP | 0.28 | 1.03 | 1.80 | 1.20 |
wrote:VBD | 1.12 | 1.32 | 0.54 | 1.21 |
a:DT | 1.25 | 1.25 | 1.15 | 1.18 |
novel:NN | 1.11 | 0.24 | 1.58 | 1.22 |
1991:CD | 1.24 | 0.97 | 1.32 | 1.56 |
NO_WORD | 0.28 | 0.17 | 0.91 | 0.57 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.59 Alignment.score
1.00 0.27 Alignment.isGood
-1.00 0.71 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.25 Alignment.txtSpan
-3.00 1.00 Date.dateHeadMismatch : 1992 vs. 1991
-1.00 1.00 NullPunisher.other : wrote
-1.00 1.00 NullPunisher.other : it
-3.00 1.00 NullPunisher.entity : Smith
-2.00 1.00 RootEntailment.unalignedRoot : "wrote" not aligned to anything
Hand-tuned score (dot product of above): -10.0219
Threshold: -9.4738
Txt: Smith wrote a novel in 1991.
Hyp: Smith wrote a novel in 1992. (don't know)
Smith NNP |
wrote VBD |
a DT |
novel NN |
1992 CD |
|
Smith:NNP | 0.28 | 1.03 | 0.96 | 1.11 | 1.20 |
wrote:VBD | 1.12 | 1.32 | 0.76 | 0.33 | 1.21 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.18 |
novel:NN | 1.11 | 0.24 | 0.71 | 0.28 | 1.22 |
1991:CD | 1.24 | 0.97 | 0.89 | 1.12 | 1.56 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 | 0.57 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.47 Alignment.score
1.00 0.25 Alignment.isGood
-1.00 0.73 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.20 Alignment.txtSpan
-3.00 1.00 Date.dateHeadMismatch : 1992 vs. 1991
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : wrote
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : novel
-2.00 1.00 RootEntailment.unalignedRoot : "wrote" not aligned to anything
Hand-tuned score (dot product of above): -10.3910
Threshold: -9.4738
Txt: Smith was running a business in 1991.
Hyp: Smith was running it in 1992. (don't know)
Smith NNP |
was VBD |
running VBG |
it PRP |
1992 CD |
|
Smith:NNP | 0.28 | 1.07 | 1.07 | 1.80 | 1.20 |
was:VBD | 1.16 | 1.32 | 0.82 | 0.54 | 1.39 |
running:VBG | 1.16 | 0.74 | 1.32 | 0.54 | 1.02 |
a:DT | 1.25 | 1.25 | 1.25 | 1.15 | 1.18 |
business:NN | 1.20 | 0.82 | 0.77 | 1.58 | 1.29 |
1991:CD | 1.24 | 1.30 | 1.09 | 1.32 | 1.56 |
NO_WORD | 0.28 | 1.30 | 0.17 | 0.91 | 0.57 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.73 Alignment.score
1.00 0.30 Alignment.isGood
-1.00 0.68 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.20 Alignment.txtSpan
-3.00 1.00 Date.dateHeadMismatch : 1992 vs. 1991
-1.00 1.00 NullPunisher.other : it
-1.00 1.00 NullPunisher.other : running
-3.00 1.00 NullPunisher.entity : Smith
-0.05 1.00 NullPunisher.aux : was
-2.00 1.00 RootEntailment.unalignedRoot : "running" not aligned to anything
Hand-tuned score (dot product of above): -9.9754
Threshold: -9.4738
Txt: Smith discovered a new species in 1991.
Hyp: Smith discovered it in 1992. (don't know)
Smith NNP |
discovered VBD |
it PRP |
1992 CD |
|
Smith:NNP | 0.28 | 1.07 | 1.80 | 1.20 |
discovered:VBD | 1.16 | 1.32 | 0.54 | 1.12 |
a:DT | 1.25 | 1.25 | 1.15 | 1.18 |
new:JJ | 1.13 | 0.96 | 1.05 | 1.13 |
species:NN | 1.05 | 0.67 | 1.58 | 1.29 |
1991:CD | 1.24 | 1.00 | 1.32 | 1.56 |
NO_WORD | 0.28 | 0.17 | 0.91 | 0.57 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.59 Alignment.score
1.00 0.27 Alignment.isGood
-1.00 0.71 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.25 Alignment.txtSpan
-3.00 1.00 Date.dateHeadMismatch : 1992 vs. 1991
-1.00 1.00 NullPunisher.other : discovered
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : it
-2.00 1.00 RootEntailment.unalignedRoot : "discovered" not aligned to anything
Hand-tuned score (dot product of above): -10.0219
Threshold: -9.4738
Txt: Smith discovered a new species in 1991.
Hyp: Smith discovered a new species in 1992. (don't know)
Smith NNP |
discovered VBD |
a DT |
new JJ |
species NN |
1992 CD |
|
Smith:NNP | 0.28 | 1.07 | 0.96 | 1.24 | 1.05 | 1.20 |
discovered:VBD | 1.16 | 1.32 | 0.76 | 1.02 | 0.76 | 1.12 |
a:DT | 1.25 | 1.25 | 2.42 | 1.18 | 1.00 | 1.18 |
new:JJ | 1.13 | 0.96 | 0.76 | 0.74 | 0.88 | 1.13 |
species:NN | 1.05 | 0.67 | 0.71 | 0.99 | 0.28 | 1.29 |
1991:CD | 1.24 | 1.00 | 0.89 | 1.00 | 1.33 | 1.56 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.11 | 0.09 | 0.57 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.41 Alignment.score
1.00 0.24 Alignment.isGood
-1.00 0.74 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.17 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "new" modifying "species"
-3.00 1.00 Date.dateHeadMismatch : 1992 vs. 1991
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : species
-1.00 1.00 NullPunisher.other : new
-1.00 1.00 NullPunisher.other : discovered
-3.00 1.00 NullPunisher.entity : Smith
-2.00 1.00 RootEntailment.unalignedRoot : "discovered" not aligned to anything
Hand-tuned score (dot product of above): -12.5764
Threshold: -9.4738
Txt: Smith wrote a report in two hours. Smith started writing the report at 8 am.
Hyp: Smith had finished writing the report by 11 am. (yes)
Smith NNP |
had VBD |
finished VBN |
writing VBG |
the DT |
report NN |
11 CD |
am RB |
|
Smith:NNP | 0.28 | 1.07 | 1.06 | 1.04 | 0.96 | 1.14 | 1.20 | 1.56 |
wrote:VBD | 1.12 | 0.75 | 0.71 | 0.68 | 0.76 | 0.75 | 1.39 | 0.91 |
a:DT | 1.25 | 1.25 | 1.25 | 1.25 | 1.31 | 1.00 | 1.18 | 1.38 |
report:NN | 1.14 | 0.82 | 0.77 | 0.71 | 0.71 | 0.28 | 1.22 | 1.31 |
two:CD | 1.24 | 1.30 | 1.21 | 1.12 | 0.82 | 1.33 | 0.04 | 1.27 |
hours:NNS | 1.13 | 0.82 | 0.65 | 0.75 | 0.71 | 0.72 | 1.16 | 1.31 |
Smith:NNP | 0.28 | 1.07 | 1.06 | 1.04 | 0.96 | 1.14 | 1.20 | 1.56 |
started:VBD | 1.13 | 0.84 | 1.55 | 0.44 | 0.76 | 0.89 | 1.33 | 1.11 |
writing:VBG | 1.13 | 0.75 | 0.71 | 1.32 | 0.76 | 0.80 | 1.33 | 0.97 |
the:DT | 1.25 | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 | 1.18 | 1.45 |
report:NN | 1.14 | 0.82 | 0.77 | 0.71 | 0.71 | 0.28 | 1.22 | 1.31 |
8:CD | 1.24 | 1.30 | 1.29 | 1.27 | 0.89 | 1.29 | 0.39 | 1.24 |
am:RB | 1.47 | 0.87 | 0.91 | 0.78 | 1.09 | 1.21 | 1.27 | 1.23 |
NO_WORD | 0.28 | 1.30 | 0.17 | 0.16 | 0.82 | 0.09 | 0.81 | 0.04 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.45 Alignment.score
1.00 0.24 Alignment.isGood
-1.00 0.73 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.25 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "am" modifying "11"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "am" modifying "8" is dropped on aligned hypothesis word "11"
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : finished
-0.05 1.00 NullPunisher.aux : had
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : am
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '11.0' vs '8.0'
1.00 1.00 Quant.equivalent : Replacing the quantifier "a" by an equivalent quantifier "the" preserves truth.
-2.00 1.00 RootEntailment.unalignedRoot : "finished" not aligned to anything
Hand-tuned score (dot product of above): -12.9719
Threshold: -9.4738
Txt: Smith wrote a report in two hours.
Hyp: Smith spent two hours writing the report. (don't know)
Smith NNP |
spent VBD |
two CD |
hours NNS |
writing VBG |
the DT |
report NN |
|
Smith:NNP | 0.28 | 1.03 | 1.20 | 1.13 | 1.04 | 0.96 | 1.14 |
wrote:VBD | 1.12 | 0.61 | 1.30 | 0.91 | 0.68 | 0.76 | 0.75 |
a:DT | 1.25 | 1.25 | 1.18 | 1.00 | 1.25 | 1.31 | 1.00 |
report:NN | 1.14 | 0.76 | 1.29 | 0.72 | 0.71 | 0.71 | 0.28 |
two:CD | 1.24 | 0.93 | 0.85 | 1.16 | 1.12 | 0.82 | 1.33 |
hours:NNS | 1.13 | 0.54 | 1.13 | 0.28 | 0.75 | 0.71 | 0.72 |
NO_WORD | 0.28 | 0.17 | 0.52 | 0.09 | 0.16 | 0.82 | 0.09 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.21 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.29 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "two" modifying "hours"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "hours" modifying "wrote" is dropped on aligned hypothesis word "writing"
-1.00 1.00 NullPunisher.other : hours
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : spent
-3.00 1.00 NullPunisher.entity : two
-0.10 1.00 NullPunisher.article : the
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '2.0' vs ''
1.00 1.00 Quant.equivalent : Replacing the quantifier "a" by an equivalent quantifier "the" preserves truth.
-2.00 1.00 RootEntailment.unalignedRoot : "spent" not aligned to anything
Hand-tuned score (dot product of above): -16.3330
Threshold: -9.4738
Txt: Smith wrote a report in two hours.
Hyp: Smith spent more than two hours writing the report. (don't know)
Smith NNP |
spent VBD |
more_than IN |
two CD |
hours NNS |
writing VBG |
the DT |
report NN |
|
Smith:NNP | 0.28 | 1.03 | 0.75 | 1.20 | 1.13 | 1.04 | 0.96 | 1.14 |
wrote:VBD | 1.12 | 0.61 | 1.15 | 1.30 | 0.91 | 0.68 | 0.76 | 0.75 |
a:DT | 1.25 | 1.25 | 1.05 | 1.18 | 1.00 | 1.25 | 1.31 | 1.00 |
report:NN | 1.14 | 0.76 | 0.48 | 1.29 | 0.72 | 0.71 | 0.71 | 0.28 |
two:CD | 1.24 | 0.93 | 1.30 | 0.85 | 1.16 | 1.12 | 0.82 | 1.33 |
hours:NNS | 1.13 | 0.54 | 0.50 | 1.13 | 0.28 | 0.75 | 0.71 | 0.72 |
NO_WORD | 0.28 | 0.17 | 0.84 | 0.52 | 0.09 | 0.16 | 0.82 | 0.09 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.29 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.76 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.25 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "two" modifying "hours"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "hours" modifying "wrote" is dropped on aligned hypothesis word "writing"
-1.00 1.00 NullPunisher.other : more_than
-1.00 1.00 NullPunisher.other : hours
-3.00 1.00 NullPunisher.entity : Smith
-3.00 1.00 NullPunisher.entity : two
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : spent
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '>2.0' vs ''
1.00 1.00 Quant.equivalent : Replacing the quantifier "a" by an equivalent quantifier "the" preserves truth.
-2.00 1.00 RootEntailment.unalignedRoot : "spent" not aligned to anything
Hand-tuned score (dot product of above): -17.3306
Threshold: -9.4738
Txt: Smith wrote a report in two hours.
Hyp: Smith wrote a report in one hour. (don't know)
Smith NNP |
wrote VBD |
a DT |
report NN |
one CD |
hour NN |
|
Smith:NNP | 0.28 | 1.03 | 0.96 | 1.14 | 1.20 | 1.12 |
wrote:VBD | 1.12 | 1.32 | 0.76 | 0.75 | 1.34 | 0.88 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.18 | 1.00 |
report:NN | 1.14 | 0.66 | 0.71 | 0.28 | 1.29 | 0.77 |
two:CD | 1.24 | 1.22 | 0.89 | 1.33 | 0.48 | 1.26 |
hours:NNS | 1.13 | 0.82 | 0.71 | 0.72 | 1.29 | 0.99 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 | 0.52 | 0.07 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.33 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.76 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.33 Alignment.txtSpan
-1.00 1.00 NullPunisher.other : wrote
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : report
-0.10 1.00 NullPunisher.article : a
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '1.0' vs '2.0'
-2.00 1.00 RootEntailment.unalignedRoot : "wrote" not aligned to anything
Hand-tuned score (dot product of above): -13.4655
Threshold: -9.4738
Txt: Smith wrote a report in two hours.
Hyp: Smith wrote a report. (yes)
Smith NNP |
wrote VBD |
a DT |
report NN |
|
Smith:NNP | 0.28 | 1.03 | 0.96 | 1.14 |
wrote:VBD | 1.12 | 1.32 | 0.76 | 0.75 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 |
report:NN | 1.14 | 0.66 | 0.71 | 0.28 |
two:CD | 1.24 | 1.22 | 0.89 | 1.33 |
hours:NNS | 1.13 | 0.82 | 0.71 | 0.72 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.20 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-3.00 1.00 NullPunisher.entity : Smith
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : report
-1.00 1.00 NullPunisher.other : wrote
-2.00 1.00 RootEntailment.unalignedRoot : "wrote" not aligned to anything
Hand-tuned score (dot product of above): -7.8790
Threshold: -9.4738
Txt: Smith discovered a new species in two hours.
Hyp: Smith spent two hours discovering the new species. (don't know)
Smith NNP |
spent VBD |
two CD |
hours NNS |
discovering VBG |
the DT |
new JJ |
species NN |
|
Smith:NNP | 0.28 | 1.03 | 1.20 | 1.13 | 1.07 | 0.96 | 1.24 | 1.05 |
discovered:VBD | 1.16 | 0.46 | 1.10 | 0.88 | 0.87 | 0.76 | 1.02 | 0.76 |
a:DT | 1.25 | 1.25 | 1.18 | 1.00 | 1.25 | 1.31 | 1.18 | 1.00 |
new:JJ | 1.13 | 0.89 | 1.11 | 0.86 | 0.91 | 0.76 | 0.74 | 0.88 |
species:NN | 1.05 | 0.74 | 1.29 | 0.83 | 0.79 | 0.71 | 0.99 | 0.28 |
two:CD | 1.24 | 0.93 | 0.85 | 1.16 | 1.12 | 0.82 | 0.98 | 1.33 |
hours:NNS | 1.13 | 0.54 | 1.13 | 0.28 | 0.79 | 0.71 | 0.97 | 0.83 |
NO_WORD | 0.28 | 0.17 | 0.52 | 0.09 | 0.16 | 0.82 | 0.11 | 0.09 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.23 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.12 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "new" modifying "species"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "new" modifying "species" is dropped on aligned hypothesis word "species"
-1.00 1.00 NullPunisher.other : new
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : spent
-3.00 1.00 NullPunisher.entity : two
-1.00 1.00 NullPunisher.other : hours
-0.10 1.00 NullPunisher.article : the
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '2.0' vs ''
1.00 1.00 Quant.equivalent : Replacing the quantifier "a" by an equivalent quantifier "the" preserves truth.
-2.00 1.00 RootEntailment.unalignedRoot : "spent" not aligned to anything
Hand-tuned score (dot product of above): -17.4325
Threshold: -9.4738
Txt: Smith discovered a new species in two hours.
Hyp: Smith discovered a new species. (yes)
Smith NNP |
discovered VBD |
a DT |
new JJ |
species NN |
|
Smith:NNP | 0.28 | 1.07 | 0.96 | 1.24 | 1.05 |
discovered:VBD | 1.16 | 1.32 | 0.76 | 1.02 | 0.76 |
a:DT | 1.25 | 1.25 | 2.42 | 1.18 | 1.00 |
new:JJ | 1.13 | 0.96 | 0.76 | 0.74 | 0.88 |
species:NN | 1.05 | 0.67 | 0.71 | 0.99 | 0.28 |
two:CD | 1.24 | 1.01 | 0.89 | 0.98 | 1.33 |
hours:NNS | 1.13 | 0.79 | 0.71 | 0.97 | 0.83 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.11 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.18 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "new" modifying "species"
-3.00 1.00 NullPunisher.entity : Smith
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : discovered
-1.00 1.00 NullPunisher.other : new
-1.00 1.00 NullPunisher.other : species
-2.00 1.00 RootEntailment.unalignedRoot : "discovered" not aligned to anything
Hand-tuned score (dot product of above): -10.0020
Threshold: -9.4738
Txt: Smith discovered many new species in two hours.
Hyp: Smith spent two hours discovering new species. (yes)
Smith NNP |
spent VBD |
two CD |
hours NNS |
discovering VBG |
new JJ |
species NNS |
|
Smith:NNP | 0.28 | 1.03 | 1.20 | 1.13 | 1.07 | 1.24 | 1.05 |
discovered:VBD | 1.16 | 0.46 | 1.10 | 0.88 | 0.87 | 1.02 | 0.76 |
many:JJ | 1.11 | 0.93 | 1.21 | 0.88 | 0.96 | 0.93 | 0.88 |
new:JJ | 1.13 | 0.89 | 1.11 | 0.86 | 0.91 | 0.74 | 0.88 |
species:NNS | 1.05 | 0.74 | 1.29 | 0.83 | 0.79 | 0.99 | 0.28 |
two:CD | 1.24 | 0.93 | 0.85 | 1.16 | 1.12 | 0.98 | 1.33 |
hours:NNS | 1.13 | 0.54 | 1.13 | 0.28 | 0.79 | 0.97 | 0.83 |
NO_WORD | 0.28 | 0.17 | 0.52 | 0.09 | 0.16 | 0.11 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.14 Alignment.score
1.00 0.19 Alignment.isGood
-1.00 0.79 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.14 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "new" modifying "species"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "new" modifying "species" is dropped on aligned hypothesis word "species"
-3.00 1.00 NullPunisher.entity : Smith
-3.00 1.00 NullPunisher.entity : two
-1.00 1.00 NullPunisher.other : new
-1.00 1.00 NullPunisher.other : spent
-1.00 1.00 NullPunisher.other : hours
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '2.0' vs ''
-2.00 1.00 RootEntailment.unalignedRoot : "spent" not aligned to anything
Hand-tuned score (dot product of above): -18.3427
Threshold: -9.4738
Txt: Smith was running his own business in two years.
Hyp: Smith spent two years running his own business. (don't know)
Smith NNP |
spent VBD |
two CD |
years NNS |
running VBG |
his PRP$ |
own JJ |
business NN |
|
Smith:NNP | 0.28 | 1.03 | 1.20 | 1.16 | 1.07 | 1.84 | 1.24 | 1.20 |
was:VBD | 1.16 | 0.91 | 1.39 | 0.85 | 0.82 | 0.47 | 1.02 | 0.91 |
running:VBG | 1.16 | 0.46 | 1.37 | 0.91 | 1.32 | 0.54 | 1.02 | 0.86 |
his:PRP$ | 2.76 | 0.97 | 1.57 | 1.30 | 0.97 | 1.05 | 1.39 | 1.30 |
own:JJ | 1.13 | 0.96 | 1.06 | 0.88 | 0.96 | 1.05 | 0.74 | 0.88 |
business:NN | 1.20 | 0.72 | 1.25 | 0.95 | 0.77 | 1.58 | 0.99 | 0.28 |
two:CD | 1.24 | 0.93 | 0.85 | 1.07 | 1.28 | 1.43 | 0.93 | 1.28 |
years:NNS | 1.16 | 0.58 | 1.04 | 0.28 | 0.82 | 1.58 | 0.99 | 0.95 |
NO_WORD | 0.28 | 0.17 | 0.52 | 0.09 | 0.16 | 0.93 | 0.11 | 0.09 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.10 Alignment.score
1.00 0.18 Alignment.isGood
-1.00 0.80 Alignment.isBad
-0.10 8.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "own" modifying "business"
-1.00 1.00 NullPunisher.other : years
-1.00 1.00 NullPunisher.other : own
-3.00 1.00 NullPunisher.entity : Smith
-3.00 1.00 NullPunisher.entity : two
-1.00 1.00 NullPunisher.other : running
-1.00 1.00 NullPunisher.other : spent
-1.00 1.00 NullPunisher.other : business
-0.10 1.00 NullPunisher.functionWord : his
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '2.0' vs ''
-2.00 1.00 RootEntailment.unalignedRoot : "spent" not aligned to anything
Hand-tuned score (dot product of above): -21.4173
Threshold: -9.4738
Txt: Smith was running his own business in two years.
Hyp: Smith spent more than two years running his own business. (don't know)
Smith NNP |
spent VBD |
more_than IN |
two CD |
years NNS |
running VBG |
his PRP$ |
own JJ |
business NN |
|
Smith:NNP | 0.28 | 1.03 | 0.75 | 1.20 | 1.16 | 1.07 | 1.84 | 1.24 | 1.20 |
was:VBD | 1.16 | 0.91 | 1.15 | 1.39 | 0.85 | 0.82 | 0.47 | 1.02 | 0.91 |
running:VBG | 1.16 | 0.46 | 1.15 | 1.37 | 0.91 | 1.32 | 0.54 | 1.02 | 0.86 |
his:PRP$ | 2.76 | 0.97 | 1.05 | 1.57 | 1.30 | 0.97 | 1.05 | 1.39 | 1.30 |
own:JJ | 1.13 | 0.96 | 1.05 | 1.06 | 0.88 | 0.96 | 1.05 | 0.74 | 0.88 |
business:NN | 1.20 | 0.72 | 0.50 | 1.25 | 0.95 | 0.77 | 1.58 | 0.99 | 0.28 |
two:CD | 1.24 | 0.93 | 1.30 | 0.85 | 1.07 | 1.28 | 1.43 | 0.93 | 1.28 |
years:NNS | 1.16 | 0.58 | 0.50 | 1.04 | 0.28 | 0.82 | 1.58 | 0.99 | 0.95 |
NO_WORD | 0.28 | 0.17 | 0.84 | 0.52 | 0.09 | 0.16 | 0.93 | 0.11 | 0.09 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.18 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 9.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "own" modifying "business"
-0.10 1.00 NullPunisher.functionWord : his
-1.00 1.00 NullPunisher.other : business
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : more_than
-1.00 1.00 NullPunisher.other : running
-1.00 1.00 NullPunisher.other : spent
-1.00 1.00 NullPunisher.other : own
-3.00 1.00 NullPunisher.entity : two
-1.00 1.00 NullPunisher.other : years
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '>2.0' vs ''
-2.00 1.00 RootEntailment.unalignedRoot : "spent" not aligned to anything
Hand-tuned score (dot product of above): -22.4075
Threshold: -9.4738
Txt: Smith was running his own business in two years.
Hyp: Smith ran his own business. (yes)
Smith NNP |
ran VBD |
his PRP$ |
own JJ |
business NN |
|
Smith:NNP | 0.28 | 1.07 | 1.84 | 1.24 | 1.20 |
was:VBD | 1.16 | 0.75 | 0.47 | 1.02 | 0.91 |
running:VBG | 1.16 | 0.65 | 0.54 | 1.02 | 0.86 |
his:PRP$ | 2.76 | 0.97 | 1.05 | 1.39 | 1.30 |
own:JJ | 1.13 | 0.89 | 1.05 | 0.74 | 0.88 |
business:NN | 1.20 | 0.77 | 1.58 | 0.99 | 0.28 |
two:CD | 1.24 | 1.16 | 1.43 | 0.93 | 1.28 |
years:NNS | 1.16 | 0.68 | 1.58 | 0.99 | 0.95 |
NO_WORD | 0.28 | 0.17 | 0.93 | 0.11 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.52 Alignment.score
1.00 0.26 Alignment.isGood
-1.00 0.72 Alignment.isBad
-0.10 2.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.40 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "own" modifying "business"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "own" modifying "business" is dropped on aligned hypothesis word "business"
-0.10 1.00 NullPunisher.functionWord : his
-1.00 1.00 NullPunisher.other : own
Hand-tuned score (dot product of above): -1.4967
Threshold: -9.4738
Txt: In two years Smith owned a chain of businesses.
Hyp: Smith owned a chain of business for two years. (don't know)
Smith NNP |
owned VBD |
a DT |
chain NN |
business NN |
two CD |
years NNS |
|
two:CD | 1.24 | 1.30 | 0.89 | 1.29 | 1.28 | 0.85 | 1.07 |
years:NNS | 1.16 | 0.71 | 0.71 | 0.82 | 0.95 | 1.04 | 0.28 |
Smith:NNP | 0.28 | 1.07 | 0.96 | 1.12 | 1.20 | 1.20 | 1.16 |
owned:VBD | 1.16 | 1.32 | 0.76 | 0.80 | 0.71 | 1.39 | 0.80 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.00 | 1.18 | 1.00 |
chain:NN | 1.12 | 0.71 | 0.71 | 0.28 | 0.58 | 1.26 | 0.82 |
businesses:NNS | 1.20 | 0.75 | 0.71 | 0.63 | 1.10 | 1.25 | 0.96 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 | 0.04 | 0.52 | 0.05 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.21 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 3.00 Alignment.hypSpan
0.10 0.29 Alignment.txtSpan
-1.00 1.00 NullPunisher.other : owned
-1.00 1.00 NullPunisher.other : chain
-3.00 1.00 NullPunisher.entity : Smith
-0.10 1.00 NullPunisher.article : a
-2.00 1.00 RootEntailment.unalignedRoot : "owned" not aligned to anything
Hand-tuned score (dot product of above): -7.5397
Threshold: -9.4738
Txt: In two years Smith owned a chain of businesses.
Hyp: Smith owned a chain of business for more than two years. (don't know)
Smith NNP |
owned VBD |
a DT |
chain NN |
business NN |
more_than IN |
two CD |
years NNS |
|
two:CD | 1.24 | 1.30 | 0.89 | 1.29 | 1.28 | 1.30 | 0.85 | 1.07 |
years:NNS | 1.16 | 0.71 | 0.71 | 0.82 | 0.95 | 0.50 | 1.04 | 0.28 |
Smith:NNP | 0.28 | 1.07 | 0.96 | 1.12 | 1.20 | 0.75 | 1.20 | 1.16 |
owned:VBD | 1.16 | 1.32 | 0.76 | 0.80 | 0.71 | 1.15 | 1.39 | 0.80 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.00 | 1.05 | 1.18 | 1.00 |
chain:NN | 1.12 | 0.71 | 0.71 | 0.28 | 0.58 | 0.50 | 1.26 | 0.82 |
businesses:NNS | 1.20 | 0.75 | 0.71 | 0.63 | 1.10 | 0.50 | 1.25 | 0.96 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 | 0.04 | 0.84 | 0.52 | 0.05 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.29 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.76 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.25 Alignment.txtSpan
-1.00 1.00 NullPunisher.other : more_than
-1.00 1.00 NullPunisher.other : owned
-3.00 1.00 NullPunisher.entity : Smith
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : chain
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '>2.0' vs '2.0'
-2.00 1.00 RootEntailment.unalignedRoot : "owned" not aligned to anything
Hand-tuned score (dot product of above): -14.6366
Threshold: -9.4738
Txt: In two years Smith owned a chain of businesses.
Hyp: Smith owned a chain of business. (yes)
Smith NNP |
owned VBD |
a DT |
chain NN |
business NN |
|
two:CD | 1.24 | 1.30 | 0.89 | 1.29 | 1.28 |
years:NNS | 1.16 | 0.71 | 0.71 | 0.82 | 0.95 |
Smith:NNP | 0.28 | 1.07 | 0.96 | 1.12 | 1.20 |
owned:VBD | 1.16 | 1.32 | 0.76 | 0.80 | 0.71 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.00 |
chain:NN | 1.12 | 0.71 | 0.71 | 0.28 | 0.58 |
businesses:NNS | 1.20 | 0.75 | 0.71 | 0.63 | 1.10 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 | 0.04 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.38 Alignment.score
1.00 0.23 Alignment.isGood
-1.00 0.75 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.20 Alignment.txtSpan
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : chain
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : owned
-2.00 1.00 RootEntailment.unalignedRoot : "owned" not aligned to anything
Hand-tuned score (dot product of above): -7.5171
Threshold: -9.4738
Txt: Smith lived in Birmingham for two years.
Hyp: Smith lived in Birmingham for a year. (yes)
Smith NNP |
lived VBD |
Birmingham NNP |
a DT |
year NN |
|
Smith:NNP | 0.28 | 1.07 | 0.95 | 0.96 | 1.17 |
lived:VBD | 1.16 | 1.32 | 1.16 | 0.76 | 0.84 |
Birmingham:NNP | 0.95 | 1.07 | 0.28 | 0.96 | 1.13 |
two:CD | 1.24 | 1.00 | 1.24 | 0.89 | 1.31 |
years:NNS | 1.16 | 0.56 | 1.12 | 0.71 | 0.62 |
NO_WORD | 0.28 | 0.17 | 0.07 | 0.82 | 0.05 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.25 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.20 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Birmingham" modifying "lived"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "two" modifying "years" is dropped on aligned hypothesis word "year"
-0.10 1.00 NullPunisher.article : a
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : lived
-3.00 1.00 NullPunisher.entity : Birmingham
-2.00 1.00 RootEntailment.unalignedRoot : "lived" not aligned to anything
Hand-tuned score (dot product of above): -10.1892
Threshold: -9.4738
Txt: Smith lived in Birmingham for two years.
Hyp: Smith lived in Birmingham for exactly a year. (don't know)
Smith NNP |
lived VBD |
Birmingham NNP |
for IN |
exactly RB |
a DT |
year NN |
|
Smith:NNP | 0.28 | 1.07 | 0.95 | 0.75 | 1.56 | 0.96 | 1.17 |
lived:VBD | 1.16 | 1.32 | 1.16 | 1.15 | 1.03 | 0.76 | 0.84 |
Birmingham:NNP | 0.95 | 1.07 | 0.28 | 0.75 | 1.56 | 0.96 | 1.13 |
two:CD | 1.24 | 1.00 | 1.24 | 1.30 | 1.14 | 0.89 | 1.31 |
years:NNS | 1.16 | 0.56 | 1.12 | 0.50 | 1.27 | 0.71 | 0.62 |
NO_WORD | 0.28 | 0.17 | 0.07 | 1.16 | 0.30 | 0.82 | 0.25 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.39 Alignment.score
1.00 0.23 Alignment.isGood
-1.00 0.75 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.14 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Birmingham" modifying "lived"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "two" modifying "years" is dropped on aligned hypothesis word "year"
-1.00 1.00 NullPunisher.other : lived
-1.00 1.00 NullPunisher.other : exactly
-3.00 1.00 NullPunisher.entity : Birmingham
-1.00 1.00 NullPunisher.other : for
-0.10 1.00 NullPunisher.article : a
-3.00 1.00 NullPunisher.entity : Smith
-2.00 1.00 RootEntailment.unalignedRoot : "lived" not aligned to anything
Hand-tuned score (dot product of above): -12.2106
Threshold: -9.4738
Txt: Smith lived in Birmingham for two years.
Hyp: Smith lived in Birmingham. (yes)
Smith NNP |
lived VBD |
Birmingham NNP |
|
Smith:NNP | 0.28 | 1.07 | 0.95 |
lived:VBD | 1.16 | 1.32 | 1.16 |
Birmingham:NNP | 0.95 | 1.07 | 0.28 |
two:CD | 1.24 | 1.00 | 1.24 |
years:NNS | 1.16 | 0.56 | 1.12 |
NO_WORD | 0.28 | 0.17 | 0.07 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 -0.06 Alignment.score
1.00 0.16 Alignment.isGood
-1.00 0.82 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Birmingham" modifying "lived"
-3.00 1.00 NullPunisher.entity : Smith
-3.00 1.00 NullPunisher.entity : Birmingham
-1.00 1.00 NullPunisher.other : lived
-2.00 1.00 RootEntailment.unalignedRoot : "lived" not aligned to anything
Hand-tuned score (dot product of above): -11.0154
Threshold: -9.4738
Txt: Smith ran his own business for two years.
Hyp: Smith ran his own business for a year. (yes)
Smith NNP |
ran VBD |
his PRP$ |
own JJ |
business NN |
a DT |
year NN |
|
Smith:NNP | 0.28 | 1.07 | 1.84 | 1.24 | 1.20 | 0.96 | 1.17 |
ran:VBD | 1.16 | 1.32 | 0.54 | 0.95 | 0.86 | 0.76 | 0.73 |
his:PRP$ | 2.76 | 0.97 | 1.05 | 1.39 | 1.30 | 1.11 | 1.30 |
own:JJ | 1.13 | 0.89 | 1.05 | 0.74 | 0.88 | 0.76 | 0.88 |
business:NN | 1.20 | 0.77 | 1.58 | 0.99 | 0.28 | 0.71 | 0.84 |
two:CD | 1.24 | 1.16 | 1.43 | 0.93 | 1.28 | 0.89 | 1.31 |
years:NNS | 1.16 | 0.68 | 1.58 | 0.99 | 0.95 | 0.71 | 0.62 |
NO_WORD | 0.28 | 0.17 | 0.93 | 0.11 | 0.09 | 0.82 | 0.05 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.35 Alignment.score
1.00 0.23 Alignment.isGood
-1.00 0.75 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.14 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "own" modifying "business"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "two" modifying "years" is dropped on aligned hypothesis word "year"
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : business
-1.00 1.00 NullPunisher.other : own
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : ran
-0.10 1.00 NullPunisher.functionWord : his
-2.00 1.00 RootEntailment.unalignedRoot : "ran" not aligned to anything
Hand-tuned score (dot product of above): -9.3601
Threshold: -9.4738
Txt: Smith ran his own business for two years.
Hyp: Smith ran his own business. (yes)
Smith NNP |
ran VBD |
his PRP$ |
own JJ |
business NN |
|
Smith:NNP | 0.28 | 1.07 | 1.84 | 1.24 | 1.20 |
ran:VBD | 1.16 | 1.32 | 0.54 | 0.95 | 0.86 |
his:PRP$ | 2.76 | 0.97 | 1.05 | 1.39 | 1.30 |
own:JJ | 1.13 | 0.89 | 1.05 | 0.74 | 0.88 |
business:NN | 1.20 | 0.77 | 1.58 | 0.99 | 0.28 |
two:CD | 1.24 | 1.16 | 1.43 | 0.93 | 1.28 |
years:NNS | 1.16 | 0.68 | 1.58 | 0.99 | 0.95 |
NO_WORD | 0.28 | 0.17 | 0.93 | 0.11 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.21 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "own" modifying "business"
-0.10 1.00 NullPunisher.functionWord : his
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : own
-1.00 1.00 NullPunisher.other : ran
-1.00 1.00 NullPunisher.other : business
-2.00 1.00 RootEntailment.unalignedRoot : "ran" not aligned to anything
Hand-tuned score (dot product of above): -9.9708
Threshold: -9.4738
Txt: Smith wrote a report for two hours.
Hyp: Smith wrote a report for an hour. (yes)
Smith NNP |
wrote VBD |
a DT |
report NN |
an DT |
hour NN |
|
Smith:NNP | 0.28 | 1.03 | 0.96 | 1.14 | 0.96 | 1.12 |
wrote:VBD | 1.12 | 1.32 | 0.76 | 0.75 | 0.76 | 0.88 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.21 | 1.00 |
report:NN | 1.14 | 0.66 | 0.71 | 0.28 | 0.71 | 0.77 |
two:CD | 1.24 | 1.22 | 0.89 | 1.33 | 0.89 | 1.26 |
hours:NNS | 1.13 | 0.82 | 0.71 | 0.72 | 0.71 | 0.99 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 | 0.82 | 0.05 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.43 Alignment.score
1.00 0.24 Alignment.isGood
-1.00 0.74 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.17 Alignment.txtSpan
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "two" modifying "hours" is dropped on aligned hypothesis word "hour"
-0.10 1.00 NullPunisher.article : an
-1.00 1.00 NullPunisher.other : report
-1.00 1.00 NullPunisher.other : wrote
-0.10 1.00 NullPunisher.article : a
-3.00 1.00 NullPunisher.entity : Smith
-2.00 1.00 RootEntailment.unalignedRoot : "wrote" not aligned to anything
Hand-tuned score (dot product of above): -7.1453
Threshold: -9.4738
Txt: Smith wrote a report for two hours.
Hyp: Smith wrote a report. (don't know)
Smith NNP |
wrote VBD |
a DT |
report NN |
|
Smith:NNP | 0.28 | 1.03 | 0.96 | 1.14 |
wrote:VBD | 1.12 | 1.32 | 0.76 | 0.75 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 |
report:NN | 1.14 | 0.66 | 0.71 | 0.28 |
two:CD | 1.24 | 1.22 | 0.89 | 1.33 |
hours:NNS | 1.13 | 0.82 | 0.71 | 0.72 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.20 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : wrote
-0.10 1.00 NullPunisher.article : a
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : report
-2.00 1.00 RootEntailment.unalignedRoot : "wrote" not aligned to anything
Hand-tuned score (dot product of above): -7.8790
Threshold: -9.4738
Txt: Smith discovered new species for two years.
Hyp: Smith discovered new species. (yes)
Smith NNP |
discovered VBD |
new JJ |
species NNS |
|
Smith:NNP | 0.28 | 1.07 | 1.24 | 1.05 |
discovered:VBD | 1.16 | 1.32 | 1.02 | 0.76 |
new:JJ | 1.13 | 0.96 | 0.74 | 0.88 |
species:NNS | 1.05 | 0.67 | 0.99 | 0.28 |
two:CD | 1.24 | 1.01 | 0.98 | 1.33 |
years:NNS | 1.16 | 0.67 | 0.99 | 0.68 |
NO_WORD | 0.28 | 0.17 | 0.11 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.02 Alignment.score
1.00 0.17 Alignment.isGood
-1.00 0.81 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "new" modifying "species"
-1.00 1.00 NullPunisher.other : discovered
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : new
-1.00 1.00 NullPunisher.other : species
-2.00 1.00 RootEntailment.unalignedRoot : "discovered" not aligned to anything
Hand-tuned score (dot product of above): -10.0099
Threshold: -9.4738
Txt: In 1994 ITEL sent a progress report every month.
Hyp: ITEL sent a progress report in July 1994. (yes)
ITEL NNP |
sent VBD |
a DT |
progress NN |
report VB |
July NNP |
1994 CD |
|
1994:CD | 1.24 | 1.24 | 0.89 | 1.33 | 1.28 | 1.07 | 0.85 |
ITEL:NNP | 0.28 | 1.07 | 0.96 | 1.07 | 1.07 | 0.99 | 1.20 |
sent:VBD | 1.16 | 1.32 | 0.76 | 0.91 | 0.52 | 1.16 | 1.33 |
a:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.25 | 1.25 | 1.18 |
progress_report:NN | 1.07 | 0.65 | 0.71 | 0.54 | 1.23 | 1.07 | 1.27 |
every:DT | 1.23 | 1.23 | 1.31 | 1.00 | 1.19 | 1.23 | 1.18 |
month:NN | 1.07 | 0.62 | 0.71 | 0.90 | 0.50 | 0.96 | 1.14 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.28 | 0.36 | 0.07 | 0.52 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.15 Alignment.score
1.00 0.19 Alignment.isGood
-1.00 0.79 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.14 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "1994" modifying "July"
-2.00 1.00 Date.hypDateIns : hypothesis date insertion: July
-3.00 1.00 NullPunisher.entity : ITEL
-3.00 1.00 NullPunisher.entity : July
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : progress
-3.00 1.00 NullPunisher.entity : 1994
-1.00 1.00 NullPunisher.other : sent
-6.00 1.00 Numeric.mismatch : DATE mismatch: '07/01/1994' vs ''
-2.00 1.00 RootEntailment.unalignedRoot : "sent" not aligned to anything
Hand-tuned score (dot product of above): -23.0262
Threshold: -9.4738
Txt: Smith had left the house at a quarter past five. Then she took a taxi to the station.
Hyp: Smith left the house before she took a taxi to the station. (yes)
Smith NNP |
left VBD |
the DT |
house NN |
before IN |
she PRP |
took VBD |
a DT |
taxi NN |
the DT |
station NN |
|
Smith:NNP | 0.28 | 1.05 | 0.96 | 1.10 | 0.75 | 1.84 | 1.07 | 0.96 | 1.05 | 0.96 | 1.03 |
had:VBD | 1.16 | 0.84 | 0.76 | 0.91 | 1.15 | 0.54 | 0.41 | 0.76 | 0.87 | 0.76 | 0.91 |
left:VBN | 1.14 | 2.02 | 0.76 | 0.91 | 1.11 | 0.54 | 0.14 | 0.76 | 0.82 | 0.76 | 0.88 |
the:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.05 | 1.01 | 1.22 | 1.31 | 0.96 | 2.42 | 1.00 |
house:NN | 1.10 | 0.82 | 0.71 | 0.28 | 0.48 | 1.58 | 0.72 | 0.71 | 0.78 | 0.71 | 0.79 |
a:DT | 1.25 | 1.25 | 1.31 | 1.00 | 1.11 | 1.16 | 1.25 | 2.42 | 1.00 | 1.31 | 1.00 |
quarter:NN | 1.15 | 0.55 | 0.71 | 0.95 | 0.48 | 1.58 | 0.65 | 0.71 | 0.83 | 0.71 | 0.87 |
five:CD | 1.21 | 1.26 | 0.86 | 1.30 | 1.26 | 1.29 | 1.04 | 0.89 | 1.28 | 0.86 | 1.33 |
Then:RB | 1.47 | 0.91 | 0.94 | 1.21 | 1.05 | 0.95 | 0.86 | 1.09 | 1.16 | 0.94 | 1.19 |
she:PRP | 2.76 | 1.21 | 1.20 | 1.30 | 1.05 | 1.05 | 0.97 | 1.11 | 1.30 | 1.20 | 1.30 |
took:VBD | 1.16 | 0.14 | 0.73 | 0.81 | 1.15 | 0.54 | 1.32 | 0.76 | 0.83 | 0.73 | 0.81 |
a:DT | 1.25 | 1.25 | 1.31 | 1.00 | 1.11 | 1.16 | 1.25 | 2.42 | 1.00 | 1.31 | 1.00 |
taxi:NN | 1.05 | 0.73 | 0.67 | 0.78 | 0.50 | 1.58 | 0.74 | 0.71 | 0.28 | 0.67 | 0.21 |
the:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.05 | 1.01 | 1.22 | 1.31 | 0.96 | 2.42 | 1.00 |
station:NN | 1.03 | 0.79 | 0.71 | 0.79 | 0.50 | 1.58 | 0.72 | 0.71 | 0.21 | 0.71 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 | 0.92 | 0.54 | 0.42 | 0.82 | 0.09 | 0.82 | 0.74 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.70 Alignment.score
1.00 0.29 Alignment.isGood
-1.00 0.68 Alignment.isBad
-0.10 9.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.18 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "station" modifying "took"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "five" modifying "left" is dropped on aligned hypothesis word "left"
-1.00 1.00 NullPunisher.other : taxi
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : house
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : station
-1.00 1.00 NullPunisher.other : before
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : she
-1.00 1.00 NullPunisher.other : took
Hand-tuned score (dot product of above): -7.1765
Threshold: -9.4738
Txt: ITEL always delivers reports late. In 1993 ITEL delivered reports.
Hyp: ITEL delivered reports late in 1993. (yes)
ITEL NNP |
delivered VBD |
reports NNS |
late RB |
1993 CD |
|
ITEL:NNP | 0.28 | 1.07 | 1.07 | 1.51 | 1.20 |
always:RB | 1.47 | 0.91 | 1.20 | 0.81 | 1.36 |
delivers:VBZ | 1.16 | 0.85 | 0.86 | 1.08 | 1.39 |
reports:NNS | 1.07 | 0.74 | 0.28 | 1.31 | 1.29 |
late:RB | 1.42 | 0.84 | 1.21 | 1.23 | 1.22 |
1993:CD | 1.24 | 1.09 | 1.33 | 1.16 | 0.85 |
ITEL:NNP | 0.28 | 1.07 | 1.07 | 1.51 | 1.20 |
delivered:VBD | 1.16 | 1.32 | 0.83 | 1.04 | 1.18 |
reports:NNS | 1.07 | 0.74 | 0.28 | 1.31 | 1.29 |
NO_WORD | 0.28 | 0.17 | 0.09 | 0.04 | 0.57 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.25 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 2.00 Alignment.nbWordNotAligned
0.10 3.00 Alignment.hypSpan
0.10 0.60 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "1993" modifying "delivered"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "always" modifying "delivers" is dropped on aligned hypothesis word "delivered"
-2.00 1.00 Date.hypDateIns : hypothesis date insertion: 1993
-1.00 1.00 NullPunisher.other : late
-3.00 1.00 NullPunisher.entity : 1993
-6.00 1.00 Numeric.mismatch : DATE mismatch: '01/01/1993' vs ''
Hand-tuned score (dot product of above): -12.6526
Threshold: -9.4738
Txt: ITEL never delivers reports late. In 1993 ITEL delivered reports.
Hyp: ITEL delivered reports late in 1993. (don't know)
ITEL NNP |
delivered VBD |
reports NNS |
late RB |
1993 CD |
|
ITEL:NNP | 0.28 | 1.07 | 1.07 | 1.51 | 1.20 |
never:RB | 1.45 | 0.85 | 1.18 | 0.83 | 1.36 |
delivers:VBZ | 1.16 | 0.85 | 0.86 | 1.08 | 1.39 |
reports:NNS | 1.07 | 0.74 | 0.28 | 1.31 | 1.29 |
late:RB | 1.42 | 0.84 | 1.21 | 1.23 | 1.22 |
1993:CD | 1.24 | 1.09 | 1.33 | 1.16 | 0.85 |
ITEL:NNP | 0.28 | 1.07 | 1.07 | 1.51 | 1.20 |
delivered:VBD | 1.16 | 1.32 | 0.83 | 1.04 | 1.18 |
reports:NNS | 1.07 | 0.74 | 0.28 | 1.31 | 1.29 |
NO_WORD | 0.28 | 0.17 | 0.09 | 0.04 | 0.57 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.25 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 2.00 Alignment.nbWordNotAligned
0.10 3.00 Alignment.hypSpan
0.10 0.60 Alignment.txtSpan
-2.00 1.00 Date.hypDateIns : hypothesis date insertion: 1993
0.00 1.00 NegPolarity.txtNegWord : "delivers": has child with relation "neg"
0.00 1.00 NegPolarity.txtNegRoot : "delivers": has child with relation "neg"
-3.00 1.00 NullPunisher.entity : 1993
-1.00 1.00 NullPunisher.other : late
-6.00 1.00 Numeric.mismatch : DATE mismatch: '01/01/1993' vs ''
Hand-tuned score (dot product of above): -12.1526
Threshold: -9.4738
Txt: Smith arrived in Paris on the 5th of May, 1995. Today is the 15th of May, 1995. She is still in Paris.
Hyp: Smith was in Paris on the 7th of May, 1995. (yes)
Smith NNP |
was VBD |
Paris NNP |
the DT |
7th NN |
of IN |
May NNP |
1995 CD |
|
Smith:NNP | 0.28 | 1.07 | 0.98 | 0.96 | 0.98 | 0.67 | 1.03 | 1.20 |
arrived:VBD | 1.16 | 0.89 | 1.13 | 0.76 | 1.14 | 1.41 | 1.16 | 1.39 |
Paris:NNP | 0.98 | 1.02 | 0.28 | 0.96 | 1.03 | 0.67 | 1.02 | 1.20 |
the:DT | 1.25 | 1.25 | 1.25 | 2.42 | 1.18 | 1.37 | 1.25 | 1.18 |
5th:JJ | 0.99 | 1.22 | 1.05 | 0.94 | 0.17 | 1.05 | 0.88 | 0.87 |
of:IN | 1.48 | 1.50 | 1.48 | 1.37 | 1.31 | 1.59 | 1.31 | 1.05 |
May:NNP | 1.03 | 1.00 | 1.02 | 0.96 | 0.72 | 0.50 | 0.28 | 1.03 |
1995:CD | 1.24 | 1.30 | 1.24 | 0.89 | 1.07 | 1.05 | 1.07 | 0.85 |
Today:NNP | 1.14 | 0.82 | 1.13 | 0.71 | 1.13 | 0.75 | 1.07 | 1.29 |
is:VBZ | 1.16 | 0.05 | 1.13 | 0.76 | 1.16 | 1.41 | 1.16 | 1.39 |
the:DT | 1.25 | 1.25 | 1.25 | 2.42 | 1.18 | 1.37 | 1.25 | 1.18 |
15th:NN | 0.96 | 1.07 | 1.03 | 0.93 | 0.03 | 0.50 | 0.72 | 0.95 |
of:IN | 1.48 | 1.50 | 1.48 | 1.37 | 1.31 | 1.59 | 1.31 | 1.05 |
May:NNP | 1.03 | 1.00 | 1.02 | 0.96 | 0.72 | 0.50 | 0.28 | 1.03 |
1995:CD | 1.24 | 1.30 | 1.24 | 0.89 | 1.07 | 1.05 | 1.07 | 0.85 |
She:PRP | 1.55 | 0.97 | 1.55 | 0.96 | 1.55 | 1.30 | 1.55 | 1.46 |
is:VBZ | 1.16 | 0.05 | 1.13 | 0.76 | 1.16 | 1.41 | 1.16 | 1.39 |
still:RB | 1.38 | 0.91 | 1.47 | 1.09 | 1.47 | 1.30 | 1.47 | 1.36 |
Paris:NNP | 0.98 | 1.02 | 0.28 | 0.96 | 1.03 | 0.67 | 1.02 | 1.20 |
NO_WORD | 0.28 | 0.17 | 0.07 | 0.82 | 0.02 | 1.16 | 0.25 | 0.26 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.23 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.12 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Paris" modifying "was"
-3.00 1.00 Date.dateHeadMismatch : 7th vs. 15th
-3.00 1.00 NullPunisher.entity : 1995
-3.00 1.00 NullPunisher.entity : Smith
-3.00 1.00 NullPunisher.entity : May
-3.00 1.00 NullPunisher.entity : Paris
-0.10 1.00 NullPunisher.article : the
-3.00 1.00 NullPunisher.entity : of
-0.05 1.00 NullPunisher.aux : was
-2.00 1.00 RootEntailment.unalignedRoot : "was" not aligned to anything
Hand-tuned score (dot product of above): -22.0787
Threshold: -9.4738
Txt: When Smith arrived in Katmandu she had been travelling for three days.
Hyp: Smith had been travelling the day before she arrived in Katmandu. (yes)
Smith NNP |
had VBD |
been VBN |
traveling VBG |
the DT |
day NN |
before IN |
she PRP |
arrived VBD |
Katmandu NNP |
|
When:WRB | 1.25 | 1.22 | 1.14 | 1.25 | 1.21 | 1.25 | 1.05 | 1.07 | 1.25 | 1.25 |
Smith:NNP | 0.28 | 1.07 | 1.07 | 1.07 | 0.96 | 1.08 | 0.75 | 1.84 | 1.07 | 0.99 |
arrived:VBD | 1.16 | 0.84 | 0.89 | 0.25 | 0.76 | 0.79 | 1.14 | 0.54 | 1.32 | 1.15 |
Katmandu:NNP | 0.99 | 1.07 | 1.07 | 1.06 | 0.96 | 0.99 | 0.75 | 1.84 | 1.06 | 0.28 |
she:PRP | 2.76 | 0.97 | 0.94 | 0.97 | 1.20 | 1.55 | 1.05 | 1.05 | 0.97 | 1.55 |
had:VBD | 1.16 | 1.32 | 0.98 | 0.94 | 0.76 | 1.09 | 1.15 | 0.54 | 0.84 | 1.16 |
been:VBN | 1.16 | 0.98 | 1.32 | 1.00 | 0.73 | 1.16 | 1.11 | 0.51 | 0.89 | 1.16 |
traveling:VBG | 1.16 | 0.94 | 1.00 | 1.32 | 0.76 | 0.98 | 1.15 | 0.54 | 0.25 | 1.15 |
three:CD | 1.24 | 1.30 | 1.28 | 1.30 | 0.78 | 1.15 | 1.28 | 1.26 | 1.14 | 1.24 |
days:NNS | 1.16 | 0.78 | 0.82 | 0.66 | 0.71 | 0.18 | 0.50 | 1.58 | 0.66 | 1.07 |
NO_WORD | 0.28 | 1.30 | 1.30 | 0.17 | 0.82 | 0.09 | 0.92 | 0.54 | 0.42 | 0.07 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.53 Alignment.score
1.00 0.26 Alignment.isGood
-1.00 0.72 Alignment.isBad
-0.10 9.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.10 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Katmandu" modifying "arrived"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "three" modifying "days" is dropped on aligned hypothesis word "day"
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : before
-1.00 1.00 NullPunisher.other : arrived
-0.10 1.00 NullPunisher.article : the
-0.05 1.00 NullPunisher.aux : been
-1.00 1.00 NullPunisher.other : traveling
-0.05 1.00 NullPunisher.aux : had
-1.00 1.00 NullPunisher.other : she
-3.00 1.00 NullPunisher.entity : Katmandu
-6.00 1.00 Numeric.mismatch : TIME mismatch: 'D' vs ''
-2.00 1.00 RootEntailment.unalignedRoot : "traveling" not aligned to anything
Hand-tuned score (dot product of above): -19.4185
Threshold: -9.4738
Txt: Jones graduated in March and has been employed ever since. Jones has been unemployed in the past.
Hyp: Jones was unemployed at some time before he graduated. (yes)
Jones NNP |
was VBD |
unemployed VBN |
some DT |
time NN |
before IN |
he PRP |
graduated VBD |
|
Jones:NNP | 0.28 | 1.07 | 1.07 | 0.89 | 1.14 | 0.73 | 1.84 | 1.07 |
graduated:VBD | 1.16 | 0.68 | 0.50 | 0.76 | 0.85 | 1.15 | 0.54 | 1.32 |
March:NNP | 1.05 | 1.07 | 1.07 | 0.96 | 1.15 | 0.75 | 1.84 | 1.07 |
has:VBZ | 1.16 | 0.84 | 0.56 | 0.76 | 0.91 | 1.15 | 0.50 | 0.65 |
been:VBN | 1.14 | 0.10 | 0.56 | 0.76 | 0.91 | 1.11 | 0.54 | 0.68 |
employed:VBN | 1.15 | 0.93 | 2.04 | 0.76 | 0.91 | 1.12 | 0.54 | 0.59 |
ever:RB | 1.45 | 0.91 | 0.91 | 1.09 | 1.21 | 1.00 | 1.05 | 0.91 |
since:RB | 1.43 | 0.91 | 0.91 | 1.02 | 1.14 | 1.03 | 1.05 | 0.91 |
Jones:NNP | 0.28 | 1.07 | 1.07 | 0.89 | 1.14 | 0.73 | 1.84 | 1.07 |
has:VBZ | 1.16 | 0.84 | 0.56 | 0.76 | 0.91 | 1.15 | 0.50 | 0.65 |
been:VBN | 1.14 | 0.10 | 0.56 | 0.76 | 0.91 | 1.11 | 0.54 | 0.68 |
unemployed:VBN | 1.16 | 0.56 | 1.32 | 0.76 | 0.89 | 1.13 | 0.54 | 0.50 |
the:DT | 1.25 | 1.25 | 1.25 | 1.28 | 0.90 | 1.05 | 0.80 | 1.25 |
past:NN | 1.17 | 0.72 | 0.82 | 0.71 | 0.61 | 0.50 | 1.58 | 0.77 |
NO_WORD | 0.15 | 1.57 | 0.17 | 0.82 | 0.20 | 0.92 | 0.54 | 0.42 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.81 Alignment.score
1.00 0.32 Alignment.isGood
-1.00 0.66 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 3.00 Alignment.hypSpan
0.10 0.25 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "time" modifying "unemployed"
-8.00 1.00 Antonym.samePol : Antonyms found: "unemployed" & "employed", text and hypothesis sentences are of the same polarity.
-1.00 1.00 NullPunisher.other : some
-1.00 1.00 NullPunisher.other : before
-1.00 1.00 NullPunisher.other : graduated
-1.00 1.00 NullPunisher.other : time
-1.00 1.00 NullPunisher.other : he
Hand-tuned score (dot product of above): -13.7044
Threshold: -9.4738
Txt: Every representative has read this report. No two representatives have read it at the same time. No representative took less than half a day to read the report. There are sixteen representatives.
Hyp: It took the representatives more than a week to read the report. (yes)
It PRP |
took VBD |
the DT |
representatives NNS |
more_than RBR |
a DT |
week NN |
to TO |
read VB |
the DT |
report NN |
|
Every:DT | 1.16 | 1.25 | 1.31 | 1.00 | 1.45 | 1.31 | 0.97 | 1.31 | 1.23 | 1.31 | 0.94 |
representative:NN | 1.58 | 0.82 | 0.71 | 1.36 | 1.30 | 0.71 | 0.89 | 0.71 | 0.82 | 0.71 | 0.75 |
has:VBZ | 0.54 | 0.41 | 0.76 | 0.91 | 1.11 | 0.76 | 0.91 | 0.76 | 0.70 | 0.76 | 0.91 |
read:VBN | 0.54 | 0.23 | 0.76 | 0.91 | 1.10 | 0.76 | 0.69 | 0.76 | 2.02 | 0.76 | 0.66 |
this:DT | 1.16 | 1.20 | 1.21 | 1.00 | 1.45 | 1.31 | 1.00 | 1.31 | 1.25 | 1.21 | 1.00 |
report:NN | 1.58 | 0.82 | 0.71 | 0.78 | 1.29 | 0.71 | 0.53 | 0.71 | 0.57 | 0.71 | 0.28 |
No:DT | 1.11 | 1.25 | 1.31 | 1.00 | 1.45 | 1.32 | 1.00 | 1.20 | 1.25 | 1.31 | 1.00 |
two:CD | 1.32 | 0.84 | 0.82 | 1.21 | 1.30 | 0.89 | 1.15 | 0.52 | 1.27 | 0.82 | 1.33 |
representatives:NNS | 1.58 | 0.82 | 0.71 | 0.28 | 1.31 | 0.71 | 0.71 | 0.71 | 0.82 | 0.71 | 0.78 |
have:VBP | 0.54 | 0.41 | 0.73 | 0.91 | 1.11 | 0.76 | 0.91 | 0.76 | 0.73 | 0.73 | 0.91 |
read:VBN | 0.54 | 0.23 | 0.76 | 0.91 | 1.10 | 0.76 | 0.69 | 0.76 | 2.02 | 0.76 | 0.66 |
it:PRP | 0.73 | 0.97 | 1.11 | 1.30 | 1.05 | 1.10 | 1.30 | 1.11 | 0.97 | 1.11 | 1.30 |
the:DT | 1.16 | 1.22 | 2.42 | 1.00 | 1.45 | 1.31 | 0.96 | 1.26 | 1.25 | 2.42 | 1.00 |
same:JJ | 1.05 | 0.96 | 0.73 | 0.88 | 1.01 | 0.76 | 0.88 | 0.76 | 0.96 | 0.73 | 0.88 |
time:NN | 1.58 | 0.35 | 0.61 | 0.98 | 1.31 | 0.71 | 0.82 | 0.71 | 0.66 | 0.61 | 0.91 |
No:DT | 1.11 | 1.25 | 1.31 | 1.00 | 1.45 | 1.32 | 1.00 | 1.20 | 1.25 | 1.31 | 1.00 |
representative:NN | 1.58 | 0.82 | 0.71 | 1.36 | 1.30 | 0.71 | 0.89 | 0.71 | 0.82 | 0.71 | 0.75 |
took:VBD | 0.54 | 1.32 | 0.73 | 0.91 | 1.11 | 0.76 | 0.76 | 0.68 | 0.23 | 0.73 | 0.91 |
less:JJR | 1.05 | 0.96 | 0.76 | 0.88 | 1.01 | 0.76 | 0.82 | 0.76 | 0.90 | 0.76 | 0.88 |
than:IN | 1.57 | 1.18 | 0.95 | 1.31 | 0.06 | 1.05 | 1.31 | 1.05 | 1.18 | 0.95 | 1.31 |
half:PDT | 1.15 | 0.87 | 1.31 | 1.00 | 1.45 | 1.21 | 1.00 | 1.31 | 1.25 | 1.31 | 1.00 |
a:DT | 1.15 | 1.25 | 1.31 | 1.00 | 1.45 | 2.42 | 1.00 | 1.31 | 1.25 | 1.31 | 1.00 |
day:NN | 1.84 | 0.78 | 0.96 | 1.25 | 1.56 | 0.96 | 0.69 | 0.96 | 0.76 | 0.96 | 1.05 |
to:TO | 1.16 | 1.18 | 1.26 | 1.00 | 1.45 | 1.31 | 1.00 | 2.42 | 1.25 | 1.26 | 1.00 |
read:VB | 0.54 | 0.23 | 0.76 | 0.91 | 1.10 | 0.76 | 0.69 | 0.76 | 1.32 | 0.76 | 0.66 |
the:DT | 1.16 | 1.22 | 2.42 | 1.00 | 1.45 | 1.31 | 0.96 | 1.26 | 1.25 | 2.42 | 1.00 |
report:NN | 1.58 | 0.82 | 0.71 | 0.78 | 1.29 | 0.71 | 0.53 | 0.71 | 0.57 | 0.71 | 0.28 |
There:EX | 1.16 | 1.23 | 1.20 | 1.00 | 1.45 | 1.31 | 0.97 | 1.31 | 1.23 | 1.20 | 0.98 |
are:VBP | 0.54 | 0.78 | 0.68 | 0.91 | 1.11 | 0.76 | 0.87 | 0.76 | 0.74 | 0.68 | 0.91 |
sixteen:NN | 1.58 | 0.67 | 0.71 | 0.73 | 1.28 | 0.71 | 0.67 | 0.71 | 0.75 | 0.71 | 0.82 |
representatives:NNS | 1.58 | 0.82 | 0.71 | 0.28 | 1.31 | 0.71 | 0.71 | 0.71 | 0.82 | 0.71 | 0.78 |
NO_WORD | 0.54 | 0.17 | 0.82 | 0.09 | 0.04 | 0.82 | 0.09 | 1.55 | 0.34 | 0.82 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.76 Alignment.score
1.00 0.30 Alignment.isGood
-1.00 0.67 Alignment.isBad
-0.10 8.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.18 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "week" modifying "took"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "time" modifying "read" is dropped on aligned hypothesis word "read"
-1.00 1.00 NullPunisher.other : more_than
-0.10 1.00 NullPunisher.article : the
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : report
-1.00 1.00 NullPunisher.other : week
-1.00 1.00 NullPunisher.other : took
-0.10 1.00 NullPunisher.functionWord : to
-0.10 1.00 NullPunisher.article : the
-2.00 1.00 Person.mismatch : person mismatch between It and it
1.00 1.00 Quant.contract : Valid quantifier weakening: replacing the quantifier "every" by a weaker quantifier "the" preserves truth.
-2.00 1.00 RootEntailment.unalignedRoot : "took" not aligned to anything
Hand-tuned score (dot product of above): -8.1890
Threshold: -9.4738
Txt: While Jones was updating the program, Mary came in and told him about the board meeting. She finished before he did.
Hyp: Mary's story lasted as long as Jones's updating the program. (don't know)
Mary NNP |
story NN |
lasted VBD |
as RB |
long RB |
as IN |
Jones NNP |
's VBZ |
updating VBG |
the DT |
program NN |
|
While:IN | 1.57 | 1.31 | 1.22 | 1.23 | 1.23 | 1.05 | 1.57 | 1.24 | 1.24 | 0.99 | 1.31 |
Jones:NNP | 0.77 | 1.13 | 1.05 | 1.56 | 1.49 | 0.75 | 0.28 | 1.07 | 1.07 | 0.96 | 1.20 |
was:VBD | 1.13 | 0.91 | 0.67 | 0.75 | 1.11 | 0.79 | 1.16 | 0.01 | 0.71 | 0.76 | 0.91 |
updating:VBG | 1.16 | 0.70 | 0.50 | 1.11 | 0.98 | 1.15 | 1.16 | 0.56 | 1.32 | 0.76 | 0.65 |
the:DT | 1.25 | 1.00 | 1.25 | 1.45 | 1.45 | 1.05 | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 |
program:NN | 1.16 | 0.88 | 0.82 | 1.31 | 1.31 | 0.50 | 1.20 | 0.76 | 0.56 | 0.71 | 0.28 |
Mary:NNP | 0.28 | 1.03 | 1.07 | 1.56 | 1.56 | 0.75 | 0.77 | 1.07 | 1.07 | 0.96 | 1.16 |
came_in:VBD | 1.16 | 0.86 | 0.39 | 1.11 | 0.81 | 1.15 | 1.16 | 0.55 | 0.48 | 0.76 | 0.91 |
told:VBD | 1.16 | 0.81 | 0.72 | 1.11 | 1.06 | 1.15 | 1.14 | 0.44 | 0.63 | 0.73 | 0.90 |
him:PRP | 1.55 | 1.30 | 0.97 | 1.05 | 1.05 | 1.05 | 1.55 | 0.97 | 0.97 | 1.11 | 1.30 |
the:DT | 1.25 | 1.00 | 1.25 | 1.45 | 1.45 | 1.05 | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 |
board_meeting:NN | 1.07 | 0.82 | 0.76 | 1.31 | 1.31 | 0.50 | 1.07 | 0.70 | 0.76 | 0.71 | 0.82 |
She:PRP | 2.76 | 1.60 | 0.95 | 1.35 | 1.05 | 1.35 | 1.55 | 0.97 | 0.97 | 0.96 | 1.30 |
finished:VBD | 1.16 | 0.91 | 0.42 | 1.11 | 1.11 | 1.15 | 1.15 | 0.42 | 0.58 | 0.76 | 0.73 |
before:IN | 1.57 | 1.25 | 1.24 | 1.23 | 1.23 | 1.05 | 1.55 | 1.24 | 1.24 | 1.05 | 1.29 |
he:PRP | 1.55 | 1.30 | 0.97 | 1.29 | 1.53 | 1.29 | 2.76 | 1.21 | 0.97 | 0.99 | 1.30 |
did:VBD | 1.16 | 0.91 | 0.64 | 1.11 | 0.98 | 1.15 | 1.16 | 0.48 | 0.65 | 0.76 | 0.91 |
NO_WORD | 0.12 | 0.28 | 0.17 | 0.04 | 0.04 | 0.92 | 0.28 | 1.30 | 0.31 | 0.82 | 0.09 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.30 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.76 Alignment.isBad
-0.10 11.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "long" modifying "lasted"
-0.05 1.00 NullPunisher.aux : 's
-0.10 1.00 NullPunisher.article : the
-0.10 1.00 NullPunisher.functionWord : as
-1.00 1.00 NullPunisher.other : updating
-1.00 1.00 NullPunisher.other : lasted
-1.00 1.00 NullPunisher.other : story
-1.00 1.00 NullPunisher.other : program
-3.00 1.00 NullPunisher.entity : Jones
-0.10 1.00 NullPunisher.functionWord : as
-1.00 1.00 NullPunisher.other : long
-3.00 1.00 NullPunisher.entity : Mary
-2.00 1.00 RootEntailment.unalignedRoot : "lasted" not aligned to anything
Hand-tuned score (dot product of above): -15.7013
Threshold: -9.4738
Txt: Before APCOM bought its present office building, it had been paying mortgage interest on the previous one for 8 years. Since APCOM bought its present office building it has been paying mortgage interest on it for more than 10 years.
Hyp: APCOM has been paying mortgage interest for a total of 15 years or more. (yes)
APCOM NNP |
has VBZ |
been VBN |
paying VBG |
mortgage NN |
interest NN |
a DT |
total NN |
15 CD |
years NNS |
or CC |
more JJR |
|
Before_APCOM:NNP | 0.34 | 1.07 | 1.07 | 1.07 | 1.05 | 1.07 | 0.96 | 1.07 | 1.20 | 1.07 | 0.96 | 1.24 |
bought:VBD | 1.16 | 0.85 | 0.91 | 0.56 | 0.76 | 0.87 | 0.76 | 0.76 | 1.35 | 0.81 | 0.76 | 1.02 |
its:PRP$ | 2.06 | 0.90 | 1.17 | 0.97 | 1.50 | 1.48 | 1.10 | 1.40 | 1.46 | 1.50 | 1.30 | 1.69 |
present:JJ | 1.13 | 0.96 | 0.90 | 0.91 | 0.86 | 0.77 | 0.76 | 0.83 | 1.11 | 0.75 | 0.76 | 0.93 |
office_building:NN | 1.07 | 0.82 | 0.82 | 0.82 | 0.61 | 0.73 | 0.71 | 0.82 | 1.26 | 0.70 | 0.71 | 0.99 |
it:PRP | 2.06 | 0.97 | 1.17 | 0.97 | 1.50 | 1.50 | 1.10 | 1.40 | 1.46 | 1.50 | 1.30 | 1.69 |
had:VBD | 1.16 | 0.17 | 0.98 | 0.87 | 0.91 | 0.91 | 0.76 | 0.91 | 1.39 | 0.91 | 0.76 | 1.02 |
been:VBN | 1.16 | 0.98 | 1.32 | 0.93 | 0.91 | 0.91 | 0.76 | 0.91 | 1.39 | 0.88 | 0.76 | 1.02 |
paying:VBG | 1.14 | 0.87 | 0.93 | 1.32 | 0.72 | 0.70 | 0.76 | 0.79 | 1.29 | 0.91 | 0.76 | 1.02 |
mortgage:NN | 1.07 | 0.82 | 0.82 | 0.63 | 0.28 | 0.46 | 0.71 | 0.83 | 1.29 | 0.87 | 0.71 | 0.91 |
interest:NN | 1.07 | 0.82 | 0.82 | 0.62 | 0.53 | 0.28 | 0.71 | 0.92 | 1.29 | 0.94 | 0.71 | 0.99 |
the:DT | 1.25 | 1.25 | 1.22 | 1.25 | 1.00 | 1.00 | 1.31 | 1.00 | 1.18 | 1.00 | 1.31 | 1.15 |
previous:JJ | 1.13 | 0.96 | 0.96 | 0.89 | 0.88 | 0.88 | 0.76 | 0.47 | 1.01 | 0.72 | 0.76 | 0.93 |
one:NN | 1.07 | 0.82 | 0.78 | 0.82 | 0.88 | 0.92 | 0.71 | 0.88 | 1.29 | 0.80 | 0.66 | 0.89 |
8:CD | 1.24 | 1.30 | 1.30 | 1.30 | 1.33 | 1.33 | 0.89 | 1.08 | 0.28 | 1.33 | 0.89 | 1.00 |
years:NNS | 1.07 | 0.76 | 0.79 | 0.82 | 0.87 | 0.94 | 0.71 | 0.80 | 1.29 | 0.28 | 0.71 | 0.96 |
Since:IN | 1.57 | 1.24 | 1.24 | 1.22 | 1.31 | 1.29 | 1.05 | 1.31 | 1.30 | 1.31 | 1.05 | 0.75 |
APCOM:NNP | 0.28 | 1.07 | 1.07 | 1.05 | 1.07 | 1.07 | 0.96 | 1.07 | 1.20 | 1.07 | 0.96 | 1.24 |
bought:VBD | 1.16 | 0.85 | 0.91 | 0.56 | 0.76 | 0.87 | 0.76 | 0.76 | 1.35 | 0.81 | 0.76 | 1.02 |
its:PRP$ | 2.76 | 0.90 | 0.97 | 0.97 | 1.30 | 1.28 | 1.10 | 1.30 | 1.46 | 1.30 | 1.10 | 1.39 |
present:JJ | 1.13 | 0.96 | 0.90 | 0.91 | 0.86 | 0.77 | 0.76 | 0.83 | 1.11 | 0.75 | 0.76 | 0.93 |
office_building:NN | 1.07 | 0.82 | 0.82 | 0.82 | 0.61 | 0.73 | 0.71 | 0.82 | 1.26 | 0.70 | 0.71 | 0.99 |
it:PRP | 2.76 | 0.97 | 0.97 | 0.97 | 1.30 | 1.30 | 1.10 | 1.30 | 1.46 | 1.30 | 1.10 | 1.39 |
has:VBZ | 1.16 | 1.32 | 0.98 | 0.87 | 0.91 | 0.91 | 0.76 | 0.91 | 1.39 | 0.85 | 0.76 | 1.02 |
been:VBN | 1.16 | 0.98 | 1.32 | 0.93 | 0.91 | 0.91 | 0.76 | 0.91 | 1.39 | 0.88 | 0.76 | 1.02 |
paying:VBG | 1.14 | 0.87 | 0.93 | 1.32 | 0.72 | 0.70 | 0.76 | 0.79 | 1.29 | 0.91 | 0.76 | 1.02 |
mortgage:NN | 1.07 | 0.82 | 0.82 | 0.63 | 0.28 | 0.46 | 0.71 | 0.83 | 1.29 | 0.87 | 0.71 | 0.91 |
interest:NN | 1.07 | 0.82 | 0.82 | 0.62 | 0.53 | 0.28 | 0.71 | 0.92 | 1.29 | 0.94 | 0.71 | 0.99 |
on:IN | 1.57 | 1.24 | 1.24 | 1.24 | 1.31 | 1.31 | 1.05 | 1.31 | 1.30 | 1.31 | 0.94 | 0.77 |
it:PRP | 2.76 | 0.97 | 0.97 | 0.97 | 1.30 | 1.30 | 1.10 | 1.30 | 1.46 | 1.30 | 1.10 | 1.39 |
more_than:IN | 1.57 | 1.24 | 1.24 | 1.24 | 1.22 | 1.30 | 1.05 | 1.28 | 1.30 | 1.31 | 1.05 | 0.40 |
10:CD | 1.24 | 1.30 | 1.30 | 1.22 | 1.33 | 1.33 | 0.89 | 1.06 | 0.58 | 1.29 | 0.89 | 1.00 |
years:NNS | 1.07 | 0.76 | 0.79 | 0.82 | 0.87 | 0.94 | 0.71 | 0.80 | 1.29 | 0.28 | 0.71 | 0.96 |
NO_WORD | 0.28 | 1.30 | 1.30 | 0.17 | 0.05 | 0.09 | 0.82 | 0.05 | 0.52 | 0.04 | 0.56 | 0.03 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.47 Alignment.score
1.00 0.25 Alignment.isGood
-1.00 0.73 Alignment.isBad
-0.10 9.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.17 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "years" modifying "total"
-0.05 1.00 NullPunisher.aux : has
-1.00 1.00 NullPunisher.other : interest
-1.00 1.00 NullPunisher.other : years
-0.05 1.00 NullPunisher.aux : been
-1.00 1.00 NullPunisher.other : total
-1.00 1.00 NullPunisher.other : or
-1.00 1.00 NullPunisher.other : mortgage
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : paying
-2.00 1.00 RootEntailment.unalignedRoot : "paying" not aligned to anything
Hand-tuned score (dot product of above): -10.0022
Threshold: -9.4738
Txt: When Jones got his job at the CIA, he knew that he would never be allowed to write his memoirs.
Hyp: It is the case that Jones is not and will never be allowed to write his memoirs. (yes)
It PRP |
is VBZ |
the DT |
case NN |
that IN |
Jones NNP |
is VBZ |
not RB |
will MD |
never RB |
be VB |
allowed VBN |
to TO |
write VB |
his PRP$ |
memoirs NNS |
|
When:WRB | 1.16 | 1.25 | 1.21 | 1.00 | 0.99 | 1.23 | 1.25 | 1.45 | 1.25 | 1.43 | 1.25 | 1.25 | 1.31 | 1.23 | 1.13 | 1.00 |
Jones:NNP | 1.84 | 1.07 | 0.96 | 1.16 | 0.75 | 0.28 | 1.07 | 1.56 | 0.96 | 1.52 | 1.07 | 1.04 | 0.96 | 1.07 | 1.84 | 1.05 |
got:VBD | 0.50 | 0.92 | 0.76 | 0.91 | 1.12 | 1.16 | 0.92 | 0.97 | 1.06 | 1.11 | 0.93 | 0.77 | 0.71 | 0.69 | 0.54 | 0.91 |
his:PRP$ | 0.67 | 0.85 | 1.35 | 1.51 | 1.01 | 2.76 | 0.85 | 1.29 | 1.08 | 1.29 | 1.21 | 0.97 | 1.35 | 0.97 | 1.05 | 1.30 |
job:NN | 1.58 | 0.82 | 0.71 | 0.91 | 0.50 | 1.16 | 0.82 | 1.23 | 0.71 | 1.31 | 0.82 | 0.82 | 0.66 | 0.75 | 1.58 | 0.83 |
the:DT | 1.16 | 1.25 | 2.42 | 0.96 | 0.95 | 1.25 | 1.25 | 1.45 | 1.31 | 1.45 | 1.21 | 1.25 | 1.26 | 1.25 | 1.16 | 1.00 |
CIA:NNP | 1.79 | 1.03 | 0.96 | 1.09 | 0.72 | 0.99 | 1.03 | 1.56 | 0.93 | 1.56 | 1.07 | 1.07 | 0.96 | 1.07 | 1.76 | 1.06 |
he:PRP | 0.71 | 1.21 | 0.99 | 1.54 | 1.05 | 2.76 | 1.21 | 1.29 | 1.11 | 1.29 | 1.10 | 0.97 | 1.35 | 0.97 | 0.82 | 1.30 |
knew:VBD | 0.54 | 0.84 | 0.73 | 0.78 | 1.15 | 1.09 | 0.84 | 1.08 | 0.94 | 1.09 | 0.84 | 0.69 | 0.76 | 0.43 | 0.54 | 0.74 |
that:IN | 1.57 | 1.24 | 0.95 | 1.31 | 1.59 | 1.57 | 1.24 | 1.20 | 1.05 | 1.23 | 1.24 | 1.24 | 1.05 | 1.21 | 1.54 | 1.31 |
he:PRP | 0.71 | 1.21 | 0.99 | 1.54 | 1.05 | 2.76 | 1.21 | 1.29 | 1.11 | 1.29 | 1.10 | 0.97 | 1.35 | 0.97 | 0.82 | 1.30 |
would:MD | 1.16 | 1.25 | 1.31 | 1.00 | 1.05 | 1.21 | 1.25 | 1.45 | 1.23 | 1.45 | 1.32 | 1.22 | 1.31 | 1.21 | 1.16 | 1.00 |
never:RB | 1.05 | 0.91 | 1.09 | 1.19 | 1.05 | 1.43 | 0.91 | 0.86 | 1.09 | 1.23 | 0.91 | 0.91 | 1.20 | 0.91 | 1.05 | 1.18 |
be:VB | 0.54 | 0.10 | 0.71 | 0.91 | 1.15 | 1.16 | 0.10 | 1.11 | 1.11 | 1.11 | 1.32 | 0.84 | 0.76 | 0.84 | 0.54 | 0.91 |
allowed:VBN | 0.54 | 0.84 | 0.76 | 0.77 | 1.15 | 1.13 | 0.84 | 1.11 | 0.94 | 1.11 | 0.84 | 1.32 | 0.76 | 0.69 | 0.54 | 0.87 |
to:TO | 1.16 | 1.25 | 1.26 | 1.00 | 1.05 | 1.25 | 1.25 | 1.41 | 1.31 | 1.56 | 1.25 | 1.25 | 2.42 | 1.25 | 1.16 | 1.00 |
write:VB | 0.51 | 0.84 | 0.76 | 0.88 | 1.13 | 1.16 | 0.84 | 1.11 | 0.86 | 1.11 | 0.84 | 0.69 | 0.76 | 1.32 | 0.54 | 0.78 |
his:PRP$ | 0.67 | 0.85 | 1.35 | 1.51 | 1.01 | 2.76 | 0.85 | 1.29 | 1.08 | 1.29 | 1.21 | 0.97 | 1.35 | 0.97 | 1.05 | 1.30 |
memoirs:NNS | 1.58 | 0.82 | 0.71 | 0.83 | 0.50 | 1.05 | 0.82 | 1.31 | 0.71 | 1.27 | 0.82 | 0.78 | 0.71 | 0.69 | 1.58 | 0.28 |
NO_WORD | 0.54 | 1.43 | 0.82 | 0.12 | 0.99 | 0.15 | 0.31 | 0.24 | 1.55 | 0.24 | 1.57 | 0.06 | 1.55 | 0.09 | 0.93 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.65 Alignment.score
1.00 0.28 Alignment.isGood
-1.00 0.69 Alignment.isBad
-0.10 16.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
0.00 1.00 NegPolarity.hypNegWord : "is": has child with relation "neg"
0.00 1.00 NegPolarity.hypNegWord : "never": is in NegPolarityMarkers list
0.00 1.00 NegPolarity.hypNegWord : "allowed": has child with relation "neg"
-1.00 1.00 NullPunisher.other : It
-1.00 1.00 NullPunisher.other : not
-0.10 1.00 NullPunisher.functionWord : to
-3.00 1.00 NullPunisher.entity : Jones
-0.05 1.00 NullPunisher.aux : will
-1.00 1.00 NullPunisher.other : memoirs
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : case
-1.00 1.00 NullPunisher.other : never
-0.05 1.00 NullPunisher.aux : is
-0.05 1.00 NullPunisher.aux : is
-0.10 1.00 NullPunisher.functionWord : his
-0.05 1.00 NullPunisher.aux : be
-0.10 1.00 NullPunisher.functionWord : that
-1.00 1.00 NullPunisher.other : write
-1.00 1.00 NullPunisher.other : allowed
-2.00 1.00 RootEntailment.unalignedRoot : "case" not aligned to anything
Hand-tuned score (dot product of above): -13.9624
Threshold: -9.4738
Txt: Smith has been to Florence twice in the past. Smith will go to Florence twice in the coming year.
Hyp: Two years from now Smith will have been to Florence at least four times. (yes)
Two CD |
years NNS |
now JJ |
Smith NNP |
will MD |
have VB |
been VBN |
Florence NNP |
at IN |
least JJS |
four CD |
times NNS |
|
Smith:NNP | 1.20 | 1.16 | 1.24 | 0.28 | 0.94 | 1.07 | 1.07 | 0.96 | 0.75 | 1.24 | 1.20 | 1.13 |
has:VBZ | 1.39 | 0.85 | 1.02 | 1.16 | 1.05 | 0.19 | 0.98 | 1.16 | 1.11 | 0.96 | 1.39 | 0.91 |
been:VBN | 1.39 | 0.88 | 1.02 | 1.16 | 1.11 | 0.98 | 1.32 | 1.16 | 1.15 | 0.99 | 1.39 | 0.88 |
Florence:NNP | 1.20 | 1.11 | 1.24 | 0.96 | 0.96 | 1.07 | 1.07 | 0.28 | 0.75 | 1.23 | 1.20 | 1.09 |
twice:RB | 1.30 | 1.08 | 1.05 | 1.43 | 1.02 | 0.89 | 0.91 | 1.45 | 1.05 | 1.05 | 0.97 | 0.84 |
the:DT | 1.11 | 1.00 | 1.18 | 1.25 | 1.31 | 1.22 | 1.22 | 1.25 | 1.38 | 1.18 | 1.18 | 0.94 |
past:NN | 1.29 | 0.46 | 0.99 | 1.13 | 0.71 | 0.76 | 0.82 | 1.11 | 0.42 | 0.25 | 0.55 | 0.70 |
Smith:NNP | 1.20 | 1.16 | 1.24 | 0.28 | 0.94 | 1.07 | 1.07 | 0.96 | 0.75 | 1.24 | 1.20 | 1.13 |
will:MD | 1.18 | 1.00 | 1.18 | 1.23 | 2.42 | 1.55 | 1.60 | 1.25 | 1.05 | 1.18 | 1.18 | 0.97 |
go:VB | 1.35 | 0.91 | 0.97 | 1.16 | 1.07 | 0.94 | 0.92 | 1.16 | 1.15 | 1.02 | 1.39 | 0.91 |
Florence:NNP | 1.20 | 1.11 | 1.24 | 0.96 | 0.96 | 1.07 | 1.07 | 0.28 | 0.75 | 1.23 | 1.20 | 1.09 |
twice:RB | 1.30 | 1.08 | 1.05 | 1.43 | 1.02 | 0.89 | 0.91 | 1.45 | 1.05 | 1.05 | 0.97 | 0.84 |
in:IN | 1.30 | 1.31 | 0.77 | 1.57 | 1.05 | 1.24 | 1.24 | 1.57 | 2.26 | 0.77 | 1.30 | 1.31 |
the:DT | 1.11 | 1.00 | 1.18 | 1.25 | 1.31 | 1.22 | 1.22 | 1.25 | 1.38 | 1.18 | 1.18 | 0.94 |
coming:JJ | 1.13 | 0.88 | 0.93 | 1.07 | 0.76 | 0.96 | 0.96 | 1.10 | 1.05 | 0.93 | 0.88 | 0.83 |
year:NN | 1.29 | 0.62 | 0.99 | 1.17 | 0.71 | 0.82 | 0.76 | 1.12 | 0.50 | 0.91 | 1.15 | 0.74 |
NO_WORD | 0.52 | 0.28 | 0.11 | 0.15 | 1.55 | 1.30 | 0.17 | 0.74 | 0.84 | 0.29 | 0.52 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.38 Alignment.score
1.00 0.23 Alignment.isGood
-1.00 0.75 Alignment.isBad
-0.10 11.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.08 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "four" modifying "times"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "coming" modifying "year" is dropped on aligned hypothesis word "years"
0.00 1.00 NegPolarity.hypNegWord : "least": tag "JJS" is in NegPolarityMarkers list
-0.05 1.00 NullPunisher.aux : have
-1.00 1.00 NullPunisher.other : times
-1.00 1.00 NullPunisher.other : least
-0.05 1.00 NullPunisher.aux : been
-1.00 1.00 NullPunisher.other : at
-1.00 1.00 NullPunisher.other : now
-3.00 1.00 NullPunisher.entity : Two
-0.05 1.00 NullPunisher.aux : will
-3.00 1.00 NullPunisher.entity : four
-3.00 1.00 NullPunisher.entity : Smith
-3.00 1.00 NullPunisher.entity : Florence
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '2.0' vs ''
-6.00 1.00 Numeric.mismatch : NUMBER mismatch: '>=4.0' vs ''
-2.00 1.00 RootEntailment.unalignedRoot : "been" not aligned to anything
Hand-tuned score (dot product of above): -31.7802
Threshold: -9.4738
Txt: Last week I already knew that when, in a month's time, Smith would discover that she had been duped she would be furious.
Hyp: It will be the case that in a few weeks Smith will discover that she has been duped; and she will be furious. (yes)
It PRP |
will MD |
be VB |
the DT |
case NN |
that IN |
a DT |
few JJ |
weeks NNS |
Smith NNP |
will MD |
discover VB |
that IN |
she PRP |
has VBZ |
been VBN |
duped VBN |
she PRP |
will MD |
be VB |
furious JJ |
|
Last:JJ | 1.05 | 0.76 | 0.96 | 0.76 | 0.77 | 0.99 | 0.76 | 0.93 | 0.88 | 1.11 | 0.76 | 0.96 | 0.99 | 1.05 | 0.86 | 0.96 | 0.96 | 1.05 | 0.76 | 0.96 | 0.93 |
week:NN | 1.58 | 0.65 | 0.82 | 0.67 | 0.94 | 0.50 | 0.71 | 0.95 | 1.07 | 1.15 | 0.65 | 0.82 | 0.50 | 1.55 | 0.82 | 0.71 | 0.76 | 1.55 | 0.65 | 0.82 | 0.92 |
I:PRP | 0.62 | 1.11 | 0.97 | 1.11 | 1.30 | 1.05 | 1.14 | 1.39 | 1.30 | 1.55 | 1.11 | 0.97 | 1.05 | 0.71 | 0.97 | 0.97 | 0.97 | 0.71 | 1.11 | 0.97 | 1.39 |
already:RB | 1.05 | 1.09 | 0.91 | 1.09 | 1.21 | 1.05 | 1.09 | 1.05 | 1.21 | 1.47 | 1.09 | 0.91 | 1.05 | 1.05 | 0.91 | 0.91 | 0.88 | 1.05 | 1.09 | 0.91 | 1.01 |
knew:VBD | 0.54 | 0.94 | 0.84 | 0.73 | 0.78 | 1.15 | 0.76 | 0.92 | 0.88 | 1.16 | 0.94 | 0.36 | 1.15 | 0.51 | 0.79 | 0.78 | 0.36 | 0.51 | 0.94 | 0.84 | 0.75 |
that:IN | 1.57 | 1.05 | 1.24 | 0.95 | 1.31 | 1.59 | 1.05 | 0.77 | 1.31 | 1.54 | 1.05 | 1.24 | 1.59 | 1.62 | 1.15 | 1.24 | 1.24 | 1.62 | 1.05 | 1.24 | 0.77 |
when:WRB | 1.16 | 1.25 | 1.25 | 1.21 | 1.00 | 0.99 | 1.31 | 1.15 | 0.92 | 1.25 | 1.25 | 1.25 | 0.99 | 1.07 | 1.22 | 1.14 | 1.23 | 1.07 | 1.25 | 1.25 | 1.18 |
a:DT | 1.15 | 1.31 | 1.25 | 1.31 | 1.00 | 1.05 | 2.42 | 1.18 | 1.00 | 1.25 | 1.31 | 1.25 | 1.05 | 1.16 | 1.25 | 1.25 | 1.25 | 1.16 | 1.31 | 1.25 | 1.18 |
month:NN | 1.58 | 0.71 | 0.82 | 0.71 | 0.91 | 0.47 | 0.71 | 0.99 | 0.47 | 1.10 | 0.71 | 0.82 | 0.47 | 1.58 | 0.82 | 0.82 | 0.82 | 1.58 | 0.71 | 0.82 | 0.89 |
time:NN | 1.58 | 0.65 | 0.82 | 0.61 | 0.50 | 0.44 | 0.71 | 0.99 | 0.90 | 1.13 | 0.65 | 0.61 | 0.44 | 1.55 | 0.82 | 0.82 | 0.78 | 1.55 | 0.65 | 0.82 | 0.89 |
Smith:NNP | 1.80 | 0.94 | 1.07 | 0.96 | 1.17 | 0.73 | 0.96 | 1.24 | 1.15 | 0.28 | 0.94 | 1.07 | 0.73 | 1.84 | 1.07 | 1.07 | 1.07 | 1.84 | 0.94 | 1.07 | 1.24 |
would:MD | 1.16 | 1.23 | 1.32 | 1.31 | 1.00 | 1.05 | 1.31 | 1.18 | 0.95 | 1.25 | 1.23 | 1.25 | 1.05 | 1.16 | 1.34 | 1.25 | 1.21 | 1.16 | 1.23 | 1.32 | 1.18 |
discover:VB | 0.54 | 0.95 | 0.85 | 0.76 | 0.84 | 1.15 | 0.76 | 1.02 | 0.91 | 1.16 | 0.95 | 1.32 | 1.15 | 0.54 | 0.80 | 0.85 | 0.52 | 0.54 | 0.95 | 0.85 | 1.00 |
that:IN | 1.57 | 1.05 | 1.24 | 0.95 | 1.31 | 1.59 | 1.05 | 0.77 | 1.31 | 1.54 | 1.05 | 1.24 | 1.59 | 1.62 | 1.15 | 1.24 | 1.24 | 1.62 | 1.05 | 1.24 | 0.77 |
she:PRP | 0.71 | 1.41 | 1.23 | 1.26 | 1.26 | 1.10 | 1.11 | 1.69 | 2.20 | 1.55 | 1.41 | 0.97 | 1.10 | 1.05 | 0.97 | 1.54 | 0.97 | 1.05 | 1.41 | 1.23 | 1.39 |
had:VBD | 0.54 | 1.05 | 0.98 | 0.76 | 0.87 | 1.06 | 0.76 | 1.02 | 0.91 | 1.16 | 1.05 | 0.80 | 1.06 | 0.54 | 0.17 | 0.98 | 0.66 | 0.54 | 1.05 | 0.98 | 1.02 |
been:VBN | 0.54 | 1.11 | 0.17 | 0.73 | 0.91 | 1.15 | 0.76 | 0.99 | 0.83 | 1.16 | 1.11 | 0.85 | 1.15 | 0.51 | 0.98 | 1.32 | 0.67 | 0.51 | 1.11 | 0.17 | 1.02 |
duped:VBN | 0.54 | 0.83 | 0.69 | 0.76 | 0.73 | 1.15 | 0.76 | 1.02 | 0.91 | 1.16 | 0.83 | 0.52 | 1.15 | 0.54 | 0.66 | 0.67 | 1.32 | 0.54 | 0.83 | 0.69 | 0.84 |
she:PRP | 0.71 | 1.41 | 1.23 | 1.26 | 1.26 | 1.10 | 1.11 | 1.69 | 2.20 | 1.55 | 1.41 | 0.97 | 1.10 | 1.05 | 0.97 | 1.54 | 0.97 | 1.05 | 1.41 | 1.23 | 1.39 |
would:MD | 1.16 | 1.23 | 1.32 | 1.31 | 1.00 | 1.05 | 1.31 | 1.18 | 0.95 | 1.25 | 1.23 | 1.25 | 1.05 | 1.16 | 1.34 | 1.25 | 1.21 | 1.16 | 1.23 | 1.32 | 1.18 |
be:VB | 0.54 | 1.11 | 1.32 | 0.71 | 0.91 | 1.15 | 0.76 | 0.97 | 0.91 | 1.16 | 1.11 | 0.85 | 1.15 | 0.50 | 0.98 | 0.17 | 0.69 | 0.50 | 1.11 | 1.32 | 1.02 |
furious:JJ | 1.05 | 0.76 | 0.96 | 0.76 | 0.86 | 1.05 | 0.76 | 0.93 | 0.88 | 1.13 | 0.76 | 0.94 | 1.05 | 1.05 | 0.96 | 0.96 | 0.78 | 1.05 | 0.76 | 0.96 | 0.74 |
NO_WORD | 0.54 | 1.55 | 1.43 | 0.82 | 0.12 | 0.60 | 0.82 | 0.11 | 0.07 | 0.28 | 1.55 | 0.36 | 0.60 | 0.67 | 1.30 | 1.57 | 0.36 | 0.54 | 1.55 | 1.43 | 0.35 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.81 Alignment.score
1.00 0.32 Alignment.isGood
-1.00 0.66 Alignment.isBad
-0.10 20.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.05 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "discover" modifying "case"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "Last" modifying "week" is dropped on aligned hypothesis word "weeks"
0.00 1.00 NegPolarity.hypNegWord : "few": is in NegPolarityMarkers list
-1.00 1.00 NullPunisher.other : discover
-0.05 1.00 NullPunisher.aux : been
-1.00 1.00 NullPunisher.other : case
-0.10 1.00 NullPunisher.article : a
-0.05 1.00 NullPunisher.aux : has
-0.05 1.00 NullPunisher.aux : be
-0.05 1.00 NullPunisher.aux : will
-0.05 1.00 NullPunisher.aux : be
-1.00 1.00 NullPunisher.other : she
-1.00 1.00 NullPunisher.other : few
-1.00 1.00 NullPunisher.other : furious
-0.05 1.00 NullPunisher.aux : will
-0.10 1.00 NullPunisher.functionWord : that
-0.10 1.00 NullPunisher.functionWord : that
-1.00 1.00 NullPunisher.other : It
-1.00 1.00 NullPunisher.other : duped
-1.00 1.00 NullPunisher.other : she
-0.05 1.00 NullPunisher.aux : will
-3.00 1.00 NullPunisher.entity : Smith
-0.10 1.00 NullPunisher.article : the
-2.00 1.00 RootEntailment.unalignedRoot : "case" not aligned to anything
Hand-tuned score (dot product of above): -15.6721
Threshold: -9.4738
Txt: No one gambling seriously stops until he is broke. No one can gamble when he is broke.
Hyp: Everyone who starts gambling seriously stops the moment he is broke. (yes)
Everyone NN |
who WP |
starts VBZ |
gambling NN |
seriously RB |
stops VBZ |
the DT |
moment NN |
he PRP |
is VBZ |
broke RB |
|
No:DT | 1.00 | 1.12 | 1.25 | 1.00 | 1.45 | 1.25 | 1.31 | 1.00 | 1.16 | 1.25 | 1.45 |
one:CD | 1.31 | 1.32 | 1.30 | 1.33 | 1.30 | 1.30 | 0.82 | 1.30 | 1.27 | 1.30 | 1.25 |
gambling:NN | 0.79 | 1.58 | 0.82 | 0.28 | 1.30 | 0.76 | 0.71 | 0.86 | 1.58 | 0.82 | 1.31 |
seriously:RB | 1.18 | 1.05 | 0.88 | 1.21 | 1.23 | 0.82 | 1.09 | 0.99 | 1.05 | 0.91 | 0.77 |
stops:VBZ | 0.91 | 0.54 | 1.56 | 0.85 | 1.02 | 1.32 | 0.76 | 0.72 | 0.54 | 0.82 | 1.07 |
until:IN | 1.31 | 1.57 | 1.24 | 1.31 | 1.23 | 1.24 | 1.05 | 1.31 | 1.57 | 1.24 | 1.23 |
he:PRP | 1.45 | 0.83 | 0.97 | 2.50 | 1.05 | 0.97 | 0.74 | 1.60 | 1.05 | 1.12 | 1.20 |
is:VBZ | 0.91 | 0.54 | 0.90 | 0.91 | 1.11 | 0.82 | 0.76 | 0.91 | 0.54 | 1.32 | 1.11 |
broke:RB | 1.16 | 1.05 | 0.89 | 1.21 | 0.77 | 0.87 | 1.09 | 0.96 | 1.05 | 0.91 | 1.23 |
No:DT | 1.00 | 1.12 | 1.25 | 1.00 | 1.45 | 1.25 | 1.31 | 1.00 | 1.16 | 1.25 | 1.45 |
one:NN | 0.80 | 1.58 | 0.82 | 0.88 | 1.31 | 0.82 | 0.63 | 0.78 | 1.53 | 0.82 | 1.25 |
can:MD | 1.00 | 1.24 | 1.44 | 1.00 | 1.45 | 1.39 | 1.31 | 1.00 | 1.16 | 1.55 | 1.45 |
gamble:VB | 0.91 | 0.54 | 0.66 | 0.10 | 0.97 | 0.62 | 0.76 | 0.79 | 0.54 | 0.74 | 1.09 |
when:WRB | 1.00 | 1.07 | 1.25 | 1.00 | 1.45 | 1.25 | 1.21 | 0.95 | 1.09 | 1.25 | 1.45 |
he:PRP | 1.45 | 0.83 | 0.97 | 2.50 | 1.05 | 0.97 | 0.74 | 1.60 | 1.05 | 1.12 | 1.20 |
is:VBZ | 0.91 | 0.54 | 0.90 | 0.91 | 1.11 | 0.82 | 0.76 | 0.91 | 0.54 | 1.32 | 1.11 |
broke:RB | 1.16 | 1.05 | 0.89 | 1.21 | 0.77 | 0.87 | 1.09 | 0.96 | 1.05 | 0.91 | 1.23 |
NO_WORD | 0.28 | 0.92 | 0.25 | 0.09 | 0.04 | 0.17 | 0.82 | 0.09 | 0.54 | 0.25 | 0.04 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.27 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 11.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "broke" modifying "is"
-1.00 1.00 NullPunisher.other : he
-1.00 1.00 NullPunisher.other : starts
-0.10 1.00 NullPunisher.functionWord : who
-1.00 1.00 NullPunisher.other : broke
-1.00 1.00 NullPunisher.other : Everyone
-1.00 1.00 NullPunisher.other : stops
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : gambling
-1.00 1.00 NullPunisher.other : moment
-1.00 1.00 NullPunisher.other : seriously
-0.05 1.00 NullPunisher.aux : is
-2.00 1.00 RootEntailment.unalignedRoot : "stops" not aligned to anything
Hand-tuned score (dot product of above): -12.6413
Threshold: -9.4738
Txt: No one who starts gambling seriously stops until he is broke.
Hyp: Everyone who starts gambling seriously continues until he is broke. (yes)
Everyone NN |
who WP |
starts VBZ |
gambling NN |
seriously RB |
continues VBZ |
until IN |
he PRP |
is VBZ |
broke RB |
|
No:DT | 1.00 | 1.12 | 1.25 | 1.00 | 1.45 | 1.25 | 1.05 | 1.16 | 1.25 | 1.45 |
one:NN | 0.80 | 1.58 | 0.82 | 0.88 | 1.31 | 0.82 | 0.50 | 1.53 | 0.82 | 1.25 |
who:WP | 1.30 | 1.05 | 0.97 | 1.30 | 1.05 | 0.97 | 1.05 | 0.83 | 0.97 | 1.05 |
starts:VBZ | 0.91 | 0.54 | 1.32 | 0.91 | 1.08 | 0.57 | 1.15 | 0.54 | 0.90 | 1.09 |
gambling:NN | 0.79 | 1.58 | 0.82 | 0.28 | 1.30 | 0.69 | 0.50 | 1.58 | 0.82 | 1.31 |
seriously:RB | 1.18 | 1.05 | 0.88 | 1.21 | 1.23 | 0.76 | 1.05 | 1.05 | 0.91 | 0.77 |
stops:VBZ | 0.91 | 0.54 | 1.56 | 0.85 | 1.02 | 1.71 | 1.15 | 0.54 | 0.82 | 1.07 |
until:IN | 1.31 | 1.57 | 1.24 | 1.31 | 1.23 | 1.21 | 1.59 | 1.57 | 1.24 | 1.23 |
he:PRP | 1.30 | 0.83 | 0.97 | 1.30 | 1.05 | 0.97 | 1.05 | 1.05 | 0.97 | 1.05 |
is:VBZ | 0.91 | 0.54 | 0.90 | 0.91 | 1.11 | 0.88 | 1.15 | 0.54 | 1.32 | 1.11 |
broke:RB | 1.16 | 1.05 | 0.89 | 1.21 | 0.77 | 0.91 | 1.05 | 1.05 | 0.91 | 1.23 |
NO_WORD | 0.28 | 0.92 | 0.25 | 0.09 | 0.04 | 0.17 | 0.92 | 0.54 | 0.42 | 0.04 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.31 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.76 Alignment.isBad
-0.10 10.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "broke" modifying "is"
-1.00 1.00 NullPunisher.other : gambling
-1.00 1.00 NullPunisher.other : seriously
-1.00 1.00 NullPunisher.other : starts
-1.00 1.00 NullPunisher.other : until
-0.05 1.00 NullPunisher.aux : is
-1.00 1.00 NullPunisher.other : he
-1.00 1.00 NullPunisher.other : Everyone
-1.00 1.00 NullPunisher.other : broke
-1.00 1.00 NullPunisher.other : continues
-0.10 1.00 NullPunisher.functionWord : who
-2.00 1.00 RootEntailment.unalignedRoot : "continues" not aligned to anything
Hand-tuned score (dot product of above): -12.3798
Threshold: -9.4738
Txt: Nobody who is asleep ever knows that he is asleep. But some people know that they have been asleep after they have been asleep.
Hyp: Some people discover that they have been asleep. (yes)
Some DT |
people NNS |
discover VBP |
that IN |
they PRP |
have VBP |
been VBN |
asleep RB |
|
Nobody:NN | 0.71 | 0.92 | 0.82 | 0.50 | 1.58 | 0.82 | 0.82 | 1.31 |
who:WP | 1.11 | 1.30 | 0.97 | 1.01 | 0.78 | 0.97 | 0.97 | 1.05 |
is:VBZ | 0.76 | 0.91 | 0.85 | 1.15 | 0.54 | 0.98 | 0.10 | 1.11 |
asleep:RB | 1.05 | 1.13 | 0.72 | 1.05 | 1.05 | 0.91 | 0.87 | 1.23 |
ever:RB | 1.09 | 1.21 | 0.88 | 1.05 | 0.99 | 0.86 | 0.86 | 0.86 |
knows:VBZ | 0.73 | 0.77 | 0.64 | 1.15 | 0.54 | 0.79 | 0.84 | 1.09 |
that:IN | 1.05 | 1.31 | 1.24 | 1.59 | 1.46 | 1.18 | 1.24 | 1.23 |
he:PRP | 1.31 | 1.30 | 0.97 | 1.05 | 1.06 | 0.90 | 1.17 | 1.05 |
is:VBZ | 0.76 | 0.91 | 0.85 | 1.15 | 0.54 | 0.98 | 0.10 | 1.11 |
asleep:RB | 1.05 | 1.13 | 0.72 | 1.05 | 1.05 | 0.91 | 0.87 | 1.23 |
some:DT | 0.01 | 0.95 | 1.22 | 1.05 | 1.16 | 1.20 | 1.25 | 1.41 |
people:NNS | 0.66 | 0.28 | 0.69 | 0.50 | 1.66 | 0.82 | 0.82 | 1.22 |
know:VBP | 0.76 | 0.51 | 0.47 | 1.15 | 0.54 | 0.79 | 0.84 | 1.03 |
that:IN | 1.05 | 1.31 | 1.24 | 1.59 | 1.46 | 1.18 | 1.24 | 1.23 |
they:PRP | 1.51 | 2.58 | 0.97 | 0.94 | 1.05 | 1.17 | 1.12 | 1.05 |
have:VBP | 0.70 | 0.91 | 0.80 | 1.10 | 0.54 | 1.32 | 0.98 | 1.11 |
been:VBN | 0.76 | 0.91 | 0.85 | 1.15 | 0.49 | 0.98 | 1.32 | 1.07 |
asleep:JJ | 0.72 | 0.79 | 0.76 | 1.05 | 1.05 | 0.96 | 0.91 | 2.55 |
after:IN | 1.02 | 1.31 | 1.22 | 1.05 | 1.54 | 1.21 | 1.21 | 1.17 |
they:PRP | 1.51 | 2.58 | 0.97 | 0.94 | 1.05 | 1.17 | 1.12 | 1.05 |
have:VBP | 0.70 | 0.91 | 0.80 | 1.10 | 0.54 | 1.32 | 0.98 | 1.11 |
been:VBN | 0.76 | 0.91 | 0.85 | 1.15 | 0.49 | 0.98 | 1.32 | 1.07 |
asleep:RB | 1.05 | 1.13 | 0.72 | 1.05 | 1.05 | 0.91 | 0.87 | 1.23 |
NO_WORD | 0.82 | 0.28 | 0.17 | 0.60 | 0.54 | 1.30 | 0.31 | 0.36 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.75 Alignment.score
1.00 0.30 Alignment.isGood
-1.00 0.67 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.12 Alignment.txtSpan
-1.00 1.00 NullPunisher.other : discover
-0.05 1.00 NullPunisher.aux : have
-0.10 1.00 NullPunisher.functionWord : that
-1.00 1.00 NullPunisher.other : people
-0.05 1.00 NullPunisher.aux : been
-1.00 1.00 NullPunisher.other : Some
-1.00 1.00 NullPunisher.other : they
-2.00 1.00 RootEntailment.unalignedRoot : "discover" not aligned to anything
Hand-tuned score (dot product of above): -6.4037
Threshold: -9.4738
Txt: ITEL built MTALK in 1993.
Hyp: ITEL finished MTALK in 1993. (yes)
ITEL NNP |
finished VBD |
MTALK NNP |
1993 CD |
|
ITEL:NNP | 0.28 | 1.07 | 1.00 | 1.20 |
built:VBD | 1.14 | 0.68 | 0.86 | 1.27 |
MTALK:NNP | 1.00 | 0.82 | 0.28 | 1.29 |
1993:CD | 1.24 | 1.16 | 1.33 | 0.85 |
NO_WORD | 0.28 | 0.17 | 0.09 | 0.57 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 -0.15 Alignment.score
1.00 0.15 Alignment.isGood
-1.00 0.83 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "1993" modifying "MTALK"
-2.00 1.00 Date.hypDateIns : hypothesis date insertion: 1993
-3.00 1.00 NullPunisher.entity : 1993
-1.00 1.00 NullPunisher.other : MTALK
-1.00 1.00 NullPunisher.other : finished
-3.00 1.00 NullPunisher.entity : ITEL
-6.00 1.00 Numeric.mismatch : DATE mismatch: '01/01/1993' vs ''
-2.00 1.00 RootEntailment.unalignedRoot : "finished" not aligned to anything
Hand-tuned score (dot product of above): -20.2304
Threshold: -9.4738
Txt: ITEL was building MTALK in 1993.
Hyp: ITEL finished MTALK in 1993. (don't know)
ITEL NNP |
finished VBD |
MTALK NNP |
1993 CD |
|
ITEL:NNP | 0.28 | 1.07 | 1.00 | 1.20 |
was:VBD | 1.16 | 0.92 | 0.91 | 1.39 |
building:VBG | 1.16 | 0.79 | 0.91 | 1.38 |
MTALK:NNP | 1.00 | 0.82 | 0.28 | 1.29 |
1993:CD | 1.24 | 1.16 | 1.33 | 0.85 |
NO_WORD | 0.28 | 0.17 | 0.09 | 0.57 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 -0.15 Alignment.score
1.00 0.15 Alignment.isGood
-1.00 0.83 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "1993" modifying "MTALK"
-2.00 1.00 Date.hypDateIns : hypothesis date insertion: 1993
-3.00 1.00 NullPunisher.entity : ITEL
-1.00 1.00 NullPunisher.other : finished
-1.00 1.00 NullPunisher.other : MTALK
-3.00 1.00 NullPunisher.entity : 1993
-6.00 1.00 Numeric.mismatch : DATE mismatch: '01/01/1993' vs ''
-2.00 1.00 RootEntailment.unalignedRoot : "finished" not aligned to anything
Hand-tuned score (dot product of above): -20.2304
Threshold: -9.4738
Txt: ITEL won the contract from APCOM in 1993.
Hyp: ITEL won a contract in 1993. (yes)
ITEL NNP |
won VBD |
a DT |
contract NN |
1993 CD |
|
ITEL:NNP | 0.28 | 1.07 | 0.96 | 1.07 | 1.20 |
won:VBD | 1.16 | 1.32 | 0.76 | 0.65 | 1.18 |
the:DT | 1.22 | 1.25 | 1.31 | 1.00 | 1.18 |
contract:NN | 1.07 | 0.56 | 0.71 | 0.28 | 1.19 |
APCOM:NNP | 1.07 | 0.82 | 0.71 | 0.82 | 1.29 |
1993:CD | 1.24 | 1.09 | 0.89 | 1.23 | 0.85 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 | 0.57 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.05 Alignment.score
1.00 0.18 Alignment.isGood
-1.00 0.80 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "1993" modifying "won"
-2.00 1.00 Date.hypDateIns : hypothesis date insertion: 1993
-1.00 1.00 NullPunisher.other : contract
-3.00 1.00 NullPunisher.entity : 1993
-1.00 1.00 NullPunisher.other : won
-3.00 1.00 NullPunisher.entity : ITEL
-0.10 1.00 NullPunisher.article : a
-6.00 1.00 Numeric.mismatch : DATE mismatch: '01/01/1993' vs ''
-2.00 1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -20.1829
Threshold: -9.4738
Txt: ITEL was winning the contract from APCOM in 1993.
Hyp: ITEL won a contract in 1993. (don't know)
ITEL NNP |
won VBD |
a DT |
contract NN |
1993 CD |
|
ITEL:NNP | 0.28 | 1.07 | 0.96 | 1.07 | 1.20 |
was:VBD | 1.16 | 0.80 | 0.76 | 0.91 | 1.39 |
winning:VBG | 1.16 | 0.70 | 0.76 | 0.80 | 1.30 |
the:DT | 1.22 | 1.25 | 1.31 | 1.00 | 1.18 |
contract:NN | 1.07 | 0.56 | 0.71 | 0.28 | 1.19 |
APCOM:NNP | 1.07 | 0.82 | 0.71 | 0.82 | 1.29 |
1993:CD | 1.24 | 1.09 | 0.89 | 1.23 | 0.85 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 | 0.57 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.37 Alignment.score
1.00 0.23 Alignment.isGood
-1.00 0.75 Alignment.isBad
-0.10 2.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.60 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "1993" modifying "won"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "APCOM" modifying "winning" is dropped on aligned hypothesis word "won"
-2.00 1.00 Date.hypDateIns : hypothesis date insertion: 1993
-3.00 1.00 NullPunisher.entity : 1993
-0.10 1.00 NullPunisher.article : a
-6.00 1.00 Numeric.mismatch : DATE mismatch: '01/01/1993' vs ''
1.00 1.00 Quant.equivalent : Replacing the quantifier "the" by an equivalent quantifier "a" preserves truth.
Hand-tuned score (dot product of above): -10.6849
Threshold: -9.4738
Txt: ITEL owned APCOM from 1988 to 1992.
Hyp: ITEL owned APCOM in 1990. (yes)
ITEL NNP |
owned VBD |
APCOM NNP |
1990 CD |
|
ITEL:NNP | 0.28 | 1.05 | 1.07 | 1.20 |
owned:VBD | 1.14 | 1.32 | 0.91 | 1.15 |
APCOM:NNP | 1.07 | 0.82 | 0.28 | 1.29 |
1988:CD | 1.24 | 1.15 | 1.33 | 0.49 |
1992:CD | 1.24 | 1.12 | 1.33 | 1.61 |
NO_WORD | 0.28 | 0.17 | 0.09 | 0.57 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.40 Alignment.score
1.00 0.23 Alignment.isGood
-1.00 0.74 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.25 Alignment.txtSpan
-3.00 1.00 Date.dateHeadMismatch : 1990 vs. 1992
-3.00 1.00 NullPunisher.entity : ITEL
-1.00 1.00 NullPunisher.other : owned
-1.00 1.00 NullPunisher.other : APCOM
-2.00 1.00 RootEntailment.unalignedRoot : "owned" not aligned to anything
Hand-tuned score (dot product of above): -10.2847
Threshold: -9.4738
Txt: Smith and Jones left the meeting.
Hyp: Smith left the meeting. (yes)
Smith NNP |
left VBD |
the DT |
meeting NN |
|
Smith:NNP | 0.28 | 1.05 | 0.96 | 1.15 |
Jones:NNP | 0.70 | 1.07 | 0.96 | 1.17 |
left:VBD | 1.14 | 1.32 | 0.76 | 0.77 |
the:DT | 1.25 | 1.25 | 2.42 | 1.00 |
meeting:NN | 1.15 | 0.68 | 0.71 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.20 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 NullPunisher.other : left
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : meeting
-0.10 1.00 NullPunisher.article : the
-2.00 1.00 RootEntailment.unalignedRoot : "left" not aligned to anything
Hand-tuned score (dot product of above): -7.8790
Threshold: -9.4738
Txt: Smith and Jones left the meeting.
Hyp: Jones left the meeting. (yes)
Jones NNP |
left VBD |
the DT |
meeting NN |
|
Smith:NNP | 0.70 | 1.05 | 0.96 | 1.15 |
Jones:NNP | 0.28 | 1.07 | 0.96 | 1.17 |
left:VBD | 1.16 | 1.32 | 0.76 | 0.77 |
the:DT | 1.25 | 1.25 | 2.42 | 1.00 |
meeting:NN | 1.17 | 0.68 | 0.71 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.20 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 4.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-0.10 1.00 NullPunisher.article : the
-3.00 1.00 NullPunisher.entity : Jones
-1.00 1.00 NullPunisher.other : left
-1.00 1.00 NullPunisher.other : meeting
-2.00 1.00 RootEntailment.unalignedRoot : "left" not aligned to anything
Hand-tuned score (dot product of above): -7.8790
Threshold: -9.4738
Txt: Smith, Anderson and Jones met.
Hyp: There was a group of people that met. (yes)
There EX |
was VBD |
a DT |
group NN |
people NNS |
that WDT |
met VBD |
|
Smith:NNP | 0.96 | 1.07 | 0.96 | 1.25 | 1.20 | 0.94 | 1.02 |
Anderson:NNP | 0.95 | 1.07 | 0.96 | 1.20 | 1.16 | 0.96 | 1.07 |
Jones:NNP | 0.96 | 1.07 | 0.96 | 1.28 | 1.20 | 0.96 | 1.07 |
met:VBD | 0.76 | 0.78 | 0.76 | 0.81 | 0.91 | 0.73 | 1.32 |
NO_WORD | 0.29 | 0.17 | 0.82 | 0.28 | 0.04 | 0.41 | 0.25 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.24 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 7.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "met" modifying "people"
-0.10 1.00 NullPunisher.article : a
-1.00 1.00 NullPunisher.other : group
-0.10 1.00 NullPunisher.functionWord : that
-1.00 1.00 NullPunisher.other : people
-0.10 1.00 NullPunisher.functionWord : There
-1.00 1.00 NullPunisher.other : met
-0.05 1.00 NullPunisher.aux : was
-2.00 1.00 RootEntailment.unalignedRoot : "was" not aligned to anything
Hand-tuned score (dot product of above): -7.3731
Threshold: -9.4738
Txt: Smith knew that ITEL had won the contract in 1992.
Hyp: ITEL won the contract in 1992. (yes)
ITEL NNP |
won VBD |
the DT |
contract NN |
1992 CD |
|
Smith:NNP | 0.93 | 1.07 | 0.96 | 1.16 | 1.20 |
knew:VBD | 1.11 | 0.69 | 0.73 | 0.86 | 1.16 |
that:IN | 1.57 | 1.24 | 0.95 | 1.27 | 1.30 |
ITEL:NNP | 0.28 | 1.07 | 0.93 | 1.07 | 1.20 |
had:VBD | 1.16 | 0.82 | 0.76 | 0.91 | 1.39 |
won:VBN | 1.16 | 1.50 | 0.76 | 0.65 | 1.11 |
the:DT | 1.22 | 1.25 | 2.42 | 1.00 | 1.18 |
contract:NN | 1.07 | 0.56 | 0.71 | 0.28 | 1.18 |
1992:CD | 1.24 | 1.02 | 0.89 | 1.22 | 0.85 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 | 0.57 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.53 Alignment.score
1.00 0.26 Alignment.isGood
-1.00 0.72 Alignment.isBad
-0.10 2.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.60 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "1992" modifying "won"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "1992" modifying "contract" is dropped on aligned hypothesis word "contract"
-2.00 1.00 Date.hypDateIns : hypothesis date insertion: 1992
-0.10 1.00 NullPunisher.article : the
-3.00 1.00 NullPunisher.entity : 1992
-6.00 1.00 Numeric.mismatch : DATE mismatch: '01/01/1992' vs ''
Hand-tuned score (dot product of above): -11.4634
Threshold: -9.4738
Txt: Smith believed that ITEL had won the contract in 1992.
Hyp: ITEL won the contract in 1992. (don't know)
ITEL NNP |
won VBD |
the DT |
contract NN |
1992 CD |
|
Smith:NNP | 0.93 | 1.07 | 0.96 | 1.16 | 1.20 |
believed:VBD | 1.16 | 0.67 | 0.76 | 0.90 | 1.10 |
that:IN | 1.57 | 1.24 | 0.95 | 1.27 | 1.30 |
ITEL:NNP | 0.28 | 1.07 | 0.93 | 1.07 | 1.20 |
had:VBD | 1.16 | 0.82 | 0.76 | 0.91 | 1.39 |
won:VBN | 1.16 | 1.50 | 0.76 | 0.65 | 1.11 |
the:DT | 1.22 | 1.25 | 2.42 | 1.00 | 1.18 |
contract:NN | 1.07 | 0.56 | 0.71 | 0.28 | 1.18 |
1992:CD | 1.24 | 1.02 | 0.89 | 1.22 | 0.85 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 | 0.57 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.53 Alignment.score
1.00 0.26 Alignment.isGood
-1.00 0.72 Alignment.isBad
-0.10 2.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.60 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "1992" modifying "won"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "1992" modifying "contract" is dropped on aligned hypothesis word "contract"
-2.00 1.00 Date.hypDateIns : hypothesis date insertion: 1992
0.50 1.00 Factive.unknownPassage : Hyp aligned with txt under non-factive verb (unknown): believed
-0.10 1.00 NullPunisher.article : the
-3.00 1.00 NullPunisher.entity : 1992
-6.00 1.00 Numeric.mismatch : DATE mismatch: '01/01/1992' vs ''
Hand-tuned score (dot product of above): -10.9634
Threshold: -9.4738
Txt: ITEL managed to win the contract in 1992.
Hyp: ITEL won the contract in 1992. (yes)
ITEL NNP |
won VBD |
the DT |
contract NN |
1992 CD |
|
ITEL:NNP | 0.28 | 1.07 | 0.93 | 1.07 | 1.20 |
managed:VBD | 1.16 | 0.63 | 0.76 | 0.82 | 1.25 |
to:TO | 1.25 | 1.21 | 1.26 | 1.00 | 1.18 |
win:VB | 1.16 | 0.74 | 0.76 | 0.84 | 1.22 |
the:DT | 1.22 | 1.25 | 2.42 | 1.00 | 1.18 |
contract:NN | 1.07 | 0.56 | 0.71 | 0.28 | 1.18 |
1992:CD | 1.24 | 1.02 | 0.89 | 1.22 | 0.85 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 | 0.57 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.25 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.40 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "1992" modifying "won"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "1992" modifying "contract" is dropped on aligned hypothesis word "contract"
-2.00 1.00 Date.hypDateIns : hypothesis date insertion: 1992
1.00 1.00 Factive.positiveStatement : Valid pattern: "managed" X entails X
-3.00 1.00 NullPunisher.entity : ITEL
-0.10 1.00 NullPunisher.article : the
-3.00 1.00 NullPunisher.entity : 1992
-6.00 1.00 Numeric.mismatch : DATE mismatch: '01/01/1992' vs ''
Hand-tuned score (dot product of above): -14.0736
Threshold: -9.4738
Txt: ITEL tried to win the contract in 1992.
Hyp: ITEL won the contract in 1992. (don't know)
ITEL NNP |
won VBD |
the DT |
contract NN |
1992 CD |
|
ITEL:NNP | 0.28 | 1.07 | 0.93 | 1.07 | 1.20 |
tried:VBD | 1.14 | 0.63 | 0.70 | 0.84 | 1.03 |
to:TO | 1.25 | 1.21 | 1.26 | 1.00 | 1.18 |
win:VB | 1.16 | 0.74 | 0.76 | 0.84 | 1.22 |
the:DT | 1.22 | 1.25 | 2.42 | 1.00 | 1.18 |
contract:NN | 1.07 | 0.56 | 0.71 | 0.28 | 1.18 |
1992:CD | 1.24 | 1.02 | 0.89 | 1.22 | 0.85 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 | 0.57 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.25 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.40 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "1992" modifying "won"
0.50 1.00 Adjunct.dropPosCxt : It is okay that text word "1992" modifying "contract" is dropped on aligned hypothesis word "contract"
-2.00 1.00 Date.hypDateIns : hypothesis date insertion: 1992
0.50 1.00 Factive.unknownPassage : Hyp aligned with txt under non-factive verb (unknown): tried
-0.10 1.00 NullPunisher.article : the
-3.00 1.00 NullPunisher.entity : ITEL
-3.00 1.00 NullPunisher.entity : 1992
-6.00 1.00 Numeric.mismatch : DATE mismatch: '01/01/1992' vs ''
Hand-tuned score (dot product of above): -14.5736
Threshold: -9.4738
Txt: It is true that ITEL won the contract in 1992.
Hyp: ITEL won the contract in 1992. (yes)
ITEL NNP |
won VBD |
the DT |
contract NN |
1992 CD |
|
It:PRP | 1.48 | 0.97 | 1.11 | 1.30 | 1.46 |
is:VBZ | 1.16 | 0.88 | 0.76 | 0.91 | 1.39 |
true:JJ | 1.13 | 0.92 | 0.67 | 0.84 | 1.01 |
that:IN | 1.57 | 1.24 | 0.95 | 1.27 | 1.30 |
ITEL:NNP | 0.28 | 1.07 | 0.93 | 1.07 | 1.20 |
won:VBD | 1.16 | 1.32 | 0.76 | 0.65 | 1.11 |
the:DT | 1.22 | 1.25 | 2.42 | 1.00 | 1.18 |
contract:NN | 1.07 | 0.56 | 0.71 | 0.28 | 1.18 |
1992:CD | 1.24 | 1.02 | 0.89 | 1.22 | 0.85 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 | 0.57 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.05 Alignment.score
1.00 0.18 Alignment.isGood
-1.00 0.80 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "1992" modifying "won"
-2.00 1.00 Date.hypDateIns : hypothesis date insertion: 1992
-0.10 1.00 NullPunisher.article : the
-3.00 1.00 NullPunisher.entity : ITEL
-1.00 1.00 NullPunisher.other : won
-1.00 1.00 NullPunisher.other : contract
-3.00 1.00 NullPunisher.entity : 1992
-6.00 1.00 Numeric.mismatch : DATE mismatch: '01/01/1992' vs ''
-2.00 1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -20.1829
Threshold: -9.4738
Txt: It is false that ITEL won the contract in 1992.
Hyp: ITEL won the contract in 1992. (don't know)
ITEL NNP |
won VBD |
the DT |
contract NN |
1992 CD |
|
It:PRP | 1.48 | 0.97 | 1.11 | 1.30 | 1.46 |
is:VBZ | 1.16 | 0.88 | 0.76 | 0.91 | 1.39 |
false:JJ | 1.13 | 0.96 | 0.76 | 0.88 | 0.92 |
that:IN | 1.57 | 1.24 | 0.95 | 1.27 | 1.30 |
ITEL:NNP | 0.28 | 1.07 | 0.93 | 1.07 | 1.20 |
won:VBD | 1.16 | 1.32 | 0.76 | 0.65 | 1.11 |
the:DT | 1.22 | 1.25 | 2.42 | 1.00 | 1.18 |
contract:NN | 1.07 | 0.56 | 0.71 | 0.28 | 1.18 |
1992:CD | 1.24 | 1.02 | 0.89 | 1.22 | 0.85 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 | 0.57 |
Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.05 Alignment.score
1.00 0.18 Alignment.isGood
-1.00 0.80 Alignment.isBad
-0.10 5.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "1992" modifying "won"
-2.00 1.00 Date.hypDateIns : hypothesis date insertion: 1992
-1.00 1.00 NullPunisher.other : won
-1.00 1.00 NullPunisher.other : contract
-3.00 1.00 NullPunisher.entity : ITEL
-0.10 1.00 NullPunisher.article : the
-3.00 1.00 NullPunisher.entity : 1992
-6.00 1.00 Numeric.mismatch : DATE mismatch: '01/01/1992' vs ''
-2.00 1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -20.1829
Threshold: -9.4738
Txt: Smith saw Jones sign the contract. If Jones signed the contract, his heart was beating.
Hyp: Smith saw Jones' heart beat. (don't know)
Smith NNP |
saw VBD |
Jones NNP |
heart NN |
beat VBD |
|
Smith:NNP | 0.28 | 1.07 | 0.70 | 1.11 | 1.05 |
saw:VBD | 1.16 | 1.32 | 1.16 | 0.78 | 0.50 |
Jones:NNP | 0.70 | 1.07 | 0.28 | 1.12 | 1.07 |
sign:VB | 1.09 | 0.55 | 1.16 | 0.91 | 0.65 |
the:DT | 1.25 | 1.25 | 1.25 | 1.00 | 1.25 |
contract:NN | 1.16 | 0.82 | 1.16 | 0.86 | 0.82 |
If:IN | 1.57 | 1.24 | 1.57 | 1.31 | 1.24 |
Jones:NNP | 0.70 | 1.07 | 0.28 | 1.12 | 1.07 |
signed:VBD | 1.14 | 0.62 | 1.10 | 0.91 | 0.65 |
the:DT | 1.25 | 1.25 | 1.25 | 1.00 | 1.25 |
contract:NN | 1.16 | 0.82 | 1.16 | 0.86 | 0.82 |
his:PRP$ | 1.55 | 0.97 | 2.76 | 1.30 | 0.97 |
heart:NN | 1.11 | 0.69 | 1.12 | 0.28 | 0.69 |
was:VBD | 1.16 | 0.69 | 1.16 | 0.91 | 0.80 |
beating:VBG | 1.16 | 0.48 | 1.16 | 0.81 | 0.92 |
NO_WORD | 0.28 | 0.17 | 0.12 | 0.28 | 0.36 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.27 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.40 Alignment.txtSpan
-3.00 1.00 NullPunisher.entity : Smith
-3.00 1.00 NullPunisher.entity : Jones
-1.00 1.00 NullPunisher.other : saw
-2.00 1.00 RootEntailment.unalignedRoot : "saw" not aligned to anything
Hand-tuned score (dot product of above): -9.3519
Threshold: -9.4738
Txt: Smith saw Jones sign the contract. When Jones signed the contract, his heart was beating.
Hyp: Smith saw Jones' heart beat. (don't know)
Smith NNP |
saw VBD |
Jones NNP |
heart NN |
beat VBD |
|
Smith:NNP | 0.28 | 1.07 | 0.70 | 1.11 | 1.05 |
saw:VBD | 1.16 | 1.32 | 1.16 | 0.78 | 0.50 |
Jones:NNP | 0.70 | 1.07 | 0.28 | 1.12 | 1.07 |
sign:VB | 1.09 | 0.55 | 1.16 | 0.91 | 0.65 |
the:DT | 1.25 | 1.25 | 1.25 | 1.00 | 1.25 |
contract:NN | 1.16 | 0.82 | 1.16 | 0.86 | 0.82 |
When:WRB | 1.25 | 1.25 | 1.23 | 0.97 | 1.25 |
Jones:NNP | 0.70 | 1.07 | 0.28 | 1.12 | 1.07 |
signed:VBD | 1.14 | 0.62 | 1.10 | 0.91 | 0.65 |
the:DT | 1.25 | 1.25 | 1.25 | 1.00 | 1.25 |
contract:NN | 1.16 | 0.82 | 1.16 | 0.86 | 0.82 |
his:PRP$ | 1.55 | 0.97 | 2.76 | 1.30 | 0.97 |
heart:NN | 1.11 | 0.69 | 1.12 | 0.28 | 0.69 |
was:VBD | 1.16 | 0.69 | 1.16 | 0.91 | 0.80 |
beating:VBG | 1.16 | 0.48 | 1.16 | 0.81 | 0.92 |
NO_WORD | 0.28 | 0.17 | 0.12 | 0.28 | 0.36 |
Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.27 Alignment.score
1.00 0.21 Alignment.isGood
-1.00 0.77 Alignment.isBad
-0.10 3.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.40 Alignment.txtSpan
-1.00 1.00 NullPunisher.other : saw
-3.00 1.00 NullPunisher.entity : Jones
-3.00 1.00 NullPunisher.entity : Smith
-2.00 1.00 RootEntailment.unalignedRoot : "saw" not aligned to anything
Hand-tuned score (dot product of above): -9.3519
Threshold: -9.4738
Txt: Smith saw Jones sign the contract.
Hyp: Jones signed the contract. (yes)
Jones NNP |
signed VBD |
the DT |
contract NN |
|
Smith:NNP | 0.70 | 1.05 | 0.96 | 1.16 |
saw:VBD | 1.16 | 0.62 | 0.76 | 0.91 |
Jones:NNP | 0.28 | 1.01 | 0.96 | 1.16 |
sign:VB | 1.16 | 0.93 | 0.76 | 0.66 |
the:DT | 1.25 | 1.25 | 2.42 | 1.00 |
contract:NN | 1.16 | 0.36 | 0.71 | 0.28 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.67 Alignment.score
1.00 0.29 Alignment.isGood
-1.00 0.69 Alignment.isBad
-0.10 1.00 Alignment.nbWordNotAligned
0.10 2.00 Alignment.hypSpan
0.10 0.75 Alignment.txtSpan
-0.10 1.00 NullPunisher.article : the
Hand-tuned score (dot product of above): 0.3416
Threshold: -9.4738
Txt: Smith saw Jones sign the contract. Jones is the chairman of ITEL.
Hyp: Smith saw the chairman of ITEL sign the contract. (yes)
Smith NNP |
saw VBD |
the DT |
chairman NN |
ITEL NNP |
sign VB |
the DT |
contract NN |
|
Smith:NNP | 0.28 | 1.07 | 0.96 | 0.98 | 0.93 | 1.00 | 0.96 | 1.16 |
saw:VBD | 1.16 | 1.32 | 0.76 | 0.87 | 1.16 | 0.55 | 0.76 | 0.91 |
Jones:NNP | 0.70 | 1.07 | 0.96 | 0.99 | 0.93 | 1.07 | 0.96 | 1.16 |
sign:VB | 1.09 | 0.55 | 0.76 | 0.91 | 1.16 | 1.32 | 0.76 | 0.66 |
the:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.22 | 1.25 | 2.42 | 1.00 |
contract:NN | 1.16 | 0.82 | 0.71 | 0.87 | 1.07 | 0.57 | 0.71 | 0.28 |
Jones:NNP | 0.70 | 1.07 | 0.96 | 0.99 | 0.93 | 1.07 | 0.96 | 1.16 |
is:VBZ | 1.16 | 0.76 | 0.76 | 0.91 | 1.16 | 0.78 | 0.76 | 0.91 |
the:DT | 1.25 | 1.25 | 2.42 | 1.00 | 1.22 | 1.25 | 2.42 | 1.00 |
chairman:NN | 0.98 | 0.78 | 0.71 | 0.28 | 1.07 | 0.82 | 0.71 | 0.87 |
ITEL:NNP | 0.93 | 1.07 | 0.93 | 1.07 | 0.28 | 1.07 | 0.93 | 1.07 |
NO_WORD | 0.28 | 0.17 | 0.82 | 0.28 | 0.04 | 0.36 | 0.82 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.22 Alignment.score
1.00 0.20 Alignment.isGood
-1.00 0.78 Alignment.isBad
-0.10 8.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "ITEL" modifying "chairman"
-1.00 1.00 NullPunisher.other : saw
-1.00 1.00 NullPunisher.other : chairman
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : contract
-3.00 1.00 NullPunisher.entity : ITEL
-1.00 1.00 NullPunisher.other : sign
-0.10 1.00 NullPunisher.article : the
-3.00 1.00 NullPunisher.entity : Smith
-2.00 1.00 RootEntailment.unalignedRoot : "saw" not aligned to anything
Hand-tuned score (dot product of above): -14.3558
Threshold: -9.4738
Txt: Helen saw the chairman of the department answer the phone. The chairman of the department is a person.
Hyp: There is someone whom Helen saw answer the phone. (yes)
There EX |
is VBZ |
someone NN |
whom WP |
Helen NNP |
saw VBD |
answer VB |
the DT |
phone NN |
|
Helen:NNP | 0.88 | 1.07 | 1.13 | 1.84 | 0.28 | 1.07 | 1.05 | 0.96 | 1.05 |
saw:VBD | 0.76 | 0.76 | 0.91 | 0.54 | 1.16 | 1.32 | 0.64 | 0.76 | 0.78 |
the:DT | 1.20 | 1.25 | 1.00 | 1.13 | 1.25 | 1.25 | 1.25 | 2.42 | 0.94 |
chairman:NN | 0.69 | 0.82 | 0.83 | 1.58 | 1.10 | 0.78 | 0.82 | 0.71 | 0.88 |
the:DT | 1.20 | 1.25 | 1.00 | 1.13 | 1.25 | 1.25 | 1.25 | 2.42 | 0.94 |
department:NN | 0.71 | 0.82 | 1.10 | 1.58 | 1.09 | 0.78 | 0.77 | 0.71 | 0.95 |
answer:VBP | 0.74 | 0.77 | 0.91 | 0.54 | 1.14 | 0.64 | 2.02 | 0.76 | 0.75 |
the:DT | 1.20 | 1.25 | 1.00 | 1.13 | 1.25 | 1.25 | 1.25 | 2.42 | 0.94 |
phone:NN | 0.62 | 0.82 | 0.87 | 1.51 | 1.05 | 0.69 | 0.66 | 0.65 | 0.28 |
The:DT | 1.20 | 1.25 | 1.00 | 1.13 | 1.25 | 1.25 | 1.25 | 0.01 | 0.94 |
chairman:NN | 0.69 | 0.82 | 0.83 | 1.58 | 1.10 | 0.78 | 0.82 | 0.71 | 0.88 |
the:DT | 1.20 | 1.25 | 1.00 | 1.13 | 1.25 | 1.25 | 1.25 | 2.42 | 0.94 |
department:NN | 0.71 | 0.82 | 1.10 | 1.58 | 1.09 | 0.78 | 0.77 | 0.71 | 0.95 |
is:VBZ | 0.76 | 1.32 | 0.91 | 0.54 | 1.16 | 0.76 | 0.82 | 0.76 | 0.91 |
a:DT | 1.31 | 1.25 | 1.00 | 1.16 | 1.25 | 1.25 | 1.25 | 1.31 | 1.00 |
person:NN | 0.69 | 0.82 | 0.22 | 1.58 | 1.11 | 0.82 | 0.61 | 0.71 | 0.76 |
NO_WORD | 0.29 | 0.17 | 0.28 | 0.92 | 0.28 | 0.25 | 0.34 | 0.82 | 0.09 |
Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
1.00 0.50 Alignment.score
1.00 0.25 Alignment.isGood
-1.00 0.72 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.33 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "saw" modifying "someone"
-1.00 1.00 NullPunisher.other : whom
-0.10 1.00 NullPunisher.functionWord : There
-3.00 1.00 NullPunisher.entity : Helen
-0.05 1.00 NullPunisher.aux : is
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : saw
-2.00 1.00 RootEntailment.unalignedRoot : "is" not aligned to anything
Hand-tuned score (dot product of above): -8.6877
Threshold: -9.4738
Txt: Smith saw Jones sign the contract and his secretary make a copy.
Hyp: Smith saw Jones sign the contract. (yes)
Smith NNP |
saw VBD |
Jones NNP |
sign VB |
the DT |
contract NN |
|
Smith:NNP | 0.28 | 1.07 | 0.70 | 1.00 | 0.96 | 1.16 |
saw:VBD | 1.16 | 1.32 | 1.16 | 0.55 | 0.76 | 0.91 |
Jones:NNP | 0.70 | 1.07 | 0.28 | 1.07 | 0.96 | 1.16 |
sign:VB | 1.09 | 0.55 | 1.16 | 1.32 | 0.76 | 0.66 |
the:DT | 1.25 | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 |
contract:NN | 1.16 | 0.82 | 1.16 | 0.57 | 0.71 | 0.28 |
his:PRP$ | 2.76 | 0.97 | 1.55 | 1.18 | 1.35 | 1.30 |
secretary:NN | 0.96 | 0.82 | 0.97 | 0.71 | 0.71 | 0.88 |
make:VBP | 1.14 | 0.58 | 1.14 | 0.57 | 0.73 | 0.90 |
a:DT | 1.25 | 1.25 | 1.25 | 1.25 | 1.31 | 1.00 |
copy:NN | 1.11 | 0.82 | 1.10 | 0.78 | 0.71 | 0.71 |
NO_WORD | 0.28 | 0.17 | 0.28 | 0.36 | 0.82 | 0.09 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.15 Alignment.score
1.00 0.19 Alignment.isGood
-1.00 0.79 Alignment.isBad
-0.10 6.00 Alignment.nbWordNotAligned
0.10 0.00 Alignment.hypSpan
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : sign
-1.00 1.00 NullPunisher.other : contract
-0.10 1.00 NullPunisher.article : the
-1.00 1.00 NullPunisher.other : saw
-3.00 1.00 NullPunisher.entity : Jones
-2.00 1.00 RootEntailment.unalignedRoot : "saw" not aligned to anything
Hand-tuned score (dot product of above): -12.1485
Threshold: -9.4738
Txt: Smith saw Jones sign the contract or cross out the crucial clause.
Hyp: Smith either saw Jones sign the contract or saw Jones cross out the crucial clause. (yes)
Smith NNP |
either RB |
saw VBD |
Jones NNP |
sign VB |
the DT |
contract NN |
saw VBD |
Jones NNP |
cross VB |
the DT |
crucial JJ |
clause NN |
|
Smith:NNP | 0.28 | 1.50 | 1.07 | 0.70 | 1.00 | 0.96 | 1.16 | 1.07 | 0.70 | 1.07 | 0.96 | 1.24 | 1.10 |
saw:VBD | 1.16 | 1.11 | 1.32 | 1.16 | 0.55 | 0.76 | 0.91 | 1.32 | 1.16 | 0.60 | 0.76 | 1.01 | 0.91 |
Jones:NNP | 0.70 | 1.54 | 1.07 | 0.28 | 1.07 | 0.96 | 1.16 | 1.07 | 0.28 | 1.03 | 0.96 | 1.24 | 1.09 |
sign:VB | 1.09 | 1.11 | 0.55 | 1.16 | 1.32 | 0.76 | 0.66 | 0.55 | 1.16 | 0.68 | 0.76 | 0.76 | 0.76 |
the:DT | 1.25 | 1.38 | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 | 1.25 | 1.25 | 1.25 | 2.42 | 1.18 | 1.00 |
contract:NN | 1.16 | 1.31 | 0.82 | 1.16 | 0.57 | 0.71 | 0.28 | 0.82 | 1.16 | 0.76 | 0.71 | 0.92 | 0.55 |
cross_out:VB | 1.16 | 1.11 | 0.54 | 1.16 | 0.62 | 0.76 | 0.80 | 0.54 | 1.16 | 1.45 | 0.76 | 0.99 | 0.89 |
the:DT | 1.25 | 1.38 | 1.25 | 1.25 | 1.25 | 2.42 | 1.00 | 1.25 | 1.25 | 1.25 | 2.42 | 1.18 | 1.00 |
crucial:JJ | 1.13 | 1.01 | 0.95 | 1.13 | 0.70 | 0.76 | 0.81 | 0.95 | 1.13 | 0.92 | 0.76 | 0.74 | 0.71 |
clause:NN | 1.10 | 1.31 | 0.82 | 1.09 | 0.67 | 0.71 | 0.55 | 0.82 | 1.09 | 0.75 | 0.71 | 0.82 | 0.28 |
NO_WORD | 0.28 | 0.04 | 0.17 | 0.28 | 0.36 | 0.82 | 0.09 | 0.80 | 0.28 | 0.36 | 0.82 | 0.11 | 0.62 |
Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
1.00 0.34 Alignment.score
1.00 0.22 Alignment.isGood
-1.00 0.75 Alignment.isBad
-0.10 12.00 Alignment.nbWordNotAligned
0.10 1.00 Alignment.hypSpan
0.10 0.08 Alignment.txtSpan
-1.00 1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "clause" modifying "cross"
-1.00 1.00 NullPunisher.other : contract
-3.00 1.00 NullPunisher.entity : Jones
-1.00 1.00 NullPunisher.other : saw
-0.10 1.00 NullPunisher.article : the
-3.00 1.00 NullPunisher.entity : Smith
-1.00 1.00 NullPunisher.other : either
-1.00 1.00 NullPunisher.other : crucial
-1.00 1.00 NullPunisher.other : saw
-1.00 1.00 NullPunisher.other : sign
-1.00 1.00 NullPunisher.other : clause
-3.00 1.00 NullPunisher.entity : Jones
-0.10 1.00 NullPunisher.article : the
-2.00 1.00 RootEntailment.unalignedRoot : "saw" not aligned to anything
Hand-tuned score (dot product of above): -20.4803
Threshold: -9.4738
Hand-set weights Accuracy: 161/334 = 0.4820
Word similarity table built on Sun Jul 29 19:17:00 PDT 2007 using command:
java edu.stanford.nlp.rte.WordSimilarityGenerator -info /u/nlp/rte/data/byformat/align/simple/fracas-joined-premises.rte.align.xml -output /u/nlp/rte/data/byformat/wordsim/simple/fracas-joined-premises.rte.wordsim.html