Txt/Hyp term similarities

The rows are the txt words. The columns are hyp words.

Resource summary:
Acronym: AcronymLexicalResource
BasicWN: null
Country: CountryLexicalResource
Coref: CorefLexicalResource
Cyc: null
DekangLin: null
EditDistance: EditDistanceLexicalResource
Google: null
InfoMap: InfoMapLexicalResource
Morpho: MorphoLexicalResource
NomBank: NomBankLexicalResource
Number: NumberLexicalResource
Ordinal: OrdinalLexicalResource
Preposition: PrepositionLexicalResource
Ravichandran: RavichandranLexicalResource
LeacockChodorowWN: null
JiangConrathWN: JiangConrathWNLexicalResource
ResnikWN: null
LinWN: LinWNLexicalResource
HirstWN: null
AdaptedLeskWN: null
EdgeWN: null
StringSim: StringSimLexicalResource
WebLexicalEntailment: null
CorpusLexicalEntailment: null
WordNet: null
WNAntonymy: WNAntonymyLexicalResource
WNHypernymy: WNHypernymyLexicalResource
WNSynonymy: WNSynonymyLexicalResource



Inference ID: 001

Txt: An Italian became the world's greatest tenor.

Hyp: There was an Italian who became the world's greatest tenor. (yes)

There
EX
was
VBD
an
DT
Italian
JJ
who
WP
became
VBD
the
DT
world
NN
greatest
JJS
tenor
NN
An:DT   1.31   1.21   0.13   1.44   1.16   1.25   1.31   1.00   1.18   1.00
Italian:JJ   1.02   1.22   1.02   0.74   1.30   1.22   1.02   1.13   1.18   1.13
became:VBD   0.74   0.93   0.76   1.28   0.54   1.32   0.76   0.91   0.99   0.89
the:DT   1.20   1.25   1.31   1.44   1.09   1.25   2.42   1.00   1.18   1.00
world:NN   0.71   0.82   0.71   1.24   1.58   0.82   0.71   0.28   0.38   0.84
greatest:JJS   0.74   0.96   0.76   1.18   1.05   0.93   0.76   0.27   0.74   0.72
tenor:NN   0.66   0.82   0.71   1.24   1.58   0.80   0.71   0.84   0.83   0.28
NO_WORD   0.29   0.17   0.82   0.23   0.92   0.25   0.82   0.12   0.11   0.10

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.34 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.76 Alignment.isBad
-0.10  10.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "greatest" modifying "tenor"
 0.00  1.00 NegPolarity.hypNegWord : "greatest": tag "JJS" is in NegPolarityMarkers list
-1.00  1.00 NullPunisher.other : became
-1.00  1.00 NullPunisher.other : tenor
-0.10  1.00 NullPunisher.functionWord : who
-0.10  1.00 NullPunisher.functionWord : There
-1.00  1.00 NullPunisher.other : greatest
-0.10  1.00 NullPunisher.article : an
-0.10  1.00 NullPunisher.article : the
-3.00  1.00 NullPunisher.entity : Italian
-0.05  1.00 NullPunisher.aux : was
-1.00  1.00 NullPunisher.other : world
-2.00  1.00 RootEntailment.unalignedRoot : "was" not aligned to anything
Hand-tuned score (dot product of above): -11.6465
Threshold: -9.4738


Inference ID: 002

Txt: Every Italian man wants to be a great tenor. Some Italian men are great tenors.

Hyp: There are Italian men who want to be a great tenor. (yes)

There
EX
are
VBP
Italian
JJ
men
NNS
who
WP
want
VBP
to
TO
be
VB
a
DT
great
JJ
tenor
NN
Every:DT   1.22   1.25   1.44   1.00   1.16   1.25   1.31   1.25   1.31   1.14   0.95
Italian:JJ   1.02   1.22   0.74   1.13   1.30   1.22   1.02   1.22   1.02   1.19   1.13
man:NN   0.71   0.82   1.24   0.01   1.58   0.72   0.71   0.82   0.71   0.85   0.80
wants:VBZ   0.76   0.80   1.28   0.82   0.54   1.10   0.76   0.80   0.76   0.93   0.86
to:TO   1.31   1.25   1.44   1.00   1.12   1.25   2.42   1.25   1.31   1.18   0.96
be:VB   0.76   0.14   1.28   0.86   0.54   0.88   0.76   1.32   0.76   1.02   0.91
a:DT   1.31   1.25   1.44   1.00   1.16   1.25   1.31   1.25   2.42   1.18   1.00
great:JJ   0.72   0.90   1.19   0.88   1.05   0.93   0.76   0.96   0.76   0.74   0.60
tenor:NN   0.66   0.82   1.24   0.79   1.58   0.79   0.67   0.82   0.71   0.70   0.28
Some:DT   1.28   1.22   1.44   0.96   1.16   1.25   1.31   1.25   1.31   1.18   1.00
Italian:JJ   1.02   1.22   0.74   1.13   1.30   1.22   1.02   1.22   1.02   1.19   1.13
men:NNS   0.71   0.82   1.24   0.28   1.58   0.72   0.71   0.77   0.71   0.99   0.79
are:VBP   0.70   1.32   1.28   0.91   0.54   0.85   0.76   0.14   0.76   0.96   0.91
great:JJ   0.72   0.90   1.19   0.88   1.05   0.93   0.76   0.96   0.76   0.74   0.60
tenors:NNS   0.65   0.82   1.23   0.66   1.58   0.69   0.71   0.82   0.71   0.99   0.99
NO_WORD   0.29   0.17   0.11   0.28   0.92   0.25   1.55   0.34   0.82   0.11   0.10

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.58 Alignment.score
 1.00  0.27 Alignment.isGood
-1.00  0.71 Alignment.isBad
-0.10  8.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.27 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "great" modifying "tenor"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "great" modifying "tenors" is dropped on aligned hypothesis word "tenor"
-0.05  1.00 NullPunisher.aux : are
-3.00  1.00 NullPunisher.entity : Italian
-0.10  1.00 NullPunisher.functionWord : who
-0.10  1.00 NullPunisher.article : a
-0.10  1.00 NullPunisher.functionWord : to
-0.10  1.00 NullPunisher.functionWord : There
-1.00  1.00 NullPunisher.other : great
-0.05  1.00 NullPunisher.aux : be
-2.00  1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -7.5273
Threshold: -9.4738


Inference ID: 003

Txt: All Italian men want to be a great tenor. Some Italian men are great tenors.

Hyp: There are Italian men who want to be a great tenor. (yes)

There
EX
are
VBP
Italian
JJ
men
NNS
who
WP
want
VBP
to
TO
be
VB
a
DT
great
JJ
tenor
NN
All:DT   1.31   1.18   1.44   1.00   1.16   1.22   1.31   1.25   1.31   1.18   1.00
Italian:JJ   1.02   1.22   0.74   1.13   1.30   1.22   1.02   1.22   1.02   1.19   1.13
men:NNS   0.71   0.82   1.24   0.28   1.58   0.72   0.71   0.77   0.71   0.99   0.79
want:VBP   0.76   0.77   1.28   0.81   0.51   1.32   0.76   0.80   0.76   0.99   0.88
to:TO   1.31   1.25   1.44   1.00   1.12   1.25   2.42   1.25   1.31   1.18   0.96
be:VB   0.76   0.14   1.28   0.86   0.54   0.88   0.76   1.32   0.76   1.02   0.91
a:DT   1.31   1.25   1.44   1.00   1.16   1.25   1.31   1.25   2.42   1.18   1.00
great:JJ   0.72   0.90   1.19   0.88   1.05   0.93   0.76   0.96   0.76   0.74   0.60
tenor:NN   0.66   0.82   1.24   0.79   1.58   0.79   0.67   0.82   0.71   0.70   0.28
Some:DT   1.28   1.22   1.44   0.96   1.16   1.25   1.31   1.25   1.31   1.18   1.00
Italian:JJ   1.02   1.22   0.74   1.13   1.30   1.22   1.02   1.22   1.02   1.19   1.13
men:NNS   0.71   0.82   1.24   0.28   1.58   0.72   0.71   0.77   0.71   0.99   0.79
are:VBP   0.70   1.32   1.28   0.91   0.54   0.85   0.76   0.14   0.76   0.96   0.91
great:JJ   0.72   0.90   1.19   0.88   1.05   0.93   0.76   0.96   0.76   0.74   0.60
tenors:NNS   0.65   0.82   1.23   0.66   1.58   0.69   0.71   0.82   0.71   0.99   0.99
NO_WORD   0.29   0.17   0.11   0.28   0.92   0.25   1.55   0.34   0.82   0.11   0.10

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.48 Alignment.score
 1.00  0.25 Alignment.isGood
-1.00  0.73 Alignment.isBad
-0.10  10.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.09 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "great" modifying "tenor"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "great" modifying "tenors" is dropped on aligned hypothesis word "tenor"
-0.10  1.00 NullPunisher.functionWord : There
-1.00  1.00 NullPunisher.other : great
-1.00  1.00 NullPunisher.other : men
-0.10  1.00 NullPunisher.article : a
-3.00  1.00 NullPunisher.entity : Italian
-0.05  1.00 NullPunisher.aux : are
-1.00  1.00 NullPunisher.other : want
-0.10  1.00 NullPunisher.functionWord : who
-0.05  1.00 NullPunisher.aux : be
-0.10  1.00 NullPunisher.functionWord : to
-2.00  1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -9.8908
Threshold: -9.4738


Inference ID: 004

Txt: Each Italian tenor wants to be great. Some Italian tenors are great.

Hyp: There are Italian tenors who want to be great. (yes)

There
EX
are
VBP
Italian
JJ
tenors
RB
who
WP
want
VBP
to
TO
be
VB
great
JJ
Each:DT   1.31   1.22   1.44   1.45   1.16   1.20   1.31   1.25   1.16
Italian:JJ   1.02   1.22   0.74   1.25   1.30   1.22   1.02   1.22   1.19
tenor:NN   0.66   0.82   1.24   0.48   1.58   0.79   0.67   0.82   0.70
wants:VBZ   0.76   0.80   1.28   1.05   0.54   1.10   0.76   0.80   0.93
to:TO   1.31   1.25   1.44   1.45   1.12   1.25   2.42   1.25   1.18
be:VB   0.76   0.14   1.28   1.11   0.54   0.88   0.76   1.32   1.02
great:JJ   0.72   0.90   1.19   1.01   1.05   0.93   0.76   0.96   0.74
Some:DT   1.28   1.22   1.44   1.45   1.16   1.25   1.31   1.25   1.18
Italian:JJ   1.02   1.22   0.74   1.25   1.30   1.22   1.02   1.22   1.19
tenors:NNS   0.65   0.82   1.23   2.58   1.58   0.69   0.71   0.82   0.99
are:VBP   0.70   1.32   1.28   1.11   0.54   0.85   0.76   0.14   0.96
great:JJ   0.72   0.90   1.19   1.01   1.05   0.93   0.76   0.96   0.74
NO_WORD   0.29   0.17   0.09   0.04   1.07   0.31   1.55   1.43   0.32

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.91 Alignment.score
 1.00  0.34 Alignment.isGood
-1.00  0.64 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.22 Alignment.txtSpan
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "Italian" modifying "tenors" is dropped on aligned hypothesis word "tenors"
-0.10  1.00 NullPunisher.functionWord : who
-0.10  1.00 NullPunisher.functionWord : There
-0.05  1.00 NullPunisher.aux : be
-1.00  1.00 NullPunisher.other : great
-0.05  1.00 NullPunisher.aux : are
-3.00  1.00 NullPunisher.entity : Italian
-0.10  1.00 NullPunisher.functionWord : to
-2.00  1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -5.8723
Threshold: -9.4738


Inference ID: 005

Txt: The really ambitious tenors are Italian.

Hyp: There are really ambitious tenors who are Italian. (yes)

There
EX
are
VBP
really
RB
ambitious
JJ
tenors
RB
who
WP
are
VBP
Italian
JJ
The:DT   1.20   1.18   1.45   1.18   1.45   1.09   1.18   1.44
really:RB   1.09   0.91   1.23   1.05   0.70   1.05   0.91   1.25
ambitious:JJ   0.76   0.96   1.01   0.74   0.82   1.05   0.96   1.19
tenors:NNS   0.65   0.82   1.15   0.79   2.58   1.58   0.82   1.23
are:VBP   0.70   1.32   1.11   1.02   1.11   0.54   1.32   1.28
Italian:JJ   1.02   1.22   1.22   1.19   1.25   1.30   1.22   0.74
NO_WORD   0.29   0.17   0.04   0.09   0.04   1.07   0.31   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.53 Alignment.score
 1.00  0.26 Alignment.isGood
-1.00  0.72 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.12 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "really" modifying "ambitious"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "ambitious" modifying "tenors" is dropped on aligned hypothesis word "tenors"
-1.00  1.00 NullPunisher.other : ambitious
-1.00  1.00 NullPunisher.other : really
-0.05  1.00 NullPunisher.aux : are
-0.05  1.00 NullPunisher.aux : are
-0.10  1.00 NullPunisher.functionWord : There
-3.00  1.00 NullPunisher.entity : Italian
-0.10  1.00 NullPunisher.functionWord : who
-2.00  1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -8.3104
Threshold: -9.4738


Inference ID: 006

Txt: No really great tenors are modest.

Hyp: There are really great tenors who are modest. (don't know)

There
EX
are
VBP
really
RB
great
JJ
tenors
RB
who
WP
are
VBP
modest
JJ
No:DT   1.31   1.25   1.45   1.18   1.45   1.12   1.25   1.18
really:RB   1.09   0.91   1.23   0.58   0.70   1.05   0.91   1.05
great:JJ   0.72   0.90   0.54   0.74   1.01   1.05   0.90   0.70
tenors:NNS   0.65   0.82   1.15   0.99   2.58   1.58   0.82   0.89
are:VBP   0.70   1.32   1.11   0.96   1.11   0.54   1.32   1.02
modest:JJ   0.74   0.96   1.01   0.70   0.92   1.05   0.96   0.74
NO_WORD   0.29   0.17   0.04   0.09   0.04   1.07   0.31   0.09

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.53 Alignment.score
 1.00  0.26 Alignment.isGood
-1.00  0.72 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.12 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "really" modifying "great"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "great" modifying "tenors" is dropped on aligned hypothesis word "tenors"
-0.05  1.00 NullPunisher.aux : are
-0.10  1.00 NullPunisher.functionWord : There
-0.10  1.00 NullPunisher.functionWord : who
-1.00  1.00 NullPunisher.other : really
-1.00  1.00 NullPunisher.other : modest
-1.00  1.00 NullPunisher.other : great
-0.05  1.00 NullPunisher.aux : are
-2.00  1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -6.3104
Threshold: -9.4738


Inference ID: 007

Txt: Some great tenors are Swedish.

Hyp: There are great tenors who are Swedish. (yes)

There
EX
are
VBP
great
JJ
tenors
JJ
who
WP
are
VBP
Swedish
NNP
Some:DT   1.28   1.22   1.18   1.18   1.16   1.22   1.25
great:JJ   0.72   0.90   0.74   0.93   1.05   0.90   1.13
tenors:NNS   0.65   0.82   0.99   1.93   1.58   0.82   0.97
are:VBP   0.70   1.32   0.96   1.02   0.54   1.32   1.16
Swedish:NNP   0.96   1.07   1.24   1.23   1.84   1.07   0.28
NO_WORD   0.29   0.17   0.09   0.29   0.54   0.36   0.10

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.47 Alignment.score
 1.00  0.25 Alignment.isGood
-1.00  0.73 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.14 Alignment.txtSpan
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "great" modifying "tenors" is dropped on aligned hypothesis word "tenors"
-0.05  1.00 NullPunisher.aux : are
-0.05  1.00 NullPunisher.aux : are
-3.00  1.00 NullPunisher.entity : Swedish
-0.10  1.00 NullPunisher.functionWord : who
-0.10  1.00 NullPunisher.functionWord : There
-1.00  1.00 NullPunisher.other : great
-2.00  1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -6.2944
Threshold: -9.4738


Inference ID: 008

Txt: Many great tenors are German.

Hyp: There are great tenors who are German. (yes)

There
EX
are
VBP
great
JJ
tenors
RB
who
WP
are
VBP
German
JJ
Many:JJ   0.76   0.93   0.93   1.01   1.05   0.93   1.15
great:JJ   0.72   0.90   0.74   1.01   1.05   0.90   1.09
tenors:NNS   0.65   0.82   0.99   2.58   1.58   0.82   1.21
are:VBP   0.70   1.32   0.96   1.11   0.54   1.32   1.28
German:JJ   1.00   1.22   1.09   1.23   1.30   1.22   0.74
NO_WORD   0.29   0.17   0.09   0.04   1.07   0.31   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.61 Alignment.score
 1.00  0.27 Alignment.isGood
-1.00  0.70 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.14 Alignment.txtSpan
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "great" modifying "tenors" is dropped on aligned hypothesis word "tenors"
-3.00  1.00 NullPunisher.entity : German
-0.05  1.00 NullPunisher.aux : are
-0.10  1.00 NullPunisher.functionWord : There
-0.10  1.00 NullPunisher.functionWord : who
-1.00  1.00 NullPunisher.other : great
-0.05  1.00 NullPunisher.aux : are
-2.00  1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -6.1099
Threshold: -9.4738


Inference ID: 009

Txt: Several great tenors are British.

Hyp: There are great tenors who are British. (yes)

There
EX
are
VBP
great
JJ
tenors
RB
who
WP
are
VBP
British
JJ
Several:JJ   0.72   0.96   0.90   0.96   1.05   0.96   1.19
great:JJ   0.72   0.90   0.74   1.01   1.05   0.90   1.19
tenors:NNS   0.65   0.82   0.99   2.58   1.58   0.82   1.23
are:VBP   0.70   1.32   0.96   1.11   0.54   1.32   1.28
British:JJ   1.02   1.22   1.19   1.25   1.30   1.22   0.74
NO_WORD   0.29   0.17   0.09   0.04   1.07   0.31   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.61 Alignment.score
 1.00  0.27 Alignment.isGood
-1.00  0.70 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.14 Alignment.txtSpan
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "great" modifying "tenors" is dropped on aligned hypothesis word "tenors"
-0.10  1.00 NullPunisher.functionWord : There
-0.05  1.00 NullPunisher.aux : are
-0.05  1.00 NullPunisher.aux : are
-1.00  1.00 NullPunisher.other : great
-0.10  1.00 NullPunisher.functionWord : who
-3.00  1.00 NullPunisher.entity : British
-2.00  1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -6.1099
Threshold: -9.4738


Inference ID: 010

Txt: Most great tenors are Italian.

Hyp: There are great tenors who are Italian. (yes)

There
EX
are
VBP
great
JJ
tenors
RB
who
WP
are
VBP
Italian
JJ
Most:JJS   0.76   0.96   0.91   1.01   1.05   0.96   1.19
great:JJ   0.72   0.90   0.74   1.01   1.05   0.90   1.19
tenors:NNS   0.65   0.82   0.99   2.58   1.58   0.82   1.23
are:VBP   0.70   1.32   0.96   1.11   0.54   1.32   1.28
Italian:JJ   1.02   1.22   1.19   1.25   1.30   1.22   0.74
NO_WORD   0.29   0.17   0.09   0.04   1.07   0.31   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.61 Alignment.score
 1.00  0.27 Alignment.isGood
-1.00  0.70 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.14 Alignment.txtSpan
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "great" modifying "tenors" is dropped on aligned hypothesis word "tenors"
-3.00  1.00 NullPunisher.entity : Italian
-0.10  1.00 NullPunisher.functionWord : who
-0.10  1.00 NullPunisher.functionWord : There
-0.05  1.00 NullPunisher.aux : are
-1.00  1.00 NullPunisher.other : great
-0.05  1.00 NullPunisher.aux : are
-2.00  1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -6.1099
Threshold: -9.4738


Inference ID: 011

Txt: A few great tenors sing popular music. Some great tenors like popular music.

Hyp: There are great tenors who sing popular music. (yes)

There
EX
are
VBP
great
JJ
tenors
RB
who
WP
sing
VBP
popular_music
NN
A:DT   1.31   1.25   1.18   1.45   1.16   1.25   1.00
few:JJ   0.76   0.96   0.93   1.01   1.05   0.96   0.88
great:JJ   0.72   0.90   0.74   1.01   1.05   0.87   0.62
tenors:NNS   0.65   0.82   0.99   2.58   1.58   0.47   0.55
sing:VBP   0.76   0.72   0.93   0.77   0.54   1.32   0.44
popular_music:NN   0.71   0.82   0.73   1.11   1.58   0.35   0.28
Some:DT   1.28   1.22   1.18   1.45   1.16   1.20   1.00
great:JJ   0.72   0.90   0.74   1.01   1.05   0.87   0.62
tenors:NNS   0.65   0.82   0.99   2.58   1.58   0.47   0.55
like:VBP   0.73   0.76   0.61   1.11   0.54   0.13   0.66
popular_music:NN   0.71   0.82   0.73   1.11   1.58   0.35   0.28
NO_WORD   0.29   0.17   0.09   0.04   1.07   0.31   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.63 Alignment.score
 1.00  0.28 Alignment.isGood
-1.00  0.70 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.14 Alignment.txtSpan
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "great" modifying "tenors" is dropped on aligned hypothesis word "tenors"
-1.00  1.00 NullPunisher.other : great
-0.10  1.00 NullPunisher.functionWord : who
-1.00  1.00 NullPunisher.other : popular_music
-0.05  1.00 NullPunisher.aux : are
-0.10  1.00 NullPunisher.functionWord : There
-1.00  1.00 NullPunisher.other : sing
-2.00  1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -5.0239
Threshold: -9.4738


Inference ID: 013

Txt: Both leading tenors are excellent. Leading tenors who are excellent are indispensable.

Hyp: Both leading tenors are indispensable. (yes)

Both
DT
leading
JJ
tenors
NNS
are
VBP
indispensable
JJ
Both:DT   2.42   1.18   1.00   1.25   1.18
leading:JJ   0.76   0.74   0.86   0.96   0.75
tenors:NNS   0.71   0.97   0.28   0.82   0.83
are:VBP   0.76   1.02   0.91   1.32   1.02
excellent:JJ   0.76   0.91   0.88   0.96   0.66
Leading:VBG   0.76   0.84   0.89   0.81   1.02
tenors:NNS   0.71   0.97   0.28   0.82   0.83
who:WP   1.11   1.39   1.30   0.97   1.39
are:VBP   0.76   1.02   0.91   1.32   1.02
excellent:JJ   0.76   0.91   0.88   0.96   0.66
are:VBP   0.76   1.02   0.91   1.32   1.02
indispensable:JJ   0.76   0.75   0.72   0.96   0.74
NO_WORD   0.82   0.11   0.28   1.43   0.16

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.59 Alignment.score
 1.00  0.27 Alignment.isGood
-1.00  0.70 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.20 Alignment.txtSpan
-0.05  1.00 NullPunisher.aux : are
-1.00  1.00 NullPunisher.other : Both
-1.00  1.00 NullPunisher.other : indispensable
-1.00  1.00 NullPunisher.other : tenors
-2.00  1.00 RootEntailment.unalignedRoot : "indispensable" not aligned to anything
-1.00  1.00 Structure.relMismatch : hypothesis noun modifier "leading" and aligned text noun modifier "tenors" bear the same relation ("amod") but different parents ("tenors" vs. "tenors")
Hand-tuned score (dot product of above): -6.1702
Threshold: -9.4738


Inference ID: 014

Txt: Neither leading tenor comes cheap. One of the leading tenors is Pavarotti.

Hyp: Pavarotti is a leading tenor who comes cheap. (don't know)

Pavarotti
NNP
is
VBZ
a
DT
leading
VBG
tenor
NN
who
WP
comes
VBZ
cheap
JJ
Neither:DT   1.25   1.25   1.31   1.22   0.96   1.16   1.25   1.18
leading:JJ   1.13   0.96   0.76   1.16   0.88   1.05   0.95   0.93
tenor:NN   1.07   0.82   0.71   0.82   0.28   1.58   0.82   0.99
comes:VBZ   1.16   0.81   0.76   0.67   0.91   0.54   1.32   0.81
cheap:JJ   1.13   0.96   0.76   0.96   0.88   1.05   0.75   0.74
One:CD   1.24   1.30   0.89   1.30   1.33   1.32   1.25   1.00
the:DT   1.25   1.25   1.31   1.25   1.00   1.09   1.25   1.13
leading:JJ   1.13   0.96   0.76   1.16   0.88   1.05   0.95   0.93
tenors:NNS   1.07   0.82   0.71   0.80   0.99   1.58   0.80   0.99
is:VBZ   1.16   1.32   0.76   0.89   0.91   0.54   0.90   1.02
Pavarotti:NNP   0.28   1.07   0.96   1.07   0.99   1.84   1.07   1.24
NO_WORD   0.28   1.43   0.82   0.12   0.28   0.92   0.25   0.09

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.75 Alignment.score
 1.00  0.30 Alignment.isGood
-1.00  0.67 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.25 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "comes" modifying "tenor"
-1.00  1.00 NullPunisher.other : cheap
-0.10  1.00 NullPunisher.article : a
-3.00  1.00 NullPunisher.entity : Pavarotti
-1.00  1.00 NullPunisher.other : comes
-0.10  1.00 NullPunisher.functionWord : who
-0.05  1.00 NullPunisher.aux : is
 1.00  1.00 Quant.equivalent : Replacing the quantifier "the" by an equivalent quantifier "a" preserves truth.
Hand-tuned score (dot product of above): -5.2429
Threshold: -9.4738


Inference ID: 015

Txt: At least three tenors will take part in the concert.

Hyp: There are tenors who will take part in the concert. (yes)

There
EX
are
VBP
tenors
JJ
who
WP
will
MD
take_part
VB
the
DT
concert
NN
At:IN   1.05   1.20   0.77   1.57   1.05   1.24   1.05   1.31
least:JJS   0.76   0.96   0.91   1.05   0.76   0.93   0.76   0.88
three:CD   0.76   1.25   0.83   1.32   0.89   1.16   0.78   1.33
tenors:NNS   0.65   0.82   1.93   1.58   0.71   0.77   0.71   0.55
will:MD   1.31   1.60   1.18   1.13   2.42   1.37   1.31   1.00
take_part:VB   0.73   0.75   0.97   0.54   0.87   1.32   0.76   0.88
the:DT   1.20   1.18   1.18   1.09   1.31   1.25   2.42   1.00
concert:NN   0.67   0.82   0.73   1.58   0.71   0.79   0.71   0.28
NO_WORD   0.29   0.17   0.09   0.54   1.55   0.36   0.82   0.07

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.70 Alignment.score
 1.00  0.29 Alignment.isGood
-1.00  0.68 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.12 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "concert" modifying "take_part"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "three" modifying "tenors" is dropped on aligned hypothesis word "tenors"
-1.00  1.00 NullPunisher.other : take_part
-0.10  1.00 NullPunisher.functionWord : There
-0.10  1.00 NullPunisher.article : the
-0.10  1.00 NullPunisher.functionWord : who
-0.05  1.00 NullPunisher.aux : will
-1.00  1.00 NullPunisher.other : concert
-0.05  1.00 NullPunisher.aux : are
-2.00  1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -5.1777
Threshold: -9.4738


Inference ID: 017

Txt: An Irishman won the Nobel prize for literature.

Hyp: An Irishman won a Nobel prize. (yes)

An
DT
Irishman
NNP
won
VBD
a
DT
Nobel_prize
NN
An:DT   2.42   1.25   1.21   1.14   1.00
Irishman:NNP   0.96   0.28   1.07   0.96   1.07
won:VBD   0.71   1.16   1.32   0.76   0.57
the:DT   1.31   1.25   1.25   1.31   1.00
Nobel_prize:NN   0.71   1.07   0.49   0.71   0.28
literature:NN   0.71   1.07   0.82   0.71   0.38
NO_WORD   0.82   0.28   0.17   0.82   0.25

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.36 Alignment.score
 1.00  0.23 Alignment.isGood
-1.00  0.75 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : Nobel_prize
-0.10  1.00 NullPunisher.article : An
-1.00  1.00 NullPunisher.other : won
-0.10  1.00 NullPunisher.article : a
-3.00  1.00 NullPunisher.entity : Irishman
-2.00  1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -7.8694
Threshold: -9.4738


Inference ID: 018

Txt: Every European has the right to live in Europe. Every European is a person. Every person who has the right to live in Europe can travel freely within Europe.

Hyp: Every European can travel freely within Europe. (yes)

Every
DT
European
NNP
can
MD
travel
VB
freely
RB
Europe
NNP
Every:DT   2.42   1.24   1.31   1.23   1.39   1.23
European:NNP   0.95   0.28   0.96   1.04   1.53   0.28
has:VBZ   0.76   1.16   0.93   0.94   1.11   1.16
the:DT   1.31   1.25   1.31   1.23   1.45   1.25
right:NN   0.71   1.18   0.71   0.80   1.19   1.17
to:TO   1.31   1.25   1.31   1.25   1.45   1.25
live:VB   0.73   1.16   0.87   0.59   0.96   1.16
Europe:NNP   0.94   0.28   0.96   1.04   1.56   0.28
Every:DT   2.42   1.24   1.31   1.23   1.39   1.23
European:NNP   0.95   0.28   0.96   1.04   1.53   0.28
is:VBZ   0.76   1.16   1.06   1.00   1.11   1.16
a:DT   1.31   1.25   1.31   1.25   1.45   1.25
person:NN   0.69   1.11   0.71   0.59   1.22   1.17
Every:DT   2.42   1.24   1.31   1.23   1.39   1.23
person:NN   0.69   1.11   0.71   0.59   1.22   1.17
who:WP   1.11   1.55   1.18   0.97   1.05   1.55
has:VBZ   0.76   1.16   0.93   0.94   1.11   1.16
the:DT   1.31   1.25   1.31   1.23   1.45   1.25
right:NN   0.71   1.18   0.71   0.80   1.19   1.17
to:TO   1.31   1.25   1.31   1.25   1.45   1.25
live:VB   0.73   1.16   0.87   0.59   0.96   1.16
Europe:NNP   0.94   0.28   0.96   1.04   1.56   0.28
can:MD   1.31   1.25   2.42   1.51   1.45   1.25
travel:VB   0.74   1.13   1.02   1.32   0.90   1.13
freely:RB   1.03   1.44   1.09   0.70   1.23   1.47
Europe:NNP   0.94   0.28   0.96   1.04   1.56   0.28
NO_WORD   0.82   0.28   1.55   0.17   0.04   0.33

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.53 Alignment.score
 1.00  0.26 Alignment.isGood
-1.00  0.72 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.17 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "freely" modifying "travel"
 2.00  1.00 Modal.yes : actual -> possible
-1.00  1.00 NullPunisher.other : travel
-0.05  1.00 NullPunisher.aux : can
-1.00  1.00 NullPunisher.other : Every
-1.00  1.00 NullPunisher.other : freely
-2.00  1.00 RootEntailment.unalignedRoot : "travel" not aligned to anything
Hand-tuned score (dot product of above): -4.2688
Threshold: -9.4738


Inference ID: 019

Txt: All Europeans have the right to live in Europe. Every European is a person. Every person who has the right to live in Europe can travel freely within Europe.

Hyp: All Europeans can travel freely within Europe. (yes)

All
DT
Europeans
NNPS
can
MD
travel
VB
freely
RB
Europe
NNP
All:DT   2.42   1.25   1.31   1.23   1.45   1.25
Europeans:NNPS   0.96   0.28   0.96   1.06   1.55   0.46
have:VBP   0.73   1.16   0.97   0.27   1.11   1.16
the:DT   1.31   1.25   1.31   1.23   1.45   1.25
right:NN   0.71   1.18   0.71   0.80   1.19   1.17
to:TO   1.31   1.25   1.31   1.25   1.45   1.25
live:VB   0.76   1.16   0.87   0.59   0.96   1.16
Europe:NNP   0.96   0.46   0.96   1.04   1.56   0.28
Every:DT   1.31   1.25   1.31   1.23   1.39   1.23
European:NNP   0.96   1.37   0.96   1.04   1.53   0.28
is:VBZ   0.76   1.16   1.06   1.00   1.11   1.16
a:DT   1.31   1.25   1.31   1.25   1.45   1.25
person:NN   0.71   1.12   0.71   0.59   1.22   1.17
Every:DT   1.31   1.25   1.31   1.23   1.39   1.23
person:NN   0.71   1.12   0.71   0.59   1.22   1.17
who:WP   1.11   1.55   1.18   0.97   1.05   1.55
has:VBZ   0.76   1.16   0.93   0.94   1.11   1.16
the:DT   1.31   1.25   1.31   1.23   1.45   1.25
right:NN   0.71   1.18   0.71   0.80   1.19   1.17
to:TO   1.31   1.25   1.31   1.25   1.45   1.25
live:VB   0.76   1.16   0.87   0.59   0.96   1.16
Europe:NNP   0.96   0.46   0.96   1.04   1.56   0.28
can:MD   1.31   1.25   2.42   1.51   1.45   1.25
travel:VB   0.73   1.15   1.02   1.32   0.90   1.13
freely:RB   1.09   1.46   1.09   0.70   1.23   1.47
Europe:NNP   0.96   0.46   0.96   1.04   1.56   0.28
NO_WORD   0.82   0.28   1.55   0.17   0.04   0.33

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.69 Alignment.score
 1.00  0.29 Alignment.isGood
-1.00  0.68 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.50 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "freely" modifying "travel"
 2.00  1.00 Modal.yes : actual -> possible
-1.00  1.00 NullPunisher.other : freely
-0.05  1.00 NullPunisher.aux : can
-1.00  1.00 NullPunisher.other : All
 1.00  1.00 Quant.equivalent : Replacing the quantifier "every" by an equivalent quantifier "all" preserves truth.
-3.00  1.00 Structure.argsMismatch : args have different parents but same relations: text "European" <-nsubj-- "person" vs. hyp "Europeans" <-nsubj-- "travel", which aligned to text "have"
Hand-tuned score (dot product of above): -2.9090
Threshold: -9.4738


Inference ID: 020

Txt: Each European has the right to live in Europe. Every European is a person. Every person who has the right to live in Europe can travel freely within Europe.

Hyp: Each European can travel freely within Europe. (yes)

Each
DT
European
NNP
can
MD
travel
VB
freely
RB
Europe
NNP
Each:DT   2.42   1.00   1.53   1.51   1.71   1.17
European:NNP   0.71   0.28   0.96   1.04   1.53   0.28
has:VBZ   0.98   1.16   0.93   0.94   1.11   1.16
the:DT   1.56   1.25   1.31   1.23   1.45   1.25
right:NN   0.94   1.18   0.71   0.80   1.19   1.17
to:TO   1.56   1.25   1.31   1.25   1.45   1.25
live:VB   1.01   1.16   0.87   0.59   0.96   1.16
Europe:NNP   0.87   0.28   0.96   1.04   1.56   0.28
Every:DT   1.54   1.24   1.31   1.23   1.39   1.23
European:NNP   0.71   0.28   0.96   1.04   1.53   0.28
is:VBZ   1.01   1.16   1.06   1.00   1.11   1.16
a:DT   1.56   1.25   1.31   1.25   1.45   1.25
person:NN   0.96   1.11   0.71   0.59   1.22   1.17
Every:DT   1.54   1.24   1.31   1.23   1.39   1.23
person:NN   0.96   1.11   0.71   0.59   1.22   1.17
who:WP   1.37   1.55   1.18   0.97   1.05   1.55
has:VBZ   0.98   1.16   0.93   0.94   1.11   1.16
the:DT   1.56   1.25   1.31   1.23   1.45   1.25
right:NN   0.94   1.18   0.71   0.80   1.19   1.17
to:TO   1.56   1.25   1.31   1.25   1.45   1.25
live:VB   1.01   1.16   0.87   0.59   0.96   1.16
Europe:NNP   0.87   0.28   0.96   1.04   1.56   0.28
can:MD   1.53   1.25   2.42   1.51   1.45   1.25
travel:VB   1.01   1.13   1.02   1.32   0.90   1.13
freely:RB   1.35   1.44   1.09   0.70   1.23   1.47
Europe:NNP   0.87   0.28   0.96   1.04   1.56   0.28
NO_WORD   0.82   0.28   1.55   0.17   0.04   0.33

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.53 Alignment.score
 1.00  0.26 Alignment.isGood
-1.00  0.72 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.17 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "freely" modifying "travel"
 2.00  1.00 Modal.yes : actual -> possible
-1.00  1.00 NullPunisher.other : travel
-0.05  1.00 NullPunisher.aux : can
-1.00  1.00 NullPunisher.other : freely
-3.00  1.00 NullPunisher.entity : Each
-2.00  1.00 RootEntailment.unalignedRoot : "travel" not aligned to anything
Hand-tuned score (dot product of above): -6.2688
Threshold: -9.4738


Inference ID: 021

Txt: The residents of member states have the right to live in Europe. All residents of member states are individuals. Every individual who has the right to live in Europe can travel freely within Europe.

Hyp: The residents of member states can travel freely within Europe. (yes)

The
DT
residents
NNS
member
NN
states
NNS
can
MD
travel
VB
freely
RB
Europe
NNP
The:DT   2.42   1.00   1.00   0.97   1.31   1.23   1.45   1.25
residents:NNS   0.71   0.28   0.79   0.87   0.71   0.69   1.27   1.09
member:NN   0.71   0.79   0.28   0.80   0.71   0.75   1.27   1.13
states:NNS   0.68   0.87   0.80   0.28   0.71   0.60   1.20   1.15
have:VBP   0.73   0.91   0.91   0.86   0.97   0.27   1.11   1.16
the:DT   0.01   1.00   1.00   0.97   1.31   1.23   1.45   1.25
right:NN   0.71   0.87   0.93   0.99   0.71   0.80   1.19   1.17
to:TO   1.26   1.00   1.00   1.00   1.31   1.25   1.45   1.25
live:VB   0.73   0.58   0.91   0.80   0.87   0.59   0.96   1.16
Europe:NNP   0.96   1.09   1.13   1.15   0.96   1.04   1.56   0.28
All:DT   1.31   1.00   1.00   1.00   1.31   1.23   1.45   1.25
residents:NNS   0.71   0.28   0.79   0.87   0.71   0.69   1.27   1.09
member:NN   0.71   0.79   0.28   0.80   0.71   0.75   1.27   1.13
states:NNS   0.68   0.87   0.80   0.28   0.71   0.60   1.20   1.15
are:VBP   0.68   0.91   0.91   0.88   1.06   0.98   1.09   1.14
individuals:NNS   0.71   0.74   0.85   0.94   0.71   0.66   0.97   1.21
Every:DT   1.31   1.00   0.94   1.00   1.31   1.23   1.39   1.23
individual:NN   0.71   0.86   0.88   0.99   0.71   0.82   1.07   1.21
who:WP   1.04   1.30   1.30   1.30   1.18   0.97   1.05   1.55
has:VBZ   0.76   0.91   0.91   0.88   0.93   0.94   1.11   1.16
the:DT   0.01   1.00   1.00   0.97   1.31   1.23   1.45   1.25
right:NN   0.71   0.87   0.93   0.99   0.71   0.80   1.19   1.17
to:TO   1.26   1.00   1.00   1.00   1.31   1.25   1.45   1.25
live:VB   0.73   0.58   0.91   0.80   0.87   0.59   0.96   1.16
Europe:NNP   0.96   1.09   1.13   1.15   0.96   1.04   1.56   0.28
can:MD   1.31   1.00   1.00   1.00   2.42   1.51   1.45   1.25
travel:VB   0.73   0.78   0.84   0.69   1.02   1.32   0.90   1.13
freely:RB   1.09   1.18   1.18   1.11   1.09   0.70   1.23   1.47
Europe:NNP   0.96   1.09   1.13   1.15   0.96   1.04   1.56   0.28
NO_WORD   0.82   0.28   0.05   0.04   1.55   0.17   0.04   0.33

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.49 Alignment.score
 1.00  0.25 Alignment.isGood
-1.00  0.73 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  3.00 Alignment.hypSpan
 0.10  0.50 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "freely" modifying "travel"
 2.00  1.00 Modal.yes : actual -> possible
-0.10  1.00 NullPunisher.article : The
-0.05  1.00 NullPunisher.aux : can
-1.00  1.00 NullPunisher.other : freely
 2.00  1.00 Location.match : Locations match between txt and hyp; both have lemma "Europe"
Hand-tuned score (dot product of above): 1.9116
Threshold: -9.4738


Inference ID: 022

Txt: No delegate finished the report on time.

Hyp: No delegate finished the report. (don't know)

No
DT
delegate
NN
finished
VBD
the
DT
report
NN
No:DT   2.42   1.00   1.25   1.31   1.00
delegate:NN   0.71   0.28   0.74   0.71   0.74
finished:VBD   0.76   0.83   1.32   0.76   0.86
the:DT   1.31   1.00   1.25   2.42   1.00
report:NN   0.71   0.74   0.77   0.71   0.28
time:NN   0.71   0.86   0.68   0.61   0.91
NO_WORD   0.82   0.28   0.17   0.82   0.09

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.32 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.76 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : finished
-1.00  1.00 NullPunisher.other : report
-1.00  1.00 NullPunisher.other : No
-1.00  1.00 NullPunisher.other : delegate
-0.10  1.00 NullPunisher.article : the
-2.00  1.00 RootEntailment.unalignedRoot : "finished" not aligned to anything
Hand-tuned score (dot product of above): -6.8134
Threshold: -9.4738


Inference ID: 023

Txt: Some delegates finished the survey on time.

Hyp: Some delegates finished the survey. (yes)

Some
DT
delegates
NNS
finished
VBD
the
DT
survey
NN
Some:DT   2.42   1.00   1.25   1.28   0.95
delegates:NNS   0.71   0.28   0.78   0.71   0.53
finished:VBD   0.76   0.87   1.32   0.76   0.91
the:DT   1.28   1.00   1.25   2.42   1.00
survey:NN   0.66   0.53   0.82   0.71   0.28
time:NN   0.60   0.86   0.68   0.61   0.84
NO_WORD   0.82   0.28   0.17   0.82   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.32 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.76 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : survey
-1.00  1.00 NullPunisher.other : Some
-1.00  1.00 NullPunisher.other : delegates
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : finished
-2.00  1.00 RootEntailment.unalignedRoot : "finished" not aligned to anything
Hand-tuned score (dot product of above): -6.8134
Threshold: -9.4738


Inference ID: 024

Txt: Many delegates obtained interesting results from the survey.

Hyp: Many delegates obtained results from the survey. (yes)

Many
JJ
delegates
NNS
obtained
VBD
results
NNS
the
DT
survey
NN
Many:JJ   0.74   0.88   0.96   0.88   0.76   0.88
delegates:NNS   0.99   0.28   0.80   0.63   0.71   0.53
obtained:VBD   1.02   0.89   1.32   0.85   0.76   0.61
interesting:JJ   0.93   0.77   0.90   0.71   0.76   0.74
results:NNS   0.99   0.63   0.76   0.28   0.71   0.63
the:DT   1.18   1.00   1.25   1.00   2.42   1.00
survey:NN   0.99   0.53   0.52   0.63   0.71   0.28
NO_WORD   0.11   0.28   0.17   0.09   0.82   0.15

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.13 Alignment.score
 1.00  0.19 Alignment.isGood
-1.00  0.79 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "survey" modifying "obtained"
-1.00  1.00 NullPunisher.other : results
-1.00  1.00 NullPunisher.other : delegates
-1.00  1.00 NullPunisher.other : obtained
-1.00  1.00 NullPunisher.other : Many
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : survey
-2.00  1.00 RootEntailment.unalignedRoot : "obtained" not aligned to anything
Hand-tuned score (dot product of above): -9.1748
Threshold: -9.4738


Inference ID: 025

Txt: Several delegates got the results published in major national newspapers.

Hyp: Several delegates got the results published. (yes)

Several
JJ
delegates
NNS
got
VBD
the
DT
results
NNS
published
VBD
Several:JJ   0.74   0.82   0.96   0.76   0.84   0.96
delegates:NNS   0.93   0.28   0.82   0.71   0.63   0.71
got:VBD   1.02   0.91   1.32   0.76   0.91   0.74
the:DT   1.18   1.00   1.25   2.42   1.00   1.25
results:NNS   0.95   0.63   0.82   0.71   0.28   0.64
published:VBN   1.02   0.80   0.74   0.76   0.73   2.02
major:JJ   0.93   0.88   0.96   0.76   0.88   0.95
national:JJ   0.89   0.75   0.96   0.76   0.88   0.83
newspapers:NNS   0.97   0.56   0.77   0.71   0.82   0.56
NO_WORD   0.11   0.28   0.17   0.82   0.28   0.36

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.49 Alignment.score
 1.00  0.25 Alignment.isGood
-1.00  0.73 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.33 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Several" modifying "delegates"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "newspapers" modifying "published" is dropped on aligned hypothesis word "published"
-1.00  1.00 NullPunisher.other : delegates
-1.00  1.00 NullPunisher.other : Several
-1.00  1.00 NullPunisher.other : got
-0.10  1.00 NullPunisher.article : the
-2.00  1.00 RootEntailment.unalignedRoot : "got" not aligned to anything
Hand-tuned score (dot product of above): -5.7572
Threshold: -9.4738


Inference ID: 026

Txt: Most Europeans are resident in Europe. All Europeans are people. All people who are resident in Europe can travel freely within Europe.

Hyp: Most Europeans can travel freely within Europe. (yes)

Most
JJS
Europeans
NNPS
can
MD
travel
VB
freely
RB
Europe
NNP
Most:JJS   0.74   1.13   0.76   0.96   1.01   1.13
Europeans:NNPS   1.24   0.28   0.96   1.06   1.55   0.46
are:VBP   1.02   1.16   1.06   0.98   1.09   1.14
resident:VBN   1.02   1.12   0.76   0.23   1.08   1.16
Europe:NNP   1.24   0.46   0.96   1.04   1.56   0.28
All:DT   1.18   1.25   1.31   1.23   1.45   1.25
Europeans:NNPS   1.24   0.28   0.96   1.06   1.55   0.46
are:VBP   1.02   1.16   1.06   0.98   1.09   1.14
people:NNS   0.99   1.21   0.71   0.66   1.22   1.14
All:DT   1.18   1.25   1.31   1.23   1.45   1.25
people:NNS   0.99   1.21   0.71   0.66   1.22   1.14
who:WP   1.39   1.55   1.18   0.97   1.05   1.55
are:VBP   1.02   1.16   1.06   0.98   1.09   1.14
resident:NN   0.99   0.89   0.71   0.49   1.27   1.09
Europe:NNP   1.24   0.46   0.96   1.04   1.56   0.28
can:MD   1.18   1.25   2.42   1.51   1.45   1.25
travel:VB   1.02   1.15   1.02   1.32   0.90   1.13
freely:RB   1.05   1.46   1.09   0.70   1.23   1.47
Europe:NNP   1.24   0.46   0.96   1.04   1.56   0.28
NO_WORD   0.11   0.28   1.55   0.17   0.04   0.33

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.47 Alignment.score
 1.00  0.25 Alignment.isGood
-1.00  0.73 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.17 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "freely" modifying "travel"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "Most" modifying "Europeans" is dropped on aligned hypothesis word "Europe"
 2.00  1.00 Modal.yes : actual -> possible
 0.00  1.00 NegPolarity.hypNegWord : "Most": tag "JJS" is in NegPolarityMarkers list
-0.05  1.00 NullPunisher.aux : can
-1.00  1.00 NullPunisher.other : freely
-1.00  1.00 NullPunisher.other : travel
-1.00  1.00 NullPunisher.other : Most
-2.00  1.00 RootEntailment.unalignedRoot : "travel" not aligned to anything
Hand-tuned score (dot product of above): -3.8498
Threshold: -9.4738


Inference ID: 027

Txt: A few committee members are from Sweden. All committee members are people. All people who are from Sweden are from Scandinavia.

Hyp: At least a few committee members are from Scandinavia. (yes)

At
IN
least
JJS
a
DT
few
JJ
committee_members
NNS
are
VBP
Scandinavia
NNP
A:DT   1.00   1.18   0.13   1.18   1.00   1.25   1.25
few:JJ   1.05   0.93   0.76   0.74   0.88   0.96   1.13
committee_members:NNS   0.50   0.99   0.71   0.99   0.28   0.82   1.06
are:VBP   1.11   1.02   0.76   1.02   0.91   1.32   1.16
Sweden:NNP   0.75   1.22   0.96   1.24   1.07   1.07   0.76
All:DT   1.00   1.18   1.31   1.18   1.00   1.18   1.25
committee_members:NNS   0.50   0.99   0.71   0.99   0.28   0.82   1.06
are:VBP   1.11   1.02   0.76   1.02   0.91   1.32   1.16
people:NNS   0.50   0.97   0.71   0.99   0.84   0.82   1.13
All:DT   1.00   1.18   1.31   1.18   1.00   1.18   1.25
people:NNS   0.50   0.97   0.71   0.99   0.84   0.82   1.13
who:WP   1.05   1.39   1.11   1.39   1.30   0.97   1.55
are:VBP   1.11   1.02   0.76   1.02   0.91   1.32   1.16
Sweden:NNP   0.75   1.22   0.96   1.24   1.07   1.07   0.76
are:VBP   1.11   1.02   0.76   1.02   0.91   1.32   1.16
Scandinavia:NNP   0.75   1.24   0.96   1.24   1.06   1.07   0.28
NO_WORD   0.73   0.29   0.82   0.11   0.28   0.17   0.15

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.24 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Scandinavia" modifying "are"
 0.00  1.00 NegPolarity.hypNegWord : "least": tag "JJS" is in NegPolarityMarkers list
 0.00  1.00 NegPolarity.hypNegWord : "few": is in NegPolarityMarkers list
-1.00  1.00 NullPunisher.other : least
-0.05  1.00 NullPunisher.aux : are
-3.00  1.00 NullPunisher.entity : Scandinavia
-1.00  1.00 NullPunisher.other : At
-1.00  1.00 NullPunisher.other : committee_members
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : few
-2.00  1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -11.1707
Threshold: -9.4738


Inference ID: 028

Txt: Few committee members are from Portugal. All committee members are people. All people who are from Portugal are from southern Europe.

Hyp: There are few committee members from southern Europe. (don't know)

There
EX
are
VBP
few
JJ
committee_members
NNS
southern
JJ
Europe
NNP
Few:JJ   0.76   0.96   0.76   0.88   0.93   1.13
committee_members:NNS   0.71   0.82   0.99   0.28   0.99   1.11
are:VBP   0.70   1.32   1.02   0.91   1.02   1.14
Portugal:NNP   0.96   1.07   1.24   1.07   1.19   0.80
All:DT   1.31   1.18   1.18   1.00   1.18   1.25
committee_members:NNS   0.71   0.82   0.99   0.28   0.99   1.11
are:VBP   0.70   1.32   1.02   0.91   1.02   1.14
people:NNS   0.69   0.82   0.99   0.84   0.99   1.14
All:DT   1.31   1.18   1.18   1.00   1.18   1.25
people:NNS   0.69   0.82   0.99   0.84   0.99   1.14
who:WP   1.11   0.97   1.39   1.30   1.39   1.55
are:VBP   0.70   1.32   1.02   0.91   1.02   1.14
Portugal:NNP   0.96   1.07   1.24   1.07   1.19   0.80
are:VBP   0.70   1.32   1.02   0.91   1.02   1.14
southern:NNP   0.62   0.82   0.99   0.82   1.87   1.07
Europe:NNP   0.94   1.05   1.24   1.11   1.24   0.28
NO_WORD   0.29   0.17   0.11   0.28   0.11   0.15

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.45 Alignment.score
 1.00  0.24 Alignment.isGood
-1.00  0.73 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.17 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Europe" modifying "committee_members"
 0.00  1.00 NegPolarity.hypNegWord : "few": is in NegPolarityMarkers list
-1.00  1.00 NullPunisher.other : committee_members
-0.05  1.00 NullPunisher.aux : are
-0.10  1.00 NullPunisher.functionWord : There
-3.00  1.00 NullPunisher.entity : Europe
-2.00  1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
-1.00  1.00 Structure.relMismatch : hypothesis noun modifier "southern" and aligned text noun modifier "Europe" bear the same relation ("amod") but different parents ("Europe" vs. "Europe") hypothesis noun modifier "few" and aligned text noun modifier "committee_members" bear the same relation ("amod") but different parents ("committee_members" vs. "committee_members")
Hand-tuned score (dot product of above): -8.4799
Threshold: -9.4738


Inference ID: 029

Txt: Both commissioners used to be leading businessmen.

Hyp: Both commissioners used to be businessmen. (yes)

Both
DT
commissioners
NNS
used
VBD
to
TO
be
VB
businessmen
NNS
Both:DT   2.42   1.00   1.25   1.31   1.25   1.00
commissioners:NNS   0.71   0.28   0.82   0.71   0.82   0.76
used:VBD   0.76   0.91   1.32   0.76   0.93   0.91
to:TO   1.31   1.00   1.25   2.42   1.25   1.00
be:VB   0.76   0.91   0.93   0.76   1.32   0.91
leading:VBG   0.76   0.91   0.63   0.76   0.81   0.80
businessmen:NNS   0.71   0.76   0.82   0.71   0.82   0.28
NO_WORD   0.82   0.28   0.17   1.55   1.43   0.28

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.66 Alignment.score
 1.00  0.28 Alignment.isGood
-1.00  0.69 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : businessmen
-0.10  1.00 NullPunisher.functionWord : to
-1.00  1.00 NullPunisher.other : Both
-1.00  1.00 NullPunisher.other : used
-1.00  1.00 NullPunisher.other : commissioners
-0.05  1.00 NullPunisher.aux : be
-2.00  1.00 RootEntailment.unalignedRoot : "used" not aligned to anything
Hand-tuned score (dot product of above): -6.4921
Threshold: -9.4738


Inference ID: 030

Txt: Neither commissioner spends a lot of time at home.

Hyp: Neither commissioner spends time at home. (don't know)

Neither
DT
commissioner
NN
spends
VBZ
time
NN
at_home
IN
Neither:DT   2.42   0.98   1.25   0.98   1.05
commissioner:NN   0.69   0.28   0.77   0.87   0.50
spends:VBZ   0.76   0.86   1.32   0.59   1.07
a:DT   1.31   1.00   1.25   1.00   1.05
lot:NN   0.71   0.87   0.70   0.62   0.45
of:IN   1.05   1.31   1.24   1.31   1.05
time:NN   0.69   0.87   0.50   0.28   0.30
at_home:IN   1.05   1.31   1.15   1.11   1.59
NO_WORD   0.82   0.28   0.17   0.09   0.99

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.36 Alignment.score
 1.00  0.23 Alignment.isGood
-1.00  0.75 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : at_home
-1.00  1.00 NullPunisher.other : spends
-1.00  1.00 NullPunisher.other : Neither
-1.00  1.00 NullPunisher.other : commissioner
-1.00  1.00 NullPunisher.other : time
-2.00  1.00 RootEntailment.unalignedRoot : "spends" not aligned to anything
Hand-tuned score (dot product of above): -7.6657
Threshold: -9.4738


Inference ID: 031

Txt: At least three commissioners spend a lot of time at home.

Hyp: At least three commissioners spend time at home. (yes)

At
IN
least
JJS
three
CD
commissioners
NNS
spend
VBP
time
NN
at_home
IN
At:IN   1.59   0.74   1.30   1.31   1.24   1.31   1.05
least:JJS   1.01   0.74   1.13   0.88   0.96   0.88   1.05
three:CD   1.30   1.00   0.85   1.18   1.22   1.01   1.23
commissioners:NNS   0.50   0.99   1.15   0.28   0.82   0.87   0.50
spend:VBP   1.15   1.02   1.31   0.91   1.32   0.51   1.07
a:DT   1.00   1.18   1.18   1.00   1.25   1.00   1.05
lot:NN   0.45   0.93   1.27   0.87   0.53   0.62   0.45
time:NN   0.50   0.99   0.98   0.87   0.42   0.28   0.30
at_home:IN   1.05   0.77   1.23   1.31   1.15   1.11   1.59
NO_WORD   0.84   0.29   0.52   0.28   0.17   0.09   1.16

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.30 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.76 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.29 Alignment.txtSpan
 0.00  1.00 NegPolarity.hypNegWord : "least": tag "JJS" is in NegPolarityMarkers list
-1.00  1.00 NullPunisher.other : time
-1.00  1.00 NullPunisher.other : least
-1.00  1.00 NullPunisher.other : spend
-1.00  1.00 NullPunisher.other : At
-1.00  1.00 NullPunisher.other : at_home
-2.00  1.00 RootEntailment.unalignedRoot : "spend" not aligned to anything
Hand-tuned score (dot product of above): -7.5125
Threshold: -9.4738


Inference ID: 032

Txt: At most ten commissioners spend a lot of time at home.

Hyp: At most ten commissioners spend time at home. (don't know)

At
IN
most
JJS
ten
NN
commissioners
NNS
spend
VBP
time
NN
at_home
IN
At:IN   1.59   0.77   1.31   1.31   1.24   1.31   1.05
most:RBS   1.05   1.44   1.21   1.21   0.91   1.21   1.05
ten:NN   0.50   0.99   0.28   0.71   0.75   0.71   0.50
commissioners:NNS   0.50   0.99   0.71   0.28   0.82   0.87   0.50
spend:VBP   1.15   1.02   0.84   0.91   1.32   0.51   1.07
a:DT   1.00   1.18   1.00   1.00   1.25   1.00   1.05
lot:NN   0.45   0.89   0.76   0.87   0.53   0.62   0.45
time:NN   0.50   0.99   0.71   0.87   0.42   0.28   0.30
at_home:IN   1.05   0.77   1.31   1.31   1.15   1.11   1.59
NO_WORD   1.16   0.29   0.05   0.28   0.17   0.09   1.16

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.63 Alignment.score
 1.00  0.28 Alignment.isGood
-1.00  0.70 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  3.00 Alignment.hypSpan
 0.10  0.43 Alignment.txtSpan
 0.00  1.00 NegPolarity.hypNegWord : "most": tag "JJS" is in NegPolarityMarkers list
-1.00  1.00 NullPunisher.other : time
-1.00  1.00 NullPunisher.other : At
-1.00  1.00 NullPunisher.other : at_home
-1.00  1.00 NullPunisher.other : spend
-2.00  1.00 RootEntailment.unalignedRoot : "spend" not aligned to anything
Hand-tuned score (dot product of above): -5.8497
Threshold: -9.4738


Inference ID: 033

Txt: An Irishman won a Nobel prize.

Hyp: An Irishman won the Nobel prize for literature. (don't know)

An
DT
Irishman
NNP
won
VBD
the
DT
Nobel_prize
NN
literature
NN
An:DT   2.42   1.25   1.21   1.31   1.00   1.00
Irishman:NNP   0.96   0.28   1.07   0.96   1.07   1.07
won:VBD   0.71   1.16   1.32   0.76   0.57   0.91
a:DT   1.14   1.25   1.25   1.31   1.00   1.00
Nobel_prize:NN   0.71   1.07   0.49   0.71   0.28   0.38
NO_WORD   0.82   0.28   0.17   0.82   0.09   0.05

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.28 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "literature" modifying "won"
-1.00  1.00 NullPunisher.other : Nobel_prize
-1.00  1.00 NullPunisher.other : won
-1.00  1.00 NullPunisher.other : literature
-0.10  1.00 NullPunisher.article : An
-3.00  1.00 NullPunisher.entity : Irishman
-0.10  1.00 NullPunisher.article : the
-2.00  1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -10.0749
Threshold: -9.4738


Inference ID: 034

Txt: Every European can travel freely within Europe. Every European is a person. Every person who has the right to live in Europe can travel freely within Europe.

Hyp: Every European has the right to live in Europe. (don't know)

Every
DT
European
NNP
has
VBZ
the
DT
right
NN
to
TO
live
VB
Europe
NNP
Every:DT   2.42   1.24   1.25   1.31   1.00   1.31   1.23   1.23
European:NNP   0.95   0.28   1.07   0.96   1.18   0.96   1.07   0.28
can:MD   1.31   1.25   1.43   1.31   1.00   1.31   1.37   1.25
travel:VB   0.74   1.13   0.94   0.73   0.89   0.76   0.59   1.13
freely:RB   1.03   1.44   0.91   1.09   1.10   1.09   0.76   1.47
Europe:NNP   0.94   0.28   1.07   0.96   1.17   0.96   1.07   0.28
Every:DT   2.42   1.24   1.25   1.31   1.00   1.31   1.23   1.23
European:NNP   0.95   0.28   1.07   0.96   1.18   0.96   1.07   0.28
is:VBZ   0.76   1.16   1.01   0.76   0.91   0.76   0.39   1.16
a:DT   1.31   1.25   1.25   1.31   1.00   1.31   1.25   1.25
person:NN   0.69   1.11   0.82   0.71   0.84   0.71   0.65   1.17
Every:DT   2.42   1.24   1.25   1.31   1.00   1.31   1.23   1.23
person:NN   0.69   1.11   0.82   0.71   0.84   0.71   0.65   1.17
who:WP   1.11   1.55   0.97   1.04   1.30   1.06   0.97   1.55
has:VBZ   0.76   1.16   1.32   0.76   0.91   0.76   0.75   1.16
the:DT   1.31   1.25   1.25   2.42   1.00   1.26   1.22   1.25
right:NN   0.71   1.18   0.82   0.71   0.28   0.71   0.79   1.17
to:TO   1.31   1.25   1.25   1.26   1.00   2.42   1.25   1.25
live:VB   0.73   1.16   0.75   0.73   0.88   0.76   1.32   1.16
Europe:NNP   0.94   0.28   1.07   0.96   1.17   0.96   1.07   0.28
can:MD   1.31   1.25   1.43   1.31   1.00   1.31   1.37   1.25
travel:VB   0.74   1.13   0.94   0.73   0.89   0.76   0.59   1.13
freely:RB   1.03   1.44   0.91   1.09   1.10   1.09   0.76   1.47
Europe:NNP   0.94   0.28   1.07   0.96   1.17   0.96   1.07   0.28
NO_WORD   0.82   0.28   0.17   0.82   0.09   1.55   0.12   0.07

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.52 Alignment.score
 1.00  0.26 Alignment.isGood
-1.00  0.72 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.12 Alignment.txtSpan
-0.10  1.00 NullPunisher.functionWord : to
-1.00  1.00 NullPunisher.other : Every
-0.05  1.00 NullPunisher.aux : has
-1.00  1.00 NullPunisher.other : right
-1.00  1.00 NullPunisher.other : live
-0.10  1.00 NullPunisher.article : the
-2.00  1.00 RootEntailment.unalignedRoot : "has" not aligned to anything
Hand-tuned score (dot product of above): -5.6849
Threshold: -9.4738


Inference ID: 035

Txt: All Europeans can travel freely within Europe. Every European is a person. Every person who has the right to live in Europe can travel freely within Europe.

Hyp: All Europeans have the right to live in Europe. (don't know)

All
DT
Europeans
NNPS
have
VBP
the
DT
right
NN
to
TO
live
VB
Europe
NNP
All:DT   2.42   1.25   1.22   1.31   1.00   1.31   1.25   1.25
Europeans:NNPS   0.96   0.28   1.07   0.96   1.18   0.96   1.07   0.46
can:MD   1.31   1.25   1.47   1.31   1.00   1.31   1.37   1.25
travel:VB   0.73   1.15   0.27   0.73   0.89   0.76   0.59   1.13
freely:RB   1.09   1.46   0.91   1.09   1.10   1.09   0.76   1.47
Europe:NNP   0.96   0.46   1.07   0.96   1.17   0.96   1.07   0.28
Every:DT   1.31   1.25   1.23   1.31   1.00   1.31   1.23   1.23
European:NNP   0.96   1.37   1.07   0.96   1.18   0.96   1.07   0.28
is:VBZ   0.76   1.16   0.98   0.76   0.91   0.76   0.39   1.16
a:DT   1.31   1.25   1.25   1.31   1.00   1.31   1.25   1.25
person:NN   0.71   1.12   0.82   0.71   0.84   0.71   0.65   1.17
Every:DT   1.31   1.25   1.23   1.31   1.00   1.31   1.23   1.23
person:NN   0.71   1.12   0.82   0.71   0.84   0.71   0.65   1.17
who:WP   1.11   1.55   0.97   1.04   1.30   1.06   0.97   1.55
has:VBZ   0.76   1.16   0.19   0.76   0.91   0.76   0.75   1.16
the:DT   1.31   1.25   1.22   2.42   1.00   1.26   1.22   1.25
right:NN   0.71   1.18   0.82   0.71   0.28   0.71   0.79   1.17
to:TO   1.31   1.25   1.25   1.26   1.00   2.42   1.25   1.25
live:VB   0.76   1.16   0.64   0.73   0.88   0.76   1.32   1.16
Europe:NNP   0.96   0.46   1.07   0.96   1.17   0.96   1.07   0.28
can:MD   1.31   1.25   1.47   1.31   1.00   1.31   1.37   1.25
travel:VB   0.73   1.15   0.27   0.73   0.89   0.76   0.59   1.13
freely:RB   1.09   1.46   0.91   1.09   1.10   1.09   0.76   1.47
Europe:NNP   0.96   0.46   1.07   0.96   1.17   0.96   1.07   0.28
NO_WORD   0.82   0.28   0.17   0.82   0.09   1.55   0.12   0.07

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.64 Alignment.score
 1.00  0.28 Alignment.isGood
-1.00  0.70 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.38 Alignment.txtSpan
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "Europe" modifying "travel" is dropped on aligned hypothesis word "have"
-2.00  1.00 Modal.dontKnow : possible -> actual
-1.00  1.00 NullPunisher.other : All
-1.00  1.00 NullPunisher.other : live
-1.00  1.00 NullPunisher.other : right
-0.10  1.00 NullPunisher.article : the
-0.10  1.00 NullPunisher.functionWord : to
 1.00  1.00 Quant.equivalent : Replacing the quantifier "every" by an equivalent quantifier "all" preserves truth.
-3.00  1.00 Structure.argsMismatch : args have different parents but same relations: text "European" <-nsubj-- "person" vs. hyp "Europeans" <-nsubj-- "have", which aligned to text "travel"
Hand-tuned score (dot product of above): -6.7407
Threshold: -9.4738


Inference ID: 036

Txt: Each European can travel freely within Europe. Every European is a person. Every person who has the right to live in Europe can travel freely within Europe.

Hyp: Each European has the right to live in Europe. (don't know)

Each
DT
European
NNP
has
VBZ
the
DT
right
NN
to
TO
live
VB
Europe
NNP
Each:DT   2.42   1.00   1.48   1.56   1.23   1.56   1.51   1.17
European:NNP   0.71   0.28   1.07   0.96   1.18   0.96   1.07   0.28
can:MD   1.53   1.25   1.43   1.31   1.00   1.31   1.37   1.25
travel:VB   1.01   1.13   0.94   0.73   0.89   0.76   0.59   1.13
freely:RB   1.35   1.44   0.91   1.09   1.10   1.09   0.76   1.47
Europe:NNP   0.87   0.28   1.07   0.96   1.17   0.96   1.07   0.28
Every:DT   1.54   1.24   1.25   1.31   1.00   1.31   1.23   1.23
European:NNP   0.71   0.28   1.07   0.96   1.18   0.96   1.07   0.28
is:VBZ   1.01   1.16   1.01   0.76   0.91   0.76   0.39   1.16
a:DT   1.56   1.25   1.25   1.31   1.00   1.31   1.25   1.25
person:NN   0.96   1.11   0.82   0.71   0.84   0.71   0.65   1.17
Every:DT   1.54   1.24   1.25   1.31   1.00   1.31   1.23   1.23
person:NN   0.96   1.11   0.82   0.71   0.84   0.71   0.65   1.17
who:WP   1.37   1.55   0.97   1.04   1.30   1.06   0.97   1.55
has:VBZ   0.98   1.16   1.32   0.76   0.91   0.76   0.75   1.16
the:DT   1.56   1.25   1.25   2.42   1.00   1.26   1.22   1.25
right:NN   0.94   1.18   0.82   0.71   0.28   0.71   0.79   1.17
to:TO   1.56   1.25   1.25   1.26   1.00   2.42   1.25   1.25
live:VB   1.01   1.16   0.75   0.73   0.88   0.76   1.32   1.16
Europe:NNP   0.87   0.28   1.07   0.96   1.17   0.96   1.07   0.28
can:MD   1.53   1.25   1.43   1.31   1.00   1.31   1.37   1.25
travel:VB   1.01   1.13   0.94   0.73   0.89   0.76   0.59   1.13
freely:RB   1.35   1.44   0.91   1.09   1.10   1.09   0.76   1.47
Europe:NNP   0.87   0.28   1.07   0.96   1.17   0.96   1.07   0.28
NO_WORD   0.82   0.28   0.17   0.82   0.09   1.55   0.12   0.07

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.52 Alignment.score
 1.00  0.26 Alignment.isGood
-1.00  0.72 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.12 Alignment.txtSpan
-0.10  1.00 NullPunisher.functionWord : to
-1.00  1.00 NullPunisher.other : live
-1.00  1.00 NullPunisher.other : right
-3.00  1.00 NullPunisher.entity : Each
-0.10  1.00 NullPunisher.article : the
-0.05  1.00 NullPunisher.aux : has
-2.00  1.00 RootEntailment.unalignedRoot : "has" not aligned to anything
Hand-tuned score (dot product of above): -7.6849
Threshold: -9.4738


Inference ID: 037

Txt: The residents of member states can travel freely within Europe. All residents of member states are individuals. Every individual who has the right to live anywhere in Europe can travel freely within Europe.

Hyp: The residents of member states have the right to live anywhere in Europe. (don't know)

The
DT
residents
NNS
member
NN
states
NNS
have
VBP
the
DT
right
NN
to
TO
live
VB
anywhere
RB
Europe
NNP
The:DT   2.42   1.00   1.00   0.97   1.22   0.01   1.00   1.26   1.22   1.45   1.25
residents:NNS   0.71   0.28   0.79   0.87   0.82   0.71   0.87   0.71   0.49   1.29   1.09
member:NN   0.71   0.79   0.28   0.80   0.82   0.71   0.93   0.71   0.82   1.27   1.13
states:NNS   0.68   0.87   0.80   0.28   0.77   0.68   0.99   0.71   0.71   1.31   1.15
can:MD   1.31   1.00   1.00   1.00   1.47   1.31   1.00   1.31   1.37   1.45   1.25
travel:VB   0.73   0.78   0.84   0.69   0.27   0.73   0.89   0.76   0.59   1.11   1.13
freely:RB   1.09   1.18   1.18   1.11   0.91   1.09   1.10   1.09   0.76   0.86   1.47
Europe:NNP   0.96   1.09   1.13   1.15   1.07   0.96   1.17   0.96   1.07   1.56   0.28
All:DT   1.31   1.00   1.00   1.00   1.22   1.31   1.00   1.31   1.25   1.45   1.25
residents:NNS   0.71   0.28   0.79   0.87   0.82   0.71   0.87   0.71   0.49   1.29   1.09
member:NN   0.71   0.79   0.28   0.80   0.82   0.71   0.93   0.71   0.82   1.27   1.13
states:NNS   0.68   0.87   0.80   0.28   0.77   0.68   0.99   0.71   0.71   1.31   1.15
are:VBP   0.68   0.91   0.91   0.88   0.89   0.68   0.91   0.76   0.36   1.09   1.14
individuals:NNS   0.71   0.74   0.85   0.94   0.82   0.71   1.02   0.71   0.80   1.31   1.21
Every:DT   1.31   1.00   0.94   1.00   1.23   1.31   1.00   1.31   1.23   1.44   1.23
individual:NN   0.71   0.86   0.88   0.99   0.82   0.71   0.95   0.71   0.82   1.31   1.21
who:WP   1.04   1.30   1.30   1.30   0.97   1.04   1.30   1.06   0.97   1.05   1.55
has:VBZ   0.76   0.91   0.91   0.88   0.19   0.76   0.91   0.76   0.75   1.11   1.16
the:DT   0.01   1.00   1.00   0.97   1.22   2.42   1.00   1.26   1.22   1.45   1.25
right:NN   0.71   0.87   0.93   0.99   0.82   0.71   0.28   0.71   0.79   1.31   1.17
to:TO   1.26   1.00   1.00   1.00   1.25   1.26   1.00   2.42   1.25   1.45   1.25
live:VB   0.73   0.58   0.91   0.80   0.64   0.73   0.88   0.76   1.32   1.11   1.16
anywhere:RB   1.09   1.20   1.18   1.21   0.91   1.09   1.21   1.09   0.91   1.23   1.47
Europe:NNP   0.96   1.09   1.13   1.15   1.07   0.96   1.17   0.96   1.07   1.56   0.28
can:MD   1.31   1.00   1.00   1.00   1.47   1.31   1.00   1.31   1.37   1.45   1.25
travel:VB   0.73   0.78   0.84   0.69   0.27   0.73   0.89   0.76   0.59   1.11   1.13
freely:RB   1.09   1.18   1.18   1.11   0.91   1.09   1.10   1.09   0.76   0.86   1.47
Europe:NNP   0.96   1.09   1.13   1.15   1.07   0.96   1.17   0.96   1.07   1.56   0.28
NO_WORD   0.82   0.28   0.05   0.04   0.17   0.82   0.09   1.55   0.12   0.04   0.07

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.47 Alignment.score
 1.00  0.25 Alignment.isGood
-1.00  0.73 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  4.00 Alignment.hypSpan
 0.10  0.36 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Europe" modifying "live"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "Europe" modifying "travel" is dropped on aligned hypothesis word "have"
-2.00  1.00 Modal.dontKnow : possible -> actual
-0.10  1.00 NullPunisher.functionWord : to
-1.00  1.00 NullPunisher.other : right
-0.10  1.00 NullPunisher.article : The
-1.00  1.00 NullPunisher.other : anywhere
-0.10  1.00 NullPunisher.article : the
-3.00  1.00 NullPunisher.entity : Europe
-1.00  1.00 NullPunisher.other : live
Hand-tuned score (dot product of above): -9.0804
Threshold: -9.4738


Inference ID: 038

Txt: No delegate finished the report.

Hyp: Some delegate finished the report on time. (don't know)

Some
DT
delegate
NN
finished
VBD
the
DT
report
NN
time
NN
No:DT   1.31   1.00   1.25   1.31   1.00   1.00
delegate:NN   0.71   0.28   0.74   0.71   0.74   0.86
finished:VBD   0.76   0.83   1.32   0.76   0.86   0.77
the:DT   1.28   1.00   1.25   2.42   1.00   0.90
report:NN   0.71   0.74   0.77   0.71   0.28   0.91
NO_WORD   0.82   0.28   0.17   0.82   0.09   0.02

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.27 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "time" modifying "finished"
-1.00  1.00 NullPunisher.other : delegate
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : time
-1.00  1.00 NullPunisher.other : report
-1.00  1.00 NullPunisher.other : finished
-1.00  1.00 NullPunisher.other : Some
-2.00  1.00 RootEntailment.unalignedRoot : "finished" not aligned to anything
Hand-tuned score (dot product of above): -8.9897
Threshold: -9.4738


Inference ID: 039

Txt: Some delegates finished the survey.

Hyp: Some delegates finished the survey on time. (don't know)

Some
DT
delegates
NNS
finished
VBD
the
DT
survey
NN
time
NN
Some:DT   2.42   1.00   1.25   1.28   0.95   0.89
delegates:NNS   0.71   0.28   0.78   0.71   0.53   0.86
finished:VBD   0.76   0.87   1.32   0.76   0.91   0.77
the:DT   1.28   1.00   1.25   2.42   1.00   0.90
survey:NN   0.66   0.53   0.82   0.71   0.28   0.84
NO_WORD   0.82   0.28   0.17   0.82   0.09   0.02

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.27 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "time" modifying "finished"
-1.00  1.00 NullPunisher.other : time
-1.00  1.00 NullPunisher.other : Some
-1.00  1.00 NullPunisher.other : delegates
-1.00  1.00 NullPunisher.other : finished
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : survey
-2.00  1.00 RootEntailment.unalignedRoot : "finished" not aligned to anything
Hand-tuned score (dot product of above): -8.9897
Threshold: -9.4738


Inference ID: 040

Txt: Many delegates obtained results from the survey.

Hyp: Many delegates obtained interesting results from the survey. (don't know)

Many
JJ
delegates
NNS
obtained
VBD
interesting
JJ
results
NNS
the
DT
survey
NN
Many:JJ   0.74   0.88   0.96   0.93   0.88   0.76   0.88
delegates:NNS   0.99   0.28   0.80   0.88   0.63   0.71   0.53
obtained:VBD   1.02   0.89   1.32   0.96   0.85   0.76   0.61
results:NNS   0.99   0.63   0.76   0.82   0.28   0.71   0.63
the:DT   1.18   1.00   1.25   1.18   1.00   2.42   1.00
survey:NN   0.99   0.53   0.52   0.85   0.63   0.71   0.28
NO_WORD   0.11   0.28   0.17   0.11   0.09   0.82   0.15

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.13 Alignment.score
 1.00  0.19 Alignment.isGood
-1.00  0.79 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "survey" modifying "obtained"
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : results
-1.00  1.00 NullPunisher.other : survey
-1.00  1.00 NullPunisher.other : delegates
-1.00  1.00 NullPunisher.other : Many
-1.00  1.00 NullPunisher.other : obtained
-1.00  1.00 NullPunisher.other : interesting
-2.00  1.00 RootEntailment.unalignedRoot : "obtained" not aligned to anything
Hand-tuned score (dot product of above): -10.2775
Threshold: -9.4738


Inference ID: 041

Txt: Several delegates got the results published.

Hyp: Several delegates got the results published in major national newspapers. (don't know)

Several
JJ
delegates
NNS
got
VBD
the
DT
results
NNS
published
VBN
major
JJ
national
JJ
newspapers
NNS
Several:JJ   0.74   0.82   0.96   0.76   0.84   0.96   0.93   0.89   0.86
delegates:NNS   0.93   0.28   0.82   0.71   0.63   0.71   0.99   0.86   0.56
got:VBD   1.02   0.91   1.32   0.76   0.91   0.74   1.02   1.02   0.86
the:DT   1.18   1.00   1.25   2.42   1.00   1.25   1.18   1.18   1.00
results:NNS   0.95   0.63   0.82   0.71   0.28   0.64   0.99   0.99   0.82
published:VBD   1.02   0.80   0.74   0.76   0.73   2.02   1.01   0.89   0.65
NO_WORD   0.11   0.28   0.17   0.82   0.09   0.16   0.11   0.11   0.07

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.34 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.75 Alignment.isBad
-0.10  8.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.11 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "newspapers" modifying "published"
-1.00  1.00 NullPunisher.other : Several
-1.00  1.00 NullPunisher.other : delegates
-1.00  1.00 NullPunisher.other : national
-1.00  1.00 NullPunisher.other : got
-1.00  1.00 NullPunisher.other : major
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : results
-1.00  1.00 NullPunisher.other : newspapers
-2.00  1.00 RootEntailment.unalignedRoot : "got" not aligned to anything
Hand-tuned score (dot product of above): -10.9745
Threshold: -9.4738


Inference ID: 042

Txt: Most Europeans can travel freely within Europe. All Europeans are people. All people who are resident in Europe can travel freely within Europe.

Hyp: Most Europeans are resident in Europe. (don't know)

Most
JJS
Europeans
NNPS
are
VBP
resident
VBN
Europe
NNP
Most:JJS   0.74   1.13   0.96   0.96   1.13
Europeans:NNPS   1.24   0.28   1.07   1.03   0.46
can:MD   1.18   1.25   1.55   1.25   1.25
travel:VB   1.02   1.15   0.98   0.23   1.13
freely:RB   1.05   1.46   0.89   0.88   1.47
Europe:NNP   1.24   0.46   1.05   1.07   0.28
All:DT   1.18   1.25   1.18   1.25   1.25
Europeans:NNPS   1.24   0.28   1.07   1.03   0.46
are:VBP   1.02   1.16   1.32   0.56   1.14
people:NNS   0.99   1.21   0.82   0.60   1.14
All:DT   1.18   1.25   1.18   1.25   1.25
people:NNS   0.99   1.21   0.82   0.60   1.14
who:WP   1.39   1.55   0.97   0.97   1.55
are:VBP   1.02   1.16   1.32   0.56   1.14
resident:NN   0.99   0.89   0.82   2.71   1.09
Europe:NNP   1.24   0.46   1.05   1.07   0.28
can:MD   1.18   1.25   1.55   1.25   1.25
travel:VB   1.02   1.15   0.98   0.23   1.13
freely:RB   1.05   1.46   0.89   0.88   1.47
Europe:NNP   1.24   0.46   1.05   1.07   0.28
NO_WORD   0.11   0.15   1.57   0.17   0.07

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.99 Alignment.score
 1.00  0.35 Alignment.isGood
-1.00  0.62 Alignment.isBad
-0.10  2.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.40 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Most" modifying "Europeans"
 0.00  1.00 NegPolarity.hypNegWord : "Most": tag "JJS" is in NegPolarityMarkers list
-0.05  1.00 NullPunisher.aux : are
-1.00  1.00 NullPunisher.other : Most
 2.00  1.00 Location.match : Locations match between txt and hyp; both have lemma "Europe"
Hand-tuned score (dot product of above): 0.7143
Threshold: -9.4738


Inference ID: 043

Txt: A few committee members are from Scandinavia. All committee members are people. All people who are from Sweden are from Scandinavia.

Hyp: At least a few committee members are from Sweden. (don't know)

At
IN
least
JJS
a
DT
few
JJ
committee_members
NNS
are
VBP
Sweden
NNP
A:DT   1.00   1.18   0.13   1.18   1.00   1.25   1.25
few:JJ   1.05   0.93   0.76   0.74   0.88   0.96   1.13
committee_members:NNS   0.50   0.99   0.71   0.99   0.28   0.82   1.07
are:VBP   1.11   1.02   0.76   1.02   0.91   1.32   1.16
Scandinavia:NNP   0.75   1.24   0.96   1.24   1.06   1.07   0.76
All:DT   1.00   1.18   1.31   1.18   1.00   1.18   1.25
committee_members:NNS   0.50   0.99   0.71   0.99   0.28   0.82   1.07
are:VBP   1.11   1.02   0.76   1.02   0.91   1.32   1.16
people:NNS   0.50   0.97   0.71   0.99   0.84   0.82   1.16
All:DT   1.00   1.18   1.31   1.18   1.00   1.18   1.25
people:NNS   0.50   0.97   0.71   0.99   0.84   0.82   1.16
who:WP   1.05   1.39   1.11   1.39   1.30   0.97   1.55
are:VBP   1.11   1.02   0.76   1.02   0.91   1.32   1.16
Sweden:NNP   0.75   1.22   0.96   1.24   1.07   1.07   0.28
are:VBP   1.11   1.02   0.76   1.02   0.91   1.32   1.16
Scandinavia:NNP   0.75   1.24   0.96   1.24   1.06   1.07   0.76
NO_WORD   0.73   0.29   0.82   0.11   0.28   0.17   0.15

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.24 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Sweden" modifying "are"
 0.00  1.00 NegPolarity.hypNegWord : "least": tag "JJS" is in NegPolarityMarkers list
 0.00  1.00 NegPolarity.hypNegWord : "few": is in NegPolarityMarkers list
-1.00  1.00 NullPunisher.other : least
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : At
-1.00  1.00 NullPunisher.other : few
-0.05  1.00 NullPunisher.aux : are
-1.00  1.00 NullPunisher.other : committee_members
-3.00  1.00 NullPunisher.entity : Sweden
-2.00  1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -11.1707
Threshold: -9.4738


Inference ID: 044

Txt: Few committee members are from southern Europe. All committee members are people. All people who are from Portugal are from southern Europe.

Hyp: There are few committee members from Portugal. (yes)

There
EX
are
VBP
few
JJ
committee_members
NNS
Portugal
NNP
Few:JJ   0.76   0.96   0.76   0.88   1.13
committee_members:NNS   0.71   0.82   0.99   0.28   1.07
are:VBP   0.70   1.32   1.02   0.91   1.16
southern:JJ   0.68   0.96   0.93   0.88   1.08
Europe:NNP   0.94   1.05   1.24   1.11   0.80
All:DT   1.31   1.18   1.18   1.00   1.25
committee_members:NNS   0.71   0.82   0.99   0.28   1.07
are:VBP   0.70   1.32   1.02   0.91   1.16
people:NNS   0.69   0.82   0.99   0.84   1.14
All:DT   1.31   1.18   1.18   1.00   1.25
people:NNS   0.69   0.82   0.99   0.84   1.14
who:WP   1.11   0.97   1.39   1.30   1.55
are:VBP   0.70   1.32   1.02   0.91   1.16
Portugal:NNP   0.96   1.07   1.24   1.07   0.28
are:VBP   0.70   1.32   1.02   0.91   1.16
southern:NNP   0.62   0.82   0.99   0.82   1.02
Europe:NNP   0.94   1.05   1.24   1.11   0.80
NO_WORD   0.29   0.17   0.11   0.28   0.15

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.16 Alignment.score
 1.00  0.19 Alignment.isGood
-1.00  0.79 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.20 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Portugal" modifying "committee_members"
 0.00  1.00 NegPolarity.hypNegWord : "few": is in NegPolarityMarkers list
-0.05  1.00 NullPunisher.aux : are
-1.00  1.00 NullPunisher.other : committee_members
-3.00  1.00 NullPunisher.entity : Portugal
-0.10  1.00 NullPunisher.functionWord : There
-2.00  1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
-1.00  1.00 Structure.relMismatch : hypothesis noun modifier "few" and aligned text noun modifier "committee_members" bear the same relation ("amod") but different parents ("committee_members" vs. "committee_members")
Hand-tuned score (dot product of above): -8.8627
Threshold: -9.4738


Inference ID: 045

Txt: Both commissioners used to be businessmen.

Hyp: Both commissioners used to be leading businessmen. (don't know)

Both
DT
commissioners
NNS
used
VBD
to
TO
be
VB
leading
VBG
businessmen
NNS
Both:DT   2.42   1.00   1.25   1.31   1.25   1.25   1.00
commissioners:NNS   0.71   0.28   0.82   0.71   0.82   0.82   0.76
used:VBD   0.76   0.91   1.32   0.76   0.93   0.63   0.91
to:TO   1.31   1.00   1.25   2.42   1.25   1.25   1.00
be:VB   0.76   0.91   0.93   0.76   1.32   0.89   0.91
businessmen:NNS   0.71   0.76   0.82   0.71   0.82   0.71   0.28
NO_WORD   0.82   0.28   0.17   1.55   1.43   0.12   0.28

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.59 Alignment.score
 1.00  0.27 Alignment.isGood
-1.00  0.71 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "leading" modifying "businessmen"
-1.00  1.00 NullPunisher.other : commissioners
-1.00  1.00 NullPunisher.other : leading
-0.05  1.00 NullPunisher.aux : be
-1.00  1.00 NullPunisher.other : used
-0.10  1.00 NullPunisher.functionWord : to
-1.00  1.00 NullPunisher.other : Both
-1.00  1.00 NullPunisher.other : businessmen
-2.00  1.00 RootEntailment.unalignedRoot : "used" not aligned to anything
Hand-tuned score (dot product of above): -8.7006
Threshold: -9.4738


Inference ID: 046

Txt: Neither commissioner spends time at home.

Hyp: One of the commissioners spends a lot of time at home. (don't know)

One
CD
the
DT
commissioners
NNS
spends
VBZ
a
DT
lot
NN
of
IN
time
NN
at_home
IN
Neither:DT   1.18   1.26   1.00   1.25   1.31   1.00   1.05   0.98   1.05
commissioner:NN   1.29   0.71   1.47   0.77   0.71   0.87   0.50   0.87   0.50
spends:VBZ   1.39   0.76   0.91   1.32   0.76   0.79   1.15   0.59   1.07
time:NN   1.26   0.61   0.87   0.50   0.71   0.62   0.50   0.28   0.30
at_home:IN   1.30   1.00   1.31   1.15   1.05   1.26   1.05   1.11   1.59
NO_WORD   0.79   0.82   0.04   0.17   0.82   0.09   1.16   0.25   0.99

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.55 Alignment.score
 1.00  0.26 Alignment.isGood
-1.00  0.71 Alignment.isBad
-0.10  8.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.11 Alignment.txtSpan
-3.00  1.00 NullPunisher.entity : One
-1.00  1.00 NullPunisher.other : lot
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : of
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : at_home
-1.00  1.00 NullPunisher.other : time
-1.00  1.00 NullPunisher.other : spends
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '1.0' vs ''
-2.00  1.00 RootEntailment.unalignedRoot : "spends" not aligned to anything
Hand-tuned score (dot product of above): -16.7851
Threshold: -9.4738


Inference ID: 047

Txt: At least three commissioners spend time at home.

Hyp: At least three commissioners spend a lot of time at home. (don't know)

At
IN
least
JJS
three
CD
commissioners
NNS
spend
VBP
a
DT
lot
NN
time
NN
at_home
IN
At:IN   1.59   0.74   1.30   1.31   1.24   1.00   1.27   1.31   1.05
least:JJS   1.01   0.74   1.13   0.88   0.96   0.76   0.82   0.88   1.05
three:CD   1.30   1.00   0.85   1.18   1.22   0.89   1.31   1.01   1.23
commissioners:NNS   0.50   0.99   1.15   0.28   0.82   0.71   0.87   0.87   0.50
spend:VBP   1.15   1.02   1.31   0.91   1.32   0.76   0.62   0.51   1.07
time:NN   0.50   0.99   0.98   0.87   0.42   0.71   0.62   0.28   0.30
at_home:IN   1.05   0.77   1.23   1.31   1.15   1.05   1.26   1.11   1.59
NO_WORD   0.84   0.29   0.52   0.28   0.17   0.82   0.09   0.04   0.99

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.31 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.76 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.22 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "time" modifying "lot"
 0.00  1.00 NegPolarity.hypNegWord : "least": tag "JJS" is in NegPolarityMarkers list
-1.00  1.00 NullPunisher.other : at_home
-1.00  1.00 NullPunisher.other : time
-1.00  1.00 NullPunisher.other : At
-1.00  1.00 NullPunisher.other : spend
-1.00  1.00 NullPunisher.other : lot
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : least
-2.00  1.00 RootEntailment.unalignedRoot : "spend" not aligned to anything
Hand-tuned score (dot product of above): -9.8066
Threshold: -9.4738


Inference ID: 048

Txt: At most ten commissioners spend time at home.

Hyp: At most ten commissioners spend a lot of time at home. (yes)

At
IN
most
RBS
ten
NN
commissioners
NNS
spend
VBP
a
DT
lot
NN
time
NN
at_home
IN
At:IN   1.59   1.23   1.31   1.31   1.24   1.00   1.27   1.31   1.05
most:JJS   1.05   1.65   0.88   0.88   0.96   0.76   0.78   0.88   1.05
ten:NN   0.50   1.31   0.28   0.71   0.75   0.71   0.76   0.71   0.50
commissioners:NNS   0.50   1.31   0.71   0.28   0.82   0.71   0.87   0.87   0.50
spend:VBP   1.15   1.11   0.84   0.91   1.32   0.76   0.62   0.51   1.07
time:NN   0.50   1.31   0.71   0.87   0.42   0.71   0.62   0.28   0.30
at_home:IN   1.05   1.23   1.31   1.31   1.15   1.05   1.26   1.11   1.59
NO_WORD   0.99   0.04   0.05   0.28   0.17   0.82   0.09   0.04   0.99

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.57 Alignment.score
 1.00  0.27 Alignment.isGood
-1.00  0.71 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  3.00 Alignment.hypSpan
 0.10  0.33 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "time" modifying "lot"
 0.00  1.00 NegPolarity.txtNegWord : "most": tag "JJS" is in NegPolarityMarkers list
-1.00  1.00 NullPunisher.other : at_home
-1.00  1.00 NullPunisher.other : time
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : lot
-1.00  1.00 NullPunisher.other : spend
-1.00  1.00 NullPunisher.other : At
-2.00  1.00 RootEntailment.unalignedRoot : "spend" not aligned to anything
Hand-tuned score (dot product of above): -8.2409
Threshold: -9.4738


Inference ID: 049

Txt: A Swede won a Nobel prize. Every Swede is a Scandinavian.

Hyp: A Scandinavian won a Nobel prize. (yes)

A
DT
Scandinavian
NNP
won
VBD
a
DT
Nobel_prize
NN
A:DT   2.42   1.25   1.25   0.13   1.00
Swede:NN   0.96   0.56   1.07   0.96   1.07
won:VBD   0.76   1.16   1.32   0.76   0.57
a:DT   0.13   1.25   1.25   2.42   1.00
Nobel_prize:NN   0.71   1.06   0.49   0.71   0.28
Every:DT   1.31   1.25   1.25   1.31   1.00
Swede:NN   0.96   0.56   1.07   0.96   1.07
is:VBZ   0.76   1.16   0.88   0.76   0.91
a:DT   0.13   1.25   1.25   2.42   1.00
Scandinavian:NNP   0.96   0.28   1.07   0.96   1.06
NO_WORD   0.82   0.28   0.17   0.82   0.25

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.36 Alignment.score
 1.00  0.23 Alignment.isGood
-1.00  0.75 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-3.00  1.00 NullPunisher.entity : Scandinavian
-1.00  1.00 NullPunisher.other : won
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : Nobel_prize
-0.10  1.00 NullPunisher.article : A
-2.00  1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -7.8694
Threshold: -9.4738


Inference ID: 050

Txt: Every Canadian resident can travel freely within Europe. Every Canadian resident is a resident of the North American continent.

Hyp: Every resident of the North American continent can travel freely within Europe. (don't know)

Every
DT
resident
NN
the
DT
North_American
NN
continent
NN
can
MD
travel
VB
freely
RB
Europe
NNP
Every:DT   2.42   0.98   1.31   1.25   1.00   1.31   1.23   1.39   1.23
Canadian:JJ   1.02   1.11   1.02   0.86   1.07   1.00   1.22   1.27   1.05
resident:NN   0.69   0.28   0.71   0.91   0.65   0.71   0.49   1.27   1.09
can:MD   1.31   1.00   1.31   1.25   1.00   2.42   1.51   1.45   1.25
travel:VB   0.74   0.58   0.73   1.16   0.67   1.02   1.32   0.90   1.13
freely:RB   1.03   1.18   1.09   1.47   0.95   1.09   0.70   1.23   1.47
Europe:NNP   0.94   1.09   0.96   0.99   0.84   0.96   1.04   1.56   0.28
Every:DT   2.42   0.98   1.31   1.25   1.00   1.31   1.23   1.39   1.23
Canadian:JJ   1.02   1.11   1.02   0.86   1.07   1.00   1.22   1.27   1.05
resident:NN   0.69   0.28   0.71   0.91   0.65   0.71   0.49   1.27   1.09
is:VBZ   0.76   0.91   0.76   1.16   0.91   1.06   1.00   1.11   1.16
a:DT   1.31   1.00   1.31   1.25   1.00   1.31   1.25   1.45   1.25
resident:NN   0.69   0.28   0.71   0.91   0.65   0.71   0.49   1.27   1.09
the:DT   1.31   1.00   2.42   1.25   1.00   1.31   1.23   1.45   1.25
North_American:NNP   0.96   0.91   0.96   2.22   1.08   0.96   1.07   1.56   0.99
continent:NNP   0.71   0.65   0.71   1.08   3.06   0.71   0.58   1.04   0.93
NO_WORD   0.82   0.28   0.82   0.05   0.04   1.55   0.17   0.04   0.33

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  1.01 Alignment.score
 1.00  0.36 Alignment.isGood
-1.00  0.61 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.33 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "freely" modifying "travel"
 2.00  1.00 Modal.yes : actual -> possible
-1.00  1.00 NullPunisher.other : travel
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : freely
-1.00  1.00 NullPunisher.other : Every
-1.00  1.00 NullPunisher.other : resident
-0.05  1.00 NullPunisher.aux : can
-2.00  1.00 RootEntailment.unalignedRoot : "travel" not aligned to anything
 2.00  1.00 Location.match : Locations match between txt and hyp; both have lemma "Europe"
 4.00  1.00 Location.match&OK.Root.poorlyOrUnAligned :
Hand-tuned score (dot product of above): 1.2367
Threshold: -9.4738


Inference ID: 051

Txt: All Canadian residents can travel freely within Europe. Every Canadian resident is a resident of the North American continent.

Hyp: All residents of the North American continent can travel freely within Europe. (don't know)

All
DT
residents
NNS
the
DT
North_American
NN
continent
NN
can
MD
travel
VB
freely
RB
Europe
NNP
All:DT   2.42   1.00   1.31   1.25   1.00   1.31   1.23   1.45   1.25
Canadian:JJ   1.02   1.12   1.02   0.86   1.07   1.00   1.22   1.27   1.05
residents:NNS   0.71   0.28   0.71   0.91   0.67   0.71   0.69   1.27   1.09
can:MD   1.31   1.00   1.31   1.25   1.00   2.42   1.51   1.45   1.25
travel:VB   0.73   0.78   0.73   1.16   0.67   1.02   1.32   0.90   1.13
freely:RB   1.09   1.18   1.09   1.47   0.95   1.09   0.70   1.23   1.47
Europe:NNP   0.96   1.09   0.96   0.99   0.84   0.96   1.04   1.56   0.28
Every:DT   1.31   1.00   1.31   1.25   1.00   1.31   1.23   1.39   1.23
Canadian:JJ   1.02   1.12   1.02   0.86   1.07   1.00   1.22   1.27   1.05
resident:NN   0.71   1.32   0.71   0.91   0.65   0.71   0.49   1.27   1.09
is:VBZ   0.76   0.91   0.76   1.16   0.91   1.06   1.00   1.11   1.16
a:DT   1.31   1.00   1.31   1.25   1.00   1.31   1.25   1.45   1.25
resident:NN   0.71   1.32   0.71   0.91   0.65   0.71   0.49   1.27   1.09
the:DT   1.31   1.00   2.42   1.25   1.00   1.31   1.23   1.45   1.25
North_American:NNP   0.96   0.91   0.96   2.22   1.08   0.96   1.07   1.56   0.99
continent:NNP   0.71   0.67   0.71   1.08   3.06   0.71   0.58   1.04   0.93
NO_WORD   0.82   0.28   0.82   0.05   0.04   1.55   0.17   0.04   0.33

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  1.27 Alignment.score
 1.00  0.42 Alignment.isGood
-1.00  0.55 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.22 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "freely" modifying "travel"
 2.00  1.00 Modal.yes : actual -> possible
-1.00  1.00 NullPunisher.other : freely
-1.00  1.00 NullPunisher.other : All
-0.05  1.00 NullPunisher.aux : can
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : travel
-7.00  1.00 Quant.expand : Invalid quantifier strengthening: replacing the quantifier "a" by a stronger quantifier "all" does NOT preserve truth.
-2.00  1.00 RootEntailment.unalignedRoot : "travel" not aligned to anything
 2.00  1.00 Location.match : Locations match between txt and hyp; both have lemma "Europe"
 4.00  1.00 Location.match&OK.Root.poorlyOrUnAligned :
Hand-tuned score (dot product of above): -4.2912
Threshold: -9.4738


Inference ID: 052

Txt: Each Canadian resident can travel freely within Europe. Every Canadian resident is a resident of the North American continent.

Hyp: Each resident of the North American continent can travel freely within Europe. (don't know)

Each
DT
resident
NN
the
DT
North_American
NN
continent
NN
can
MD
travel
VB
freely
RB
Europe
NNP
Each:DT   2.42   1.00   1.31   1.25   1.00   1.28   1.25   1.45   1.25
Canadian:JJ   1.02   1.11   1.02   0.86   1.07   1.00   1.22   1.27   1.05
resident:NN   0.71   0.28   0.71   0.91   0.65   0.71   0.49   1.27   1.09
can:MD   1.28   1.00   1.31   1.25   1.00   2.42   1.51   1.45   1.25
travel:VB   0.76   0.58   0.73   1.16   0.67   1.02   1.32   0.90   1.13
freely:RB   1.09   1.18   1.09   1.47   0.95   1.09   0.70   1.23   1.47
Europe:NNP   0.96   1.09   0.96   0.99   0.84   0.96   1.04   1.56   0.28
Every:DT   1.28   0.98   1.31   1.25   1.00   1.31   1.23   1.39   1.23
Canadian:JJ   1.02   1.11   1.02   0.86   1.07   1.00   1.22   1.27   1.05
resident:NN   0.71   0.28   0.71   0.91   0.65   0.71   0.49   1.27   1.09
is:VBZ   0.76   0.91   0.76   1.16   0.91   1.06   1.00   1.11   1.16
a:DT   1.31   1.00   1.31   1.25   1.00   1.31   1.25   1.45   1.25
resident:NN   0.71   0.28   0.71   0.91   0.65   0.71   0.49   1.27   1.09
the:DT   1.31   1.00   2.42   1.25   1.00   1.31   1.23   1.45   1.25
North_American:NNP   0.96   0.91   0.96   2.22   1.08   0.96   1.07   1.56   0.99
continent:NNP   0.71   0.65   0.71   1.08   3.06   0.71   0.58   1.04   0.93
NO_WORD   0.82   0.28   0.82   0.05   0.04   1.55   0.17   0.04   0.33

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  1.01 Alignment.score
 1.00  0.36 Alignment.isGood
-1.00  0.61 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.33 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "freely" modifying "travel"
 2.00  1.00 Modal.yes : actual -> possible
-1.00  1.00 NullPunisher.other : freely
-0.05  1.00 NullPunisher.aux : can
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : resident
-1.00  1.00 NullPunisher.other : Each
-1.00  1.00 NullPunisher.other : travel
-2.00  1.00 RootEntailment.unalignedRoot : "travel" not aligned to anything
 2.00  1.00 Location.match : Locations match between txt and hyp; both have lemma "Europe"
 4.00  1.00 Location.match&OK.Root.poorlyOrUnAligned :
Hand-tuned score (dot product of above): 1.2367
Threshold: -9.4738


Inference ID: 053

Txt: The residents of major western countries can travel freely within Europe. All residents of major western countries are residents of western countries.

Hyp: The residents of western countries have the right to live in Europe. (don't know)

The
DT
residents
NNS
western
JJ
countries
NNS
have
VBP
the
DT
right
NN
to
TO
live
VB
Europe
NNP
The:DT   2.42   1.00   1.18   1.00   1.22   0.01   1.00   1.26   1.22   1.25
residents:NNS   0.71   0.28   0.66   0.86   0.82   0.71   0.87   0.71   0.49   1.09
major:JJ   0.76   0.88   0.76   0.70   0.93   0.76   0.88   0.76   0.96   1.11
western:JJ   0.76   0.55   0.74   0.48   0.96   0.76   0.88   0.76   0.80   1.12
countries:NNS   0.71   0.86   0.59   0.28   0.82   0.71   0.95   0.71   0.73   1.17
can:MD   1.31   1.00   1.18   1.00   1.47   1.31   1.00   1.31   1.37   1.25
travel:VB   0.73   0.78   0.94   0.69   0.27   0.73   0.89   0.76   0.59   1.13
freely:RB   1.09   1.18   0.77   1.01   0.91   1.09   1.10   1.09   0.76   1.47
Europe:NNP   0.96   1.09   1.23   1.17   1.07   0.96   1.17   0.96   1.07   0.28
All:DT   1.31   1.00   1.18   1.00   1.22   1.31   1.00   1.31   1.25   1.25
residents:NNS   0.71   0.28   0.66   0.86   0.82   0.71   0.87   0.71   0.49   1.09
major:JJ   0.76   0.88   0.76   0.70   0.93   0.76   0.88   0.76   0.96   1.11
western:JJ   0.76   0.55   0.74   0.48   0.96   0.76   0.88   0.76   0.80   1.12
countries:NNS   0.71   0.86   0.59   0.28   0.82   0.71   0.95   0.71   0.73   1.17
are:VBP   0.68   0.91   1.02   0.91   0.89   0.68   0.91   0.76   0.36   1.14
residents:NNS   0.71   0.28   0.66   0.86   0.82   0.71   0.87   0.71   0.49   1.09
western:JJ   0.76   0.55   0.74   0.48   0.96   0.76   0.88   0.76   0.80   1.12
countries:NNS   0.71   0.86   0.59   0.28   0.82   0.71   0.95   0.71   0.73   1.17
NO_WORD   0.82   0.28   0.11   0.04   0.17   0.82   0.09   1.55   0.12   0.07

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.46 Alignment.score
 1.00  0.24 Alignment.isGood
-1.00  0.73 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.20 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Europe" modifying "live"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "Europe" modifying "travel" is dropped on aligned hypothesis word "have"
-2.00  1.00 Modal.dontKnow : possible -> actual
-0.10  1.00 NullPunisher.article : The
-0.10  1.00 NullPunisher.article : the
-3.00  1.00 NullPunisher.entity : Europe
-1.00  1.00 NullPunisher.other : live
-1.00  1.00 NullPunisher.other : western
-1.00  1.00 NullPunisher.other : right
-0.10  1.00 NullPunisher.functionWord : to
Hand-tuned score (dot product of above): -9.3122
Threshold: -9.4738


Inference ID: 054

Txt: No Scandinavian delegate finished the report on time.

Hyp: Some delegate finished the report on time. (don't know)

Some
DT
delegate
NN
finished
VBD
the
DT
report
NN
time
NN
No:DT   1.31   1.00   1.25   1.31   1.00   1.00
Scandinavian:JJ   1.02   1.13   1.22   1.02   1.13   1.13
delegate:NN   0.71   0.28   0.74   0.71   0.74   0.86
finished:VBD   0.76   0.83   1.32   0.76   0.86   0.77
the:DT   1.28   1.00   1.25   2.42   1.00   0.90
report:NN   0.71   0.74   0.77   0.71   0.28   0.91
time:NN   0.60   0.86   0.68   0.61   0.91   0.28
NO_WORD   0.82   0.28   0.17   0.82   0.09   0.02

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.27 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "time" modifying "finished"
-1.00  1.00 NullPunisher.other : Some
-1.00  1.00 NullPunisher.other : finished
-1.00  1.00 NullPunisher.other : time
-1.00  1.00 NullPunisher.other : delegate
-1.00  1.00 NullPunisher.other : report
-0.10  1.00 NullPunisher.article : the
-2.00  1.00 RootEntailment.unalignedRoot : "finished" not aligned to anything
Hand-tuned score (dot product of above): -8.9897
Threshold: -9.4738


Inference ID: 055

Txt: Some Irish delegates finished the survey on time.

Hyp: Some delegates finished the survey on time. (yes)

Some
DT
delegates
NNS
finished
VBD
the
DT
survey
NN
time
NN
Some:DT   2.42   1.00   1.25   1.28   0.95   0.89
Irish:JJ   1.02   1.13   1.13   1.02   1.11   1.11
delegates:NNS   0.71   0.28   0.78   0.71   0.53   0.86
finished:VBD   0.76   0.87   1.32   0.76   0.91   0.77
the:DT   1.28   1.00   1.25   2.42   1.00   0.90
survey:NN   0.66   0.53   0.82   0.71   0.28   0.84
time:NN   0.60   0.86   0.68   0.61   0.84   0.28
NO_WORD   0.82   0.28   0.17   0.82   0.09   0.02

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.27 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "time" modifying "finished"
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : time
-1.00  1.00 NullPunisher.other : finished
-1.00  1.00 NullPunisher.other : delegates
-1.00  1.00 NullPunisher.other : Some
-1.00  1.00 NullPunisher.other : survey
-2.00  1.00 RootEntailment.unalignedRoot : "finished" not aligned to anything
Hand-tuned score (dot product of above): -8.9897
Threshold: -9.4738


Inference ID: 056

Txt: Many British delegates obtained interesting results from the survey.

Hyp: Many delegates obtained interesting results from the survey. (don't know)

Many
JJ
delegates
NNS
obtained
VBD
interesting
JJ
results
NNS
the
DT
survey
NN
Many:JJ   0.74   0.88   0.96   0.93   0.88   0.76   0.88
British:JJ   1.19   1.13   1.20   1.17   1.13   1.02   1.13
delegates:NNS   0.99   0.28   0.80   0.88   0.63   0.71   0.53
obtained:VBD   1.02   0.89   1.32   0.96   0.85   0.76   0.61
interesting:JJ   0.93   0.77   0.90   0.74   0.71   0.76   0.74
results:NNS   0.99   0.63   0.76   0.82   0.28   0.71   0.63
the:DT   1.18   1.00   1.25   1.18   1.00   2.42   1.00
survey:NN   0.99   0.53   0.52   0.85   0.63   0.71   0.28
NO_WORD   0.11   0.28   0.17   0.11   0.09   0.82   0.15

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.13 Alignment.score
 1.00  0.19 Alignment.isGood
-1.00  0.79 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "survey" modifying "obtained"
-1.00  1.00 NullPunisher.other : survey
-1.00  1.00 NullPunisher.other : delegates
-1.00  1.00 NullPunisher.other : interesting
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : Many
-1.00  1.00 NullPunisher.other : results
-1.00  1.00 NullPunisher.other : obtained
-2.00  1.00 RootEntailment.unalignedRoot : "obtained" not aligned to anything
Hand-tuned score (dot product of above): -10.2775
Threshold: -9.4738


Inference ID: 057

Txt: Several Portuguese delegates got the results published in major national newspapers.

Hyp: Several delegates got the results published in major national newspapers. (yes)

Several
JJ
delegates
NNS
got
VBD
the
DT
results
NNS
published
VBN
major
JJ
national
JJ
newspapers
NNS
Several:JJ   0.74   0.82   0.96   0.76   0.84   0.96   0.93   0.89   0.86
Portuguese:NNP   1.24   1.04   1.07   0.96   1.06   1.04   1.24   1.24   1.05
delegates:NNS   0.93   0.28   0.82   0.71   0.63   0.71   0.99   0.86   0.56
got:VBD   1.02   0.91   1.32   0.76   0.91   0.74   1.02   1.02   0.86
the:DT   1.18   1.00   1.25   2.42   1.00   1.25   1.18   1.18   1.00
results:NNS   0.95   0.63   0.82   0.71   0.28   0.64   0.99   0.99   0.82
published:VBN   1.02   0.80   0.74   0.76   0.73   1.32   1.01   0.89   0.65
major:JJ   0.93   0.88   0.96   0.76   0.88   0.95   0.74   0.63   0.73
national:JJ   0.89   0.75   0.96   0.76   0.88   0.83   0.63   0.74   0.62
newspapers:NNS   0.97   0.56   0.77   0.71   0.82   0.56   0.84   0.73   0.28
NO_WORD   0.11   0.28   0.17   0.82   0.09   0.16   0.11   0.11   0.07

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.14 Alignment.score
 1.00  0.19 Alignment.isGood
-1.00  0.79 Alignment.isBad
-0.10  9.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "newspapers" modifying "published"
-1.00  1.00 NullPunisher.other : results
-1.00  1.00 NullPunisher.other : Several
-1.00  1.00 NullPunisher.other : delegates
-1.00  1.00 NullPunisher.other : national
-1.00  1.00 NullPunisher.other : newspapers
-1.00  1.00 NullPunisher.other : major
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : got
-1.00  1.00 NullPunisher.other : published
-2.00  1.00 RootEntailment.unalignedRoot : "got" not aligned to anything
Hand-tuned score (dot product of above): -12.4619
Threshold: -9.4738


Inference ID: 058

Txt: Most Europeans who are resident in Europe can travel freely within Europe.

Hyp: Most Europeans can travel freely within Europe. (don't know)

Most
JJS
Europeans
NNPS
can
MD
travel
VB
freely
RB
Europe
NNP
Most:JJS   0.74   1.13   0.76   0.96   1.01   1.13
Europeans:NNS   1.24   2.22   0.96   1.06   1.55   0.46
who:WP   1.39   1.55   1.18   0.97   1.05   1.55
are:VBP   1.02   1.16   1.06   0.98   1.09   1.14
resident:NN   0.99   0.89   0.71   0.49   1.27   1.09
Europe:NNP   1.24   0.46   0.96   1.04   1.56   0.28
can:MD   1.18   1.25   2.42   1.51   1.45   1.25
travel:VB   1.02   1.15   1.02   1.32   0.90   1.13
freely:RB   1.05   1.46   1.09   0.70   1.23   1.47
Europe:NNP   1.24   0.46   0.96   1.04   1.56   0.28
NO_WORD   0.11   0.28   1.55   0.17   0.04   0.33

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.47 Alignment.score
 1.00  0.25 Alignment.isGood
-1.00  0.73 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.17 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "freely" modifying "travel"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "resident" modifying "Europeans" is dropped on aligned hypothesis word "Europe"
 2.00  1.00 Modal.yes : actual -> possible
 0.00  1.00 NegPolarity.hypNegWord : "Most": tag "JJS" is in NegPolarityMarkers list
-1.00  1.00 NullPunisher.other : travel
-1.00  1.00 NullPunisher.other : freely
-0.05  1.00 NullPunisher.aux : can
-1.00  1.00 NullPunisher.other : Most
-2.00  1.00 RootEntailment.unalignedRoot : "travel" not aligned to anything
Hand-tuned score (dot product of above): -3.8498
Threshold: -9.4738


Inference ID: 059

Txt: A few female committee members are from Scandinavia.

Hyp: At least a few committee members are from Scandinavia. (yes)

At
IN
least
JJS
a
DT
few
JJ
committee_members
NNS
are
VBP
Scandinavia
NNP
A:DT   1.00   1.18   0.13   1.18   1.00   1.25   1.25
few:JJ   1.05   0.93   0.76   0.74   0.88   0.96   1.13
female:JJ   1.05   0.87   0.76   0.91   0.59   0.93   1.13
committee_members:NNS   0.50   0.99   0.71   0.99   0.28   0.82   1.06
are:VBP   1.11   1.02   0.76   1.02   0.91   1.32   1.16
Scandinavia:NNP   0.75   1.24   0.96   1.24   1.06   1.07   0.28
NO_WORD   0.73   0.29   0.82   0.11   0.28   0.17   0.15

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.24 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Scandinavia" modifying "are"
 0.00  1.00 NegPolarity.hypNegWord : "least": tag "JJS" is in NegPolarityMarkers list
 0.00  1.00 NegPolarity.hypNegWord : "few": is in NegPolarityMarkers list
-1.00  1.00 NullPunisher.other : committee_members
-3.00  1.00 NullPunisher.entity : Scandinavia
-0.05  1.00 NullPunisher.aux : are
-1.00  1.00 NullPunisher.other : At
-1.00  1.00 NullPunisher.other : few
-1.00  1.00 NullPunisher.other : least
-0.10  1.00 NullPunisher.article : a
-2.00  1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -11.1707
Threshold: -9.4738


Inference ID: 060

Txt: Few female committee members are from southern Europe.

Hyp: Few committee members are from southern Europe. (don't know)

Few
JJ
committee_members
NNS
are
VBP
southern
JJ
Europe
NNP
Few:JJ   0.74   0.88   0.96   0.93   1.13
female:JJ   0.91   0.59   0.93   0.93   1.10
committee_members:NNS   0.99   0.28   0.82   0.99   1.11
are:VBP   1.02   0.91   1.32   1.02   1.14
southern:JJ   0.93   0.88   0.96   0.74   1.13
Europe:NNP   1.24   1.11   1.05   1.24   0.28
NO_WORD   0.11   0.28   0.17   0.11   0.15

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00 -0.00 Alignment.score
 1.00  0.17 Alignment.isGood
-1.00  0.81 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Europe" modifying "are"
-3.00  1.00 NullPunisher.entity : Europe
-1.00  1.00 NullPunisher.other : southern
-1.00  1.00 NullPunisher.other : Few
-0.05  1.00 NullPunisher.aux : are
-1.00  1.00 NullPunisher.other : committee_members
-2.00  1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -10.1982
Threshold: -9.4738


Inference ID: 063

Txt: At least three female commissioners spend time at home.

Hyp: At least three commissioners spend time at home. (yes)

At
IN
least
JJS
three
CD
commissioners
NNS
spend
VBP
time
NN
at_home
IN
At:IN   1.59   0.74   1.30   1.31   1.24   1.31   1.05
least:JJS   1.01   0.74   1.13   0.88   0.96   0.88   1.05
three:CD   1.30   1.00   0.85   1.18   1.22   1.01   1.23
female:JJ   1.05   0.87   1.02   0.87   0.96   0.83   1.03
commissioners:NNS   0.50   0.99   1.15   0.28   0.82   0.87   0.50
spend:VBP   1.15   1.02   1.31   0.91   1.32   0.51   1.07
time:NN   0.50   0.99   0.98   0.87   0.42   0.28   0.30
at_home:IN   1.05   0.77   1.23   1.31   1.15   1.11   1.59
NO_WORD   0.84   0.29   0.52   0.28   0.17   0.09   1.16

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.30 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.76 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.29 Alignment.txtSpan
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "female" modifying "commissioners" is dropped on aligned hypothesis word "commissioners"
 0.00  1.00 NegPolarity.hypNegWord : "least": tag "JJS" is in NegPolarityMarkers list
-1.00  1.00 NullPunisher.other : least
-1.00  1.00 NullPunisher.other : at_home
-1.00  1.00 NullPunisher.other : At
-1.00  1.00 NullPunisher.other : spend
-1.00  1.00 NullPunisher.other : time
-2.00  1.00 RootEntailment.unalignedRoot : "spend" not aligned to anything
Hand-tuned score (dot product of above): -7.0125
Threshold: -9.4738


Inference ID: 064

Txt: At most ten female commissioners spend time at home.

Hyp: At most ten commissioners spend time at home. (don't know)

At
IN
most
JJS
ten
NN
commissioners
NNS
spend
VBP
time
NN
at_home
IN
most:JJS   1.05   0.74   0.88   0.88   0.96   0.88   1.05
ten:NN   0.50   0.99   0.28   0.71   0.75   0.71   0.50
female:JJ   1.05   0.93   0.88   0.87   0.96   0.83   1.03
commissioners:NNS   0.50   0.99   0.71   0.28   0.82   0.87   0.50
spend:VBP   1.15   1.02   0.84   0.91   1.32   0.51   1.07
time:NN   0.50   0.99   0.71   0.87   0.42   0.28   0.30
at_home:IN   1.05   0.77   1.31   1.31   1.15   1.11   1.59
NO_WORD   1.16   0.29   0.05   0.28   0.17   0.09   1.16

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.36 Alignment.score
 1.00  0.23 Alignment.isGood
-1.00  0.75 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "ten" modifying "commissioners"
 0.00  1.00 NegPolarity.hypNegWord : "most": tag "JJS" is in NegPolarityMarkers list
-1.00  1.00 NullPunisher.other : At
-1.00  1.00 NullPunisher.other : spend
-1.00  1.00 NullPunisher.other : at_home
-1.00  1.00 NullPunisher.other : ten
-1.00  1.00 NullPunisher.other : commissioners
-1.00  1.00 NullPunisher.other : time
-1.00  1.00 NullPunisher.other : most
-2.00  1.00 RootEntailment.unalignedRoot : "spend" not aligned to anything
Hand-tuned score (dot product of above): -10.8594
Threshold: -9.4738


Inference ID: 065

Txt: A Scandinavian won a Nobel prize. Every Swede is a Scandinavian.

Hyp: A Swede won a Nobel prize. (don't know)

A
DT
Swede
NN
won
VBD
a
DT
Nobel_prize
NN
A:DT   2.42   1.25   1.25   0.13   1.00
Scandinavian:NNP   0.96   0.56   1.07   0.96   1.06
won:VBD   0.76   1.16   1.32   0.76   0.57
a:DT   0.13   1.25   1.25   2.42   1.00
Nobel_prize:NN   0.71   1.07   0.49   0.71   0.28
Every:DT   1.31   1.21   1.25   1.31   1.00
Swede:NN   0.96   0.28   1.07   0.96   1.07
is:VBZ   0.76   1.16   0.88   0.76   0.91
a:DT   0.13   1.25   1.25   2.42   1.00
Scandinavian:NNP   0.96   0.56   1.07   0.96   1.06
NO_WORD   0.82   0.28   0.17   0.82   0.25

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.36 Alignment.score
 1.00  0.23 Alignment.isGood
-1.00  0.75 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : Nobel_prize
-3.00  1.00 NullPunisher.entity : Swede
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : won
-0.10  1.00 NullPunisher.article : A
-2.00  1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -7.8694
Threshold: -9.4738


Inference ID: 066

Txt: Every resident of the North American continent can travel freely within Europe. Every Canadian resident is a resident of the North American continent.

Hyp: Every Canadian resident can travel freely within Europe. (yes)

Every
DT
Canadian
JJ
resident
NN
can
MD
travel
VB
freely
RB
Europe
NNP
Every:DT   2.42   1.44   0.98   1.31   1.23   1.39   1.23
resident:NN   0.69   1.22   0.28   0.71   0.49   1.27   1.09
the:DT   1.31   1.44   1.00   1.31   1.23   1.45   1.25
North_American:NN   0.96   0.97   0.91   0.96   1.07   1.56   0.99
continent:NN   0.71   1.18   0.65   0.71   0.58   1.04   0.93
can:MD   1.31   1.42   1.00   2.42   1.51   1.45   1.25
travel:VB   0.74   1.28   0.58   1.02   1.32   0.90   1.13
freely:RB   1.03   1.30   1.18   1.09   0.70   1.23   1.47
Europe:NNP   0.94   1.15   1.09   0.96   1.04   1.56   0.28
Every:DT   2.42   1.44   0.98   1.31   1.23   1.39   1.23
Canadian:JJ   1.02   0.74   1.11   1.00   1.22   1.27   1.05
resident:NN   0.69   1.22   0.28   0.71   0.49   1.27   1.09
is:VBZ   0.76   1.28   0.91   1.06   1.00   1.11   1.16
a:DT   1.31   1.44   1.00   1.31   1.25   1.45   1.25
resident:NN   0.69   1.22   0.28   0.71   0.49   1.27   1.09
the:DT   1.31   1.44   1.00   1.31   1.23   1.45   1.25
North_American:NNP   0.96   0.97   0.91   0.96   1.07   1.56   0.99
continent:NNP   0.71   1.18   0.65   0.71   0.58   1.04   0.93
NO_WORD   0.82   0.11   0.28   1.55   0.17   0.04   0.33

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.31 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.76 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.14 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "freely" modifying "travel"
 2.00  1.00 Modal.yes : actual -> possible
-0.05  1.00 NullPunisher.aux : can
-1.00  1.00 NullPunisher.other : freely
-3.00  1.00 NullPunisher.entity : Canadian
-1.00  1.00 NullPunisher.other : resident
-1.00  1.00 NullPunisher.other : travel
-1.00  1.00 NullPunisher.other : Every
-2.00  1.00 RootEntailment.unalignedRoot : "travel" not aligned to anything
 2.00  1.00 Location.match : Locations match between txt and hyp; both have lemma "Europe"
 4.00  1.00 Location.match&OK.Root.poorlyOrUnAligned :
Hand-tuned score (dot product of above): -2.7729
Threshold: -9.4738


Inference ID: 067

Txt: All residents of the North American continent can travel freely within Europe. Every Canadian resident is a resident of the North American continent.

Hyp: All Canadian residents can travel freely within Europe. (yes)

All
DT
Canadian
JJ
residents
NNS
can
MD
travel
VB
freely
RB
Europe
NNP
All:DT   2.42   1.44   1.00   1.31   1.23   1.45   1.25
residents:NNS   0.71   1.23   0.28   0.71   0.69   1.27   1.09
the:DT   1.31   1.44   1.00   1.31   1.23   1.45   1.25
North_American:NN   0.96   0.97   0.91   0.96   1.07   1.56   0.99
continent:NN   0.71   1.18   0.67   0.71   0.58   1.04   0.93
can:MD   1.31   1.42   1.00   2.42   1.51   1.45   1.25
travel:VB   0.73   1.28   0.78   1.02   1.32   0.90   1.13
freely:RB   1.09   1.30   1.18   1.09   0.70   1.23   1.47
Europe:NNP   0.96   1.15   1.09   0.96   1.04   1.56   0.28
Every:DT   1.31   1.44   1.00   1.31   1.23   1.39   1.23
Canadian:JJ   1.02   0.74   1.12   1.00   1.22   1.27   1.05
resident:NN   0.71   1.22   1.32   0.71   0.49   1.27   1.09
is:VBZ   0.76   1.28   0.91   1.06   1.00   1.11   1.16
a:DT   1.31   1.44   1.00   1.31   1.25   1.45   1.25
resident:NN   0.71   1.22   1.32   0.71   0.49   1.27   1.09
the:DT   1.31   1.44   1.00   1.31   1.23   1.45   1.25
North_American:NNP   0.96   0.97   0.91   0.96   1.07   1.56   0.99
continent:NNP   0.71   1.18   0.67   0.71   0.58   1.04   0.93
NO_WORD   0.82   0.11   0.28   1.55   0.17   0.04   0.33

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.53 Alignment.score
 1.00  0.26 Alignment.isGood
-1.00  0.72 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.29 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "freely" modifying "travel"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "Canadian" modifying "resident" is dropped on aligned hypothesis word "residents"
 2.00  1.00 Modal.yes : actual -> possible
-1.00  1.00 NullPunisher.other : travel
-0.05  1.00 NullPunisher.aux : can
-1.00  1.00 NullPunisher.other : freely
-1.00  1.00 NullPunisher.other : All
-3.00  1.00 NullPunisher.entity : Canadian
 1.00  1.00 Quant.equivalent : Replacing the quantifier "every" by an equivalent quantifier "all" preserves truth.
-2.00  1.00 RootEntailment.unalignedRoot : "travel" not aligned to anything
 2.00  1.00 Location.match : Locations match between txt and hyp; both have lemma "Europe"
 4.00  1.00 Location.match&OK.Root.poorlyOrUnAligned :
Hand-tuned score (dot product of above): 0.1542
Threshold: -9.4738


Inference ID: 068

Txt: Each resident of the North American continent can travel freely within Europe. Every Canadian resident is a resident of the North American continent.

Hyp: Each Canadian resident can travel freely within Europe. (yes)

Each
DT
Canadian
JJ
resident
NN
can
MD
travel
VB
freely
RB
Europe
NNP
Each:DT   2.42   1.44   1.00   1.28   1.25   1.45   1.25
resident:NN   0.71   1.22   0.28   0.71   0.49   1.27   1.09
the:DT   1.31   1.44   1.00   1.31   1.23   1.45   1.25
North_American:NN   0.96   0.97   0.91   0.96   1.07   1.56   0.99
continent:NN   0.71   1.18   0.65   0.71   0.58   1.04   0.93
can:MD   1.28   1.42   1.00   2.42   1.51   1.45   1.25
travel:VB   0.76   1.28   0.58   1.02   1.32   0.90   1.13
freely:RB   1.09   1.30   1.18   1.09   0.70   1.23   1.47
Europe:NNP   0.96   1.15   1.09   0.96   1.04   1.56   0.28
Every:DT   1.28   1.44   0.98   1.31   1.23   1.39   1.23
Canadian:JJ   1.02   0.74   1.11   1.00   1.22   1.27   1.05
resident:NN   0.71   1.22   0.28   0.71   0.49   1.27   1.09
is:VBZ   0.76   1.28   0.91   1.06   1.00   1.11   1.16
a:DT   1.31   1.44   1.00   1.31   1.25   1.45   1.25
resident:NN   0.71   1.22   0.28   0.71   0.49   1.27   1.09
the:DT   1.31   1.44   1.00   1.31   1.23   1.45   1.25
North_American:NNP   0.96   0.97   0.91   0.96   1.07   1.56   0.99
continent:NNP   0.71   1.18   0.65   0.71   0.58   1.04   0.93
NO_WORD   0.82   0.11   0.28   1.55   0.17   0.04   0.33

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.31 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.76 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.14 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "freely" modifying "travel"
 2.00  1.00 Modal.yes : actual -> possible
-0.05  1.00 NullPunisher.aux : can
-1.00  1.00 NullPunisher.other : travel
-1.00  1.00 NullPunisher.other : resident
-1.00  1.00 NullPunisher.other : freely
-1.00  1.00 NullPunisher.other : Each
-3.00  1.00 NullPunisher.entity : Canadian
-2.00  1.00 RootEntailment.unalignedRoot : "travel" not aligned to anything
 2.00  1.00 Location.match : Locations match between txt and hyp; both have lemma "Europe"
 4.00  1.00 Location.match&OK.Root.poorlyOrUnAligned :
Hand-tuned score (dot product of above): -2.7729
Threshold: -9.4738


Inference ID: 069

Txt: The residents of western countries can travel freely within Europe. All residents of major western countries are residents of western countries.

Hyp: The residents of major western countries have the right to live in Europe. (yes)

The
DT
residents
NNS
major
JJ
western
JJ
countries
NNS
have
VBP
the
DT
right
NN
to
TO
live
VB
Europe
NNP
The:DT   2.42   1.00   1.18   1.18   1.00   1.22   0.01   1.00   1.26   1.22   1.25
residents:NNS   0.71   0.28   0.99   0.66   0.86   0.82   0.71   0.87   0.71   0.49   1.09
western:JJ   0.76   0.55   0.76   0.74   0.48   0.96   0.76   0.88   0.76   0.80   1.12
countries:NNS   0.71   0.86   0.81   0.59   0.28   0.82   0.71   0.95   0.71   0.73   1.17
can:MD   1.31   1.00   1.18   1.18   1.00   1.47   1.31   1.00   1.31   1.37   1.25
travel:VB   0.73   0.78   0.91   0.94   0.69   0.27   0.73   0.89   0.76   0.59   1.13
freely:RB   1.09   1.18   0.99   0.77   1.01   0.91   1.09   1.10   1.09   0.76   1.47
Europe:NNP   0.96   1.09   1.22   1.23   1.17   1.07   0.96   1.17   0.96   1.07   0.28
All:DT   1.31   1.00   1.18   1.18   1.00   1.22   1.31   1.00   1.31   1.25   1.25
residents:NNS   0.71   0.28   0.99   0.66   0.86   0.82   0.71   0.87   0.71   0.49   1.09
major:JJ   0.76   0.88   0.74   0.76   0.70   0.93   0.76   0.88   0.76   0.96   1.11
western:JJ   0.76   0.55   0.76   0.74   0.48   0.96   0.76   0.88   0.76   0.80   1.12
countries:NNS   0.71   0.86   0.81   0.59   0.28   0.82   0.71   0.95   0.71   0.73   1.17
are:VBP   0.68   0.91   1.02   1.02   0.91   0.89   0.68   0.91   0.76   0.36   1.14
residents:NNS   0.71   0.28   0.99   0.66   0.86   0.82   0.71   0.87   0.71   0.49   1.09
western:JJ   0.76   0.55   0.76   0.74   0.48   0.96   0.76   0.88   0.76   0.80   1.12
countries:NNS   0.71   0.86   0.81   0.59   0.28   0.82   0.71   0.95   0.71   0.73   1.17
NO_WORD   0.82   0.28   0.11   0.11   0.04   0.17   0.82   0.09   1.55   0.12   0.07

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.42 Alignment.score
 1.00  0.24 Alignment.isGood
-1.00  0.74 Alignment.isBad
-0.10  8.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.18 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Europe" modifying "live"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "Europe" modifying "travel" is dropped on aligned hypothesis word "have"
-2.00  1.00 Modal.dontKnow : possible -> actual
-0.10  1.00 NullPunisher.article : The
-1.00  1.00 NullPunisher.other : western
-1.00  1.00 NullPunisher.other : live
-1.00  1.00 NullPunisher.other : right
-0.10  1.00 NullPunisher.functionWord : to
-3.00  1.00 NullPunisher.entity : Europe
-1.00  1.00 NullPunisher.other : major
-0.10  1.00 NullPunisher.article : the
Hand-tuned score (dot product of above): -10.4568
Threshold: -9.4738


Inference ID: 070

Txt: No delegate finished the report on time.

Hyp: Some Scandinavian delegate finished the report on time. (don't know)

Some
DT
Scandinavian
JJ
delegate
NN
finished
VBD
the
DT
report
NN
time
NN
No:DT   1.31   1.44   1.00   1.25   1.31   1.00   1.00
delegate:NN   0.71   1.24   0.28   0.74   0.71   0.74   0.86
finished:VBD   0.76   1.28   0.83   1.32   0.76   0.86   0.77
the:DT   1.28   1.44   1.00   1.25   2.42   1.00   0.90
report:NN   0.71   1.24   0.74   0.77   0.71   0.28   0.91
time:NN   0.60   1.24   0.86   0.68   0.61   0.91   0.28
NO_WORD   0.82   0.11   0.28   0.17   0.82   0.09   0.02

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.25 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "time" modifying "finished"
-1.00  1.00 NullPunisher.other : finished
-1.00  1.00 NullPunisher.other : report
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : delegate
-1.00  1.00 NullPunisher.other : Some
-1.00  1.00 NullPunisher.other : time
-3.00  1.00 NullPunisher.entity : Scandinavian
-2.00  1.00 RootEntailment.unalignedRoot : "finished" not aligned to anything
Hand-tuned score (dot product of above): -12.1191
Threshold: -9.4738


Inference ID: 071

Txt: Some delegates finished the survey on time.

Hyp: Some Irish delegates finished the survey on time. (don't know)

Some
DT
Irish
JJ
delegates
NNS
finished
VBD
the
DT
survey
NN
time
NN
Some:DT   2.42   1.44   1.00   1.25   1.28   0.95   0.89
delegates:NNS   0.71   1.24   0.28   0.78   0.71   0.53   0.86
finished:VBD   0.76   1.19   0.87   1.32   0.76   0.91   0.77
the:DT   1.28   1.44   1.00   1.25   2.42   1.00   0.90
survey:NN   0.66   1.22   0.53   0.82   0.71   0.28   0.84
time:NN   0.60   1.22   0.86   0.68   0.61   0.84   0.28
NO_WORD   0.82   0.11   0.28   0.17   0.82   0.09   0.02

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.25 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "time" modifying "finished"
-1.00  1.00 NullPunisher.other : finished
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : time
-3.00  1.00 NullPunisher.entity : Irish
-1.00  1.00 NullPunisher.other : delegates
-1.00  1.00 NullPunisher.other : Some
-1.00  1.00 NullPunisher.other : survey
-2.00  1.00 RootEntailment.unalignedRoot : "finished" not aligned to anything
Hand-tuned score (dot product of above): -12.1191
Threshold: -9.4738


Inference ID: 072

Txt: Many delegates obtained interesting results from the survey.

Hyp: Many British delegates obtained interesting results from the survey. (don't know)

Many
JJ
British
JJ
delegates
NNS
obtained
VBD
interesting
JJ
results
NNS
the
DT
survey
NN
Many:JJ   0.74   1.19   0.88   0.96   0.93   0.88   0.76   0.88
delegates:NNS   0.99   1.24   0.28   0.80   0.88   0.63   0.71   0.53
obtained:VBD   1.02   1.26   0.89   1.32   0.96   0.85   0.76   0.61
interesting:JJ   0.93   1.17   0.77   0.90   0.74   0.71   0.76   0.74
results:NNS   0.99   1.24   0.63   0.76   0.82   0.28   0.71   0.63
the:DT   1.18   1.44   1.00   1.25   1.18   1.00   2.42   1.00
survey:NN   0.99   1.24   0.53   0.52   0.85   0.63   0.71   0.28
NO_WORD   0.11   0.11   0.28   0.17   0.11   0.09   0.82   0.15

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.12 Alignment.score
 1.00  0.19 Alignment.isGood
-1.00  0.79 Alignment.isBad
-0.10  8.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "survey" modifying "obtained"
-1.00  1.00 NullPunisher.other : obtained
-3.00  1.00 NullPunisher.entity : British
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : Many
-1.00  1.00 NullPunisher.other : delegates
-1.00  1.00 NullPunisher.other : interesting
-1.00  1.00 NullPunisher.other : survey
-1.00  1.00 NullPunisher.other : results
-2.00  1.00 RootEntailment.unalignedRoot : "obtained" not aligned to anything
Hand-tuned score (dot product of above): -13.3794
Threshold: -9.4738


Inference ID: 073

Txt: Several delegates got the results published in major national newspapers.

Hyp: Several Portuguese delegates got the results published in major national newspapers. (don't know)

Several
JJ
Portuguese
NNP
delegates
NNS
got
VBD
the
DT
results
NNS
published
VBN
major
JJ
national
JJ
newspapers
NNS
Several:JJ   0.74   1.13   0.82   0.96   0.76   0.84   0.96   0.93   0.89   0.86
delegates:NNS   0.93   1.04   0.28   0.82   0.71   0.63   0.71   0.99   0.86   0.56
got:VBD   1.02   1.16   0.91   1.32   0.76   0.91   0.74   1.02   1.02   0.86
the:DT   1.18   1.25   1.00   1.25   2.42   1.00   1.25   1.18   1.18   1.00
results:NNS   0.95   1.06   0.63   0.82   0.71   0.28   0.64   0.99   0.99   0.82
published:VBN   1.02   1.13   0.80   0.74   0.76   0.73   1.32   1.01   0.89   0.65
major:JJ   0.93   1.13   0.88   0.96   0.76   0.88   0.95   0.74   0.63   0.73
national:JJ   0.89   1.13   0.75   0.96   0.76   0.88   0.83   0.63   0.74   0.62
newspapers:NNS   0.97   1.05   0.56   0.77   0.71   0.82   0.56   0.84   0.73   0.28
NO_WORD   0.11   0.05   0.28   0.17   0.82   0.09   0.16   0.11   0.11   0.07

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.12 Alignment.score
 1.00  0.19 Alignment.isGood
-1.00  0.79 Alignment.isBad
-0.10  10.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "newspapers" modifying "published"
-1.00  1.00 NullPunisher.other : national
-1.00  1.00 NullPunisher.other : got
-1.00  1.00 NullPunisher.other : results
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : newspapers
-1.00  1.00 NullPunisher.other : major
-3.00  1.00 NullPunisher.entity : Portuguese
-1.00  1.00 NullPunisher.other : published
-1.00  1.00 NullPunisher.other : delegates
-1.00  1.00 NullPunisher.other : Several
-2.00  1.00 RootEntailment.unalignedRoot : "got" not aligned to anything
Hand-tuned score (dot product of above): -15.5872
Threshold: -9.4738


Inference ID: 074

Txt: Most Europeans can travel freely within Europe.

Hyp: Most Europeans who are resident outside Europe can travel freely within Europe. (don't know)

Most
JJS
Europeans
NNS
who
WP
are
VBP
resident
NN
Europe
NNP
can
MD
travel
VB
freely
RB
Europe
NNP
Most:JJS   0.74   1.13   1.05   0.96   0.88   1.13   0.76   0.96   1.01   1.13
Europeans:NNPS   1.24   2.22   1.84   1.07   0.89   0.46   0.96   1.06   1.55   0.46
can:MD   1.18   1.25   1.24   1.55   1.00   1.25   2.42   1.51   1.45   1.25
travel:VB   1.02   1.15   0.54   0.98   0.58   1.13   1.02   1.32   0.90   1.13
freely:RB   1.05   1.46   1.05   0.89   1.18   1.47   1.09   0.70   1.23   1.47
Europe:NNP   1.24   0.46   1.84   1.05   1.09   0.28   0.96   1.04   1.56   0.28
NO_WORD   0.11   0.28   0.92   1.43   0.19   0.10   1.55   0.17   0.04   0.33

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.65 Alignment.score
 1.00  0.28 Alignment.isGood
-1.00  0.69 Alignment.isBad
-0.10  8.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.10 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "freely" modifying "travel"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "Most" modifying "Europeans" is dropped on aligned hypothesis word "Europeans"
 2.00  1.00 Modal.yes : actual -> possible
 0.00  1.00 NegPolarity.hypNegWord : "Most": tag "JJS" is in NegPolarityMarkers list
-0.05  1.00 NullPunisher.aux : can
-3.00  1.00 NullPunisher.entity : Europe
-1.00  1.00 NullPunisher.other : Most
-0.10  1.00 NullPunisher.functionWord : who
-1.00  1.00 NullPunisher.other : resident
-1.00  1.00 NullPunisher.other : travel
-0.05  1.00 NullPunisher.aux : are
-1.00  1.00 NullPunisher.other : freely
-2.00  1.00 RootEntailment.unalignedRoot : "travel" not aligned to anything
 2.00  1.00 Location.match : Locations match between txt and hyp; both have lemma "Europe"
 4.00  1.00 Location.match&OK.Root.poorlyOrUnAligned :
Hand-tuned score (dot product of above): -2.1558
Threshold: -9.4738


Inference ID: 075

Txt: A few committee members are from Scandinavia.

Hyp: At least a few female committee members are from Scandinavia. (don't know)

At
IN
least
JJS
a
DT
few
JJ
female
JJ
committee_members
NNS
are
VBP
Scandinavia
NNP
A:DT   1.00   1.18   0.13   1.18   1.18   1.00   1.25   1.25
few:JJ   1.05   0.93   0.76   0.74   0.91   0.88   0.96   1.13
committee_members:NNS   0.50   0.99   0.71   0.99   0.70   0.28   0.82   1.06
are:VBP   1.11   1.02   0.76   1.02   0.99   0.91   1.32   1.16
Scandinavia:NNP   0.75   1.24   0.96   1.24   1.24   1.06   1.07   0.28
NO_WORD   0.73   0.29   0.82   0.11   0.11   0.28   0.17   0.15

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.23 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  8.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Scandinavia" modifying "are"
 0.00  1.00 NegPolarity.hypNegWord : "least": tag "JJS" is in NegPolarityMarkers list
 0.00  1.00 NegPolarity.hypNegWord : "few": is in NegPolarityMarkers list
-1.00  1.00 NullPunisher.other : female
-1.00  1.00 NullPunisher.other : At
-0.05  1.00 NullPunisher.aux : are
-3.00  1.00 NullPunisher.entity : Scandinavia
-1.00  1.00 NullPunisher.other : committee_members
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : few
-1.00  1.00 NullPunisher.other : least
-2.00  1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -12.2924
Threshold: -9.4738


Inference ID: 076

Txt: Few committee members are from southern Europe.

Hyp: Few female committee members are from southern Europe. (yes)

Few
JJ
female
JJ
committee_members
NNS
are
VBP
southern
JJ
Europe
NNP
Few:JJ   0.74   0.91   0.88   0.96   0.93   1.13
committee_members:NNS   0.99   0.70   0.28   0.82   0.99   1.11
are:VBP   1.02   0.99   0.91   1.32   1.02   1.14
southern:JJ   0.93   0.93   0.88   0.96   0.74   1.13
Europe:NNP   1.24   1.21   1.11   1.05   1.24   0.28
NO_WORD   0.11   0.11   0.28   0.17   0.11   0.15

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.01 Alignment.score
 1.00  0.17 Alignment.isGood
-1.00  0.81 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Europe" modifying "are"
-1.00  1.00 NullPunisher.other : Few
-0.05  1.00 NullPunisher.aux : are
-1.00  1.00 NullPunisher.other : committee_members
-1.00  1.00 NullPunisher.other : female
-3.00  1.00 NullPunisher.entity : Europe
-1.00  1.00 NullPunisher.other : southern
-2.00  1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -11.2726
Threshold: -9.4738


Inference ID: 079

Txt: At least three commissioners spend time at home.

Hyp: At least three male commissioners spend time at home. (don't know)

At
IN
least
JJS
three
CD
male
JJ
commissioners
NNS
spend
VBP
time
NN
at_home
IN
At:IN   1.59   0.74   1.30   0.77   1.31   1.24   1.31   1.05
least:JJS   1.01   0.74   1.13   0.91   0.88   0.96   0.88   1.05
three:CD   1.30   1.00   0.85   0.91   1.18   1.22   1.01   1.23
commissioners:NNS   0.50   0.99   1.15   0.99   0.28   0.82   0.87   0.50
spend:VBP   1.15   1.02   1.31   1.02   0.91   1.32   0.51   1.07
time:NN   0.50   0.99   0.98   0.93   0.87   0.42   0.28   0.30
at_home:IN   1.05   0.77   1.23   0.77   1.31   1.15   1.11   1.59
NO_WORD   0.84   0.29   0.52   0.11   0.28   0.17   0.09   1.16

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.28 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.25 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "male" modifying "commissioners"
 0.00  1.00 NegPolarity.hypNegWord : "least": tag "JJS" is in NegPolarityMarkers list
-1.00  1.00 NullPunisher.other : male
-1.00  1.00 NullPunisher.other : least
-1.00  1.00 NullPunisher.other : At
-1.00  1.00 NullPunisher.other : spend
-1.00  1.00 NullPunisher.other : time
-1.00  1.00 NullPunisher.other : at_home
-2.00  1.00 RootEntailment.unalignedRoot : "spend" not aligned to anything
Hand-tuned score (dot product of above): -9.7480
Threshold: -9.4738


Inference ID: 080

Txt: At most ten commissioners spend time at home.

Hyp: At most ten female commissioners spend time at home. (yes)

most
JJS
ten
NN
female
JJ
commissioners
NNS
spend
VBP
time
NN
at_home
IN
At:IN   0.77   1.31   0.77   1.31   1.24   1.31   1.05
most:JJS   0.74   0.88   0.93   0.88   0.96   0.88   1.05
ten:NN   0.99   0.28   0.98   0.71   0.75   0.71   0.50
commissioners:NNS   0.99   0.71   0.98   0.28   0.82   0.87   0.50
spend:VBP   1.02   0.84   1.02   0.91   1.32   0.51   1.07
time:NN   0.99   0.71   0.94   0.87   0.42   0.28   0.30
at_home:IN   0.77   1.31   0.75   1.31   1.15   1.11   1.59
NO_WORD   0.11   0.20   0.11   0.28   0.17   0.09   1.16

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.22 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "female" modifying "commissioners"
 0.00  1.00 NegPolarity.hypNegWord : "most": tag "JJS" is in NegPolarityMarkers list
-1.00  1.00 NullPunisher.other : commissioners
-1.00  1.00 NullPunisher.other : time
-1.00  1.00 NullPunisher.other : ten
-1.00  1.00 NullPunisher.other : most
-1.00  1.00 NullPunisher.other : at_home
-1.00  1.00 NullPunisher.other : female
-1.00  1.00 NullPunisher.other : spend
-2.00  1.00 RootEntailment.unalignedRoot : "spend" not aligned to anything
Hand-tuned score (dot product of above): -11.0461
Threshold: -9.4738


Inference ID: 081

Txt: Smith, Jones and Anderson signed the contract.

Hyp: Jones signed the contract. (yes)

Jones
NNP
signed
VBD
the
DT
contract
NN
Smith:NNP   0.70   1.05   0.96   1.16
Jones:NNP   0.28   1.01   0.96   1.16
Anderson:NNP   0.75   1.07   0.96   1.10
signed:VBD   1.10   1.32   0.76   0.45
the:DT   1.25   1.25   2.42   1.00
contract:NN   1.16   0.36   0.71   0.28
NO_WORD   0.28   0.17   0.82   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.20 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-3.00  1.00 NullPunisher.entity : Jones
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : contract
-1.00  1.00 NullPunisher.other : signed
-2.00  1.00 RootEntailment.unalignedRoot : "signed" not aligned to anything
Hand-tuned score (dot product of above): -7.8790
Threshold: -9.4738


Inference ID: 082

Txt: Smith, Jones and several lawyers signed the contract.

Hyp: Jones signed the contract. (yes)

Jones
NNP
signed
VBD
the
DT
contract
NN
Smith:NNP   0.70   1.05   0.96   1.16
Jones:NNP   0.28   1.01   0.96   1.16
several:JJ   1.13   0.94   0.76   0.83
lawyers:NNS   1.03   0.62   0.71   0.84
signed:VBD   1.10   1.32   0.76   0.45
the:DT   1.25   1.25   2.42   1.00
contract:NN   1.16   0.36   0.71   0.28
NO_WORD   0.28   0.17   0.82   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.20 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-3.00  1.00 NullPunisher.entity : Jones
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : signed
-1.00  1.00 NullPunisher.other : contract
-2.00  1.00 RootEntailment.unalignedRoot : "signed" not aligned to anything
Hand-tuned score (dot product of above): -7.8790
Threshold: -9.4738


Inference ID: 083

Txt: Either Smith, Jones or Anderson signed the contract.

Hyp: Jones signed the contract. (don't know)

Jones
NNP
signed
VBD
the
DT
contract
NN
Either:CC   1.23   1.18   1.23   1.00
Smith:NNP   0.70   1.05   0.96   1.16
Jones:NNP   0.28   1.01   0.96   1.16
Anderson:NNP   0.75   1.07   0.96   1.10
signed:VBD   1.10   1.32   0.76   0.45
the:DT   1.25   1.25   2.42   1.00
contract:NN   1.16   0.36   0.71   0.28
NO_WORD   0.28   0.17   0.82   0.09

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.20 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : signed
-1.00  1.00 NullPunisher.other : contract
-3.00  1.00 NullPunisher.entity : Jones
-0.10  1.00 NullPunisher.article : the
-2.00  1.00 RootEntailment.unalignedRoot : "signed" not aligned to anything
Hand-tuned score (dot product of above): -7.8790
Threshold: -9.4738


Inference ID: 084

Txt: Either Smith, Jones or Anderson signed the contract.

Hyp: If Smith and Anderson did not sign the contract, Jones signed the contract. (yes)

If
IN
Smith
NNP
Anderson
NNP
did
VBD
not
RB
sign
VB
the
DT
contract
NN
Jones
NNP
signed
VBD
the
DT
contract
NN
Either:CC   1.05   1.19   1.25   1.25   1.45   1.25   1.23   1.00   1.23   1.18   1.23   1.00
Smith:NNP   0.75   0.28   0.76   1.07   1.56   1.00   0.96   1.16   0.70   1.05   0.96   1.16
Jones:NNP   0.75   0.70   0.75   1.07   1.56   1.07   0.96   1.16   0.28   1.01   0.96   1.16
Anderson:NNP   0.75   0.76   0.28   1.07   1.56   1.07   0.96   1.10   0.75   1.07   0.96   1.10
signed:VBD   1.15   1.14   1.16   0.63   1.11   0.93   0.76   0.45   1.10   1.32   0.76   0.45
the:DT   1.05   1.25   1.25   1.25   1.45   1.25   2.42   1.00   1.25   1.25   2.42   1.00
contract:NN   0.50   1.16   1.10   0.71   1.31   0.57   0.71   0.28   1.16   0.36   0.71   0.28
NO_WORD   0.92   0.28   0.01   1.30   0.24   0.42   0.82   0.09   0.28   0.17   0.82   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.50 Alignment.score
 1.00  0.25 Alignment.isGood
-1.00  0.72 Alignment.isBad
-0.10  9.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.17 Alignment.txtSpan
 0.00  1.00 NegPolarity.hypNegWord : "sign": has child with relation "neg"
-1.00  1.00 NullPunisher.other : signed
-0.10  1.00 NullPunisher.article : the
-0.05  1.00 NullPunisher.aux : did
-1.00  1.00 NullPunisher.other : contract
-3.00  1.00 NullPunisher.entity : Anderson
-0.10  1.00 NullPunisher.article : the
-3.00  1.00 NullPunisher.entity : Jones
-1.00  1.00 NullPunisher.other : not
-1.00  1.00 NullPunisher.other : If
 1.00  1.00 Quant.contract : Valid quantifier weakening: replacing the quantifier "the" by a weaker quantifier "the" preserves truth.
-2.00  1.00 RootEntailment.unalignedRoot : "signed" not aligned to anything
Hand-tuned score (dot product of above): -12.0113
Threshold: -9.4738


Inference ID: 085

Txt: Exactly two lawyers and three accountants signed the contract.

Hyp: Six lawyers signed the contract. (don't know)

Six
CD
lawyers
NNS
signed
VBD
the
DT
contract
NN
Exactly:RB   1.36   1.21   0.91   1.09   1.20
two:CD   0.35   1.32   1.17   0.82   1.25
lawyers:NNS   1.29   0.28   0.62   0.71   0.84
three:CD   0.35   1.33   1.26   0.78   1.32
accountants:NNS   1.29   0.37   0.82   0.71   0.81
signed:VBD   1.37   0.70   1.32   0.76   0.45
the:DT   1.18   1.00   1.25   2.42   1.00
contract:NN   1.29   0.84   0.36   0.71   0.28
NO_WORD   0.52   0.28   0.17   0.82   0.09

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.23 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.40 Alignment.txtSpan
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : contract
-1.00  1.00 NullPunisher.other : signed
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '6.0' vs '2.0'
-2.00  1.00 RootEntailment.unalignedRoot : "signed" not aligned to anything
Hand-tuned score (dot product of above): -10.5021
Threshold: -9.4738


Inference ID: 086

Txt: Exactly two lawyers and three accountants signed the contract.

Hyp: Six accountants signed the contract. (don't know)

Six
CD
accountants
NNS
signed
VBD
the
DT
contract
NN
Exactly:RB   1.36   1.21   0.91   1.09   1.20
two:CD   0.35   1.21   1.17   0.82   1.25
lawyers:NNS   1.29   0.37   0.62   0.71   0.84
three:CD   0.35   1.24   1.26   0.78   1.32
accountants:NNS   1.29   0.28   0.82   0.71   0.81
signed:VBD   1.37   0.91   1.32   0.76   0.45
the:DT   1.18   1.00   1.25   2.42   1.00
contract:NN   1.29   0.81   0.36   0.71   0.28
NO_WORD   0.52   0.28   0.17   0.82   0.09

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.23 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.40 Alignment.txtSpan
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : signed
-1.00  1.00 NullPunisher.other : contract
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '6.0' vs '3.0'
-2.00  1.00 RootEntailment.unalignedRoot : "signed" not aligned to anything
Hand-tuned score (dot product of above): -10.5021
Threshold: -9.4738


Inference ID: 087

Txt: Every representative and client was at the meeting.

Hyp: Every representative was at the meeting. (yes)

Every
DT
representative
NN
was
VBD
the
DT
meeting
NN
Every:DT   2.42   1.00   1.25   1.31   1.00
representative:NN   0.71   0.28   0.82   0.71   0.82
client:NN   0.69   0.84   0.82   0.71   0.89
was:VBD   0.76   0.91   1.32   0.76   0.91
the:DT   1.31   1.00   1.25   2.42   1.00
meeting:NN   0.71   0.82   0.82   0.71   0.28
NO_WORD   0.82   0.28   0.17   0.82   0.20

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.35 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.75 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "meeting" modifying "was"
-1.00  1.00 NullPunisher.other : meeting
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : Every
-0.05  1.00 NullPunisher.aux : was
-1.00  1.00 NullPunisher.other : representative
-2.00  1.00 RootEntailment.unalignedRoot : "was" not aligned to anything
Hand-tuned score (dot product of above): -6.8325
Threshold: -9.4738


Inference ID: 088

Txt: Every representative and client was at the meeting.

Hyp: Every representative was at the meeting. (don't know)

Every
DT
representative
NN
was
VBD
the
DT
meeting
NN
Every:DT   2.42   1.00   1.25   1.31   1.00
representative:NN   0.71   0.28   0.82   0.71   0.82
client:NN   0.69   0.84   0.82   0.71   0.89
was:VBD   0.76   0.91   1.32   0.76   0.91
the:DT   1.31   1.00   1.25   2.42   1.00
meeting:NN   0.71   0.82   0.82   0.71   0.28
NO_WORD   0.82   0.28   0.17   0.82   0.20

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.35 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.75 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "meeting" modifying "was"
-1.00  1.00 NullPunisher.other : representative
-1.00  1.00 NullPunisher.other : meeting
-1.00  1.00 NullPunisher.other : Every
-0.05  1.00 NullPunisher.aux : was
-0.10  1.00 NullPunisher.article : the
-2.00  1.00 RootEntailment.unalignedRoot : "was" not aligned to anything
Hand-tuned score (dot product of above): -6.8325
Threshold: -9.4738


Inference ID: 089

Txt: Every representative or client was at the meeting.

Hyp: Every representative and every client was at the meeting. (yes)

Every
DT
representative
NN
every
DT
client
NN
was
VBD
the
DT
meeting
NN
Every:DT   2.42   1.00   0.22   0.98   1.25   1.31   1.00
representative:NN   0.71   0.28   0.71   0.84   0.82   0.71   0.82
client:NN   0.69   0.84   0.69   0.28   0.82   0.71   0.89
was:VBD   0.76   0.91   0.76   0.91   1.32   0.76   0.91
the:DT   1.31   1.00   1.31   1.00   1.25   2.42   1.00
meeting:NN   0.71   0.82   0.71   0.89   0.82   0.71   0.28
NO_WORD   0.82   0.28   0.82   0.01   0.17   0.82   0.20

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.36 Alignment.score
 1.00  0.23 Alignment.isGood
-1.00  0.75 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "meeting" modifying "was"
-1.00  1.00 NullPunisher.other : client
-1.00  1.00 NullPunisher.other : meeting
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : Every
-0.05  1.00 NullPunisher.aux : was
-1.00  1.00 NullPunisher.other : every
-1.00  1.00 NullPunisher.other : representative
-2.00  1.00 RootEntailment.unalignedRoot : "was" not aligned to anything
Hand-tuned score (dot product of above): -9.0074
Threshold: -9.4738


Inference ID: 090

Txt: The chairman read out the items on the agenda.

Hyp: The chairman read out every item on the agenda. (yes)

The
DT
chairman
NN
read
VB
every
DT
item
NN
the
DT
agenda
NN
The:DT   2.42   1.00   1.25   1.31   0.96   0.01   1.00
chairman:NN   0.71   0.28   0.82   0.71   0.89   0.71   0.75
read:VB   0.76   0.91   1.32   0.73   0.66   0.76   0.81
the:DT   0.01   1.00   1.25   1.31   0.96   2.42   1.00
items:NNS   0.71   0.96   0.72   0.66   1.06   0.71   0.84
the:DT   0.01   1.00   1.25   1.31   0.96   2.42   1.00
agenda:NN   0.71   0.75   0.72   0.69   0.78   0.71   0.28
NO_WORD   0.82   0.28   0.17   0.82   0.62   0.82   0.02

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.55 Alignment.score
 1.00  0.26 Alignment.isGood
-1.00  0.71 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.14 Alignment.txtSpan
-1.00  1.00 NullPunisher.other : chairman
-1.00  1.00 NullPunisher.other : every
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : read
-0.10  1.00 NullPunisher.article : The
-7.00  1.00 Quant.expand : Invalid quantifier strengthening: replacing the quantifier "the" by a stronger quantifier "every" does NOT preserve truth.
-2.00  1.00 RootEntailment.unalignedRoot : "read" not aligned to anything
Hand-tuned score (dot product of above): -12.4918
Threshold: -9.4738


Inference ID: 091

Txt: The people who were at the meeting voted for a new chairman.

Hyp: Everyone at the meeting voted for a new chairman. (don't know)

Everyone
NN
the
DT
meeting
NN
voted
VBD
a
DT
new
JJ
chairman
NN
The:DT   1.00   0.01   1.00   1.25   1.31   1.18   1.00
people:NNS   0.79   0.71   0.86   0.80   0.71   0.99   1.02
who:WP   1.30   1.04   1.30   0.97   1.11   1.39   1.30
were:VBD   0.87   0.73   0.91   0.76   0.76   0.99   0.91
the:DT   1.00   2.42   1.00   1.25   1.31   1.18   1.00
meeting:NN   0.77   0.71   0.28   0.54   0.71   0.99   0.67
voted:VBN   0.89   0.76   0.63   2.10   0.76   0.99   0.55
a:DT   1.00   1.31   1.00   1.25   2.42   1.18   1.00
new:JJ   0.88   0.76   0.88   0.93   0.76   0.74   0.87
chairman:NN   0.79   0.71   0.67   0.46   0.71   0.98   0.28
NO_WORD   0.28   0.82   0.20   0.17   0.82   0.11   0.05

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.60 Alignment.score
 1.00  0.27 Alignment.isGood
-1.00  0.70 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.14 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "new" modifying "chairman"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "new" modifying "chairman" is dropped on aligned hypothesis word "chairman"
-0.10  1.00 NullPunisher.article : the
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : meeting
-1.00  1.00 NullPunisher.other : Everyone
-1.00  1.00 NullPunisher.other : new
Hand-tuned score (dot product of above): -3.9137
Threshold: -9.4738


Inference ID: 092

Txt: All the people who were at the meeting voted for a new chairman.

Hyp: Everyone at the meeting voted for a new chairman. (yes)

Everyone
NN
the
DT
meeting
NN
voted
VBD
a
DT
new
JJ
chairman
NN
All:PDT   1.00   1.31   1.00   1.25   1.31   1.18   1.00
the:DT   1.00   2.42   1.00   1.25   1.31   1.18   1.00
people:NNS   0.79   0.71   0.86   0.80   0.71   0.99   1.02
who:WP   1.30   1.04   1.30   0.97   1.11   1.39   1.30
were:VBD   0.87   0.73   0.91   0.76   0.76   0.99   0.91
the:DT   1.00   2.42   1.00   1.25   1.31   1.18   1.00
meeting:NN   0.77   0.71   0.28   0.54   0.71   0.99   0.67
voted:VBN   0.89   0.76   0.63   2.10   0.76   0.99   0.55
a:DT   1.00   1.31   1.00   1.25   2.42   1.18   1.00
new:JJ   0.88   0.76   0.88   0.93   0.76   0.74   0.87
chairman:NN   0.79   0.71   0.67   0.46   0.71   0.98   0.28
NO_WORD   0.28   0.82   0.20   0.17   0.82   0.11   0.05

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.60 Alignment.score
 1.00  0.27 Alignment.isGood
-1.00  0.70 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.14 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "new" modifying "chairman"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "new" modifying "chairman" is dropped on aligned hypothesis word "chairman"
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : Everyone
-1.00  1.00 NullPunisher.other : new
-1.00  1.00 NullPunisher.other : meeting
-0.10  1.00 NullPunisher.article : a
Hand-tuned score (dot product of above): -3.9137
Threshold: -9.4738


Inference ID: 093

Txt: The people who were at the meeting all voted for a new chairman.

Hyp: Everyone at the meeting voted for a new chairman. (yes)

Everyone
NN
the
DT
meeting
NN
voted
VBD
a
DT
new
JJ
chairman
NN
The:DT   1.00   0.01   1.00   1.25   1.31   1.18   1.00
people:NNS   0.79   0.71   0.86   0.80   0.71   0.99   1.02
who:WP   1.30   1.04   1.30   0.97   1.11   1.39   1.30
were:VBD   0.87   0.73   0.91   0.76   0.76   0.99   0.91
the:DT   1.00   2.42   1.00   1.25   1.31   1.18   1.00
meeting:NN   0.77   0.71   0.28   0.54   0.71   0.99   0.67
all:DT   1.00   1.31   1.00   1.25   1.31   1.18   1.00
voted:VBN   0.89   0.76   0.63   2.10   0.76   0.99   0.55
a:DT   1.00   1.31   1.00   1.25   2.42   1.18   1.00
new:JJ   0.88   0.76   0.88   0.93   0.76   0.74   0.87
chairman:NN   0.79   0.71   0.67   0.46   0.71   0.98   0.28
NO_WORD   0.28   0.82   0.20   0.17   0.82   0.11   0.05

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.60 Alignment.score
 1.00  0.27 Alignment.isGood
-1.00  0.70 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.14 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "new" modifying "chairman"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "new" modifying "chairman" is dropped on aligned hypothesis word "chairman"
-1.00  1.00 NullPunisher.other : new
-1.00  1.00 NullPunisher.other : meeting
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : Everyone
-0.10  1.00 NullPunisher.article : the
Hand-tuned score (dot product of above): -3.9137
Threshold: -9.4738


Inference ID: 094

Txt: The inhabitants of Cambridge voted for a Labour MP.

Hyp: Every inhabitant of Cambridge voted for a Labour MP. (don't know)

Every
DT
inhabitant
NN
Cambridge
NNP
voted
VBD
a
DT
Labour_MP
NNP
The:DT   1.31   1.00   1.25   1.25   1.31   1.25
inhabitants:NNS   0.71   1.17   1.08   0.82   0.71   1.15
Cambridge:NNP   0.96   1.07   0.28   1.07   0.96   0.99
voted:VBD   0.76   0.91   1.16   1.32   0.76   1.16
a:DT   1.31   1.00   1.25   1.25   2.42   1.25
Labour_MP:NNP   0.96   1.16   0.99   1.07   0.96   0.28
NO_WORD   0.82   0.28   0.04   0.17   0.82   0.05

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.58 Alignment.score
 1.00  0.27 Alignment.isGood
-1.00  0.71 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.33 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Labour_MP" modifying "voted"
-1.00  1.00 NullPunisher.other : voted
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : Every
-3.00  1.00 NullPunisher.entity : Labour_MP
-7.00  1.00 Quant.expand : Invalid quantifier strengthening: replacing the quantifier "the" by a stronger quantifier "every" does NOT preserve truth.
-2.00  1.00 RootEntailment.unalignedRoot : "voted" not aligned to anything
 2.00  1.00 Location.match : Locations match between txt and hyp; both have lemma "Cambridge"
 4.00  1.00 Location.match&OK.Root.poorlyOrUnAligned :
Hand-tuned score (dot product of above): -9.1256
Threshold: -9.4738


Inference ID: 095

Txt: The Ancient Greeks were noted philosophers.

Hyp: Every Ancient Greek was a noted philosopher. (don't know)

Every
DT
Ancient_Greek
NNP
was
VBD
a
DT
noted
JJ
philosopher
NN
The:DT   1.31   1.25   1.25   1.31   1.18   1.00
Ancient:NNP   0.96   0.39   1.07   0.96   1.21   1.00
Greeks:NNP   0.94   0.49   1.07   0.96   1.22   1.10
were:VBD   0.68   1.16   0.13   0.76   0.99   0.91
noted:VBN   0.76   1.16   0.86   0.76   2.37   0.88
philosophers:NNS   0.71   1.06   0.82   0.71   0.93   1.46
NO_WORD   0.82   0.28   1.43   0.82   0.11   0.12

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  1.32 Alignment.score
 1.00  0.43 Alignment.isGood
-1.00  0.54 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.50 Alignment.txtSpan
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "Ancient" modifying "Greeks" is dropped on aligned hypothesis word "Ancient_Greek"
-1.00  1.00 Apposition.mismatch : no apposition in text between philosopher and Ancient_Greek
-1.00  1.00 Hypernym.posNarrow : Narrowing a term (from "Greeks" to "Ancient_Greek") does NOT preserve truth in a positive context
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : Every
-0.05  1.00 NullPunisher.aux : was
-7.00  1.00 Quant.expand : Invalid quantifier strengthening: replacing the quantifier "the" by a stronger quantifier "every" does NOT preserve truth.
Hand-tuned score (dot product of above): -8.4827
Threshold: -9.4738


Inference ID: 096

Txt: The Ancient Greeks were all noted philosophers.

Hyp: Every Ancient Greek was a noted philosopher. (yes)

Every
DT
Ancient_Greek
NNP
was
VBD
a
DT
noted
JJ
philosopher
NN
The:DT   1.31   1.25   1.25   1.31   1.18   1.00
Ancient:NNP   0.96   0.39   1.07   0.96   1.21   1.00
Greeks:NNP   0.94   0.49   1.07   0.96   1.22   1.10
were:VBD   0.68   1.16   0.13   0.76   0.99   0.91
all:RB   1.09   1.47   0.91   1.09   1.05   1.21
noted:VBN   0.76   1.16   0.86   0.76   2.37   0.88
philosophers:NNS   0.71   1.06   0.82   0.71   0.93   1.46
NO_WORD   0.82   0.28   1.43   0.82   0.11   0.12

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  1.32 Alignment.score
 1.00  0.43 Alignment.isGood
-1.00  0.54 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.33 Alignment.txtSpan
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "all" modifying "noted" is dropped on aligned hypothesis word "noted"
-1.00  1.00 Apposition.mismatch : no apposition in text between philosopher and Ancient_Greek
-1.00  1.00 Hypernym.posNarrow : Narrowing a term (from "Greeks" to "Ancient_Greek") does NOT preserve truth in a positive context
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : Every
-0.05  1.00 NullPunisher.aux : was
-7.00  1.00 Quant.expand : Invalid quantifier strengthening: replacing the quantifier "the" by a stronger quantifier "every" does NOT preserve truth.
Hand-tuned score (dot product of above): -8.4993
Threshold: -9.4738


Inference ID: 097

Txt: Software faults were blamed for the system failure.

Hyp: The system failure was blamed on one or more software faults. (yes)

The
DT
system
NN
failure
NN
was
VBD
blamed
VBN
one
CD
or
CC
more
JJR
software
NN
faults
NNS
Software:NNP   0.71   0.94   0.89   0.82   0.82   1.29   0.71   0.95   1.67   0.92
faults:NNS   0.71   0.72   0.27   0.79   0.82   1.29   0.71   0.99   0.82   0.28
were:VBD   0.73   0.91   0.89   0.13   0.76   1.36   0.76   0.91   0.87   0.91
blamed:VBN   0.76   0.76   0.50   0.76   1.32   1.39   0.76   1.02   0.91   0.91
the:DT   0.01   1.00   1.00   1.25   1.25   1.11   1.31   1.15   1.00   1.00
system:NN   0.71   0.28   0.52   0.82   0.67   1.29   0.71   0.99   0.49   0.72
failure:NN   0.71   0.52   0.28   0.82   0.41   1.29   0.71   0.97   0.86   0.27
NO_WORD   0.82   0.05   0.15   1.57   0.17   0.52   0.56   0.15   0.05   0.02

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.42 Alignment.score
 1.00  0.24 Alignment.isGood
-1.00  0.74 Alignment.isBad
-0.10  9.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.10 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "faults" modifying "blamed"
-1.00  1.00 NullPunisher.other : faults
-1.00  1.00 NullPunisher.other : blamed
-0.10  1.00 NullPunisher.article : The
-3.00  1.00 NullPunisher.entity : one
-1.00  1.00 NullPunisher.other : or
-0.05  1.00 NullPunisher.aux : was
-1.00  1.00 NullPunisher.other : failure
-1.00  1.00 NullPunisher.other : system
-1.00  1.00 NullPunisher.other : more
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '>=1.0' vs ''
-2.00  1.00 RootEntailment.unalignedRoot : "blamed" not aligned to anything
Hand-tuned score (dot product of above): -19.0219
Threshold: -9.4738


Inference ID: 098

Txt: Software faults were blamed for the system failure. Bug # 32-985 is a known software fault.

Hyp: Bug # 32-985 was blamed for the system failure. (don't know)

Bug
NN
#
#
32-985
CD
was
VBD
blamed
VBN
the
DT
system
NN
failure
NN
Software:NNP   0.88   0.96   1.29   0.82   0.82   0.71   0.94   0.89
faults:NNS   0.85   0.96   1.29   0.79   0.82   0.71   0.72   0.27
were:VBD   0.91   1.01   1.39   0.13   0.76   0.73   0.91   0.89
blamed:VBN   0.91   0.96   1.39   0.76   1.32   0.76   0.76   0.50
the:DT   1.00   1.56   1.18   1.25   1.25   2.42   1.00   1.00
system:NN   0.83   0.96   1.29   0.82   0.67   0.71   0.28   0.52
failure:NN   0.87   0.96   1.29   0.82   0.41   0.71   0.52   0.28
Bug:NN   0.28   0.96   1.29   0.82   0.82   0.71   0.83   0.87
#:#   1.25   2.42   0.75   1.51   1.46   1.56   1.25   1.25
32-985:CD   1.33   0.46   0.85   1.30   1.30   0.89   1.32   1.33
is:VBZ   0.91   1.01   1.39   0.05   0.76   0.76   0.91   0.91
a:DT   1.00   1.56   1.18   1.25   1.25   1.31   1.00   1.00
known:JJ   0.88   1.02   1.13   0.96   0.86   0.76   0.67   0.62
software:NN   0.88   0.96   1.29   0.82   0.82   0.71   0.49   0.86
fault:NN   0.85   0.96   1.25   0.82   0.53   0.71   0.73   0.06
NO_WORD   0.15   0.56   0.26   1.57   0.17   0.82   0.05   0.05

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.34 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.75 Alignment.isBad
-0.10  8.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "failure" modifying "blamed"
-3.00  1.00 NullPunisher.entity : #
-1.00  1.00 NullPunisher.other : failure
-1.00  1.00 NullPunisher.other : Bug
-3.00  1.00 NullPunisher.entity : 32-985
-0.05  1.00 NullPunisher.aux : was
-1.00  1.00 NullPunisher.other : system
-1.00  1.00 NullPunisher.other : blamed
-0.10  1.00 NullPunisher.article : the
-2.00  1.00 RootEntailment.unalignedRoot : "blamed" not aligned to anything
Hand-tuned score (dot product of above): -14.1423
Threshold: -9.4738


Inference ID: 099

Txt: Clients at the demonstration were all impressed by the system's performance. Smith was a client at the demonstration.

Hyp: Smith was impressed by the system's performance. (yes)

Smith
NNP
was
VBD
impressed
VBN
the
DT
system
NN
performance
NN
Clients:NNS   1.01   0.82   0.82   0.71   0.87   0.91
the:DT   1.25   1.25   1.25   2.42   1.00   1.00
demonstration:NN   1.17   0.82   0.72   0.71   0.84   0.85
were:VBD   1.16   0.13   0.89   0.73   0.91   0.91
all:RB   1.47   0.91   0.91   1.09   1.21   1.21
impressed:VBN   1.16   0.89   1.32   0.76   0.89   0.76
the:DT   1.25   1.25   1.25   2.42   1.00   1.00
system:NN   1.05   0.82   0.80   0.71   0.28   0.96
performance:NN   1.15   0.82   0.67   0.71   0.96   0.28
Smith:NNP   0.28   1.07   1.07   0.96   1.05   1.15
was:VBD   1.16   1.32   0.89   0.76   0.91   0.91
a:DT   1.25   1.25   1.25   1.31   1.00   1.00
client:NN   1.02   0.82   0.70   0.71   0.87   0.88
the:DT   1.25   1.25   1.25   2.42   1.00   1.00
demonstration:NN   1.17   0.82   0.72   0.71   0.84   0.85
NO_WORD   0.15   1.57   0.17   0.82   0.12   0.21

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.39 Alignment.score
 1.00  0.23 Alignment.isGood
-1.00  0.75 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : impressed
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : performance
-1.00  1.00 NullPunisher.other : system
-0.05  1.00 NullPunisher.aux : was
-3.00  1.00 NullPunisher.entity : Smith
-2.00  1.00 RootEntailment.unalignedRoot : "impressed" not aligned to anything
Hand-tuned score (dot product of above): -8.8786
Threshold: -9.4738


Inference ID: 100

Txt: Clients at the demonstration were impressed by the system's performance.

Hyp: Most clients at the demonstration were impressed by the system's performance. (yes)

Most
JJS
clients
NNS
the
DT
demonstration
NN
were
VBD
impressed
VBN
the
DT
system
NN
performance
NN
Clients:NNS   0.99   1.53   0.71   0.92   0.82   0.82   0.71   0.87   0.91
the:DT   1.18   1.00   2.42   1.00   1.22   1.25   2.42   1.00   1.00
demonstration:NN   0.99   0.92   0.71   0.28   0.82   0.72   0.71   0.84   0.85
were:VBD   1.02   0.91   0.73   0.91   1.32   0.89   0.73   0.91   0.91
impressed:VBN   1.02   0.89   0.76   0.81   0.89   1.32   0.76   0.89   0.76
the:DT   1.18   1.00   2.42   1.00   1.22   1.25   2.42   1.00   1.00
system:NN   0.94   0.87   0.71   0.84   0.82   0.80   0.71   0.28   0.96
performance:NN   0.99   0.82   0.71   0.77   0.82   0.67   0.71   0.96   0.28
NO_WORD   0.11   0.15   0.82   0.20   1.57   0.17   0.82   0.12   0.21

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.57 Alignment.score
 1.00  0.27 Alignment.isGood
-1.00  0.71 Alignment.isBad
-0.10  8.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.11 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "demonstration" modifying "clients"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "demonstration" modifying "Clients" is dropped on aligned hypothesis word "clients"
 0.00  1.00 NegPolarity.hypNegWord : "Most": tag "JJS" is in NegPolarityMarkers list
-1.00  1.00 NullPunisher.other : Most
-1.00  1.00 NullPunisher.other : demonstration
-0.05  1.00 NullPunisher.aux : were
-1.00  1.00 NullPunisher.other : performance
-0.10  1.00 NullPunisher.article : the
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : system
-1.00  1.00 NullPunisher.other : impressed
-2.00  1.00 RootEntailment.unalignedRoot : "impressed" not aligned to anything
Hand-tuned score (dot product of above): -8.3133
Threshold: -9.4738


Inference ID: 101

Txt: University graduates make poor stock-market traders. Smith is a university graduate.

Hyp: Smith is likely to make a poor stock market trader. (yes)

Smith
NNP
is
VBZ
likely
JJ
to
TO
make
VB
a
DT
poor
JJ
stock-market
NN
trader
NN
University:NNP   1.13   0.82   0.96   0.71   0.82   0.71   0.99   0.87   0.90
graduates:NNS   1.02   0.82   0.90   0.71   0.79   0.71   0.92   0.79   0.71
make:VBP   1.14   0.84   0.78   0.76   2.10   0.76   0.88   0.83   0.86
poor:JJ   1.13   0.96   0.93   0.76   0.82   0.76   0.74   0.78   0.79
stock-market:NN   1.08   0.82   0.86   0.71   0.74   0.71   0.88   0.28   0.60
traders:NNS   1.06   0.82   0.61   0.71   0.80   0.71   0.99   0.53   1.22
Smith:NNP   0.28   1.07   1.24   0.96   1.05   0.96   1.24   1.08   1.06
is:VBZ   1.16   1.32   1.02   0.76   0.92   0.76   1.02   0.91   0.91
a:DT   1.25   1.25   1.18   1.31   1.25   2.42   1.18   1.00   1.00
university:NN   1.13   0.82   0.96   0.71   0.82   0.71   0.99   0.87   0.90
graduate:NN   1.02   0.82   0.99   0.71   0.82   0.71   0.98   0.80   0.72
NO_WORD   0.28   1.43   0.16   1.55   0.34   0.82   0.11   0.05   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.95 Alignment.score
 1.00  0.35 Alignment.isGood
-1.00  0.62 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.33 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "poor" modifying "trader"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "poor" modifying "traders" is dropped on aligned hypothesis word "trader"
-0.10  1.00 NullPunisher.functionWord : to
-0.05  1.00 NullPunisher.aux : is
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : likely
-1.00  1.00 NullPunisher.other : poor
-3.00  1.00 NullPunisher.entity : Smith
-2.00  1.00 RootEntailment.unalignedRoot : "likely" not aligned to anything
Hand-tuned score (dot product of above): -7.4405
Threshold: -9.4738


Inference ID: 102

Txt: University graduates make poor stock-market traders. Smith is a university graduate.

Hyp: Smith will make a poor stock market trader. (don't know)

Smith
NNP
will
MD
make
VB
a
DT
poor
JJ
stock-market
NN
trader
NN
University:NNP   1.13   0.71   0.82   0.71   0.99   0.87   0.90
graduates:NNS   1.02   0.71   0.79   0.71   0.92   0.79   0.71
make:VBP   1.14   1.00   2.10   0.76   0.88   0.83   0.86
poor:JJ   1.13   0.76   0.82   0.76   0.74   0.78   0.79
stock-market:NN   1.08   0.71   0.74   0.71   0.88   0.28   0.60
traders:NNS   1.06   0.71   0.80   0.71   0.99   0.53   1.22
Smith:NNP   0.28   0.94   1.05   0.96   1.24   1.08   1.06
is:VBZ   1.16   1.11   0.92   0.76   1.02   0.91   0.91
a:DT   1.25   1.31   1.25   2.42   1.18   1.00   1.00
university:NN   1.13   0.71   0.82   0.71   0.99   0.87   0.90
graduate:NN   1.02   0.71   0.82   0.71   0.98   0.80   0.72
NO_WORD   0.28   1.55   0.17   0.82   0.11   0.05   0.09

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  1.00 Alignment.score
 1.00  0.36 Alignment.isGood
-1.00  0.61 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.43 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "poor" modifying "trader"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "poor" modifying "traders" is dropped on aligned hypothesis word "trader"
-0.10  1.00 NullPunisher.article : a
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : poor
-0.05  1.00 NullPunisher.aux : will
Hand-tuned score (dot product of above): -4.0655
Threshold: -9.4738


Inference ID: 103

Txt: All APCOM managers have company cars. Jones is an APCOM manager.

Hyp: Jones has a company car. (yes)

Jones
NNP
has
VBZ
a
DT
company
NN
car
NN
All:DT   1.25   1.25   1.31   1.00   1.00
APCOM:NNP   0.95   1.07   0.96   1.07   1.07
managers:NNS   0.93   0.82   0.71   0.85   0.86
have:VBP   1.14   0.19   0.76   0.91   0.87
company:NN   1.20   0.82   0.71   0.28   0.80
cars:NNS   1.09   0.72   0.71   0.86   1.20
Jones:NNP   0.28   1.07   0.96   1.20   1.12
is:VBZ   1.16   1.01   0.76   0.91   0.91
an:DT   1.25   1.21   1.21   1.00   0.95
APCOM:NNP   0.95   1.07   0.96   1.07   1.07
manager:NN   0.95   0.82   0.71   0.96   0.86
NO_WORD   0.28   0.17   0.82   0.05   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.68 Alignment.score
 1.00  0.29 Alignment.isGood
-1.00  0.69 Alignment.isBad
-0.10  1.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.80 Alignment.txtSpan
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "APCOM" modifying "managers" is dropped on aligned hypothesis word "Jones"
-0.10  1.00 NullPunisher.article : a
Hand-tuned score (dot product of above): 0.8655
Threshold: -9.4738


Inference ID: 104

Txt: All APCOM managers have company cars. Jones is an APCOM manager.

Hyp: Jones has more than one company car. (don't know)

Jones
NNP
has
VBZ
more_than
IN
one
CD
company
NN
car
NN
All:DT   1.25   1.25   1.05   1.18   1.00   1.00
APCOM:NNP   0.95   1.07   0.75   1.20   1.07   1.07
managers:NNS   0.93   0.82   0.48   1.29   0.85   0.86
have:VBP   1.14   0.19   1.15   1.36   0.91   0.87
company:NN   1.20   0.82   0.47   1.29   0.28   0.80
cars:NNS   1.09   0.72   0.50   1.29   0.86   1.20
Jones:NNP   0.28   1.07   0.75   1.09   1.20   1.12
is:VBZ   1.16   1.01   1.15   1.39   0.91   0.91
an:DT   1.25   1.21   1.05   1.14   1.00   0.95
APCOM:NNP   0.95   1.07   0.75   1.20   1.07   1.07
manager:NN   0.95   0.82   0.50   1.29   0.96   0.86
NO_WORD   0.28   0.17   0.84   0.52   0.05   0.09

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.49 Alignment.score
 1.00  0.25 Alignment.isGood
-1.00  0.73 Alignment.isBad
-0.10  2.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.67 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "one" modifying "car"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "APCOM" modifying "managers" is dropped on aligned hypothesis word "Jones"
-1.00  1.00 NullPunisher.other : more_than
-3.00  1.00 NullPunisher.entity : one
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '>1.0' vs ''
Hand-tuned score (dot product of above): -10.4225
Threshold: -9.4738


Inference ID: 105

Txt: Just one accountant attended the meeting.

Hyp: No accountants attended the meeting. (don't know)

No
DT
accountants
NNS
attended
VBD
the
DT
meeting
NN
Just:RB   1.09   1.21   0.91   1.09   1.21
one:CD   0.85   1.33   1.30   0.82   1.33
accountant:NN   0.71   1.32   0.77   0.71   0.87
attended:VBD   0.76   0.82   1.32   0.76   0.57
the:DT   1.31   1.00   1.25   2.42   1.00
meeting:NN   0.71   0.89   0.48   0.71   0.28
NO_WORD   0.82   0.28   0.17   0.82   0.09

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.64 Alignment.score
 1.00  0.28 Alignment.isGood
-1.00  0.69 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.20 Alignment.txtSpan
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "one" modifying "accountant" is dropped on aligned hypothesis word "accountants"
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : meeting
-1.00  1.00 NullPunisher.other : No
-1.00  1.00 NullPunisher.other : attended
-6.00  1.00 Quant.oneNo : [one,no]
-2.00  1.00 RootEntailment.unalignedRoot : "attended" not aligned to anything
Hand-tuned score (dot product of above): -10.6506
Threshold: -9.4738


Inference ID: 106

Txt: Just one accountant attended the meeting.

Hyp: No accountant attended the meeting. (don't know)

No
DT
accountant
NN
attended
VBD
the
DT
meeting
NN
Just:RB   1.09   1.21   0.91   1.09   1.21
one:CD   0.85   1.33   1.30   0.82   1.33
accountant:NN   0.71   0.28   0.77   0.71   0.87
attended:VBD   0.76   0.86   1.32   0.76   0.57
the:DT   1.31   1.00   1.25   2.42   1.00
meeting:NN   0.71   0.87   0.48   0.71   0.28
NO_WORD   0.82   0.28   0.17   0.82   0.09

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.32 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.76 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : accountant
-1.00  1.00 NullPunisher.other : attended
-1.00  1.00 NullPunisher.other : meeting
-1.00  1.00 NullPunisher.other : No
-2.00  1.00 RootEntailment.unalignedRoot : "attended" not aligned to anything
Hand-tuned score (dot product of above): -6.8134
Threshold: -9.4738


Inference ID: 107

Txt: Just one accountant attended the meeting.

Hyp: Some accountants attended the meeting. (yes)

Some
DT
accountants
NNS
attended
VBD
the
DT
meeting
NN
Just:RB   1.09   1.21   0.91   1.09   1.21
one:CD   0.79   1.33   1.30   0.82   1.33
accountant:NN   0.71   1.32   0.77   0.71   0.87
attended:VBD   0.76   0.82   1.32   0.76   0.57
the:DT   1.28   1.00   1.25   2.42   1.00
meeting:NN   0.71   0.89   0.48   0.71   0.28
NO_WORD   0.82   0.28   0.17   0.82   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.64 Alignment.score
 1.00  0.28 Alignment.isGood
-1.00  0.69 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.20 Alignment.txtSpan
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "one" modifying "accountant" is dropped on aligned hypothesis word "accountants"
-1.00  1.00 NullPunisher.other : attended
-1.00  1.00 NullPunisher.other : meeting
-1.00  1.00 NullPunisher.other : Some
-0.10  1.00 NullPunisher.article : the
 1.00  1.00 Quant.equivalent : Replacing the quantifier "one" by an equivalent quantifier "some" preserves truth.
-2.00  1.00 RootEntailment.unalignedRoot : "attended" not aligned to anything
Hand-tuned score (dot product of above): -3.6506
Threshold: -9.4738


Inference ID: 108

Txt: Just one accountant attended the meeting.

Hyp: Some accountant attended the meeting. (yes)

Some
DT
accountant
NN
attended
VBD
the
DT
meeting
NN
Just:RB   1.09   1.21   0.91   1.09   1.21
one:CD   0.79   1.33   1.30   0.82   1.33
accountant:NN   0.71   0.28   0.77   0.71   0.87
attended:VBD   0.76   0.86   1.32   0.76   0.57
the:DT   1.28   1.00   1.25   2.42   1.00
meeting:NN   0.71   0.87   0.48   0.71   0.28
NO_WORD   0.82   0.28   0.17   0.82   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.32 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.76 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : accountant
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : meeting
-1.00  1.00 NullPunisher.other : attended
-1.00  1.00 NullPunisher.other : Some
-2.00  1.00 RootEntailment.unalignedRoot : "attended" not aligned to anything
Hand-tuned score (dot product of above): -6.8134
Threshold: -9.4738


Inference ID: 109

Txt: Just one accountant attended the meeting.

Hyp: Some accountants attended the meeting. (don't know)

Some
DT
accountants
NNS
attended
VBD
the
DT
meeting
NN
Just:RB   1.09   1.21   0.91   1.09   1.21
one:CD   0.79   1.33   1.30   0.82   1.33
accountant:NN   0.71   1.32   0.77   0.71   0.87
attended:VBD   0.76   0.82   1.32   0.76   0.57
the:DT   1.28   1.00   1.25   2.42   1.00
meeting:NN   0.71   0.89   0.48   0.71   0.28
NO_WORD   0.82   0.28   0.17   0.82   0.09

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.64 Alignment.score
 1.00  0.28 Alignment.isGood
-1.00  0.69 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.20 Alignment.txtSpan
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "one" modifying "accountant" is dropped on aligned hypothesis word "accountants"
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : attended
-1.00  1.00 NullPunisher.other : meeting
-1.00  1.00 NullPunisher.other : Some
 1.00  1.00 Quant.equivalent : Replacing the quantifier "one" by an equivalent quantifier "some" preserves truth.
-2.00  1.00 RootEntailment.unalignedRoot : "attended" not aligned to anything
Hand-tuned score (dot product of above): -3.6506
Threshold: -9.4738


Inference ID: 110

Txt: Just one accountant attended the meeting.

Hyp: Some accountant attended the meeting. (yes)

Some
DT
accountant
NN
attended
VBD
the
DT
meeting
NN
Just:RB   1.09   1.21   0.91   1.09   1.21
one:CD   0.79   1.33   1.30   0.82   1.33
accountant:NN   0.71   0.28   0.77   0.71   0.87
attended:VBD   0.76   0.86   1.32   0.76   0.57
the:DT   1.28   1.00   1.25   2.42   1.00
meeting:NN   0.71   0.87   0.48   0.71   0.28
NO_WORD   0.82   0.28   0.17   0.82   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.32 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.76 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : Some
-1.00  1.00 NullPunisher.other : attended
-1.00  1.00 NullPunisher.other : accountant
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : meeting
-2.00  1.00 RootEntailment.unalignedRoot : "attended" not aligned to anything
Hand-tuned score (dot product of above): -6.8134
Threshold: -9.4738


Inference ID: 111

Txt: Smith signed one contract. Jones signed another contract.

Hyp: Smith and Jones signed two contracts. (yes)

Smith
NNP
Jones
NNP
signed
VBD
two
CD
contracts
NNS
Smith:NNP   0.28   0.70   1.05   1.20   1.16
signed:VBD   1.14   1.10   1.32   1.26   0.57
one:CD   1.24   1.13   1.28   0.48   1.33
contract:NN   1.16   1.16   0.36   1.21   1.44
Jones:NNP   0.70   0.28   1.01   1.20   1.15
signed:VBD   1.14   1.10   1.32   1.26   0.57
another:DT   1.22   1.22   1.24   1.18   1.00
contract:NN   1.16   1.16   0.36   1.21   1.44
NO_WORD   0.28   0.01   0.17   0.52   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.31 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.76 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.40 Alignment.txtSpan
-3.00  1.00 NullPunisher.entity : Jones
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : signed
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '2.0' vs '1.0'
-2.00  1.00 RootEntailment.unalignedRoot : "signed" not aligned to anything
Hand-tuned score (dot product of above): -15.2891
Threshold: -9.4738


Inference ID: 112

Txt: Smith signed two contracts. Jones signed two contracts.

Hyp: Smith and Jones signed two contracts. (yes)

Smith
NNP
Jones
NNP
signed
VBD
two
CD
contracts
NNS
Smith:NNP   0.28   0.70   1.05   1.20   1.16
signed:VBD   1.14   1.10   1.32   1.26   0.57
two:CD   1.24   1.24   1.17   0.85   1.15
contracts:NNS   1.16   1.15   0.48   1.11   0.28
Jones:NNP   0.70   0.28   1.01   1.20   1.15
signed:VBD   1.14   1.10   1.32   1.26   0.57
two:CD   1.24   1.24   1.17   0.85   1.15
contracts:NNS   1.16   1.15   0.48   1.11   0.28
NO_WORD   0.28   0.01   0.17   0.52   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00 -0.11 Alignment.score
 1.00  0.16 Alignment.isGood
-1.00  0.83 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "two" modifying "contracts"
-3.00  1.00 NullPunisher.entity : Smith
-3.00  1.00 NullPunisher.entity : two
-1.00  1.00 NullPunisher.other : signed
-3.00  1.00 NullPunisher.entity : Jones
-1.00  1.00 NullPunisher.other : contracts
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '2.0' vs ''
-2.00  1.00 RootEntailment.unalignedRoot : "signed" not aligned to anything
Hand-tuned score (dot product of above): -21.2778
Threshold: -9.4738


Inference ID: 113

Txt: Smith signed two contracts. Jones also signed them.

Hyp: Smith and Jones signed two contracts. (yes)

Smith
NNP
Jones
NNP
signed
VBD
two
CD
contracts
NNS
Smith:NNP   0.28   0.70   1.05   1.20   1.16
signed:VBD   1.14   1.10   1.32   1.26   0.57
two:CD   1.24   1.24   1.17   0.85   1.15
contracts:NNS   1.16   1.15   0.48   1.11   0.28
Jones:NNP   0.70   0.28   1.01   1.20   1.15
also:RB   1.47   1.47   0.91   1.33   1.21
signed:VBD   1.14   1.10   1.32   1.26   0.57
them:PRP   1.55   2.74   1.21   1.43   1.30
NO_WORD   0.28   0.01   0.17   0.52   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00 -0.11 Alignment.score
 1.00  0.16 Alignment.isGood
-1.00  0.83 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "two" modifying "contracts"
-1.00  1.00 NullPunisher.other : signed
-1.00  1.00 NullPunisher.other : contracts
-3.00  1.00 NullPunisher.entity : Smith
-3.00  1.00 NullPunisher.entity : Jones
-3.00  1.00 NullPunisher.entity : two
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '2.0' vs ''
-2.00  1.00 RootEntailment.unalignedRoot : "signed" not aligned to anything
Hand-tuned score (dot product of above): -21.2778
Threshold: -9.4738


Inference ID: 114

Txt: Mary used her workstation.

Hyp: Mary's workstation was used. (yes)

Mary
NNP
workstation
NN
was
VBD
used
VBN
Mary:NNP   0.28   1.04   1.04   1.07
used:VBD   1.16   0.69   0.93   1.57
her:PRP$   2.73   1.30   1.27   0.94
workstation:NN   1.04   0.28   0.82   0.60
NO_WORD   0.12   0.15   1.57   0.17

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.89 Alignment.score
 1.00  0.33 Alignment.isGood
-1.00  0.64 Alignment.isBad
-0.10  2.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.50 Alignment.txtSpan
-0.05  1.00 NullPunisher.aux : was
-3.00  1.00 NullPunisher.entity : Mary
Hand-tuned score (dot product of above): -2.5091
Threshold: -9.4738


Inference ID: 115

Txt: Mary used her workstation.

Hyp: Mary has a workstation. (yes)

Mary
NNP
has
VBZ
a
DT
workstation
NN
Mary:NNP   0.28   1.04   0.96   1.04
used:VBD   1.16   0.79   0.76   0.69
her:PRP$   2.73   1.20   1.41   1.30
workstation:NN   1.04   0.82   0.71   0.28
NO_WORD   0.28   0.17   0.82   0.25

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.24 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-0.05  1.00 NullPunisher.aux : has
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : workstation
-3.00  1.00 NullPunisher.entity : Mary
-2.00  1.00 RootEntailment.unalignedRoot : "has" not aligned to anything
Hand-tuned score (dot product of above): -6.8749
Threshold: -9.4738


Inference ID: 116

Txt: Mary used her workstation.

Hyp: Mary is female. (yes)

Mary
NNP
is
VBZ
female
JJ
Mary:NNP   0.28   1.07   1.20
used:VBD   1.16   0.93   1.02
her:PRP$   2.73   0.97   1.39
workstation:NN   1.04   0.82   0.99
NO_WORD   0.28   1.43   0.16

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.44 Alignment.score
 1.00  0.24 Alignment.isGood
-1.00  0.74 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-3.00  1.00 NullPunisher.entity : Mary
-1.00  1.00 NullPunisher.other : female
-0.05  1.00 NullPunisher.aux : is
-2.00  1.00 RootEntailment.unalignedRoot : "female" not aligned to anything
Hand-tuned score (dot product of above): -6.4057
Threshold: -9.4738


Inference ID: 117

Txt: Every student used her workstation. Mary is a student.

Hyp: Mary used her workstation. (yes)

Mary
NNP
used
VBD
her
PRP$
workstation
NN
Every:DT   1.18   1.23   1.11   1.00
student:NN   1.03   0.63   1.58   0.79
used:VBD   1.16   1.32   0.51   0.69
her:PRP$   1.52   1.28   1.05   1.30
workstation:NN   1.04   0.60   1.58   0.28
Mary:NNP   0.28   1.07   1.80   1.04
is:VBZ   1.16   0.93   0.54   0.91
a:DT   1.25   1.25   1.16   1.00
student:NN   1.03   0.63   1.58   0.79
NO_WORD   0.28   0.17   0.93   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.23 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-3.00  1.00 NullPunisher.entity : Mary
-0.10  1.00 NullPunisher.functionWord : her
-1.00  1.00 NullPunisher.other : used
-1.00  1.00 NullPunisher.other : workstation
-2.00  1.00 RootEntailment.unalignedRoot : "used" not aligned to anything
Hand-tuned score (dot product of above): -7.8399
Threshold: -9.4738


Inference ID: 118

Txt: Every student used her workstation. Mary is a student.

Hyp: Mary has a workstation. (yes)

Mary
NNP
has
VBZ
a
DT
workstation
NN
Every:DT   1.18   1.25   1.31   1.00
student:NN   1.03   0.82   0.71   0.79
used:VBD   1.16   0.79   0.76   0.69
her:PRP$   1.52   0.90   1.11   1.30
workstation:NN   1.04   0.82   0.71   0.28
Mary:NNP   0.28   1.04   0.96   1.04
is:VBZ   1.16   1.01   0.76   0.91
a:DT   1.25   1.25   2.42   1.00
student:NN   1.03   0.82   0.71   0.79
NO_WORD   0.28   0.17   0.82   0.25

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.24 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-3.00  1.00 NullPunisher.entity : Mary
-0.05  1.00 NullPunisher.aux : has
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : workstation
-2.00  1.00 RootEntailment.unalignedRoot : "has" not aligned to anything
Hand-tuned score (dot product of above): -6.8749
Threshold: -9.4738


Inference ID: 119

Txt: No student used her workstation. Mary is a student.

Hyp: Mary used a workstation. (don't know)

Mary
NNP
used
VBD
a
DT
workstation
NN
No:DT   1.25   1.25   1.32   1.00
student:NN   1.03   0.63   0.71   0.79
used:VBD   1.16   1.32   0.76   0.69
her:PRP$   1.52   1.28   1.11   1.30
workstation:NN   1.04   0.60   0.71   0.28
Mary:NNP   0.28   1.07   0.96   1.04
is:VBZ   1.16   0.93   0.76   0.91
a:DT   1.25   1.25   2.42   1.00
student:NN   1.03   0.63   0.71   0.79
NO_WORD   0.28   0.17   0.82   0.09

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.20 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-0.10  1.00 NullPunisher.article : a
-3.00  1.00 NullPunisher.entity : Mary
-1.00  1.00 NullPunisher.other : workstation
-1.00  1.00 NullPunisher.other : used
-2.00  1.00 RootEntailment.unalignedRoot : "used" not aligned to anything
Hand-tuned score (dot product of above): -7.8790
Threshold: -9.4738


Inference ID: 120

Txt: Smith attended a meeting. She chaired it.

Hyp: Smith chaired a meeting. (yes)

Smith
NNP
chaired
VBD
a
DT
meeting
NN
Smith:NNP   0.28   1.07   0.96   1.15
attended:VBD   1.16   0.43   0.76   0.57
a:DT   1.25   1.25   2.42   1.00
meeting:NN   1.15   0.52   0.71   0.28
She:PRP   2.76   0.97   1.11   1.30
chaired:VBD   1.16   1.32   0.76   0.61
it:PRP   1.52   0.97   1.10   1.30
NO_WORD   0.28   0.17   0.82   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.20 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : meeting
-3.00  1.00 NullPunisher.entity : Smith
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : chaired
-2.00  1.00 RootEntailment.unalignedRoot : "chaired" not aligned to anything
Hand-tuned score (dot product of above): -7.8790
Threshold: -9.4738


Inference ID: 121

Txt: Smith delivered a report to ITEL. She also delivered them an invoice. And she delivered them a project proposal.

Hyp: Smith delivered a report, an invoice and a project proposal to ITEL. (yes)

Smith
NNP
delivered
VBD
a
DT
report
NN
an
DT
invoice
NN
a
DT
project
NN
proposal
NN
to
TO
ITEL
NNP
Smith:NNP   0.28   1.07   0.96   1.14   0.96   1.15   0.96   1.18   1.16   0.96   0.93
delivered:VBD   1.16   1.32   0.76   0.82   0.76   0.65   0.76   0.91   0.89   0.76   1.16
a:DT   1.25   1.25   2.42   1.00   1.21   1.00   2.42   1.00   1.00   1.31   1.25
report:NN   1.14   0.73   0.71   0.28   0.71   0.54   0.71   0.87   0.76   0.71   1.07
to:TO   1.25   1.25   1.31   1.00   1.40   1.00   1.31   1.00   1.00   2.42   1.25
ITEL:NNP   0.93   1.07   0.96   1.07   0.96   1.07   0.96   1.07   1.07   0.96   0.28
She:PRP   2.76   0.97   1.11   1.30   1.11   1.30   1.11   1.30   1.30   1.35   1.52
also:RB   1.47   0.91   1.09   1.21   1.09   1.21   1.09   1.21   1.21   1.09   1.47
delivered:VBD   1.16   1.32   0.76   0.82   0.76   0.65   0.76   0.91   0.89   0.76   1.16
them:PRP   2.76   0.97   1.11   1.30   1.11   1.30   1.11   1.30   1.30   1.35   1.50
an:DT   1.25   1.25   1.21   1.00   2.42   1.00   1.21   1.00   1.00   1.40   1.25
invoice:NN   1.15   0.56   0.71   0.54   0.71   0.28   0.71   0.93   0.73   0.71   1.07
she:PRP   2.76   0.97   1.11   1.30   1.11   1.30   1.11   1.30   1.30   1.35   1.52
delivered:VBD   1.16   1.32   0.76   0.82   0.76   0.65   0.76   0.91   0.89   0.76   1.16
them:PRP   2.76   0.97   1.11   1.30   1.11   1.30   1.11   1.30   1.30   1.35   1.50
a:DT   1.25   1.25   2.42   1.00   1.21   1.00   2.42   1.00   1.00   1.31   1.25
project:NN   1.18   0.82   0.71   0.87   0.71   0.93   0.71   0.28   0.77   0.71   1.07
proposal:NN   1.16   0.80   0.71   0.76   0.71   0.73   0.71   0.77   0.28   0.71   1.07
NO_WORD   0.28   0.17   0.82   0.09   0.82   0.01   0.82   0.05   0.01   0.56   0.25

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.29 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.76 Alignment.isBad
-0.10  11.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "project" modifying "proposal"
-3.00  1.00 NullPunisher.entity : Smith
-0.10  1.00 NullPunisher.functionWord : to
-0.10  1.00 NullPunisher.article : an
-3.00  1.00 NullPunisher.entity : ITEL
-1.00  1.00 NullPunisher.other : report
-1.00  1.00 NullPunisher.other : invoice
-1.00  1.00 NullPunisher.other : proposal
-1.00  1.00 NullPunisher.other : delivered
-0.10  1.00 NullPunisher.article : a
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : project
-2.00  1.00 RootEntailment.unalignedRoot : "delivered" not aligned to anything
Hand-tuned score (dot product of above): -15.7571
Threshold: -9.4738


Inference ID: 122

Txt: Every committee has a chairman. He is appointed its members.

Hyp: Every committee has a chairman appointed by members of the committee. (yes)

Every
DT
committee
NN
has
VBZ
a
DT
chairman
NN
appointed
VBN
members
NNS
the
DT
committee
NN
Every:DT   2.42   1.00   1.25   1.31   1.00   1.25   0.92   1.31   1.00
committee:NN   0.71   0.28   0.82   0.71   0.39   0.71   0.39   0.71   0.28
has:VBZ   0.76   0.91   1.32   0.76   0.91   0.77   0.91   0.76   0.91
a:DT   1.31   1.00   1.25   2.42   1.00   1.25   1.06   1.31   1.00
chairman:NN   0.71   0.39   0.82   0.71   0.28   0.31   0.52   0.71   0.39
He:PRP   1.11   2.50   0.93   1.11   1.56   1.37   1.43   1.25   2.50
is:VBZ   0.76   0.91   1.01   0.76   0.91   0.81   0.91   0.76   0.91
appointed:VBN   0.76   0.80   0.77   0.76   0.40   1.32   0.81   0.76   0.80
its:PRP$   1.26   1.45   1.20   1.25   2.50   0.97   1.30   1.26   1.45
members:NNS   0.63   0.39   0.82   0.77   0.52   0.72   0.28   0.77   0.39
NO_WORD   0.82   0.28   0.17   0.82   0.09   0.16   0.30   0.82   0.04

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.26 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  8.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.11 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "committee" modifying "members"
-1.00  1.00 NullPunisher.other : committee
-1.00  1.00 NullPunisher.other : Every
-0.05  1.00 NullPunisher.aux : has
-1.00  1.00 NullPunisher.other : chairman
-0.10  1.00 NullPunisher.article : a
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : appointed
-1.00  1.00 NullPunisher.other : committee
-2.00  1.00 RootEntailment.unalignedRoot : "has" not aligned to anything
Hand-tuned score (dot product of above): -9.2357
Threshold: -9.4738


Inference ID: 123

Txt: ITEL has sent most of the reports Smith needs. They are on her desk.

Hyp: There are some reports from ITEL on Smith's desk. (yes)

There
EX
are
VBP
some
DT
reports
NNS
ITEL
NNP
Smith
NNP
desk
NN
ITEL:NNP   0.94   1.04   0.96   1.07   0.28   0.93   1.07
has:VBZ   0.76   1.05   0.76   0.91   1.16   1.16   0.87
sent:VBN   0.73   0.88   0.70   0.55   1.16   1.09   0.66
most:JJS   0.76   0.96   0.70   0.86   1.13   1.11   0.82
of:IN   1.05   1.24   1.05   1.31   1.57   1.57   1.31
the:DT   1.20   1.18   1.28   1.00   1.22   1.25   1.00
reports:NNS   0.71   0.82   0.71   0.28   1.07   1.14   0.73
Smith:NNP   0.96   1.07   0.94   1.14   0.93   0.28   1.08
needs:VBZ   0.71   0.82   0.76   0.87   1.14   1.16   0.83
They:PRP   0.13   0.94   1.11   1.30   1.50   1.55   1.30
are:VBP   0.70   1.32   0.73   0.91   1.13   1.16   0.91
her:PRP$   1.00   0.97   1.11   1.30   2.73   1.55   1.26
desk:NN   0.68   0.82   0.71   0.73   1.07   1.08   0.28
NO_WORD   0.29   0.17   0.82   0.28   0.15   0.12   0.02

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.14 Alignment.score
 1.00  0.19 Alignment.isGood
-1.00  0.79 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "desk" modifying "ITEL"
-0.05  1.00 NullPunisher.aux : are
-1.00  1.00 NullPunisher.other : reports
-0.10  1.00 NullPunisher.functionWord : There
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : desk
-1.00  1.00 NullPunisher.other : some
-3.00  1.00 NullPunisher.entity : ITEL
-2.00  1.00 RootEntailment.unalignedRoot : "are" not aligned to anything
Hand-tuned score (dot product of above): -13.3141
Threshold: -9.4738


Inference ID: 124

Txt: Two out of ten machines are missing. They have been removed.

Hyp: Two machines have been removed. (yes)

Two
CD
machines
NNS
have
VBP
been
VBN
removed
VBN
Two:CD   0.85   1.33   1.30   1.30   1.30
ten:NN   1.22   0.88   0.82   0.72   0.74
machines:NNS   1.29   0.28   0.82   0.82   0.80
are:VBP   1.39   0.91   0.89   0.13   0.91
missing:VBG   1.39   0.83   1.83   0.82   0.42
They:PRP   1.43   2.50   1.27   1.07   1.12
have:VBP   1.39   0.91   1.32   0.98   0.83
been:VBN   1.39   0.91   0.98   1.32   0.89
removed:VBN   1.39   0.89   0.83   0.89   1.32
NO_WORD   0.52   0.15   1.30   1.57   0.17

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.47 Alignment.score
 1.00  0.25 Alignment.isGood
-1.00  0.73 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Two" modifying "machines"
-0.05  1.00 NullPunisher.aux : have
-3.00  1.00 NullPunisher.entity : Two
-1.00  1.00 NullPunisher.other : removed
-0.05  1.00 NullPunisher.aux : been
-1.00  1.00 NullPunisher.other : machines
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '2.0' vs ''
-2.00  1.00 RootEntailment.unalignedRoot : "removed" not aligned to anything
Hand-tuned score (dot product of above): -14.6058
Threshold: -9.4738


Inference ID: 125

Txt: Two out of ten machines are missing. They have been removed.

Hyp: Eight machines have been removed. (don't know)

Eight
CD
machines
NNS
have
VBP
been
VBN
removed
VBN
Two:CD   0.35   1.33   1.30   1.30   1.30
ten:NN   1.29   0.88   0.82   0.72   0.74
machines:NNS   1.29   0.28   0.82   0.82   0.80
are:VBP   1.39   0.91   0.89   0.13   0.91
missing:VBG   1.39   0.83   1.83   0.82   0.42
They:PRP   1.61   2.50   1.27   1.07   1.12
have:VBP   1.39   0.91   1.32   0.98   0.83
been:VBN   1.39   0.91   0.98   1.32   0.89
removed:VBN   1.39   0.89   0.83   0.89   1.32
NO_WORD   0.52   0.15   1.30   1.57   0.17

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.51 Alignment.score
 1.00  0.25 Alignment.isGood
-1.00  0.72 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.20 Alignment.txtSpan
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "machines" modifying "Two" is dropped on aligned hypothesis word "Eight"
-0.05  1.00 NullPunisher.aux : been
-1.00  1.00 NullPunisher.other : machines
-1.00  1.00 NullPunisher.other : removed
-0.05  1.00 NullPunisher.aux : have
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '8.0' vs '2.0'
-2.00  1.00 RootEntailment.unalignedRoot : "removed" not aligned to anything
Hand-tuned score (dot product of above): -9.8394
Threshold: -9.4738


Inference ID: 126

Txt: Two out of ten machines are missing. They were all here yesterday.

Hyp: Ten machines were here yesterday. (yes)

Ten
CD
machines
NNS
were
VBD
here
RB
yesterday
NN
Two:CD   0.28   1.33   1.30   1.30   1.33
ten:NN   0.00   0.88   0.78   1.27   0.78
machines:NNS   1.29   0.28   0.82   1.31   0.96
are:VBP   1.39   0.91   0.11   1.02   0.91
missing:VBG   1.39   0.83   0.82   1.11   0.91
They:PRP   1.52   2.50   1.12   1.29   1.30
were:VBD   1.36   0.91   1.32   0.17   0.91
all:DT   1.18   1.00   1.25   1.45   1.00
here:RB   1.33   1.21   0.37   1.23   1.21
yesterday:NN   1.29   0.96   0.82   1.31   0.28
NO_WORD   0.52   0.28   0.31   0.04   0.12

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.08 Alignment.score
 1.00  0.18 Alignment.isGood
-1.00  0.80 Alignment.isBad
-0.10  2.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.60 Alignment.txtSpan
-1.00  1.00 NullPunisher.other : yesterday
-1.00  1.00 NullPunisher.other : machines
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '10.0' vs ''
-2.00  1.00 RootEntailment.unalignedRoot : "yesterday" not aligned to anything
Hand-tuned score (dot product of above): -10.4837
Threshold: -9.4738


Inference ID: 127

Txt: Smith took a machine on Tuesday, and Jones took a machine on Wednesday. They put them in the lobby.

Hyp: Smith and Jones put two machines in the lobby. (yes)

Smith
NNP
Jones
NNP
put
VBD
two
CD
machines
NNS
the
DT
lobby
NN
Smith:NNP   0.28   0.70   1.07   1.20   1.11   0.96   1.06
took:VBD   1.16   1.14   0.24   0.93   0.91   0.73   0.78
a:DT   1.25   1.25   1.25   1.18   1.00   1.31   1.00
machine:NN   1.11   1.13   0.54   1.29   1.40   0.71   0.81
Tuesday:NNP   1.04   1.05   1.07   1.20   1.18   0.96   1.12
Jones:NNP   0.70   0.28   1.07   1.20   1.08   0.96   1.03
took:VBD   1.16   1.14   0.24   0.93   0.91   0.73   0.78
a:DT   1.25   1.25   1.25   1.18   1.00   1.31   1.00
machine:NN   1.11   1.13   0.54   1.29   1.40   0.71   0.81
Wednesday:NNP   1.03   1.01   1.07   1.20   1.16   0.96   1.11
They:PRP   1.73   1.70   0.97   1.43   2.33   1.30   1.27
put:VBD   1.16   1.16   1.32   1.35   0.85   0.76   0.74
them:PRP   1.73   1.70   0.97   1.43   2.33   0.80   1.30
the:DT   1.25   1.25   1.25   1.11   1.00   2.42   1.00
lobby:NN   1.06   1.03   0.65   1.29   0.81   0.71   0.28
NO_WORD   0.28   0.01   0.17   0.52   0.09   0.82   0.07

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.22 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.14 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "lobby" modifying "put"
-3.00  1.00 NullPunisher.entity : Smith
-3.00  1.00 NullPunisher.entity : two
-1.00  1.00 NullPunisher.other : lobby
-1.00  1.00 NullPunisher.other : put
-0.10  1.00 NullPunisher.article : the
-3.00  1.00 NullPunisher.entity : Jones
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '2.0' vs ''
-2.00  1.00 RootEntailment.unalignedRoot : "put" not aligned to anything
Hand-tuned score (dot product of above): -20.9395
Threshold: -9.4738


Inference ID: 128

Txt: John and his colleagues went to a meeting. They hated it.

Hyp: John's colleagues hated the meeting. (yes)

John
NNP
colleagues
NNS
hated
VBZ
the
DT
meeting
NN
John:NNP   0.28   1.08   1.07   0.93   1.18
his:PRP$   2.76   1.30   0.97   1.41   1.30
colleagues:NNS   1.08   0.28   0.80   0.71   0.60
went:VBD   1.16   0.81   0.19   0.76   0.86
a:DT   1.25   1.00   1.25   1.31   1.00
meeting:NN   1.18   0.60   0.82   0.71   0.28
They:PRP   1.67   2.50   1.19   1.07   1.42
hated:VBZ   1.16   0.89   1.32   0.76   0.91
it:PRP   1.55   1.30   0.97   1.11   1.30
NO_WORD   0.12   0.28   0.17   0.82   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.18 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-3.00  1.00 NullPunisher.entity : John
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : colleagues
-1.00  1.00 NullPunisher.other : meeting
-1.00  1.00 NullPunisher.other : hated
-2.00  1.00 RootEntailment.unalignedRoot : "hated" not aligned to anything
Hand-tuned score (dot product of above): -9.0010
Threshold: -9.4738


Inference ID: 129

Txt: John and his colleagues went to a meeting. They hated it.

Hyp: John hated the meeting. (yes)

John
NNP
hated
NNP
the
DT
meeting
NN
John:NNP   0.28   1.07   0.93   1.18
his:PRP$   2.76   1.30   1.41   1.30
colleagues:NNS   1.08   0.80   0.71   0.60
went:VBD   1.16   0.40   0.76   0.86
a:DT   1.25   1.00   1.31   1.00
meeting:NN   1.18   0.82   0.71   0.28
They:PRP   1.67   1.51   1.07   1.42
hated:VBZ   1.16   1.75   0.76   0.91
it:PRP   1.55   1.30   1.11   1.30
NO_WORD   0.05   0.12   0.82   0.25

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.69 Alignment.score
 1.00  0.29 Alignment.isGood
-1.00  0.68 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.25 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "John" modifying "hated"
-1.00  1.00 NullPunisher.other : meeting
-3.00  1.00 NullPunisher.entity : John
-0.10  1.00 NullPunisher.article : the
Hand-tuned score (dot product of above): -4.9782
Threshold: -9.4738


Inference ID: 130

Txt: John and his colleagues went to a meeting. They hated it.

Hyp: John hated the meeting. (don't know)

John
NNP
hated
NNP
the
DT
meeting
NN
John:NNP   0.28   1.07   0.93   1.18
his:PRP$   2.76   1.30   1.41   1.30
colleagues:NNS   1.08   0.80   0.71   0.60
went:VBD   1.16   0.40   0.76   0.86
a:DT   1.25   1.00   1.31   1.00
meeting:NN   1.18   0.82   0.71   0.28
They:PRP   1.67   1.51   1.07   1.42
hated:VBZ   1.16   1.75   0.76   0.91
it:PRP   1.55   1.30   1.11   1.30
NO_WORD   0.05   0.12   0.82   0.25

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.69 Alignment.score
 1.00  0.29 Alignment.isGood
-1.00  0.68 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.25 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "John" modifying "hated"
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : meeting
-3.00  1.00 NullPunisher.entity : John
Hand-tuned score (dot product of above): -4.9782
Threshold: -9.4738


Inference ID: 131

Txt: Each department has a dedicated line. They rent them from BT.

Hyp: Every department rents a line from BT. (yes)

Every
DT
department
NN
rents
VBZ
a
DT
line
NN
BT
NNP
Each:DT   1.28   1.00   1.23   1.31   1.00   1.25
department:NN   0.71   0.28   0.80   0.71   0.79   1.07
has:VBZ   0.76   0.91   0.70   0.76   0.91   1.16
a:DT   1.31   1.00   1.25   2.42   1.00   1.25
dedicated:JJ   0.76   0.82   0.93   0.76   0.83   1.13
line:NN   0.71   0.79   0.71   0.71   0.28   1.07
They:PRP   1.04   1.30   0.97   1.11   2.50   1.55
rent:VB   0.73   0.87   1.51   0.76   0.75   1.16
them:PRP   1.08   1.30   0.97   1.11   2.50   1.55
BT:NNP   0.96   1.07   1.07   0.96   1.07   0.28
NO_WORD   0.82   0.28   0.17   0.82   0.09   0.15

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.47 Alignment.score
 1.00  0.25 Alignment.isGood
-1.00  0.73 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.17 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "BT" modifying "rents"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "them" modifying "rent" is dropped on aligned hypothesis word "rents"
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : line
-1.00  1.00 NullPunisher.other : department
-3.00  1.00 NullPunisher.entity : BT
-1.00  1.00 NullPunisher.other : Every
Hand-tuned score (dot product of above): -6.9986
Threshold: -9.4738


Inference ID: 132

Txt: Each department has a dedicated line. The sales department rents it from BT.

Hyp: The sales department rents a line from BT. (yes)

The
DT
sales_department
NN
rents
VBZ
a
DT
line
NN
BT
NNP
Each:DT   1.31   1.00   1.23   1.31   1.00   1.25
department:NN   0.71   1.41   0.80   0.71   0.79   1.07
has:VBZ   0.76   0.91   0.70   0.76   0.91   1.16
a:DT   1.31   1.00   1.25   2.42   1.00   1.25
dedicated:JJ   0.76   0.85   0.93   0.76   0.83   1.13
line:NN   0.67   0.76   0.71   0.71   0.28   1.07
The:DT   2.42   1.00   1.25   1.31   0.96   1.25
sales_department:NN   0.71   0.28   0.82   0.71   0.76   1.07
rents:VBZ   0.76   0.91   1.32   0.76   0.80   1.16
it:PRP   1.18   2.50   1.20   1.18   1.45   1.44
BT:NNP   0.96   1.07   1.07   0.96   1.07   0.28
NO_WORD   0.82   0.28   0.17   0.82   0.09   0.15

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.53 Alignment.score
 1.00  0.26 Alignment.isGood
-1.00  0.72 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.17 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "BT" modifying "rents"
-1.00  1.00 Hypernym.posNarrow : Narrowing a term (from "department" to "sales_department") does NOT preserve truth in a positive context
-3.00  1.00 NullPunisher.entity : BT
-1.00  1.00 NullPunisher.other : rents
-1.00  1.00 NullPunisher.other : line
-0.10  1.00 NullPunisher.article : a
-0.10  1.00 NullPunisher.article : The
-2.00  1.00 RootEntailment.unalignedRoot : "rents" not aligned to anything
Hand-tuned score (dot product of above): -9.5190
Threshold: -9.4738


Inference ID: 133

Txt: GFI owns several computers. ITEL maintains them.

Hyp: ITEL maintains all the computers that GFI owns. (yes)

ITEL
NNP
maintains
VBZ
all
PDT
the
DT
computers
NNS
that
IN
GFI
NNP
owns
VBZ
GFI:NNP   0.82   1.07   0.96   0.96   1.07   0.75   0.28   1.07
owns:VBZ   1.16   0.46   0.76   0.76   0.91   1.15   1.16   1.32
several:JJ   1.11   0.96   0.76   0.76   0.88   1.05   1.13   0.96
computers:NNS   1.07   0.77   0.71   0.71   0.28   0.50   1.07   0.82
ITEL:NNP   0.28   1.07   0.93   0.93   1.07   0.75   0.82   1.07
maintains:VBZ   1.16   1.32   0.76   0.76   0.86   1.15   1.16   0.46
them:PRP   1.50   1.10   1.11   0.59   2.50   1.07   1.55   1.24
NO_WORD   0.28   0.17   0.41   0.82   0.09   0.60   0.28   0.36

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.24 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  8.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "owns" modifying "computers"
-1.00  1.00 NullPunisher.other : computers
-1.00  1.00 NullPunisher.other : all
-3.00  1.00 NullPunisher.entity : ITEL
-3.00  1.00 NullPunisher.entity : GFI
-1.00  1.00 NullPunisher.other : owns
-0.10  1.00 NullPunisher.functionWord : that
-1.00  1.00 NullPunisher.other : maintains
-0.10  1.00 NullPunisher.article : the
-2.00  1.00 RootEntailment.unalignedRoot : "maintains" not aligned to anything
Hand-tuned score (dot product of above): -14.3284
Threshold: -9.4738


Inference ID: 134

Txt: Every customer who owns a computer has a service contract for it. MFI is a customer that owns exactly one computer.

Hyp: MFI has a service contract for all its computers. (yes)

MFI
NNP
has
VBZ
a
DT
service_contract
NN
all
PDT
its
PRP$
computers
NNS
Every:DT   1.25   1.25   1.31   1.00   1.31   1.16   1.00
customer:NN   1.07   0.82   0.71   0.77   0.71   1.58   0.53
who:WP   1.55   0.97   1.11   1.30   1.11   0.71   1.30
owns:VBZ   1.16   0.46   0.76   0.91   0.76   0.51   0.91
a:DT   1.25   1.25   2.42   1.00   1.31   1.15   1.00
computer:NN   1.07   0.82   0.71   0.82   0.71   1.58   1.65
has:VBZ   1.16   1.32   0.76   0.91   0.76   0.47   0.91
a:DT   1.25   1.25   2.42   1.00   1.31   1.15   1.00
service_contract:NN   1.07   0.82   0.71   0.28   0.71   1.58   0.82
it:PRP   1.55   0.97   1.10   1.30   1.11   0.35   1.30
MFI:NNP   0.28   1.07   0.96   1.07   0.96   1.84   1.07
is:VBZ   1.16   1.01   0.76   0.91   0.76   0.18   0.91
a:DT   1.25   1.25   2.42   1.00   1.31   1.15   1.00
customer:NN   1.07   0.82   0.71   0.77   0.71   1.58   0.53
that:WDT   1.25   1.16   1.31   1.00   1.31   1.16   1.00
owns:VBZ   1.16   0.46   0.76   0.91   0.76   0.51   0.91
exactly:RB   1.47   0.91   1.09   1.20   1.09   1.05   1.21
one:CD   1.24   1.30   0.89   1.33   0.89   1.32   1.33
computer:NN   1.07   0.82   0.71   0.82   0.71   1.58   1.65
NO_WORD   0.28   0.17   0.82   0.09   0.41   0.93   0.05

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.54 Alignment.score
 1.00  0.26 Alignment.isGood
-1.00  0.72 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.14 Alignment.txtSpan
-1.00  1.00 NullPunisher.other : service_contract
-0.05  1.00 NullPunisher.aux : has
-1.00  1.00 NullPunisher.other : all
-3.00  1.00 NullPunisher.entity : MFI
-0.10  1.00 NullPunisher.article : a
-0.10  1.00 NullPunisher.functionWord : its
-7.00  1.00 Quant.expand : Invalid quantifier strengthening: replacing the quantifier "a" by a stronger quantifier "all" does NOT preserve truth.
-2.00  1.00 RootEntailment.unalignedRoot : "has" not aligned to anything
Hand-tuned score (dot product of above): -14.6474
Threshold: -9.4738


Inference ID: 135

Txt: Every customer who owns a computer has a service contract for it. MFI is a customer that owns several computers.

Hyp: MFI has a service contract for all its computers. (yes)

MFI
NNP
has
VBZ
a
DT
service_contract
NN
all
PDT
its
PRP$
computers
NNS
Every:DT   1.25   1.25   1.31   1.00   1.31   1.16   1.00
customer:NN   1.07   0.82   0.71   0.77   0.71   1.58   0.53
who:WP   1.55   0.97   1.11   1.30   1.11   0.71   1.30
owns:VBZ   1.16   0.46   0.76   0.91   0.76   0.51   0.91
a:DT   1.25   1.25   2.42   1.00   1.31   1.15   1.00
computer:NN   1.07   0.82   0.71   0.82   0.71   1.58   1.65
has:VBZ   1.16   1.32   0.76   0.91   0.76   0.47   0.91
a:DT   1.25   1.25   2.42   1.00   1.31   1.15   1.00
service_contract:NN   1.07   0.82   0.71   0.28   0.71   1.58   0.82
it:PRP   1.55   0.97   1.10   1.30   1.11   0.35   1.30
MFI:NNP   0.28   1.07   0.96   1.07   0.96   1.84   1.07
is:VBZ   1.16   1.01   0.76   0.91   0.76   0.18   0.91
a:DT   1.25   1.25   2.42   1.00   1.31   1.15   1.00
customer:NN   1.07   0.82   0.71   0.77   0.71   1.58   0.53
that:WDT   1.25   1.16   1.31   1.00   1.31   1.16   1.00
owns:VBZ   1.16   0.46   0.76   0.91   0.76   0.51   0.91
several:JJ   1.13   0.96   0.76   0.85   0.76   1.05   0.88
computers:NNS   1.07   0.82   0.71   0.82   0.71   1.58   0.28
NO_WORD   0.28   0.17   0.82   0.09   0.41   0.93   0.05

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.54 Alignment.score
 1.00  0.26 Alignment.isGood
-1.00  0.72 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.14 Alignment.txtSpan
-0.05  1.00 NullPunisher.aux : has
-1.00  1.00 NullPunisher.other : service_contract
-0.10  1.00 NullPunisher.article : a
-3.00  1.00 NullPunisher.entity : MFI
-0.10  1.00 NullPunisher.functionWord : its
-1.00  1.00 NullPunisher.other : all
-7.00  1.00 Quant.expand : Invalid quantifier strengthening: replacing the quantifier "a" by a stronger quantifier "all" does NOT preserve truth.
-2.00  1.00 RootEntailment.unalignedRoot : "has" not aligned to anything
Hand-tuned score (dot product of above): -14.6474
Threshold: -9.4738


Inference ID: 136

Txt: Every executive who had a laptop computer brought it to take notes at the meeting. Smith is a executive who owns five different laptop computers.

Hyp: Smith took five laptop computers to the meeting. (don't know)

Smith
NNP
took
VBD
five
CD
laptop_computers
NNS
the
DT
meeting
NN
Every:DT   1.25   1.25   1.16   1.00   1.31   1.00
executive:NN   0.98   0.79   1.25   0.83   0.71   0.80
who:WP   1.55   0.94   1.46   1.30   1.04   1.30
had:VBD   1.16   0.41   1.39   0.91   0.76   0.91
a:DT   1.25   1.25   1.18   1.00   1.31   1.00
laptop_computer:NN   1.05   0.82   1.28   2.31   0.71   0.86
brought:VBD   1.16   0.18   1.38   0.91   0.76   0.91
it:PRP   1.52   0.97   1.46   1.30   1.11   1.30
to:TO   1.25   1.18   1.18   1.00   1.26   1.00
take_notes:VB   1.16   0.40   1.26   0.84   0.76   0.81
the:DT   1.25   1.22   1.15   1.00   2.42   1.00
meeting:NN   1.15   0.82   1.29   0.86   0.71   0.28
Smith:NNP   0.28   1.07   1.18   1.05   0.96   1.15
is:VBZ   1.16   0.78   1.39   0.91   0.76   0.91
a:DT   1.25   1.25   1.18   1.00   1.31   1.00
executive:NN   0.98   0.79   1.25   0.83   0.71   0.80
who:WP   1.55   0.94   1.46   1.30   1.04   1.30
owns:VBZ   1.16   0.79   1.39   0.91   0.76   0.91
five:CD   1.21   1.04   0.85   1.33   0.86   1.33
different:JJ   1.13   0.96   0.90   0.73   0.76   0.82
laptop_computers:NNS   1.05   0.82   1.29   0.28   0.71   0.86
NO_WORD   0.28   0.17   0.52   0.09   0.82   0.74

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.54 Alignment.score
 1.00  0.26 Alignment.isGood
-1.00  0.72 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.33 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "meeting" modifying "took"
-1.00  1.00 NullPunisher.other : meeting
-3.00  1.00 NullPunisher.entity : Smith
-3.00  1.00 NullPunisher.entity : five
-0.10  1.00 NullPunisher.article : the
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '5.0' vs ''
-3.00  1.00 Structure.argsMismatch : args have different parents but same relations: text "laptop_computer" <-dobj-- "had" vs. hyp "laptop_computers" <-dobj-- "took", which aligned to text "brought"
Hand-tuned score (dot product of above): -17.2796
Threshold: -9.4738


Inference ID: 137

Txt: There are 100 companies. ICM is one of the companies and owns 150 computers. It does not have service contracts for any of its computers. Each of the other 99 companies owns one computer. They have service contracts for them.

Hyp: Most companies that own a computer have a service contract for it. (yes)

Most
JJS
companies
NNS
that
WDT
own
VBP
a
DT
computer
NN
have
VBP
a
DT
service_contract
NN
it
PRP
There:EX   1.18   1.00   1.23   1.25   1.31   1.00   1.18   1.31   1.00   1.16
are:VBP   1.02   0.91   0.76   0.96   0.76   0.91   0.89   0.76   0.91   0.54
100:CD   1.00   1.28   0.89   1.30   0.89   1.33   1.30   0.89   1.18   1.32
companies:NNS   0.99   0.28   0.71   0.82   0.71   0.68   0.82   0.71   0.82   1.58
ICM:NNP   1.24   1.07   0.96   1.07   0.96   1.07   1.07   0.96   1.07   1.79
is:VBZ   1.02   0.91   0.76   0.96   0.76   0.91   0.98   0.76   0.91   0.43
one:CD   0.97   1.33   0.89   1.23   0.89   1.33   1.27   0.89   1.33   1.32
the:DT   1.18   1.00   1.21   1.25   1.31   1.00   1.22   1.31   1.00   1.16
companies:NNS   0.99   0.28   0.71   0.82   0.71   0.68   0.82   0.71   0.82   1.58
owns:VBZ   1.02   0.88   0.76   0.67   0.76   0.91   0.50   0.76   0.91   0.54
150:CD   1.00   1.33   0.89   1.30   0.89   1.33   1.30   0.89   1.23   1.32
computers:NNS   0.99   0.78   0.71   0.82   0.71   1.65   0.82   0.71   0.82   1.58
It:PRP   1.39   1.30   1.11   0.97   1.10   1.30   0.97   1.10   1.30   0.73
does:VBZ   0.96   0.89   0.76   0.81   0.76   0.91   0.86   0.76   0.84   0.54
not:RB   0.95   1.21   1.06   0.91   1.09   1.21   0.91   1.09   1.21   1.00
have:VB   1.02   0.91   0.70   0.50   0.76   0.91   0.68   0.76   0.91   0.54
service_contracts:NNS   0.99   0.62   0.71   0.82   0.71   0.82   0.82   0.71   2.11   1.58
any:DT   1.18   1.00   1.31   1.25   1.31   1.00   1.22   1.31   1.00   1.16
its:PRP$   1.36   1.30   1.11   0.97   1.10   1.30   0.97   1.10   1.30   0.35
computers:NNS   0.99   0.78   0.71   0.82   0.71   1.65   0.82   0.71   0.82   1.58
Each:DT   1.18   1.00   1.31   1.25   1.31   1.00   1.20   1.31   1.00   1.16
the:DT   1.18   1.00   1.21   1.25   1.31   1.00   1.22   1.31   1.00   1.16
other:JJ   0.93   0.88   0.69   0.96   0.76   0.83   0.93   0.76   0.88   1.05
99:CD   1.00   1.33   0.89   1.30   0.89   1.23   1.30   0.89   1.13   1.32
companies:NNS   0.99   0.28   0.71   0.82   0.71   0.68   0.82   0.71   0.82   1.58
owns:VBZ   1.02   0.88   0.76   0.67   0.76   0.91   0.50   0.76   0.91   0.54
one:CD   0.97   1.33   0.89   1.23   0.89   1.33   1.27   0.89   1.33   1.32
computer:NN   0.99   0.68   0.71   0.82   0.71   0.28   0.82   0.71   0.82   1.58
They:PRP   1.52   2.50   1.13   1.24   1.24   1.97   1.24   1.24   1.30   0.84
have:VBP   1.02   0.91   0.70   0.50   0.76   0.91   1.32   0.76   0.91   0.54
service_contracts:NNS   0.99   0.62   0.71   0.82   0.71   0.82   0.82   0.71   2.11   1.58
them:PRP   1.52   2.50   1.13   1.24   1.24   1.97   1.24   1.24   1.30   0.98
NO_WORD   0.11   0.28   0.41   0.25   0.82   0.09   0.17   0.82   0.09   0.87

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.95 Alignment.score
 1.00  0.35 Alignment.isGood
-1.00  0.63 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.40 Alignment.txtSpan
 0.00  1.00 NegPolarity.hypNegWord : "Most": tag "JJS" is in NegPolarityMarkers list
 0.00  1.00 NegPolarity.txtNegWord : "have": has child with relation "neg"
 0.00  1.00 NegPolarity.txtNegRoot : "have": has child with relation "neg"
-1.00  1.00 NullPunisher.other : it
-0.10  1.00 NullPunisher.functionWord : that
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : Most
-1.00  1.00 NullPunisher.other : companies
-0.10  1.00 NullPunisher.article : a
-6.00  1.00 Quant.oneNo : [any,a]
 0.00  1.00 NegPolarity.txtNegWord&NegPolarity.hypNegWord :
Hand-tuned score (dot product of above): -8.9901
Threshold: -9.4738


Inference ID: 138

Txt: Every report has a cover page. R-95-103 is a report. Smith signed the cover page.

Hyp: Smith signed the cover page of R-95-103. (yes)

Smith
NNP
signed
VBD
the
DT
cover
NN
page
NN
R-95-103
NN
Every:DT   1.25   1.25   1.31   0.91   1.00   1.00
report:NN   1.14   0.82   0.71   0.76   0.71   0.87
has:VBZ   1.16   0.74   0.76   0.91   0.87   0.91
a:DT   1.25   1.25   1.31   1.00   1.00   1.00
cover:NN   1.06   0.80   0.71   0.28   0.71   0.77
page:NN   1.09   0.77   0.67   0.71   0.28   0.85
R-95-103:NNP   1.12   0.66   0.71   0.77   0.85   3.06
is:VBZ   1.16   0.78   0.76   0.91   0.91   0.91
a:DT   1.25   1.25   1.31   1.00   1.00   1.00
report:NN   1.14   0.82   0.71   0.76   0.71   0.87
Smith:NNP   0.28   1.05   0.96   1.06   1.09   1.12
signed:VBD   1.14   1.32   0.76   0.89   0.86   0.75
the:DT   1.25   1.25   2.42   1.00   0.96   1.00
cover:NN   1.06   0.80   0.71   0.28   0.71   0.77
page:NN   1.09   0.77   0.67   0.71   0.28   0.85
NO_WORD   0.28   0.17   0.82   0.05   0.09   0.04

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.63 Alignment.score
 1.00  0.28 Alignment.isGood
-1.00  0.70 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.17 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "cover" modifying "page"
-1.00  1.00 NullPunisher.other : signed
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : page
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : cover
-2.00  1.00 RootEntailment.unalignedRoot : "signed" not aligned to anything
Hand-tuned score (dot product of above): -9.2654
Threshold: -9.4738


Inference ID: 139

Txt: A company director awarded himself a large payrise.

Hyp: A company director has awarded and been awarded a payrise. (yes)

A
DT
company
NN
director
NN
has
VBZ
awarded
VBN
been
VBN
awarded
VBN
a
DT
payrise
NN
A:DT   2.42   1.00   1.00   1.25   1.25   1.25   1.25   0.13   1.00
company:NN   0.71   0.28   0.92   0.82   0.82   0.82   0.82   0.71   0.82
director:NN   0.71   0.92   0.28   0.82   0.79   0.82   0.79   0.71   0.82
awarded:VBD   0.76   0.91   0.88   0.73   2.02   0.78   2.02   0.76   0.87
himself:PRP   1.11   1.26   1.28   0.97   0.97   0.97   0.97   1.11   1.30
a:DT   0.13   1.00   1.00   1.25   1.25   1.25   1.25   2.42   1.00
large:JJ   0.76   0.82   0.88   0.96   0.89   0.96   0.89   0.76   0.80
payrise:NN   0.71   0.82   0.82   0.82   0.78   0.82   0.78   0.71   0.28
NO_WORD   0.82   0.05   0.15   1.30   0.17   1.57   0.06   0.82   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.80 Alignment.score
 1.00  0.31 Alignment.isGood
-1.00  0.66 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.33 Alignment.txtSpan
-1.00  1.00 NullPunisher.other : payrise
-0.05  1.00 NullPunisher.aux : has
-0.10  1.00 NullPunisher.article : A
-1.00  1.00 NullPunisher.other : awarded
-0.10  1.00 NullPunisher.article : a
-0.05  1.00 NullPunisher.aux : been
-3.00  1.00 Structure.clearBadness : for predicate awarded, text actor set [director] overlaps hyp undergoer set [director]
-3.00  1.00 Structure.argsMismatch : awarded (verbal compl of the root) not aligned
Hand-tuned score (dot product of above): -8.2148
Threshold: -9.4738


Inference ID: 140

Txt: John said Bill had hurt himself.

Hyp: John said Bill had been hurt. (yes)

John
NNP
said
VBD
Bill
NNP
had
VBD
been
VBN
hurt
VBN
John:NNP   0.28   1.07   0.92   1.07   1.02   1.07
said:VBD   1.16   1.32   1.16   0.84   1.00   0.72
Bill:NNP   0.92   1.07   0.28   1.07   1.02   1.07
had:VBD   1.16   0.84   1.16   1.32   0.98   0.64
hurt:VBN   1.16   0.72   1.16   0.72   0.72   1.32
himself:PRP   2.76   0.97   1.53   0.97   1.27   0.97
NO_WORD   0.28   0.17   0.15   1.30   1.57   0.36

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.50 Alignment.score
 1.00  0.25 Alignment.isGood
-1.00  0.72 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-0.05  1.00 NullPunisher.aux : had
-1.00  1.00 NullPunisher.other : hurt
-0.05  1.00 NullPunisher.aux : been
-3.00  1.00 NullPunisher.entity : John
-3.00  1.00 NullPunisher.entity : Bill
-1.00  1.00 NullPunisher.other : said
-2.00  1.00 RootEntailment.unalignedRoot : "said" not aligned to anything
Hand-tuned score (dot product of above): -10.6755
Threshold: -9.4738


Inference ID: 141

Txt: John said Bill had hurt himself.

Hyp: Someone said John had been hurt. (don't know)

Someone
NN
said
VBD
John
NNP
had
VBD
been
VBN
hurt
VBN
John:NNP   1.14   1.07   0.28   1.07   1.02   1.07
said:VBD   0.91   1.32   1.16   0.84   1.00   0.72
Bill:NNP   1.31   1.07   0.92   1.07   1.02   1.07
had:VBD   0.91   0.84   1.16   1.32   0.98   0.64
hurt:VBN   0.91   0.72   1.16   0.72   0.72   1.32
himself:PRP   1.26   0.97   2.76   0.97   1.27   0.97
NO_WORD   0.28   0.17   0.15   1.30   1.57   0.36

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.50 Alignment.score
 1.00  0.25 Alignment.isGood
-1.00  0.72 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-0.05  1.00 NullPunisher.aux : been
-1.00  1.00 NullPunisher.other : Someone
-1.00  1.00 NullPunisher.other : hurt
-1.00  1.00 NullPunisher.other : said
-0.05  1.00 NullPunisher.aux : had
-3.00  1.00 NullPunisher.entity : John
-2.00  1.00 RootEntailment.unalignedRoot : "said" not aligned to anything
Hand-tuned score (dot product of above): -8.6755
Threshold: -9.4738


Inference ID: 142

Txt: John spoke to Mary. So did Bill.

Hyp: Bill spoke to Mary. (yes)

Bill
NNP
spoke
VBD
Mary
NNP
John:NNP   0.92   1.05   0.82
spoke:VBD   1.16   1.32   1.16
Mary:NNP   0.87   1.07   0.28
did:VBD   1.13   0.60   1.16
Bill:NNP   0.28   1.07   0.87
NO_WORD   0.28   0.17   0.74

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.21 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Mary" modifying "spoke"
-3.00  1.00 NullPunisher.entity : Bill
-1.00  1.00 NullPunisher.other : spoke
-3.00  1.00 NullPunisher.entity : Mary
-2.00  1.00 RootEntailment.unalignedRoot : "spoke" not aligned to anything
Hand-tuned score (dot product of above): -10.6630
Threshold: -9.4738


Inference ID: 143

Txt: John spoke to Mary. So did Bill. John spoke to Mary at four o'clock.

Hyp: Bill spoke to Mary at four o'clock. (don't know)

Bill
NNP
spoke
VBD
Mary
NNP
four_o'clock
CD
John:NNP   0.92   1.05   0.82   1.20
spoke:VBD   1.16   1.32   1.16   1.22
Mary:NNP   0.87   1.07   0.28   1.20
did:VBD   1.13   0.60   1.16   1.39
Bill:NNP   0.28   1.07   0.87   1.20
John:NNP   0.92   1.05   0.82   1.20
spoke:VBD   1.16   1.32   1.16   1.22
Mary:NNP   0.87   1.07   0.28   1.20
four_o'clock:CD   1.24   1.13   1.24   0.85
NO_WORD   0.28   0.17   0.74   0.30

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.08 Alignment.score
 1.00  0.18 Alignment.isGood
-1.00  0.80 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "four_o'clock" modifying "spoke"
-1.00  1.00 NullPunisher.other : spoke
-3.00  1.00 NullPunisher.entity : Bill
-3.00  1.00 NullPunisher.entity : Mary
-3.00  1.00 NullPunisher.entity : four_o'clock
-2.00  1.00 RootEntailment.unalignedRoot : "spoke" not aligned to anything
Hand-tuned score (dot product of above): -13.9336
Threshold: -9.4738


Inference ID: 144

Txt: John spoke to Mary at four o'clock. So did Bill.

Hyp: Bill spoke to Mary at four o'clock. (yes)

Bill
NNP
spoke
VBD
Mary
NNP
four_o'clock
CD
John:NNP   0.92   1.05   0.82   1.20
spoke:VBD   1.16   1.32   1.16   1.22
Mary:NNP   0.87   1.07   0.28   1.20
four_o'clock:CD   1.24   1.13   1.24   0.85
did:VBD   1.13   0.60   1.16   1.39
Bill:NNP   0.28   1.07   0.87   1.20
NO_WORD   0.28   0.17   0.74   0.30

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.08 Alignment.score
 1.00  0.18 Alignment.isGood
-1.00  0.80 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "four_o'clock" modifying "spoke"
-3.00  1.00 NullPunisher.entity : Mary
-3.00  1.00 NullPunisher.entity : Bill
-1.00  1.00 NullPunisher.other : spoke
-3.00  1.00 NullPunisher.entity : four_o'clock
-2.00  1.00 RootEntailment.unalignedRoot : "spoke" not aligned to anything
Hand-tuned score (dot product of above): -13.9336
Threshold: -9.4738


Inference ID: 145

Txt: John spoke to Mary at four o'clock. And Bill did at five o'clock.

Hyp: Bill spoke to Mary at five o'clock. (yes)

Bill
NNP
spoke
VBD
Mary
NNP
five
CD
o'clock
RB
John:NNP   0.92   1.05   0.82   1.20   1.56
spoke:VBD   1.16   1.32   1.16   1.37   0.94
Mary:NNP   0.87   1.07   0.28   1.20   1.56
four_o'clock:CD   1.24   1.13   1.24   0.28   0.82
Bill:NNP   0.28   1.07   0.87   1.15   1.56
did:VBD   1.13   0.60   1.16   1.28   1.11
five:CD   1.18   1.28   1.24   0.85   1.30
o'clock:RB   1.47   0.74   1.47   1.36   1.23
NO_WORD   0.28   0.17   0.74   0.30   0.04

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.08 Alignment.score
 1.00  0.18 Alignment.isGood
-1.00  0.80 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.20 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "o'clock" modifying "five"
-1.00  1.00 NullPunisher.other : spoke
-1.00  1.00 NullPunisher.other : o'clock
-3.00  1.00 NullPunisher.entity : Mary
-3.00  1.00 NullPunisher.entity : Bill
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '5.0' vs ''
-2.00  1.00 RootEntailment.unalignedRoot : "spoke" not aligned to anything
Hand-tuned score (dot product of above): -17.8173
Threshold: -9.4738


Inference ID: 146

Txt: John has spoken to Mary. Bill is going to.

Hyp: Bill will speak to Mary. (yes)

Bill
NNP
will
MD
speak
VB
Mary
NNP
John:NNP   1.18   0.96   1.07   0.82
has:VBZ   0.91   1.05   0.86   1.13
spoken:VBN   0.91   1.00   0.44   1.16
Mary:NNP   1.13   0.96   1.07   0.28
Bill:NNP   0.28   0.57   0.82   1.13
is:VBZ   0.91   1.11   0.92   1.16
going_to:VBG   0.91   0.93   0.74   1.16
NO_WORD   0.28   1.55   0.17   0.74

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.61 Alignment.score
 1.00  0.27 Alignment.isGood
-1.00  0.70 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.25 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Mary" modifying "speak"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "Mary" modifying "spoken" is dropped on aligned hypothesis word "speak"
-0.05  1.00 NullPunisher.aux : will
-3.00  1.00 NullPunisher.entity : Mary
-1.00  1.00 NullPunisher.other : Bill
Hand-tuned score (dot product of above): -4.5368
Threshold: -9.4738


Inference ID: 147

Txt: John spoke to Mary on Monday. Bill didn't.

Hyp: Bill spoke to Mary on Monday. (don't know)

Bill
NNP
spoke
VBD
Mary
NNP
on
IN
Monday
NNP
John:NNP   1.18   1.05   0.82   0.68   1.07
spoke:VBD   0.91   1.32   1.16   1.15   1.16
Mary:NNP   1.13   1.07   0.28   0.75   0.98
on:IN   1.31   1.24   1.57   1.59   1.57
Monday:NNP   1.15   1.07   0.98   0.75   0.28
Bill:NNP   0.28   0.82   1.13   0.50   1.15
did:VBD   0.87   0.60   1.16   1.15   1.16
n't:RB   1.21   0.91   1.47   1.05   1.47
NO_WORD   0.28   0.17   0.74   1.16   0.09

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.34 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.75 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Monday" modifying "spoke"
-2.00  1.00 Date.hypDateIns : hypothesis date insertion: Monday
-1.00  1.00 NullPunisher.other : Bill
-1.00  1.00 NullPunisher.other : spoke
-3.00  1.00 NullPunisher.entity : Monday
-3.00  1.00 NullPunisher.entity : Mary
-1.00  1.00 NullPunisher.other : on
-6.00  1.00 Numeric.mismatch : DATE mismatch: '01/01/1000' vs ''
-2.00  1.00 RootEntailment.unalignedRoot : "spoke" not aligned to anything
Hand-tuned score (dot product of above): -20.6883
Threshold: -9.4738


Inference ID: 148

Txt: Has John spoken to Mary? Bill has.

Hyp: Bill has spoken to Mary. (yes)

Bill
NNP
has
VBZ
spoken
VBN
Mary
NNP
Has:NNP   1.07   0.54   0.82   1.04
John:NNP   1.09   1.07   1.03   0.82
spoken:VBN   1.16   0.86   1.32   1.16
Mary:NNP   1.04   1.04   1.07   0.28
Bill:NNP   0.28   1.07   1.07   1.04
has:VBZ   1.16   1.32   0.86   1.13
NO_WORD   0.28   1.30   0.17   0.74

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.48 Alignment.score
 1.00  0.25 Alignment.isGood
-1.00  0.73 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Mary" modifying "spoken"
-3.00  1.00 NullPunisher.entity : Mary
-1.00  1.00 NullPunisher.other : spoken
-3.00  1.00 NullPunisher.entity : Bill
-0.05  1.00 NullPunisher.aux : has
-2.00  1.00 RootEntailment.unalignedRoot : "spoken" not aligned to anything
Hand-tuned score (dot product of above): -10.4431
Threshold: -9.4738


Inference ID: 149

Txt: John has spoken to Mary. The students have too.

Hyp: The students have spoken to Mary. (yes)

The
DT
students
NNS
have
VBP
spoken
VBN
Mary
NNP
John:NNP   0.93   1.10   1.07   1.03   0.82
has:VBZ   0.76   0.91   0.19   0.86   1.13
spoken:VBN   0.76   0.81   0.86   1.32   1.16
Mary:NNP   0.96   1.03   1.02   1.07   0.28
The:DT   2.42   1.00   1.22   1.25   1.25
students:NNS   0.71   0.28   0.82   0.72   1.03
have:VBP   0.73   0.91   1.32   0.86   1.11
too:RB   1.02   1.21   0.91   0.91   1.47
NO_WORD   0.82   0.28   1.30   0.17   0.74

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.55 Alignment.score
 1.00  0.26 Alignment.isGood
-1.00  0.71 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Mary" modifying "spoken"
-0.10  1.00 NullPunisher.article : The
-1.00  1.00 NullPunisher.other : spoken
-1.00  1.00 NullPunisher.other : students
-3.00  1.00 NullPunisher.entity : Mary
-0.05  1.00 NullPunisher.aux : have
-2.00  1.00 RootEntailment.unalignedRoot : "spoken" not aligned to anything
Hand-tuned score (dot product of above): -8.5507
Threshold: -9.4738


Inference ID: 150

Txt: John went to Paris by car, and Bill by train.

Hyp: Bill went to Paris by train. (yes)

Bill
NNP
went
VBD
Paris
NNP
train
NN
John:NNP   0.92   1.07   1.01   1.02
went:VBD   1.16   1.32   1.16   0.69
Paris:NNP   1.05   1.07   0.28   1.02
car:NN   1.20   0.63   1.05   0.29
Bill:NNP   0.28   1.07   1.05   1.14
train:NN   1.14   0.60   1.02   0.28
NO_WORD   0.28   0.17   0.74   0.30

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.09 Alignment.score
 1.00  0.18 Alignment.isGood
-1.00  0.80 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.25 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Paris" modifying "went"
-3.00  1.00 NullPunisher.entity : Bill
-3.00  1.00 NullPunisher.entity : Paris
-1.00  1.00 NullPunisher.other : went
-2.00  1.00 RootEntailment.unalignedRoot : "went" not aligned to anything
Hand-tuned score (dot product of above): -10.7005
Threshold: -9.4738


Inference ID: 151

Txt: John went to Paris by car, and Bill by train to Berlin.

Hyp: Bill went to Berlin by train. (yes)

Bill
NNP
went
VBD
Berlin
NNP
train
NN
John:NNP   0.92   1.07   0.99   1.02
went:VBD   1.16   1.32   1.16   0.69
Paris:NNP   1.05   1.07   0.57   1.02
car:NN   1.20   0.63   1.09   0.29
Bill:NNP   0.28   1.07   1.00   1.14
train:NN   1.14   0.60   0.96   0.28
Berlin:NNP   1.00   1.07   0.28   0.96
NO_WORD   0.28   0.17   0.74   0.30

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.09 Alignment.score
 1.00  0.18 Alignment.isGood
-1.00  0.80 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.25 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Berlin" modifying "went"
-1.00  1.00 NullPunisher.other : went
-3.00  1.00 NullPunisher.entity : Bill
-3.00  1.00 NullPunisher.entity : Berlin
-2.00  1.00 RootEntailment.unalignedRoot : "went" not aligned to anything
Hand-tuned score (dot product of above): -10.7005
Threshold: -9.4738


Inference ID: 152

Txt: John went to Paris by car, and Bill to Berlin.

Hyp: Bill went to Berlin by car. (yes)

Bill
NNP
went
VBD
Berlin
NNP
car
NN
John:NNP   0.92   1.07   0.99   1.09
went:VBD   1.16   1.32   1.16   0.72
Paris:NNP   1.05   1.07   0.57   1.05
car:NN   1.20   0.63   1.09   0.28
Bill:NNP   0.28   1.07   1.00   1.20
to:TO   1.25   1.25   1.25   1.00
Berlin:NNP   1.00   1.07   0.28   1.02
NO_WORD   0.28   0.17   0.74   0.30

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.09 Alignment.score
 1.00  0.18 Alignment.isGood
-1.00  0.80 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.25 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Berlin" modifying "went"
-3.00  1.00 NullPunisher.entity : Bill
-1.00  1.00 NullPunisher.other : went
-3.00  1.00 NullPunisher.entity : Berlin
-2.00  1.00 RootEntailment.unalignedRoot : "went" not aligned to anything
Hand-tuned score (dot product of above): -10.7005
Threshold: -9.4738


Inference ID: 153

Txt: John is going to Paris by car, and the students by train.

Hyp: The students are going to Paris by train. (yes)

The
DT
students
NNS
are
VBP
going
VBG
Paris
NNP
train
NN
John:NNP   0.93   1.10   1.07   1.00   1.01   1.02
is:VBZ   0.76   0.91   0.02   1.00   1.13   0.91
going:VBG   0.76   0.89   0.92   1.32   1.16   0.70
Paris:NNP   0.96   1.08   1.02   1.07   0.28   1.02
car:NN   0.71   0.86   0.74   0.81   1.05   0.29
the:DT   0.01   1.00   1.18   1.25   1.25   1.00
students:NNS   0.71   0.28   0.82   0.80   1.08   0.63
train:NN   0.71   0.63   0.82   0.61   1.02   0.28
NO_WORD   0.82   0.28   1.30   0.17   0.74   0.30

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.41 Alignment.score
 1.00  0.24 Alignment.isGood
-1.00  0.74 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.17 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Paris" modifying "going"
-1.00  1.00 NullPunisher.other : students
-0.10  1.00 NullPunisher.article : The
-1.00  1.00 NullPunisher.other : going
-0.05  1.00 NullPunisher.aux : are
-3.00  1.00 NullPunisher.entity : Paris
-2.00  1.00 RootEntailment.unalignedRoot : "going" not aligned to anything
Hand-tuned score (dot product of above): -8.6256
Threshold: -9.4738


Inference ID: 154

Txt: John went to Paris by car. Bill by train.

Hyp: Bill went to Paris by train. (yes)

Bill
NNP
went
VBD
Paris
NNP
train
NN
John:NNP   1.18   1.07   1.01   1.02
went:VBD   0.91   1.32   1.16   0.69
Paris:NNP   1.14   1.07   0.28   1.02
car:NN   0.94   0.63   1.05   0.29
Bill:VB   0.85   0.78   1.16   0.91
train:NN   0.88   0.60   1.02   0.28
NO_WORD   0.28   0.17   0.74   0.30

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.37 Alignment.score
 1.00  0.23 Alignment.isGood
-1.00  0.75 Alignment.isBad
-0.10  2.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.50 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Paris" modifying "went"
-3.00  1.00 NullPunisher.entity : Paris
-1.00  1.00 NullPunisher.other : went
-2.00  1.00 RootEntailment.unalignedRoot : "went" not aligned to anything
Hand-tuned score (dot product of above): -7.1988
Threshold: -9.4738


Inference ID: 155

Txt: John owns a car. Bill owns one too.

Hyp: Bill owns a car. (yes)

Bill
NNP
owns
VBZ
a
DT
car
NN
John:NNP   1.06   1.02   0.96   1.09
owns:VBZ   1.16   1.32   0.76   0.91
a:DT   1.25   1.25   2.42   1.00
car:NN   1.20   0.82   0.71   0.28
Bill:NNP   0.28   1.07   0.96   1.20
owns:VBZ   1.16   1.32   0.76   0.91
one:CD   1.24   1.21   0.89   1.33
too:RB   1.47   0.91   1.09   1.21
NO_WORD   0.28   0.17   0.82   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.20 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-0.10  1.00 NullPunisher.article : a
-3.00  1.00 NullPunisher.entity : Bill
-1.00  1.00 NullPunisher.other : owns
-1.00  1.00 NullPunisher.other : car
-2.00  1.00 RootEntailment.unalignedRoot : "owns" not aligned to anything
Hand-tuned score (dot product of above): -7.8790
Threshold: -9.4738


Inference ID: 156

Txt: John owns a car. Bill owns one too.

Hyp: There is a car that John and Bill own. (don't know)

There
EX
is
VBZ
a
DT
car
NN
that
IN
John
NNP
Bill
NNP
own
VBP
John:NNP   0.96   1.07   0.96   1.09   0.75   0.28   1.06   0.98
owns:VBZ   0.76   0.96   0.76   0.91   1.15   1.11   1.16   0.67
a:DT   1.31   1.25   2.42   1.00   1.05   1.25   1.25   1.25
car:NN   0.71   0.82   0.71   0.28   0.47   1.09   1.20   0.82
Bill:NNP   0.96   1.07   0.96   1.20   0.75   1.06   0.28   1.07
owns:VBZ   0.76   0.96   0.76   0.91   1.15   1.11   1.16   0.67
one:CD   0.89   1.30   0.89   1.33   1.30   1.21   1.24   1.23
too:RB   1.09   0.91   1.09   1.21   1.01   1.44   1.47   0.91
NO_WORD   0.29   0.17   0.82   0.28   0.99   0.28   0.01   0.31

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.38 Alignment.score
 1.00  0.23 Alignment.isGood
-1.00  0.75 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.25 Alignment.txtSpan
-3.00  1.00 NullPunisher.entity : Bill
-0.10  1.00 NullPunisher.article : a
-0.10  1.00 NullPunisher.functionWord : There
-0.05  1.00 NullPunisher.aux : is
-1.00  1.00 NullPunisher.other : car
-0.10  1.00 NullPunisher.functionWord : that
-2.00  1.00 RootEntailment.unalignedRoot : "is" not aligned to anything
Hand-tuned score (dot product of above): -6.9581
Threshold: -9.4738


Inference ID: 157

Txt: John owns a red car. Bill owns a blue one.

Hyp: Bill owns a blue car. (yes)

Bill
NNP
owns
VBZ
a
DT
blue
JJ
car
NN
John:NNP   0.92   1.02   0.96   1.24   1.09
owns:VBZ   1.16   1.32   0.76   1.02   0.91
a:DT   1.25   1.25   2.42   1.18   1.00
red:JJ   1.13   0.96   0.76   0.46   0.76
car:NN   1.20   0.82   0.71   0.78   0.28
Bill:NNP   0.28   1.07   0.96   1.19   1.20
owns:VBZ   1.16   1.32   0.76   1.02   0.91
a:DT   1.25   1.25   2.42   1.18   1.00
blue:JJ   1.08   0.96   0.76   0.74   0.67
one:NN   1.15   0.72   0.71   0.95   0.92
NO_WORD   0.28   0.17   0.82   0.11   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.18 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "blue" modifying "car"
-1.00  1.00 NullPunisher.other : owns
-1.00  1.00 NullPunisher.other : car
-1.00  1.00 NullPunisher.other : blue
-0.10  1.00 NullPunisher.article : a
-3.00  1.00 NullPunisher.entity : Bill
-2.00  1.00 RootEntailment.unalignedRoot : "owns" not aligned to anything
Hand-tuned score (dot product of above): -10.0020
Threshold: -9.4738


Inference ID: 158

Txt: John owns a red car. Bill owns a blue one.

Hyp: Bill owns a red car. (don't know)

Bill
NNP
owns
VBZ
a
DT
red
JJ
car
NN
John:NNP   0.92   1.02   0.96   1.24   1.09
owns:VBZ   1.16   1.32   0.76   1.02   0.91
a:DT   1.25   1.25   2.42   1.18   1.00
red:JJ   1.13   0.96   0.76   0.74   0.76
car:NN   1.20   0.82   0.71   0.87   0.28
Bill:NNP   0.28   1.07   0.96   1.24   1.20
owns:VBZ   1.16   1.32   0.76   1.02   0.91
a:DT   1.25   1.25   2.42   1.18   1.00
blue:JJ   1.08   0.96   0.76   0.46   0.67
one:NN   1.15   0.72   0.71   0.99   0.92
NO_WORD   0.28   0.17   0.82   0.11   0.09

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.18 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "red" modifying "car"
-3.00  1.00 NullPunisher.entity : Bill
-1.00  1.00 NullPunisher.other : owns
-1.00  1.00 NullPunisher.other : red
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : car
-2.00  1.00 RootEntailment.unalignedRoot : "owns" not aligned to anything
Hand-tuned score (dot product of above): -10.0020
Threshold: -9.4738


Inference ID: 159

Txt: John owns a red car. Bill owns a fast one.

Hyp: Bill owns a fast car. (yes)

Bill
NNP
owns
VBZ
a
DT
fast
JJ
car
NN
John:NNP   0.92   1.02   0.96   1.24   1.09
owns:VBZ   1.16   1.32   0.76   1.02   0.91
a:DT   1.25   1.25   2.42   1.18   1.00
red:JJ   1.13   0.96   0.76   0.93   0.76
car:NN   1.20   0.82   0.71   0.82   0.28
Bill:NNP   0.28   1.07   0.96   1.24   1.20
owns:VBZ   1.16   1.32   0.76   1.02   0.91
a:DT   1.25   1.25   2.42   1.18   1.00
fast_one:.   1.25   1.25   1.31   0.01   1.00
.:.   1.25   1.25   1.31   1.02   0.97
NO_WORD   0.28   0.17   0.82   0.11   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.18 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "fast" modifying "car"
-1.00  1.00 NullPunisher.other : owns
-1.00  1.00 NullPunisher.other : fast
-3.00  1.00 NullPunisher.entity : Bill
-1.00  1.00 NullPunisher.other : car
-0.10  1.00 NullPunisher.article : a
-2.00  1.00 RootEntailment.unalignedRoot : "owns" not aligned to anything
Hand-tuned score (dot product of above): -10.0020
Threshold: -9.4738


Inference ID: 160

Txt: John owns a red car. Bill owns a fast one.

Hyp: Bill owns a fast red car. (yes)

Bill
NNP
owns
VBZ
a
DT
fast
JJ
red
JJ
car
NN
John:NNP   0.92   1.02   0.96   1.24   1.24   1.09
owns:VBZ   1.16   1.32   0.76   1.02   1.02   0.91
a:DT   1.25   1.25   2.42   1.18   1.18   1.00
red:JJ   1.13   0.96   0.76   0.93   0.74   0.76
car:NN   1.20   0.82   0.71   0.82   0.87   0.28
Bill:NNP   0.28   1.07   0.96   1.24   1.24   1.20
owns:VBZ   1.16   1.32   0.76   1.02   1.02   0.91
a:DT   1.25   1.25   2.42   1.18   1.18   1.00
fast_one:.   1.25   1.25   1.31   0.01   1.18   1.00
.:.   1.25   1.25   1.31   1.02   1.13   0.97
NO_WORD   0.28   0.17   0.82   0.11   0.11   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.17 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "red" modifying "car"
-1.00  1.00 NullPunisher.other : red
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : fast
-1.00  1.00 NullPunisher.other : car
-3.00  1.00 NullPunisher.entity : Bill
-1.00  1.00 NullPunisher.other : owns
-2.00  1.00 RootEntailment.unalignedRoot : "owns" not aligned to anything
Hand-tuned score (dot product of above): -11.1172
Threshold: -9.4738


Inference ID: 161

Txt: John owns a red car. Bill owns a fast one.

Hyp: Bill owns a fast red car. (don't know)

Bill
NNP
owns
VBZ
a
DT
fast
JJ
red
JJ
car
NN
John:NNP   0.92   1.02   0.96   1.24   1.24   1.09
owns:VBZ   1.16   1.32   0.76   1.02   1.02   0.91
a:DT   1.25   1.25   2.42   1.18   1.18   1.00
red:JJ   1.13   0.96   0.76   0.93   0.74   0.76
car:NN   1.20   0.82   0.71   0.82   0.87   0.28
Bill:NNP   0.28   1.07   0.96   1.24   1.24   1.20
owns:VBZ   1.16   1.32   0.76   1.02   1.02   0.91
a:DT   1.25   1.25   2.42   1.18   1.18   1.00
fast_one:.   1.25   1.25   1.31   0.01   1.18   1.00
.:.   1.25   1.25   1.31   1.02   1.13   0.97
NO_WORD   0.28   0.17   0.82   0.11   0.11   0.09

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.17 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "red" modifying "car"
-1.00  1.00 NullPunisher.other : car
-1.00  1.00 NullPunisher.other : fast
-3.00  1.00 NullPunisher.entity : Bill
-1.00  1.00 NullPunisher.other : red
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : owns
-2.00  1.00 RootEntailment.unalignedRoot : "owns" not aligned to anything
Hand-tuned score (dot product of above): -11.1172
Threshold: -9.4738


Inference ID: 162

Txt: John owns a fast red car. Bill owns a slow one.

Hyp: Bill owns a slow red car. (yes)

Bill
NNP
owns
VBZ
a
DT
slow
JJ
red
JJ
car
NN
John:NNP   0.92   1.02   0.96   1.24   1.24   1.09
owns:VBZ   1.16   1.32   0.76   1.02   1.02   0.91
a:DT   1.25   1.25   2.42   1.18   1.18   1.00
fast:JJ   1.13   0.96   0.76   1.36   0.93   0.72
red:JJ   1.13   0.96   0.76   0.93   0.74   0.76
car:NN   1.20   0.82   0.71   0.95   0.87   0.28
Bill:NNP   0.28   1.07   0.96   1.24   1.24   1.20
owns:VBZ   1.16   1.32   0.76   1.02   1.02   0.91
a:DT   1.25   1.25   2.42   1.18   1.18   1.00
slow:JJ   1.13   0.96   0.76   0.74   0.93   0.84
one:NN   1.15   0.72   0.71   0.99   0.99   0.92
NO_WORD   0.28   0.17   0.82   0.11   0.11   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.17 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "red" modifying "car"
-0.10  1.00 NullPunisher.article : a
-3.00  1.00 NullPunisher.entity : Bill
-1.00  1.00 NullPunisher.other : slow
-1.00  1.00 NullPunisher.other : owns
-1.00  1.00 NullPunisher.other : car
-1.00  1.00 NullPunisher.other : red
-2.00  1.00 RootEntailment.unalignedRoot : "owns" not aligned to anything
Hand-tuned score (dot product of above): -11.1172
Threshold: -9.4738


Inference ID: 163

Txt: John had his paper accepted. Bill doesn't know why.

Hyp: Bill knows why John had his paper accepted. (don't know)

Bill
NNP
knows
VBZ
why
WRB
John
NNP
had
VBD
his
PRP$
paper
NN
accepted
VBN
John:NNP   1.18   1.05   0.93   0.28   1.07   1.84   1.17   1.07
had:VBD   0.91   0.71   0.76   1.16   1.32   0.47   0.91   0.45
his:PRP$   1.26   0.97   1.41   2.76   0.90   1.05   1.30   0.97
paper:NN   0.94   0.82   0.71   1.17   0.82   1.58   0.28   0.61
accepted:VBN   0.91   0.74   0.76   1.16   0.37   0.54   0.70   1.32
Bill:NNP   0.28   0.82   0.71   1.18   0.82   1.55   0.94   0.82
does:VBZ   0.91   0.03   0.76   1.11   0.86   0.51   0.88   0.81
n't:RB   1.21   0.79   1.09   1.47   0.91   1.05   1.21   0.91
know:VB   0.91   1.10   0.76   1.16   0.79   0.54   0.91   0.66
why:WRB   1.00   1.25   2.42   1.22   1.25   1.16   1.00   1.25
NO_WORD   0.28   0.17   0.30   0.28   0.36   0.93   0.09   0.16

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.38 Alignment.score
 1.00  0.23 Alignment.isGood
-1.00  0.75 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.12 Alignment.txtSpan
 0.00  1.00 NegPolarity.txtNegWord : "know": has child with relation "neg"
 0.00  1.00 NegPolarity.txtNegRoot : "know": has child with relation "neg"
-1.00  1.00 NullPunisher.other : accepted
-3.00  1.00 NullPunisher.entity : John
-0.10  1.00 NullPunisher.functionWord : his
-1.00  1.00 NullPunisher.other : why
-1.00  1.00 NullPunisher.other : paper
-0.05  1.00 NullPunisher.aux : had
-3.00  1.00 Structure.argsMismatch : had (verbal compl of the root) not aligned
Hand-tuned score (dot product of above): -9.6688
Threshold: -9.4738


Inference ID: 164

Txt: John spoke to Mary. And to Sue.

Hyp: John spoke to Sue. (yes)

John
NNP
spoke
VBD
Sue
NNP
John:NNP   0.28   1.05   1.07
spoke:VBD   1.14   1.32   0.85
Mary:NNP   0.82   1.07   1.07
to:TO   1.25   1.25   1.00
Sue:VBZ   1.16   0.64   0.85
NO_WORD   0.28   0.17   0.74

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.25 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  2.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.33 Alignment.txtSpan
-1.00  1.00 NullPunisher.other : spoke
-3.00  1.00 NullPunisher.entity : John
-2.00  1.00 RootEntailment.unalignedRoot : "spoke" not aligned to anything
Hand-tuned score (dot product of above): -6.3821
Threshold: -9.4738


Inference ID: 165

Txt: John spoke to Mary. On Friday.

Hyp: John spoke to Mary on Friday. (yes)

John
NNP
spoke
VBD
Mary
NNP
on
IN
Friday
NNP
John:NNP   0.28   1.05   0.82   0.68   1.09
spoke:VBD   1.14   1.32   1.16   1.15   1.16
Mary:NNP   0.82   1.07   0.28   0.75   1.04
On:IN   1.50   1.24   1.57   0.25   1.57
Friday:NNP   1.09   1.07   1.04   0.75   0.28
NO_WORD   0.28   0.17   0.74   1.16   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.34 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.75 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Friday" modifying "spoke"
-2.00  1.00 Date.hypDateIns : hypothesis date insertion: Friday
-1.00  1.00 NullPunisher.other : on
-1.00  1.00 NullPunisher.other : spoke
-3.00  1.00 NullPunisher.entity : John
-3.00  1.00 NullPunisher.entity : Mary
-3.00  1.00 NullPunisher.entity : Friday
-6.00  1.00 Numeric.mismatch : DATE mismatch: '01/01/1000' vs ''
-2.00  1.00 RootEntailment.unalignedRoot : "spoke" not aligned to anything
Hand-tuned score (dot product of above): -22.6883
Threshold: -9.4738


Inference ID: 166

Txt: John spoke to Mary on Thursday. And on Friday.

Hyp: John spoke to Mary on Friday. (yes)

John
NNP
spoke
VBD
Mary
NNP
on
IN
Friday
NNP
John:NNP   0.28   1.05   0.82   0.68   1.09
spoke:VBD   1.14   1.32   1.16   1.15   1.16
Mary:NNP   0.82   1.07   0.28   0.75   1.04
on:IN   1.50   1.24   1.57   1.59   1.57
Thursday:NNP   1.06   1.07   1.02   0.75   0.58
And:CC   1.25   1.25   1.22   1.00   1.25
on:IN   1.50   1.24   1.57   1.59   1.57
Friday:NNP   1.09   1.07   1.04   0.75   0.28
NO_WORD   0.28   0.17   0.74   1.16   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.34 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.75 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Friday" modifying "spoke"
-2.00  1.00 Date.hypDateIns : hypothesis date insertion: Friday
-3.00  1.00 NullPunisher.entity : Mary
-3.00  1.00 NullPunisher.entity : John
-1.00  1.00 NullPunisher.other : spoke
-3.00  1.00 NullPunisher.entity : Friday
-1.00  1.00 NullPunisher.other : on
-6.00  1.00 Numeric.mismatch : DATE mismatch: '01/01/1000' vs ''
-2.00  1.00 RootEntailment.unalignedRoot : "spoke" not aligned to anything
Hand-tuned score (dot product of above): -22.6883
Threshold: -9.4738


Inference ID: 167

Txt: Twenty men work in the Sales Department. But only one woman.

Hyp: Two women work in the Sales Department. (don't know)

Two
CD
women
NNS
work
VBP
the
DT
Sales_Department
NNP
Twenty:CD   0.33   1.31   1.30   0.86   1.24
men:NNS   1.29   1.08   0.72   0.71   1.07
work:VBP   1.36   0.70   1.32   0.76   1.16
the:DT   1.11   1.00   1.25   2.42   1.25
Sales_Department:NNP   1.20   1.07   1.07   0.96   0.28
But:CC   1.18   1.00   1.25   1.31   1.25
only:RB   1.36   1.19   0.91   1.09   1.47
one:CD   0.35   1.27   1.27   0.82   1.24
woman:VBZ   1.39   0.01   0.44   0.76   1.16
NO_WORD   0.52   0.28   0.17   0.82   0.07

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.11 Alignment.score
 1.00  0.19 Alignment.isGood
-1.00  0.79 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.40 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Sales_Department" modifying "work"
-0.10  1.00 NullPunisher.article : the
-3.00  1.00 NullPunisher.entity : Sales_Department
-1.00  1.00 NullPunisher.other : work
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '2.0' vs '1.0'
-2.00  1.00 RootEntailment.unalignedRoot : "work" not aligned to anything
Hand-tuned score (dot product of above): -13.6544
Threshold: -9.4738


Inference ID: 168

Txt: Five men work part time. And forty five women.

Hyp: Forty five women work part time. (yes)

Forty
CD
five
CD
women
NNS
work
VBP
part
NN
time
NN
Five:CD   0.33   1.35   1.30   1.30   1.33   1.22
men:NNS   1.29   1.29   1.08   0.72   1.05   0.92
work:VBP   1.32   1.39   0.70   1.32   0.68   0.69
part:NN   1.22   1.28   1.01   0.59   0.28   1.00
time:NN   1.29   0.93   0.86   0.60   1.00   0.28
And:CC   1.18   1.18   1.00   1.25   0.96   1.00
forty:RB   0.17   0.85   1.16   0.84   1.14   0.91
five:CD   0.33   0.85   1.30   1.30   1.31   0.96
women:VBZ   1.35   1.37   1.75   0.35   0.89   0.83
NO_WORD   0.42   0.52   0.28   0.17   0.05   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.46 Alignment.score
 1.00  0.24 Alignment.isGood
-1.00  0.73 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  3.00 Alignment.hypSpan
 0.10  0.50 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "part" modifying "time"
-1.00  1.00 NullPunisher.other : work
-1.00  1.00 NullPunisher.other : part
-1.00  1.00 NullPunisher.other : time
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '0.0' vs ''
-2.00  1.00 RootEntailment.unalignedRoot : "work" not aligned to anything
Hand-tuned score (dot product of above): -11.9807
Threshold: -9.4738


Inference ID: 169

Txt: John found Mary before Bill.

Hyp: John found Mary before Bill found Mary. (yes)

John
NNP
found
VBD
Mary
NNP
Bill
NNP
found
VBD
Mary
NNP
John:NNP   0.28   1.00   0.82   0.92   1.00   0.82
found:VBD   1.09   1.32   1.16   1.16   1.32   1.16
Mary:NNP   0.82   1.07   0.28   0.87   1.07   0.28
Bill:NNP   0.92   1.07   0.87   0.28   1.07   0.87
NO_WORD   0.28   0.17   0.28   0.10   0.36   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.03 Alignment.score
 1.00  0.17 Alignment.isGood
-1.00  0.81 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Bill" modifying "Mary"
-3.00  1.00 NullPunisher.entity : Mary
-1.00  1.00 NullPunisher.other : found
-1.00  1.00 NullPunisher.other : found
-3.00  1.00 NullPunisher.entity : Mary
-3.00  1.00 NullPunisher.entity : Bill
-3.00  1.00 NullPunisher.entity : John
-2.00  1.00 RootEntailment.unalignedRoot : "found" not aligned to anything
Hand-tuned score (dot product of above): -18.2039
Threshold: -9.4738


Inference ID: 170

Txt: John found Mary before Bill.

Hyp: John found Mary before John found Bill. (yes)

John
NNP
found
VBD
Mary
NNP
John
NNP
found
VBD
Bill
NNP
John:NNP   0.28   1.00   0.82   0.28   1.00   0.92
found:VBD   1.09   1.32   1.16   1.09   1.32   1.16
Mary:NNP   0.82   1.07   0.28   0.82   1.07   0.87
Bill:NNP   0.92   1.07   0.87   0.92   1.07   0.28
NO_WORD   0.28   0.17   0.28   0.10   0.36   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.03 Alignment.score
 1.00  0.17 Alignment.isGood
-1.00  0.81 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "John" modifying "Mary"
-3.00  1.00 NullPunisher.entity : John
-3.00  1.00 NullPunisher.entity : Bill
-3.00  1.00 NullPunisher.entity : Mary
-3.00  1.00 NullPunisher.entity : John
-1.00  1.00 NullPunisher.other : found
-1.00  1.00 NullPunisher.other : found
-2.00  1.00 RootEntailment.unalignedRoot : "found" not aligned to anything
Hand-tuned score (dot product of above): -18.2039
Threshold: -9.4738


Inference ID: 171

Txt: John wants to know how many men work part time. And women.

Hyp: John wants to know how many women work part time. (yes)

John
NNP
wants
VBZ
to
TO
know
VB
how
WRB
many
JJ
women
NNS
work
VBP
part
NN
time
NN
John:NNP   0.28   1.07   0.96   1.07   0.93   1.24   1.05   1.02   1.27   1.18
wants:VBZ   1.16   1.32   0.76   0.60   0.76   0.95   0.76   0.60   0.74   0.81
to:TO   1.25   1.25   2.42   1.25   1.26   1.18   1.06   1.25   1.00   1.00
know:VB   1.16   0.60   0.76   1.32   0.66   1.02   0.91   0.47   0.91   0.86
how:WRB   1.22   1.25   1.26   1.16   2.42   1.18   1.00   1.22   1.00   1.00
many:JJ   1.13   0.89   0.76   0.96   0.76   0.74   0.85   0.96   0.82   0.88
men:NNS   1.19   0.73   0.71   0.82   0.71   0.89   1.08   0.72   1.05   0.92
work:VBP   1.11   0.60   0.76   0.47   0.73   1.02   0.70   1.32   0.68   0.69
part:NN   1.27   0.65   0.71   0.82   0.71   0.93   1.01   0.59   0.28   1.00
time:NN   1.18   0.72   0.71   0.77   0.71   0.99   0.86   0.60   1.00   0.28
And:CC   1.25   1.20   1.31   1.22   1.31   1.09   1.00   1.25   0.96   1.00
women:RB   1.40   0.77   1.09   0.91   1.09   1.02   2.56   0.70   1.20   1.14
NO_WORD   0.28   0.17   1.55   0.34   0.30   0.29   0.28   0.36   0.05   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.53 Alignment.score
 1.00  0.26 Alignment.isGood
-1.00  0.72 Alignment.isBad
-0.10  9.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.10 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "part" modifying "time"
-1.00  1.00 NullPunisher.other : many
-1.00  1.00 NullPunisher.other : how
-3.00  1.00 NullPunisher.entity : John
-1.00  1.00 NullPunisher.other : work
-0.10  1.00 NullPunisher.functionWord : to
-1.00  1.00 NullPunisher.other : wants
-1.00  1.00 NullPunisher.other : part
-1.00  1.00 NullPunisher.other : time
-1.00  1.00 NullPunisher.other : know
-2.00  1.00 RootEntailment.unalignedRoot : "wants" not aligned to anything
Hand-tuned score (dot product of above): -13.8132
Threshold: -9.4738


Inference ID: 172

Txt: John wants to know how many men work part time, and which.

Hyp: John wants to know which men work part time. (yes)

John
NNP
wants
VBZ
to
TO
know
VB
which
WDT
men
NNS
work
VBP
part
NN
time
NN
John:NNP   0.28   1.07   0.96   1.07   0.96   1.19   1.02   1.27   1.18
wants:VBZ   1.16   1.32   0.76   0.60   0.71   0.82   0.60   0.74   0.81
to:TO   1.25   1.25   2.42   1.25   1.31   1.00   1.25   1.00   1.00
know:VB   1.16   0.60   0.76   1.32   0.76   0.91   0.47   0.91   0.86
how:WRB   1.22   1.25   1.26   1.16   1.31   1.00   1.22   1.00   1.00
many:JJ   1.13   0.89   0.76   0.96   0.76   0.78   0.96   0.82   0.88
men:NNS   1.19   0.73   0.71   0.82   0.71   0.28   0.72   1.05   0.92
work:VBP   1.11   0.60   0.76   0.47   0.73   0.81   1.32   0.68   0.69
part:NN   1.27   0.65   0.71   0.82   0.71   1.05   0.59   0.28   1.00
time:NN   1.18   0.72   0.71   0.77   0.68   0.92   0.60   1.00   0.28
which:WDT   1.25   1.21   1.31   1.25   2.42   1.00   1.23   1.00   0.97
NO_WORD   0.28   0.17   1.55   0.34   0.40   0.28   0.36   0.05   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.26 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  9.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "part" modifying "time"
-1.00  1.00 NullPunisher.other : part
-1.00  1.00 NullPunisher.other : wants
-1.00  1.00 NullPunisher.other : time
-1.00  1.00 NullPunisher.other : men
-3.00  1.00 NullPunisher.entity : John
-1.00  1.00 NullPunisher.other : know
-1.00  1.00 NullPunisher.other : work
-0.10  1.00 NullPunisher.functionWord : to
-0.10  1.00 NullPunisher.functionWord : which
-2.00  1.00 RootEntailment.unalignedRoot : "wants" not aligned to anything
Hand-tuned score (dot product of above): -13.4042
Threshold: -9.4738


Inference ID: 173

Txt: Bill spoke to everyone that John did. John spoke to Mary.

Hyp: Bill spoke to Mary. (yes)

Bill
NNP
spoke
VBD
Mary
NNP
Bill:NNP   0.28   0.82   1.13
spoke:VBD   0.91   1.32   1.16
everyone:NN   0.82   0.80   1.07
that:IN   1.31   1.24   1.57
John:NNP   1.18   1.05   0.82
did:VBD   0.87   0.60   1.16
John:NNP   1.18   1.05   0.82
spoke:VBD   0.91   1.32   1.16
Mary:NNP   1.13   1.07   0.28
NO_WORD   0.28   0.17   0.74

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.21 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Mary" modifying "spoke"
-3.00  1.00 NullPunisher.entity : Mary
-1.00  1.00 NullPunisher.other : spoke
-1.00  1.00 NullPunisher.other : Bill
-2.00  1.00 RootEntailment.unalignedRoot : "spoke" not aligned to anything
Hand-tuned score (dot product of above): -8.6630
Threshold: -9.4738


Inference ID: 174

Txt: Bill spoke to everyone that John did. Bill spoke to Mary.

Hyp: John spoke to Mary. (don't know)

John
NNP
spoke
VBD
Mary
NNP
Bill:NNP   1.18   0.82   1.13
spoke:VBD   1.14   1.32   1.16
everyone:NN   1.07   0.80   1.07
that:IN   1.57   1.24   1.57
John:NNP   0.28   1.05   0.82
did:VBD   1.16   0.60   1.16
Bill:NNP   1.18   0.82   1.13
spoke:VBD   1.14   1.32   1.16
Mary:NNP   0.82   1.07   0.28
NO_WORD   0.28   0.17   0.74

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.21 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Mary" modifying "spoke"
-1.00  1.00 NullPunisher.other : spoke
-3.00  1.00 NullPunisher.entity : John
-3.00  1.00 NullPunisher.entity : Mary
-2.00  1.00 RootEntailment.unalignedRoot : "spoke" not aligned to anything
Hand-tuned score (dot product of above): -10.6630
Threshold: -9.4738


Inference ID: 175

Txt: John said Mary wrote a report, and Bill did too.

Hyp: Bill said Mary wrote a report. (yes)

Bill
NNP
said
VBD
Mary
NNP
wrote
VBD
a
DT
report
NN
John:NNP   0.92   1.07   0.82   1.05   0.96   1.17
said:VBD   1.16   1.32   1.11   0.61   0.76   0.91
Mary:NNP   0.87   1.02   0.28   1.07   0.96   1.12
wrote:VBD   1.16   0.61   1.16   1.32   0.76   0.75
a:DT   1.25   1.25   1.25   1.25   2.42   1.00
report:NN   1.01   0.82   1.12   0.66   0.71   0.28
Bill:NNP   0.28   1.07   0.87   1.07   0.96   1.01
did:VBD   1.13   0.65   1.16   0.56   0.76   0.85
too:RB   1.47   0.91   1.47   0.91   1.09   1.21
NO_WORD   0.28   0.17   0.28   0.36   0.82   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.15 Alignment.score
 1.00  0.19 Alignment.isGood
-1.00  0.79 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : report
-3.00  1.00 NullPunisher.entity : Bill
-1.00  1.00 NullPunisher.other : wrote
-3.00  1.00 NullPunisher.entity : Mary
-1.00  1.00 NullPunisher.other : said
-2.00  1.00 RootEntailment.unalignedRoot : "said" not aligned to anything
Hand-tuned score (dot product of above): -12.1485
Threshold: -9.4738


Inference ID: 176

Txt: John said Mary wrote a report, and Bill did too.

Hyp: John said Bill wrote a report. (yes)

John
NNP
said
VBD
Bill
NNP
wrote
VBD
a
DT
report
NN
John:NNP   0.28   1.07   0.92   1.05   0.96   1.17
said:VBD   1.16   1.32   1.16   0.61   0.76   0.91
Mary:NNP   0.82   1.02   0.87   1.07   0.96   1.12
wrote:VBD   1.14   0.61   1.16   1.32   0.76   0.75
a:DT   1.25   1.25   1.25   1.25   2.42   1.00
report:NN   1.17   0.82   1.01   0.66   0.71   0.28
Bill:NNP   0.92   1.07   0.28   1.07   0.96   1.01
did:VBD   1.16   0.65   1.13   0.56   0.76   0.85
too:RB   1.44   0.91   1.47   0.91   1.09   1.21
NO_WORD   0.28   0.17   0.28   0.36   0.82   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.15 Alignment.score
 1.00  0.19 Alignment.isGood
-1.00  0.79 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-3.00  1.00 NullPunisher.entity : Bill
-3.00  1.00 NullPunisher.entity : John
-1.00  1.00 NullPunisher.other : said
-1.00  1.00 NullPunisher.other : report
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : wrote
-2.00  1.00 RootEntailment.unalignedRoot : "said" not aligned to anything
Hand-tuned score (dot product of above): -12.1485
Threshold: -9.4738


Inference ID: 177

Txt: John said that Mary wrote a report, and that Bill did too.

Hyp: Bill said Mary wrote a report. (don't know)

Bill
NNP
said
VBD
Mary
NNP
wrote
VBD
a
DT
report
NN
John:NNP   0.92   1.07   0.82   1.05   0.96   1.17
said:VBD   1.16   1.32   1.11   0.61   0.76   0.91
that:IN   1.57   1.24   1.57   1.21   1.05   1.31
Mary:NNP   0.87   1.02   0.28   1.07   0.96   1.12
wrote:VBD   1.16   0.61   1.16   1.32   0.76   0.75
a:DT   1.25   1.25   1.25   1.25   2.42   1.00
report:NN   1.01   0.82   1.12   0.66   0.71   0.28
that:IN   1.57   1.24   1.57   1.21   1.05   1.31
Bill:NNP   0.28   1.07   0.87   1.07   0.96   1.01
did:VBD   1.13   0.65   1.16   0.56   0.76   0.85
too:RB   1.47   0.91   1.47   0.91   1.09   1.21
NO_WORD   0.28   0.17   0.28   0.36   0.82   0.09

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.15 Alignment.score
 1.00  0.19 Alignment.isGood
-1.00  0.79 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : said
-1.00  1.00 NullPunisher.other : wrote
-3.00  1.00 NullPunisher.entity : Bill
-3.00  1.00 NullPunisher.entity : Mary
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : report
-2.00  1.00 RootEntailment.unalignedRoot : "said" not aligned to anything
Hand-tuned score (dot product of above): -12.1485
Threshold: -9.4738


Inference ID: 178

Txt: John wrote a report, and Bill said Peter did too.

Hyp: Bill said Peter wrote a report. (yes)

Bill
NNP
said
VBD
Peter
NNP
wrote
VBD
a
DT
report
NN
John:NNP   0.92   1.07   0.84   1.05   0.96   1.17
wrote:VBD   1.16   0.61   1.12   1.32   0.76   0.75
a:DT   1.25   1.25   1.25   1.25   2.42   1.00
report:NN   1.01   0.82   1.08   0.66   0.71   0.28
Bill:NNP   0.28   1.07   0.89   1.07   0.96   1.01
said:VBD   1.16   1.32   1.16   0.61   0.76   0.91
Peter:NNP   0.89   1.07   0.28   1.03   0.96   1.08
did:VBD   1.13   0.65   1.16   0.56   0.76   0.85
too:RB   1.47   0.91   1.47   0.91   1.09   1.21
NO_WORD   0.28   0.17   0.28   0.36   0.82   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.15 Alignment.score
 1.00  0.19 Alignment.isGood
-1.00  0.79 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : said
-3.00  1.00 NullPunisher.entity : Peter
-1.00  1.00 NullPunisher.other : wrote
-3.00  1.00 NullPunisher.entity : Bill
-1.00  1.00 NullPunisher.other : report
-2.00  1.00 RootEntailment.unalignedRoot : "said" not aligned to anything
Hand-tuned score (dot product of above): -12.1485
Threshold: -9.4738


Inference ID: 179

Txt: If John wrote a report, then Bill did too. John wrote a report.

Hyp: Bill wrote a report. (yes)

Bill
NNP
wrote
VBD
a
DT
report
NN
If:IN   1.57   1.24   1.05   1.31
John:NNP   0.92   1.05   0.96   1.17
wrote:VBD   1.16   1.32   0.76   0.75
a:DT   1.25   1.25   2.42   1.00
report:NN   1.01   0.66   0.71   0.28
then:RB   1.47   0.91   1.09   1.21
Bill:NNP   0.28   1.07   0.96   1.01
did:VBD   1.13   0.56   0.76   0.85
too:RB   1.47   0.91   1.09   1.21
John:NNP   0.92   1.05   0.96   1.17
wrote:VBD   1.16   1.32   0.76   0.75
a:DT   1.25   1.25   2.42   1.00
report:NN   1.01   0.66   0.71   0.28
NO_WORD   0.28   0.17   0.82   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.20 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : report
-1.00  1.00 NullPunisher.other : wrote
-0.10  1.00 NullPunisher.article : a
-3.00  1.00 NullPunisher.entity : Bill
-2.00  1.00 RootEntailment.unalignedRoot : "wrote" not aligned to anything
Hand-tuned score (dot product of above): -7.8790
Threshold: -9.4738


Inference ID: 180

Txt: John wanted to buy a car, and he did.

Hyp: John bought a car. (yes)

John
NNP
bought
VBD
a
DT
car
NN
John:NNP   0.28   1.03   0.96   1.09
wanted:VBD   1.16   0.53   0.76   0.73
to:TO   1.25   1.25   1.31   1.00
buy:VB   1.16   0.47   0.76   0.83
a:DT   1.25   1.25   2.42   1.00
car:NN   1.09   0.73   0.71   0.28
he:PRP   2.76   0.97   1.11   1.30
did:VBD   1.16   0.72   0.76   0.72
NO_WORD   0.28   0.17   0.82   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.39 Alignment.score
 1.00  0.23 Alignment.isGood
-1.00  0.75 Alignment.isBad
-0.10  2.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.50 Alignment.txtSpan
 1.00  1.00 Factive.inPositiveEmbedding : embedded positive text
-0.10  1.00 NullPunisher.article : a
-3.00  1.00 NullPunisher.entity : John
Hand-tuned score (dot product of above): -2.2764
Threshold: -9.4738


Inference ID: 181

Txt: John needed to buy a car, and Bill did.

Hyp: Bill bought a car. (don't know)

Bill
NNP
bought
VBD
a
DT
car
NN
John:NNP   0.92   1.03   0.96   1.09
needed:VBD   1.16   0.75   0.76   0.91
to:TO   1.25   1.25   1.31   1.00
buy:VB   1.13   0.47   0.76   0.83
a:DT   1.25   1.25   2.42   1.00
car:NN   1.20   0.73   0.71   0.28
Bill:NNP   0.28   1.07   0.96   1.20
did:VBD   1.13   0.72   0.76   0.72
NO_WORD   0.28   0.17   0.82   0.09

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.39 Alignment.score
 1.00  0.23 Alignment.isGood
-1.00  0.75 Alignment.isBad
-0.10  2.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.50 Alignment.txtSpan
 1.00  1.00 Factive.inPositiveEmbedding : embedded positive text
-3.00  1.00 NullPunisher.entity : Bill
-0.10  1.00 NullPunisher.article : a
Hand-tuned score (dot product of above): -2.2764
Threshold: -9.4738


Inference ID: 182

Txt: Smith represents his company and so does Jones.

Hyp: Jones represents Jones' company. (yes)

Jones
NNP
represents
VBZ
Jones
NNP
company
NN
Smith:NNP   0.70   1.07   0.70   1.18
represents:VBZ   1.16   1.32   1.16   0.84
his:PRP$   1.55   0.97   1.55   1.30
company:NN   1.20   0.75   1.20   0.28
so:RB   1.47   0.91   1.47   1.21
does:VBZ   1.04   0.70   1.04   0.83
Jones:NNP   0.28   1.07   0.28   1.20
NO_WORD   0.28   0.17   0.12   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.03 Alignment.score
 1.00  0.17 Alignment.isGood
-1.00  0.81 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-3.00  1.00 NullPunisher.entity : Jones
-1.00  1.00 NullPunisher.other : company
-3.00  1.00 NullPunisher.entity : Jones
-1.00  1.00 NullPunisher.other : represents
-2.00  1.00 RootEntailment.unalignedRoot : "represents" not aligned to anything
Hand-tuned score (dot product of above): -11.0087
Threshold: -9.4738


Inference ID: 183

Txt: Smith represents his company and so does Jones.

Hyp: Jones represents Smith's company. (yes)

Jones
NNP
represents
VBZ
Smith
NNP
company
NN
Smith:NNP   0.70   1.07   0.28   1.18
represents:VBZ   1.16   1.32   1.16   0.84
his:PRP$   1.55   0.97   2.76   1.30
company:NN   1.20   0.75   1.18   0.28
so:RB   1.47   0.91   1.47   1.21
does:VBZ   1.04   0.70   1.16   0.83
Jones:NNP   0.28   1.07   0.70   1.20
NO_WORD   0.28   0.17   0.12   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.03 Alignment.score
 1.00  0.17 Alignment.isGood
-1.00  0.81 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : company
-3.00  1.00 NullPunisher.entity : Jones
-1.00  1.00 NullPunisher.other : represents
-3.00  1.00 NullPunisher.entity : Smith
-2.00  1.00 RootEntailment.unalignedRoot : "represents" not aligned to anything
Hand-tuned score (dot product of above): -11.0087
Threshold: -9.4738


Inference ID: 184

Txt: Smith represents his company and so does Jones.

Hyp: Smith represents Jones' company. (don't know)

Smith
NNP
represents
VBZ
Jones
NNP
company
NN
Smith:NNP   0.28   1.07   0.70   1.18
represents:VBZ   1.16   1.32   1.16   0.84
his:PRP$   2.76   0.97   1.55   1.30
company:NN   1.18   0.75   1.20   0.28
so:RB   1.47   0.91   1.47   1.21
does:VBZ   1.16   0.70   1.04   0.83
Jones:NNP   0.70   1.07   0.28   1.20
NO_WORD   0.28   0.17   0.12   0.09

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.03 Alignment.score
 1.00  0.17 Alignment.isGood
-1.00  0.81 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : represents
-3.00  1.00 NullPunisher.entity : Smith
-3.00  1.00 NullPunisher.entity : Jones
-1.00  1.00 NullPunisher.other : company
-2.00  1.00 RootEntailment.unalignedRoot : "represents" not aligned to anything
Hand-tuned score (dot product of above): -11.0087
Threshold: -9.4738


Inference ID: 185

Txt: Smith claimed he had costed his proposal and so did Jones.

Hyp: Jones claimed he had costed his own proposal. (yes)

Jones
NNP
claimed
VBD
he
PRP
had
VBD
costed
VBN
his
PRP$
own
JJ
proposal
NN
Smith:NNP   0.70   1.07   1.84   1.07   1.05   1.84   1.24   1.16
claimed:VBD   1.16   1.32   0.54   0.78   0.71   0.54   1.02   0.91
he:PRP   1.55   0.97   1.05   0.93   0.97   0.82   1.39   1.30
had:VBD   1.16   0.78   0.50   1.32   0.94   0.47   1.02   0.91
costed:VBN   1.10   0.71   0.54   0.94   1.32   0.54   1.02   0.91
his:PRP$   1.55   0.97   0.58   0.90   0.97   1.05   1.39   1.30
proposal:NN   1.16   0.82   1.58   0.82   0.82   1.58   0.99   0.28
so:RB   1.47   0.91   1.05   0.91   0.91   1.05   1.05   1.21
did:VBD   1.16   0.43   0.54   0.79   0.88   0.47   1.02   0.87
Jones:NNP   0.28   1.07   1.84   1.07   1.01   1.84   1.24   1.16
NO_WORD   0.28   0.17   0.54   1.30   0.36   0.93   0.11   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.40 Alignment.score
 1.00  0.23 Alignment.isGood
-1.00  0.74 Alignment.isBad
-0.10  8.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "own" modifying "proposal"
-1.00  1.00 NullPunisher.other : own
-1.00  1.00 NullPunisher.other : costed
-3.00  1.00 NullPunisher.entity : Jones
-0.05  1.00 NullPunisher.aux : had
-1.00  1.00 NullPunisher.other : claimed
-0.10  1.00 NullPunisher.functionWord : his
-1.00  1.00 NullPunisher.other : proposal
-1.00  1.00 NullPunisher.other : he
-2.00  1.00 RootEntailment.unalignedRoot : "claimed" not aligned to anything
Hand-tuned score (dot product of above): -12.0533
Threshold: -9.4738


Inference ID: 186

Txt: Smith claimed he had costed his proposal and so did Jones.

Hyp: Jones claimed he had costed Smith's proposal. (yes)

Jones
NNP
claimed
VBD
he
PRP
had
VBD
costed
VBN
Smith
NNP
proposal
NN
Smith:NNP   0.70   1.07   1.84   1.07   1.05   0.28   1.16
claimed:VBD   1.16   1.32   0.54   0.78   0.71   1.16   0.91
he:PRP   1.55   0.97   1.05   0.93   0.97   2.76   1.30
had:VBD   1.16   0.78   0.50   1.32   0.94   1.16   0.91
costed:VBN   1.10   0.71   0.54   0.94   1.32   1.14   0.91
his:PRP$   1.55   0.97   0.58   0.90   0.97   2.76   1.30
proposal:NN   1.16   0.82   1.58   0.82   0.82   1.16   0.28
so:RB   1.47   0.91   1.05   0.91   0.91   1.47   1.21
did:VBD   1.16   0.43   0.54   0.79   0.88   1.16   0.87
Jones:NNP   0.28   1.07   1.84   1.07   1.01   0.70   1.16
NO_WORD   0.28   0.17   0.54   1.30   0.36   0.12   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.33 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.76 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : proposal
-1.00  1.00 NullPunisher.other : costed
-1.00  1.00 NullPunisher.other : he
-3.00  1.00 NullPunisher.entity : Jones
-3.00  1.00 NullPunisher.entity : Smith
-0.05  1.00 NullPunisher.aux : had
-1.00  1.00 NullPunisher.other : claimed
-2.00  1.00 RootEntailment.unalignedRoot : "claimed" not aligned to anything
Hand-tuned score (dot product of above): -12.9557
Threshold: -9.4738


Inference ID: 187

Txt: Smith claimed he had costed his proposal and so did Jones.

Hyp: Jones claimed Smith had costed Smith's proposal. (yes)

Jones
NNP
claimed
VBD
Smith
NNP
had
VBD
costed
VBN
Smith
NNP
proposal
NN
Smith:NNP   0.70   1.07   0.28   1.07   1.05   0.28   1.16
claimed:VBD   1.16   1.32   1.16   0.78   0.71   1.16   0.91
he:PRP   1.55   0.97   2.76   0.93   0.97   2.76   1.30
had:VBD   1.16   0.78   1.16   1.32   0.94   1.16   0.91
costed:VBN   1.10   0.71   1.14   0.94   1.32   1.14   0.91
his:PRP$   1.55   0.97   2.76   0.90   0.97   2.76   1.30
proposal:NN   1.16   0.82   1.16   0.82   0.82   1.16   0.28
so:RB   1.47   0.91   1.47   0.91   0.91   1.47   1.21
did:VBD   1.16   0.43   1.16   0.79   0.88   1.16   0.87
Jones:NNP   0.28   1.07   0.70   1.07   1.01   0.70   1.16
NO_WORD   0.28   0.17   0.28   1.30   0.36   0.12   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.21 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : proposal
-3.00  1.00 NullPunisher.entity : Smith
-0.05  1.00 NullPunisher.aux : had
-3.00  1.00 NullPunisher.entity : Jones
-1.00  1.00 NullPunisher.other : claimed
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : costed
-2.00  1.00 RootEntailment.unalignedRoot : "claimed" not aligned to anything
Hand-tuned score (dot product of above): -15.1125
Threshold: -9.4738


Inference ID: 188

Txt: Smith claimed he had costed his proposal and so did Jones.

Hyp: Jones claimed Smith had costed Jones' proposal. (don't know)

Jones
NNP
claimed
VBD
Smith
NNP
had
VBD
costed
VBN
Jones
NNP
proposal
NN
Smith:NNP   0.70   1.07   0.28   1.07   1.05   0.70   1.16
claimed:VBD   1.16   1.32   1.16   0.78   0.71   1.16   0.91
he:PRP   1.55   0.97   2.76   0.93   0.97   1.55   1.30
had:VBD   1.16   0.78   1.16   1.32   0.94   1.16   0.91
costed:VBN   1.10   0.71   1.14   0.94   1.32   1.10   0.91
his:PRP$   1.55   0.97   2.76   0.90   0.97   1.55   1.30
proposal:NN   1.16   0.82   1.16   0.82   0.82   1.16   0.28
so:RB   1.47   0.91   1.47   0.91   0.91   1.47   1.21
did:VBD   1.16   0.43   1.16   0.79   0.88   1.16   0.87
Jones:NNP   0.28   1.07   0.70   1.07   1.01   0.28   1.16
NO_WORD   0.28   0.17   0.28   1.30   0.36   0.12   0.09

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.21 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-3.00  1.00 NullPunisher.entity : Jones
-1.00  1.00 NullPunisher.other : proposal
-1.00  1.00 NullPunisher.other : costed
-0.05  1.00 NullPunisher.aux : had
-3.00  1.00 NullPunisher.entity : Jones
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : claimed
-2.00  1.00 RootEntailment.unalignedRoot : "claimed" not aligned to anything
Hand-tuned score (dot product of above): -15.1125
Threshold: -9.4738


Inference ID: 189

Txt: John is a man and Mary is a woman. John represents his company and so does Mary.

Hyp: Mary represents her own company. (yes)

Mary
NNP
represents
VBZ
her
PRP$
own
JJ
company
NN
John:NNP   0.82   1.07   1.84   1.15   1.20
is:VBZ   1.16   0.31   0.54   1.02   0.91
a:DT   1.25   1.25   1.16   1.18   1.00
man:NN   0.94   0.82   1.58   0.91   0.85
Mary:NNP   0.28   1.07   1.80   1.24   1.10
is:VBZ   1.16   0.31   0.54   1.02   0.91
a:DT   1.25   1.25   1.16   1.18   1.00
woman:NN   1.02   0.80   1.58   0.93   0.78
John:NNP   0.82   1.07   1.84   1.15   1.20
represents:VBZ   1.16   1.32   0.54   1.02   0.84
his:PRP$   1.55   0.97   0.90   1.99   1.30
company:NN   1.10   0.75   1.58   0.99   0.28
so:RB   1.47   0.91   1.05   1.05   1.21
does:VBZ   1.16   0.70   0.51   0.99   0.83
Mary:NNP   0.28   1.07   1.80   1.24   1.10
NO_WORD   0.28   0.17   0.93   0.11   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.21 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "own" modifying "company"
-3.00  1.00 NullPunisher.entity : Mary
-0.10  1.00 NullPunisher.functionWord : her
-1.00  1.00 NullPunisher.other : company
-1.00  1.00 NullPunisher.other : own
-1.00  1.00 NullPunisher.other : represents
-2.00  1.00 RootEntailment.unalignedRoot : "represents" not aligned to anything
Hand-tuned score (dot product of above): -9.9708
Threshold: -9.4738


Inference ID: 190

Txt: John is a man and Mary is a woman. John represents his company and so does Mary.

Hyp: Mary represents John's company. (yes)

Mary
NNP
represents
VBZ
John
NNP
company
NN
John:NNP   0.82   1.07   0.28   1.20
is:VBZ   1.16   0.31   1.16   0.91
a:DT   1.25   1.25   1.25   1.00
man:NN   0.94   0.82   1.08   0.85
Mary:NNP   0.28   1.07   0.82   1.10
is:VBZ   1.16   0.31   1.16   0.91
a:DT   1.25   1.25   1.25   1.00
woman:NN   1.02   0.80   1.05   0.78
John:NNP   0.82   1.07   0.28   1.20
represents:VBZ   1.16   1.32   1.16   0.84
his:PRP$   1.55   0.97   2.76   1.30
company:NN   1.10   0.75   1.20   0.28
so:RB   1.47   0.91   1.47   1.21
does:VBZ   1.16   0.70   1.11   0.83
Mary:NNP   0.28   1.07   0.82   1.10
NO_WORD   0.28   0.17   0.12   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.03 Alignment.score
 1.00  0.17 Alignment.isGood
-1.00  0.81 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-3.00  1.00 NullPunisher.entity : John
-1.00  1.00 NullPunisher.other : company
-3.00  1.00 NullPunisher.entity : Mary
-1.00  1.00 NullPunisher.other : represents
-2.00  1.00 RootEntailment.unalignedRoot : "represents" not aligned to anything
Hand-tuned score (dot product of above): -11.0087
Threshold: -9.4738


Inference ID: 191

Txt: Bill suggested to Frank's boss that they should go to the meeting together, and Carl to Alan's wife.

Hyp: If it was suggested that Bill and Frank should go together, it was suggested that Carl and Alan should go together. (yes)

If
IN
it
PRP
was
VBD
suggested
VBN
that
IN
Bill
NNP
Frank
NNP
should
MD
go
VB
together
RB
it
PRP
was
VBD
suggested
VBN
that
IN
Carl
NNP
Alan
NNP
should
MD
go
VB
together
RB
Bill:NNP   0.75   1.84   1.07   1.07   0.75   0.28   0.88   0.96   1.07   1.56   1.84   1.07   1.07   0.75   0.76   0.82   0.96   1.07   1.56
suggested:VBD   1.15   0.54   0.86   2.02   1.15   1.16   1.16   0.74   0.82   1.05   0.54   0.86   2.02   1.15   1.16   1.16   0.74   0.82   1.05
Frank:NNP   0.75   1.84   1.07   1.07   0.73   0.88   0.28   0.96   1.07   1.56   1.84   1.07   1.07   0.73   0.79   0.74   0.96   1.07   1.56
boss:NN   0.50   1.58   0.78   0.75   0.50   1.11   1.04   0.71   0.76   1.31   1.58   0.78   0.75   0.50   1.07   1.07   0.71   0.76   1.31
that:IN   1.05   1.57   1.21   1.24   1.59   1.57   1.54   1.05   1.24   1.23   1.57   1.21   1.24   1.59   1.57   1.51   1.05   1.24   1.23
they:PRP   1.05   0.85   1.21   0.97   1.18   1.55   2.76   1.11   0.97   1.01   0.85   1.21   0.97   1.18   1.79   2.04   1.11   0.97   1.01
should:MD   1.05   1.16   1.25   1.24   1.05   1.25   1.25   2.42   1.25   1.45   1.16   1.25   1.24   1.05   1.25   1.25   2.42   1.25   1.45
go:VB   1.15   0.54   0.92   0.82   1.15   1.16   1.16   0.76   1.32   1.11   0.54   0.92   0.82   1.15   1.16   1.16   0.76   1.32   1.11
the:DT   1.05   1.16   1.25   1.25   0.95   1.25   1.25   1.31   1.25   1.43   1.16   1.25   1.25   0.95   1.25   1.25   1.31   1.25   1.43
meeting:NN   0.50   1.58   0.82   0.41   0.50   1.17   1.14   0.71   0.75   1.26   1.58   0.82   0.41   0.50   1.07   1.07   0.71   0.75   1.26
together:RB   1.05   1.05   0.91   0.85   1.05   1.47   1.47   1.09   0.91   1.23   1.05   0.91   0.85   1.05   1.47   1.47   1.09   0.91   1.23
Carl:NNP   0.75   1.84   1.04   1.07   0.75   0.76   0.79   0.96   1.07   1.56   1.84   1.04   1.07   0.75   0.28   0.82   0.96   1.07   1.56
Alan:NNP   0.75   1.84   1.04   1.07   0.70   0.82   0.74   0.96   1.07   1.56   1.84   1.04   1.07   0.70   0.82   0.28   0.96   1.07   1.56
wife:NN   0.42   1.58   0.78   0.75   0.50   1.09   1.03   0.71   0.69   1.31   1.58   0.78   0.75   0.50   1.07   1.07   0.71   0.69   1.31
NO_WORD   0.92   0.67   1.57   0.42   0.60   0.28   0.01   1.55   0.36   0.04   0.67   1.57   0.17   0.60   0.28   0.01   1.55   0.36   0.04

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.65 Alignment.score
 1.00  0.28 Alignment.isGood
-1.00  0.69 Alignment.isBad
-0.10  18.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.05 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "together" modifying "go"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "boss" modifying "suggested" is dropped on aligned hypothesis word "suggested"
-3.00  1.00 NullPunisher.entity : Bill
-1.00  1.00 NullPunisher.other : go
-1.00  1.00 NullPunisher.other : together
-0.05  1.00 NullPunisher.aux : was
-0.10  1.00 NullPunisher.functionWord : that
-0.05  1.00 NullPunisher.aux : should
-1.00  1.00 NullPunisher.other : together
-0.10  1.00 NullPunisher.functionWord : that
-3.00  1.00 NullPunisher.entity : Carl
-0.05  1.00 NullPunisher.aux : should
-1.00  1.00 NullPunisher.other : it
-0.05  1.00 NullPunisher.aux : was
-1.00  1.00 NullPunisher.other : it
-1.00  1.00 NullPunisher.other : suggested
-1.00  1.00 NullPunisher.other : If
-3.00  1.00 NullPunisher.entity : Alan
-3.00  1.00 NullPunisher.entity : Frank
-1.00  1.00 NullPunisher.other : go
-3.00  1.00 Structure.argsMismatch : go (verbal compl of the root) not aligned
Hand-tuned score (dot product of above): -25.3504
Threshold: -9.4738


Inference ID: 192

Txt: Bill suggested to Frank's boss that they should go to the meeting together, and Carl to Alan's wife.

Hyp: If it was suggested that Bill and Frank should go together, it was suggested that Carl and Alan's wife should go together. (don't know)

If
IN
it
PRP
was
VBD
suggested
VBN
that
IN
Bill
NNP
Frank
NNP
should
MD
go
VB
together
RB
it
PRP
was
VBD
suggested
VBN
that
IN
Carl
NNP
Alan
NNP
wife
NN
should
MD
go
VB
together
RB
Bill:NNP   0.75   1.84   1.07   1.07   0.75   0.28   0.88   0.96   1.07   1.56   1.84   1.07   1.07   0.75   0.76   0.82   1.09   0.96   1.07   1.56
suggested:VBD   1.15   0.54   0.86   2.02   1.15   1.16   1.16   0.74   0.82   1.05   0.54   0.86   2.02   1.15   1.16   1.16   0.84   0.74   0.82   1.05
Frank:NNP   0.75   1.84   1.07   1.07   0.73   0.88   0.28   0.96   1.07   1.56   1.84   1.07   1.07   0.73   0.79   0.74   1.03   0.96   1.07   1.56
boss:NN   0.50   1.58   0.78   0.75   0.50   1.11   1.04   0.71   0.76   1.31   1.58   0.78   0.75   0.50   1.07   1.07   0.53   0.71   0.76   1.31
that:IN   1.05   1.57   1.21   1.24   1.59   1.57   1.54   1.05   1.24   1.23   1.57   1.21   1.24   1.59   1.57   1.51   1.31   1.05   1.24   1.23
they:PRP   1.05   0.85   1.21   0.97   1.18   1.55   2.76   1.11   0.97   1.01   0.85   1.21   0.97   1.18   1.79   2.04   1.30   1.11   0.97   1.01
should:MD   1.05   1.16   1.25   1.24   1.05   1.25   1.25   2.42   1.25   1.45   1.16   1.25   1.24   1.05   1.25   1.25   1.00   2.42   1.25   1.45
go:VB   1.15   0.54   0.92   0.82   1.15   1.16   1.16   0.76   1.32   1.11   0.54   0.92   0.82   1.15   1.16   1.16   0.78   0.76   1.32   1.11
the:DT   1.05   1.16   1.25   1.25   0.95   1.25   1.25   1.31   1.25   1.43   1.16   1.25   1.25   0.95   1.25   1.25   0.96   1.31   1.25   1.43
meeting:NN   0.50   1.58   0.82   0.41   0.50   1.17   1.14   0.71   0.75   1.26   1.58   0.82   0.41   0.50   1.07   1.07   0.89   0.71   0.75   1.26
together:RB   1.05   1.05   0.91   0.85   1.05   1.47   1.47   1.09   0.91   1.23   1.05   0.91   0.85   1.05   1.47   1.47   1.21   1.09   0.91   1.23
Carl:NNP   0.75   1.84   1.04   1.07   0.75   0.76   0.79   0.96   1.07   1.56   1.84   1.04   1.07   0.75   0.28   0.82   1.07   0.96   1.07   1.56
Alan:NNP   0.75   1.84   1.04   1.07   0.70   0.82   0.74   0.96   1.07   1.56   1.84   1.04   1.07   0.70   0.82   0.28   1.07   0.96   1.07   1.56
wife:NN   0.42   1.58   0.78   0.75   0.50   1.09   1.03   0.71   0.69   1.31   1.58   0.78   0.75   0.50   1.07   1.07   0.28   0.71   0.69   1.31
NO_WORD   0.92   0.67   1.57   0.42   0.60   0.28   0.01   1.55   0.36   0.04   0.67   1.57   0.17   0.60   0.28   0.12   0.01   1.55   0.36   0.04

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.63 Alignment.score
 1.00  0.28 Alignment.isGood
-1.00  0.70 Alignment.isBad
-0.10  19.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.05 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "together" modifying "go"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "boss" modifying "suggested" is dropped on aligned hypothesis word "suggested"
-1.00  1.00 NullPunisher.other : go
-0.10  1.00 NullPunisher.functionWord : that
-1.00  1.00 NullPunisher.other : If
-0.05  1.00 NullPunisher.aux : should
-1.00  1.00 NullPunisher.other : together
-1.00  1.00 NullPunisher.other : it
-3.00  1.00 NullPunisher.entity : Carl
-1.00  1.00 NullPunisher.other : it
-3.00  1.00 NullPunisher.entity : Frank
-1.00  1.00 NullPunisher.other : wife
-3.00  1.00 NullPunisher.entity : Bill
-0.05  1.00 NullPunisher.aux : was
-1.00  1.00 NullPunisher.other : go
-1.00  1.00 NullPunisher.other : together
-0.05  1.00 NullPunisher.aux : was
-0.10  1.00 NullPunisher.functionWord : that
-1.00  1.00 NullPunisher.other : suggested
-3.00  1.00 NullPunisher.entity : Alan
-0.05  1.00 NullPunisher.aux : should
-3.00  1.00 Structure.argsMismatch : go (verbal compl of the root) not aligned
Hand-tuned score (dot product of above): -26.4886
Threshold: -9.4738


Inference ID: 193

Txt: Bill suggested to Frank's boss that they should go to the meeting together, and Carl to Alan's wife.

Hyp: If it was suggested that Bill and Frank's boss should go together, it was suggested that Carl and Alan's wife should go together. (yes)

If
IN
it
PRP
was
VBD
suggested
VBN
that
IN
Bill
NNP
Frank
NNP
boss
NN
should
MD
go
VB
together
RB
it
PRP
was
VBD
suggested
VBN
that
IN
Carl
NNP
Alan
NNP
wife
NN
should
MD
go
VB
together
RB
Bill:NNP   0.75   1.84   1.07   1.07   0.75   0.28   0.88   1.11   0.96   1.07   1.56   1.84   1.07   1.07   0.75   0.76   0.82   1.09   0.96   1.07   1.56
suggested:VBD   1.15   0.54   0.86   2.02   1.15   1.16   1.16   0.83   0.74   0.82   1.05   0.54   0.86   2.02   1.15   1.16   1.16   0.84   0.74   0.82   1.05
Frank:NNP   0.75   1.84   1.07   1.07   0.73   0.88   0.28   1.04   0.96   1.07   1.56   1.84   1.07   1.07   0.73   0.79   0.74   1.03   0.96   1.07   1.56
boss:NN   0.50   1.58   0.78   0.75   0.50   1.11   1.04   0.28   0.71   0.76   1.31   1.58   0.78   0.75   0.50   1.07   1.07   0.53   0.71   0.76   1.31
that:IN   1.05   1.57   1.21   1.24   1.59   1.57   1.54   1.31   1.05   1.24   1.23   1.57   1.21   1.24   1.59   1.57   1.51   1.31   1.05   1.24   1.23
they:PRP   1.05   0.85   1.21   0.97   1.18   1.55   2.76   1.30   1.11   0.97   1.01   0.85   1.21   0.97   1.18   1.79   2.04   1.30   1.11   0.97   1.01
should:MD   1.05   1.16   1.25   1.24   1.05   1.25   1.25   1.00   2.42   1.25   1.45   1.16   1.25   1.24   1.05   1.25   1.25   1.00   2.42   1.25   1.45
go:VB   1.15   0.54   0.92   0.82   1.15   1.16   1.16   0.85   0.76   1.32   1.11   0.54   0.92   0.82   1.15   1.16   1.16   0.78   0.76   1.32   1.11
the:DT   1.05   1.16   1.25   1.25   0.95   1.25   1.25   1.00   1.31   1.25   1.43   1.16   1.25   1.25   0.95   1.25   1.25   0.96   1.31   1.25   1.43
meeting:NN   0.50   1.58   0.82   0.41   0.50   1.17   1.14   0.91   0.71   0.75   1.26   1.58   0.82   0.41   0.50   1.07   1.07   0.89   0.71   0.75   1.26
together:RB   1.05   1.05   0.91   0.85   1.05   1.47   1.47   1.21   1.09   0.91   1.23   1.05   0.91   0.85   1.05   1.47   1.47   1.21   1.09   0.91   1.23
Carl:NNP   0.75   1.84   1.04   1.07   0.75   0.76   0.79   1.07   0.96   1.07   1.56   1.84   1.04   1.07   0.75   0.28   0.82   1.07   0.96   1.07   1.56
Alan:NNP   0.75   1.84   1.04   1.07   0.70   0.82   0.74   1.07   0.96   1.07   1.56   1.84   1.04   1.07   0.70   0.82   0.28   1.07   0.96   1.07   1.56
wife:NN   0.42   1.58   0.78   0.75   0.50   1.09   1.03   0.53   0.71   0.69   1.31   1.58   0.78   0.75   0.50   1.07   1.07   0.28   0.71   0.69   1.31
NO_WORD   0.92   0.67   1.57   0.42   0.60   0.28   0.12   0.01   1.55   0.36   0.04   0.67   1.57   0.17   0.60   0.28   0.12   0.01   1.55   0.36   0.04

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.60 Alignment.score
 1.00  0.27 Alignment.isGood
-1.00  0.70 Alignment.isBad
-0.10  20.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.05 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "together" modifying "go"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "boss" modifying "suggested" is dropped on aligned hypothesis word "suggested"
-1.00  1.00 NullPunisher.other : go
-1.00  1.00 NullPunisher.other : suggested
-3.00  1.00 NullPunisher.entity : Alan
-1.00  1.00 NullPunisher.other : together
-0.05  1.00 NullPunisher.aux : was
-1.00  1.00 NullPunisher.other : it
-0.05  1.00 NullPunisher.aux : was
-0.05  1.00 NullPunisher.aux : should
-0.10  1.00 NullPunisher.functionWord : that
-0.05  1.00 NullPunisher.aux : should
-1.00  1.00 NullPunisher.other : If
-3.00  1.00 NullPunisher.entity : Carl
-1.00  1.00 NullPunisher.other : boss
-1.00  1.00 NullPunisher.other : wife
-1.00  1.00 NullPunisher.other : it
-3.00  1.00 NullPunisher.entity : Bill
-1.00  1.00 NullPunisher.other : together
-0.10  1.00 NullPunisher.functionWord : that
-1.00  1.00 NullPunisher.other : go
-3.00  1.00 NullPunisher.entity : Frank
-3.00  1.00 Structure.argsMismatch : go (verbal compl of the root) not aligned
Hand-tuned score (dot product of above): -27.6230
Threshold: -9.4738


Inference ID: 194

Txt: Bill suggested to Frank's boss that they should go to the meeting together, and Carl to Alan's wife.

Hyp: If it was suggested that Bill and Frank's boss should go together, it was suggested that Carl and Alan should go together. (don't know)

If
IN
it
PRP
was
VBD
suggested
VBN
that
IN
Bill
NNP
Frank
NNP
boss
NN
should
MD
go
VB
together
RB
it
PRP
was
VBD
suggested
VBN
that
IN
Carl
NNP
Alan
NNP
should
MD
go
VB
together
RB
Bill:NNP   0.75   1.84   1.07   1.07   0.75   0.28   0.88   1.11   0.96   1.07   1.56   1.84   1.07   1.07   0.75   0.76   0.82   0.96   1.07   1.56
suggested:VBD   1.15   0.54   0.86   2.02   1.15   1.16   1.16   0.83   0.74   0.82   1.05   0.54   0.86   2.02   1.15   1.16   1.16   0.74   0.82   1.05
Frank:NNP   0.75   1.84   1.07   1.07   0.73   0.88   0.28   1.04   0.96   1.07   1.56   1.84   1.07   1.07   0.73   0.79   0.74   0.96   1.07   1.56
boss:NN   0.50   1.58   0.78   0.75   0.50   1.11   1.04   0.28   0.71   0.76   1.31   1.58   0.78   0.75   0.50   1.07   1.07   0.71   0.76   1.31
that:IN   1.05   1.57   1.21   1.24   1.59   1.57   1.54   1.31   1.05   1.24   1.23   1.57   1.21   1.24   1.59   1.57   1.51   1.05   1.24   1.23
they:PRP   1.05   0.85   1.21   0.97   1.18   1.55   2.76   1.30   1.11   0.97   1.01   0.85   1.21   0.97   1.18   1.79   2.04   1.11   0.97   1.01
should:MD   1.05   1.16   1.25   1.24   1.05   1.25   1.25   1.00   2.42   1.25   1.45   1.16   1.25   1.24   1.05   1.25   1.25   2.42   1.25   1.45
go:VB   1.15   0.54   0.92   0.82   1.15   1.16   1.16   0.85   0.76   1.32   1.11   0.54   0.92   0.82   1.15   1.16   1.16   0.76   1.32   1.11
the:DT   1.05   1.16   1.25   1.25   0.95   1.25   1.25   1.00   1.31   1.25   1.43   1.16   1.25   1.25   0.95   1.25   1.25   1.31   1.25   1.43
meeting:NN   0.50   1.58   0.82   0.41   0.50   1.17   1.14   0.91   0.71   0.75   1.26   1.58   0.82   0.41   0.50   1.07   1.07   0.71   0.75   1.26
together:RB   1.05   1.05   0.91   0.85   1.05   1.47   1.47   1.21   1.09   0.91   1.23   1.05   0.91   0.85   1.05   1.47   1.47   1.09   0.91   1.23
Carl:NNP   0.75   1.84   1.04   1.07   0.75   0.76   0.79   1.07   0.96   1.07   1.56   1.84   1.04   1.07   0.75   0.28   0.82   0.96   1.07   1.56
Alan:NNP   0.75   1.84   1.04   1.07   0.70   0.82   0.74   1.07   0.96   1.07   1.56   1.84   1.04   1.07   0.70   0.82   0.28   0.96   1.07   1.56
wife:NN   0.42   1.58   0.78   0.75   0.50   1.09   1.03   0.53   0.71   0.69   1.31   1.58   0.78   0.75   0.50   1.07   1.07   0.71   0.69   1.31
NO_WORD   0.92   0.67   1.57   0.42   0.60   0.28   0.12   0.01   1.55   0.36   0.04   0.67   1.57   0.17   0.60   0.28   0.01   1.55   0.36   0.04

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.63 Alignment.score
 1.00  0.28 Alignment.isGood
-1.00  0.70 Alignment.isBad
-0.10  19.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.05 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "together" modifying "go"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "boss" modifying "suggested" is dropped on aligned hypothesis word "suggested"
-1.00  1.00 NullPunisher.other : go
-0.05  1.00 NullPunisher.aux : was
-1.00  1.00 NullPunisher.other : suggested
-0.05  1.00 NullPunisher.aux : should
-3.00  1.00 NullPunisher.entity : Frank
-0.10  1.00 NullPunisher.functionWord : that
-0.05  1.00 NullPunisher.aux : was
-1.00  1.00 NullPunisher.other : it
-3.00  1.00 NullPunisher.entity : Alan
-3.00  1.00 NullPunisher.entity : Carl
-1.00  1.00 NullPunisher.other : together
-1.00  1.00 NullPunisher.other : together
-1.00  1.00 NullPunisher.other : go
-0.05  1.00 NullPunisher.aux : should
-1.00  1.00 NullPunisher.other : boss
-1.00  1.00 NullPunisher.other : If
-0.10  1.00 NullPunisher.functionWord : that
-3.00  1.00 NullPunisher.entity : Bill
-1.00  1.00 NullPunisher.other : it
-3.00  1.00 Structure.argsMismatch : go (verbal compl of the root) not aligned
Hand-tuned score (dot product of above): -26.4886
Threshold: -9.4738


Inference ID: 195

Txt: Bill suggested to Frank's boss that they should go to the meeting together, and Carl to Alan's wife.

Hyp: If it was suggested that Bill, Frank and Frank's boss should go together, it was suggested that Carl, Alan and Alan's wife should go together. (yes)

If
IN
it
PRP
was
VBD
suggested
VBN
that
IN
Bill
NNP
Frank
NNP
Frank
NNP
boss
NN
should
MD
go
VB
together
RB
it
PRP
was
VBD
suggested
VBN
that
IN
Carl
NNP
Alan
NNP
Alan
NNP
wife
NN
should
MD
go
VB
together
RB
Bill:NNP   0.75   1.84   1.07   1.07   0.75   0.28   0.88   0.88   1.11   0.96   1.07   1.56   1.84   1.07   1.07   0.75   0.76   0.82   0.82   1.09   0.96   1.07   1.56
suggested:VBD   1.15   0.54   0.86   2.02   1.15   1.16   1.16   1.16   0.83   0.74   0.82   1.05   0.54   0.86   2.02   1.15   1.16   1.16   1.16   0.84   0.74   0.82   1.05
Frank:NNP   0.75   1.84   1.07   1.07   0.73   0.88   0.28   0.28   1.04   0.96   1.07   1.56   1.84   1.07   1.07   0.73   0.79   0.74   0.74   1.03   0.96   1.07   1.56
boss:NN   0.50   1.58   0.78   0.75   0.50   1.11   1.04   1.04   0.28   0.71   0.76   1.31   1.58   0.78   0.75   0.50   1.07   1.07   1.07   0.53   0.71   0.76   1.31
that:IN   1.05   1.57   1.21   1.24   1.59   1.57   1.54   1.54   1.31   1.05   1.24   1.23   1.57   1.21   1.24   1.59   1.57   1.51   1.51   1.31   1.05   1.24   1.23
they:PRP   1.05   0.85   1.21   0.97   1.18   1.55   2.76   2.76   1.30   1.11   0.97   1.01   0.85   1.21   0.97   1.18   1.79   2.04   2.04   1.30   1.11   0.97   1.01
should:MD   1.05   1.16   1.25   1.24   1.05   1.25   1.25   1.25   1.00   2.42   1.25   1.45   1.16   1.25   1.24   1.05   1.25   1.25   1.25   1.00   2.42   1.25   1.45
go:VB   1.15   0.54   0.92   0.82   1.15   1.16   1.16   1.16   0.85   0.76   1.32   1.11   0.54   0.92   0.82   1.15   1.16   1.16   1.16   0.78   0.76   1.32   1.11
the:DT   1.05   1.16   1.25   1.25   0.95   1.25   1.25   1.25   1.00   1.31   1.25   1.43   1.16   1.25   1.25   0.95   1.25   1.25   1.25   0.96   1.31   1.25   1.43
meeting:NN   0.50   1.58   0.82   0.41   0.50   1.17   1.14   1.14   0.91   0.71   0.75   1.26   1.58   0.82   0.41   0.50   1.07   1.07   1.07   0.89   0.71   0.75   1.26
together:RB   1.05   1.05   0.91   0.85   1.05   1.47   1.47   1.47   1.21   1.09   0.91   1.23   1.05   0.91   0.85   1.05   1.47   1.47   1.47   1.21   1.09   0.91   1.23
Carl:NNP   0.75   1.84   1.04   1.07   0.75   0.76   0.79   0.79   1.07   0.96   1.07   1.56   1.84   1.04   1.07   0.75   0.28   0.82   0.82   1.07   0.96   1.07   1.56
Alan:NNP   0.75   1.84   1.04   1.07   0.70   0.82   0.74   0.74   1.07   0.96   1.07   1.56   1.84   1.04   1.07   0.70   0.82   0.28   0.28   1.07   0.96   1.07   1.56
wife:NN   0.42   1.58   0.78   0.75   0.50   1.09   1.03   1.03   0.53   0.71   0.69   1.31   1.58   0.78   0.75   0.50   1.07   1.07   1.07   0.28   0.71   0.69   1.31
NO_WORD   0.92   0.67   1.57   0.42   0.60   0.05   0.28   0.12   0.01   1.55   0.36   0.04   0.67   1.57   0.17   0.60   0.05   0.28   0.12   0.01   1.55   0.36   0.04

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.55 Alignment.score
 1.00  0.26 Alignment.isGood
-1.00  0.71 Alignment.isBad
-0.10  22.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.04 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "together" modifying "go"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "boss" modifying "suggested" is dropped on aligned hypothesis word "suggested"
-3.00  1.00 NullPunisher.entity : Bill
-3.00  1.00 NullPunisher.entity : Alan
-1.00  1.00 NullPunisher.other : together
-1.00  1.00 NullPunisher.other : together
-1.00  1.00 NullPunisher.other : it
-0.10  1.00 NullPunisher.functionWord : that
-0.05  1.00 NullPunisher.aux : was
-0.10  1.00 NullPunisher.functionWord : that
-1.00  1.00 NullPunisher.other : boss
-3.00  1.00 NullPunisher.entity : Alan
-1.00  1.00 NullPunisher.other : wife
-0.05  1.00 NullPunisher.aux : should
-0.05  1.00 NullPunisher.aux : should
-3.00  1.00 NullPunisher.entity : Frank
-1.00  1.00 NullPunisher.other : it
-1.00  1.00 NullPunisher.other : suggested
-3.00  1.00 NullPunisher.entity : Carl
-0.05  1.00 NullPunisher.aux : was
-3.00  1.00 NullPunisher.entity : Frank
-1.00  1.00 NullPunisher.other : go
-1.00  1.00 NullPunisher.other : go
-1.00  1.00 NullPunisher.other : If
-3.00  1.00 Structure.argsMismatch : go (verbal compl of the root) not aligned
Hand-tuned score (dot product of above): -33.9034
Threshold: -9.4738


Inference ID: 196

Txt: A lawyer signed every report, and so did an auditor. That is, there was one lawyer who signed all the reports.

Hyp: There was one auditor who signed all the reports. (yes)

There
EX
was
VBD
one
CD
auditor
NN
who
WP
signed
VBD
all
PDT
the
DT
reports
NNS
A:DT   1.31   1.25   1.18   1.00   1.16   1.25   1.31   1.31   1.00
lawyer:NN   0.69   0.82   1.29   0.65   1.58   0.60   0.71   0.71   0.92
signed:VBD   0.74   0.78   1.37   0.91   0.54   1.32   0.76   0.76   0.91
every:DT   1.22   1.25   1.18   1.00   1.16   1.25   1.31   1.31   0.96
report:NN   0.69   0.82   1.29   0.57   1.58   0.82   0.71   0.71   1.47
so:RB   1.09   0.91   1.36   1.21   1.00   0.91   1.09   1.09   1.21
did:VBD   0.76   0.87   1.39   0.70   0.54   0.63   0.76   0.76   0.85
an:DT   1.31   1.21   1.14   1.00   1.16   1.25   1.26   1.31   1.00
auditor:NN   0.71   0.82   1.29   0.28   1.58   0.82   0.71   0.71   0.67
That:DT   1.23   1.22   1.18   1.00   1.13   1.25   1.31   1.21   1.00
is:VBZ   0.76   0.05   1.39   0.91   0.54   0.78   0.76   0.76   0.91
there:EX   0.37   1.25   1.18   1.00   1.16   1.23   1.31   1.20   1.00
was:VBD   0.76   1.32   1.39   0.91   0.47   0.78   0.76   0.76   0.91
one:CD   0.89   1.30   0.85   1.33   1.32   1.28   0.89   0.82   1.33
lawyer:NN   0.69   0.82   1.29   0.65   1.58   0.60   0.71   0.71   0.92
who:WP   1.11   0.90   1.46   1.30   1.05   0.97   1.11   1.04   1.30
signed:VBD   0.74   0.78   1.37   0.91   0.54   1.32   0.76   0.76   0.91
all:PDT   1.31   1.25   1.18   1.00   1.16   1.25   2.42   1.31   1.00
the:DT   1.20   1.25   1.11   1.00   1.09   1.25   1.31   2.42   1.00
reports:NNS   0.71   0.82   1.29   0.67   1.58   0.82   0.71   0.71   0.28
NO_WORD   0.29   0.17   0.52   0.28   0.92   0.25   0.41   0.82   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.40 Alignment.score
 1.00  0.23 Alignment.isGood
-1.00  0.74 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.22 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "signed" modifying "auditor"
-0.05  1.00 NullPunisher.aux : was
-0.10  1.00 NullPunisher.functionWord : who
-3.00  1.00 NullPunisher.entity : one
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : all
-1.00  1.00 NullPunisher.other : signed
-1.00  1.00 NullPunisher.other : auditor
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '1.0' vs ''
 1.00  1.00 Quant.equivalent : Replacing the quantifier "every" by an equivalent quantifier "all" preserves truth.
-2.00  1.00 RootEntailment.unalignedRoot : "was" not aligned to anything
Hand-tuned score (dot product of above): -14.9360
Threshold: -9.4738


Inference ID: 197

Txt: John has a genuine diamond.

Hyp: John has a diamond. (yes)

John
NNP
has
VBZ
a
DT
diamond
NN
John:NNP   0.28   1.07   0.96   1.05
has:VBZ   1.16   1.32   0.76   0.91
a:DT   1.25   1.25   2.42   1.00
genuine:JJ   1.13   0.96   0.76   0.84
diamond:NN   1.05   0.82   0.71   0.28
NO_WORD   0.28   0.17   0.82   0.25

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.24 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-3.00  1.00 NullPunisher.entity : John
-1.00  1.00 NullPunisher.other : diamond
-0.05  1.00 NullPunisher.aux : has
-0.10  1.00 NullPunisher.article : a
-2.00  1.00 RootEntailment.unalignedRoot : "has" not aligned to anything
Hand-tuned score (dot product of above): -6.8749
Threshold: -9.4738


Inference ID: 198

Txt: John is a former university student.

Hyp: John is a university student. (don't know)

John
NNP
is
VBZ
a
DT
university_student
NN
John:NNP   0.28   1.07   0.96   1.07
is:VBZ   1.16   1.32   0.76   0.91
a:DT   1.25   1.25   2.42   1.00
former:JJ   1.13   0.96   0.76   0.88
university_student:NN   1.07   0.82   0.71   0.28
NO_WORD   0.28   1.43   0.82   0.12

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.52 Alignment.score
 1.00  0.26 Alignment.isGood
-1.00  0.72 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : university_student
-3.00  1.00 NullPunisher.entity : John
-0.05  1.00 NullPunisher.aux : is
-0.10  1.00 NullPunisher.article : a
-2.00  1.00 RootEntailment.unalignedRoot : "university_student" not aligned to anything
Hand-tuned score (dot product of above): -6.4901
Threshold: -9.4738


Inference ID: 199

Txt: John is a successful former university student.

Hyp: John is successful. (yes)

John
NNP
is
VBZ
successful
JJ
John:NNP   0.28   1.07   1.24
is:VBZ   1.16   1.32   1.02
a:DT   1.25   1.25   1.18
successful:JJ   1.13   0.96   0.74
former:JJ   1.13   0.96   0.93
university_student:NN   1.07   0.82   0.89
NO_WORD   0.28   1.43   0.16

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.44 Alignment.score
 1.00  0.24 Alignment.isGood
-1.00  0.74 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-0.05  1.00 NullPunisher.aux : is
-1.00  1.00 NullPunisher.other : successful
-3.00  1.00 NullPunisher.entity : John
-2.00  1.00 RootEntailment.unalignedRoot : "successful" not aligned to anything
Hand-tuned score (dot product of above): -6.4057
Threshold: -9.4738


Inference ID: 200

Txt: John is a former successful university student.

Hyp: John is successful. (don't know)

John
NNP
is
VBZ
successful
JJ
John:NNP   0.28   1.07   1.24
is:VBZ   1.16   1.32   1.02
a:DT   1.25   1.25   1.18
former:JJ   1.13   0.96   0.93
successful:JJ   1.13   0.96   0.74
university_student:NN   1.07   0.82   0.89
NO_WORD   0.28   1.43   0.16

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.44 Alignment.score
 1.00  0.24 Alignment.isGood
-1.00  0.74 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-3.00  1.00 NullPunisher.entity : John
-1.00  1.00 NullPunisher.other : successful
-0.05  1.00 NullPunisher.aux : is
-2.00  1.00 RootEntailment.unalignedRoot : "successful" not aligned to anything
Hand-tuned score (dot product of above): -6.4057
Threshold: -9.4738


Inference ID: 201

Txt: John is a former successful university student.

Hyp: John is a university student. (don't know)

John
NNP
is
VBZ
a
DT
university_student
NN
John:NNP   0.28   1.07   0.96   1.07
is:VBZ   1.16   1.32   0.76   0.91
a:DT   1.25   1.25   2.42   1.00
former:JJ   1.13   0.96   0.76   0.88
successful:JJ   1.13   0.96   0.76   0.78
university_student:NN   1.07   0.82   0.71   0.28
NO_WORD   0.28   1.43   0.82   0.12

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.52 Alignment.score
 1.00  0.26 Alignment.isGood
-1.00  0.72 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : university_student
-0.05  1.00 NullPunisher.aux : is
-3.00  1.00 NullPunisher.entity : John
-0.10  1.00 NullPunisher.article : a
-2.00  1.00 RootEntailment.unalignedRoot : "university_student" not aligned to anything
Hand-tuned score (dot product of above): -6.4901
Threshold: -9.4738


Inference ID: 202

Txt: Every mammal is an animal.

Hyp: Every four-legged mammal is a four-legged animal. (yes)

Every
DT
four-legged
JJ
mammal
NN
is
VBZ
a
DT
four-legged
JJ
animal
NN
Every:DT   2.42   1.18   1.00   1.25   1.31   1.18   1.00
mammal:NN   0.71   0.99   0.28   0.82   0.71   0.99   0.13
is:VBZ   0.76   1.02   0.91   1.32   0.76   1.02   0.91
an:DT   1.31   1.18   1.00   1.25   1.21   1.18   1.00
animal:NN   0.71   0.99   0.19   0.82   0.71   0.99   0.28
NO_WORD   0.82   0.11   0.28   1.43   0.82   0.11   0.12

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.46 Alignment.score
 1.00  0.25 Alignment.isGood
-1.00  0.73 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.14 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "four-legged" modifying "animal"
-1.00  1.00 Hypernym.posNarrow : Narrowing a term (from "animal" to "mammal") does NOT preserve truth in a positive context
-1.00  1.00 NullPunisher.other : four-legged
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : Every
-1.00  1.00 NullPunisher.other : four-legged
-0.05  1.00 NullPunisher.aux : is
-1.00  1.00 NullPunisher.other : animal
-7.00  1.00 Quant.expand : Invalid quantifier strengthening: replacing the quantifier "a" by a stronger quantifier "every" does NOT preserve truth.
-2.00  1.00 RootEntailment.unalignedRoot : "animal" not aligned to anything
Hand-tuned score (dot product of above): -15.6622
Threshold: -9.4738


Inference ID: 203

Txt: Dumbo is a four-legged animal.

Hyp: Dumbo is four-legged. (yes)

Dumbo
NNP
is
VBZ
four-legged
JJ
Dumbo:NNP   0.28   1.07   1.24
is:VBZ   1.16   1.32   1.02
a:DT   1.25   1.25   1.18
four-legged:JJ   1.13   0.96   0.74
animal:NN   1.05   0.82   0.99
NO_WORD   0.28   1.43   0.16

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.44 Alignment.score
 1.00  0.24 Alignment.isGood
-1.00  0.74 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : four-legged
-0.05  1.00 NullPunisher.aux : is
-3.00  1.00 NullPunisher.entity : Dumbo
-2.00  1.00 RootEntailment.unalignedRoot : "four-legged" not aligned to anything
Hand-tuned score (dot product of above): -6.4057
Threshold: -9.4738


Inference ID: 204

Txt: Mickey is a small animal.

Hyp: Mickey is a large animal. (don't know)

Mickey
NNP
is
VBZ
a
DT
large
JJ
animal
NN
Mickey:NNP   0.28   1.07   0.96   1.22   1.05
is:VBZ   1.16   1.32   0.76   1.02   0.91
a:DT   1.25   1.25   2.42   1.18   1.00
small:JJ   1.13   0.96   0.76   1.28   0.72
animal:NN   1.05   0.82   0.71   0.94   0.28
NO_WORD   0.28   1.43   0.82   0.11   0.12

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.44 Alignment.score
 1.00  0.24 Alignment.isGood
-1.00  0.74 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "large" modifying "animal"
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : animal
-0.05  1.00 NullPunisher.aux : is
-1.00  1.00 NullPunisher.other : large
-3.00  1.00 NullPunisher.entity : Mickey
-2.00  1.00 RootEntailment.unalignedRoot : "animal" not aligned to anything
Hand-tuned score (dot product of above): -8.7032
Threshold: -9.4738


Inference ID: 205

Txt: Dumbo is a large animal.

Hyp: Dumbo is a small animal. (don't know)

Dumbo
NNP
is
VBZ
a
DT
small
JJ
animal
NN
Dumbo:NNP   0.28   1.07   0.96   1.24   1.05
is:VBZ   1.16   1.32   0.76   1.02   0.91
a:DT   1.25   1.25   2.42   1.18   1.00
large:JJ   1.13   0.96   0.76   1.28   0.83
animal:NN   1.05   0.82   0.71   0.83   0.28
NO_WORD   0.28   1.43   0.82   0.11   0.12

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.44 Alignment.score
 1.00  0.24 Alignment.isGood
-1.00  0.74 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "small" modifying "animal"
-1.00  1.00 NullPunisher.other : small
-3.00  1.00 NullPunisher.entity : Dumbo
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : animal
-0.05  1.00 NullPunisher.aux : is
-2.00  1.00 RootEntailment.unalignedRoot : "animal" not aligned to anything
Hand-tuned score (dot product of above): -8.7032
Threshold: -9.4738


Inference ID: 206

Txt: Fido is not a small animal.

Hyp: Fido is a large animal. (don't know)

Fido
NNP
is
VBZ
a
DT
large
JJ
animal
NN
Fido:NNP   0.28   0.82   0.71   0.99   0.82
is:VBZ   0.91   1.32   0.76   1.02   0.91
not:RB   1.21   0.91   1.09   1.05   1.21
a:DT   1.00   1.25   2.42   1.18   1.00
small:JJ   0.88   0.96   0.76   1.28   0.72
animal:NN   0.82   0.82   0.71   0.94   0.28
NO_WORD   0.28   1.43   0.82   0.11   0.12

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.44 Alignment.score
 1.00  0.24 Alignment.isGood
-1.00  0.74 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "large" modifying "animal"
-1.00  1.00 NullPunisher.other : animal
-1.00  1.00 NullPunisher.other : large
-1.00  1.00 NullPunisher.other : Fido
-0.05  1.00 NullPunisher.aux : is
-0.10  1.00 NullPunisher.article : a
-2.00  1.00 RootEntailment.unalignedRoot : "animal" not aligned to anything
Hand-tuned score (dot product of above): -6.7032
Threshold: -9.4738


Inference ID: 207

Txt: Fido is not a large animal.

Hyp: Fido is a small animal. (don't know)

Fido
NNP
is
VBZ
a
DT
small
JJ
animal
NN
Fido:NNP   0.28   0.82   0.71   0.99   0.82
is:VBZ   0.91   1.32   0.76   1.02   0.91
not:RB   1.21   0.91   1.09   1.05   1.21
a:DT   1.00   1.25   2.42   1.18   1.00
large:JJ   0.88   0.96   0.76   1.28   0.83
animal:NN   0.82   0.82   0.71   0.83   0.28
NO_WORD   0.28   1.43   0.82   0.11   0.12

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.44 Alignment.score
 1.00  0.24 Alignment.isGood
-1.00  0.74 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "small" modifying "animal"
-0.05  1.00 NullPunisher.aux : is
-1.00  1.00 NullPunisher.other : small
-1.00  1.00 NullPunisher.other : animal
-1.00  1.00 NullPunisher.other : Fido
-0.10  1.00 NullPunisher.article : a
-2.00  1.00 RootEntailment.unalignedRoot : "animal" not aligned to anything
Hand-tuned score (dot product of above): -6.7032
Threshold: -9.4738


Inference ID: 208

Txt: Mickey is a small animal. Dumbo is a large animal.

Hyp: Mickey is smaller than Dumbo. (yes)

Mickey
NNP
is
VBZ
smaller
JJR
Dumbo
NNP
Mickey:NNP   0.28   1.07   1.19   0.95
is:VBZ   1.16   1.32   1.02   1.16
a:DT   1.25   1.25   1.18   1.25
small:JJ   1.13   0.96   1.81   1.13
animal:NN   1.05   0.82   0.96   1.05
Dumbo:NNP   0.95   1.07   1.24   0.28
is:VBZ   1.16   1.32   1.02   1.16
a:DT   1.25   1.25   1.18   1.25
large:JJ   1.11   0.96   0.29   1.13
animal:NN   1.05   0.82   0.96   1.05
NO_WORD   0.28   1.43   0.16   0.07

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.81 Alignment.score
 1.00  0.32 Alignment.isGood
-1.00  0.66 Alignment.isBad
-0.10  2.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.25 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Dumbo" modifying "smaller"
-0.05  1.00 NullPunisher.aux : is
-3.00  1.00 NullPunisher.entity : Dumbo
Hand-tuned score (dot product of above): -3.6529
Threshold: -9.4738


Inference ID: 209

Txt: Mickey is a small animal. Dumbo is a large animal.

Hyp: Mickey is larger than Dumbo. (don't know)

Mickey
NNP
is
VBZ
larger
JJR
Dumbo
NNP
Mickey:NNP   0.28   1.07   1.21   0.95
is:VBZ   1.16   1.32   1.02   1.16
a:DT   1.25   1.25   1.18   1.25
small:JJ   1.13   0.96   0.41   1.13
animal:NN   1.05   0.82   0.84   1.05
Dumbo:NNP   0.95   1.07   1.24   0.28
is:VBZ   1.16   1.32   1.02   1.16
a:DT   1.25   1.25   1.18   1.25
large:JJ   1.11   0.96   2.04   1.13
animal:NN   1.05   0.82   0.84   1.05
NO_WORD   0.28   1.43   0.16   0.07

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.78 Alignment.score
 1.00  0.31 Alignment.isGood
-1.00  0.66 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.25 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Dumbo" modifying "larger"
-3.00  1.00 NullPunisher.entity : Mickey
-3.00  1.00 NullPunisher.entity : Dumbo
-0.05  1.00 NullPunisher.aux : is
Hand-tuned score (dot product of above): -6.8002
Threshold: -9.4738


Inference ID: 210

Txt: All mice are small animals. Mickey is a large mouse.

Hyp: Mickey is a large animal. (don't know)

Mickey
NNP
is
VBZ
a
DT
large
JJ
animal
NN
All:DT   1.25   1.25   1.31   1.18   0.97
mice:NNS   0.86   0.82   0.71   0.94   0.13
are:VBP   1.16   0.02   0.76   0.91   0.91
small:JJ   1.13   0.96   0.76   1.28   0.72
animals:NNS   1.03   0.82   0.71   0.90   1.51
Mickey:NNP   0.28   1.07   0.96   1.22   1.05
is:VBZ   1.16   1.32   0.76   1.02   0.91
a:DT   1.25   1.25   2.42   1.18   1.00
large:JJ   1.11   0.96   0.76   0.74   0.83
mouse:NN   0.94   0.82   0.71   0.94   0.46
NO_WORD   0.28   1.43   0.82   0.11   0.12

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.74 Alignment.score
 1.00  0.30 Alignment.isGood
-1.00  0.67 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.20 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "large" modifying "animal"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "small" modifying "animals" is dropped on aligned hypothesis word "animal"
-1.00  1.00 NullPunisher.other : large
-0.05  1.00 NullPunisher.aux : is
-0.10  1.00 NullPunisher.article : a
Hand-tuned score (dot product of above): -1.4690
Threshold: -9.4738


Inference ID: 211

Txt: All elephants are large animals. Dumbo is a small elephant.

Hyp: Dumbo is a small animal. (don't know)

Dumbo
NNP
is
VBZ
a
DT
small
JJ
animal
NN
All:DT   1.25   1.25   1.31   1.07   0.97
elephants:NNS   1.07   0.82   0.71   0.70   0.32
are:VBP   1.16   0.02   0.76   1.02   0.91
large:JJ   1.13   0.96   0.76   1.28   0.83
animals:NNS   1.07   0.82   0.71   0.86   1.51
Dumbo:NNP   0.28   1.07   0.96   1.24   1.05
is:VBZ   1.16   1.32   0.76   1.02   0.91
a:DT   1.25   1.25   2.42   1.18   1.00
small:JJ   1.13   0.96   0.76   0.74   0.72
elephant:NN   1.07   0.82   0.71   0.86   0.39
NO_WORD   0.28   1.43   0.82   0.11   0.12

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.72 Alignment.score
 1.00  0.30 Alignment.isGood
-1.00  0.68 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.20 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "small" modifying "animal"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "large" modifying "animals" is dropped on aligned hypothesis word "animal"
-0.05  1.00 NullPunisher.aux : is
-1.00  1.00 NullPunisher.other : small
-0.10  1.00 NullPunisher.article : a
-3.00  1.00 NullPunisher.entity : Dumbo
Hand-tuned score (dot product of above): -4.5939
Threshold: -9.4738


Inference ID: 212

Txt: All mice are small animals. All elephants are large animals. Mickey is a large mouse. Dumbo is a small elephant.

Hyp: Dumbo is larger than Mickey. (yes)

Dumbo
NNP
is
VBZ
larger
JJR
Mickey
NNP
All:DT   1.25   1.25   1.18   1.25
mice:NNS   1.07   0.82   0.95   0.86
are:VBP   1.16   0.02   0.95   1.16
small:JJ   1.13   0.96   0.41   1.13
animals:NNS   1.07   0.82   0.78   1.03
All:DT   1.25   1.25   1.18   1.25
elephants:NNS   1.07   0.82   0.93   0.99
are:VBP   1.16   0.02   0.95   1.16
large:JJ   1.13   0.96   2.04   1.11
animals:NNS   1.07   0.82   0.78   1.03
Mickey:NNP   0.95   1.07   1.21   0.28
is:VBZ   1.16   1.32   1.02   1.16
a:DT   1.25   1.25   1.18   1.25
large:JJ   1.13   0.96   2.04   1.11
mouse:NN   1.07   0.82   0.97   0.94
Dumbo:NNP   0.28   1.07   1.24   0.95
is:VBZ   1.16   1.32   1.02   1.16
a:DT   1.25   1.25   1.18   1.25
small:JJ   1.13   0.96   0.41   1.13
elephant:NN   1.07   0.82   0.93   0.99
NO_WORD   0.28   1.43   0.16   0.07

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.78 Alignment.score
 1.00  0.31 Alignment.isGood
-1.00  0.66 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.25 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Mickey" modifying "larger"
-0.05  1.00 NullPunisher.aux : is
-3.00  1.00 NullPunisher.entity : Mickey
-3.00  1.00 NullPunisher.entity : Dumbo
Hand-tuned score (dot product of above): -6.8002
Threshold: -9.4738


Inference ID: 214

Txt: All legal authorities are law lecturers. All law lecturers are legal authorities.

Hyp: All fat legal authorities are fat law lecturers. (yes)

All
DT
fat
JJ
legal
JJ
authorities
NNS
are
VBP
fat
JJ
law
NN
lecturers
NNS
All:DT   2.42   1.18   1.18   1.00   1.18   1.18   1.00   1.00
legal:JJ   0.76   0.93   0.74   0.78   0.96   0.93   0.23   0.88
authorities:NNS   0.71   0.99   0.89   0.28   0.82   0.99   0.69   0.89
are:VBP   0.68   1.02   1.02   0.91   1.32   1.02   0.91   0.91
law:NN   0.71   0.88   0.34   0.63   0.82   0.88   0.28   0.75
lecturers:NNS   0.71   0.96   0.99   0.89   0.82   0.96   0.75   0.28
All:DT   2.42   1.18   1.18   1.00   1.18   1.18   1.00   1.00
law:NN   0.71   0.88   0.34   0.63   0.82   0.88   0.28   0.75
lecturers:NNS   0.71   0.96   0.99   0.89   0.82   0.96   0.75   0.28
are:VBP   0.68   1.02   1.02   0.91   1.32   1.02   0.91   0.91
legal:JJ   0.76   0.93   0.74   0.78   0.96   0.93   0.23   0.88
authorities:NNS   0.71   0.99   0.89   0.28   0.82   0.99   0.69   0.89
NO_WORD   0.82   0.11   0.11   0.28   1.43   0.11   0.05   0.12

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.30 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.76 Alignment.isBad
-0.10  8.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "law" modifying "lecturers"
-0.05  1.00 NullPunisher.aux : are
-1.00  1.00 NullPunisher.other : fat
-1.00  1.00 NullPunisher.other : All
-1.00  1.00 NullPunisher.other : lecturers
-1.00  1.00 NullPunisher.other : authorities
-1.00  1.00 NullPunisher.other : fat
-1.00  1.00 NullPunisher.other : law
-1.00  1.00 NullPunisher.other : legal
-2.00  1.00 RootEntailment.unalignedRoot : "lecturers" not aligned to anything
Hand-tuned score (dot product of above): -11.0990
Threshold: -9.4738


Inference ID: 215

Txt: All legal authorities are law lecturers. All law lecturers are legal authorities.

Hyp: All competent legal authorities are competent law lecturers. (don't know)

All
DT
competent
JJ
legal
JJ
authorities
NNS
are
VBP
competent
JJ
law
NN
lecturers
NNS
All:DT   2.42   1.18   1.18   1.00   1.18   1.18   1.00   1.00
legal:JJ   0.76   0.45   0.74   0.78   0.96   0.45   0.23   0.88
authorities:NNS   0.71   0.98   0.89   0.28   0.82   0.98   0.69   0.89
are:VBP   0.68   1.02   1.02   0.91   1.32   1.02   0.91   0.91
law:NN   0.71   0.76   0.34   0.63   0.82   0.76   0.28   0.75
lecturers:NNS   0.71   0.96   0.99   0.89   0.82   0.96   0.75   0.28
All:DT   2.42   1.18   1.18   1.00   1.18   1.18   1.00   1.00
law:NN   0.71   0.76   0.34   0.63   0.82   0.76   0.28   0.75
lecturers:NNS   0.71   0.96   0.99   0.89   0.82   0.96   0.75   0.28
are:VBP   0.68   1.02   1.02   0.91   1.32   1.02   0.91   0.91
legal:JJ   0.76   0.45   0.74   0.78   0.96   0.45   0.23   0.88
authorities:NNS   0.71   0.98   0.89   0.28   0.82   0.98   0.69   0.89
NO_WORD   0.82   0.11   0.11   0.28   1.43   0.11   0.05   0.12

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.30 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.76 Alignment.isBad
-0.10  8.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "law" modifying "lecturers"
-1.00  1.00 NullPunisher.other : law
-1.00  1.00 NullPunisher.other : All
-1.00  1.00 NullPunisher.other : competent
-1.00  1.00 NullPunisher.other : competent
-1.00  1.00 NullPunisher.other : legal
-1.00  1.00 NullPunisher.other : authorities
-0.05  1.00 NullPunisher.aux : are
-1.00  1.00 NullPunisher.other : lecturers
-2.00  1.00 RootEntailment.unalignedRoot : "lecturers" not aligned to anything
Hand-tuned score (dot product of above): -11.0990
Threshold: -9.4738


Inference ID: 216

Txt: John is a fatter politician than Bill.

Hyp: John is fatter than Bill. (yes)

John
NNP
is
VBZ
fatter
VBN
Bill
NNP
John:NNP   0.28   1.07   1.07   1.18
is:VBZ   1.16   1.32   0.56   0.91
a:DT   1.25   1.25   1.25   1.00
fatter:JJR   1.13   0.96   2.00   0.88
politician:NN   1.12   0.82   0.82   0.93
Bill:NNP   1.18   0.82   0.82   0.28
NO_WORD   0.15   1.57   0.17   0.07

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.84 Alignment.score
 1.00  0.32 Alignment.isGood
-1.00  0.65 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.25 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Bill" modifying "fatter"
-0.05  1.00 NullPunisher.aux : is
-3.00  1.00 NullPunisher.entity : John
-1.00  1.00 NullPunisher.other : Bill
Hand-tuned score (dot product of above): -4.7183
Threshold: -9.4738


Inference ID: 217

Txt: John is a cleverer politician than Bill.

Hyp: John is cleverer than Bill. (don't know)

John
NNP
is
VBZ
cleverer
VB
Bill
NNP
John:NNP   0.28   1.07   1.07   1.18
is:VBZ   1.16   1.32   0.56   0.91
a:DT   1.25   1.25   1.25   1.00
cleverer:NN   1.07   0.82   2.71   0.82
politician:NN   1.12   0.82   0.76   0.93
Bill:NNP   1.18   0.82   0.82   0.28
NO_WORD   0.28   0.17   0.36   0.07

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.63 Alignment.score
 1.00  0.28 Alignment.isGood
-1.00  0.70 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.25 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Bill" modifying "cleverer"
-3.00  1.00 NullPunisher.entity : John
-1.00  1.00 NullPunisher.other : Bill
-0.05  1.00 NullPunisher.aux : is
-2.00  1.00 RootEntailment.unalignedRoot : "is" not aligned to anything
Hand-tuned score (dot product of above): -7.0099
Threshold: -9.4738


Inference ID: 218

Txt: Kim is a clever person.

Hyp: Kim is clever. (yes)

Kim
NNP
is
VBZ
clever
JJ
Kim:NNP   0.28   1.03   1.24
is:VBZ   1.12   1.32   1.02
a:DT   1.25   1.25   1.18
clever:JJ   1.13   0.96   0.74
person:NN   1.07   0.82   0.87
NO_WORD   0.28   1.43   0.16

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.44 Alignment.score
 1.00  0.24 Alignment.isGood
-1.00  0.74 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-0.05  1.00 NullPunisher.aux : is
-3.00  1.00 NullPunisher.entity : Kim
-1.00  1.00 NullPunisher.other : clever
-2.00  1.00 RootEntailment.unalignedRoot : "clever" not aligned to anything
Hand-tuned score (dot product of above): -6.4057
Threshold: -9.4738


Inference ID: 219

Txt: Kim is a clever politician.

Hyp: Kim is clever. (don't know)

Kim
NNP
is
VBZ
clever
JJ
Kim:NNP   0.28   1.03   1.24
is:VBZ   1.12   1.32   1.02
a:DT   1.25   1.25   1.18
clever:JJ   1.13   0.96   0.74
politician:NN   1.07   0.82   0.73
NO_WORD   0.28   1.43   0.16

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.44 Alignment.score
 1.00  0.24 Alignment.isGood
-1.00  0.74 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-3.00  1.00 NullPunisher.entity : Kim
-1.00  1.00 NullPunisher.other : clever
-0.05  1.00 NullPunisher.aux : is
-2.00  1.00 RootEntailment.unalignedRoot : "clever" not aligned to anything
Hand-tuned score (dot product of above): -6.4057
Threshold: -9.4738


Inference ID: 220

Txt: The PC-6082 is faster than the ITEL-XZ. The ITEL-XZ is fast.

Hyp: The PC-6082 is fast. (yes)

The
DT
PC-6082
NN
is
VBZ
fast
JJ
The:DT   2.42   1.00   1.25   1.18
PC-6082:NN   0.71   0.28   0.82   0.99
is:VBZ   0.76   0.91   1.32   1.02
faster:RBR   1.09   1.21   0.91   1.14
the:DT   0.01   1.00   1.25   1.18
ITEL-XZ:NNP   0.71   0.82   0.82   0.99
The:DT   2.42   1.00   1.25   1.18
ITEL-XZ:NN   0.71   0.82   0.82   0.99
is:VBZ   0.76   0.91   1.32   1.02
fast:JJ   0.76   0.88   0.96   0.74
NO_WORD   0.82   0.28   1.43   0.16

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.91 Alignment.score
 1.00  0.34 Alignment.isGood
-1.00  0.64 Alignment.isBad
-0.10  2.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.50 Alignment.txtSpan
-0.05  1.00 NullPunisher.aux : is
-0.10  1.00 NullPunisher.article : The
Hand-tuned score (dot product of above): 0.4092
Threshold: -9.4738


Inference ID: 221

Txt: The PC-6082 is faster than the ITEL-XZ.

Hyp: The PC-6082 is fast. (don't know)

The
DT
PC-6082
NN
is
VBZ
fast
JJ
The:DT   2.42   1.00   1.25   1.18
PC-6082:NN   0.71   0.28   0.82   0.99
is:VBZ   0.76   0.91   1.32   1.02
faster:RBR   1.09   1.21   0.91   1.14
the:DT   0.01   1.00   1.25   1.18
ITEL-XZ:NNP   0.71   0.82   0.82   0.99
NO_WORD   0.82   0.28   1.43   0.16

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.91 Alignment.score
 1.00  0.34 Alignment.isGood
-1.00  0.64 Alignment.isBad
-0.10  2.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.50 Alignment.txtSpan
-0.10  1.00 NullPunisher.article : The
-0.05  1.00 NullPunisher.aux : is
Hand-tuned score (dot product of above): 0.4092
Threshold: -9.4738


Inference ID: 222

Txt: The PC-6082 is faster than the ITEL-XZ. The PC-6082 is fast.

Hyp: The ITEL-XZ is fast. (don't know)

The
DT
ITEL-XZ
NN
is
VBZ
fast
JJ
The:DT   2.42   1.00   1.25   1.18
PC-6082:NN   0.71   0.82   0.82   0.99
is:VBZ   0.76   0.91   1.32   1.02
faster:RBR   1.09   1.21   0.91   1.14
the:DT   0.01   1.00   1.25   1.18
ITEL-XZ:NNP   0.71   1.82   0.82   0.99
The:DT   2.42   1.00   1.25   1.18
PC-6082:NN   0.71   0.82   0.82   0.99
is:VBZ   0.76   0.91   1.32   1.02
fast:JJ   0.76   0.88   0.96   0.74
NO_WORD   0.82   0.28   1.43   0.16

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  1.45 Alignment.score
 1.00  0.47 Alignment.isGood
-1.00  0.50 Alignment.isBad
-0.10  1.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.75 Alignment.txtSpan
-0.05  1.00 NullPunisher.aux : is
-2.00  1.00 Structure.parentsMismatch : args have different parents, different relations: text "ITEL-XZ" <-prep_than-- "is" vs. hyp "ITEL-XZ" <-nsubj-- "fast", which aligned to text "faster"
Hand-tuned score (dot product of above): -0.4689
Threshold: -9.4738


Inference ID: 223

Txt: The PC-6082 is faster than the ITEL-XZ. The PC-6082 is slow.

Hyp: The ITEL-XZ is fast. (don't know)

The
DT
ITEL-XZ
NN
is
VBZ
fast
JJ
The:DT   2.42   1.00   1.25   1.18
PC-6082:NN   0.71   0.82   0.82   0.99
is:VBZ   0.76   0.91   1.32   1.02
faster:RBR   1.09   1.21   0.91   1.14
the:DT   0.01   1.00   1.25   1.18
ITEL-XZ:NNP   0.71   1.82   0.82   0.99
The:DT   2.42   1.00   1.25   1.18
PC-6082:NN   0.71   0.82   0.82   0.99
is:VBZ   0.76   0.91   1.32   1.02
slow:JJ   0.76   0.88   0.96   1.36
NO_WORD   0.82   0.28   1.43   0.16

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  1.45 Alignment.score
 1.00  0.47 Alignment.isGood
-1.00  0.50 Alignment.isBad
-0.10  1.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.75 Alignment.txtSpan
-0.05  1.00 NullPunisher.aux : is
-2.00  1.00 Structure.parentsMismatch : args have different parents, different relations: text "ITEL-XZ" <-prep_than-- "is" vs. hyp "ITEL-XZ" <-nsubj-- "fast", which aligned to text "faster"
Hand-tuned score (dot product of above): -0.4689
Threshold: -9.4738


Inference ID: 224

Txt: The PC-6082 is as fast as the ITEL-XZ. The ITEL-XZ is fast.

Hyp: The PC-6082 is fast. (yes)

The
DT
PC-6082
NN
is
VBZ
fast
JJ
The:DT   2.42   1.00   1.25   1.18
PC-6082:NN   0.71   0.28   0.82   0.99
is:VBZ   0.76   0.91   1.32   1.02
as:RB   1.09   1.21   0.80   0.97
fast:RB   1.09   1.21   0.91   2.87
the:DT   0.01   1.00   1.25   1.18
ITEL-XZ:NNP   0.71   0.82   0.82   0.99
The:DT   2.42   1.00   1.25   1.18
ITEL-XZ:NN   0.71   0.82   0.82   0.99
is:VBZ   0.76   0.91   1.32   1.02
fast:JJ   0.76   0.88   0.96   0.74
NO_WORD   0.82   0.28   1.43   0.16

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  1.34 Alignment.score
 1.00  0.44 Alignment.isGood
-1.00  0.53 Alignment.isBad
-0.10  2.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.25 Alignment.txtSpan
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "as" modifying "fast" is dropped on aligned hypothesis word "fast"
-0.10  1.00 NullPunisher.article : The
-0.05  1.00 NullPunisher.aux : is
Hand-tuned score (dot product of above): 1.5225
Threshold: -9.4738


Inference ID: 225

Txt: The PC-6082 is as fast as the ITEL-XZ.

Hyp: The PC-6082 is fast. (don't know)

The
DT
PC-6082
NN
is
VBZ
fast
JJ
The:DT   2.42   1.00   1.25   1.18
PC-6082:NN   0.71   0.28   0.82   0.99
is:VBZ   0.76   0.91   1.32   1.02
as:RB   1.09   1.21   0.80   0.97
fast:RB   1.09   1.21   0.91   2.87
the:DT   0.01   1.00   1.25   1.18
ITEL-XZ:NNP   0.71   0.82   0.82   0.99
NO_WORD   0.82   0.28   1.43   0.16

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  1.34 Alignment.score
 1.00  0.44 Alignment.isGood
-1.00  0.53 Alignment.isBad
-0.10  2.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.25 Alignment.txtSpan
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "as" modifying "fast" is dropped on aligned hypothesis word "fast"
-0.10  1.00 NullPunisher.article : The
-0.05  1.00 NullPunisher.aux : is
Hand-tuned score (dot product of above): 1.5225
Threshold: -9.4738


Inference ID: 226

Txt: The PC-6082 is as fast as the ITEL-XZ. The PC-6082 is fast.

Hyp: The ITEL-XZ is fast. (don't know)

The
DT
ITEL-XZ
NN
is
VBZ
fast
JJ
The:DT   2.42   1.00   1.25   1.18
PC-6082:NN   0.71   0.82   0.82   0.99
is:VBZ   0.76   0.91   1.32   1.02
as:RB   1.09   1.21   0.80   0.97
fast:RB   1.09   1.21   0.91   2.87
the:DT   0.01   1.00   1.25   1.18
ITEL-XZ:NNP   0.71   1.82   0.82   0.99
The:DT   2.42   1.00   1.25   1.18
PC-6082:NN   0.71   0.82   0.82   0.99
is:VBZ   0.76   0.91   1.32   1.02
fast:JJ   0.76   0.88   0.96   0.74
NO_WORD   0.82   0.28   1.43   0.16

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  1.88 Alignment.score
 1.00  0.57 Alignment.isGood
-1.00  0.40 Alignment.isBad
-0.10  1.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.75 Alignment.txtSpan
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "as" modifying "fast" is dropped on aligned hypothesis word "fast"
-0.05  1.00 NullPunisher.aux : is
-2.00  1.00 Structure.parentsMismatch : args have different parents, different relations: text "ITEL-XZ" <-prep_as-- "is" vs. hyp "ITEL-XZ" <-nsubj-- "fast", which aligned to text "fast"
-3.00  1.00 Structure.parentsMismatch&Align.veryGood :
 0.50  1.00 Adjunct.dropPosCxt&Align.veryGood :
Hand-tuned score (dot product of above): -1.8234
Threshold: -9.4738


Inference ID: 227

Txt: The PC-6082 is as fast as the ITEL-XZ. The PC-6082 is slow.

Hyp: The ITEL-XZ is fast. (don't know)

The
DT
ITEL-XZ
NN
is
VBZ
fast
JJ
The:DT   2.42   1.00   1.25   1.18
PC-6082:NN   0.71   0.82   0.82   0.99
is:VBZ   0.76   0.91   1.32   1.02
as:RB   1.09   1.21   0.80   0.97
fast:RB   1.09   1.21   0.91   2.87
the:DT   0.01   1.00   1.25   1.18
ITEL-XZ:NNP   0.71   1.82   0.82   0.99
The:DT   2.42   1.00   1.25   1.18
PC-6082:NN   0.71   0.82   0.82   0.99
is:VBZ   0.76   0.91   1.32   1.02
slow:JJ   0.76   0.88   0.96   1.36
NO_WORD   0.82   0.28   1.43   0.16

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  1.88 Alignment.score
 1.00  0.57 Alignment.isGood
-1.00  0.40 Alignment.isBad
-0.10  1.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.75 Alignment.txtSpan
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "as" modifying "fast" is dropped on aligned hypothesis word "fast"
-0.05  1.00 NullPunisher.aux : is
-2.00  1.00 Structure.parentsMismatch : args have different parents, different relations: text "ITEL-XZ" <-prep_as-- "is" vs. hyp "ITEL-XZ" <-nsubj-- "fast", which aligned to text "fast"
-3.00  1.00 Structure.parentsMismatch&Align.veryGood :
 0.50  1.00 Adjunct.dropPosCxt&Align.veryGood :
Hand-tuned score (dot product of above): -1.8234
Threshold: -9.4738


Inference ID: 228

Txt: The PC-6082 is as fast as the ITEL-XZ.

Hyp: The PC-6082 is faster than the ITEL-XZ. (don't know)

The
DT
PC-6082
NN
is
VBZ
faster
RBR
the
DT
ITEL-XZ
NNP
The:DT   2.42   1.00   1.25   1.45   0.01   1.00
PC-6082:NN   0.71   0.28   0.82   1.31   0.71   0.82
is:VBZ   0.76   0.91   1.32   1.11   0.76   0.91
as:RB   1.09   1.21   0.80   0.86   1.09   1.21
fast:RB   1.09   1.21   0.91   1.33   1.09   1.21
the:DT   0.01   1.00   1.25   1.45   2.42   1.00
ITEL-XZ:NNP   0.71   0.82   0.82   1.31   0.71   0.28
NO_WORD   0.82   0.28   0.17   0.04   0.82   0.07

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.46 Alignment.score
 1.00  0.25 Alignment.isGood
-1.00  0.73 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.17 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "ITEL-XZ" modifying "is"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "as" modifying "fast" is dropped on aligned hypothesis word "faster"
-0.05  1.00 NullPunisher.aux : is
-0.10  1.00 NullPunisher.article : The
-1.00  1.00 NullPunisher.other : ITEL-XZ
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : PC-6082
-2.00  1.00 RootEntailment.unalignedRoot : "is" not aligned to anything
Hand-tuned score (dot product of above): -5.1535
Threshold: -9.4738


Inference ID: 229

Txt: The PC-6082 is as fast as the ITEL-XZ.

Hyp: The PC-6082 is slower than the ITEL-XZ. (don't know)

The
DT
PC-6082
NN
is
VBZ
slower
JJR
the
DT
ITEL-XZ
NNP
The:DT   2.42   1.00   1.25   1.18   0.01   1.00
PC-6082:NN   0.71   0.28   0.82   0.99   0.71   0.82
is:VBZ   0.76   0.91   1.32   1.02   0.76   0.91
as:RB   1.09   1.21   0.80   1.05   1.09   1.21
fast:RB   1.09   1.21   0.91   0.45   1.09   1.21
the:DT   0.01   1.00   1.25   1.18   2.42   1.00
ITEL-XZ:NNP   0.71   0.82   0.82   0.99   0.71   0.28
NO_WORD   0.82   0.28   1.43   0.16   0.82   0.07

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.48 Alignment.score
 1.00  0.25 Alignment.isGood
-1.00  0.73 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "ITEL-XZ" modifying "slower"
-1.00  1.00 NullPunisher.other : slower
-1.00  1.00 NullPunisher.other : ITEL-XZ
-1.00  1.00 NullPunisher.other : PC-6082
-0.10  1.00 NullPunisher.article : The
-0.05  1.00 NullPunisher.aux : is
-0.10  1.00 NullPunisher.article : the
-2.00  1.00 RootEntailment.unalignedRoot : "slower" not aligned to anything
Hand-tuned score (dot product of above): -6.8496
Threshold: -9.4738


Inference ID: 230

Txt: ITEL won more orders than APCOM did.

Hyp: ITEL won some orders. (yes)

ITEL
NNP
won
VBD
some
DT
orders
NNS
ITEL:NNP   0.28   1.07   0.96   1.07
won:VBD   1.16   1.32   0.73   0.91
more:JJR   1.13   0.93   0.65   0.83
orders:NNS   1.07   0.82   0.71   0.28
APCOM:NNP   1.07   0.82   0.68   0.82
did:VBD   1.16   0.71   0.76   0.85
NO_WORD   0.28   0.17   0.82   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.20 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : won
-1.00  1.00 NullPunisher.other : some
-3.00  1.00 NullPunisher.entity : ITEL
-1.00  1.00 NullPunisher.other : orders
-2.00  1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -8.7790
Threshold: -9.4738


Inference ID: 231

Txt: ITEL won more orders than APCOM did.

Hyp: APCOM won some orders. (don't know)

APCOM
NNP
won
VBD
some
DT
orders
NNS
ITEL:NNP   1.07   1.07   0.96   1.07
won:VBD   0.91   1.32   0.73   0.91
more:JJR   0.88   0.93   0.65   0.83
orders:NNS   0.82   0.82   0.71   0.28
APCOM:NNP   0.28   0.82   0.68   0.82
did:VBD   0.91   0.71   0.76   0.85
NO_WORD   0.28   0.17   0.82   0.09

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.20 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : some
-1.00  1.00 NullPunisher.other : APCOM
-1.00  1.00 NullPunisher.other : won
-1.00  1.00 NullPunisher.other : orders
-2.00  1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -6.7790
Threshold: -9.4738


Inference ID: 232

Txt: ITEL won more orders than APCOM did. APCOM won ten orders.

Hyp: ITEL won at least eleven orders. (yes)

ITEL
NNP
won
VBD
at
IN
least
JJS
eleven
NN
orders
NNS
ITEL:NNP   0.28   1.07   0.75   1.24   1.07   1.07
won:VBD   1.16   1.32   1.15   1.02   0.68   0.91
more:JJR   1.13   0.93   1.05   0.93   0.88   0.83
orders:NNS   1.07   0.82   0.50   0.97   0.71   0.28
APCOM:NNP   1.07   0.82   0.50   0.99   0.82   0.82
did:VBD   1.16   0.71   1.15   1.02   0.84   0.85
APCOM:NNP   1.07   0.82   0.50   0.99   0.82   0.82
won:VBD   1.16   1.32   1.15   1.02   0.68   0.91
ten:NN   0.98   0.57   0.50   0.99   0.00   0.87
orders:NNS   1.07   0.82   0.50   0.97   0.71   0.28
NO_WORD   0.28   0.17   0.99   0.29   0.05   0.25

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.24 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.17 Alignment.txtSpan
 0.00  1.00 NegPolarity.hypNegWord : "least": tag "JJS" is in NegPolarityMarkers list
-1.00  1.00 NullPunisher.other : won
-1.00  1.00 NullPunisher.other : orders
-1.00  1.00 NullPunisher.other : at
-1.00  1.00 NullPunisher.other : least
-3.00  1.00 NullPunisher.entity : ITEL
-2.00  1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -9.7116
Threshold: -9.4738


Inference ID: 233

Txt: ITEL won more orders than APCOM.

Hyp: ITEL won some orders. (yes)

ITEL
NNP
won
VBD
some
DT
orders
NNS
ITEL:NNP   0.28   1.07   0.96   1.07
won:VBD   1.16   1.32   0.73   0.91
more:JJR   1.13   0.93   0.65   0.83
orders:NNS   1.07   0.82   0.71   0.28
APCOM:NNP   1.07   0.82   0.68   0.82
NO_WORD   0.28   0.17   0.82   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.20 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : won
-1.00  1.00 NullPunisher.other : orders
-1.00  1.00 NullPunisher.other : some
-3.00  1.00 NullPunisher.entity : ITEL
-2.00  1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -8.7790
Threshold: -9.4738


Inference ID: 234

Txt: ITEL won more orders than APCOM.

Hyp: APCOM won some orders. (don't know)

APCOM
NNP
won
VBD
some
DT
orders
NNS
ITEL:NNP   1.07   1.07   0.96   1.07
won:VBD   0.91   1.32   0.73   0.91
more:JJR   0.88   0.93   0.65   0.83
orders:NNS   0.82   0.82   0.71   0.28
APCOM:NNP   0.28   0.82   0.68   0.82
NO_WORD   0.28   0.17   0.82   0.09

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.20 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : orders
-1.00  1.00 NullPunisher.other : APCOM
-1.00  1.00 NullPunisher.other : won
-1.00  1.00 NullPunisher.other : some
-2.00  1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -6.7790
Threshold: -9.4738


Inference ID: 235

Txt: ITEL won more orders than APCOM. APCOM won ten orders.

Hyp: ITEL won at least eleven orders. (yes)

ITEL
NNP
won
VBD
at
IN
least
JJS
eleven
NN
orders
NNS
ITEL:NNP   0.28   1.07   0.75   1.24   1.07   1.07
won:VBD   1.16   1.32   1.15   1.02   0.68   0.91
more:JJR   1.13   0.93   1.05   0.93   0.88   0.83
orders:NNS   1.07   0.82   0.50   0.97   0.71   0.28
APCOM:NNP   1.07   0.82   0.50   0.99   0.82   0.82
APCOM:NNP   1.07   0.82   0.50   0.99   0.82   0.82
won:VBD   1.16   1.32   1.15   1.02   0.68   0.91
ten:NN   0.98   0.57   0.50   0.99   0.00   0.87
orders:NNS   1.07   0.82   0.50   0.97   0.71   0.28
NO_WORD   0.28   0.17   0.99   0.29   0.05   0.25

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.24 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.17 Alignment.txtSpan
 0.00  1.00 NegPolarity.hypNegWord : "least": tag "JJS" is in NegPolarityMarkers list
-1.00  1.00 NullPunisher.other : orders
-1.00  1.00 NullPunisher.other : won
-3.00  1.00 NullPunisher.entity : ITEL
-1.00  1.00 NullPunisher.other : at
-1.00  1.00 NullPunisher.other : least
-2.00  1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -9.7116
Threshold: -9.4738


Inference ID: 236

Txt: ITEL won more orders than the APCOM contract.

Hyp: ITEL won the APCOM contract. (yes)

ITEL
NNP
won
VBD
the
DT
APCOM
NNP
contract
NN
ITEL:NNP   0.28   1.07   0.93   1.07   1.07
won:VBD   1.16   1.32   0.76   0.91   0.65
more:JJR   1.13   0.93   0.73   0.88   0.88
orders:NNS   1.07   0.82   0.71   0.82   0.92
the:DT   1.22   1.25   2.42   1.00   1.00
APCOM:NNP   1.07   0.82   0.71   0.28   0.82
contract:NN   1.07   0.56   0.71   0.82   0.28
NO_WORD   0.28   0.17   0.82   0.05   0.25

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.18 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "APCOM" modifying "contract"
-1.00  1.00 NullPunisher.other : won
-1.00  1.00 NullPunisher.other : contract
-3.00  1.00 NullPunisher.entity : ITEL
-1.00  1.00 NullPunisher.other : APCOM
-0.10  1.00 NullPunisher.article : the
-2.00  1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -10.0034
Threshold: -9.4738


Inference ID: 237

Txt: ITEL won more orders than the APCOM contract.

Hyp: ITEL won more than one order. (yes)

ITEL
NNP
won
VBD
more_than
IN
one
CD
order
NN
ITEL:NNP   0.28   1.07   0.75   1.17   1.05
won:VBD   1.16   1.32   1.15   1.32   0.91
more:JJR   1.13   0.93   0.12   1.03   0.80
orders:NNS   1.07   0.82   0.48   1.27   0.95
the:DT   1.22   1.25   1.05   1.11   1.00
APCOM:NNP   1.07   0.82   0.50   1.29   0.82
contract:NN   1.07   0.56   0.46   1.29   0.90
NO_WORD   0.28   0.17   0.84   0.52   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.23 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.20 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "one" modifying "order"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "more" modifying "orders" is dropped on aligned hypothesis word "order"
-3.00  1.00 NullPunisher.entity : one
-1.00  1.00 NullPunisher.other : more_than
-3.00  1.00 NullPunisher.entity : ITEL
-1.00  1.00 NullPunisher.other : won
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '>1.0' vs ''
-2.00  1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -17.1146
Threshold: -9.4738


Inference ID: 238

Txt: ITEL won twice as many orders than APCOM. APCOM won ten orders.

Hyp: ITEL won twenty orders. (yes)

ITEL
NNP
won
VBD
twenty
CD
orders
NNS
ITEL:NNP   0.28   1.07   1.20   1.07
won:VBD   1.16   1.32   1.36   0.91
twice:RB   1.45   0.54   1.14   1.20
as:RB   1.47   0.91   1.36   1.21
many:JJ   1.13   0.93   1.08   0.88
orders:NNS   1.07   0.82   1.29   0.28
APCOM:NNP   1.07   0.82   1.29   0.82
APCOM:NNP   1.07   0.82   1.29   0.82
won:VBD   1.16   1.32   1.36   0.91
ten:NN   0.98   0.57   0.50   0.87
orders:NNS   1.07   0.82   1.29   0.28
NO_WORD   0.28   0.17   0.52   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00 -0.13 Alignment.score
 1.00  0.15 Alignment.isGood
-1.00  0.83 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.25 Alignment.txtSpan
-1.00  1.00 NullPunisher.other : won
-3.00  1.00 NullPunisher.entity : ITEL
-1.00  1.00 NullPunisher.other : orders
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '20.0' vs ''
-2.00  1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -13.9813
Threshold: -9.4738


Inference ID: 239

Txt: ITEL won more orders than APCOM lost.

Hyp: ITEL won some orders. (yes)

ITEL
NNP
won
VBD
some
DT
orders
NNS
ITEL:NNP   0.28   1.07   0.96   1.07
won:VBD   1.16   1.32   0.73   0.91
more:JJR   1.13   0.93   0.65   0.83
orders:NNS   1.07   0.82   0.71   0.28
APCOM:NNP   1.07   0.82   0.68   0.82
lost:VBD   1.16   1.33   0.70   0.91
NO_WORD   0.28   0.17   0.82   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.20 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : some
-3.00  1.00 NullPunisher.entity : ITEL
-1.00  1.00 NullPunisher.other : orders
-1.00  1.00 NullPunisher.other : won
-2.00  1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -8.7790
Threshold: -9.4738


Inference ID: 240

Txt: ITEL won more orders than APCOM lost.

Hyp: APCOM lost some orders. (don't know)

APCOM
NNP
lost
VBD
some
DT
orders
NNS
ITEL:NNP   1.07   1.07   0.96   1.07
won:VBD   0.91   1.33   0.73   0.91
more:JJR   0.88   0.90   0.65   0.83
orders:NNS   0.82   0.82   0.71   0.28
APCOM:NNP   0.28   0.82   0.68   0.82
lost:VBD   0.91   1.32   0.70   0.91
NO_WORD   0.28   0.17   0.82   0.09

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.20 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : APCOM
-1.00  1.00 NullPunisher.other : orders
-1.00  1.00 NullPunisher.other : lost
-1.00  1.00 NullPunisher.other : some
-2.00  1.00 RootEntailment.unalignedRoot : "lost" not aligned to anything
Hand-tuned score (dot product of above): -6.7790
Threshold: -9.4738


Inference ID: 241

Txt: ITEL won more orders than APCOM lost. APCOM lost ten orders.

Hyp: ITEL won at least eleven orders. (yes)

ITEL
NNP
won
VBD
at
IN
least
JJS
eleven
NN
orders
NNS
ITEL:NNP   0.28   1.07   0.75   1.24   1.07   1.07
won:VBD   1.16   1.32   1.15   1.02   0.68   0.91
more:JJR   1.13   0.93   1.05   0.93   0.88   0.83
orders:NNS   1.07   0.82   0.50   0.97   0.71   0.28
APCOM:NNP   1.07   0.82   0.50   0.99   0.82   0.82
lost:VBD   1.16   1.33   1.15   0.90   0.83   0.91
APCOM:NNP   1.07   0.82   0.50   0.99   0.82   0.82
lost:VBD   1.16   1.33   1.15   0.90   0.83   0.91
ten:NN   0.98   0.57   0.50   0.99   0.00   0.87
orders:NNS   1.07   0.82   0.50   0.97   0.71   0.28
NO_WORD   0.28   0.17   0.99   0.29   0.05   0.25

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.24 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.17 Alignment.txtSpan
 0.00  1.00 NegPolarity.hypNegWord : "least": tag "JJS" is in NegPolarityMarkers list
-1.00  1.00 NullPunisher.other : orders
-1.00  1.00 NullPunisher.other : won
-1.00  1.00 NullPunisher.other : least
-1.00  1.00 NullPunisher.other : at
-3.00  1.00 NullPunisher.entity : ITEL
-2.00  1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -9.7116
Threshold: -9.4738


Inference ID: 242

Txt: The PC-6082 is faster than 500 MIPS. The ITEL-ZX is slower than 500 MIPS.

Hyp: The PC-6082 is faster than the ITEL-ZX. (yes)

The
DT
PC-6082
NN
is
VBZ
faster
RBR
the
DT
ITEL-ZX
NNP
The:DT   2.42   1.00   1.25   1.45   0.01   1.00
PC-6082:NN   0.71   0.28   0.82   1.31   0.71   0.82
is:VBZ   0.76   0.91   1.32   1.11   0.76   0.91
faster:RBR   1.09   1.21   0.91   1.23   1.09   1.21
500:CD   0.89   1.33   1.30   1.30   0.89   1.33
MIPS:NNS   0.71   0.85   0.74   1.31   0.71   0.82
The:DT   2.42   1.00   1.25   1.45   0.01   1.00
ITEL-ZX:NN   0.71   0.82   0.82   1.31   0.71   1.82
is:VBZ   0.76   0.91   1.32   1.11   0.76   0.91
slower:JJR   0.76   0.88   0.96   0.23   0.76   0.88
500:CD   0.89   1.33   1.30   1.30   0.89   1.33
MIPS:NNS   0.71   0.85   0.74   1.31   0.71   0.82
NO_WORD   0.82   0.28   0.17   0.04   0.82   0.07

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.66 Alignment.score
 1.00  0.28 Alignment.isGood
-1.00  0.69 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.33 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "faster" modifying "is"
-0.10  1.00 NullPunisher.article : The
-1.00  1.00 NullPunisher.other : PC-6082
-1.00  1.00 NullPunisher.other : faster
-0.05  1.00 NullPunisher.aux : is
-2.00  1.00 RootEntailment.unalignedRoot : "is" not aligned to anything
Hand-tuned score (dot product of above): -5.0640
Threshold: -9.4738


Inference ID: 243

Txt: ITEL sold 3000 more computers than APCOM. APCOM sold exactly 2500 computers.

Hyp: ITEL sold 5500 computers. (yes)

ITEL
NNP
sold
VBD
5500
CD
computers
NNS
ITEL:NNP   0.28   1.07   1.20   1.07
sold:VBD   1.16   1.32   1.39   0.80
3000:CD   1.24   1.30   0.02   1.24
more:JJR   1.13   0.90   1.13   0.88
computers:NNS   1.07   0.71   1.19   0.28
APCOM:NNP   1.07   0.82   1.29   0.82
APCOM:NNP   1.07   0.82   1.29   0.82
sold:VBD   1.16   1.32   1.39   0.80
exactly:RB   1.47   0.91   1.32   1.21
2500:CD   1.24   1.30   1.22   1.18
computers:NNS   1.07   0.71   1.19   0.28
NO_WORD   0.28   0.17   0.52   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.30 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.76 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.25 Alignment.txtSpan
-1.00  1.00 NullPunisher.other : computers
-1.00  1.00 NullPunisher.other : sold
-3.00  1.00 NullPunisher.entity : ITEL
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '5500.0' vs '2500.0'
-2.00  1.00 RootEntailment.unalignedRoot : "sold" not aligned to anything
Hand-tuned score (dot product of above): -13.4173
Threshold: -9.4738


Inference ID: 244

Txt: APCOM has a more important customer than ITEL.

Hyp: APCOM has a more important customer than ITEL is. (yes)

APCOM
NNP
has
VBZ
a
DT
more
RBR
important
JJ
customer
NN
than
IN
ITEL
NNP
is
VBZ
APCOM:NNP   0.28   0.82   0.71   1.31   0.99   0.80   0.50   1.07   0.82
has:VBZ   0.91   1.32   0.76   1.11   1.02   0.91   1.06   1.16   1.01
a:DT   1.00   1.25   2.42   1.45   1.18   1.00   1.05   1.25   1.25
more:RBR   1.21   0.91   1.09   1.23   1.03   1.21   1.05   1.47   0.91
important:JJ   0.88   0.96   0.76   1.00   0.74   0.85   1.05   1.13   0.96
customer:NN   0.80   0.82   0.71   1.31   0.96   0.28   0.50   1.07   0.82
ITEL:NNP   1.07   1.07   0.96   1.56   1.24   1.07   0.75   0.28   1.07
NO_WORD   0.28   0.17   0.82   0.04   0.11   0.09   0.92   0.28   0.31

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.21 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  9.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "important" modifying "customer"
-1.00  1.00 NullPunisher.other : than
-1.00  1.00 NullPunisher.other : customer
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : APCOM
-3.00  1.00 NullPunisher.entity : ITEL
-0.05  1.00 NullPunisher.aux : is
-0.05  1.00 NullPunisher.aux : has
-1.00  1.00 NullPunisher.other : more
-1.00  1.00 NullPunisher.other : important
-2.00  1.00 RootEntailment.unalignedRoot : "has" not aligned to anything
Hand-tuned score (dot product of above): -12.4637
Threshold: -9.4738


Inference ID: 245

Txt: APCOM has a more important customer than ITEL.

Hyp: APCOM has a more important customer than ITEL has. (yes)

APCOM
NNP
has
VBZ
a
DT
more
RBR
important
JJ
customer
NN
than
IN
ITEL
NNP
has
VBZ
APCOM:NNP   0.28   0.82   0.71   1.31   0.99   0.80   0.50   1.07   0.82
has:VBZ   0.91   1.32   0.76   1.11   1.02   0.91   1.06   1.16   1.32
a:DT   1.00   1.25   2.42   1.45   1.18   1.00   1.05   1.25   1.25
more:RBR   1.21   0.91   1.09   1.23   1.03   1.21   1.05   1.47   0.91
important:JJ   0.88   0.96   0.76   1.00   0.74   0.85   1.05   1.13   0.96
customer:NN   0.80   0.82   0.71   1.31   0.96   0.28   0.50   1.07   0.82
ITEL:NNP   1.07   1.07   0.96   1.56   1.24   1.07   0.75   0.28   1.07
NO_WORD   0.28   0.17   0.82   0.04   0.11   0.09   0.92   0.28   0.31

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.21 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  9.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "important" modifying "customer"
-1.00  1.00 NullPunisher.other : than
-1.00  1.00 NullPunisher.other : important
-1.00  1.00 NullPunisher.other : APCOM
-0.05  1.00 NullPunisher.aux : has
-1.00  1.00 NullPunisher.other : customer
-0.05  1.00 NullPunisher.aux : has
-1.00  1.00 NullPunisher.other : more
-3.00  1.00 NullPunisher.entity : ITEL
-0.10  1.00 NullPunisher.article : a
-2.00  1.00 RootEntailment.unalignedRoot : "has" not aligned to anything
Hand-tuned score (dot product of above): -12.4637
Threshold: -9.4738


Inference ID: 246

Txt: The PC-6082 is faster than every ITEL computer. The ITEL-ZX is an ITEL computer.

Hyp: The PC-6082 is faster than the ITEL-ZX. (yes)

The
DT
PC-6082
NN
is
VBZ
faster
RBR
the
DT
ITEL-ZX
NNP
The:DT   2.42   1.00   1.25   1.45   0.01   1.00
PC-6082:NN   0.71   0.28   0.82   1.31   0.71   0.82
is:VBZ   0.76   0.91   1.32   1.11   0.76   0.91
faster:RBR   1.09   1.21   0.91   1.23   1.09   1.21
every:DT   1.31   1.00   1.25   1.43   1.31   1.00
ITEL:NNP   0.93   1.07   1.07   1.52   0.93   0.14
computer:NN   0.71   0.67   0.82   0.94   0.71   0.82
The:DT   2.42   1.00   1.25   1.45   0.01   1.00
ITEL-ZX:NN   0.71   0.82   0.82   1.31   0.71   1.82
is:VBZ   0.76   0.91   1.32   1.11   0.76   0.91
an:DT   1.31   1.00   1.25   1.45   1.31   1.00
ITEL:NNP   0.93   1.07   1.07   1.52   0.93   0.14
computer:NN   0.71   0.67   0.82   0.94   0.71   0.82
NO_WORD   0.82   0.28   0.17   0.04   0.82   0.07

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.66 Alignment.score
 1.00  0.28 Alignment.isGood
-1.00  0.69 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.33 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "faster" modifying "is"
-1.00  1.00 NullPunisher.other : faster
-0.10  1.00 NullPunisher.article : The
-0.05  1.00 NullPunisher.aux : is
-1.00  1.00 NullPunisher.other : PC-6082
-2.00  1.00 RootEntailment.unalignedRoot : "is" not aligned to anything
Hand-tuned score (dot product of above): -5.0640
Threshold: -9.4738


Inference ID: 247

Txt: The PC-6082 is faster than some ITEL computer. The ITEL-ZX is an ITEL computer.

Hyp: The PC-6082 is faster than the ITEL-ZX. (don't know)

The
DT
PC-6082
NN
is
VBZ
faster
RBR
the
DT
ITEL-ZX
NNP
The:DT   2.42   1.00   1.25   1.45   0.01   1.00
PC-6082:NN   0.71   0.28   0.82   1.31   0.71   0.82
is:VBZ   0.76   0.91   1.32   1.11   0.76   0.91
faster:RBR   1.09   1.21   0.91   1.23   1.09   1.21
some:DT   1.28   1.00   1.25   1.45   1.28   1.00
ITEL:NNP   0.93   1.07   1.07   1.52   0.93   0.14
computer:NN   0.71   0.67   0.82   0.94   0.71   0.82
The:DT   2.42   1.00   1.25   1.45   0.01   1.00
ITEL-ZX:NN   0.71   0.82   0.82   1.31   0.71   1.82
is:VBZ   0.76   0.91   1.32   1.11   0.76   0.91
an:DT   1.31   1.00   1.25   1.45   1.31   1.00
ITEL:NNP   0.93   1.07   1.07   1.52   0.93   0.14
computer:NN   0.71   0.67   0.82   0.94   0.71   0.82
NO_WORD   0.82   0.28   0.17   0.04   0.82   0.07

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.66 Alignment.score
 1.00  0.28 Alignment.isGood
-1.00  0.69 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.33 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "faster" modifying "is"
-1.00  1.00 NullPunisher.other : faster
-1.00  1.00 NullPunisher.other : PC-6082
-0.05  1.00 NullPunisher.aux : is
-0.10  1.00 NullPunisher.article : The
-2.00  1.00 RootEntailment.unalignedRoot : "is" not aligned to anything
Hand-tuned score (dot product of above): -5.0640
Threshold: -9.4738


Inference ID: 248

Txt: The PC-6082 is faster than any ITEL computer. The ITEL-ZX is an ITEL computer.

Hyp: The PC-6082 is faster than the ITEL-ZX. (yes)

The
DT
PC-6082
NN
is
VBZ
faster
RBR
the
DT
ITEL-ZX
NNP
The:DT   2.42   1.00   1.25   1.45   0.01   1.00
PC-6082:NN   0.71   0.28   0.82   1.31   0.71   0.82
is:VBZ   0.76   0.91   1.32   1.11   0.76   0.91
faster:RBR   1.09   1.21   0.91   1.23   1.09   1.21
any:DT   1.31   1.00   1.25   1.45   1.31   1.00
ITEL:NNP   0.93   1.07   1.07   1.52   0.93   0.14
computer:NN   0.71   0.67   0.82   0.94   0.71   0.82
The:DT   2.42   1.00   1.25   1.45   0.01   1.00
ITEL-ZX:NN   0.71   0.82   0.82   1.31   0.71   1.82
is:VBZ   0.76   0.91   1.32   1.11   0.76   0.91
an:DT   1.31   1.00   1.25   1.45   1.31   1.00
ITEL:NNP   0.93   1.07   1.07   1.52   0.93   0.14
computer:NN   0.71   0.67   0.82   0.94   0.71   0.82
NO_WORD   0.82   0.28   0.17   0.04   0.82   0.07

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.66 Alignment.score
 1.00  0.28 Alignment.isGood
-1.00  0.69 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.33 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "faster" modifying "is"
-0.05  1.00 NullPunisher.aux : is
-1.00  1.00 NullPunisher.other : PC-6082
-1.00  1.00 NullPunisher.other : faster
-0.10  1.00 NullPunisher.article : The
-2.00  1.00 RootEntailment.unalignedRoot : "is" not aligned to anything
Hand-tuned score (dot product of above): -5.0640
Threshold: -9.4738


Inference ID: 249

Txt: The PC-6082 is faster than the ITEL-ZX and the ITEL-ZY.

Hyp: The PC-6082 is faster than the ITEL-ZX. (yes)

The
DT
PC-6082
NN
is
VBZ
faster
RBR
the
DT
ITEL-ZX
NNP
The:DT   2.42   1.00   1.25   1.45   0.01   1.00
PC-6082:NN   0.71   0.28   0.82   1.31   0.71   0.82
is:VBZ   0.76   0.91   1.32   1.11   0.76   0.91
faster:RBR   1.09   1.21   0.91   1.23   1.09   1.21
the:DT   0.01   1.00   1.25   1.45   2.42   1.00
ITEL-ZX:NN   0.71   0.82   0.82   1.31   0.71   1.82
the:DT   0.01   1.00   1.25   1.45   2.42   1.00
ITEL-ZY:NN   0.71   0.82   0.82   1.31   0.71   0.98
NO_WORD   0.82   0.28   0.17   0.04   0.82   0.07

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.56 Alignment.score
 1.00  0.27 Alignment.isGood
-1.00  0.71 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.17 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "faster" modifying "is"
-0.10  1.00 NullPunisher.article : The
-0.05  1.00 NullPunisher.aux : is
-1.00  1.00 NullPunisher.other : PC-6082
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : faster
-2.00  1.00 RootEntailment.unalignedRoot : "is" not aligned to anything
Hand-tuned score (dot product of above): -5.5142
Threshold: -9.4738


Inference ID: 250

Txt: The PC-6082 is faster than the ITEL-ZX or the ITEL-ZY.

Hyp: The PC-6082 is faster than the ITEL-ZX. (yes)

The
DT
PC-6082
NN
is
VBZ
faster
RBR
the
DT
ITEL-ZX
NNP
The:DT   2.42   1.00   1.25   1.45   0.01   1.00
PC-6082:NN   0.71   0.28   0.82   1.31   0.71   0.82
is:VBZ   0.76   0.91   1.32   1.11   0.76   0.91
faster:RBR   1.09   1.21   0.91   1.23   1.09   1.21
the:DT   0.01   1.00   1.25   1.45   2.42   1.00
ITEL-ZX:NN   0.71   0.82   0.82   1.31   0.71   1.82
the:DT   0.01   1.00   1.25   1.45   2.42   1.00
ITEL-ZY:NN   0.71   0.82   0.82   1.31   0.71   0.98
NO_WORD   0.82   0.28   0.17   0.04   0.82   0.07

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.56 Alignment.score
 1.00  0.27 Alignment.isGood
-1.00  0.71 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.17 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "faster" modifying "is"
-1.00  1.00 NullPunisher.other : PC-6082
-1.00  1.00 NullPunisher.other : faster
-0.10  1.00 NullPunisher.article : The
-0.10  1.00 NullPunisher.article : the
-0.05  1.00 NullPunisher.aux : is
-2.00  1.00 RootEntailment.unalignedRoot : "is" not aligned to anything
Hand-tuned score (dot product of above): -5.5142
Threshold: -9.4738


Inference ID: 251

Txt: ITEL has a factory in Birmingham.

Hyp: ITEL currently has a factory in Birmingham. (yes)

ITEL
NNP
currently
RB
has
VBZ
a
DT
factory
NN
Birmingham
NNP
ITEL:NNP   0.28   1.56   1.07   0.96   1.07   0.99
has:VBZ   1.16   1.11   1.32   0.76   0.91   1.16
a:DT   1.25   1.45   1.25   2.42   1.00   1.25
factory:NN   1.07   1.31   0.82   0.71   0.28   1.06
Birmingham:NNP   0.99   1.53   1.07   0.96   1.06   0.28
NO_WORD   0.28   0.04   0.17   0.82   0.09   0.07

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.13 Alignment.score
 1.00  0.19 Alignment.isGood
-1.00  0.79 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Birmingham" modifying "factory"
-1.00  1.00 NullPunisher.other : currently
-0.10  1.00 NullPunisher.article : a
-0.05  1.00 NullPunisher.aux : has
-3.00  1.00 NullPunisher.entity : ITEL
-3.00  1.00 NullPunisher.entity : Birmingham
-1.00  1.00 NullPunisher.other : factory
-2.00  1.00 RootEntailment.unalignedRoot : "has" not aligned to anything
Hand-tuned score (dot product of above): -12.2229
Threshold: -9.4738


Inference ID: 252

Txt: Since 1992 ITEL has been in Birmingham. It is now 1996.

Hyp: Itel was in Birmingham in 1993. (yes)

Itel
NNP
was
VBD
Birmingham
NNP
1993
CD
1992:CD   1.24   1.30   1.24   1.60
ITEL:NNP   0.54   1.07   0.99   1.20
has:VBZ   1.16   0.84   1.16   1.39
been:VBN   1.11   0.10   1.16   1.39
Birmingham:NNP   0.99   1.07   0.28   1.20
It:PRP   1.48   0.97   1.55   1.46
is:VBZ   1.16   0.05   1.16   1.39
now:RB   1.47   0.91   1.47   1.36
1996:CD   1.24   1.30   1.24   0.95
NO_WORD   0.28   0.17   0.07   0.57

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.56 Alignment.score
 1.00  0.26 Alignment.isGood
-1.00  0.71 Alignment.isBad
-0.10  2.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.50 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Birmingham" modifying "was"
-3.00  1.00 Date.dateHeadMismatch : 1993 vs. 1992
-3.00  1.00 NullPunisher.entity : Birmingham
-0.05  1.00 NullPunisher.aux : was
-2.00  1.00 RootEntailment.unalignedRoot : "was" not aligned to anything
Hand-tuned score (dot product of above): -8.9872
Threshold: -9.4738


Inference ID: 253

Txt: ITEL has developed a new editor since 1992. It is now 1996.

Hyp: ITEL developed a new editor in 1993. (don't know)

ITEL
NNP
developed
VBD
a
DT
new
JJ
editor
NN
1993
CD
ITEL:NNP   0.28   1.07   0.96   1.21   1.03   1.20
has:VBZ   1.16   0.79   0.76   1.02   0.91   1.39
developed:VBN   1.16   1.50   0.76   0.95   0.88   1.27
a:DT   1.25   1.25   2.42   1.18   1.00   1.18
new:JJ   1.10   0.89   0.76   0.74   0.84   1.13
editor:NN   1.03   0.79   0.71   0.94   0.28   1.19
1992:CD   1.24   1.13   0.89   1.00   1.32   1.60
It:PRP   2.69   0.97   1.10   1.39   1.30   1.46
is:VBZ   1.16   0.84   0.76   1.02   0.91   1.39
now:RB   1.47   0.91   1.09   0.90   1.21   1.36
1996:CD   1.24   1.30   0.89   0.79   1.26   0.95
NO_WORD   0.28   0.17   0.82   0.11   0.09   0.57

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.84 Alignment.score
 1.00  0.32 Alignment.isGood
-1.00  0.65 Alignment.isBad
-0.10  2.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.33 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "new" modifying "editor"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "new" modifying "editor" is dropped on aligned hypothesis word "editor"
-3.00  1.00 Date.dateHeadMismatch : 1993 vs. 1992
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : new
-1.00  1.00 Structure.relMismatch : text "1992" is prep_since of "developed" while hyp "1993" is prep_in of "developed" which aligned to text "developed"
Hand-tuned score (dot product of above): -5.0548
Threshold: -9.4738


Inference ID: 254

Txt: ITEL has expanded since 1992. It is now 1996.

Hyp: ITEL expanded in 1993. (don't know)

ITEL
NNP
expanded
VBD
1993
CD
ITEL:NNP   0.28   1.07   1.20
has:VBZ   1.16   0.74   1.39
expanded:VBN   1.16   2.02   1.11
1992:CD   1.24   1.06   1.60
It:PRP   2.69   0.97   1.46
is:VBZ   1.16   0.79   1.39
now:RB   1.47   0.91   1.36
1996:CD   1.24   1.22   0.95
NO_WORD   0.28   0.17   0.57

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  1.37 Alignment.score
 1.00  0.45 Alignment.isGood
-1.00  0.52 Alignment.isBad
-0.10  0.00 Alignment.nbWordNotAligned
 0.10  3.00 Alignment.hypSpan
 0.10  1.00 Alignment.txtSpan
-3.00  1.00 Date.dateHeadMismatch : 1993 vs. 1992
-1.00  1.00 Structure.relMismatch : text "1992" is prep_since of "expanded" while hyp "1993" is prep_in of "expanded" which aligned to text "expanded"
Hand-tuned score (dot product of above): -2.3128
Threshold: -9.4738


Inference ID: 255

Txt: Since 1992 ITEL has made a loss. It is now 1996.

Hyp: ITEL made a loss in 1993. (yes)

ITEL
NNP
made
VBD
a
DT
loss
NN
1993
CD
1992:CD   1.24   0.91   0.89   1.33   1.60
ITEL:NNP   0.28   1.07   0.96   1.07   1.20
has:VBZ   1.16   0.35   0.76   0.87   1.39
made:VBN   1.16   2.10   0.76   0.76   1.06
a:DT   1.25   1.25   2.42   1.00   1.18
loss:NN   1.07   0.67   0.71   0.28   1.22
It:PRP   1.48   0.97   1.10   1.30   1.46
is:VBZ   1.16   0.92   0.76   0.91   1.39
now:RB   1.47   0.91   1.09   1.18   1.36
1996:CD   1.24   1.30   0.89   1.25   0.95
NO_WORD   0.28   0.17   0.82   0.09   0.57

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  1.11 Alignment.score
 1.00  0.38 Alignment.isGood
-1.00  0.59 Alignment.isBad
-0.10  1.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.80 Alignment.txtSpan
-3.00  1.00 Date.dateHeadMismatch : 1993 vs. 1992
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 Structure.relMismatch : text "1992" is prep_since of "made" while hyp "1993" is prep_in of "made" which aligned to text "made"
Hand-tuned score (dot product of above): -3.0194
Threshold: -9.4738


Inference ID: 256

Txt: ITEL has made a loss since 1992. It is now 1996.

Hyp: ITEL made a loss in 1993. (don't know)

ITEL
NNP
made
VBD
a
DT
loss
NN
1993
CD
ITEL:NNP   0.28   1.07   0.96   1.07   1.20
has:VBZ   1.16   0.35   0.76   0.87   1.39
made:VBN   1.16   2.10   0.76   0.76   1.06
a:DT   1.25   1.25   2.42   1.00   1.18
loss:NN   1.07   0.67   0.71   0.28   1.22
1992:CD   1.24   0.91   0.89   1.33   1.60
It:PRP   2.69   0.97   1.10   1.30   1.46
is:VBZ   1.16   0.92   0.76   0.91   1.39
now:RB   1.47   0.91   1.09   1.18   1.36
1996:CD   1.24   1.30   0.89   1.25   0.95
NO_WORD   0.28   0.17   0.82   0.09   0.57

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  1.11 Alignment.score
 1.00  0.38 Alignment.isGood
-1.00  0.59 Alignment.isBad
-0.10  1.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.80 Alignment.txtSpan
-3.00  1.00 Date.dateHeadMismatch : 1993 vs. 1992
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 Structure.relMismatch : text "1992" is prep_since of "made" while hyp "1993" is prep_in of "made" which aligned to text "made"
Hand-tuned score (dot product of above): -3.0194
Threshold: -9.4738


Inference ID: 257

Txt: ITEL has made a loss since 1992. It is now 1996.

Hyp: ITEL made a loss in 1993. (yes)

ITEL
NNP
made
VBD
a
DT
loss
NN
1993
CD
ITEL:NNP   0.28   1.07   0.96   1.07   1.20
has:VBZ   1.16   0.35   0.76   0.87   1.39
made:VBN   1.16   2.10   0.76   0.76   1.06
a:DT   1.25   1.25   2.42   1.00   1.18
loss:NN   1.07   0.67   0.71   0.28   1.22
1992:CD   1.24   0.91   0.89   1.33   1.60
It:PRP   2.69   0.97   1.10   1.30   1.46
is:VBZ   1.16   0.92   0.76   0.91   1.39
now:RB   1.47   0.91   1.09   1.18   1.36
1996:CD   1.24   1.30   0.89   1.25   0.95
NO_WORD   0.28   0.17   0.82   0.09   0.57

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  1.11 Alignment.score
 1.00  0.38 Alignment.isGood
-1.00  0.59 Alignment.isBad
-0.10  1.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.80 Alignment.txtSpan
-3.00  1.00 Date.dateHeadMismatch : 1993 vs. 1992
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 Structure.relMismatch : text "1992" is prep_since of "made" while hyp "1993" is prep_in of "made" which aligned to text "made"
Hand-tuned score (dot product of above): -3.0194
Threshold: -9.4738


Inference ID: 258

Txt: In March 1993 APCOM founded ITEL.

Hyp: ITEL existed in 1992. (don't know)

ITEL
NNP
existed
VBD
1992
CD
March:NNP   0.99   1.07   1.03
1993:CD   1.24   1.10   1.60
APCOM:NNP   1.07   0.82   1.29
founded:VBD   1.16   0.47   1.10
ITEL:NNP   0.28   1.01   1.20
NO_WORD   0.28   0.17   0.57

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.50 Alignment.score
 1.00  0.25 Alignment.isGood
-1.00  0.72 Alignment.isBad
-0.10  2.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.33 Alignment.txtSpan
-3.00  1.00 Date.dateHeadMismatch : 1992 vs. 1993
-3.00  1.00 NullPunisher.entity : ITEL
-1.00  1.00 NullPunisher.other : existed
-2.00  1.00 RootEntailment.unalignedRoot : "existed" not aligned to anything
Hand-tuned score (dot product of above): -9.0419
Threshold: -9.4738


Inference ID: 259

Txt: The conference started on July 4th, 1994. It lasted 2 days.

Hyp: The conference was over on July 8th, 1994. (yes)

The
DT
conference
NN
was
VBD
over
IN
on
IN
July
NNP
8th
CD
1994
CD
The:DT   2.42   1.00   1.25   1.01   1.05   1.25   1.11   1.18
conference:NN   0.71   0.28   0.82   0.50   0.50   1.12   1.29   1.29
started:VBD   0.76   0.91   0.90   1.15   1.15   1.16   1.39   1.24
July:NNP   0.96   1.12   1.07   0.75   0.75   0.28   1.03   1.03
4th:CD   0.82   1.33   1.30   1.30   1.30   1.07   0.49   0.34
1994:CD   0.89   1.33   1.30   1.30   1.30   1.07   0.35   0.85
It:PRP   1.23   2.50   0.97   1.28   1.29   1.55   1.42   1.46
lasted:VBD   0.76   0.91   0.60   1.15   1.15   1.16   1.39   1.39
2:CD   0.89   1.33   1.30   1.30   1.30   1.24   0.40   0.38
days:NNS   0.71   0.82   0.72   0.50   0.50   1.01   1.29   1.29
NO_WORD   0.82   0.28   0.17   1.16   0.99   0.25   0.52   0.26

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.42 Alignment.score
 1.00  0.24 Alignment.isGood
-1.00  0.74 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.12 Alignment.txtSpan
-2.00  1.00 Date.hypDateIns : hypothesis date insertion: July
-1.00  1.00 NullPunisher.other : on
-1.00  1.00 NullPunisher.other : over
-3.00  1.00 NullPunisher.entity : 1994
-3.00  1.00 NullPunisher.entity : July
-0.05  1.00 NullPunisher.aux : was
-1.00  1.00 NullPunisher.other : conference
-0.10  1.00 NullPunisher.article : The
-6.00  1.00 Numeric.mismatch : DATE mismatch: '07/08/1994' vs ''
-2.00  1.00 RootEntailment.unalignedRoot : "was" not aligned to anything
Hand-tuned score (dot product of above): -19.8221
Threshold: -9.4738


Inference ID: 260

Txt: Yesterday APCOM signed the contract. Today is Saturday, July 14th.

Hyp: APCOM signed the contract Friday, 13th. (yes)

APCOM
NNP
signed
VBD
the
DT
contract
NN
Friday
NNP
13th
NN
Yesterday:NNP   0.82   0.80   0.71   0.89   1.02   0.78
APCOM:NNP   0.28   0.82   0.71   0.82   1.07   0.82
signed:VBD   0.91   1.32   0.76   0.45   1.16   0.91
the:DT   1.00   1.25   2.42   1.00   1.25   0.96
contract:NN   0.82   0.36   0.71   0.28   1.18   0.89
Today:NNP   0.82   0.82   0.71   0.90   1.05   0.87
is:VBZ   0.91   0.78   0.76   0.91   1.16   0.91
Saturday:NNP   1.07   1.04   0.96   1.11   0.60   0.90
July:NNP   1.07   1.07   0.96   1.16   0.74   0.98
14th:NNP   1.07   1.07   0.93   1.15   0.67   0.07
NO_WORD   0.28   0.17   0.82   0.09   0.09   0.25

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.16 Alignment.score
 1.00  0.19 Alignment.isGood
-1.00  0.79 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Friday" modifying "signed"
-2.00  1.00 Date.hypDateIns : hypothesis date insertion: Friday
-1.00  1.00 NullPunisher.other : contract
-1.00  1.00 NullPunisher.other : 13th
-1.00  1.00 NullPunisher.other : APCOM
-1.00  1.00 NullPunisher.other : signed
-3.00  1.00 NullPunisher.entity : Friday
-0.10  1.00 NullPunisher.article : the
-6.00  1.00 Numeric.mismatch : DATE mismatch: '01/01/1000' vs ''
-2.00  1.00 RootEntailment.unalignedRoot : "signed" not aligned to anything
Hand-tuned score (dot product of above): -19.1308
Threshold: -9.4738


Inference ID: 261

Txt: Smith left before Jones left. Jones left before Anderson left.

Hyp: Smith left before Anderson left. (yes)

Smith
NNP
left
VBD
before
IN
Anderson
NNP
left
VBD
Smith:NNP   0.28   1.05   0.75   0.76   1.05
left:VBD   1.14   1.32   1.11   1.16   1.32
before:IN   1.57   1.20   1.59   1.57   1.20
Jones:NNP   0.70   1.07   0.73   0.75   1.07
left:VBD   1.14   1.32   1.11   1.16   1.32
Jones:NNP   0.70   1.07   0.73   0.75   1.07
left:VBD   1.14   1.32   1.11   1.16   1.32
before:IN   1.57   1.20   1.59   1.57   1.20
Anderson:NNP   0.76   1.07   0.75   0.28   1.07
left:VBD   1.14   1.32   1.11   1.16   1.32
NO_WORD   0.28   0.17   0.92   0.28   0.42

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.19 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-3.00  1.00 NullPunisher.entity : Anderson
-1.00  1.00 NullPunisher.other : before
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : left
-1.00  1.00 NullPunisher.other : left
-2.00  1.00 RootEntailment.unalignedRoot : "left" not aligned to anything
Hand-tuned score (dot product of above): -11.8902
Threshold: -9.4738


Inference ID: 262

Txt: Smith left after Jones left. Jones left after Anderson left.

Hyp: Smith left after Anderson left. (yes)

Smith
NNP
left
VBD
after
IN
Anderson
NNP
left
VBD
Smith:NNP   0.28   1.05   0.75   0.76   1.05
left:VBD   1.14   1.32   1.13   1.16   1.32
after:IN   1.57   1.21   1.59   1.52   1.21
Jones:NNP   0.70   1.07   0.71   0.75   1.07
left:VBD   1.14   1.32   1.13   1.16   1.32
Jones:NNP   0.70   1.07   0.71   0.75   1.07
left:VBD   1.14   1.32   1.13   1.16   1.32
after:IN   1.57   1.21   1.59   1.52   1.21
Anderson:NNP   0.76   1.07   0.70   0.28   1.07
left:VBD   1.14   1.32   1.13   1.16   1.32
NO_WORD   0.28   0.17   0.92   0.28   0.42

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.19 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : left
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : after
-3.00  1.00 NullPunisher.entity : Anderson
-1.00  1.00 NullPunisher.other : left
-2.00  1.00 RootEntailment.unalignedRoot : "left" not aligned to anything
Hand-tuned score (dot product of above): -11.8902
Threshold: -9.4738


Inference ID: 263

Txt: Smith was present after Jones left. Jones left after Anderson was present.

Hyp: Smith was present after Anderson was present. (don't know)

Smith
NNP
was
VBD
present
JJ
after
IN
Anderson
NNP
was
VBD
present
JJ
Smith:NNP   0.28   1.07   1.24   0.75   0.76   1.07   1.24
was:VBD   1.16   1.32   1.02   1.15   1.16   1.32   1.02
present:JJ   1.13   0.96   0.74   1.05   1.09   0.96   0.74
after:IN   1.57   1.24   0.77   1.59   1.52   1.24   0.77
Jones:NNP   0.70   1.07   1.24   0.71   0.75   1.07   1.24
left:VBD   1.14   0.90   1.00   1.13   1.16   0.90   1.00
Jones:NNP   0.70   1.07   1.24   0.71   0.75   1.07   1.24
left:VBD   1.14   0.90   1.00   1.13   1.16   0.90   1.00
after:IN   1.57   1.24   0.77   1.59   1.52   1.24   0.77
Anderson:NNP   0.76   1.07   1.20   0.70   0.28   1.07   1.20
was:VBD   1.16   1.32   1.02   1.15   1.16   1.32   1.02
present:JJ   1.13   0.96   0.74   1.05   1.09   0.96   0.74
NO_WORD   0.28   1.43   0.16   0.92   0.28   1.43   0.35

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.53 Alignment.score
 1.00  0.26 Alignment.isGood
-1.00  0.72 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-3.00  1.00 NullPunisher.entity : Anderson
-0.05  1.00 NullPunisher.aux : was
-1.00  1.00 NullPunisher.other : after
-0.05  1.00 NullPunisher.aux : was
-1.00  1.00 NullPunisher.other : present
-1.00  1.00 NullPunisher.other : present
-3.00  1.00 NullPunisher.entity : Smith
-2.00  1.00 RootEntailment.unalignedRoot : "present" not aligned to anything
Hand-tuned score (dot product of above): -11.7224
Threshold: -9.4738


Inference ID: 264

Txt: Smith left. Jones left. Smith left before Jones left.

Hyp: Jones left after Smith left. (yes)

Jones
NNP
left
VBD
after
IN
Smith
NNP
left
VBD
Smith:NNP   0.70   1.05   0.75   0.28   1.05
left:VBD   1.16   1.32   1.13   1.14   1.32
Jones:NNP   0.28   1.07   0.71   0.70   1.07
left:VBD   1.16   1.32   1.13   1.14   1.32
Smith:NNP   0.70   1.05   0.75   0.28   1.05
left:VBD   1.16   1.32   1.13   1.14   1.32
before:IN   1.55   1.20   2.47   1.57   1.20
Jones:NNP   0.28   1.07   0.71   0.70   1.07
left:VBD   1.16   1.32   1.13   1.14   1.32
NO_WORD   0.28   0.17   0.92   0.28   0.42

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.19 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-3.00  1.00 NullPunisher.entity : Jones
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : left
-1.00  1.00 NullPunisher.other : left
-1.00  1.00 NullPunisher.other : after
-2.00  1.00 RootEntailment.unalignedRoot : "left" not aligned to anything
Hand-tuned score (dot product of above): -11.8902
Threshold: -9.4738


Inference ID: 265

Txt: Smith left. Jones left. Smith left after Jones left.

Hyp: Jones left before Smith left. (yes)

Jones
NNP
left
VBD
before
IN
Smith
NNP
left
VBD
Smith:NNP   0.70   1.05   0.75   0.28   1.05
left:VBD   1.16   1.32   1.11   1.14   1.32
Jones:NNP   0.28   1.07   0.73   0.70   1.07
left:VBD   1.16   1.32   1.11   1.14   1.32
Smith:NNP   0.70   1.05   0.75   0.28   1.05
left:VBD   1.16   1.32   1.11   1.14   1.32
after:IN   1.52   1.21   2.47   1.57   1.21
Jones:NNP   0.28   1.07   0.73   0.70   1.07
left:VBD   1.16   1.32   1.11   1.14   1.32
NO_WORD   0.28   0.17   0.92   0.28   0.42

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.19 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : left
-1.00  1.00 NullPunisher.other : before
-3.00  1.00 NullPunisher.entity : Jones
-1.00  1.00 NullPunisher.other : left
-2.00  1.00 RootEntailment.unalignedRoot : "left" not aligned to anything
Hand-tuned score (dot product of above): -11.8902
Threshold: -9.4738


Inference ID: 266

Txt: Smith left. Jones left. Jones left before Smith left.

Hyp: Smith left after Jones left. (yes)

Smith
NNP
left
VBD
after
IN
Jones
NNP
left
VBD
Smith:NNP   0.28   1.05   0.75   0.70   1.05
left:VBD   1.14   1.32   1.13   1.16   1.32
Jones:NNP   0.70   1.07   0.71   0.28   1.07
left:VBD   1.14   1.32   1.13   1.16   1.32
Jones:NNP   0.70   1.07   0.71   0.28   1.07
left:VBD   1.14   1.32   1.13   1.16   1.32
before:IN   1.57   1.20   2.47   1.55   1.20
Smith:NNP   0.28   1.05   0.75   0.70   1.05
left:VBD   1.14   1.32   1.13   1.16   1.32
NO_WORD   0.28   0.17   0.92   0.28   0.42

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.19 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : after
-3.00  1.00 NullPunisher.entity : Jones
-1.00  1.00 NullPunisher.other : left
-1.00  1.00 NullPunisher.other : left
-3.00  1.00 NullPunisher.entity : Smith
-2.00  1.00 RootEntailment.unalignedRoot : "left" not aligned to anything
Hand-tuned score (dot product of above): -11.8902
Threshold: -9.4738


Inference ID: 267

Txt: Jones revised the contract. Smith revised the contract. Jones revised the contract before Smith did.

Hyp: Smith revised the contract after Jones did. (yes)

Smith
NNP
revised
VBD
the
DT
contract
NN
after
IN
Jones
NNP
did
VBD
Jones:NNP   0.70   1.07   0.96   1.16   0.71   0.28   1.07
revised:VBD   1.16   1.32   0.76   0.79   1.15   1.16   0.64
the:DT   1.25   1.25   2.42   1.00   1.16   1.25   1.25
contract:NN   1.16   0.70   0.71   0.28   0.50   1.16   0.71
Smith:NNP   0.28   1.07   0.96   1.16   0.75   0.70   1.07
revised:VBD   1.16   1.32   0.76   0.79   1.15   1.16   0.64
the:DT   1.25   1.25   2.42   1.00   1.16   1.25   1.25
contract:NN   1.16   0.70   0.71   0.28   0.50   1.16   0.71
Jones:NNP   0.70   1.07   0.96   1.16   0.71   0.28   1.07
revised:VBD   1.16   1.32   0.76   0.79   1.15   1.16   0.64
the:DT   1.25   1.25   2.42   1.00   1.16   1.25   1.25
contract:NN   1.16   0.70   0.71   0.28   0.50   1.16   0.71
Smith:NNP   0.28   1.07   0.96   1.16   0.75   0.70   1.07
did:VBD   1.16   0.64   0.76   0.80   1.15   1.16   1.32
NO_WORD   0.28   0.17   0.82   0.09   0.92   0.28   0.42

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.27 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-0.10  1.00 NullPunisher.article : the
-3.00  1.00 NullPunisher.entity : Jones
-1.00  1.00 NullPunisher.other : contract
-0.05  1.00 NullPunisher.aux : did
-1.00  1.00 NullPunisher.other : revised
-1.00  1.00 NullPunisher.other : after
-3.00  1.00 NullPunisher.entity : Smith
-2.00  1.00 RootEntailment.unalignedRoot : "revised" not aligned to anything
Hand-tuned score (dot product of above): -12.1404
Threshold: -9.4738


Inference ID: 268

Txt: Jones revised the contract. Smith revised the contract. Jones revised the contract after Smith did.

Hyp: Smith revised the contract before Jones did. (yes)

Smith
NNP
revised
VBD
the
DT
contract
NN
Jones
NNP
did
VBD
Jones:NNP   0.70   1.07   0.96   1.16   0.28   1.07
revised:VBD   1.16   1.32   0.76   0.79   1.16   0.64
the:DT   1.25   1.25   2.42   1.00   1.25   1.25
contract:NN   1.16   0.70   0.71   0.28   1.16   0.71
Smith:NNP   0.28   1.07   0.96   1.16   0.70   1.07
revised:VBD   1.16   1.32   0.76   0.79   1.16   0.64
the:DT   1.25   1.25   2.42   1.00   1.25   1.25
contract:NN   1.16   0.70   0.71   0.28   1.16   0.71
Jones:NNP   0.70   1.07   0.96   1.16   0.28   1.07
revised:VBD   1.16   1.32   0.76   0.79   1.16   0.64
the:DT   1.25   1.25   2.42   1.00   1.25   1.25
contract:NN   1.16   0.70   0.71   0.28   1.16   0.71
after:IN   1.57   1.24   1.16   1.31   1.52   1.24
Smith:NNP   0.28   1.07   0.96   1.16   0.70   1.07
did:VBD   1.16   0.64   0.76   0.80   1.16   1.32
NO_WORD   0.28   0.17   0.82   0.28   0.10   0.36

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.15 Alignment.score
 1.00  0.19 Alignment.isGood
-1.00  0.79 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Jones" modifying "contract"
-0.10  1.00 NullPunisher.article : the
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : revised
-3.00  1.00 NullPunisher.entity : Jones
-0.05  1.00 NullPunisher.aux : did
-1.00  1.00 NullPunisher.other : contract
-2.00  1.00 RootEntailment.unalignedRoot : "revised" not aligned to anything
Hand-tuned score (dot product of above): -12.1952
Threshold: -9.4738


Inference ID: 269

Txt: Smith swam. Jones swam. Smith swam before Jones swam.

Hyp: Jones swam after Smith swam. (don't know)

Jones
NNP
swam
VB
Smith
NNP
swam
NN
Smith:NNP   0.70   1.05   0.28   1.05
swam:VBP   1.16   2.02   1.14   1.75
Jones:NNP   0.28   1.07   0.70   1.07
swam:VBP   1.16   2.02   1.14   1.75
Smith:NNP   0.70   1.05   0.28   1.05
swam:VB   1.16   1.32   1.14   1.75
Jones:NNP   0.28   1.07   0.70   1.07
swam:NN   1.07   2.77   1.05   0.28
NO_WORD   0.28   0.17   0.05   0.23

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  1.24 Alignment.score
 1.00  0.41 Alignment.isGood
-1.00  0.56 Alignment.isBad
-0.10  0.00 Alignment.nbWordNotAligned
 0.10  4.00 Alignment.hypSpan
 0.10  1.00 Alignment.txtSpan
Hand-tuned score (dot product of above): 1.5946
Threshold: -9.4738


Inference ID: 270

Txt: Smith swam to the shore. Jones swam to the shore. Smith swam to the shore before Jones swam to the shore.

Hyp: Jones swam to the shore after Smith swam to the shore. (yes)

Jones
NNP
swam
VBP
the
DT
shore
NN
after
IN
Smith
NNP
swam
VB
the
DT
shore
NN
Smith:NNP   0.70   1.05   0.96   1.03   0.75   0.28   1.05   0.96   1.03
swam:VB   1.16   2.02   0.76   0.77   1.15   1.14   1.32   0.76   0.77
the:DT   1.25   1.25   2.42   0.94   1.16   1.25   1.25   2.42   0.94
shore:NN   1.04   0.69   0.65   0.28   0.50   1.03   0.69   0.65   0.28
Jones:NNP   0.28   1.07   0.96   1.04   0.71   0.70   1.07   0.96   1.04
swam:VB   1.16   2.02   0.76   0.77   1.15   1.14   1.32   0.76   0.77
the:DT   1.25   1.25   2.42   0.94   1.16   1.25   1.25   2.42   0.94
shore:NN   1.04   0.69   0.65   0.28   0.50   1.03   0.69   0.65   0.28
Smith:NNP   0.70   1.05   0.96   1.03   0.75   0.28   1.05   0.96   1.03
swam:VBP   1.16   1.32   0.76   0.77   1.15   1.14   2.02   0.76   0.77
the:DT   1.25   1.25   2.42   0.94   1.16   1.25   1.25   2.42   0.94
shore:NN   1.04   0.69   0.65   0.28   0.50   1.03   0.69   0.65   0.28
before:IN   1.55   1.24   1.05   1.21   2.47   1.57   1.24   1.05   1.21
Jones:NNP   0.28   1.07   0.96   1.04   0.71   0.70   1.07   0.96   1.04
swam:VB   1.16   2.02   0.76   0.77   1.15   1.14   1.32   0.76   0.77
the:DT   1.25   1.25   2.42   0.94   1.16   1.25   1.25   2.42   0.94
shore:NN   1.04   0.69   0.65   0.28   0.50   1.03   0.69   0.65   0.28
NO_WORD   0.28   0.17   0.82   0.74   0.92   0.28   0.42   0.82   0.74

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.96 Alignment.score
 1.00  0.35 Alignment.isGood
-1.00  0.62 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.44 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "shore" modifying "swam"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "shore" modifying "swam" is dropped on aligned hypothesis word "swam"
-1.00  1.00 NullPunisher.other : shore
-1.00  1.00 NullPunisher.other : after
-1.00  1.00 NullPunisher.other : shore
-0.10  1.00 NullPunisher.article : the
-0.10  1.00 NullPunisher.article : the
Hand-tuned score (dot product of above): -3.2654
Threshold: -9.4738


Inference ID: 271

Txt: Smith was present. Jones was present. Smith was present after Jones was present.

Hyp: Jones was present before Smith was present. (don't know)

Jones
NNP
was
VBD
present
JJ
before
IN
Smith
NNP
was
VBD
present
JJ
Smith:NNP   0.70   1.07   1.24   0.75   0.28   1.07   1.24
was:VBD   1.16   1.32   1.02   1.15   1.16   1.32   1.02
present:JJ   1.13   0.96   0.74   1.03   1.13   0.96   0.74
Jones:NNP   0.28   1.07   1.24   0.73   0.70   1.07   1.24
was:VBD   1.16   1.32   1.02   1.15   1.16   1.32   1.02
present:JJ   1.13   0.96   0.74   1.03   1.13   0.96   0.74
Smith:NNP   0.70   1.07   1.24   0.75   0.28   1.07   1.24
was:VBD   1.16   1.32   1.02   1.15   1.16   1.32   1.02
present:JJ   1.13   0.96   0.74   1.03   1.13   0.96   0.74
after:IN   1.52   1.24   0.77   2.47   1.57   1.24   0.77
Jones:NNP   0.28   1.07   1.24   0.73   0.70   1.07   1.24
was:VBD   1.16   1.32   1.02   1.15   1.16   1.32   1.02
present:JJ   1.13   0.96   0.74   1.03   1.13   0.96   0.74
NO_WORD   0.28   1.43   0.16   0.92   0.28   1.43   0.35

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.53 Alignment.score
 1.00  0.26 Alignment.isGood
-1.00  0.72 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-0.05  1.00 NullPunisher.aux : was
-1.00  1.00 NullPunisher.other : present
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : present
-1.00  1.00 NullPunisher.other : before
-3.00  1.00 NullPunisher.entity : Jones
-0.05  1.00 NullPunisher.aux : was
-2.00  1.00 RootEntailment.unalignedRoot : "present" not aligned to anything
Hand-tuned score (dot product of above): -11.7224
Threshold: -9.4738


Inference ID: 272

Txt: Smith was present. Jones was present. Smith was present before Jones was present.

Hyp: Jones was present after Smith was present. (don't know)

Jones
NNP
was
VBD
present
JJ
after
IN
Smith
NNP
was
VBD
present
JJ
Smith:NNP   0.70   1.07   1.24   0.75   0.28   1.07   1.24
was:VBD   1.16   1.32   1.02   1.15   1.16   1.32   1.02
present:JJ   1.13   0.96   0.74   1.05   1.13   0.96   0.74
Jones:NNP   0.28   1.07   1.24   0.71   0.70   1.07   1.24
was:VBD   1.16   1.32   1.02   1.15   1.16   1.32   1.02
present:JJ   1.13   0.96   0.74   1.05   1.13   0.96   0.74
Smith:NNP   0.70   1.07   1.24   0.75   0.28   1.07   1.24
was:VBD   1.16   1.32   1.02   1.15   1.16   1.32   1.02
present:JJ   1.13   0.96   0.74   1.05   1.13   0.96   0.74
before:IN   1.55   1.24   0.75   2.47   1.57   1.24   0.75
Jones:NNP   0.28   1.07   1.24   0.71   0.70   1.07   1.24
was:VBD   1.16   1.32   1.02   1.15   1.16   1.32   1.02
present:JJ   1.13   0.96   0.74   1.05   1.13   0.96   0.74
NO_WORD   0.28   1.43   0.16   0.92   0.28   1.43   0.35

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.53 Alignment.score
 1.00  0.26 Alignment.isGood
-1.00  0.72 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : after
-1.00  1.00 NullPunisher.other : present
-1.00  1.00 NullPunisher.other : present
-0.05  1.00 NullPunisher.aux : was
-3.00  1.00 NullPunisher.entity : Jones
-0.05  1.00 NullPunisher.aux : was
-2.00  1.00 RootEntailment.unalignedRoot : "present" not aligned to anything
Hand-tuned score (dot product of above): -11.7224
Threshold: -9.4738


Inference ID: 273

Txt: Smith was writing a report. Jones was writing a report. Smith was writing a report before Jones was writing a report.

Hyp: Jones was writing a report after Smith was writing a report. (don't know)

Jones
NNP
was
VBD
writing
VBG
a
DT
report
NN
after
IN
Smith
NNP
was
VBD
writing
VBG
a
DT
report
NN
Smith:NNP   0.70   1.07   1.04   0.96   1.14   0.75   0.28   1.07   1.04   0.96   1.14
was:VBD   1.16   1.32   0.84   0.76   0.91   1.15   1.16   1.32   0.84   0.76   0.91
writing:VBG   1.16   0.84   1.32   0.76   0.80   1.15   1.13   0.84   1.32   0.76   0.80
a:DT   1.25   1.25   1.25   2.42   1.00   1.12   1.25   1.25   1.25   2.42   1.00
report:NN   1.16   0.82   0.71   0.71   0.28   0.48   1.14   0.82   0.71   0.71   0.28
Jones:NNP   0.28   1.07   1.07   0.96   1.16   0.71   0.70   1.07   1.07   0.96   1.16
was:VBD   1.16   1.32   0.84   0.76   0.91   1.15   1.16   1.32   0.84   0.76   0.91
writing:VBG   1.16   0.84   1.32   0.76   0.80   1.15   1.13   0.84   1.32   0.76   0.80
a:DT   1.25   1.25   1.25   2.42   1.00   1.12   1.25   1.25   1.25   2.42   1.00
report:NN   1.16   0.82   0.71   0.71   0.28   0.48   1.14   0.82   0.71   0.71   0.28
Smith:NNP   0.70   1.07   1.04   0.96   1.14   0.75   0.28   1.07   1.04   0.96   1.14
was:VBD   1.16   1.32   0.84   0.76   0.91   1.15   1.16   1.32   0.84   0.76   0.91
writing:VBG   1.16   0.84   1.32   0.76   0.80   1.15   1.13   0.84   1.32   0.76   0.80
a:DT   1.25   1.25   1.25   2.42   1.00   1.12   1.25   1.25   1.25   2.42   1.00
report:NN   1.16   0.82   0.71   0.71   0.28   0.48   1.14   0.82   0.71   0.71   0.28
before:IN   1.55   1.24   1.24   1.11   1.20   2.47   1.57   1.24   1.24   1.11   1.20
Jones:NNP   0.28   1.07   1.07   0.96   1.16   0.71   0.70   1.07   1.07   0.96   1.16
was:VBD   1.16   1.32   0.84   0.76   0.91   1.15   1.16   1.32   0.84   0.76   0.91
writing:VBG   1.16   0.84   1.32   0.76   0.80   1.15   1.13   0.84   1.32   0.76   0.80
a:DT   1.25   1.25   1.25   2.42   1.00   1.12   1.25   1.25   1.25   2.42   1.00
report:NN   1.16   0.82   0.71   0.71   0.28   0.48   1.14   0.82   0.71   0.71   0.28
NO_WORD   0.28   1.30   0.17   0.82   0.09   0.92   0.28   1.30   0.42   0.82   0.09

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.49 Alignment.score
 1.00  0.25 Alignment.isGood
-1.00  0.73 Alignment.isBad
-0.10  11.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-0.10  1.00 NullPunisher.article : a
-0.05  1.00 NullPunisher.aux : was
-1.00  1.00 NullPunisher.other : report
-1.00  1.00 NullPunisher.other : report
-0.05  1.00 NullPunisher.aux : was
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : writing
-3.00  1.00 NullPunisher.entity : Jones
-1.00  1.00 NullPunisher.other : after
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : writing
-2.00  1.00 RootEntailment.unalignedRoot : "writing" not aligned to anything
Hand-tuned score (dot product of above): -14.3875
Threshold: -9.4738


Inference ID: 274

Txt: Smith was writing a report. Jones was writing a report. Smith was writing a report after Jones was writing a report.

Hyp: Jones was writing a report before Smith was writing a report. (don't know)

Jones
NNP
was
VBD
writing
VBG
a
DT
report
NN
before
IN
Smith
NNP
was
VBD
writing
VBG
a
DT
report
NN
Smith:NNP   0.70   1.07   1.04   0.96   1.14   0.75   0.28   1.07   1.04   0.96   1.14
was:VBD   1.16   1.32   0.84   0.76   0.91   1.15   1.16   1.32   0.84   0.76   0.91
writing:VBG   1.16   0.84   1.32   0.76   0.80   1.15   1.13   0.84   1.32   0.76   0.80
a:DT   1.25   1.25   1.25   2.42   1.00   1.11   1.25   1.25   1.25   2.42   1.00
report:NN   1.16   0.82   0.71   0.71   0.28   0.39   1.14   0.82   0.71   0.71   0.28
Jones:NNP   0.28   1.07   1.07   0.96   1.16   0.73   0.70   1.07   1.07   0.96   1.16
was:VBD   1.16   1.32   0.84   0.76   0.91   1.15   1.16   1.32   0.84   0.76   0.91
writing:VBG   1.16   0.84   1.32   0.76   0.80   1.15   1.13   0.84   1.32   0.76   0.80
a:DT   1.25   1.25   1.25   2.42   1.00   1.11   1.25   1.25   1.25   2.42   1.00
report:NN   1.16   0.82   0.71   0.71   0.28   0.39   1.14   0.82   0.71   0.71   0.28
Smith:NNP   0.70   1.07   1.04   0.96   1.14   0.75   0.28   1.07   1.04   0.96   1.14
was:VBD   1.16   1.32   0.84   0.76   0.91   1.15   1.16   1.32   0.84   0.76   0.91
writing:VBG   1.16   0.84   1.32   0.76   0.80   1.15   1.13   0.84   1.32   0.76   0.80
a:DT   1.25   1.25   1.25   2.42   1.00   1.11   1.25   1.25   1.25   2.42   1.00
report:NN   1.16   0.82   0.71   0.71   0.28   0.39   1.14   0.82   0.71   0.71   0.28
after:IN   1.52   1.24   1.24   1.12   1.29   2.47   1.57   1.24   1.24   1.12   1.29
Jones:NNP   0.28   1.07   1.07   0.96   1.16   0.73   0.70   1.07   1.07   0.96   1.16
was:VBD   1.16   1.32   0.84   0.76   0.91   1.15   1.16   1.32   0.84   0.76   0.91
writing:VBG   1.16   0.84   1.32   0.76   0.80   1.15   1.13   0.84   1.32   0.76   0.80
a:DT   1.25   1.25   1.25   2.42   1.00   1.11   1.25   1.25   1.25   2.42   1.00
report:NN   1.16   0.82   0.71   0.71   0.28   0.39   1.14   0.82   0.71   0.71   0.28
NO_WORD   0.28   1.30   0.17   0.82   0.09   0.92   0.28   1.30   0.42   0.82   0.09

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.49 Alignment.score
 1.00  0.25 Alignment.isGood
-1.00  0.73 Alignment.isBad
-0.10  11.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : writing
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : before
-1.00  1.00 NullPunisher.other : report
-1.00  1.00 NullPunisher.other : report
-3.00  1.00 NullPunisher.entity : Jones
-1.00  1.00 NullPunisher.other : writing
-3.00  1.00 NullPunisher.entity : Smith
-0.05  1.00 NullPunisher.aux : was
-0.05  1.00 NullPunisher.aux : was
-2.00  1.00 RootEntailment.unalignedRoot : "writing" not aligned to anything
Hand-tuned score (dot product of above): -14.3875
Threshold: -9.4738


Inference ID: 275

Txt: Smith left the meeting before he lost his temper.

Hyp: Smith lost his temper. (don't know)

Smith
NNP
lost
VBD
his
PRP$
temper
NN
Smith:NNP   0.28   1.05   1.84   1.06
left:VBD   1.14   0.41   0.54   0.77
the:DT   1.25   1.25   1.16   0.97
meeting:NN   1.15   0.82   1.58   0.69
before:IN   1.57   1.24   1.57   1.27
he:PRP   2.76   1.21   0.82   1.30
lost:VBD   1.14   1.32   0.51   0.87
his:PRP$   2.76   1.18   1.05   1.30
temper:NN   1.06   0.78   1.58   0.28
NO_WORD   0.28   0.17   0.93   0.09

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.23 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-0.10  1.00 NullPunisher.functionWord : his
-1.00  1.00 NullPunisher.other : temper
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : lost
-2.00  1.00 RootEntailment.unalignedRoot : "lost" not aligned to anything
Hand-tuned score (dot product of above): -7.8399
Threshold: -9.4738


Inference ID: 277

Txt: Smith lived in Birmingham in 1991.

Hyp: Smith lived in Birmingham in 1992. (don't know)

Smith
NNP
lived
VBD
Birmingham
NNP
1992
CD
Smith:NNP   0.28   1.07   0.95   1.20
lived:VBD   1.16   1.32   1.16   1.39
Birmingham:NNP   0.95   1.07   0.28   1.20
1991:CD   1.24   1.17   1.24   1.56
NO_WORD   0.28   0.17   0.07   0.57

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.35 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.75 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.25 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Birmingham" modifying "lived"
-3.00  1.00 Date.dateHeadMismatch : 1992 vs. 1991
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : lived
-3.00  1.00 NullPunisher.entity : Birmingham
-2.00  1.00 RootEntailment.unalignedRoot : "lived" not aligned to anything
Hand-tuned score (dot product of above): -13.3578
Threshold: -9.4738


Inference ID: 278

Txt: Smith wrote his first novel in 1991.

Hyp: Smith wrote his first novel in 1992. (don't know)

Smith
NNP
wrote
VBD
his
PRP$
first
JJ
novel
NN
1992
CD
Smith:NNP   0.28   1.03   1.84   1.15   1.11   1.20
wrote:VBD   1.12   1.32   0.54   1.23   0.33   1.21
his:PRP$   2.76   1.21   1.05   1.59   1.30   1.46
first:JJ   1.05   1.17   1.25   0.74   1.13   1.04
novel:NN   1.11   0.24   1.58   1.24   0.28   1.22
1991:CD   1.24   0.97   1.32   0.91   1.12   1.56
NO_WORD   0.28   0.17   0.93   0.11   0.09   0.57

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.43 Alignment.score
 1.00  0.24 Alignment.isGood
-1.00  0.74 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.17 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "first" modifying "novel"
-3.00  1.00 Date.dateHeadMismatch : 1992 vs. 1991
-3.00  1.00 NullPunisher.entity : first
-1.00  1.00 NullPunisher.other : wrote
-1.00  1.00 NullPunisher.other : novel
-3.00  1.00 NullPunisher.entity : Smith
-0.10  1.00 NullPunisher.functionWord : his
-2.00  1.00 RootEntailment.unalignedRoot : "wrote" not aligned to anything
Hand-tuned score (dot product of above): -14.5496
Threshold: -9.4738


Inference ID: 279

Txt: Smith wrote a novel in 1991.

Hyp: Smith wrote it in 1992. (don't know)

Smith
NNP
wrote
VBD
it
PRP
1992
CD
Smith:NNP   0.28   1.03   1.80   1.20
wrote:VBD   1.12   1.32   0.54   1.21
a:DT   1.25   1.25   1.15   1.18
novel:NN   1.11   0.24   1.58   1.22
1991:CD   1.24   0.97   1.32   1.56
NO_WORD   0.28   0.17   0.91   0.57

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.59 Alignment.score
 1.00  0.27 Alignment.isGood
-1.00  0.71 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.25 Alignment.txtSpan
-3.00  1.00 Date.dateHeadMismatch : 1992 vs. 1991
-1.00  1.00 NullPunisher.other : wrote
-1.00  1.00 NullPunisher.other : it
-3.00  1.00 NullPunisher.entity : Smith
-2.00  1.00 RootEntailment.unalignedRoot : "wrote" not aligned to anything
Hand-tuned score (dot product of above): -10.0219
Threshold: -9.4738


Inference ID: 280

Txt: Smith wrote a novel in 1991.

Hyp: Smith wrote a novel in 1992. (don't know)

Smith
NNP
wrote
VBD
a
DT
novel
NN
1992
CD
Smith:NNP   0.28   1.03   0.96   1.11   1.20
wrote:VBD   1.12   1.32   0.76   0.33   1.21
a:DT   1.25   1.25   2.42   1.00   1.18
novel:NN   1.11   0.24   0.71   0.28   1.22
1991:CD   1.24   0.97   0.89   1.12   1.56
NO_WORD   0.28   0.17   0.82   0.09   0.57

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.47 Alignment.score
 1.00  0.25 Alignment.isGood
-1.00  0.73 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.20 Alignment.txtSpan
-3.00  1.00 Date.dateHeadMismatch : 1992 vs. 1991
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : wrote
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : novel
-2.00  1.00 RootEntailment.unalignedRoot : "wrote" not aligned to anything
Hand-tuned score (dot product of above): -10.3910
Threshold: -9.4738


Inference ID: 281

Txt: Smith was running a business in 1991.

Hyp: Smith was running it in 1992. (don't know)

Smith
NNP
was
VBD
running
VBG
it
PRP
1992
CD
Smith:NNP   0.28   1.07   1.07   1.80   1.20
was:VBD   1.16   1.32   0.82   0.54   1.39
running:VBG   1.16   0.74   1.32   0.54   1.02
a:DT   1.25   1.25   1.25   1.15   1.18
business:NN   1.20   0.82   0.77   1.58   1.29
1991:CD   1.24   1.30   1.09   1.32   1.56
NO_WORD   0.28   1.30   0.17   0.91   0.57

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.73 Alignment.score
 1.00  0.30 Alignment.isGood
-1.00  0.68 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.20 Alignment.txtSpan
-3.00  1.00 Date.dateHeadMismatch : 1992 vs. 1991
-1.00  1.00 NullPunisher.other : it
-1.00  1.00 NullPunisher.other : running
-3.00  1.00 NullPunisher.entity : Smith
-0.05  1.00 NullPunisher.aux : was
-2.00  1.00 RootEntailment.unalignedRoot : "running" not aligned to anything
Hand-tuned score (dot product of above): -9.9754
Threshold: -9.4738


Inference ID: 282

Txt: Smith discovered a new species in 1991.

Hyp: Smith discovered it in 1992. (don't know)

Smith
NNP
discovered
VBD
it
PRP
1992
CD
Smith:NNP   0.28   1.07   1.80   1.20
discovered:VBD   1.16   1.32   0.54   1.12
a:DT   1.25   1.25   1.15   1.18
new:JJ   1.13   0.96   1.05   1.13
species:NN   1.05   0.67   1.58   1.29
1991:CD   1.24   1.00   1.32   1.56
NO_WORD   0.28   0.17   0.91   0.57

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.59 Alignment.score
 1.00  0.27 Alignment.isGood
-1.00  0.71 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.25 Alignment.txtSpan
-3.00  1.00 Date.dateHeadMismatch : 1992 vs. 1991
-1.00  1.00 NullPunisher.other : discovered
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : it
-2.00  1.00 RootEntailment.unalignedRoot : "discovered" not aligned to anything
Hand-tuned score (dot product of above): -10.0219
Threshold: -9.4738


Inference ID: 283

Txt: Smith discovered a new species in 1991.

Hyp: Smith discovered a new species in 1992. (don't know)

Smith
NNP
discovered
VBD
a
DT
new
JJ
species
NN
1992
CD
Smith:NNP   0.28   1.07   0.96   1.24   1.05   1.20
discovered:VBD   1.16   1.32   0.76   1.02   0.76   1.12
a:DT   1.25   1.25   2.42   1.18   1.00   1.18
new:JJ   1.13   0.96   0.76   0.74   0.88   1.13
species:NN   1.05   0.67   0.71   0.99   0.28   1.29
1991:CD   1.24   1.00   0.89   1.00   1.33   1.56
NO_WORD   0.28   0.17   0.82   0.11   0.09   0.57

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.41 Alignment.score
 1.00  0.24 Alignment.isGood
-1.00  0.74 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.17 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "new" modifying "species"
-3.00  1.00 Date.dateHeadMismatch : 1992 vs. 1991
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : species
-1.00  1.00 NullPunisher.other : new
-1.00  1.00 NullPunisher.other : discovered
-3.00  1.00 NullPunisher.entity : Smith
-2.00  1.00 RootEntailment.unalignedRoot : "discovered" not aligned to anything
Hand-tuned score (dot product of above): -12.5764
Threshold: -9.4738


Inference ID: 284

Txt: Smith wrote a report in two hours. Smith started writing the report at 8 am.

Hyp: Smith had finished writing the report by 11 am. (yes)

Smith
NNP
had
VBD
finished
VBN
writing
VBG
the
DT
report
NN
11
CD
am
RB
Smith:NNP   0.28   1.07   1.06   1.04   0.96   1.14   1.20   1.56
wrote:VBD   1.12   0.75   0.71   0.68   0.76   0.75   1.39   0.91
a:DT   1.25   1.25   1.25   1.25   1.31   1.00   1.18   1.38
report:NN   1.14   0.82   0.77   0.71   0.71   0.28   1.22   1.31
two:CD   1.24   1.30   1.21   1.12   0.82   1.33   0.04   1.27
hours:NNS   1.13   0.82   0.65   0.75   0.71   0.72   1.16   1.31
Smith:NNP   0.28   1.07   1.06   1.04   0.96   1.14   1.20   1.56
started:VBD   1.13   0.84   1.55   0.44   0.76   0.89   1.33   1.11
writing:VBG   1.13   0.75   0.71   1.32   0.76   0.80   1.33   0.97
the:DT   1.25   1.25   1.25   1.25   2.42   1.00   1.18   1.45
report:NN   1.14   0.82   0.77   0.71   0.71   0.28   1.22   1.31
8:CD   1.24   1.30   1.29   1.27   0.89   1.29   0.39   1.24
am:RB   1.47   0.87   0.91   0.78   1.09   1.21   1.27   1.23
NO_WORD   0.28   1.30   0.17   0.16   0.82   0.09   0.81   0.04

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.45 Alignment.score
 1.00  0.24 Alignment.isGood
-1.00  0.73 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.25 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "am" modifying "11"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "am" modifying "8" is dropped on aligned hypothesis word "11"
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : finished
-0.05  1.00 NullPunisher.aux : had
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : am
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '11.0' vs '8.0'
 1.00  1.00 Quant.equivalent : Replacing the quantifier "a" by an equivalent quantifier "the" preserves truth.
-2.00  1.00 RootEntailment.unalignedRoot : "finished" not aligned to anything
Hand-tuned score (dot product of above): -12.9719
Threshold: -9.4738


Inference ID: 285

Txt: Smith wrote a report in two hours.

Hyp: Smith spent two hours writing the report. (don't know)

Smith
NNP
spent
VBD
two
CD
hours
NNS
writing
VBG
the
DT
report
NN
Smith:NNP   0.28   1.03   1.20   1.13   1.04   0.96   1.14
wrote:VBD   1.12   0.61   1.30   0.91   0.68   0.76   0.75
a:DT   1.25   1.25   1.18   1.00   1.25   1.31   1.00
report:NN   1.14   0.76   1.29   0.72   0.71   0.71   0.28
two:CD   1.24   0.93   0.85   1.16   1.12   0.82   1.33
hours:NNS   1.13   0.54   1.13   0.28   0.75   0.71   0.72
NO_WORD   0.28   0.17   0.52   0.09   0.16   0.82   0.09

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.21 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.29 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "two" modifying "hours"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "hours" modifying "wrote" is dropped on aligned hypothesis word "writing"
-1.00  1.00 NullPunisher.other : hours
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : spent
-3.00  1.00 NullPunisher.entity : two
-0.10  1.00 NullPunisher.article : the
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '2.0' vs ''
 1.00  1.00 Quant.equivalent : Replacing the quantifier "a" by an equivalent quantifier "the" preserves truth.
-2.00  1.00 RootEntailment.unalignedRoot : "spent" not aligned to anything
Hand-tuned score (dot product of above): -16.3330
Threshold: -9.4738


Inference ID: 286

Txt: Smith wrote a report in two hours.

Hyp: Smith spent more than two hours writing the report. (don't know)

Smith
NNP
spent
VBD
more_than
IN
two
CD
hours
NNS
writing
VBG
the
DT
report
NN
Smith:NNP   0.28   1.03   0.75   1.20   1.13   1.04   0.96   1.14
wrote:VBD   1.12   0.61   1.15   1.30   0.91   0.68   0.76   0.75
a:DT   1.25   1.25   1.05   1.18   1.00   1.25   1.31   1.00
report:NN   1.14   0.76   0.48   1.29   0.72   0.71   0.71   0.28
two:CD   1.24   0.93   1.30   0.85   1.16   1.12   0.82   1.33
hours:NNS   1.13   0.54   0.50   1.13   0.28   0.75   0.71   0.72
NO_WORD   0.28   0.17   0.84   0.52   0.09   0.16   0.82   0.09

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.29 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.76 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.25 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "two" modifying "hours"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "hours" modifying "wrote" is dropped on aligned hypothesis word "writing"
-1.00  1.00 NullPunisher.other : more_than
-1.00  1.00 NullPunisher.other : hours
-3.00  1.00 NullPunisher.entity : Smith
-3.00  1.00 NullPunisher.entity : two
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : spent
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '>2.0' vs ''
 1.00  1.00 Quant.equivalent : Replacing the quantifier "a" by an equivalent quantifier "the" preserves truth.
-2.00  1.00 RootEntailment.unalignedRoot : "spent" not aligned to anything
Hand-tuned score (dot product of above): -17.3306
Threshold: -9.4738


Inference ID: 287

Txt: Smith wrote a report in two hours.

Hyp: Smith wrote a report in one hour. (don't know)

Smith
NNP
wrote
VBD
a
DT
report
NN
one
CD
hour
NN
Smith:NNP   0.28   1.03   0.96   1.14   1.20   1.12
wrote:VBD   1.12   1.32   0.76   0.75   1.34   0.88
a:DT   1.25   1.25   2.42   1.00   1.18   1.00
report:NN   1.14   0.66   0.71   0.28   1.29   0.77
two:CD   1.24   1.22   0.89   1.33   0.48   1.26
hours:NNS   1.13   0.82   0.71   0.72   1.29   0.99
NO_WORD   0.28   0.17   0.82   0.09   0.52   0.07

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.33 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.76 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.33 Alignment.txtSpan
-1.00  1.00 NullPunisher.other : wrote
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : report
-0.10  1.00 NullPunisher.article : a
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '1.0' vs '2.0'
-2.00  1.00 RootEntailment.unalignedRoot : "wrote" not aligned to anything
Hand-tuned score (dot product of above): -13.4655
Threshold: -9.4738


Inference ID: 288

Txt: Smith wrote a report in two hours.

Hyp: Smith wrote a report. (yes)

Smith
NNP
wrote
VBD
a
DT
report
NN
Smith:NNP   0.28   1.03   0.96   1.14
wrote:VBD   1.12   1.32   0.76   0.75
a:DT   1.25   1.25   2.42   1.00
report:NN   1.14   0.66   0.71   0.28
two:CD   1.24   1.22   0.89   1.33
hours:NNS   1.13   0.82   0.71   0.72
NO_WORD   0.28   0.17   0.82   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.20 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-3.00  1.00 NullPunisher.entity : Smith
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : report
-1.00  1.00 NullPunisher.other : wrote
-2.00  1.00 RootEntailment.unalignedRoot : "wrote" not aligned to anything
Hand-tuned score (dot product of above): -7.8790
Threshold: -9.4738


Inference ID: 289

Txt: Smith discovered a new species in two hours.

Hyp: Smith spent two hours discovering the new species. (don't know)

Smith
NNP
spent
VBD
two
CD
hours
NNS
discovering
VBG
the
DT
new
JJ
species
NN
Smith:NNP   0.28   1.03   1.20   1.13   1.07   0.96   1.24   1.05
discovered:VBD   1.16   0.46   1.10   0.88   0.87   0.76   1.02   0.76
a:DT   1.25   1.25   1.18   1.00   1.25   1.31   1.18   1.00
new:JJ   1.13   0.89   1.11   0.86   0.91   0.76   0.74   0.88
species:NN   1.05   0.74   1.29   0.83   0.79   0.71   0.99   0.28
two:CD   1.24   0.93   0.85   1.16   1.12   0.82   0.98   1.33
hours:NNS   1.13   0.54   1.13   0.28   0.79   0.71   0.97   0.83
NO_WORD   0.28   0.17   0.52   0.09   0.16   0.82   0.11   0.09

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.23 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.12 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "new" modifying "species"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "new" modifying "species" is dropped on aligned hypothesis word "species"
-1.00  1.00 NullPunisher.other : new
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : spent
-3.00  1.00 NullPunisher.entity : two
-1.00  1.00 NullPunisher.other : hours
-0.10  1.00 NullPunisher.article : the
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '2.0' vs ''
 1.00  1.00 Quant.equivalent : Replacing the quantifier "a" by an equivalent quantifier "the" preserves truth.
-2.00  1.00 RootEntailment.unalignedRoot : "spent" not aligned to anything
Hand-tuned score (dot product of above): -17.4325
Threshold: -9.4738


Inference ID: 290

Txt: Smith discovered a new species in two hours.

Hyp: Smith discovered a new species. (yes)

Smith
NNP
discovered
VBD
a
DT
new
JJ
species
NN
Smith:NNP   0.28   1.07   0.96   1.24   1.05
discovered:VBD   1.16   1.32   0.76   1.02   0.76
a:DT   1.25   1.25   2.42   1.18   1.00
new:JJ   1.13   0.96   0.76   0.74   0.88
species:NN   1.05   0.67   0.71   0.99   0.28
two:CD   1.24   1.01   0.89   0.98   1.33
hours:NNS   1.13   0.79   0.71   0.97   0.83
NO_WORD   0.28   0.17   0.82   0.11   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.18 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "new" modifying "species"
-3.00  1.00 NullPunisher.entity : Smith
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : discovered
-1.00  1.00 NullPunisher.other : new
-1.00  1.00 NullPunisher.other : species
-2.00  1.00 RootEntailment.unalignedRoot : "discovered" not aligned to anything
Hand-tuned score (dot product of above): -10.0020
Threshold: -9.4738


Inference ID: 291

Txt: Smith discovered many new species in two hours.

Hyp: Smith spent two hours discovering new species. (yes)

Smith
NNP
spent
VBD
two
CD
hours
NNS
discovering
VBG
new
JJ
species
NNS
Smith:NNP   0.28   1.03   1.20   1.13   1.07   1.24   1.05
discovered:VBD   1.16   0.46   1.10   0.88   0.87   1.02   0.76
many:JJ   1.11   0.93   1.21   0.88   0.96   0.93   0.88
new:JJ   1.13   0.89   1.11   0.86   0.91   0.74   0.88
species:NNS   1.05   0.74   1.29   0.83   0.79   0.99   0.28
two:CD   1.24   0.93   0.85   1.16   1.12   0.98   1.33
hours:NNS   1.13   0.54   1.13   0.28   0.79   0.97   0.83
NO_WORD   0.28   0.17   0.52   0.09   0.16   0.11   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.14 Alignment.score
 1.00  0.19 Alignment.isGood
-1.00  0.79 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.14 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "new" modifying "species"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "new" modifying "species" is dropped on aligned hypothesis word "species"
-3.00  1.00 NullPunisher.entity : Smith
-3.00  1.00 NullPunisher.entity : two
-1.00  1.00 NullPunisher.other : new
-1.00  1.00 NullPunisher.other : spent
-1.00  1.00 NullPunisher.other : hours
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '2.0' vs ''
-2.00  1.00 RootEntailment.unalignedRoot : "spent" not aligned to anything
Hand-tuned score (dot product of above): -18.3427
Threshold: -9.4738


Inference ID: 292

Txt: Smith was running his own business in two years.

Hyp: Smith spent two years running his own business. (don't know)

Smith
NNP
spent
VBD
two
CD
years
NNS
running
VBG
his
PRP$
own
JJ
business
NN
Smith:NNP   0.28   1.03   1.20   1.16   1.07   1.84   1.24   1.20
was:VBD   1.16   0.91   1.39   0.85   0.82   0.47   1.02   0.91
running:VBG   1.16   0.46   1.37   0.91   1.32   0.54   1.02   0.86
his:PRP$   2.76   0.97   1.57   1.30   0.97   1.05   1.39   1.30
own:JJ   1.13   0.96   1.06   0.88   0.96   1.05   0.74   0.88
business:NN   1.20   0.72   1.25   0.95   0.77   1.58   0.99   0.28
two:CD   1.24   0.93   0.85   1.07   1.28   1.43   0.93   1.28
years:NNS   1.16   0.58   1.04   0.28   0.82   1.58   0.99   0.95
NO_WORD   0.28   0.17   0.52   0.09   0.16   0.93   0.11   0.09

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.10 Alignment.score
 1.00  0.18 Alignment.isGood
-1.00  0.80 Alignment.isBad
-0.10  8.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "own" modifying "business"
-1.00  1.00 NullPunisher.other : years
-1.00  1.00 NullPunisher.other : own
-3.00  1.00 NullPunisher.entity : Smith
-3.00  1.00 NullPunisher.entity : two
-1.00  1.00 NullPunisher.other : running
-1.00  1.00 NullPunisher.other : spent
-1.00  1.00 NullPunisher.other : business
-0.10  1.00 NullPunisher.functionWord : his
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '2.0' vs ''
-2.00  1.00 RootEntailment.unalignedRoot : "spent" not aligned to anything
Hand-tuned score (dot product of above): -21.4173
Threshold: -9.4738


Inference ID: 293

Txt: Smith was running his own business in two years.

Hyp: Smith spent more than two years running his own business. (don't know)

Smith
NNP
spent
VBD
more_than
IN
two
CD
years
NNS
running
VBG
his
PRP$
own
JJ
business
NN
Smith:NNP   0.28   1.03   0.75   1.20   1.16   1.07   1.84   1.24   1.20
was:VBD   1.16   0.91   1.15   1.39   0.85   0.82   0.47   1.02   0.91
running:VBG   1.16   0.46   1.15   1.37   0.91   1.32   0.54   1.02   0.86
his:PRP$   2.76   0.97   1.05   1.57   1.30   0.97   1.05   1.39   1.30
own:JJ   1.13   0.96   1.05   1.06   0.88   0.96   1.05   0.74   0.88
business:NN   1.20   0.72   0.50   1.25   0.95   0.77   1.58   0.99   0.28
two:CD   1.24   0.93   1.30   0.85   1.07   1.28   1.43   0.93   1.28
years:NNS   1.16   0.58   0.50   1.04   0.28   0.82   1.58   0.99   0.95
NO_WORD   0.28   0.17   0.84   0.52   0.09   0.16   0.93   0.11   0.09

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.18 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  9.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "own" modifying "business"
-0.10  1.00 NullPunisher.functionWord : his
-1.00  1.00 NullPunisher.other : business
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : more_than
-1.00  1.00 NullPunisher.other : running
-1.00  1.00 NullPunisher.other : spent
-1.00  1.00 NullPunisher.other : own
-3.00  1.00 NullPunisher.entity : two
-1.00  1.00 NullPunisher.other : years
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '>2.0' vs ''
-2.00  1.00 RootEntailment.unalignedRoot : "spent" not aligned to anything
Hand-tuned score (dot product of above): -22.4075
Threshold: -9.4738


Inference ID: 294

Txt: Smith was running his own business in two years.

Hyp: Smith ran his own business. (yes)

Smith
NNP
ran
VBD
his
PRP$
own
JJ
business
NN
Smith:NNP   0.28   1.07   1.84   1.24   1.20
was:VBD   1.16   0.75   0.47   1.02   0.91
running:VBG   1.16   0.65   0.54   1.02   0.86
his:PRP$   2.76   0.97   1.05   1.39   1.30
own:JJ   1.13   0.89   1.05   0.74   0.88
business:NN   1.20   0.77   1.58   0.99   0.28
two:CD   1.24   1.16   1.43   0.93   1.28
years:NNS   1.16   0.68   1.58   0.99   0.95
NO_WORD   0.28   0.17   0.93   0.11   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.52 Alignment.score
 1.00  0.26 Alignment.isGood
-1.00  0.72 Alignment.isBad
-0.10  2.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.40 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "own" modifying "business"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "own" modifying "business" is dropped on aligned hypothesis word "business"
-0.10  1.00 NullPunisher.functionWord : his
-1.00  1.00 NullPunisher.other : own
Hand-tuned score (dot product of above): -1.4967
Threshold: -9.4738


Inference ID: 295

Txt: In two years Smith owned a chain of businesses.

Hyp: Smith owned a chain of business for two years. (don't know)

Smith
NNP
owned
VBD
a
DT
chain
NN
business
NN
two
CD
years
NNS
two:CD   1.24   1.30   0.89   1.29   1.28   0.85   1.07
years:NNS   1.16   0.71   0.71   0.82   0.95   1.04   0.28
Smith:NNP   0.28   1.07   0.96   1.12   1.20   1.20   1.16
owned:VBD   1.16   1.32   0.76   0.80   0.71   1.39   0.80
a:DT   1.25   1.25   2.42   1.00   1.00   1.18   1.00
chain:NN   1.12   0.71   0.71   0.28   0.58   1.26   0.82
businesses:NNS   1.20   0.75   0.71   0.63   1.10   1.25   0.96
NO_WORD   0.28   0.17   0.82   0.09   0.04   0.52   0.05

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.21 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  3.00 Alignment.hypSpan
 0.10  0.29 Alignment.txtSpan
-1.00  1.00 NullPunisher.other : owned
-1.00  1.00 NullPunisher.other : chain
-3.00  1.00 NullPunisher.entity : Smith
-0.10  1.00 NullPunisher.article : a
-2.00  1.00 RootEntailment.unalignedRoot : "owned" not aligned to anything
Hand-tuned score (dot product of above): -7.5397
Threshold: -9.4738


Inference ID: 296

Txt: In two years Smith owned a chain of businesses.

Hyp: Smith owned a chain of business for more than two years. (don't know)

Smith
NNP
owned
VBD
a
DT
chain
NN
business
NN
more_than
IN
two
CD
years
NNS
two:CD   1.24   1.30   0.89   1.29   1.28   1.30   0.85   1.07
years:NNS   1.16   0.71   0.71   0.82   0.95   0.50   1.04   0.28
Smith:NNP   0.28   1.07   0.96   1.12   1.20   0.75   1.20   1.16
owned:VBD   1.16   1.32   0.76   0.80   0.71   1.15   1.39   0.80
a:DT   1.25   1.25   2.42   1.00   1.00   1.05   1.18   1.00
chain:NN   1.12   0.71   0.71   0.28   0.58   0.50   1.26   0.82
businesses:NNS   1.20   0.75   0.71   0.63   1.10   0.50   1.25   0.96
NO_WORD   0.28   0.17   0.82   0.09   0.04   0.84   0.52   0.05

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.29 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.76 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.25 Alignment.txtSpan
-1.00  1.00 NullPunisher.other : more_than
-1.00  1.00 NullPunisher.other : owned
-3.00  1.00 NullPunisher.entity : Smith
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : chain
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '>2.0' vs '2.0'
-2.00  1.00 RootEntailment.unalignedRoot : "owned" not aligned to anything
Hand-tuned score (dot product of above): -14.6366
Threshold: -9.4738


Inference ID: 297

Txt: In two years Smith owned a chain of businesses.

Hyp: Smith owned a chain of business. (yes)

Smith
NNP
owned
VBD
a
DT
chain
NN
business
NN
two:CD   1.24   1.30   0.89   1.29   1.28
years:NNS   1.16   0.71   0.71   0.82   0.95
Smith:NNP   0.28   1.07   0.96   1.12   1.20
owned:VBD   1.16   1.32   0.76   0.80   0.71
a:DT   1.25   1.25   2.42   1.00   1.00
chain:NN   1.12   0.71   0.71   0.28   0.58
businesses:NNS   1.20   0.75   0.71   0.63   1.10
NO_WORD   0.28   0.17   0.82   0.09   0.04

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.38 Alignment.score
 1.00  0.23 Alignment.isGood
-1.00  0.75 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.20 Alignment.txtSpan
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : chain
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : owned
-2.00  1.00 RootEntailment.unalignedRoot : "owned" not aligned to anything
Hand-tuned score (dot product of above): -7.5171
Threshold: -9.4738


Inference ID: 298

Txt: Smith lived in Birmingham for two years.

Hyp: Smith lived in Birmingham for a year. (yes)

Smith
NNP
lived
VBD
Birmingham
NNP
a
DT
year
NN
Smith:NNP   0.28   1.07   0.95   0.96   1.17
lived:VBD   1.16   1.32   1.16   0.76   0.84
Birmingham:NNP   0.95   1.07   0.28   0.96   1.13
two:CD   1.24   1.00   1.24   0.89   1.31
years:NNS   1.16   0.56   1.12   0.71   0.62
NO_WORD   0.28   0.17   0.07   0.82   0.05

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.25 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.20 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Birmingham" modifying "lived"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "two" modifying "years" is dropped on aligned hypothesis word "year"
-0.10  1.00 NullPunisher.article : a
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : lived
-3.00  1.00 NullPunisher.entity : Birmingham
-2.00  1.00 RootEntailment.unalignedRoot : "lived" not aligned to anything
Hand-tuned score (dot product of above): -10.1892
Threshold: -9.4738


Inference ID: 299

Txt: Smith lived in Birmingham for two years.

Hyp: Smith lived in Birmingham for exactly a year. (don't know)

Smith
NNP
lived
VBD
Birmingham
NNP
for
IN
exactly
RB
a
DT
year
NN
Smith:NNP   0.28   1.07   0.95   0.75   1.56   0.96   1.17
lived:VBD   1.16   1.32   1.16   1.15   1.03   0.76   0.84
Birmingham:NNP   0.95   1.07   0.28   0.75   1.56   0.96   1.13
two:CD   1.24   1.00   1.24   1.30   1.14   0.89   1.31
years:NNS   1.16   0.56   1.12   0.50   1.27   0.71   0.62
NO_WORD   0.28   0.17   0.07   1.16   0.30   0.82   0.25

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.39 Alignment.score
 1.00  0.23 Alignment.isGood
-1.00  0.75 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.14 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Birmingham" modifying "lived"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "two" modifying "years" is dropped on aligned hypothesis word "year"
-1.00  1.00 NullPunisher.other : lived
-1.00  1.00 NullPunisher.other : exactly
-3.00  1.00 NullPunisher.entity : Birmingham
-1.00  1.00 NullPunisher.other : for
-0.10  1.00 NullPunisher.article : a
-3.00  1.00 NullPunisher.entity : Smith
-2.00  1.00 RootEntailment.unalignedRoot : "lived" not aligned to anything
Hand-tuned score (dot product of above): -12.2106
Threshold: -9.4738


Inference ID: 300

Txt: Smith lived in Birmingham for two years.

Hyp: Smith lived in Birmingham. (yes)

Smith
NNP
lived
VBD
Birmingham
NNP
Smith:NNP   0.28   1.07   0.95
lived:VBD   1.16   1.32   1.16
Birmingham:NNP   0.95   1.07   0.28
two:CD   1.24   1.00   1.24
years:NNS   1.16   0.56   1.12
NO_WORD   0.28   0.17   0.07

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00 -0.06 Alignment.score
 1.00  0.16 Alignment.isGood
-1.00  0.82 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Birmingham" modifying "lived"
-3.00  1.00 NullPunisher.entity : Smith
-3.00  1.00 NullPunisher.entity : Birmingham
-1.00  1.00 NullPunisher.other : lived
-2.00  1.00 RootEntailment.unalignedRoot : "lived" not aligned to anything
Hand-tuned score (dot product of above): -11.0154
Threshold: -9.4738


Inference ID: 301

Txt: Smith ran his own business for two years.

Hyp: Smith ran his own business for a year. (yes)

Smith
NNP
ran
VBD
his
PRP$
own
JJ
business
NN
a
DT
year
NN
Smith:NNP   0.28   1.07   1.84   1.24   1.20   0.96   1.17
ran:VBD   1.16   1.32   0.54   0.95   0.86   0.76   0.73
his:PRP$   2.76   0.97   1.05   1.39   1.30   1.11   1.30
own:JJ   1.13   0.89   1.05   0.74   0.88   0.76   0.88
business:NN   1.20   0.77   1.58   0.99   0.28   0.71   0.84
two:CD   1.24   1.16   1.43   0.93   1.28   0.89   1.31
years:NNS   1.16   0.68   1.58   0.99   0.95   0.71   0.62
NO_WORD   0.28   0.17   0.93   0.11   0.09   0.82   0.05

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.35 Alignment.score
 1.00  0.23 Alignment.isGood
-1.00  0.75 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.14 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "own" modifying "business"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "two" modifying "years" is dropped on aligned hypothesis word "year"
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : business
-1.00  1.00 NullPunisher.other : own
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : ran
-0.10  1.00 NullPunisher.functionWord : his
-2.00  1.00 RootEntailment.unalignedRoot : "ran" not aligned to anything
Hand-tuned score (dot product of above): -9.3601
Threshold: -9.4738


Inference ID: 302

Txt: Smith ran his own business for two years.

Hyp: Smith ran his own business. (yes)

Smith
NNP
ran
VBD
his
PRP$
own
JJ
business
NN
Smith:NNP   0.28   1.07   1.84   1.24   1.20
ran:VBD   1.16   1.32   0.54   0.95   0.86
his:PRP$   2.76   0.97   1.05   1.39   1.30
own:JJ   1.13   0.89   1.05   0.74   0.88
business:NN   1.20   0.77   1.58   0.99   0.28
two:CD   1.24   1.16   1.43   0.93   1.28
years:NNS   1.16   0.68   1.58   0.99   0.95
NO_WORD   0.28   0.17   0.93   0.11   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.21 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "own" modifying "business"
-0.10  1.00 NullPunisher.functionWord : his
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : own
-1.00  1.00 NullPunisher.other : ran
-1.00  1.00 NullPunisher.other : business
-2.00  1.00 RootEntailment.unalignedRoot : "ran" not aligned to anything
Hand-tuned score (dot product of above): -9.9708
Threshold: -9.4738


Inference ID: 303

Txt: Smith wrote a report for two hours.

Hyp: Smith wrote a report for an hour. (yes)

Smith
NNP
wrote
VBD
a
DT
report
NN
an
DT
hour
NN
Smith:NNP   0.28   1.03   0.96   1.14   0.96   1.12
wrote:VBD   1.12   1.32   0.76   0.75   0.76   0.88
a:DT   1.25   1.25   2.42   1.00   1.21   1.00
report:NN   1.14   0.66   0.71   0.28   0.71   0.77
two:CD   1.24   1.22   0.89   1.33   0.89   1.26
hours:NNS   1.13   0.82   0.71   0.72   0.71   0.99
NO_WORD   0.28   0.17   0.82   0.09   0.82   0.05

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.43 Alignment.score
 1.00  0.24 Alignment.isGood
-1.00  0.74 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.17 Alignment.txtSpan
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "two" modifying "hours" is dropped on aligned hypothesis word "hour"
-0.10  1.00 NullPunisher.article : an
-1.00  1.00 NullPunisher.other : report
-1.00  1.00 NullPunisher.other : wrote
-0.10  1.00 NullPunisher.article : a
-3.00  1.00 NullPunisher.entity : Smith
-2.00  1.00 RootEntailment.unalignedRoot : "wrote" not aligned to anything
Hand-tuned score (dot product of above): -7.1453
Threshold: -9.4738


Inference ID: 304

Txt: Smith wrote a report for two hours.

Hyp: Smith wrote a report. (don't know)

Smith
NNP
wrote
VBD
a
DT
report
NN
Smith:NNP   0.28   1.03   0.96   1.14
wrote:VBD   1.12   1.32   0.76   0.75
a:DT   1.25   1.25   2.42   1.00
report:NN   1.14   0.66   0.71   0.28
two:CD   1.24   1.22   0.89   1.33
hours:NNS   1.13   0.82   0.71   0.72
NO_WORD   0.28   0.17   0.82   0.09

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.20 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : wrote
-0.10  1.00 NullPunisher.article : a
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : report
-2.00  1.00 RootEntailment.unalignedRoot : "wrote" not aligned to anything
Hand-tuned score (dot product of above): -7.8790
Threshold: -9.4738


Inference ID: 306

Txt: Smith discovered new species for two years.

Hyp: Smith discovered new species. (yes)

Smith
NNP
discovered
VBD
new
JJ
species
NNS
Smith:NNP   0.28   1.07   1.24   1.05
discovered:VBD   1.16   1.32   1.02   0.76
new:JJ   1.13   0.96   0.74   0.88
species:NNS   1.05   0.67   0.99   0.28
two:CD   1.24   1.01   0.98   1.33
years:NNS   1.16   0.67   0.99   0.68
NO_WORD   0.28   0.17   0.11   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.02 Alignment.score
 1.00  0.17 Alignment.isGood
-1.00  0.81 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "new" modifying "species"
-1.00  1.00 NullPunisher.other : discovered
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : new
-1.00  1.00 NullPunisher.other : species
-2.00  1.00 RootEntailment.unalignedRoot : "discovered" not aligned to anything
Hand-tuned score (dot product of above): -10.0099
Threshold: -9.4738


Inference ID: 307

Txt: In 1994 ITEL sent a progress report every month.

Hyp: ITEL sent a progress report in July 1994. (yes)

ITEL
NNP
sent
VBD
a
DT
progress
NN
report
VB
July
NNP
1994
CD
1994:CD   1.24   1.24   0.89   1.33   1.28   1.07   0.85
ITEL:NNP   0.28   1.07   0.96   1.07   1.07   0.99   1.20
sent:VBD   1.16   1.32   0.76   0.91   0.52   1.16   1.33
a:DT   1.25   1.25   2.42   1.00   1.25   1.25   1.18
progress_report:NN   1.07   0.65   0.71   0.54   1.23   1.07   1.27
every:DT   1.23   1.23   1.31   1.00   1.19   1.23   1.18
month:NN   1.07   0.62   0.71   0.90   0.50   0.96   1.14
NO_WORD   0.28   0.17   0.82   0.28   0.36   0.07   0.52

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.15 Alignment.score
 1.00  0.19 Alignment.isGood
-1.00  0.79 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.14 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "1994" modifying "July"
-2.00  1.00 Date.hypDateIns : hypothesis date insertion: July
-3.00  1.00 NullPunisher.entity : ITEL
-3.00  1.00 NullPunisher.entity : July
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : progress
-3.00  1.00 NullPunisher.entity : 1994
-1.00  1.00 NullPunisher.other : sent
-6.00  1.00 Numeric.mismatch : DATE mismatch: '07/01/1994' vs ''
-2.00  1.00 RootEntailment.unalignedRoot : "sent" not aligned to anything
Hand-tuned score (dot product of above): -23.0262
Threshold: -9.4738


Inference ID: 311

Txt: Smith had left the house at a quarter past five. Then she took a taxi to the station.

Hyp: Smith left the house before she took a taxi to the station. (yes)

Smith
NNP
left
VBD
the
DT
house
NN
before
IN
she
PRP
took
VBD
a
DT
taxi
NN
the
DT
station
NN
Smith:NNP   0.28   1.05   0.96   1.10   0.75   1.84   1.07   0.96   1.05   0.96   1.03
had:VBD   1.16   0.84   0.76   0.91   1.15   0.54   0.41   0.76   0.87   0.76   0.91
left:VBN   1.14   2.02   0.76   0.91   1.11   0.54   0.14   0.76   0.82   0.76   0.88
the:DT   1.25   1.25   2.42   1.00   1.05   1.01   1.22   1.31   0.96   2.42   1.00
house:NN   1.10   0.82   0.71   0.28   0.48   1.58   0.72   0.71   0.78   0.71   0.79
a:DT   1.25   1.25   1.31   1.00   1.11   1.16   1.25   2.42   1.00   1.31   1.00
quarter:NN   1.15   0.55   0.71   0.95   0.48   1.58   0.65   0.71   0.83   0.71   0.87
five:CD   1.21   1.26   0.86   1.30   1.26   1.29   1.04   0.89   1.28   0.86   1.33
Then:RB   1.47   0.91   0.94   1.21   1.05   0.95   0.86   1.09   1.16   0.94   1.19
she:PRP   2.76   1.21   1.20   1.30   1.05   1.05   0.97   1.11   1.30   1.20   1.30
took:VBD   1.16   0.14   0.73   0.81   1.15   0.54   1.32   0.76   0.83   0.73   0.81
a:DT   1.25   1.25   1.31   1.00   1.11   1.16   1.25   2.42   1.00   1.31   1.00
taxi:NN   1.05   0.73   0.67   0.78   0.50   1.58   0.74   0.71   0.28   0.67   0.21
the:DT   1.25   1.25   2.42   1.00   1.05   1.01   1.22   1.31   0.96   2.42   1.00
station:NN   1.03   0.79   0.71   0.79   0.50   1.58   0.72   0.71   0.21   0.71   0.28
NO_WORD   0.28   0.17   0.82   0.09   0.92   0.54   0.42   0.82   0.09   0.82   0.74

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.70 Alignment.score
 1.00  0.29 Alignment.isGood
-1.00  0.68 Alignment.isBad
-0.10  9.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.18 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "station" modifying "took"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "five" modifying "left" is dropped on aligned hypothesis word "left"
-1.00  1.00 NullPunisher.other : taxi
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : house
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : station
-1.00  1.00 NullPunisher.other : before
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : she
-1.00  1.00 NullPunisher.other : took
Hand-tuned score (dot product of above): -7.1765
Threshold: -9.4738


Inference ID: 312

Txt: ITEL always delivers reports late. In 1993 ITEL delivered reports.

Hyp: ITEL delivered reports late in 1993. (yes)

ITEL
NNP
delivered
VBD
reports
NNS
late
RB
1993
CD
ITEL:NNP   0.28   1.07   1.07   1.51   1.20
always:RB   1.47   0.91   1.20   0.81   1.36
delivers:VBZ   1.16   0.85   0.86   1.08   1.39
reports:NNS   1.07   0.74   0.28   1.31   1.29
late:RB   1.42   0.84   1.21   1.23   1.22
1993:CD   1.24   1.09   1.33   1.16   0.85
ITEL:NNP   0.28   1.07   1.07   1.51   1.20
delivered:VBD   1.16   1.32   0.83   1.04   1.18
reports:NNS   1.07   0.74   0.28   1.31   1.29
NO_WORD   0.28   0.17   0.09   0.04   0.57

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.25 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  2.00 Alignment.nbWordNotAligned
 0.10  3.00 Alignment.hypSpan
 0.10  0.60 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "1993" modifying "delivered"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "always" modifying "delivers" is dropped on aligned hypothesis word "delivered"
-2.00  1.00 Date.hypDateIns : hypothesis date insertion: 1993
-1.00  1.00 NullPunisher.other : late
-3.00  1.00 NullPunisher.entity : 1993
-6.00  1.00 Numeric.mismatch : DATE mismatch: '01/01/1993' vs ''
Hand-tuned score (dot product of above): -12.6526
Threshold: -9.4738


Inference ID: 313

Txt: ITEL never delivers reports late. In 1993 ITEL delivered reports.

Hyp: ITEL delivered reports late in 1993. (don't know)

ITEL
NNP
delivered
VBD
reports
NNS
late
RB
1993
CD
ITEL:NNP   0.28   1.07   1.07   1.51   1.20
never:RB   1.45   0.85   1.18   0.83   1.36
delivers:VBZ   1.16   0.85   0.86   1.08   1.39
reports:NNS   1.07   0.74   0.28   1.31   1.29
late:RB   1.42   0.84   1.21   1.23   1.22
1993:CD   1.24   1.09   1.33   1.16   0.85
ITEL:NNP   0.28   1.07   1.07   1.51   1.20
delivered:VBD   1.16   1.32   0.83   1.04   1.18
reports:NNS   1.07   0.74   0.28   1.31   1.29
NO_WORD   0.28   0.17   0.09   0.04   0.57

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.25 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  2.00 Alignment.nbWordNotAligned
 0.10  3.00 Alignment.hypSpan
 0.10  0.60 Alignment.txtSpan
-2.00  1.00 Date.hypDateIns : hypothesis date insertion: 1993
 0.00  1.00 NegPolarity.txtNegWord : "delivers": has child with relation "neg"
 0.00  1.00 NegPolarity.txtNegRoot : "delivers": has child with relation "neg"
-3.00  1.00 NullPunisher.entity : 1993
-1.00  1.00 NullPunisher.other : late
-6.00  1.00 Numeric.mismatch : DATE mismatch: '01/01/1993' vs ''
Hand-tuned score (dot product of above): -12.1526
Threshold: -9.4738


Inference ID: 314

Txt: Smith arrived in Paris on the 5th of May, 1995. Today is the 15th of May, 1995. She is still in Paris.

Hyp: Smith was in Paris on the 7th of May, 1995. (yes)

Smith
NNP
was
VBD
Paris
NNP
the
DT
7th
NN
of
IN
May
NNP
1995
CD
Smith:NNP   0.28   1.07   0.98   0.96   0.98   0.67   1.03   1.20
arrived:VBD   1.16   0.89   1.13   0.76   1.14   1.41   1.16   1.39
Paris:NNP   0.98   1.02   0.28   0.96   1.03   0.67   1.02   1.20
the:DT   1.25   1.25   1.25   2.42   1.18   1.37   1.25   1.18
5th:JJ   0.99   1.22   1.05   0.94   0.17   1.05   0.88   0.87
of:IN   1.48   1.50   1.48   1.37   1.31   1.59   1.31   1.05
May:NNP   1.03   1.00   1.02   0.96   0.72   0.50   0.28   1.03
1995:CD   1.24   1.30   1.24   0.89   1.07   1.05   1.07   0.85
Today:NNP   1.14   0.82   1.13   0.71   1.13   0.75   1.07   1.29
is:VBZ   1.16   0.05   1.13   0.76   1.16   1.41   1.16   1.39
the:DT   1.25   1.25   1.25   2.42   1.18   1.37   1.25   1.18
15th:NN   0.96   1.07   1.03   0.93   0.03   0.50   0.72   0.95
of:IN   1.48   1.50   1.48   1.37   1.31   1.59   1.31   1.05
May:NNP   1.03   1.00   1.02   0.96   0.72   0.50   0.28   1.03
1995:CD   1.24   1.30   1.24   0.89   1.07   1.05   1.07   0.85
She:PRP   1.55   0.97   1.55   0.96   1.55   1.30   1.55   1.46
is:VBZ   1.16   0.05   1.13   0.76   1.16   1.41   1.16   1.39
still:RB   1.38   0.91   1.47   1.09   1.47   1.30   1.47   1.36
Paris:NNP   0.98   1.02   0.28   0.96   1.03   0.67   1.02   1.20
NO_WORD   0.28   0.17   0.07   0.82   0.02   1.16   0.25   0.26

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.23 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.12 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Paris" modifying "was"
-3.00  1.00 Date.dateHeadMismatch : 7th vs. 15th
-3.00  1.00 NullPunisher.entity : 1995
-3.00  1.00 NullPunisher.entity : Smith
-3.00  1.00 NullPunisher.entity : May
-3.00  1.00 NullPunisher.entity : Paris
-0.10  1.00 NullPunisher.article : the
-3.00  1.00 NullPunisher.entity : of
-0.05  1.00 NullPunisher.aux : was
-2.00  1.00 RootEntailment.unalignedRoot : "was" not aligned to anything
Hand-tuned score (dot product of above): -22.0787
Threshold: -9.4738


Inference ID: 315

Txt: When Smith arrived in Katmandu she had been travelling for three days.

Hyp: Smith had been travelling the day before she arrived in Katmandu. (yes)

Smith
NNP
had
VBD
been
VBN
traveling
VBG
the
DT
day
NN
before
IN
she
PRP
arrived
VBD
Katmandu
NNP
When:WRB   1.25   1.22   1.14   1.25   1.21   1.25   1.05   1.07   1.25   1.25
Smith:NNP   0.28   1.07   1.07   1.07   0.96   1.08   0.75   1.84   1.07   0.99
arrived:VBD   1.16   0.84   0.89   0.25   0.76   0.79   1.14   0.54   1.32   1.15
Katmandu:NNP   0.99   1.07   1.07   1.06   0.96   0.99   0.75   1.84   1.06   0.28
she:PRP   2.76   0.97   0.94   0.97   1.20   1.55   1.05   1.05   0.97   1.55
had:VBD   1.16   1.32   0.98   0.94   0.76   1.09   1.15   0.54   0.84   1.16
been:VBN   1.16   0.98   1.32   1.00   0.73   1.16   1.11   0.51   0.89   1.16
traveling:VBG   1.16   0.94   1.00   1.32   0.76   0.98   1.15   0.54   0.25   1.15
three:CD   1.24   1.30   1.28   1.30   0.78   1.15   1.28   1.26   1.14   1.24
days:NNS   1.16   0.78   0.82   0.66   0.71   0.18   0.50   1.58   0.66   1.07
NO_WORD   0.28   1.30   1.30   0.17   0.82   0.09   0.92   0.54   0.42   0.07

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.53 Alignment.score
 1.00  0.26 Alignment.isGood
-1.00  0.72 Alignment.isBad
-0.10  9.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.10 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "Katmandu" modifying "arrived"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "three" modifying "days" is dropped on aligned hypothesis word "day"
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : before
-1.00  1.00 NullPunisher.other : arrived
-0.10  1.00 NullPunisher.article : the
-0.05  1.00 NullPunisher.aux : been
-1.00  1.00 NullPunisher.other : traveling
-0.05  1.00 NullPunisher.aux : had
-1.00  1.00 NullPunisher.other : she
-3.00  1.00 NullPunisher.entity : Katmandu
-6.00  1.00 Numeric.mismatch : TIME mismatch: 'D' vs ''
-2.00  1.00 RootEntailment.unalignedRoot : "traveling" not aligned to anything
Hand-tuned score (dot product of above): -19.4185
Threshold: -9.4738


Inference ID: 316

Txt: Jones graduated in March and has been employed ever since. Jones has been unemployed in the past.

Hyp: Jones was unemployed at some time before he graduated. (yes)

Jones
NNP
was
VBD
unemployed
VBN
some
DT
time
NN
before
IN
he
PRP
graduated
VBD
Jones:NNP   0.28   1.07   1.07   0.89   1.14   0.73   1.84   1.07
graduated:VBD   1.16   0.68   0.50   0.76   0.85   1.15   0.54   1.32
March:NNP   1.05   1.07   1.07   0.96   1.15   0.75   1.84   1.07
has:VBZ   1.16   0.84   0.56   0.76   0.91   1.15   0.50   0.65
been:VBN   1.14   0.10   0.56   0.76   0.91   1.11   0.54   0.68
employed:VBN   1.15   0.93   2.04   0.76   0.91   1.12   0.54   0.59
ever:RB   1.45   0.91   0.91   1.09   1.21   1.00   1.05   0.91
since:RB   1.43   0.91   0.91   1.02   1.14   1.03   1.05   0.91
Jones:NNP   0.28   1.07   1.07   0.89   1.14   0.73   1.84   1.07
has:VBZ   1.16   0.84   0.56   0.76   0.91   1.15   0.50   0.65
been:VBN   1.14   0.10   0.56   0.76   0.91   1.11   0.54   0.68
unemployed:VBN   1.16   0.56   1.32   0.76   0.89   1.13   0.54   0.50
the:DT   1.25   1.25   1.25   1.28   0.90   1.05   0.80   1.25
past:NN   1.17   0.72   0.82   0.71   0.61   0.50   1.58   0.77
NO_WORD   0.15   1.57   0.17   0.82   0.20   0.92   0.54   0.42

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.81 Alignment.score
 1.00  0.32 Alignment.isGood
-1.00  0.66 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  3.00 Alignment.hypSpan
 0.10  0.25 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "time" modifying "unemployed"
-8.00  1.00 Antonym.samePol : Antonyms found: "unemployed" & "employed", text and hypothesis sentences are of the same polarity.
-1.00  1.00 NullPunisher.other : some
-1.00  1.00 NullPunisher.other : before
-1.00  1.00 NullPunisher.other : graduated
-1.00  1.00 NullPunisher.other : time
-1.00  1.00 NullPunisher.other : he
Hand-tuned score (dot product of above): -13.7044
Threshold: -9.4738


Inference ID: 317

Txt: Every representative has read this report. No two representatives have read it at the same time. No representative took less than half a day to read the report. There are sixteen representatives.

Hyp: It took the representatives more than a week to read the report. (yes)

It
PRP
took
VBD
the
DT
representatives
NNS
more_than
RBR
a
DT
week
NN
to
TO
read
VB
the
DT
report
NN
Every:DT   1.16   1.25   1.31   1.00   1.45   1.31   0.97   1.31   1.23   1.31   0.94
representative:NN   1.58   0.82   0.71   1.36   1.30   0.71   0.89   0.71   0.82   0.71   0.75
has:VBZ   0.54   0.41   0.76   0.91   1.11   0.76   0.91   0.76   0.70   0.76   0.91
read:VBN   0.54   0.23   0.76   0.91   1.10   0.76   0.69   0.76   2.02   0.76   0.66
this:DT   1.16   1.20   1.21   1.00   1.45   1.31   1.00   1.31   1.25   1.21   1.00
report:NN   1.58   0.82   0.71   0.78   1.29   0.71   0.53   0.71   0.57   0.71   0.28
No:DT   1.11   1.25   1.31   1.00   1.45   1.32   1.00   1.20   1.25   1.31   1.00
two:CD   1.32   0.84   0.82   1.21   1.30   0.89   1.15   0.52   1.27   0.82   1.33
representatives:NNS   1.58   0.82   0.71   0.28   1.31   0.71   0.71   0.71   0.82   0.71   0.78
have:VBP   0.54   0.41   0.73   0.91   1.11   0.76   0.91   0.76   0.73   0.73   0.91
read:VBN   0.54   0.23   0.76   0.91   1.10   0.76   0.69   0.76   2.02   0.76   0.66
it:PRP   0.73   0.97   1.11   1.30   1.05   1.10   1.30   1.11   0.97   1.11   1.30
the:DT   1.16   1.22   2.42   1.00   1.45   1.31   0.96   1.26   1.25   2.42   1.00
same:JJ   1.05   0.96   0.73   0.88   1.01   0.76   0.88   0.76   0.96   0.73   0.88
time:NN   1.58   0.35   0.61   0.98   1.31   0.71   0.82   0.71   0.66   0.61   0.91
No:DT   1.11   1.25   1.31   1.00   1.45   1.32   1.00   1.20   1.25   1.31   1.00
representative:NN   1.58   0.82   0.71   1.36   1.30   0.71   0.89   0.71   0.82   0.71   0.75
took:VBD   0.54   1.32   0.73   0.91   1.11   0.76   0.76   0.68   0.23   0.73   0.91
less:JJR   1.05   0.96   0.76   0.88   1.01   0.76   0.82   0.76   0.90   0.76   0.88
than:IN   1.57   1.18   0.95   1.31   0.06   1.05   1.31   1.05   1.18   0.95   1.31
half:PDT   1.15   0.87   1.31   1.00   1.45   1.21   1.00   1.31   1.25   1.31   1.00
a:DT   1.15   1.25   1.31   1.00   1.45   2.42   1.00   1.31   1.25   1.31   1.00
day:NN   1.84   0.78   0.96   1.25   1.56   0.96   0.69   0.96   0.76   0.96   1.05
to:TO   1.16   1.18   1.26   1.00   1.45   1.31   1.00   2.42   1.25   1.26   1.00
read:VB   0.54   0.23   0.76   0.91   1.10   0.76   0.69   0.76   1.32   0.76   0.66
the:DT   1.16   1.22   2.42   1.00   1.45   1.31   0.96   1.26   1.25   2.42   1.00
report:NN   1.58   0.82   0.71   0.78   1.29   0.71   0.53   0.71   0.57   0.71   0.28
There:EX   1.16   1.23   1.20   1.00   1.45   1.31   0.97   1.31   1.23   1.20   0.98
are:VBP   0.54   0.78   0.68   0.91   1.11   0.76   0.87   0.76   0.74   0.68   0.91
sixteen:NN   1.58   0.67   0.71   0.73   1.28   0.71   0.67   0.71   0.75   0.71   0.82
representatives:NNS   1.58   0.82   0.71   0.28   1.31   0.71   0.71   0.71   0.82   0.71   0.78
NO_WORD   0.54   0.17   0.82   0.09   0.04   0.82   0.09   1.55   0.34   0.82   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.76 Alignment.score
 1.00  0.30 Alignment.isGood
-1.00  0.67 Alignment.isBad
-0.10  8.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.18 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "week" modifying "took"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "time" modifying "read" is dropped on aligned hypothesis word "read"
-1.00  1.00 NullPunisher.other : more_than
-0.10  1.00 NullPunisher.article : the
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : report
-1.00  1.00 NullPunisher.other : week
-1.00  1.00 NullPunisher.other : took
-0.10  1.00 NullPunisher.functionWord : to
-0.10  1.00 NullPunisher.article : the
-2.00  1.00 Person.mismatch : person mismatch between It and it
 1.00  1.00 Quant.contract : Valid quantifier weakening: replacing the quantifier "every" by a weaker quantifier "the" preserves truth.
-2.00  1.00 RootEntailment.unalignedRoot : "took" not aligned to anything
Hand-tuned score (dot product of above): -8.1890
Threshold: -9.4738


Inference ID: 318

Txt: While Jones was updating the program, Mary came in and told him about the board meeting. She finished before he did.

Hyp: Mary's story lasted as long as Jones's updating the program. (don't know)

Mary
NNP
story
NN
lasted
VBD
as
RB
long
RB
as
IN
Jones
NNP
's
VBZ
updating
VBG
the
DT
program
NN
While:IN   1.57   1.31   1.22   1.23   1.23   1.05   1.57   1.24   1.24   0.99   1.31
Jones:NNP   0.77   1.13   1.05   1.56   1.49   0.75   0.28   1.07   1.07   0.96   1.20
was:VBD   1.13   0.91   0.67   0.75   1.11   0.79   1.16   0.01   0.71   0.76   0.91
updating:VBG   1.16   0.70   0.50   1.11   0.98   1.15   1.16   0.56   1.32   0.76   0.65
the:DT   1.25   1.00   1.25   1.45   1.45   1.05   1.25   1.25   1.25   2.42   1.00
program:NN   1.16   0.88   0.82   1.31   1.31   0.50   1.20   0.76   0.56   0.71   0.28
Mary:NNP   0.28   1.03   1.07   1.56   1.56   0.75   0.77   1.07   1.07   0.96   1.16
came_in:VBD   1.16   0.86   0.39   1.11   0.81   1.15   1.16   0.55   0.48   0.76   0.91
told:VBD   1.16   0.81   0.72   1.11   1.06   1.15   1.14   0.44   0.63   0.73   0.90
him:PRP   1.55   1.30   0.97   1.05   1.05   1.05   1.55   0.97   0.97   1.11   1.30
the:DT   1.25   1.00   1.25   1.45   1.45   1.05   1.25   1.25   1.25   2.42   1.00
board_meeting:NN   1.07   0.82   0.76   1.31   1.31   0.50   1.07   0.70   0.76   0.71   0.82
She:PRP   2.76   1.60   0.95   1.35   1.05   1.35   1.55   0.97   0.97   0.96   1.30
finished:VBD   1.16   0.91   0.42   1.11   1.11   1.15   1.15   0.42   0.58   0.76   0.73
before:IN   1.57   1.25   1.24   1.23   1.23   1.05   1.55   1.24   1.24   1.05   1.29
he:PRP   1.55   1.30   0.97   1.29   1.53   1.29   2.76   1.21   0.97   0.99   1.30
did:VBD   1.16   0.91   0.64   1.11   0.98   1.15   1.16   0.48   0.65   0.76   0.91
NO_WORD   0.12   0.28   0.17   0.04   0.04   0.92   0.28   1.30   0.31   0.82   0.09

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.30 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.76 Alignment.isBad
-0.10  11.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "long" modifying "lasted"
-0.05  1.00 NullPunisher.aux : 's
-0.10  1.00 NullPunisher.article : the
-0.10  1.00 NullPunisher.functionWord : as
-1.00  1.00 NullPunisher.other : updating
-1.00  1.00 NullPunisher.other : lasted
-1.00  1.00 NullPunisher.other : story
-1.00  1.00 NullPunisher.other : program
-3.00  1.00 NullPunisher.entity : Jones
-0.10  1.00 NullPunisher.functionWord : as
-1.00  1.00 NullPunisher.other : long
-3.00  1.00 NullPunisher.entity : Mary
-2.00  1.00 RootEntailment.unalignedRoot : "lasted" not aligned to anything
Hand-tuned score (dot product of above): -15.7013
Threshold: -9.4738


Inference ID: 319

Txt: Before APCOM bought its present office building, it had been paying mortgage interest on the previous one for 8 years. Since APCOM bought its present office building it has been paying mortgage interest on it for more than 10 years.

Hyp: APCOM has been paying mortgage interest for a total of 15 years or more. (yes)

APCOM
NNP
has
VBZ
been
VBN
paying
VBG
mortgage
NN
interest
NN
a
DT
total
NN
15
CD
years
NNS
or
CC
more
JJR
Before_APCOM:NNP   0.34   1.07   1.07   1.07   1.05   1.07   0.96   1.07   1.20   1.07   0.96   1.24
bought:VBD   1.16   0.85   0.91   0.56   0.76   0.87   0.76   0.76   1.35   0.81   0.76   1.02
its:PRP$   2.06   0.90   1.17   0.97   1.50   1.48   1.10   1.40   1.46   1.50   1.30   1.69
present:JJ   1.13   0.96   0.90   0.91   0.86   0.77   0.76   0.83   1.11   0.75   0.76   0.93
office_building:NN   1.07   0.82   0.82   0.82   0.61   0.73   0.71   0.82   1.26   0.70   0.71   0.99
it:PRP   2.06   0.97   1.17   0.97   1.50   1.50   1.10   1.40   1.46   1.50   1.30   1.69
had:VBD   1.16   0.17   0.98   0.87   0.91   0.91   0.76   0.91   1.39   0.91   0.76   1.02
been:VBN   1.16   0.98   1.32   0.93   0.91   0.91   0.76   0.91   1.39   0.88   0.76   1.02
paying:VBG   1.14   0.87   0.93   1.32   0.72   0.70   0.76   0.79   1.29   0.91   0.76   1.02
mortgage:NN   1.07   0.82   0.82   0.63   0.28   0.46   0.71   0.83   1.29   0.87   0.71   0.91
interest:NN   1.07   0.82   0.82   0.62   0.53   0.28   0.71   0.92   1.29   0.94   0.71   0.99
the:DT   1.25   1.25   1.22   1.25   1.00   1.00   1.31   1.00   1.18   1.00   1.31   1.15
previous:JJ   1.13   0.96   0.96   0.89   0.88   0.88   0.76   0.47   1.01   0.72   0.76   0.93
one:NN   1.07   0.82   0.78   0.82   0.88   0.92   0.71   0.88   1.29   0.80   0.66   0.89
8:CD   1.24   1.30   1.30   1.30   1.33   1.33   0.89   1.08   0.28   1.33   0.89   1.00
years:NNS   1.07   0.76   0.79   0.82   0.87   0.94   0.71   0.80   1.29   0.28   0.71   0.96
Since:IN   1.57   1.24   1.24   1.22   1.31   1.29   1.05   1.31   1.30   1.31   1.05   0.75
APCOM:NNP   0.28   1.07   1.07   1.05   1.07   1.07   0.96   1.07   1.20   1.07   0.96   1.24
bought:VBD   1.16   0.85   0.91   0.56   0.76   0.87   0.76   0.76   1.35   0.81   0.76   1.02
its:PRP$   2.76   0.90   0.97   0.97   1.30   1.28   1.10   1.30   1.46   1.30   1.10   1.39
present:JJ   1.13   0.96   0.90   0.91   0.86   0.77   0.76   0.83   1.11   0.75   0.76   0.93
office_building:NN   1.07   0.82   0.82   0.82   0.61   0.73   0.71   0.82   1.26   0.70   0.71   0.99
it:PRP   2.76   0.97   0.97   0.97   1.30   1.30   1.10   1.30   1.46   1.30   1.10   1.39
has:VBZ   1.16   1.32   0.98   0.87   0.91   0.91   0.76   0.91   1.39   0.85   0.76   1.02
been:VBN   1.16   0.98   1.32   0.93   0.91   0.91   0.76   0.91   1.39   0.88   0.76   1.02
paying:VBG   1.14   0.87   0.93   1.32   0.72   0.70   0.76   0.79   1.29   0.91   0.76   1.02
mortgage:NN   1.07   0.82   0.82   0.63   0.28   0.46   0.71   0.83   1.29   0.87   0.71   0.91
interest:NN   1.07   0.82   0.82   0.62   0.53   0.28   0.71   0.92   1.29   0.94   0.71   0.99
on:IN   1.57   1.24   1.24   1.24   1.31   1.31   1.05   1.31   1.30   1.31   0.94   0.77
it:PRP   2.76   0.97   0.97   0.97   1.30   1.30   1.10   1.30   1.46   1.30   1.10   1.39
more_than:IN   1.57   1.24   1.24   1.24   1.22   1.30   1.05   1.28   1.30   1.31   1.05   0.40
10:CD   1.24   1.30   1.30   1.22   1.33   1.33   0.89   1.06   0.58   1.29   0.89   1.00
years:NNS   1.07   0.76   0.79   0.82   0.87   0.94   0.71   0.80   1.29   0.28   0.71   0.96
NO_WORD   0.28   1.30   1.30   0.17   0.05   0.09   0.82   0.05   0.52   0.04   0.56   0.03

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.47 Alignment.score
 1.00  0.25 Alignment.isGood
-1.00  0.73 Alignment.isBad
-0.10  9.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.17 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "years" modifying "total"
-0.05  1.00 NullPunisher.aux : has
-1.00  1.00 NullPunisher.other : interest
-1.00  1.00 NullPunisher.other : years
-0.05  1.00 NullPunisher.aux : been
-1.00  1.00 NullPunisher.other : total
-1.00  1.00 NullPunisher.other : or
-1.00  1.00 NullPunisher.other : mortgage
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : paying
-2.00  1.00 RootEntailment.unalignedRoot : "paying" not aligned to anything
Hand-tuned score (dot product of above): -10.0022
Threshold: -9.4738


Inference ID: 320

Txt: When Jones got his job at the CIA, he knew that he would never be allowed to write his memoirs.

Hyp: It is the case that Jones is not and will never be allowed to write his memoirs. (yes)

It
PRP
is
VBZ
the
DT
case
NN
that
IN
Jones
NNP
is
VBZ
not
RB
will
MD
never
RB
be
VB
allowed
VBN
to
TO
write
VB
his
PRP$
memoirs
NNS
When:WRB   1.16   1.25   1.21   1.00   0.99   1.23   1.25   1.45   1.25   1.43   1.25   1.25   1.31   1.23   1.13   1.00
Jones:NNP   1.84   1.07   0.96   1.16   0.75   0.28   1.07   1.56   0.96   1.52   1.07   1.04   0.96   1.07   1.84   1.05
got:VBD   0.50   0.92   0.76   0.91   1.12   1.16   0.92   0.97   1.06   1.11   0.93   0.77   0.71   0.69   0.54   0.91
his:PRP$   0.67   0.85   1.35   1.51   1.01   2.76   0.85   1.29   1.08   1.29   1.21   0.97   1.35   0.97   1.05   1.30
job:NN   1.58   0.82   0.71   0.91   0.50   1.16   0.82   1.23   0.71   1.31   0.82   0.82   0.66   0.75   1.58   0.83
the:DT   1.16   1.25   2.42   0.96   0.95   1.25   1.25   1.45   1.31   1.45   1.21   1.25   1.26   1.25   1.16   1.00
CIA:NNP   1.79   1.03   0.96   1.09   0.72   0.99   1.03   1.56   0.93   1.56   1.07   1.07   0.96   1.07   1.76   1.06
he:PRP   0.71   1.21   0.99   1.54   1.05   2.76   1.21   1.29   1.11   1.29   1.10   0.97   1.35   0.97   0.82   1.30
knew:VBD   0.54   0.84   0.73   0.78   1.15   1.09   0.84   1.08   0.94   1.09   0.84   0.69   0.76   0.43   0.54   0.74
that:IN   1.57   1.24   0.95   1.31   1.59   1.57   1.24   1.20   1.05   1.23   1.24   1.24   1.05   1.21   1.54   1.31
he:PRP   0.71   1.21   0.99   1.54   1.05   2.76   1.21   1.29   1.11   1.29   1.10   0.97   1.35   0.97   0.82   1.30
would:MD   1.16   1.25   1.31   1.00   1.05   1.21   1.25   1.45   1.23   1.45   1.32   1.22   1.31   1.21   1.16   1.00
never:RB   1.05   0.91   1.09   1.19   1.05   1.43   0.91   0.86   1.09   1.23   0.91   0.91   1.20   0.91   1.05   1.18
be:VB   0.54   0.10   0.71   0.91   1.15   1.16   0.10   1.11   1.11   1.11   1.32   0.84   0.76   0.84   0.54   0.91
allowed:VBN   0.54   0.84   0.76   0.77   1.15   1.13   0.84   1.11   0.94   1.11   0.84   1.32   0.76   0.69   0.54   0.87
to:TO   1.16   1.25   1.26   1.00   1.05   1.25   1.25   1.41   1.31   1.56   1.25   1.25   2.42   1.25   1.16   1.00
write:VB   0.51   0.84   0.76   0.88   1.13   1.16   0.84   1.11   0.86   1.11   0.84   0.69   0.76   1.32   0.54   0.78
his:PRP$   0.67   0.85   1.35   1.51   1.01   2.76   0.85   1.29   1.08   1.29   1.21   0.97   1.35   0.97   1.05   1.30
memoirs:NNS   1.58   0.82   0.71   0.83   0.50   1.05   0.82   1.31   0.71   1.27   0.82   0.78   0.71   0.69   1.58   0.28
NO_WORD   0.54   1.43   0.82   0.12   0.99   0.15   0.31   0.24   1.55   0.24   1.57   0.06   1.55   0.09   0.93   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.65 Alignment.score
 1.00  0.28 Alignment.isGood
-1.00  0.69 Alignment.isBad
-0.10  16.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
 0.00  1.00 NegPolarity.hypNegWord : "is": has child with relation "neg"
 0.00  1.00 NegPolarity.hypNegWord : "never": is in NegPolarityMarkers list
 0.00  1.00 NegPolarity.hypNegWord : "allowed": has child with relation "neg"
-1.00  1.00 NullPunisher.other : It
-1.00  1.00 NullPunisher.other : not
-0.10  1.00 NullPunisher.functionWord : to
-3.00  1.00 NullPunisher.entity : Jones
-0.05  1.00 NullPunisher.aux : will
-1.00  1.00 NullPunisher.other : memoirs
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : case
-1.00  1.00 NullPunisher.other : never
-0.05  1.00 NullPunisher.aux : is
-0.05  1.00 NullPunisher.aux : is
-0.10  1.00 NullPunisher.functionWord : his
-0.05  1.00 NullPunisher.aux : be
-0.10  1.00 NullPunisher.functionWord : that
-1.00  1.00 NullPunisher.other : write
-1.00  1.00 NullPunisher.other : allowed
-2.00  1.00 RootEntailment.unalignedRoot : "case" not aligned to anything
Hand-tuned score (dot product of above): -13.9624
Threshold: -9.4738


Inference ID: 321

Txt: Smith has been to Florence twice in the past. Smith will go to Florence twice in the coming year.

Hyp: Two years from now Smith will have been to Florence at least four times. (yes)

Two
CD
years
NNS
now
JJ
Smith
NNP
will
MD
have
VB
been
VBN
Florence
NNP
at
IN
least
JJS
four
CD
times
NNS
Smith:NNP   1.20   1.16   1.24   0.28   0.94   1.07   1.07   0.96   0.75   1.24   1.20   1.13
has:VBZ   1.39   0.85   1.02   1.16   1.05   0.19   0.98   1.16   1.11   0.96   1.39   0.91
been:VBN   1.39   0.88   1.02   1.16   1.11   0.98   1.32   1.16   1.15   0.99   1.39   0.88
Florence:NNP   1.20   1.11   1.24   0.96   0.96   1.07   1.07   0.28   0.75   1.23   1.20   1.09
twice:RB   1.30   1.08   1.05   1.43   1.02   0.89   0.91   1.45   1.05   1.05   0.97   0.84
the:DT   1.11   1.00   1.18   1.25   1.31   1.22   1.22   1.25   1.38   1.18   1.18   0.94
past:NN   1.29   0.46   0.99   1.13   0.71   0.76   0.82   1.11   0.42   0.25   0.55   0.70
Smith:NNP   1.20   1.16   1.24   0.28   0.94   1.07   1.07   0.96   0.75   1.24   1.20   1.13
will:MD   1.18   1.00   1.18   1.23   2.42   1.55   1.60   1.25   1.05   1.18   1.18   0.97
go:VB   1.35   0.91   0.97   1.16   1.07   0.94   0.92   1.16   1.15   1.02   1.39   0.91
Florence:NNP   1.20   1.11   1.24   0.96   0.96   1.07   1.07   0.28   0.75   1.23   1.20   1.09
twice:RB   1.30   1.08   1.05   1.43   1.02   0.89   0.91   1.45   1.05   1.05   0.97   0.84
in:IN   1.30   1.31   0.77   1.57   1.05   1.24   1.24   1.57   2.26   0.77   1.30   1.31
the:DT   1.11   1.00   1.18   1.25   1.31   1.22   1.22   1.25   1.38   1.18   1.18   0.94
coming:JJ   1.13   0.88   0.93   1.07   0.76   0.96   0.96   1.10   1.05   0.93   0.88   0.83
year:NN   1.29   0.62   0.99   1.17   0.71   0.82   0.76   1.12   0.50   0.91   1.15   0.74
NO_WORD   0.52   0.28   0.11   0.15   1.55   1.30   0.17   0.74   0.84   0.29   0.52   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.38 Alignment.score
 1.00  0.23 Alignment.isGood
-1.00  0.75 Alignment.isBad
-0.10  11.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.08 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "four" modifying "times"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "coming" modifying "year" is dropped on aligned hypothesis word "years"
 0.00  1.00 NegPolarity.hypNegWord : "least": tag "JJS" is in NegPolarityMarkers list
-0.05  1.00 NullPunisher.aux : have
-1.00  1.00 NullPunisher.other : times
-1.00  1.00 NullPunisher.other : least
-0.05  1.00 NullPunisher.aux : been
-1.00  1.00 NullPunisher.other : at
-1.00  1.00 NullPunisher.other : now
-3.00  1.00 NullPunisher.entity : Two
-0.05  1.00 NullPunisher.aux : will
-3.00  1.00 NullPunisher.entity : four
-3.00  1.00 NullPunisher.entity : Smith
-3.00  1.00 NullPunisher.entity : Florence
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '2.0' vs ''
-6.00  1.00 Numeric.mismatch : NUMBER mismatch: '>=4.0' vs ''
-2.00  1.00 RootEntailment.unalignedRoot : "been" not aligned to anything
Hand-tuned score (dot product of above): -31.7802
Threshold: -9.4738


Inference ID: 322

Txt: Last week I already knew that when, in a month's time, Smith would discover that she had been duped she would be furious.

Hyp: It will be the case that in a few weeks Smith will discover that she has been duped; and she will be furious. (yes)

It
PRP
will
MD
be
VB
the
DT
case
NN
that
IN
a
DT
few
JJ
weeks
NNS
Smith
NNP
will
MD
discover
VB
that
IN
she
PRP
has
VBZ
been
VBN
duped
VBN
she
PRP
will
MD
be
VB
furious
JJ
Last:JJ   1.05   0.76   0.96   0.76   0.77   0.99   0.76   0.93   0.88   1.11   0.76   0.96   0.99   1.05   0.86   0.96   0.96   1.05   0.76   0.96   0.93
week:NN   1.58   0.65   0.82   0.67   0.94   0.50   0.71   0.95   1.07   1.15   0.65   0.82   0.50   1.55   0.82   0.71   0.76   1.55   0.65   0.82   0.92
I:PRP   0.62   1.11   0.97   1.11   1.30   1.05   1.14   1.39   1.30   1.55   1.11   0.97   1.05   0.71   0.97   0.97   0.97   0.71   1.11   0.97   1.39
already:RB   1.05   1.09   0.91   1.09   1.21   1.05   1.09   1.05   1.21   1.47   1.09   0.91   1.05   1.05   0.91   0.91   0.88   1.05   1.09   0.91   1.01
knew:VBD   0.54   0.94   0.84   0.73   0.78   1.15   0.76   0.92   0.88   1.16   0.94   0.36   1.15   0.51   0.79   0.78   0.36   0.51   0.94   0.84   0.75
that:IN   1.57   1.05   1.24   0.95   1.31   1.59   1.05   0.77   1.31   1.54   1.05   1.24   1.59   1.62   1.15   1.24   1.24   1.62   1.05   1.24   0.77
when:WRB   1.16   1.25   1.25   1.21   1.00   0.99   1.31   1.15   0.92   1.25   1.25   1.25   0.99   1.07   1.22   1.14   1.23   1.07   1.25   1.25   1.18
a:DT   1.15   1.31   1.25   1.31   1.00   1.05   2.42   1.18   1.00   1.25   1.31   1.25   1.05   1.16   1.25   1.25   1.25   1.16   1.31   1.25   1.18
month:NN   1.58   0.71   0.82   0.71   0.91   0.47   0.71   0.99   0.47   1.10   0.71   0.82   0.47   1.58   0.82   0.82   0.82   1.58   0.71   0.82   0.89
time:NN   1.58   0.65   0.82   0.61   0.50   0.44   0.71   0.99   0.90   1.13   0.65   0.61   0.44   1.55   0.82   0.82   0.78   1.55   0.65   0.82   0.89
Smith:NNP   1.80   0.94   1.07   0.96   1.17   0.73   0.96   1.24   1.15   0.28   0.94   1.07   0.73   1.84   1.07   1.07   1.07   1.84   0.94   1.07   1.24
would:MD   1.16   1.23   1.32   1.31   1.00   1.05   1.31   1.18   0.95   1.25   1.23   1.25   1.05   1.16   1.34   1.25   1.21   1.16   1.23   1.32   1.18
discover:VB   0.54   0.95   0.85   0.76   0.84   1.15   0.76   1.02   0.91   1.16   0.95   1.32   1.15   0.54   0.80   0.85   0.52   0.54   0.95   0.85   1.00
that:IN   1.57   1.05   1.24   0.95   1.31   1.59   1.05   0.77   1.31   1.54   1.05   1.24   1.59   1.62   1.15   1.24   1.24   1.62   1.05   1.24   0.77
she:PRP   0.71   1.41   1.23   1.26   1.26   1.10   1.11   1.69   2.20   1.55   1.41   0.97   1.10   1.05   0.97   1.54   0.97   1.05   1.41   1.23   1.39
had:VBD   0.54   1.05   0.98   0.76   0.87   1.06   0.76   1.02   0.91   1.16   1.05   0.80   1.06   0.54   0.17   0.98   0.66   0.54   1.05   0.98   1.02
been:VBN   0.54   1.11   0.17   0.73   0.91   1.15   0.76   0.99   0.83   1.16   1.11   0.85   1.15   0.51   0.98   1.32   0.67   0.51   1.11   0.17   1.02
duped:VBN   0.54   0.83   0.69   0.76   0.73   1.15   0.76   1.02   0.91   1.16   0.83   0.52   1.15   0.54   0.66   0.67   1.32   0.54   0.83   0.69   0.84
she:PRP   0.71   1.41   1.23   1.26   1.26   1.10   1.11   1.69   2.20   1.55   1.41   0.97   1.10   1.05   0.97   1.54   0.97   1.05   1.41   1.23   1.39
would:MD   1.16   1.23   1.32   1.31   1.00   1.05   1.31   1.18   0.95   1.25   1.23   1.25   1.05   1.16   1.34   1.25   1.21   1.16   1.23   1.32   1.18
be:VB   0.54   1.11   1.32   0.71   0.91   1.15   0.76   0.97   0.91   1.16   1.11   0.85   1.15   0.50   0.98   0.17   0.69   0.50   1.11   1.32   1.02
furious:JJ   1.05   0.76   0.96   0.76   0.86   1.05   0.76   0.93   0.88   1.13   0.76   0.94   1.05   1.05   0.96   0.96   0.78   1.05   0.76   0.96   0.74
NO_WORD   0.54   1.55   1.43   0.82   0.12   0.60   0.82   0.11   0.07   0.28   1.55   0.36   0.60   0.67   1.30   1.57   0.36   0.54   1.55   1.43   0.35

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.81 Alignment.score
 1.00  0.32 Alignment.isGood
-1.00  0.66 Alignment.isBad
-0.10  20.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.05 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "discover" modifying "case"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "Last" modifying "week" is dropped on aligned hypothesis word "weeks"
 0.00  1.00 NegPolarity.hypNegWord : "few": is in NegPolarityMarkers list
-1.00  1.00 NullPunisher.other : discover
-0.05  1.00 NullPunisher.aux : been
-1.00  1.00 NullPunisher.other : case
-0.10  1.00 NullPunisher.article : a
-0.05  1.00 NullPunisher.aux : has
-0.05  1.00 NullPunisher.aux : be
-0.05  1.00 NullPunisher.aux : will
-0.05  1.00 NullPunisher.aux : be
-1.00  1.00 NullPunisher.other : she
-1.00  1.00 NullPunisher.other : few
-1.00  1.00 NullPunisher.other : furious
-0.05  1.00 NullPunisher.aux : will
-0.10  1.00 NullPunisher.functionWord : that
-0.10  1.00 NullPunisher.functionWord : that
-1.00  1.00 NullPunisher.other : It
-1.00  1.00 NullPunisher.other : duped
-1.00  1.00 NullPunisher.other : she
-0.05  1.00 NullPunisher.aux : will
-3.00  1.00 NullPunisher.entity : Smith
-0.10  1.00 NullPunisher.article : the
-2.00  1.00 RootEntailment.unalignedRoot : "case" not aligned to anything
Hand-tuned score (dot product of above): -15.6721
Threshold: -9.4738


Inference ID: 323

Txt: No one gambling seriously stops until he is broke. No one can gamble when he is broke.

Hyp: Everyone who starts gambling seriously stops the moment he is broke. (yes)

Everyone
NN
who
WP
starts
VBZ
gambling
NN
seriously
RB
stops
VBZ
the
DT
moment
NN
he
PRP
is
VBZ
broke
RB
No:DT   1.00   1.12   1.25   1.00   1.45   1.25   1.31   1.00   1.16   1.25   1.45
one:CD   1.31   1.32   1.30   1.33   1.30   1.30   0.82   1.30   1.27   1.30   1.25
gambling:NN   0.79   1.58   0.82   0.28   1.30   0.76   0.71   0.86   1.58   0.82   1.31
seriously:RB   1.18   1.05   0.88   1.21   1.23   0.82   1.09   0.99   1.05   0.91   0.77
stops:VBZ   0.91   0.54   1.56   0.85   1.02   1.32   0.76   0.72   0.54   0.82   1.07
until:IN   1.31   1.57   1.24   1.31   1.23   1.24   1.05   1.31   1.57   1.24   1.23
he:PRP   1.45   0.83   0.97   2.50   1.05   0.97   0.74   1.60   1.05   1.12   1.20
is:VBZ   0.91   0.54   0.90   0.91   1.11   0.82   0.76   0.91   0.54   1.32   1.11
broke:RB   1.16   1.05   0.89   1.21   0.77   0.87   1.09   0.96   1.05   0.91   1.23
No:DT   1.00   1.12   1.25   1.00   1.45   1.25   1.31   1.00   1.16   1.25   1.45
one:NN   0.80   1.58   0.82   0.88   1.31   0.82   0.63   0.78   1.53   0.82   1.25
can:MD   1.00   1.24   1.44   1.00   1.45   1.39   1.31   1.00   1.16   1.55   1.45
gamble:VB   0.91   0.54   0.66   0.10   0.97   0.62   0.76   0.79   0.54   0.74   1.09
when:WRB   1.00   1.07   1.25   1.00   1.45   1.25   1.21   0.95   1.09   1.25   1.45
he:PRP   1.45   0.83   0.97   2.50   1.05   0.97   0.74   1.60   1.05   1.12   1.20
is:VBZ   0.91   0.54   0.90   0.91   1.11   0.82   0.76   0.91   0.54   1.32   1.11
broke:RB   1.16   1.05   0.89   1.21   0.77   0.87   1.09   0.96   1.05   0.91   1.23
NO_WORD   0.28   0.92   0.25   0.09   0.04   0.17   0.82   0.09   0.54   0.25   0.04

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.27 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  11.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "broke" modifying "is"
-1.00  1.00 NullPunisher.other : he
-1.00  1.00 NullPunisher.other : starts
-0.10  1.00 NullPunisher.functionWord : who
-1.00  1.00 NullPunisher.other : broke
-1.00  1.00 NullPunisher.other : Everyone
-1.00  1.00 NullPunisher.other : stops
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : gambling
-1.00  1.00 NullPunisher.other : moment
-1.00  1.00 NullPunisher.other : seriously
-0.05  1.00 NullPunisher.aux : is
-2.00  1.00 RootEntailment.unalignedRoot : "stops" not aligned to anything
Hand-tuned score (dot product of above): -12.6413
Threshold: -9.4738


Inference ID: 324

Txt: No one who starts gambling seriously stops until he is broke.

Hyp: Everyone who starts gambling seriously continues until he is broke. (yes)

Everyone
NN
who
WP
starts
VBZ
gambling
NN
seriously
RB
continues
VBZ
until
IN
he
PRP
is
VBZ
broke
RB
No:DT   1.00   1.12   1.25   1.00   1.45   1.25   1.05   1.16   1.25   1.45
one:NN   0.80   1.58   0.82   0.88   1.31   0.82   0.50   1.53   0.82   1.25
who:WP   1.30   1.05   0.97   1.30   1.05   0.97   1.05   0.83   0.97   1.05
starts:VBZ   0.91   0.54   1.32   0.91   1.08   0.57   1.15   0.54   0.90   1.09
gambling:NN   0.79   1.58   0.82   0.28   1.30   0.69   0.50   1.58   0.82   1.31
seriously:RB   1.18   1.05   0.88   1.21   1.23   0.76   1.05   1.05   0.91   0.77
stops:VBZ   0.91   0.54   1.56   0.85   1.02   1.71   1.15   0.54   0.82   1.07
until:IN   1.31   1.57   1.24   1.31   1.23   1.21   1.59   1.57   1.24   1.23
he:PRP   1.30   0.83   0.97   1.30   1.05   0.97   1.05   1.05   0.97   1.05
is:VBZ   0.91   0.54   0.90   0.91   1.11   0.88   1.15   0.54   1.32   1.11
broke:RB   1.16   1.05   0.89   1.21   0.77   0.91   1.05   1.05   0.91   1.23
NO_WORD   0.28   0.92   0.25   0.09   0.04   0.17   0.92   0.54   0.42   0.04

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.31 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.76 Alignment.isBad
-0.10  10.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "broke" modifying "is"
-1.00  1.00 NullPunisher.other : gambling
-1.00  1.00 NullPunisher.other : seriously
-1.00  1.00 NullPunisher.other : starts
-1.00  1.00 NullPunisher.other : until
-0.05  1.00 NullPunisher.aux : is
-1.00  1.00 NullPunisher.other : he
-1.00  1.00 NullPunisher.other : Everyone
-1.00  1.00 NullPunisher.other : broke
-1.00  1.00 NullPunisher.other : continues
-0.10  1.00 NullPunisher.functionWord : who
-2.00  1.00 RootEntailment.unalignedRoot : "continues" not aligned to anything
Hand-tuned score (dot product of above): -12.3798
Threshold: -9.4738


Inference ID: 325

Txt: Nobody who is asleep ever knows that he is asleep. But some people know that they have been asleep after they have been asleep.

Hyp: Some people discover that they have been asleep. (yes)

Some
DT
people
NNS
discover
VBP
that
IN
they
PRP
have
VBP
been
VBN
asleep
RB
Nobody:NN   0.71   0.92   0.82   0.50   1.58   0.82   0.82   1.31
who:WP   1.11   1.30   0.97   1.01   0.78   0.97   0.97   1.05
is:VBZ   0.76   0.91   0.85   1.15   0.54   0.98   0.10   1.11
asleep:RB   1.05   1.13   0.72   1.05   1.05   0.91   0.87   1.23
ever:RB   1.09   1.21   0.88   1.05   0.99   0.86   0.86   0.86
knows:VBZ   0.73   0.77   0.64   1.15   0.54   0.79   0.84   1.09
that:IN   1.05   1.31   1.24   1.59   1.46   1.18   1.24   1.23
he:PRP   1.31   1.30   0.97   1.05   1.06   0.90   1.17   1.05
is:VBZ   0.76   0.91   0.85   1.15   0.54   0.98   0.10   1.11
asleep:RB   1.05   1.13   0.72   1.05   1.05   0.91   0.87   1.23
some:DT   0.01   0.95   1.22   1.05   1.16   1.20   1.25   1.41
people:NNS   0.66   0.28   0.69   0.50   1.66   0.82   0.82   1.22
know:VBP   0.76   0.51   0.47   1.15   0.54   0.79   0.84   1.03
that:IN   1.05   1.31   1.24   1.59   1.46   1.18   1.24   1.23
they:PRP   1.51   2.58   0.97   0.94   1.05   1.17   1.12   1.05
have:VBP   0.70   0.91   0.80   1.10   0.54   1.32   0.98   1.11
been:VBN   0.76   0.91   0.85   1.15   0.49   0.98   1.32   1.07
asleep:JJ   0.72   0.79   0.76   1.05   1.05   0.96   0.91   2.55
after:IN   1.02   1.31   1.22   1.05   1.54   1.21   1.21   1.17
they:PRP   1.51   2.58   0.97   0.94   1.05   1.17   1.12   1.05
have:VBP   0.70   0.91   0.80   1.10   0.54   1.32   0.98   1.11
been:VBN   0.76   0.91   0.85   1.15   0.49   0.98   1.32   1.07
asleep:RB   1.05   1.13   0.72   1.05   1.05   0.91   0.87   1.23
NO_WORD   0.82   0.28   0.17   0.60   0.54   1.30   0.31   0.36

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.75 Alignment.score
 1.00  0.30 Alignment.isGood
-1.00  0.67 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.12 Alignment.txtSpan
-1.00  1.00 NullPunisher.other : discover
-0.05  1.00 NullPunisher.aux : have
-0.10  1.00 NullPunisher.functionWord : that
-1.00  1.00 NullPunisher.other : people
-0.05  1.00 NullPunisher.aux : been
-1.00  1.00 NullPunisher.other : Some
-1.00  1.00 NullPunisher.other : they
-2.00  1.00 RootEntailment.unalignedRoot : "discover" not aligned to anything
Hand-tuned score (dot product of above): -6.4037
Threshold: -9.4738


Inference ID: 326

Txt: ITEL built MTALK in 1993.

Hyp: ITEL finished MTALK in 1993. (yes)

ITEL
NNP
finished
VBD
MTALK
NNP
1993
CD
ITEL:NNP   0.28   1.07   1.00   1.20
built:VBD   1.14   0.68   0.86   1.27
MTALK:NNP   1.00   0.82   0.28   1.29
1993:CD   1.24   1.16   1.33   0.85
NO_WORD   0.28   0.17   0.09   0.57

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00 -0.15 Alignment.score
 1.00  0.15 Alignment.isGood
-1.00  0.83 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "1993" modifying "MTALK"
-2.00  1.00 Date.hypDateIns : hypothesis date insertion: 1993
-3.00  1.00 NullPunisher.entity : 1993
-1.00  1.00 NullPunisher.other : MTALK
-1.00  1.00 NullPunisher.other : finished
-3.00  1.00 NullPunisher.entity : ITEL
-6.00  1.00 Numeric.mismatch : DATE mismatch: '01/01/1993' vs ''
-2.00  1.00 RootEntailment.unalignedRoot : "finished" not aligned to anything
Hand-tuned score (dot product of above): -20.2304
Threshold: -9.4738


Inference ID: 327

Txt: ITEL was building MTALK in 1993.

Hyp: ITEL finished MTALK in 1993. (don't know)

ITEL
NNP
finished
VBD
MTALK
NNP
1993
CD
ITEL:NNP   0.28   1.07   1.00   1.20
was:VBD   1.16   0.92   0.91   1.39
building:VBG   1.16   0.79   0.91   1.38
MTALK:NNP   1.00   0.82   0.28   1.29
1993:CD   1.24   1.16   1.33   0.85
NO_WORD   0.28   0.17   0.09   0.57

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00 -0.15 Alignment.score
 1.00  0.15 Alignment.isGood
-1.00  0.83 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "1993" modifying "MTALK"
-2.00  1.00 Date.hypDateIns : hypothesis date insertion: 1993
-3.00  1.00 NullPunisher.entity : ITEL
-1.00  1.00 NullPunisher.other : finished
-1.00  1.00 NullPunisher.other : MTALK
-3.00  1.00 NullPunisher.entity : 1993
-6.00  1.00 Numeric.mismatch : DATE mismatch: '01/01/1993' vs ''
-2.00  1.00 RootEntailment.unalignedRoot : "finished" not aligned to anything
Hand-tuned score (dot product of above): -20.2304
Threshold: -9.4738


Inference ID: 328

Txt: ITEL won the contract from APCOM in 1993.

Hyp: ITEL won a contract in 1993. (yes)

ITEL
NNP
won
VBD
a
DT
contract
NN
1993
CD
ITEL:NNP   0.28   1.07   0.96   1.07   1.20
won:VBD   1.16   1.32   0.76   0.65   1.18
the:DT   1.22   1.25   1.31   1.00   1.18
contract:NN   1.07   0.56   0.71   0.28   1.19
APCOM:NNP   1.07   0.82   0.71   0.82   1.29
1993:CD   1.24   1.09   0.89   1.23   0.85
NO_WORD   0.28   0.17   0.82   0.09   0.57

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.05 Alignment.score
 1.00  0.18 Alignment.isGood
-1.00  0.80 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "1993" modifying "won"
-2.00  1.00 Date.hypDateIns : hypothesis date insertion: 1993
-1.00  1.00 NullPunisher.other : contract
-3.00  1.00 NullPunisher.entity : 1993
-1.00  1.00 NullPunisher.other : won
-3.00  1.00 NullPunisher.entity : ITEL
-0.10  1.00 NullPunisher.article : a
-6.00  1.00 Numeric.mismatch : DATE mismatch: '01/01/1993' vs ''
-2.00  1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -20.1829
Threshold: -9.4738


Inference ID: 329

Txt: ITEL was winning the contract from APCOM in 1993.

Hyp: ITEL won a contract in 1993. (don't know)

ITEL
NNP
won
VBD
a
DT
contract
NN
1993
CD
ITEL:NNP   0.28   1.07   0.96   1.07   1.20
was:VBD   1.16   0.80   0.76   0.91   1.39
winning:VBG   1.16   0.70   0.76   0.80   1.30
the:DT   1.22   1.25   1.31   1.00   1.18
contract:NN   1.07   0.56   0.71   0.28   1.19
APCOM:NNP   1.07   0.82   0.71   0.82   1.29
1993:CD   1.24   1.09   0.89   1.23   0.85
NO_WORD   0.28   0.17   0.82   0.09   0.57

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.37 Alignment.score
 1.00  0.23 Alignment.isGood
-1.00  0.75 Alignment.isBad
-0.10  2.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.60 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "1993" modifying "won"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "APCOM" modifying "winning" is dropped on aligned hypothesis word "won"
-2.00  1.00 Date.hypDateIns : hypothesis date insertion: 1993
-3.00  1.00 NullPunisher.entity : 1993
-0.10  1.00 NullPunisher.article : a
-6.00  1.00 Numeric.mismatch : DATE mismatch: '01/01/1993' vs ''
 1.00  1.00 Quant.equivalent : Replacing the quantifier "the" by an equivalent quantifier "a" preserves truth.
Hand-tuned score (dot product of above): -10.6849
Threshold: -9.4738


Inference ID: 330

Txt: ITEL owned APCOM from 1988 to 1992.

Hyp: ITEL owned APCOM in 1990. (yes)

ITEL
NNP
owned
VBD
APCOM
NNP
1990
CD
ITEL:NNP   0.28   1.05   1.07   1.20
owned:VBD   1.14   1.32   0.91   1.15
APCOM:NNP   1.07   0.82   0.28   1.29
1988:CD   1.24   1.15   1.33   0.49
1992:CD   1.24   1.12   1.33   1.61
NO_WORD   0.28   0.17   0.09   0.57

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.40 Alignment.score
 1.00  0.23 Alignment.isGood
-1.00  0.74 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.25 Alignment.txtSpan
-3.00  1.00 Date.dateHeadMismatch : 1990 vs. 1992
-3.00  1.00 NullPunisher.entity : ITEL
-1.00  1.00 NullPunisher.other : owned
-1.00  1.00 NullPunisher.other : APCOM
-2.00  1.00 RootEntailment.unalignedRoot : "owned" not aligned to anything
Hand-tuned score (dot product of above): -10.2847
Threshold: -9.4738


Inference ID: 331

Txt: Smith and Jones left the meeting.

Hyp: Smith left the meeting. (yes)

Smith
NNP
left
VBD
the
DT
meeting
NN
Smith:NNP   0.28   1.05   0.96   1.15
Jones:NNP   0.70   1.07   0.96   1.17
left:VBD   1.14   1.32   0.76   0.77
the:DT   1.25   1.25   2.42   1.00
meeting:NN   1.15   0.68   0.71   0.28
NO_WORD   0.28   0.17   0.82   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.20 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 NullPunisher.other : left
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : meeting
-0.10  1.00 NullPunisher.article : the
-2.00  1.00 RootEntailment.unalignedRoot : "left" not aligned to anything
Hand-tuned score (dot product of above): -7.8790
Threshold: -9.4738


Inference ID: 332

Txt: Smith and Jones left the meeting.

Hyp: Jones left the meeting. (yes)

Jones
NNP
left
VBD
the
DT
meeting
NN
Smith:NNP   0.70   1.05   0.96   1.15
Jones:NNP   0.28   1.07   0.96   1.17
left:VBD   1.16   1.32   0.76   0.77
the:DT   1.25   1.25   2.42   1.00
meeting:NN   1.17   0.68   0.71   0.28
NO_WORD   0.28   0.17   0.82   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.20 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  4.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-0.10  1.00 NullPunisher.article : the
-3.00  1.00 NullPunisher.entity : Jones
-1.00  1.00 NullPunisher.other : left
-1.00  1.00 NullPunisher.other : meeting
-2.00  1.00 RootEntailment.unalignedRoot : "left" not aligned to anything
Hand-tuned score (dot product of above): -7.8790
Threshold: -9.4738


Inference ID: 333

Txt: Smith, Anderson and Jones met.

Hyp: There was a group of people that met. (yes)

There
EX
was
VBD
a
DT
group
NN
people
NNS
that
WDT
met
VBD
Smith:NNP   0.96   1.07   0.96   1.25   1.20   0.94   1.02
Anderson:NNP   0.95   1.07   0.96   1.20   1.16   0.96   1.07
Jones:NNP   0.96   1.07   0.96   1.28   1.20   0.96   1.07
met:VBD   0.76   0.78   0.76   0.81   0.91   0.73   1.32
NO_WORD   0.29   0.17   0.82   0.28   0.04   0.41   0.25

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.24 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  7.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "met" modifying "people"
-0.10  1.00 NullPunisher.article : a
-1.00  1.00 NullPunisher.other : group
-0.10  1.00 NullPunisher.functionWord : that
-1.00  1.00 NullPunisher.other : people
-0.10  1.00 NullPunisher.functionWord : There
-1.00  1.00 NullPunisher.other : met
-0.05  1.00 NullPunisher.aux : was
-2.00  1.00 RootEntailment.unalignedRoot : "was" not aligned to anything
Hand-tuned score (dot product of above): -7.3731
Threshold: -9.4738


Inference ID: 334

Txt: Smith knew that ITEL had won the contract in 1992.

Hyp: ITEL won the contract in 1992. (yes)

ITEL
NNP
won
VBD
the
DT
contract
NN
1992
CD
Smith:NNP   0.93   1.07   0.96   1.16   1.20
knew:VBD   1.11   0.69   0.73   0.86   1.16
that:IN   1.57   1.24   0.95   1.27   1.30
ITEL:NNP   0.28   1.07   0.93   1.07   1.20
had:VBD   1.16   0.82   0.76   0.91   1.39
won:VBN   1.16   1.50   0.76   0.65   1.11
the:DT   1.22   1.25   2.42   1.00   1.18
contract:NN   1.07   0.56   0.71   0.28   1.18
1992:CD   1.24   1.02   0.89   1.22   0.85
NO_WORD   0.28   0.17   0.82   0.09   0.57

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.53 Alignment.score
 1.00  0.26 Alignment.isGood
-1.00  0.72 Alignment.isBad
-0.10  2.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.60 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "1992" modifying "won"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "1992" modifying "contract" is dropped on aligned hypothesis word "contract"
-2.00  1.00 Date.hypDateIns : hypothesis date insertion: 1992
-0.10  1.00 NullPunisher.article : the
-3.00  1.00 NullPunisher.entity : 1992
-6.00  1.00 Numeric.mismatch : DATE mismatch: '01/01/1992' vs ''
Hand-tuned score (dot product of above): -11.4634
Threshold: -9.4738


Inference ID: 335

Txt: Smith believed that ITEL had won the contract in 1992.

Hyp: ITEL won the contract in 1992. (don't know)

ITEL
NNP
won
VBD
the
DT
contract
NN
1992
CD
Smith:NNP   0.93   1.07   0.96   1.16   1.20
believed:VBD   1.16   0.67   0.76   0.90   1.10
that:IN   1.57   1.24   0.95   1.27   1.30
ITEL:NNP   0.28   1.07   0.93   1.07   1.20
had:VBD   1.16   0.82   0.76   0.91   1.39
won:VBN   1.16   1.50   0.76   0.65   1.11
the:DT   1.22   1.25   2.42   1.00   1.18
contract:NN   1.07   0.56   0.71   0.28   1.18
1992:CD   1.24   1.02   0.89   1.22   0.85
NO_WORD   0.28   0.17   0.82   0.09   0.57

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.53 Alignment.score
 1.00  0.26 Alignment.isGood
-1.00  0.72 Alignment.isBad
-0.10  2.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.60 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "1992" modifying "won"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "1992" modifying "contract" is dropped on aligned hypothesis word "contract"
-2.00  1.00 Date.hypDateIns : hypothesis date insertion: 1992
 0.50  1.00 Factive.unknownPassage : Hyp aligned with txt under non-factive verb (unknown): believed
-0.10  1.00 NullPunisher.article : the
-3.00  1.00 NullPunisher.entity : 1992
-6.00  1.00 Numeric.mismatch : DATE mismatch: '01/01/1992' vs ''
Hand-tuned score (dot product of above): -10.9634
Threshold: -9.4738


Inference ID: 336

Txt: ITEL managed to win the contract in 1992.

Hyp: ITEL won the contract in 1992. (yes)

ITEL
NNP
won
VBD
the
DT
contract
NN
1992
CD
ITEL:NNP   0.28   1.07   0.93   1.07   1.20
managed:VBD   1.16   0.63   0.76   0.82   1.25
to:TO   1.25   1.21   1.26   1.00   1.18
win:VB   1.16   0.74   0.76   0.84   1.22
the:DT   1.22   1.25   2.42   1.00   1.18
contract:NN   1.07   0.56   0.71   0.28   1.18
1992:CD   1.24   1.02   0.89   1.22   0.85
NO_WORD   0.28   0.17   0.82   0.09   0.57

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.25 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.40 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "1992" modifying "won"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "1992" modifying "contract" is dropped on aligned hypothesis word "contract"
-2.00  1.00 Date.hypDateIns : hypothesis date insertion: 1992
 1.00  1.00 Factive.positiveStatement : Valid pattern: "managed" X entails X
-3.00  1.00 NullPunisher.entity : ITEL
-0.10  1.00 NullPunisher.article : the
-3.00  1.00 NullPunisher.entity : 1992
-6.00  1.00 Numeric.mismatch : DATE mismatch: '01/01/1992' vs ''
Hand-tuned score (dot product of above): -14.0736
Threshold: -9.4738


Inference ID: 337

Txt: ITEL tried to win the contract in 1992.

Hyp: ITEL won the contract in 1992. (don't know)

ITEL
NNP
won
VBD
the
DT
contract
NN
1992
CD
ITEL:NNP   0.28   1.07   0.93   1.07   1.20
tried:VBD   1.14   0.63   0.70   0.84   1.03
to:TO   1.25   1.21   1.26   1.00   1.18
win:VB   1.16   0.74   0.76   0.84   1.22
the:DT   1.22   1.25   2.42   1.00   1.18
contract:NN   1.07   0.56   0.71   0.28   1.18
1992:CD   1.24   1.02   0.89   1.22   0.85
NO_WORD   0.28   0.17   0.82   0.09   0.57

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.25 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.40 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "1992" modifying "won"
 0.50  1.00 Adjunct.dropPosCxt : It is okay that text word "1992" modifying "contract" is dropped on aligned hypothesis word "contract"
-2.00  1.00 Date.hypDateIns : hypothesis date insertion: 1992
 0.50  1.00 Factive.unknownPassage : Hyp aligned with txt under non-factive verb (unknown): tried
-0.10  1.00 NullPunisher.article : the
-3.00  1.00 NullPunisher.entity : ITEL
-3.00  1.00 NullPunisher.entity : 1992
-6.00  1.00 Numeric.mismatch : DATE mismatch: '01/01/1992' vs ''
Hand-tuned score (dot product of above): -14.5736
Threshold: -9.4738


Inference ID: 338

Txt: It is true that ITEL won the contract in 1992.

Hyp: ITEL won the contract in 1992. (yes)

ITEL
NNP
won
VBD
the
DT
contract
NN
1992
CD
It:PRP   1.48   0.97   1.11   1.30   1.46
is:VBZ   1.16   0.88   0.76   0.91   1.39
true:JJ   1.13   0.92   0.67   0.84   1.01
that:IN   1.57   1.24   0.95   1.27   1.30
ITEL:NNP   0.28   1.07   0.93   1.07   1.20
won:VBD   1.16   1.32   0.76   0.65   1.11
the:DT   1.22   1.25   2.42   1.00   1.18
contract:NN   1.07   0.56   0.71   0.28   1.18
1992:CD   1.24   1.02   0.89   1.22   0.85
NO_WORD   0.28   0.17   0.82   0.09   0.57

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.05 Alignment.score
 1.00  0.18 Alignment.isGood
-1.00  0.80 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "1992" modifying "won"
-2.00  1.00 Date.hypDateIns : hypothesis date insertion: 1992
-0.10  1.00 NullPunisher.article : the
-3.00  1.00 NullPunisher.entity : ITEL
-1.00  1.00 NullPunisher.other : won
-1.00  1.00 NullPunisher.other : contract
-3.00  1.00 NullPunisher.entity : 1992
-6.00  1.00 Numeric.mismatch : DATE mismatch: '01/01/1992' vs ''
-2.00  1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -20.1829
Threshold: -9.4738


Inference ID: 339

Txt: It is false that ITEL won the contract in 1992.

Hyp: ITEL won the contract in 1992. (don't know)

ITEL
NNP
won
VBD
the
DT
contract
NN
1992
CD
It:PRP   1.48   0.97   1.11   1.30   1.46
is:VBZ   1.16   0.88   0.76   0.91   1.39
false:JJ   1.13   0.96   0.76   0.88   0.92
that:IN   1.57   1.24   0.95   1.27   1.30
ITEL:NNP   0.28   1.07   0.93   1.07   1.20
won:VBD   1.16   1.32   0.76   0.65   1.11
the:DT   1.22   1.25   2.42   1.00   1.18
contract:NN   1.07   0.56   0.71   0.28   1.18
1992:CD   1.24   1.02   0.89   1.22   0.85
NO_WORD   0.28   0.17   0.82   0.09   0.57

Response: dontknow (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.05 Alignment.score
 1.00  0.18 Alignment.isGood
-1.00  0.80 Alignment.isBad
-0.10  5.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "1992" modifying "won"
-2.00  1.00 Date.hypDateIns : hypothesis date insertion: 1992
-1.00  1.00 NullPunisher.other : won
-1.00  1.00 NullPunisher.other : contract
-3.00  1.00 NullPunisher.entity : ITEL
-0.10  1.00 NullPunisher.article : the
-3.00  1.00 NullPunisher.entity : 1992
-6.00  1.00 Numeric.mismatch : DATE mismatch: '01/01/1992' vs ''
-2.00  1.00 RootEntailment.unalignedRoot : "won" not aligned to anything
Hand-tuned score (dot product of above): -20.1829
Threshold: -9.4738


Inference ID: 340

Txt: Smith saw Jones sign the contract. If Jones signed the contract, his heart was beating.

Hyp: Smith saw Jones' heart beat. (don't know)

Smith
NNP
saw
VBD
Jones
NNP
heart
NN
beat
VBD
Smith:NNP   0.28   1.07   0.70   1.11   1.05
saw:VBD   1.16   1.32   1.16   0.78   0.50
Jones:NNP   0.70   1.07   0.28   1.12   1.07
sign:VB   1.09   0.55   1.16   0.91   0.65
the:DT   1.25   1.25   1.25   1.00   1.25
contract:NN   1.16   0.82   1.16   0.86   0.82
If:IN   1.57   1.24   1.57   1.31   1.24
Jones:NNP   0.70   1.07   0.28   1.12   1.07
signed:VBD   1.14   0.62   1.10   0.91   0.65
the:DT   1.25   1.25   1.25   1.00   1.25
contract:NN   1.16   0.82   1.16   0.86   0.82
his:PRP$   1.55   0.97   2.76   1.30   0.97
heart:NN   1.11   0.69   1.12   0.28   0.69
was:VBD   1.16   0.69   1.16   0.91   0.80
beating:VBG   1.16   0.48   1.16   0.81   0.92
NO_WORD   0.28   0.17   0.12   0.28   0.36

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.27 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.40 Alignment.txtSpan
-3.00  1.00 NullPunisher.entity : Smith
-3.00  1.00 NullPunisher.entity : Jones
-1.00  1.00 NullPunisher.other : saw
-2.00  1.00 RootEntailment.unalignedRoot : "saw" not aligned to anything
Hand-tuned score (dot product of above): -9.3519
Threshold: -9.4738


Inference ID: 341

Txt: Smith saw Jones sign the contract. When Jones signed the contract, his heart was beating.

Hyp: Smith saw Jones' heart beat. (don't know)

Smith
NNP
saw
VBD
Jones
NNP
heart
NN
beat
VBD
Smith:NNP   0.28   1.07   0.70   1.11   1.05
saw:VBD   1.16   1.32   1.16   0.78   0.50
Jones:NNP   0.70   1.07   0.28   1.12   1.07
sign:VB   1.09   0.55   1.16   0.91   0.65
the:DT   1.25   1.25   1.25   1.00   1.25
contract:NN   1.16   0.82   1.16   0.86   0.82
When:WRB   1.25   1.25   1.23   0.97   1.25
Jones:NNP   0.70   1.07   0.28   1.12   1.07
signed:VBD   1.14   0.62   1.10   0.91   0.65
the:DT   1.25   1.25   1.25   1.00   1.25
contract:NN   1.16   0.82   1.16   0.86   0.82
his:PRP$   1.55   0.97   2.76   1.30   0.97
heart:NN   1.11   0.69   1.12   0.28   0.69
was:VBD   1.16   0.69   1.16   0.91   0.80
beating:VBG   1.16   0.48   1.16   0.81   0.92
NO_WORD   0.28   0.17   0.12   0.28   0.36

Response: yes (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.27 Alignment.score
 1.00  0.21 Alignment.isGood
-1.00  0.77 Alignment.isBad
-0.10  3.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.40 Alignment.txtSpan
-1.00  1.00 NullPunisher.other : saw
-3.00  1.00 NullPunisher.entity : Jones
-3.00  1.00 NullPunisher.entity : Smith
-2.00  1.00 RootEntailment.unalignedRoot : "saw" not aligned to anything
Hand-tuned score (dot product of above): -9.3519
Threshold: -9.4738


Inference ID: 342

Txt: Smith saw Jones sign the contract.

Hyp: Jones signed the contract. (yes)

Jones
NNP
signed
VBD
the
DT
contract
NN
Smith:NNP   0.70   1.05   0.96   1.16
saw:VBD   1.16   0.62   0.76   0.91
Jones:NNP   0.28   1.01   0.96   1.16
sign:VB   1.16   0.93   0.76   0.66
the:DT   1.25   1.25   2.42   1.00
contract:NN   1.16   0.36   0.71   0.28
NO_WORD   0.28   0.17   0.82   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.67 Alignment.score
 1.00  0.29 Alignment.isGood
-1.00  0.69 Alignment.isBad
-0.10  1.00 Alignment.nbWordNotAligned
 0.10  2.00 Alignment.hypSpan
 0.10  0.75 Alignment.txtSpan
-0.10  1.00 NullPunisher.article : the
Hand-tuned score (dot product of above): 0.3416
Threshold: -9.4738


Inference ID: 343

Txt: Smith saw Jones sign the contract. Jones is the chairman of ITEL.

Hyp: Smith saw the chairman of ITEL sign the contract. (yes)

Smith
NNP
saw
VBD
the
DT
chairman
NN
ITEL
NNP
sign
VB
the
DT
contract
NN
Smith:NNP   0.28   1.07   0.96   0.98   0.93   1.00   0.96   1.16
saw:VBD   1.16   1.32   0.76   0.87   1.16   0.55   0.76   0.91
Jones:NNP   0.70   1.07   0.96   0.99   0.93   1.07   0.96   1.16
sign:VB   1.09   0.55   0.76   0.91   1.16   1.32   0.76   0.66
the:DT   1.25   1.25   2.42   1.00   1.22   1.25   2.42   1.00
contract:NN   1.16   0.82   0.71   0.87   1.07   0.57   0.71   0.28
Jones:NNP   0.70   1.07   0.96   0.99   0.93   1.07   0.96   1.16
is:VBZ   1.16   0.76   0.76   0.91   1.16   0.78   0.76   0.91
the:DT   1.25   1.25   2.42   1.00   1.22   1.25   2.42   1.00
chairman:NN   0.98   0.78   0.71   0.28   1.07   0.82   0.71   0.87
ITEL:NNP   0.93   1.07   0.93   1.07   0.28   1.07   0.93   1.07
NO_WORD   0.28   0.17   0.82   0.28   0.04   0.36   0.82   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.22 Alignment.score
 1.00  0.20 Alignment.isGood
-1.00  0.78 Alignment.isBad
-0.10  8.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "ITEL" modifying "chairman"
-1.00  1.00 NullPunisher.other : saw
-1.00  1.00 NullPunisher.other : chairman
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : contract
-3.00  1.00 NullPunisher.entity : ITEL
-1.00  1.00 NullPunisher.other : sign
-0.10  1.00 NullPunisher.article : the
-3.00  1.00 NullPunisher.entity : Smith
-2.00  1.00 RootEntailment.unalignedRoot : "saw" not aligned to anything
Hand-tuned score (dot product of above): -14.3558
Threshold: -9.4738


Inference ID: 344

Txt: Helen saw the chairman of the department answer the phone. The chairman of the department is a person.

Hyp: There is someone whom Helen saw answer the phone. (yes)

There
EX
is
VBZ
someone
NN
whom
WP
Helen
NNP
saw
VBD
answer
VB
the
DT
phone
NN
Helen:NNP   0.88   1.07   1.13   1.84   0.28   1.07   1.05   0.96   1.05
saw:VBD   0.76   0.76   0.91   0.54   1.16   1.32   0.64   0.76   0.78
the:DT   1.20   1.25   1.00   1.13   1.25   1.25   1.25   2.42   0.94
chairman:NN   0.69   0.82   0.83   1.58   1.10   0.78   0.82   0.71   0.88
the:DT   1.20   1.25   1.00   1.13   1.25   1.25   1.25   2.42   0.94
department:NN   0.71   0.82   1.10   1.58   1.09   0.78   0.77   0.71   0.95
answer:VBP   0.74   0.77   0.91   0.54   1.14   0.64   2.02   0.76   0.75
the:DT   1.20   1.25   1.00   1.13   1.25   1.25   1.25   2.42   0.94
phone:NN   0.62   0.82   0.87   1.51   1.05   0.69   0.66   0.65   0.28
The:DT   1.20   1.25   1.00   1.13   1.25   1.25   1.25   0.01   0.94
chairman:NN   0.69   0.82   0.83   1.58   1.10   0.78   0.82   0.71   0.88
the:DT   1.20   1.25   1.00   1.13   1.25   1.25   1.25   2.42   0.94
department:NN   0.71   0.82   1.10   1.58   1.09   0.78   0.77   0.71   0.95
is:VBZ   0.76   1.32   0.91   0.54   1.16   0.76   0.82   0.76   0.91
a:DT   1.31   1.25   1.00   1.16   1.25   1.25   1.25   1.31   1.00
person:NN   0.69   0.82   0.22   1.58   1.11   0.82   0.61   0.71   0.76
NO_WORD   0.29   0.17   0.28   0.92   0.28   0.25   0.34   0.82   0.09

Response: yes (CORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.50 Alignment.score
 1.00  0.25 Alignment.isGood
-1.00  0.72 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.33 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "saw" modifying "someone"
-1.00  1.00 NullPunisher.other : whom
-0.10  1.00 NullPunisher.functionWord : There
-3.00  1.00 NullPunisher.entity : Helen
-0.05  1.00 NullPunisher.aux : is
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : saw
-2.00  1.00 RootEntailment.unalignedRoot : "is" not aligned to anything
Hand-tuned score (dot product of above): -8.6877
Threshold: -9.4738


Inference ID: 345

Txt: Smith saw Jones sign the contract and his secretary make a copy.

Hyp: Smith saw Jones sign the contract. (yes)

Smith
NNP
saw
VBD
Jones
NNP
sign
VB
the
DT
contract
NN
Smith:NNP   0.28   1.07   0.70   1.00   0.96   1.16
saw:VBD   1.16   1.32   1.16   0.55   0.76   0.91
Jones:NNP   0.70   1.07   0.28   1.07   0.96   1.16
sign:VB   1.09   0.55   1.16   1.32   0.76   0.66
the:DT   1.25   1.25   1.25   1.25   2.42   1.00
contract:NN   1.16   0.82   1.16   0.57   0.71   0.28
his:PRP$   2.76   0.97   1.55   1.18   1.35   1.30
secretary:NN   0.96   0.82   0.97   0.71   0.71   0.88
make:VBP   1.14   0.58   1.14   0.57   0.73   0.90
a:DT   1.25   1.25   1.25   1.25   1.31   1.00
copy:NN   1.11   0.82   1.10   0.78   0.71   0.71
NO_WORD   0.28   0.17   0.28   0.36   0.82   0.09

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.15 Alignment.score
 1.00  0.19 Alignment.isGood
-1.00  0.79 Alignment.isBad
-0.10  6.00 Alignment.nbWordNotAligned
 0.10  0.00 Alignment.hypSpan
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : sign
-1.00  1.00 NullPunisher.other : contract
-0.10  1.00 NullPunisher.article : the
-1.00  1.00 NullPunisher.other : saw
-3.00  1.00 NullPunisher.entity : Jones
-2.00  1.00 RootEntailment.unalignedRoot : "saw" not aligned to anything
Hand-tuned score (dot product of above): -12.1485
Threshold: -9.4738


Inference ID: 346

Txt: Smith saw Jones sign the contract or cross out the crucial clause.

Hyp: Smith either saw Jones sign the contract or saw Jones cross out the crucial clause. (yes)

Smith
NNP
either
RB
saw
VBD
Jones
NNP
sign
VB
the
DT
contract
NN
saw
VBD
Jones
NNP
cross
VB
the
DT
crucial
JJ
clause
NN
Smith:NNP   0.28   1.50   1.07   0.70   1.00   0.96   1.16   1.07   0.70   1.07   0.96   1.24   1.10
saw:VBD   1.16   1.11   1.32   1.16   0.55   0.76   0.91   1.32   1.16   0.60   0.76   1.01   0.91
Jones:NNP   0.70   1.54   1.07   0.28   1.07   0.96   1.16   1.07   0.28   1.03   0.96   1.24   1.09
sign:VB   1.09   1.11   0.55   1.16   1.32   0.76   0.66   0.55   1.16   0.68   0.76   0.76   0.76
the:DT   1.25   1.38   1.25   1.25   1.25   2.42   1.00   1.25   1.25   1.25   2.42   1.18   1.00
contract:NN   1.16   1.31   0.82   1.16   0.57   0.71   0.28   0.82   1.16   0.76   0.71   0.92   0.55
cross_out:VB   1.16   1.11   0.54   1.16   0.62   0.76   0.80   0.54   1.16   1.45   0.76   0.99   0.89
the:DT   1.25   1.38   1.25   1.25   1.25   2.42   1.00   1.25   1.25   1.25   2.42   1.18   1.00
crucial:JJ   1.13   1.01   0.95   1.13   0.70   0.76   0.81   0.95   1.13   0.92   0.76   0.74   0.71
clause:NN   1.10   1.31   0.82   1.09   0.67   0.71   0.55   0.82   1.09   0.75   0.71   0.82   0.28
NO_WORD   0.28   0.04   0.17   0.28   0.36   0.82   0.09   0.80   0.28   0.36   0.82   0.11   0.62

Response: dontknow (INCORRECT)
Justification:
Features matched (wt val name just):
 1.00  0.34 Alignment.score
 1.00  0.22 Alignment.isGood
-1.00  0.75 Alignment.isBad
-0.10  12.00 Alignment.nbWordNotAligned
 0.10  1.00 Alignment.hypSpan
 0.10  0.08 Alignment.txtSpan
-1.00  1.00 Adjunct.addPosCxt : It is not okay that the hypothesis added the word "clause" modifying "cross"
-1.00  1.00 NullPunisher.other : contract
-3.00  1.00 NullPunisher.entity : Jones
-1.00  1.00 NullPunisher.other : saw
-0.10  1.00 NullPunisher.article : the
-3.00  1.00 NullPunisher.entity : Smith
-1.00  1.00 NullPunisher.other : either
-1.00  1.00 NullPunisher.other : crucial
-1.00  1.00 NullPunisher.other : saw
-1.00  1.00 NullPunisher.other : sign
-1.00  1.00 NullPunisher.other : clause
-3.00  1.00 NullPunisher.entity : Jones
-0.10  1.00 NullPunisher.article : the
-2.00  1.00 RootEntailment.unalignedRoot : "saw" not aligned to anything
Hand-tuned score (dot product of above): -20.4803
Threshold: -9.4738


Hand-set weights Accuracy: 161/334 = 0.4820


Word similarity table built on Sun Jul 29 19:17:00 PDT 2007 using command:
java edu.stanford.nlp.rte.WordSimilarityGenerator -info /u/nlp/rte/data/byformat/align/simple/fracas-joined-premises.rte.align.xml -output /u/nlp/rte/data/byformat/wordsim/simple/fracas-joined-premises.rte.wordsim.html