Txt/Hyp term similarities

The rows are the txt words. The columns are hyp words.
Resource summary:
Acronym: AcronymLexicalResource
BasicWN: BasicWNLexicalResource
Country: CountryLexicalResource
Cyc: null
DekangLin: DekangLinLexicalResource
Google: null
InfoMap: InfoMapLexicalResource
NomBank: NomBankLexicalResource
Number: NumberLexicalResource
Ordinal: OrdinalLexicalResource
Preposition: PrepositionLexicalResource
Ravichandran: RavichandranLexicalResource
ResnikWN: ResnikWNLexicalResource
StringSim: StringSimLexicalResource



Inference ID: 1

Txt: Some students came to school by car.

Hyp: Some students came to school. (Yes.)

Some
DT
students
NNS
came
VBD
school
NN
.
.
Some:DT   0.00 20.00 20.00 20.00 10.00
students:NNS 20.00   0.00 15.00   0.75 20.00
came:VBD 20.00 15.00   0.00 12.45 17.90
school:NN 20.00   0.75 12.45   0.00 19.99
car:NN 20.00   8.95 13.49   7.06 19.69
.:. 10.00 20.00 17.90 19.99   0.00
NO_WORD 10.00 10.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: 0.0000
Features matched: Adjunct.dropPosCxt: text adjunct "car" of "came" dropped on aligned hyp word "came"; Location.mismatch: no clear info of matching: come(X, prep_to); Quant.contract: [some,some]; Adjunct.dropPosCxt&Align.veryGood:
Hand-tuned score: 1.0000
Threshold: -3.3437


Inference ID: -1

Txt: Some students came to school by car.

Hyp: No students came to school. (Don't know.)

No
DT
students
NNS
came
VBD
school
NN
.
.
Some:DT 10.00 20.00 20.00 20.00 10.00
students:NNS 20.00   0.00 15.00   0.75 20.00
came:VBD 20.00 15.00   0.00 12.45 17.90
school:NN 20.00   0.75 12.45   0.00 19.99
car:NN 20.00   8.95 13.49   7.06 19.69
.:. 10.00 20.00 17.90 19.99   0.00
NO_WORD 10.00 10.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -10.0000
Features matched: Adjunct.dropPosCxt: text adjunct "car" of "came" dropped on aligned hyp word "came"; Antonym.samePol: matching polarity with antonyms: No & Some; Location.mismatch: no clear info of matching: come(X, prep_to); Quant.oneNo: [some,no[
Hand-tuned score: -14.5000
Threshold: -3.3437


Inference ID: 2

Txt: No students came to school by car.

Hyp: Some students came to school. (Don't know.)

Some
DT
students
NNS
came
VBD
school
NN
.
.
No:DT 10.00 20.00 20.00 20.00 10.00
students:NNS 20.00   0.00 15.00   0.75 20.00
came:VBD 20.00 15.00   0.00 12.45 17.90
school:NN 20.00   0.75 12.45   0.00 19.99
car:NN 20.00   8.95 13.49   7.06 19.69
.:. 10.00 20.00 17.90 19.99   0.00
NO_WORD 10.00 10.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -10.0000
Features matched: Adjunct.dropPosCxt: text adjunct "car" of "came" dropped on aligned hyp word "came"; Antonym.samePol: matching polarity with antonyms: Some & No; Location.mismatch: no clear info of matching: come(X, prep_to); Quant.oneNo: [no,some[
Hand-tuned score: -14.5000
Threshold: -3.3437


Inference ID: -2

Txt: No students came to school by car.

Hyp: No students came to school. (Don't know.)

No
DT
students
NNS
came
VBD
school
NN
.
.
No:DT   0.00 20.00 20.00 20.00 10.00
students:NNS 20.00   0.00 15.00   0.75 20.00
came:VBD 20.00 15.00   0.00 12.45 17.90
school:NN 20.00   0.75 12.45   0.00 19.99
car:NN 20.00   8.95 13.49   7.06 19.69
.:. 10.00 20.00 17.90 19.99   0.00
NO_WORD 10.00 10.00 10.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: 0.0000
Features matched: Adjunct.dropPosCxt: text adjunct "car" of "came" dropped on aligned hyp word "came"; Location.mismatch: no clear info of matching: come(X, prep_to); Quant.bothNo: [no,no]; Adjunct.dropPosCxt&Align.veryGood:
Hand-tuned score: 1.0000
Threshold: -3.3437


Inference ID: 3

Txt: John drove legally.

Hyp: John drove. (Yes.)

John
NNP
drove
VBD
.
.
John:NNP   0.00 13.53 20.50
drove:VBD 13.53   0.00 18.77
legally:RB 15.46 19.96 19.03
.:. 20.50 18.77   0.00
NO_WORD 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: 0.0000
Features matched: Adjunct.dropPosCxt: text adjunct "legally" of "drove" dropped on aligned hyp word "drove"; Adjunct.dropPosCxt&Align.veryGood:
Hand-tuned score: 2.0000
Threshold: -3.3437


Inference ID: -3

Txt: John drove legally.

Hyp: John did not drive. (Don't know.)

John
NNP
did
VBD
not
RB
drive
VB
.
.
John:NNP   0.00 13.35 15.46 13.53 20.50
drove:VBD 13.53   8.93 19.96   0.50 18.77
legally:RB 15.46 17.86   9.96 19.50 19.03
.:. 20.50 17.99 20.00 19.58   0.00
NO_WORD 10.00   1.00   9.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -10.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "drive": neg; NullPunisher.aux: did; NullPunisher.other: not
Hand-tuned score: -5.0500
Threshold: -3.3437


Inference ID: 4

Txt: John drove predictably.

Hyp: John drove. (Yes.)

John
NNP
drove
VBD
.
.
John:NNP   0.00 13.53 20.50
drove:VBD 13.53   0.00 18.77
predictably:RB 15.46 19.94 20.00
.:. 20.50 18.77   0.00
NO_WORD 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: 0.0000
Features matched: Adjunct.dropPosCxt: text adjunct "predictably" of "drove" dropped on aligned hyp word "drove"; Adjunct.dropPosCxt&Align.veryGood:
Hand-tuned score: 2.0000
Threshold: -3.3437


Inference ID: -4

Txt: John drove predictably.

Hyp: John did not drive. (Don't know.)

John
NNP
did
VBD
not
RB
drive
VB
.
.
John:NNP   0.00 13.35 15.46 13.53 20.50
drove:VBD 13.53   8.93 19.96   0.50 18.77
predictably:RB 15.46 19.17   9.96 19.68 20.00
.:. 20.50 17.99 20.00 19.58   0.00
NO_WORD 10.00   1.00   9.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -10.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "drive": neg; NullPunisher.aux: did; NullPunisher.other: not
Hand-tuned score: -5.0500
Threshold: -3.3437


Inference ID: 5

Txt: Legally, John could drive.

Hyp: John drove. (Don't know.)

John
NNP
drove
VBD
.
.
Legally:RB 15.46 19.96 20.00
,:, 20.50 18.13   5.73
John:NNP   0.00 13.53 20.50
could:MD 20.46 19.96 10.00
drive:VB 13.53   0.50 19.58
.:. 20.50 18.77   0.00
NO_WORD 10.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -0.5000
Features matched: Adjunct.dropPosCxt: text adjunct "Legally" of "drive" dropped on aligned hyp word "drove"; Modal.dontKnow: possible -> actual; Adjunct.dropPosCxt&Align.veryGood:
Hand-tuned score: 0.0000
Threshold: -3.3437


Inference ID: -5

Txt: Legally, John could drive.

Hyp: John did not drive. (Don't know.)

John
NNP
did
VBD
not
RB
drive
VB
.
.
Legally:RB 15.46 19.96   9.96 19.96 20.00
,:, 20.50 19.80 20.00 19.25   5.73
John:NNP   0.00 13.35 15.46 13.53 20.50
could:MD 20.46 17.84 19.96 19.96 10.00
drive:VB 13.53   6.26 19.96   0.00 19.58
.:. 20.50 17.99 20.00 19.58   0.00
NO_WORD 10.00   1.00   9.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -10.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Modal.dontKnow: possible -> not actual; Polarity.hypNegMarker: "drive": neg; NullPunisher.other: not; NullPunisher.aux: did
Hand-tuned score: -7.0500
Threshold: -3.3437


Inference ID: 6

Txt: Predictably, John drove.

Hyp: John drove. (Yes.)

John
NNP
drove
VBD
.
.
Predictably:RB 15.46 19.96 20.00
,:, 20.50 18.13   5.73
John:NNP   0.00 13.53 20.50
drove:VBD 13.53   0.00 18.77
.:. 20.50 18.77   0.00
NO_WORD 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: 0.0000
Features matched: Adjunct.dropPosCxt: text adjunct "Predictably" of "drove" dropped on aligned hyp word "drove"; Adjunct.dropPosCxt&Align.veryGood:
Hand-tuned score: 2.0000
Threshold: -3.3437


Inference ID: -6

Txt: Predictably, John drove.

Hyp: John did not drive. (Don't know.)

John
NNP
did
VBD
not
RB
drive
VB
.
.
Predictably:RB 15.46 19.96   9.96 19.96 20.00
,:, 20.50 19.80 20.00 19.25   5.73
John:NNP   0.00 13.35 15.46 13.53 20.50
drove:VBD 13.53   8.93 19.96   0.50 18.77
.:. 20.50 17.99 20.00 19.58   0.00
NO_WORD 10.00   1.00   9.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -10.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "drive": neg; NullPunisher.other: not; NullPunisher.aux: did
Hand-tuned score: -5.0500
Threshold: -3.3437


Inference ID: 7

Txt: The technician cooled the room.

Hyp: The technician lowered the temperature of the room. (Yes.)

The
DT
technician
NN
lowered
VBD
the
DT
temperature
NN
the
DT
room
NN
.
.
The:DT   0.00 20.00 20.00   0.00 20.00   0.00 20.00 10.00
technician:NN 20.00   0.00 13.95 20.00   8.15 20.00   8.71 20.00
cooled:VBD 20.00 13.08   7.62 20.00   9.59 20.00 13.15 19.82
the:DT   0.00 20.00 20.00   0.00 20.00   0.00 20.00 10.00
room:NN 20.00   8.71 12.93 20.00   6.97 20.00   0.00 19.15
.:. 10.00 20.00 20.00 10.00 20.00 10.00 19.15   0.00
NO_WORD   1.00 10.00 10.00   1.00 10.00   1.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -16.5893
Features matched: RootEntailment.poorlyAlignedRoot: "lowered" aligned badly to "cooled"
Hand-tuned score: -1.0000
Threshold: -3.3437


Inference ID: -7

Txt: The technician cooled the room.

Hyp: The technician did not lower the temperature of the room. (Don't know.)

The
DT
technician
NN
did
VBD
not
RB
lower
VB
the
DT
temperature
NN
the
DT
room
NN
.
.
The:DT   0.00 20.00 20.00 20.00 20.00   0.00 20.00   0.00 20.00 10.00
technician:NN 20.00   0.00 12.84 14.96 13.95 20.00   8.15 20.00   8.71 20.00
cooled:VBD 20.00 13.08   7.53 19.96   7.62 20.00   9.59 20.00 13.15 19.82
the:DT   0.00 20.00 20.00 20.00 20.00   0.00 20.00   0.00 20.00 10.00
room:NN 20.00   8.71 13.03 14.96 11.31 20.00   6.97 20.00   0.00 19.15
.:. 10.00 20.00 17.99 20.00 19.18 10.00 20.00 10.00 19.15   0.00
NO_WORD   1.00 10.00   1.00   9.00 10.00   1.00 10.00   1.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -26.5893
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "lower": neg; NullPunisher.aux: did; NullPunisher.other: not; RootEntailment.poorlyAlignedRoot: "lower" aligned badly to "cooled"
Hand-tuned score: -7.0500
Threshold: -3.3437


Inference ID: 8

Txt: The technician raised the temperature of the room.

Hyp: The technician cooled the room. (Don't know.)

The
DT
technician
NN
cooled
VBD
the
DT
room
NN
.
.
The:DT   0.00 20.00 20.00   0.00 20.00 10.00
technician:NN 20.00   0.00 13.08 20.00   8.71 20.00
raised:VBD 20.00 13.95   6.95 20.00 13.69 20.00
the:DT   0.00 20.00 20.00   0.00 20.00 10.00
temperature:NN 20.00   8.15   9.59 20.00   6.97 20.00
the:DT   0.00 20.00 20.00   0.00 20.00 10.00
room:NN 20.00   8.71 13.15 20.00   0.00 19.15
.:. 10.00 20.00 19.82 10.00 19.15   0.00
NO_WORD   1.00 10.00 10.00   1.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -8.9537
Features matched: RootEntailment.poorlyAlignedRoot: "cooled" aligned badly to "raised"; Structure.parentsMismatch: args have different parents, different relations: text "room" <-prep_of-- "temperature" vs. hyp "room" <-dobj-- "cooled", which aligned to text "raised"
Hand-tuned score: -4.0000
Threshold: -3.3437


Inference ID: -8

Txt: The technician raised the temperature of the room.

Hyp: The technician did not cool the room. (Yes.)

The
DT
technician
NN
did
VBD
not
RB
cool
VB
the
DT
room
NN
.
.
The:DT   0.00 20.00 20.00 20.00 20.00   0.00 20.00 10.00
technician:NN 20.00   0.00 12.84 14.96 13.08 20.00   8.71 20.00
raised:VBD 20.00 13.95   5.44 19.96   6.95 20.00 13.69 20.00
the:DT   0.00 20.00 20.00 20.00 20.00   0.00 20.00 10.00
temperature:NN 20.00   8.15 12.53 14.96   9.59 20.00   6.97 20.00
the:DT   0.00 20.00 20.00 20.00 20.00   0.00 20.00 10.00
room:NN 20.00   8.71 13.03 14.96 11.00 20.00   0.00 19.15
.:. 10.00 20.00 17.99 20.00 20.00 10.00 19.15   0.00
NO_WORD   1.00 10.00   1.00   9.00 10.00   1.00 10.00 10.00

Response: dontknow (INCORRECT)
Justification:
Alignment score: -18.9537
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "cool": neg; NullPunisher.aux: did; NullPunisher.other: not; RootEntailment.poorlyAlignedRoot: "cool" aligned badly to "raised"; Structure.parentsMismatch: args have different parents, different relations: text "room" <-prep_of-- "temperature" vs. hyp "room" <-dobj-- "cool", which aligned to text "raised"
Hand-tuned score: -10.0500
Threshold: -3.3437


Inference ID: 9

Txt: The president visited Iraq in September.

Hyp: The president has gone to Iraq. (Yes.)

The
DT
president
NN
has
VBZ
gone
VBN
Iraq
NNP
.
.
The:DT   0.00 20.00 20.00 20.00 20.50 10.00
president:NN 20.00   0.00 14.34 12.72   9.84 20.00
visited:VBD 20.00 13.07 10.00   7.43 15.50 19.78
Iraq:NNP 20.50   9.84 13.02 14.84   0.00 20.50
September:NNP 20.50 10.50 14.19 12.73 15.00 20.50
.:. 10.00 20.00 20.00 19.35 20.50   0.00
NO_WORD   1.00 10.00   1.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -10.4338
Features matched: Adjunct.dropPosCxt: text adjunct "September" of "Iraq" dropped on aligned hyp word "Iraq"; NullPunisher.aux: has; RootEntailment.poorlyAlignedRoot: "gone" aligned badly to "visited"; Structure.relMismatch: text "Iraq" is dobj of "visited" while hyp "Iraq" is prep_to of "gone" which aligned to text "visited"
Hand-tuned score: -1.5500
Threshold: -3.3437


Inference ID: -9

Txt: The president visited Iraq in September.

Hyp: The president has not gone to Iraq. (Don't know.)

The
DT
president
NN
has
VBZ
not
RB
gone
VBN
Iraq
NNP
.
.
The:DT   0.00 20.00 20.00 20.00 20.00 20.50 10.00
president:NN 20.00   0.00 14.34 14.96 12.72   9.84 20.00
visited:VBD 20.00 13.07 10.00 19.96   7.43 15.50 19.78
Iraq:NNP 20.50   9.84 13.02 15.46 14.84   0.00 20.50
September:NNP 20.50 10.50 14.19 15.46 12.73 15.00 20.50
.:. 10.00 20.00 20.00 20.00 19.35 20.50   0.00
NO_WORD   1.00 10.00   1.00   9.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -19.4338
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "gone": neg; NullPunisher.other: not; NullPunisher.aux: has; RootEntailment.poorlyAlignedRoot: "gone" aligned badly to "visited"; Structure.relMismatch: text "Iraq" is dobj of "visited" while hyp "Iraq" is prep_to of "gone" which aligned to text "visited"
Hand-tuned score: -8.0500
Threshold: -3.3437


Inference ID: 10

Txt: Jones has visited Iraq.

Hyp: Jones visited Iraq in September. (Don't know.)

Jones
NNP
visited
VBD
Iraq
NNP
September
NNP
.
.
Jones:NNP   0.00 15.50 14.34 15.00 20.50
has:VBZ 14.84 10.00 13.02 14.19 20.00
visited:VBN 15.50   0.00 15.50 15.50 19.78
Iraq:NNP 14.34 15.50   0.00 15.00 20.50
.:. 20.50 19.78 20.50 20.50   0.00
NO_WORD 10.00 10.00 10.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -10.0000
Features matched: Adjunct.addPosCxt: hyp added September[September-NNP]; Date.hypDateIns: hypothesis date insertion: September; NullPunisher.other: September
Hand-tuned score: -3.0000
Threshold: -3.3437


Inference ID: -10

Txt: Jones has visited Iraq.

Hyp: Jones did not visit Iraq in September. (Don't know.)

Jones
NNP
did
VBD
not
RB
visit
VB
Iraq
NNP
September
NNP
.
.
Jones:NNP   0.00 15.50 15.46 15.50 14.34 15.00 20.50
has:VBZ 14.84   7.53 19.96 10.00 13.02 14.19 20.00
visited:VBN 15.50   7.62 19.96   0.31 15.50 15.50 19.78
Iraq:NNP 14.34 15.50 15.46 15.50   0.00 15.00 20.50
.:. 20.50 17.99 20.00 20.00 20.50 20.50   0.00
NO_WORD 10.00   1.00   9.00 10.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -20.3094
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Date.hypDateIns: hypothesis date insertion: September; Polarity.hypNegMarker: "visit": neg; NullPunisher.other: not; NullPunisher.aux: did; NullPunisher.other: September
Hand-tuned score: -8.0500
Threshold: -3.3437


Inference ID: 11

Txt: Jones arrived in Paris in September last year.

Hyp: Jones arrived in Paris last year. (Yes.)

Jones
NNP
arrived
VBD
Paris
NNP
last
JJ
year
NN
.
.
Jones:NNP   0.00 15.00   9.84 11.45 10.50 20.00
arrived:VBD 15.00   0.00 15.50 12.50 15.28 20.00
Paris:NNP   9.84 15.50   0.00 16.34 13.13 20.50
September:NNP 10.50 15.50 15.00   9.84   7.23 20.50
last:JJ 11.45 12.50 16.34   0.00   9.84 20.50
year:NN 10.50 15.28 13.13   9.84   0.00 17.60
.:. 20.00 20.00 20.50 20.50 17.60   0.00
NO_WORD 10.00 10.00 10.00   9.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: 0.0000
Features matched: Adjunct.dropPosCxt: text adjunct "September" of "Paris" dropped on aligned hyp word "Paris"; Date.matchDatesByGraph: hyp/txt matching, by graph: year and children; Adjunct.dropPosCxt&Align.veryGood:
Hand-tuned score: 3.0000
Threshold: -3.3437


Inference ID: -11

Txt: Jones arrived in Paris in September last year.

Hyp: Jones did not arrive in Paris last year. (Don't know.)

Jones
NNP
did
VBD
not
RB
arrive
VB
Paris
NNP
last
JJ
year
NN
.
.
Jones:NNP   0.50 15.00 14.96 15.00   9.84 11.45 10.50 20.00
arrived:VBD 15.50   7.47 19.96   0.50 15.50 12.50 15.28 20.00
Paris:NNP 14.34 15.50 15.46 15.50   0.00 16.34 13.13 20.50
September:NNP 15.00 14.19 15.46 15.50 15.00   9.84   7.23 20.50
last:JJ 15.95 10.19 12.46 12.50 16.34   0.00   9.84 20.50
year:NN 15.00 14.19 15.46 15.50 13.13   9.84   0.00 17.60
.:. 20.50 17.99 20.00 20.00 20.50 20.50 17.60   0.00
NO_WORD 10.00   1.00   9.00 10.00 10.00   9.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -11.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Date.matchDatesByGraph: hyp/txt matching, by graph: year and children; Polarity.hypNegMarker: "arrive": neg; NullPunisher.aux: did; NullPunisher.other: not
Hand-tuned score: -4.0500
Threshold: -3.3437


Inference ID: 12

Txt: Jones arrived in Paris in September last year.

Hyp: Jones arrived in Paris in September. (Don't know.)

Jones
NNP
arrived
VBD
Paris
NNP
September
NNP
.
.
Jones:NNP   0.00 15.00   9.84 10.50 20.00
arrived:VBD 15.00   0.00 15.50 15.50 20.00
Paris:NNP   9.84 15.50   0.00 15.00 20.50
September:NNP 10.50 15.50 15.00   0.00 20.50
last:JJ 11.45 12.50 16.34   9.84 20.50
year:NN 10.50 15.28 13.13   7.23 17.60
.:. 20.00 20.00 20.50 20.50   0.00
NO_WORD 10.00 10.00 10.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -2.0000
Features matched: Adjunct.dropPosCxt: text adjunct "year" of "arrived" dropped on aligned hyp word "arrived"; Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 09/01/1000; Structure.argsMismatch: args have different parents but same relations: text "September" <-prep_in-- "Paris vs. hyp "September" <-prep_in-- "arrived", which aligned to text "arrived"
Hand-tuned score: -0.5000
Threshold: -3.3437


Inference ID: -12

Txt: Jones arrived in Paris in September last year.

Hyp: Jones did not arrive in Paris in September. (Don't know.)

Jones
NNP
did
VBD
not
RB
arrive
VB
Paris
NNP
September
NNP
.
.
Jones:NNP   0.50 15.00 14.96 15.00   9.84 10.50 20.00
arrived:VBD 15.50   7.47 19.96   0.50 15.50 15.50 20.00
Paris:NNP 14.34 15.50 15.46 15.50   0.00 15.00 20.50
September:NNP 15.00 14.19 15.46 15.50 15.00   0.00 20.50
last:JJ 15.95 10.19 12.46 12.50 16.34   9.84 20.50
year:NN 15.00 14.19 15.46 15.50 13.13   7.23 17.60
.:. 20.50 17.99 20.00 20.00 20.50 20.50   0.00
NO_WORD 10.00   1.00   9.00 10.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -11.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 09/01/1000; Polarity.hypNegMarker: "arrive": neg; NullPunisher.aux: did; NullPunisher.other: not
Hand-tuned score: -4.0500
Threshold: -3.3437


Inference ID: 13

Txt: Jones arrived on a Sunday in September.

Hyp: Jones arrived on a Sunday. (Yes.)

Jones
NNP
arrived
VBD
a
DT
Sunday
NNP
.
.
Jones:NNP   0.00 15.00 20.00 10.50 20.00
arrived:VBD 15.00   0.00 20.00 15.50 20.00
a:DT 20.00 20.00   0.00 20.50 10.00
Sunday:NNP 10.50 15.50 20.50   0.00 20.50
September:NNP 10.50 15.50 20.50   7.23 20.50
.:. 20.00 20.00 10.00 20.50   0.00
NO_WORD 10.00 10.00   1.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: 0.0000
Features matched: Adjunct.dropPosCxt: text adjunct "September" of "arrived" dropped on aligned hyp word "arrived"; Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/1000; Quant.contract: [a,a]; Adjunct.dropPosCxt&Align.veryGood:
Hand-tuned score: 4.0000
Threshold: -3.3437


Inference ID: -13

Txt: Jones arrived on a Sunday in September.

Hyp: Jones did not arrive on a Sunday. (Don't know.)

Jones
NNP
did
VBD
not
RB
arrive
VB
a
DT
Sunday
NNP
.
.
Jones:NNP   0.50 15.00 14.96 15.00 20.00 10.50 20.00
arrived:VBD 15.50   7.47 19.96   0.50 20.00 15.50 20.00
a:DT 20.50 20.00 20.00 20.00   0.00 20.50 10.00
Sunday:NNP 15.00 10.20 15.46 15.50 20.50   0.00 20.50
September:NNP 15.00 14.19 15.46 15.50 20.50   7.23 20.50
.:. 20.50 17.99 20.00 20.00 10.00 20.50   0.00
NO_WORD 10.00   1.00   9.00 10.00   1.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -11.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/1000; Polarity.hypNegMarker: "arrive": neg; NullPunisher.other: not; NullPunisher.aux: did; Quant.contract: [a,a]
Hand-tuned score: -3.0500
Threshold: -3.3437


Inference ID: 14

Txt: Jones arrived on a Sunday in September.

Hyp: Jones arrived in September. (Yes.)

Jones
NNP
arrived
VBD
September
NNP
.
.
Jones:NNP   0.00 15.00 10.50 20.00
arrived:VBD 15.00   0.00 15.50 20.00
a:DT 20.00 20.00 20.50 10.00
Sunday:NNP 10.50 15.50   7.23 20.50
September:NNP 10.50 15.50   0.00 20.50
.:. 20.00 20.00 20.50   0.00
NO_WORD 10.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: 0.0000
Features matched: Adjunct.dropPosCxt: text adjunct "Sunday" of "arrived" dropped on aligned hyp word "arrived"; Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 09/01/1000; Adjunct.dropPosCxt&Align.veryGood:
Hand-tuned score: 3.0000
Threshold: -3.3437


Inference ID: -14

Txt: Jones arrived on a Sunday in September.

Hyp: Jones did not arrive in September. (Don't know.)

Jones
NNP
did
VBD
not
RB
arrive
VB
September
NNP
.
.
Jones:NNP   0.50 15.00 14.96 15.00 10.50 20.00
arrived:VBD 15.50   7.47 19.96   0.50 15.50 20.00
a:DT 20.50 20.00 20.00 20.00 20.50 10.00
Sunday:NNP 15.00 10.20 15.46 15.50   7.23 20.50
September:NNP 15.00 14.19 15.46 15.50   0.00 20.50
.:. 20.50 17.99 20.00 20.00 20.50   0.00
NO_WORD 10.00   1.00   9.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -11.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 09/01/1000; Polarity.hypNegMarker: "arrive": neg; NullPunisher.aux: did; NullPunisher.other: not
Hand-tuned score: -4.0500
Threshold: -3.3437


Inference ID: 15

Txt: The president left after the diplomat arrived.

Hyp: The diplomat arrived before the president left. (Yes.)

The
DT
diplomat
NN
arrived
VBD
before
IN
the
DT
president
NN
left
VBD
.
.
The:DT   0.00 20.00 20.00 20.00   0.00 20.00 20.00 10.00
president:NN 20.00   3.94 13.72 20.00 20.00   0.00 13.35 20.00
left:VBD 20.00 14.34   6.03 20.00 20.00 13.35   0.00 19.46
after:IN 20.00 20.00 20.00   0.00 17.88 20.00 20.00 20.00
the:DT   0.00 20.00 20.00 20.00   0.00 20.00 20.00 10.00
diplomat:NN 20.00   0.00 13.41 20.00 20.00   3.94 14.34 19.87
arrived:VBD 20.00 13.41   0.00 20.00 20.00 13.72   6.03 20.00
.:. 10.00 19.87 20.00 20.00 10.00 20.00 19.46   0.00
NO_WORD   1.00 10.00 10.00 10.00   1.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -6.0000
Features matched: Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "left vs. hyp "." <-punct-- "arrived", which aligned to text "arrived"
Hand-tuned score: -2.0000
Threshold: -3.3437


Inference ID: -15

Txt: The president left after the diplomat arrived.

Hyp: The diplomat did not arrive before the president left. (Don't know.)

The
DT
diplomat
NN
did
VBD
not
RB
arrive
VB
before
IN
the
DT
president
NN
left
VBD
.
.
The:DT   0.00 20.00 20.00 20.00 20.00 20.00   0.00 20.00 20.00 10.00
president:NN 20.00   3.94 15.00 14.96 14.47 20.00 20.00   0.00 13.35 20.00
left:VBD 20.00 14.34   7.37 19.96   9.62 20.00 20.00 13.35   0.00 19.46
after:IN 20.00 20.00 20.00 20.00 20.00   0.00 17.88 20.00 20.00 20.00
the:DT   0.00 20.00 20.00 20.00 20.00 20.00   0.00 20.00 20.00 10.00
diplomat:NN 20.00   0.00 15.00 14.96 13.56 20.00 20.00   3.94 14.34 19.87
arrived:VBD 20.00 13.41   7.47 19.96   0.50 20.00 20.00 13.72   6.03 20.00
.:. 10.00 19.87 17.99 20.00 20.00 20.00 10.00 20.00 19.46   0.00
NO_WORD   1.00 10.00   1.00   9.00 10.00 10.00   1.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -16.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "arrive": neg; NullPunisher.other: not; NullPunisher.aux: did; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "left vs. hyp "." <-punct-- "arrive", which aligned to text "arrived"
Hand-tuned score: -8.0500
Threshold: -3.3437


Inference ID: 16

Txt: No US congressman has visited Iraq since the war ended.

Hyp: Jones, a US Congressman, has visited Iraq after the war ended. (Don't know.)

Jones
NNP
,
,
a
DT
US_Congressman
NNP
,
,
has
VBZ
visited
VBN
Iraq
NNP
after
IN
the
DT
war
NN
ended
VBD
.
.
No:DT 20.50 10.00 10.00 20.50 10.00 20.00 20.00 20.50 20.00 10.00 20.00 20.00 10.00
US:NNP 14.34 20.50 20.50   5.00 20.50 13.02 15.50   5.34 20.50 20.50 10.50 13.02 20.50
congressman:NN 10.19 19.74 20.50   5.00 19.74 14.84 15.32 14.34 20.50 20.50   9.08 13.55 20.50
has:VBZ 14.84 20.00 20.00 14.26 20.00   0.00 10.00 13.02 20.00 20.00 15.00   7.52 20.00
visited:VBN 15.50 19.44 20.00 15.46 19.44 10.00   0.00 15.50 20.00 20.00 12.10   7.62 19.78
Iraq:NNP 14.34 20.50 20.50 12.67 20.50 13.02 15.50   0.00 20.50 20.50 10.50 13.02 20.50
since:IN 20.50 20.00 20.00 20.50 20.00 20.00 20.00 20.50   0.00 20.00 20.00 20.00 20.00
the:DT 20.50 10.00 10.00 20.50 10.00 20.00 20.00 20.50 17.88   0.00 20.00 20.00 10.00
war:NN 10.50 20.00 20.00 10.46 20.00 15.00 12.10 10.50 20.00 20.00   0.00 12.44 19.45
ended:VBD   9.59 18.30 20.00 14.26 18.30   7.52   7.62 13.02 20.00 20.00 12.44   0.00 20.00
.:. 20.50   5.73 10.00 20.50   5.73 20.00 19.78 20.50 20.00 10.00 19.45 20.00   0.00
NO_WORD 10.00 10.00   1.00 10.00 10.00   1.00 10.00 10.00 10.00   1.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -33.4591
Features matched: Adjunct.dropPosCxt: text adjunct "US" of "congressman" dropped on aligned hyp word "US_Congressman"; NullPunisher.other: Jones; NullPunisher.article: a; Quant.oneNo: [no,a[
Hand-tuned score: -5.6000
Threshold: -3.3437


Inference ID: -16

Txt: No US congressman has visited Iraq since the war ended.

Hyp: Jones, a US Congressman, has not visited Iraq after the war ended. (Yes.)

Jones
NNP
,
,
a
DT
US_Congressman
NNP
,
,
has
VBZ
not
RB
visited
VBN
Iraq
NNP
after
IN
the
DT
war
NN
ended
VBD
.
.
No:DT 20.50 10.00 10.00 20.50 10.00 20.00 20.00 20.00 20.50 20.00 10.00 20.00 20.00 10.00
US:NNP 14.34 20.50 20.50   5.00 20.50 13.02 15.46 15.50   5.34 20.50 20.50 10.50 13.02 20.50
congressman:NN 10.19 19.74 20.50   5.00 19.74 14.84 15.46 15.32 14.34 20.50 20.50   9.08 13.55 20.50
has:VBZ 14.84 20.00 20.00 14.26 20.00   0.00 19.96 10.00 13.02 20.00 20.00 15.00   7.52 20.00
visited:VBN 15.50 19.44 20.00 15.46 19.44 10.00 19.96   0.00 15.50 20.00 20.00 12.10   7.62 19.78
Iraq:NNP 14.34 20.50 20.50 12.67 20.50 13.02 15.46 15.50   0.00 20.50 20.50 10.50 13.02 20.50
since:IN 20.50 20.00 20.00 20.50 20.00 20.00 20.00 20.00 20.50   0.00 20.00 20.00 20.00 20.00
the:DT 20.50 10.00 10.00 20.50 10.00 20.00 20.00 20.00 20.50 17.88   0.00 20.00 20.00 10.00
war:NN 10.50 20.00 20.00 10.46 20.00 15.00 14.96 12.10 10.50 20.00 20.00   0.00 12.44 19.45
ended:VBD   9.59 18.30 20.00 14.26 18.30   7.52 19.96   7.62 13.02 20.00 20.00 12.44   0.00 20.00
.:. 20.50   5.73 10.00 20.50   5.73 20.00 20.00 19.78 20.50 20.00 10.00 19.45 20.00   0.00
NO_WORD 10.00 10.00   1.00 10.00 10.00   1.00   9.00 10.00 10.00 10.00   1.00 10.00 10.00 10.00

Response: dontknow (INCORRECT)
Justification:
Alignment score: -42.4591
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "visited": neg; NullPunisher.other: not; NullPunisher.other: Jones; NullPunisher.article: a; Quant.oneNo: [no,a[
Hand-tuned score: -12.1000
Threshold: -3.3437


Inference ID: 17

Txt: No US congressman has visited Iraq since the war.

Hyp: Jones, a US Congressman, visited Iraq before the war. (Don't know.)

Jones
NNP
,
,
a
DT
US_Congressman
NNP
,
,
visited
VBD
Iraq
NNP
the
DT
war
NN
.
.
No:DT 20.50 10.00 10.00 20.50 10.00 20.00 20.50 10.00 20.00 10.00
US:NNP 14.34 20.50 20.50   5.00 20.50 15.50   5.34 20.50 10.50 20.50
congressman:NN 10.19 19.74 20.50   5.00 19.74 15.32 14.34 20.50   9.08 20.50
has:VBZ 14.84 20.00 20.00 14.26 20.00 10.00 13.02 20.00 15.00 20.00
visited:VBN 15.50 19.44 20.00 15.46 19.44   0.00 15.50 20.00 12.10 19.78
Iraq:NNP 14.34 20.50 20.50 12.67 20.50 15.50   0.00 20.50 10.50 20.50
the:DT 20.50 10.00 10.00 20.50 10.00 20.00 20.50   0.00 20.00 10.00
war:NN 10.50 20.00 20.00 10.46 20.00 12.10 10.50 20.00   0.00 19.45
.:. 20.50   5.73 10.00 20.50   5.73 19.78 20.50 10.00 19.45   0.00
NO_WORD 10.00 10.00   1.00 10.00 10.00 10.00 10.00   1.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -35.4591
Features matched: Adjunct.dropPosCxt: text adjunct "US" of "congressman" dropped on aligned hyp word "US_Congressman"; NullPunisher.article: a; NullPunisher.other: Jones; Quant.oneNo: [no,a[; Structure.parentsMismatch: args have different parents, different relations: text "war" <-prep_since-- "Iraq" vs. hyp "war" <-prep_before-- "visited", which aligned to text "visited"
Hand-tuned score: -8.6000
Threshold: -3.3437


Inference ID: -17

Txt: No US congressman has visited Iraq since the war.

Hyp: Jones, a US Congressman, did not visit Iraq before the war. (Don't know.)

Jones
NNP
,
,
a
DT
US_Congressman
NNP
,
,
did
VBD
not
RB
visit
VB
Iraq
NNP
the
DT
war
NN
.
.
No:DT 20.50 10.00 10.00 20.50 10.00 20.00 20.00 20.00 20.50 10.00 20.00 10.00
US:NNP 14.34 20.50 20.50   5.00 20.50 15.50 15.46 15.50   5.34 20.50 10.50 20.50
congressman:NN 10.19 19.74 20.50   5.00 19.74 14.41 15.46 15.50 14.34 20.50   9.08 20.50
has:VBZ 14.84 20.00 20.00 14.26 20.00   7.53 19.96 10.00 13.02 20.00 15.00 20.00
visited:VBN 15.50 19.44 20.00 15.46 19.44   7.62 19.96   0.31 15.50 20.00 12.10 19.78
Iraq:NNP 14.34 20.50 20.50 12.67 20.50 15.50 15.46 15.50   0.00 20.50 10.50 20.50
the:DT 20.50 10.00 10.00 20.50 10.00 20.00 20.00 20.00 20.50   0.00 20.00 10.00
war:NN 10.50 20.00 20.00 10.46 20.00 12.69 14.96 12.10 10.50 20.00   0.00 19.45
.:. 20.50   5.73 10.00 20.50   5.73 17.99 20.00 20.00 20.50 10.00 19.45   0.00
NO_WORD 10.00 10.00   1.00 10.00 10.00   1.00   9.00 10.00 10.00   1.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -45.7685
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "visit": neg; NullPunisher.article: a; NullPunisher.other: not; NullPunisher.aux: did; NullPunisher.other: Jones; Quant.oneNo: [no,a[; Structure.parentsMismatch: args have different parents, different relations: text "war" <-prep_since-- "Iraq" vs. hyp "war" <-prep_before-- "visit", which aligned to text "visited"
Hand-tuned score: -15.1500
Threshold: -3.3437


Inference ID: 18

Txt: No US congressman visited Iraq until the war.

Hyp: Some US congressman visited Iraq before the war. (Don't know.)

Some
DT
US
NNP
congressman
NN
visited
VBD
Iraq
NNP
the
DT
war
NN
.
.
No:DT 10.00 20.50 20.00 20.00 20.50 10.00 20.00 10.00
US:NNP 20.50   0.00   9.84 15.50   5.34 20.50 10.50 20.50
congressman:NN 20.00   9.84   0.00 14.82   9.84 20.00   8.58 20.00
visited:VBD 20.00 15.50 14.82   0.00 15.50 20.00 12.10 19.78
Iraq:NNP 20.50   5.34   9.84 15.50   0.00 20.50 10.50 20.50
the:DT 10.00 20.50 20.00 20.00 20.50   0.00 20.00 10.00
war:NN 20.00 10.50   8.58 12.10 10.50 20.00   0.00 19.45
.:. 10.00 20.50 20.00 19.78 20.50 10.00 19.45   0.00
NO_WORD 10.00 10.00 10.00 10.00 10.00   1.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -11.0000
Features matched: Antonym.samePol: matching polarity with antonyms: Some & No; Quant.oneNo: [no,some[; Structure.relMismatch: text "war" is prep_until of "visited" while hyp "war" is prep_before of "visited" which aligned to text "visited"
Hand-tuned score: -14.0000
Threshold: -3.3437


Inference ID: -18

Txt: No US congressman visited Iraq until the war.

Hyp: No US congressman visited Iraq before the war. (Yes.)

No
DT
US
NNP
congressman
NN
visited
VBD
Iraq
NNP
the
DT
war
NN
.
.
No:DT   0.00 20.50 20.00 20.00 20.50 10.00 20.00 10.00
US:NNP 20.50   0.00   9.84 15.50   5.34 20.50 10.50 20.50
congressman:NN 20.00   9.84   0.00 14.82   9.84 20.00   8.58 20.00
visited:VBD 20.00 15.50 14.82   0.00 15.50 20.00 12.10 19.78
Iraq:NNP 20.50   5.34   9.84 15.50   0.00 20.50 10.50 20.50
the:DT 10.00 20.50 20.00 20.00 20.50   0.00 20.00 10.00
war:NN 20.00 10.50   8.58 12.10 10.50 20.00   0.00 19.45
.:. 10.00 20.50 20.00 19.78 20.50 10.00 19.45   0.00
NO_WORD 10.00 10.00 10.00 10.00 10.00   1.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -1.0000
Features matched: Quant.bothNo: [no,no]; Structure.relMismatch: text "war" is prep_until of "visited" while hyp "war" is prep_before of "visited" which aligned to text "visited"
Hand-tuned score: 1.0000
Threshold: -3.3437


Inference ID: 19

Txt: Some students arrived at the school on Sunday.

Hyp: There were some students at the school on Sunday. (Yes.)

There
EX
were
VBD
some
DT
students
NNS
the
DT
school
NN
Sunday
NNP
.
.
Some:DT 10.00 20.00   0.00 20.00 10.00 20.00 20.50 10.00
students:NNS 20.00 14.34 20.00   0.00 20.00   0.75 10.50 20.00
arrived:VBD 20.00 10.00 20.00 14.29 20.00 13.50 15.50 20.00
the:DT 10.00 20.00 10.00 20.00   0.00 20.00 20.50 10.00
school:NN 20.00 14.34 20.00   0.75 20.00   0.00   7.73 19.99
Sunday:NNP 20.50 15.50 20.50 10.50 20.50   7.73   0.00 20.50
.:. 10.00 20.00 10.00 20.00 10.00 19.99 20.50   0.00
NO_WORD   1.00 10.00 10.00 10.00   1.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -15.0000
Features matched: Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/1000; NullPunisher.functionWord: There; Quant.contract: [some,some]; RootEntailment.poorlyAlignedRoot: "were" aligned badly to "arrived"
Hand-tuned score: 0.9000
Threshold: -3.3437


Inference ID: -19

Txt: Some students arrived at the school on Sunday.

Hyp: There were no students at the school on Sunday. (Don't know.)

There
EX
were
VBD
no
DT
students
NNS
the
DT
school
NN
Sunday
NNP
.
.
Some:DT 10.00 20.00 10.00 20.00 10.00 20.00 20.50 10.00
students:NNS 20.00 14.34 20.00   0.00 20.00   0.75 10.50 20.00
arrived:VBD 20.00 10.00 20.00 14.29 20.00 13.50 15.50 20.00
the:DT 10.00 20.00 10.00 20.00   0.00 20.00 20.50 10.00
school:NN 20.00 14.34 20.00   0.75 20.00   0.00   7.73 19.99
Sunday:NNP 20.50 15.50 20.50 10.50 20.50   7.73   0.00 20.50
.:. 10.00 20.00 10.00 20.00 10.00 19.99 20.50   0.00
NO_WORD   1.00 10.00 10.00 10.00   1.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -25.0000
Features matched: Antonym.samePol: matching polarity with antonyms: no & Some; Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/1000; Polarity.hypNegMarker: "no": no; NullPunisher.functionWord: There; Quant.oneNo: [some,no[; RootEntailment.poorlyAlignedRoot: "were" aligned badly to "arrived"
Hand-tuned score: -14.1000
Threshold: -3.3437


Inference ID: 20

Txt: No students arrived at the school on Sunday.

Hyp: There were some students at the school on Sunday. (Don't know.)

There
EX
were
VBD
some
DT
students
NNS
the
DT
school
NN
Sunday
NNP
.
.
No:DT 10.00 20.00 10.00 20.00 10.00 20.00 20.50 10.00
students:NNS 20.00 14.34 20.00   0.00 20.00   0.75 10.50 20.00
arrived:VBD 20.00 10.00 20.00 14.29 20.00 13.50 15.50 20.00
the:DT 10.00 20.00 10.00 20.00   0.00 20.00 20.50 10.00
school:NN 20.00 14.34 20.00   0.75 20.00   0.00   7.73 19.99
Sunday:NNP 20.50 15.50 20.50 10.50 20.50   7.73   0.00 20.50
.:. 10.00 20.00 10.00 20.00 10.00 19.99 20.50   0.00
NO_WORD   1.00 10.00 10.00 10.00   1.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -25.0000
Features matched: Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/1000; NullPunisher.functionWord: There; NullPunisher.other: some; Quant.oneNo: [no,some[; RootEntailment.poorlyAlignedRoot: "were" aligned badly to "arrived"
Hand-tuned score: -7.1000
Threshold: -3.3437


Inference ID: -20

Txt: No students arrived at the school on Sunday.

Hyp: There were no students at the school on Sunday. (Don't know.)

There
EX
were
VBD
no
DT
students
NNS
the
DT
school
NN
Sunday
NNP
.
.
No:DT 10.00 20.00   0.00 20.00 10.00 20.00 20.50 10.00
students:NNS 20.00 14.34 20.00   0.00 20.00   0.75 10.50 20.00
arrived:VBD 20.00 10.00 20.00 14.29 20.00 13.50 15.50 20.00
the:DT 10.00 20.00 10.00 20.00   0.00 20.00 20.50 10.00
school:NN 20.00 14.34 20.00   0.75 20.00   0.00   7.73 19.99
Sunday:NNP 20.50 15.50 20.50 10.50 20.50   7.73   0.00 20.50
.:. 10.00 20.00 10.00 20.00 10.00 19.99 20.50   0.00
NO_WORD   1.00 10.00 10.00 10.00   1.00 10.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -15.0000
Features matched: Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/1000; Polarity.hypNegMarker: "no": no; NullPunisher.functionWord: There; Quant.bothNo: [no,no]; RootEntailment.poorlyAlignedRoot: "were" aligned badly to "arrived"
Hand-tuned score: 0.9000
Threshold: -3.3437


Inference ID: 21

Txt: There were no students at the school on Sunday.

Hyp: Some students arrived at the school on Sunday. (Don't know.)

Some
DT
students
NNS
arrived
VBD
the
DT
school
NN
Sunday
NNP
.
.
There:EX 10.00 20.00 20.00 10.00 20.00 20.50 10.00
were:VBD 20.00 14.34 10.00 20.00 14.34 15.50 20.00
no:DT 10.00 20.00 20.00 10.00 20.00 20.50 10.00
students:NNS 20.00   0.00 14.29 20.00   0.75 10.50 20.00
the:DT 10.00 20.00 20.00   0.00 20.00 20.50 10.00
school:NN 20.00   0.75 13.50 20.00   0.00   7.73 19.99
Sunday:NNP 20.50 10.50 15.50 20.50   7.73   0.00 20.50
.:. 10.00 20.00 20.00 10.00 19.99 20.50   0.00
NO_WORD 10.00 10.00 10.00   1.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -24.0000
Features matched: Antonym.samePol: matching polarity with antonyms: Some & no; Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/1000; Polarity.txtNegMarker: "no": no; Quant.oneNo: [no,some[; RootEntailment.poorlyAlignedRoot: "arrived" aligned badly to "were"; Structure.argsMismatch: args have different parents but same relations: text "school" <-prep_at-- "students vs. hyp "school" <-prep_at-- "arrived", which aligned to text "were" args have different parents but same relations: text "Sunday" <-prep_on-- "school vs. hyp "Sunday" <-prep_on-- "arrived", which aligned to text "were"
Hand-tuned score: -17.0000
Threshold: -3.3437


Inference ID: -21

Txt: There were no students at the school on Sunday.

Hyp: No students arrived at the school on Sunday. (Yes.)

No
DT
students
NNS
arrived
VBD
the
DT
school
NN
Sunday
NNP
.
.
There:EX 10.00 20.00 20.00 10.00 20.00 20.50 10.00
were:VBD 20.00 14.34 10.00 20.00 14.34 15.50 20.00
no:DT   0.00 20.00 20.00 10.00 20.00 20.50 10.00
students:NNS 20.00   0.00 14.29 20.00   0.75 10.50 20.00
the:DT 10.00 20.00 20.00   0.00 20.00 20.50 10.00
school:NN 20.00   0.75 13.50 20.00   0.00   7.73 19.99
Sunday:NNP 20.50 10.50 15.50 20.50   7.73   0.00 20.50
.:. 10.00 20.00 20.00 10.00 19.99 20.50   0.00
NO_WORD 10.00 10.00 10.00   1.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -14.0000
Features matched: Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/1000; Polarity.txtNegMarker: "no": no; Quant.bothNo: [no,no]; RootEntailment.poorlyAlignedRoot: "arrived" aligned badly to "were"; Structure.argsMismatch: args have different parents but same relations: text "school" <-prep_at-- "students vs. hyp "school" <-prep_at-- "arrived", which aligned to text "were" args have different parents but same relations: text "Sunday" <-prep_on-- "school vs. hyp "Sunday" <-prep_on-- "arrived", which aligned to text "were"
Hand-tuned score: -2.0000
Threshold: -3.3437


Inference ID: 22

Txt: The diplomat left Baghdad last week.

Hyp: The diplomat has been to Baghdad. (Yes.)

The
DT
diplomat
NN
has
VBZ
been
VBN
Baghdad
NNP
.
.
The:DT   0.00 20.00 20.00 20.00 20.50 10.00
diplomat:NN 20.00   0.00 14.34 14.34   9.84 19.87
left:VBD 20.00 14.34   7.52   9.34 11.79 19.46
Baghdad:NNP 20.50   9.84 13.02 14.84   0.00 20.50
last:JJ 20.50 11.45 11.19 10.83 16.34 20.50
week:NN 20.50 10.50 14.19 15.50 15.00 17.43
.:. 10.00 19.87 20.00 20.00 20.50   0.00
NO_WORD   1.00 10.00   1.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -12.3420
Features matched: Adjunct.dropPosCxt: text adjunct "week" of "left" dropped on aligned hyp word "been"; NullPunisher.aux: has; RootEntailment.poorlyAlignedRoot: "been" aligned badly to "left"; Structure.relMismatch: text "Baghdad" is dobj of "left" while hyp "Baghdad" is prep_to of "been" which aligned to text "left"
Hand-tuned score: -1.5500
Threshold: -3.3437


Inference ID: -22

Txt: The diplomat left Baghdad last week.

Hyp: The diplomat has not been to Baghdad. (Don't know.)

The
DT
diplomat
NN
has
VBZ
not
RB
been
VBN
Baghdad
NNP
.
.
The:DT   0.00 20.00 20.00 20.00 20.00 20.50 10.00
diplomat:NN 20.00   0.00 14.34 14.96 14.34   9.84 19.87
left:VBD 20.00 14.34   7.52 19.96   9.34 11.79 19.46
Baghdad:NNP 20.50   9.84 13.02 15.46 14.84   0.00 20.50
last:JJ 20.50 11.45 11.19 12.46 10.83 16.34 20.50
week:NN 20.50 10.50 14.19 15.46 15.50 15.00 17.43
.:. 10.00 19.87 20.00 20.00 20.00 20.50   0.00
NO_WORD   1.00 10.00   1.00   9.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -21.3420
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "been": neg; NullPunisher.other: not; NullPunisher.aux: has; RootEntailment.poorlyAlignedRoot: "been" aligned badly to "left"; Structure.relMismatch: text "Baghdad" is dobj of "left" while hyp "Baghdad" is prep_to of "been" which aligned to text "left"
Hand-tuned score: -8.0500
Threshold: -3.3437


Inference ID: 23

Txt: The diplomat will arrive in Baghdad next week.

Hyp: The diplomat has been to Baghdad. (Don't know.)

The
DT
diplomat
NN
has
VBZ
been
VBN
Baghdad
NNP
.
.
The:DT   0.00 20.00 20.00 20.00 20.50 10.00
diplomat:NN 20.00   0.00 14.34 14.34   9.84 19.87
will:MD 10.00 20.00 18.69 20.00 20.50 10.00
arrive:VB 20.00 13.56 10.00 10.00 15.50 20.00
Baghdad:NNP 20.50   9.84 13.02 14.84   0.00 20.50
next:JJ 20.00 11.96 11.96 11.96 12.46 20.00
week:NN 20.00 10.00 13.69 15.00 10.50 16.93
.:. 10.00 19.87 20.00 20.00 20.50   0.00
NO_WORD   1.00 10.00   1.00 10.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -12.0000
Features matched: Adjunct.dropPosCxt: text adjunct "week" of "arrive" dropped on aligned hyp word "been"; NullPunisher.aux: has; RootEntailment.poorlyAlignedRoot: "been" aligned badly to "arrive"; Structure.relMismatch: text "Baghdad" is prep_in of "arrive" while hyp "Baghdad" is prep_to of "been" which aligned to text "arrive"
Hand-tuned score: -1.5500
Threshold: -3.3437


Inference ID: -23

Txt: The diplomat will arrive in Baghdad next week.

Hyp: The diplomat has not been to Baghdad. (Don't know.)

The
DT
diplomat
NN
has
VBZ
not
RB
been
VBN
Baghdad
NNP
.
.
The:DT   0.00 20.00 20.00 20.00 20.00 20.50 10.00
diplomat:NN 20.00   0.00 14.34 14.96 14.34   9.84 19.87
will:MD 10.00 20.00 18.69 19.96 20.00 20.50 10.00
arrive:VB 20.00 13.56 10.00 19.96 10.00 15.50 20.00
Baghdad:NNP 20.50   9.84 13.02 15.46 14.84   0.00 20.50
next:JJ 20.00 11.96 11.96 11.96 11.96 12.46 20.00
week:NN 20.00 10.00 13.69 14.96 15.00 10.50 16.93
.:. 10.00 19.87 20.00 20.00 20.00 20.50   0.00
NO_WORD   1.00 10.00   1.00   9.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -21.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "been": neg; NullPunisher.aux: has; NullPunisher.other: not; RootEntailment.poorlyAlignedRoot: "been" aligned badly to "arrive"; Structure.relMismatch: text "Baghdad" is prep_in of "arrive" while hyp "Baghdad" is prep_to of "been" which aligned to text "arrive"
Hand-tuned score: -8.0500
Threshold: -3.3437


Inference ID: 24

Txt: The president knows that the diplomat left Baghdad.

Hyp: The diplomat has been to Baghdad. (Yes.)

The
DT
diplomat
NN
has
VBZ
been
VBN
Baghdad
NNP
.
.
The:DT   0.00 20.00 20.00 20.00 20.50 10.00
president:NN 20.00   3.94 14.34 14.34   9.84 20.00
knows:VBZ 20.00 13.88 10.00   8.07 15.50 20.00
that:IN 20.00 20.00 20.00 20.00 20.50 20.00
the:DT   0.00 20.00 20.00 20.00 20.50 10.00
diplomat:NN 20.00   0.00 14.34 14.34   9.84 19.87
left:VBD 20.00 14.34   7.52   9.34 11.79 19.46
Baghdad:NNP 20.50   9.84 13.02 14.84   0.00 20.50
.:. 10.00 19.87 20.00 20.00 20.50   0.00
NO_WORD   1.00 10.00   1.00 10.00 10.00 10.00

Response: dontknow (INCORRECT)
Justification:
Alignment score: -13.0669
Features matched: NullPunisher.aux: has; RootEntailment.poorlyAlignedRoot: "been" aligned badly to "knows"; Structure.argsMismatch: args have different parents but same relations: text "diplomat" <-nsubj-- "left vs. hyp "diplomat" <-nsubj-- "been", which aligned to text "knows" args have different parents, different relations: text "Baghdad" <-dobj-- "left" vs. hyp "Baghdad" <-prep_to-- "been", which aligned to text "knows"
Hand-tuned score: -4.0500
Threshold: -3.3437


Inference ID: -24

Txt: The president knows that the diplomat left Baghdad.

Hyp: The diplomat has not been to Baghdad. (Don't know.)

The
DT
diplomat
NN
has
VBZ
not
RB
been
VBN
Baghdad
NNP
.
.
The:DT   0.00 20.00 20.00 20.00 20.00 20.50 10.00
president:NN 20.00   3.94 14.34 14.96 14.34   9.84 20.00
knows:VBZ 20.00 13.88 10.00 19.96   8.07 15.50 20.00
that:IN 20.00 20.00 20.00 20.00 20.00 20.50 20.00
the:DT   0.00 20.00 20.00 20.00 20.00 20.50 10.00
diplomat:NN 20.00   0.00 14.34 14.96 14.34   9.84 19.87
left:VBD 20.00 14.34   7.52 19.96   9.34 11.79 19.46
Baghdad:NNP 20.50   9.84 13.02 15.46 14.84   0.00 20.50
.:. 10.00 19.87 20.00 20.00 20.00 20.50   0.00
NO_WORD   1.00 10.00   1.00   9.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -22.0669
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "been": neg; NullPunisher.aux: has; NullPunisher.other: not; RootEntailment.poorlyAlignedRoot: "been" aligned badly to "knows"; Structure.argsMismatch: args have different parents but same relations: text "diplomat" <-nsubj-- "left vs. hyp "diplomat" <-nsubj-- "been", which aligned to text "knows" args have different parents, different relations: text "Baghdad" <-dobj-- "left" vs. hyp "Baghdad" <-prep_to-- "been", which aligned to text "knows"
Hand-tuned score: -10.0500
Threshold: -3.3437


Inference ID: 25

Txt: The president hasn't gone to Iraq since the diplomat left Baghdad.

Hyp: The diplomat has been to Baghdad. (Yes.)

The
DT
diplomat
NN
has
VBZ
been
VBN
Baghdad
NNP
.
.
The:DT   0.00 20.00 20.00 20.00 20.50 10.00
president:NN 20.00   3.94 14.34 14.34   9.84 20.00
has:VBZ 20.00 14.34   0.00   9.34 13.02 20.00
n't:RB 20.00 14.17 19.96 19.96 15.46 17.90
gone:VBN 20.00 13.08   8.69   6.07 14.84 19.35
Iraq:NNP 20.50   9.84 13.02 14.84   2.00 20.50
since:IN 20.00 20.00 20.00 20.00 20.50 20.00
the:DT   0.00 20.00 20.00 20.00 20.50 10.00
diplomat:NN 20.00   0.00 14.34 14.34   9.84 19.87
left:VBD 20.00 14.34   7.52   9.34 11.79 19.46
Baghdad:NNP 20.50   9.84 13.02 14.84   0.00 20.50
.:. 10.00 19.87 20.00 20.00 20.50   0.00
NO_WORD   1.00 10.00   1.00 10.00 10.00 10.00

Response: dontknow (INCORRECT)
Justification:
Alignment score: -10.0708
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.txtNegMarker: "gone": neg; RootEntailment.poorlyAlignedRoot: "been" aligned badly to "gone"; Structure.argsMismatch: args have different parents but same relations: text "diplomat" <-nsubj-- "left vs. hyp "diplomat" <-nsubj-- "been", which aligned to text "gone" args have different parents, different relations: text "Baghdad" <-dobj-- "left" vs. hyp "Baghdad" <-prep_to-- "been", which aligned to text "gone"
Hand-tuned score: -9.0000
Threshold: -3.3437


Inference ID: -25

Txt: The president hasn't gone to Iraq since the diplomat left Baghdad.

Hyp: The diplomat has not been to Baghdad. (Don't know.)

The
DT
diplomat
NN
has
VBZ
not
RB
been
VBN
Baghdad
NNP
.
.
The:DT   0.00 20.00 20.00 20.00 20.00 20.50 10.00
president:NN 20.00   3.94 14.34 14.96 14.34   9.84 20.00
has:VBZ 20.00 14.34   0.00 19.96   9.34 13.02 20.00
n't:RB 20.00 14.17 19.96   0.50 19.96 15.46 17.90
gone:VBN 20.00 13.08   8.69 19.96   6.07 14.84 19.35
Iraq:NNP 20.50   9.84 13.02 15.46 14.84   2.00 20.50
since:IN 20.00 20.00 20.00 20.00 20.00 20.50 20.00
the:DT   0.00 20.00 20.00 20.00 20.00 20.50 10.00
diplomat:NN 20.00   0.00 14.34 14.96 14.34   9.84 19.87
left:VBD 20.00 14.34   7.52 19.96   9.34 11.79 19.46
Baghdad:NNP 20.50   9.84 13.02 15.46 14.84   0.00 20.50
.:. 10.00 19.87 20.00 20.00 20.00 20.50   0.00
NO_WORD   1.00 10.00   1.00   9.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -10.5708
Features matched: Adjunct.dropNegCxt: text adjunct "Iraq" of "gone" dropped on aligned hyp word "been"; Polarity.hypNegMarker: "been": neg; Polarity.txtNegMarker: "gone": neg; RootEntailment.poorlyAlignedRoot: "been" aligned badly to "gone"; Structure.argsMismatch: args have different parents but same relations: text "diplomat" <-nsubj-- "left vs. hyp "diplomat" <-nsubj-- "been", which aligned to text "gone" args have different parents, different relations: text "Baghdad" <-dobj-- "left" vs. hyp "Baghdad" <-prep_to-- "been", which aligned to text "gone" ; Polarity.txtNegMarker&PolarityhypNegMarker:
Hand-tuned score: -7.0000
Threshold: -3.3437


Inference ID: 26

Txt: The president hasn't gone to Iraq since the diplomat left Baghdad.

Hyp: The president has been to Iraq. (Don't know.)

The
DT
president
NN
has
VBZ
been
VBN
Iraq
NNP
.
.
The:DT   0.00 20.00 20.00 20.00 20.50 10.00
president:NN 20.00   0.00 14.34 14.34   9.84 20.00
has:VBZ 20.00 14.34   0.00   9.34 13.02 20.00
n't:RB 20.00 14.96 19.96 19.96 15.46 17.90
gone:VBN 20.00 12.72   8.69   6.07 14.84 19.35
Iraq:NNP 20.50   9.84 13.02 14.84   0.00 20.50
since:IN 20.00 20.00 20.00 20.00 20.50 20.00
the:DT   0.00 20.00 20.00 20.00 20.50 10.00
diplomat:NN 20.00   3.94 14.34 14.34   9.84 19.87
left:VBD 20.00 13.35   7.52   9.34 12.61 19.46
Baghdad:NNP 20.50   9.84 13.02 14.84   2.00 20.50
.:. 10.00 20.00 20.00 20.00 20.50   0.00
NO_WORD   1.00 10.00   1.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -6.0708
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.txtNegMarker: "gone": neg; RootEntailment.poorlyAlignedRoot: "been" aligned badly to "gone"
Hand-tuned score: -6.0000
Threshold: -3.3437


Inference ID: -26

Txt: The president hasn't gone to Iraq since the diplomat left Baghdad.

Hyp: The president has not been to Iraq. (Don't know.)

The
DT
president
NN
has
VBZ
not
RB
been
VBN
Iraq
NNP
.
.
The:DT   0.00 20.00 20.00 20.00 20.00 20.50 10.00
president:NN 20.00   0.00 14.34 14.96 14.34   9.84 20.00
has:VBZ 20.00 14.34   0.00 19.96   9.34 13.02 20.00
n't:RB 20.00 14.96 19.96   0.50 19.96 15.46 17.90
gone:VBN 20.00 12.72   8.69 19.96   6.07 14.84 19.35
Iraq:NNP 20.50   9.84 13.02 15.46 14.84   0.00 20.50
since:IN 20.00 20.00 20.00 20.00 20.00 20.50 20.00
the:DT   0.00 20.00 20.00 20.00 20.00 20.50 10.00
diplomat:NN 20.00   3.94 14.34 14.96 14.34   9.84 19.87
left:VBD 20.00 13.35   7.52 19.96   9.34 12.61 19.46
Baghdad:NNP 20.50   9.84 13.02 15.46 14.84   2.00 20.50
.:. 10.00 20.00 20.00 20.00 20.00 20.50   0.00
NO_WORD   1.00 10.00   1.00   9.00 10.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -6.5708
Features matched: Polarity.hypNegMarker: "been": neg; Polarity.txtNegMarker: "gone": neg; RootEntailment.poorlyAlignedRoot: "been" aligned badly to "gone"; Polarity.txtNegMarker&PolarityhypNegMarker:
Hand-tuned score: -1.0000
Threshold: -3.3437


Inference ID: 27

Txt: The diplomat didn't manage to leave Baghdad.

Hyp: The diplomat has been to Baghdad. (Yes.)

The
DT
diplomat
NN
has
VBZ
been
VBN
Baghdad
NNP
.
.
The:DT   0.00 20.00 20.00 20.00 20.50 10.00
diplomat:NN 20.00   0.00 14.34 14.34   9.84 19.87
did:VBD 20.00 15.00   7.53   6.07 15.50 17.99
n't:RB 20.00 14.17 19.96 19.96 15.46 17.90
manage:VB 20.00 15.00 10.00   8.07 15.50 19.98
to:TO 10.00 20.00 20.00 20.00 20.50 10.00
leave:VB 20.00 15.00   8.69   7.74 15.50 19.32
Baghdad:NNP 20.50   9.84 13.02 14.84   0.00 20.50
.:. 10.00 19.87 20.00 20.00 20.50   0.00
NO_WORD   1.00 10.00   1.00 10.00 10.00 10.00

Response: dontknow (INCORRECT)
Justification:
Alignment score: -11.0669
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.txtNegMarker: "manage": neg; NullPunisher.aux: has; RootEntailment.poorlyAlignedRoot: "been" aligned badly to "manage"; Structure.parentsMismatch: args have different parents, different relations: text "Baghdad" <-dobj-- "leave" vs. hyp "Baghdad" <-prep_to-- "been", which aligned to text "manage"
Hand-tuned score: -9.0500
Threshold: -3.3437


Inference ID: -27

Txt: The diplomat didn't manage to leave Baghdad.

Hyp: The diplomat has not been to Baghdad. (Don't know.)

The
DT
diplomat
NN
has
VBZ
not
RB
been
VBN
Baghdad
NNP
.
.
The:DT   0.00 20.00 20.00 20.00 20.00 20.50 10.00
diplomat:NN 20.00   0.00 14.34 14.96 14.34   9.84 19.87
did:VBD 20.00 15.00   7.53 19.96   6.07 15.50 17.99
n't:RB 20.00 14.17 19.96   0.50 19.96 15.46 17.90
manage:VB 20.00 15.00 10.00 19.96   8.07 15.50 19.98
to:TO 10.00 20.00 20.00 20.00 20.00 20.50 10.00
leave:VB 20.00 15.00   8.69 19.96   7.74 15.50 19.32
Baghdad:NNP 20.50   9.84 13.02 15.46 14.84   0.00 20.50
.:. 10.00 19.87 20.00 20.00 20.00 20.50   0.00
NO_WORD   1.00 10.00   1.00   9.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -11.5669
Features matched: Polarity.hypNegMarker: "been": neg; Polarity.txtNegMarker: "manage": neg; NullPunisher.aux: has; RootEntailment.poorlyAlignedRoot: "been" aligned badly to "manage"; Structure.parentsMismatch: args have different parents, different relations: text "Baghdad" <-dobj-- "leave" vs. hyp "Baghdad" <-prep_to-- "been", which aligned to text "manage" ; Polarity.txtNegMarker&PolarityhypNegMarker:
Hand-tuned score: -4.0500
Threshold: -3.3437


Inference ID: 28

Txt: The diplomat hasn't managed to leave Baghdad.

Hyp: The diplomat is in Baghdad now. (Yes.)

The
DT
diplomat
NN
is
VBZ
Baghdad
NNP
now
RB
.
.
The:DT   0.00 20.00 20.00 20.50 20.00 10.00
diplomat:NN 20.00   0.00 14.34   9.84 15.00 19.87
has:VBZ 20.00 14.34   8.64 13.02 18.69 20.00
n't:RB 20.00 14.17 19.96 15.46   9.96 17.90
managed:VBN 20.00 15.00   8.07 15.50 20.00 20.00
to:TO 10.00 20.00 20.00 20.50 20.00 10.00
leave:VB 20.00 15.00   7.74 15.50 18.69 19.32
Baghdad:NNP 20.50   9.84 14.84   0.00 15.50 20.50
.:. 10.00 19.87 20.00 20.50 20.00   0.00
NO_WORD   1.00 10.00 10.00 10.00   9.00 10.00

Response: dontknow (INCORRECT)
Justification:
Alignment score: -19.0669
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.txtNegMarker: "managed": neg; NullPunisher.other: now; RootEntailment.poorlyAlignedRoot: "is" aligned badly to "managed"; Structure.parentsMismatch: args have different parents, different relations: text "Baghdad" <-dobj-- "leave" vs. hyp "Baghdad" <-prep_in-- "is", which aligned to text "managed"
Hand-tuned score: -10.0000
Threshold: -3.3437


Inference ID: -28

Txt: The diplomat hasn't managed to leave Baghdad.

Hyp: The diplomat is not in Baghdad now. (Don't know.)

The
DT
diplomat
NN
is
VBZ
not
RB
Baghdad
NNP
now
RB
.
.
The:DT   0.00 20.00 20.00 20.00 20.50 20.00 10.00
diplomat:NN 20.00   0.00 14.34 14.96   9.84 15.00 19.87
has:VBZ 20.00 14.34   8.64 19.96 13.02 18.69 20.00
n't:RB 20.00 14.17 19.96   0.50 15.46   9.96 17.90
managed:VBN 20.00 15.00   8.07 19.96 15.50 20.00 20.00
to:TO 10.00 20.00 20.00 20.00 20.50 20.00 10.00
leave:VB 20.00 15.00   7.74 19.96 15.50 18.69 19.32
Baghdad:NNP 20.50   9.84 14.84 15.46   0.00 15.50 20.50
.:. 10.00 19.87 20.00 20.00 20.50 20.00   0.00
NO_WORD   1.00 10.00 10.00   9.00 10.00   9.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -19.5669
Features matched: Adjunct.addNegCxt: hyp added now[now-RB]; Polarity.hypNegMarker: "is": neg; Polarity.txtNegMarker: "managed": neg; NullPunisher.other: now; RootEntailment.poorlyAlignedRoot: "is" aligned badly to "managed"; Structure.parentsMismatch: args have different parents, different relations: text "Baghdad" <-dobj-- "leave" vs. hyp "Baghdad" <-prep_in-- "is", which aligned to text "managed" ; Polarity.txtNegMarker&PolarityhypNegMarker:
Hand-tuned score: -4.0000
Threshold: -3.3437


Inference ID: 29

Txt: The room was full of intelligent women.

Hyp: The room was full of women. (Yes.)

The
DT
room
NN
was
VBD
full
JJ
women
NNS
.
.
The:DT   0.00 20.00 20.00 20.00 20.00 10.00
room:NN 20.00   0.00 14.34 10.93   8.13 19.15
was:VBD 20.00 14.34   0.00 12.00 14.34 20.00
full:JJ 20.00 10.93 12.00   0.00 11.15 17.26
intelligent:JJ 20.00 11.96 11.96   9.96   9.83 19.37
women:NNS 20.00   8.13 14.34 11.15   0.00 19.64
.:. 10.00 19.15 20.00 17.26 19.64   0.00
NO_WORD   1.00 10.00   1.00   9.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: 0.0000
Features matched: Adjunct.dropPosCxt: text adjunct "intelligent" of "women" dropped on aligned hyp word "women"; Adjunct.dropPosCxt&Align.veryGood:
Hand-tuned score: 2.0000
Threshold: -3.3437


Inference ID: -29

Txt: The room was full of intelligent women.

Hyp: The room was not full of women. (Don't know.)

The
DT
room
NN
was
VBD
not
RB
full
JJ
women
NNS
.
.
The:DT   0.00 20.00 20.00 20.00 20.00 20.00 10.00
room:NN 20.00   0.00 14.34 14.96 10.93   8.13 19.15
was:VBD 20.00 14.34   0.00 19.96 12.00 14.34 20.00
full:JJ 20.00 10.93 12.00 11.96   0.00 11.15 17.26
intelligent:JJ 20.00 11.96 11.96 11.96   9.96   9.83 19.37
women:NNS 20.00   8.13 14.34 14.96 11.15   0.00 19.64
.:. 10.00 19.15 20.00 20.00 17.26 19.64   0.00
NO_WORD   1.00 10.00   1.00   9.00   9.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -9.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "full": neg; NullPunisher.other: not
Hand-tuned score: -5.0000
Threshold: -3.3437


Inference ID: 30

Txt: The room was full of women.

Hyp: The room was full of intelligent women. (Don't know.)

The
DT
room
NN
was
VBD
full
JJ
intelligent
JJ
women
NNS
.
.
The:DT   0.00 20.00 20.00 20.00 20.00 20.00 10.00
room:NN 20.00   0.00 14.34 10.93 11.96   8.13 19.15
was:VBD 20.00 14.34   0.00 12.00 11.96 14.34 20.00
full:JJ 20.00 10.93 12.00   0.00   9.96 11.15 17.26
women:NNS 20.00   8.13 14.34 11.15   9.83   0.00 19.64
.:. 10.00 19.15 20.00 17.26 19.37 19.64   0.00
NO_WORD   1.00 10.00   1.00   9.00   9.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -9.0000
Features matched: Adjunct.addPosCxt: hyp added intelligent[intelligent-JJ]; NullPunisher.other: intelligent
Hand-tuned score: -1.0000
Threshold: -3.3437


Inference ID: -30

Txt: The room was full of women.

Hyp: The room was not full of intelligent women. (Don't know.)

The
DT
room
NN
was
VBD
not
RB
full
JJ
intelligent
JJ
women
NNS
.
.
The:DT   0.00 20.00 20.00 20.00 20.00 20.00 20.00 10.00
room:NN 20.00   0.00 14.34 14.96 10.93 11.96   8.13 19.15
was:VBD 20.00 14.34   0.00 19.96 12.00 11.96 14.34 20.00
full:JJ 20.00 10.93 12.00 11.96   0.00   9.96 11.15 17.26
women:NNS 20.00   8.13 14.34 14.96 11.15   9.83   0.00 19.64
.:. 10.00 19.15 20.00 20.00 17.26 19.37 19.64   0.00
NO_WORD   1.00 10.00   1.00   9.00   9.00   9.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -18.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "full": neg; NullPunisher.other: not; NullPunisher.other: intelligent
Hand-tuned score: -6.0000
Threshold: -3.3437


Inference ID: 31

Txt: Children are not admitted to the theatre.

Hyp: Small children are admitted to the theatre. (Don't know.)

Small
JJ
children
NNS
are
VBP
admitted
VBN
the
DT
theater
NN
.
.
Children:NNP 11.34   0.00 15.00 15.00 20.00   8.95 20.00
are:VBP 10.69 15.00   0.00 10.00 20.00 15.00 20.00
not:RB 11.96 14.96 19.96 19.96 20.00 14.96 20.00
admitted:VBN 12.00 12.85 10.00   0.00 20.00 15.00 19.33
the:DT 20.00 20.00 20.00 20.00   0.00 20.00 10.00
theater:NN 11.34   8.95 15.00 15.00 20.00   0.00 20.00
.:. 20.00 19.49 20.00 19.33 10.00 20.00   0.00
NO_WORD   9.00 10.00   1.00 10.00   1.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -9.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.txtNegMarker: "admitted": neg; NullPunisher.other: Small
Hand-tuned score: -5.0000
Threshold: -3.3437


Inference ID: -31

Txt: Children are not admitted to the theatre.

Hyp: Small children are not admitted to the theatre. (Yes.)

Small
JJ
children
NNS
are
VBP
not
RB
admitted
VBN
the
DT
theater
NN
.
.
Children:NNP 11.34   0.00 15.00 14.96 15.00 20.00   8.95 20.00
are:VBP 10.69 15.00   0.00 19.96 10.00 20.00 15.00 20.00
not:RB 11.96 14.96 19.96   0.00 19.96 20.00 14.96 20.00
admitted:VBN 12.00 12.85 10.00 19.96   0.00 20.00 15.00 19.33
the:DT 20.00 20.00 20.00 20.00 20.00   0.00 20.00 10.00
theater:NN 11.34   8.95 15.00 14.96 15.00 20.00   0.00 20.00
.:. 20.00 19.49 20.00 20.00 19.33 10.00 20.00   0.00
NO_WORD   9.00 10.00   1.00   9.00 10.00   1.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -9.0000
Features matched: Adjunct.addNegCxt: hyp added Small[Small-JJ]; Polarity.hypNegMarker: "admitted": neg; Polarity.txtNegMarker: "admitted": neg; NullPunisher.other: Small; Polarity.txtNegMarker&PolarityhypNegMarker:
Hand-tuned score: 1.0000
Threshold: -3.3437


Inference ID: 32

Txt: Small children are not admitted to the theatre.

Hyp: Children are admitted to the theatre. (Don't know.)

Children
NNP
are
VBP
admitted
VBN
the
DT
theater
NN
.
.
Small:JJ 11.34 10.69 12.00 20.00 11.34 20.00
children:NNS   0.00 15.00 12.85 20.00   8.95 19.49
are:VBP 15.00   0.00 10.00 20.00 15.00 20.00
not:RB 14.96 19.96 19.96 20.00 14.96 20.00
admitted:VBN 15.00 10.00   0.00 20.00 15.00 19.33
the:DT 20.00 20.00 20.00   0.00 20.00 10.00
theater:NN   8.95 15.00 15.00 20.00   0.00 20.00
.:. 20.00 20.00 19.33 10.00 20.00   0.00
NO_WORD 10.00   1.00 10.00   1.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: 0.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.txtNegMarker: "admitted": neg
Hand-tuned score: -4.0000
Threshold: -3.3437


Inference ID: -32

Txt: Small children are not admitted to the theatre.

Hyp: Children are not admitted to the theatre. (Don't know.)

Children
NNP
are
VBP
not
RB
admitted
VBN
the
DT
theater
NN
.
.
Small:JJ 11.34 10.69 11.96 12.00 20.00 11.34 20.00
children:NNS   0.00 15.00 14.96 12.85 20.00   8.95 19.49
are:VBP 15.00   0.00 19.96 10.00 20.00 15.00 20.00
not:RB 14.96 19.96   0.00 19.96 20.00 14.96 20.00
admitted:VBN 15.00 10.00 19.96   0.00 20.00 15.00 19.33
the:DT 20.00 20.00 20.00 20.00   0.00 20.00 10.00
theater:NN   8.95 15.00 14.96 15.00 20.00   0.00 20.00
.:. 20.00 20.00 20.00 19.33 10.00 20.00   0.00
NO_WORD 10.00   1.00   9.00 10.00   1.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: 0.0000
Features matched: Adjunct.dropNegCxt: text adjunct "Small" of "children" dropped on aligned hyp word "Children"; Polarity.hypNegMarker: "admitted": neg; Polarity.txtNegMarker: "admitted": neg; Polarity.txtNegMarker&PolarityhypNegMarker:
Hand-tuned score: -2.0000
Threshold: -3.3437


Inference ID: 33

Txt: All companies have to file annual reports.

Hyp: All Fortune 500 companies have to file annual reports. (Yes.)

All
DT
Fortune
JJ
500
CD
companies
NNS
have
VBP
to
TO
file
VB
annual
JJ
reports
NNS
.
.
All:DT   0.00 20.00 20.50 20.00 20.00 10.00 20.00 20.00 20.00 10.00
companies:NNS 20.00   9.44 19.02   0.00 12.80 20.00 13.13 10.11   7.80 19.68
have:VBP 20.00 12.00 20.50 12.80   0.00 20.00   7.02 10.11 12.80 20.00
to:TO 10.00 20.00 20.50 20.00 20.00   0.00 20.00 20.00 20.00 10.00
file:VB 20.00 12.00 19.19 13.13   7.02 20.00   0.00 10.03 12.37 20.00
annual:JJ 20.00 10.00 19.41 10.11 10.11 20.00 10.03   0.00 10.23 20.00
reports:NNS 20.00 12.00 19.19   7.80 12.80 20.00 12.37 10.23   0.00 19.70
.:. 10.00 20.00 19.65 19.68 20.00 10.00 20.00 20.00 19.70   0.00
NO_WORD 10.00   9.00 10.00 10.00 10.00 10.00 10.00   9.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -19.0000
Features matched: Adjunct.addAllCxt: all cxt -- hyp adds 500[500-CD];, hyp added 500[500-CD]; Modal.weakYes: necessary -> necessary; NullPunisher.other: 500; NullPunisher.other: Fortune; Quant.contract: [all,all]
Hand-tuned score: 4.0000
Threshold: -3.3437


Inference ID: -33

Txt: All companies have to file annual reports.

Hyp: Not all Fortune 500 companies have to file annual reports. (Don't know.)

Not
RB
all
DT
Fortune
JJ
500
CD
companies
NNS
have
VBP
to
TO
file
VB
annual
JJ
reports
NNS
.
.
All:DT 20.00   0.00 20.00 20.50 20.00 20.00 10.00 20.00 20.00 20.00 10.00
companies:NNS 14.96 20.00   9.44 19.02   0.00 12.80 20.00 13.13 10.11   7.80 19.68
have:VBP 19.96 20.00 12.00 20.50 12.80   0.00 20.00   7.02 10.11 12.80 20.00
to:TO 20.00 10.00 20.00 20.50 20.00 20.00   0.00 20.00 20.00 20.00 10.00
file:VB 19.96 20.00 12.00 19.19 13.13   7.02 20.00   0.00 10.03 12.37 20.00
annual:JJ 11.96 20.00 10.00 19.41 10.11 10.11 20.00 10.03   0.00 10.23 20.00
reports:NNS 14.96 20.00 12.00 19.19   7.80 12.80 20.00 12.37 10.23   0.00 19.70
.:. 20.00 10.00 20.00 19.65 19.68 20.00 10.00 20.00 20.00 19.70   0.00
NO_WORD   9.00 10.00   9.00 10.00 10.00 10.00 10.00 10.00   9.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -28.0000
Features matched: Adjunct.addAllCxt: all cxt -- hyp adds 500[500-CD];, hyp added 500[500-CD]; Modal.weakYes: necessary -> necessary; NullPunisher.other: Fortune; NullPunisher.other: Not; NullPunisher.other: 500; Quant.contract: [all,all]
Hand-tuned score: 3.0000
Threshold: -3.3437


Inference ID: 34

Txt: All Fortune 500 companies have to file annual reports.

Hyp: All companies have to file annual reports. (Don't know.)

All
DT
companies
NNS
have
VBP
to
TO
file
VB
annual
JJ
reports
NNS
.
.
All:DT   0.00 20.00 20.00 10.00 20.00 20.00 20.00 10.00
Fortune:JJ 20.00   9.44 12.00 20.00 12.00 10.00 12.00 20.00
500:CD 20.50 19.02 20.50 20.50 19.19 19.41 19.19 19.65
companies:NNS 20.00   0.00 12.80 20.00 13.13 10.11   7.80 19.68
have:VBP 20.00 12.80   0.00 20.00   7.02 10.11 12.80 20.00
to:TO 10.00 20.00 20.00   0.00 20.00 20.00 20.00 10.00
file:VB 20.00 13.13   7.02 20.00   0.00 10.03 12.37 20.00
annual:JJ 20.00 10.11 10.11 20.00 10.03   0.00 10.23 20.00
reports:NNS 20.00   7.80 12.80 20.00 12.37 10.23   0.00 19.70
.:. 10.00 19.68 20.00 10.00 20.00 20.00 19.70   0.00
NO_WORD 10.00 10.00 10.00 10.00 10.00   9.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: 0.0000
Features matched: Adjunct.dropAllCxt: all cxt -- hyp drops adjunct 500 of companies aligned to hyp companies, text adjunct "500" of "companies" dropped on aligned hyp word "companies"; Modal.weakYes: necessary -> necessary; Quant.contract: [all,all]
Hand-tuned score: 0.0000
Threshold: -3.3437


Inference ID: -34

Txt: All Fortune 500 companies have to file annual reports.

Hyp: Not all companies have to file annual reports. (Don't know.)

Not
RB
all
DT
companies
NNS
have
VBP
to
TO
file
VB
annual
JJ
reports
NNS
.
.
All:DT 20.00   0.00 20.00 20.00 10.00 20.00 20.00 20.00 10.00
Fortune:JJ 11.96 20.00   9.44 12.00 20.00 12.00 10.00 12.00 20.00
500:CD 20.46 20.50 19.02 20.50 20.50 19.19 19.41 19.19 19.65
companies:NNS 14.96 20.00   0.00 12.80 20.00 13.13 10.11   7.80 19.68
have:VBP 19.96 20.00 12.80   0.00 20.00   7.02 10.11 12.80 20.00
to:TO 20.00 10.00 20.00 20.00   0.00 20.00 20.00 20.00 10.00
file:VB 19.96 20.00 13.13   7.02 20.00   0.00 10.03 12.37 20.00
annual:JJ 11.96 20.00 10.11 10.11 20.00 10.03   0.00 10.23 20.00
reports:NNS 14.96 20.00   7.80 12.80 20.00 12.37 10.23   0.00 19.70
.:. 20.00 10.00 19.68 20.00 10.00 20.00 20.00 19.70   0.00
NO_WORD   9.00 10.00 10.00 10.00 10.00 10.00   9.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -9.0000
Features matched: Adjunct.dropAllCxt: all cxt -- hyp drops adjunct 500 of companies aligned to hyp companies, text adjunct "500" of "companies" dropped on aligned hyp word "companies", hyp added Not[Not-RB]; Modal.weakYes: necessary -> necessary; NullPunisher.other: Not; Quant.contract: [all,all]
Hand-tuned score: -1.0000
Threshold: -3.3437


Inference ID: 35

Txt: All companies have to file annual reports to the sec.

Hyp: All companies have to file annual reports. (Yes.)

All
DT
companies
NNS
have
VBP
to
TO
file
VB
annual
JJ
reports
NNS
.
.
All:DT   0.00 20.00 20.00 10.00 20.00 20.00 20.00 10.00
companies:NNS 20.00   0.00 12.80 20.00 13.13 10.11   7.80 19.68
have:VBP 20.00 12.80   0.00 20.00   7.02 10.11 12.80 20.00
to:TO 10.00 20.00 20.00   0.00 20.00 20.00 20.00 10.00
file:VB 20.00 13.13   7.02 20.00   0.00 10.03 12.37 20.00
annual:JJ 20.00 10.11 10.11 20.00 10.03   0.00 10.23 20.00
reports:NNS 20.00   7.80 12.80 20.00 12.37 10.23   0.00 19.70
the:DT 10.00 20.00 20.00 10.00 20.00 20.00 20.00 10.00
sec:NN 20.00   6.69 15.00 20.00 12.11 12.00   7.76 19.27
.:. 10.00 19.68 20.00 10.00 20.00 20.00 19.70   0.00
NO_WORD 10.00 10.00 10.00 10.00 10.00   9.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: 0.0000
Features matched: Adjunct.dropPosCxt: text adjunct "sec" of "file" dropped on aligned hyp word "file"; Modal.weakYes: necessary -> necessary; Quant.contract: [all,all]; Adjunct.dropPosCxt&Align.veryGood:
Hand-tuned score: 4.0000
Threshold: -3.3437


Inference ID: -35

Txt: All companies have to file annual reports to the sec.

Hyp: Not all companies have to file annual reports. (Don't know.)

Not
RB
all
DT
companies
NNS
have
VBP
to
TO
file
VB
annual
JJ
reports
NNS
.
.
All:DT 20.00   0.00 20.00 20.00 10.00 20.00 20.00 20.00 10.00
companies:NNS 14.96 20.00   0.00 12.80 20.00 13.13 10.11   7.80 19.68
have:VBP 19.96 20.00 12.80   0.00 20.00   7.02 10.11 12.80 20.00
to:TO 20.00 10.00 20.00 20.00   0.00 20.00 20.00 20.00 10.00
file:VB 19.96 20.00 13.13   7.02 20.00   0.00 10.03 12.37 20.00
annual:JJ 11.96 20.00 10.11 10.11 20.00 10.03   0.00 10.23 20.00
reports:NNS 14.96 20.00   7.80 12.80 20.00 12.37 10.23   0.00 19.70
the:DT 20.00 10.00 20.00 20.00 10.00 20.00 20.00 20.00 10.00
sec:NN 14.96 20.00   6.69 15.00 20.00 12.11 12.00   7.76 19.27
.:. 20.00 10.00 19.68 20.00 10.00 20.00 20.00 19.70   0.00
NO_WORD   9.00 10.00 10.00 10.00 10.00 10.00   9.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -9.0000
Features matched: Adjunct.addAllCxt: all cxt -- hyp adds Not[Not-RB];, text adjunct "sec" of "file" dropped on aligned hyp word "file", hyp added Not[Not-RB]; Modal.weakYes: necessary -> necessary; NullPunisher.other: Not; Quant.contract: [all,all]
Hand-tuned score: 5.0000
Threshold: -3.3437


Inference ID: 36

Txt: All companies have to file annual reports.

Hyp: All companies have to file annual reports to the sec. (Don't know.)

All
DT
companies
NNS
have
VBP
to
TO
file
VB
annual
JJ
reports
NNS
the
DT
sec
NN
.
.
All:DT   0.00 20.00 20.00 10.00 20.00 20.00 20.00 10.00 20.00 10.00
companies:NNS 20.00   0.00 12.80 20.00 13.13 10.11   7.80 20.00   6.69 19.68
have:VBP 20.00 12.80   0.00 20.00   7.02 10.11 12.80 20.00 15.00 20.00
to:TO 10.00 20.00 20.00   0.00 20.00 20.00 20.00 10.00 20.00 10.00
file:VB 20.00 13.13   7.02 20.00   0.00 10.03 12.37 20.00 12.11 20.00
annual:JJ 20.00 10.11 10.11 20.00 10.03   0.00 10.23 20.00 12.00 20.00
reports:NNS 20.00   7.80 12.80 20.00 12.37 10.23   0.00 20.00   7.76 19.70
.:. 10.00 19.68 20.00 10.00 20.00 20.00 19.70 10.00 19.27   0.00
NO_WORD 10.00 10.00 10.00 10.00 10.00   9.00 10.00   1.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -9.6929
Features matched: Modal.weakYes: necessary -> necessary; NullPunisher.article: the; Quant.contract: [all,all]; Quant.contract: [all,the]
Hand-tuned score: 3.9000
Threshold: -3.3437


Inference ID: -36

Txt: All companies have to file annual reports.

Hyp: Not all companies have to file annual reports to the sec. (Don't know.)

Not
RB
all
DT
companies
NNS
have
VBP
to
TO
file
VB
annual
JJ
reports
NNS
the
DT
sec
NN
.
.
All:DT 20.00   0.00 20.00 20.00 10.00 20.00 20.00 20.00 10.00 20.00 10.00
companies:NNS 14.96 20.00   0.00 12.80 20.00 13.13 10.11   7.80 20.00   6.69 19.68
have:VBP 19.96 20.00 12.80   0.00 20.00   7.02 10.11 12.80 20.00 15.00 20.00
to:TO 20.00 10.00 20.00 20.00   0.00 20.00 20.00 20.00 10.00 20.00 10.00
file:VB 19.96 20.00 13.13   7.02 20.00   0.00 10.03 12.37 20.00 12.11 20.00
annual:JJ 11.96 20.00 10.11 10.11 20.00 10.03   0.00 10.23 20.00 12.00 20.00
reports:NNS 14.96 20.00   7.80 12.80 20.00 12.37 10.23   0.00 20.00   7.76 19.70
.:. 20.00 10.00 19.68 20.00 10.00 20.00 20.00 19.70 10.00 19.27   0.00
NO_WORD   9.00 10.00 10.00 10.00 10.00 10.00   9.00 10.00   1.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -18.6929
Features matched: Adjunct.addAllCxt: all cxt -- hyp adds Not[Not-RB];, hyp added Not[Not-RB]; Modal.weakYes: necessary -> necessary; NullPunisher.article: the; NullPunisher.other: Not; Quant.contract: [all,all]; Quant.contract: [all,the]
Hand-tuned score: 5.9000
Threshold: -3.3437


Inference ID: 37

Txt: No delegates finished the report.

Hyp: Some delegates finished the report on time. (Don't know.)

Some
DT
delegates
NNS
finished
VBD
the
DT
report
NN
on_time
IN
.
.
No:DT 10.00 20.00 20.00 10.00 20.00 20.00 10.00
delegates:NNS 20.00   0.00 14.34 20.00   9.57 20.00 20.00
finished:VBD 20.00 14.34   0.00 20.00 11.89 20.00 18.93
the:DT 10.00 20.00 20.00   0.00 20.00 20.00 10.00
report:NN 20.00   9.57 11.89 20.00   0.00 20.00 19.87
.:. 10.00 20.00 18.93 10.00 19.87 20.00   0.00
NO_WORD 10.00 10.00 10.00   1.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -20.0000
Features matched: NullPunisher.other: Some; NullPunisher.other: on_time; Quant.oneNo: [no,some[
Hand-tuned score: -7.0000
Threshold: -3.3437


Inference ID: -37

Txt: No delegates finished the report.

Hyp: No delegates finish the report on time. (Yes.)

No
DT
delegates
NNS
finish
VBP
the
DT
report
NN
on_time
IN
.
.
No:DT   0.00 20.00 20.00 10.00 20.00 20.00 10.00
delegates:NNS 20.00   0.00 13.85 20.00   9.57 20.00 20.00
finished:VBD 20.00 14.34   0.50 20.00 11.89 20.00 18.93
the:DT 10.00 20.00 20.00   0.00 20.00 20.00 10.00
report:NN 20.00   9.57 11.89 20.00   0.00 20.00 19.87
.:. 10.00 20.00 18.26 10.00 19.87 20.00   0.00
NO_WORD 10.00 10.00 10.00   1.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -10.5000
Features matched: NullPunisher.other: on_time; Quant.bothNo: [no,no]
Hand-tuned score: 1.0000
Threshold: -3.3437


Inference ID: 38

Txt: The US troops stayed in Iraq although the war was over.

Hyp: The war was over. (Yes.)

The
DT
war
NN
was
VBD
over
RP
.
.
The:DT   0.00 20.00 20.00 10.00 10.00
US:NNP 20.50 10.50 11.83 20.50 20.50
troops:NNS 20.00   5.07 15.00 20.00 20.00
stayed:VBD 20.00 12.44   9.34 18.69 18.18
Iraq:NNP 20.50 10.50 11.83 20.50 20.50
although:IN 20.00 20.00 20.00 20.00 20.00
the:DT   0.00 20.00 20.00 10.00 10.00
war:NN 20.00   0.00 15.00 20.00 19.45
was:VBD 20.00 15.00   0.00 20.00 20.00
over:RP 10.00 20.00 20.00   0.00 10.00
.:. 10.00 19.45 20.00 10.00   0.00
NO_WORD   1.00 10.00   1.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -2.0000
Features matched: Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "stayed vs. hyp "." <-punct-- "over", which aligned to text "over"
Hand-tuned score: -2.0000
Threshold: -3.3437


Inference ID: -38

Txt: The US troops stayed in Iraq although the war was over.

Hyp: The war was not over. (Don't know.)

The
DT
war
NN
was
VBD
not
RB
over
RP
.
.
The:DT   0.00 20.00 20.00 20.00 10.00 10.00
US:NNP 20.50 10.50 11.83 15.46 20.50 20.50
troops:NNS 20.00   5.07 15.00 14.96 20.00 20.00
stayed:VBD 20.00 12.44   9.34 19.96 18.69 18.18
Iraq:NNP 20.50 10.50 11.83 15.46 20.50 20.50
although:IN 20.00 20.00 20.00 20.00 20.00 20.00
the:DT   0.00 20.00 20.00 20.00 10.00 10.00
war:NN 20.00   0.00 15.00 14.96 20.00 19.45
was:VBD 20.00 15.00   0.00 19.96 20.00 20.00
over:RP 10.00 20.00 20.00 19.96   0.00 10.00
.:. 10.00 19.45 20.00 20.00 10.00   0.00
NO_WORD   1.00 10.00   1.00   9.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -11.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "over": neg; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "stayed vs. hyp "." <-punct-- "over", which aligned to text "over"
Hand-tuned score: -8.0000
Threshold: -3.3437


Inference ID: 39

Txt: Since it was cold, he closed the window.

Hyp: It was cold. (Yes.)

It
PRP
was
VBD
cold
JJ
.
.
Since:IN 20.00 20.00 20.00 20.00
it:PRP   0.00 15.00 15.00 20.00
was:VBD 15.00   0.00 11.34 20.00
cold:JJ 15.00 11.34   0.00 19.61
,:, 20.00 20.00 20.00   5.73
he:PRP 10.00 15.00 15.00 20.00
closed:VBD 15.00 10.00   9.84 19.49
the:DT 20.00 20.00 20.00 10.00
window:NN 12.00 12.52   9.28 19.62
.:. 20.00 20.00 19.61   0.00
NO_WORD 10.00   1.00   9.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -2.0000
Features matched: Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "closed vs. hyp "." <-punct-- "cold", which aligned to text "cold"
Hand-tuned score: -2.0000
Threshold: -3.3437


Inference ID: -39

Txt: Since it was cold, he closed the window.

Hyp: It was not cold. (Don't know.)

It
PRP
was
VBD
not
RB
cold
JJ
.
.
Since:IN 20.00 20.00 20.00 20.00 20.00
it:PRP   0.00 15.00 20.00 15.00 20.00
was:VBD 15.00   0.00 19.96 11.34 20.00
cold:JJ 15.00 11.34 11.96   0.00 19.61
,:, 20.00 20.00 20.00 20.00   5.73
he:PRP 10.00 15.00 20.00 15.00 20.00
closed:VBD 15.00 10.00 19.96   9.84 19.49
the:DT 20.00 20.00 20.00 20.00 10.00
window:NN 12.00 12.52 14.96   9.28 19.62
.:. 20.00 20.00 20.00 19.61   0.00
NO_WORD 10.00   1.00   9.00   9.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -11.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "cold": neg; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "closed vs. hyp "." <-punct-- "cold", which aligned to text "cold"
Hand-tuned score: -8.0000
Threshold: -3.3437


Inference ID: 40

Txt: John didn't visit us after he returned from Spain.

Hyp: John returned from Spain. (Yes.)

John
NNP
returned
VBD
Spain
NNP
.
.
John:NNP   0.00 12.27 14.34 20.50
did:VBD 13.35   6.77 15.50 17.99
n't:RB 15.46 19.96 15.46 17.90
visit:VB 15.50   7.10 15.50 20.00
us:PRP 12.50 15.00 12.50 20.00
after:IN 20.50 20.00 20.50 20.00
he:PRP 12.50 15.00 12.50 20.00
returned:VBD 12.27   0.00 14.84 19.41
Spain:NNP 14.34 14.84   0.00 20.50
.:. 20.50 19.41 20.50   0.00
NO_WORD 10.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -4.0000
Features matched: Structure.argsMismatch: args have different parents but same relations: text "John" <-nsubj-- "visit vs. hyp "John" <-nsubj-- "returned", which aligned to text "returned" args have different parents but same relations: text "." <-punct-- "visit vs. hyp "." <-punct-- "returned", which aligned to text "returned"
Hand-tuned score: -2.0000
Threshold: -3.3437


Inference ID: -40

Txt: John didn't visit us after he returned from Spain.

Hyp: John did not return from Spain. (Don't know.)

John
NNP
did
VBD
not
RB
return
VB
Spain
NNP
.
.
John:NNP   0.00 13.35 15.46 12.27 14.34 20.50
did:VBD 13.35   0.00 19.96   5.85 15.50 17.99
n't:RB 15.46 13.27   0.50 17.74 15.46 17.90
visit:VB 15.50   7.62 19.96   7.10 15.50 20.00
us:PRP 12.50 15.00 20.00 15.00 12.50 20.00
after:IN 20.50 20.00 20.00 20.00 20.50 20.00
he:PRP 12.50 15.00 20.00 15.00 12.50 20.00
returned:VBD 12.27   6.77 19.96   0.31 14.84 19.41
Spain:NNP 14.34 15.50 15.46 14.84   0.00 20.50
.:. 20.50 17.99 20.00 19.01 20.50   0.00
NO_WORD 10.00   1.00   9.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -7.8094
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "return": neg; NullPunisher.aux: did; Structure.argsMismatch: args have different parents but same relations: text "John" <-nsubj-- "visit vs. hyp "John" <-nsubj-- "return", which aligned to text "returned" args have different parents but same relations: text "n't" <-neg-- "visit vs. hyp "not" <-neg-- "return", which aligned to text "returned" args have different parents but same relations: text "." <-punct-- "visit vs. hyp "." <-punct-- "return", which aligned to text "returned"
Hand-tuned score: -7.0500
Threshold: -3.3437


Inference ID: 41

Txt: Hanssen, who sold FBI secrets to the Russians, could face the death penalty.

Hyp: Hanssen sold FBI secrets to the Russians. (Yes.)

Hanssen
NNP
sold
VBD
FBI
NNP
secrets
NNS
the
DT
Russians
NNPS
.
.
Hanssen:NNP   0.00 15.46 14.96 10.46 20.50 14.96 20.50
,:, 20.50 19.81 20.50 20.00 10.00 20.50   5.73
who:WP 12.50 15.00 12.50 12.00 20.00 12.50 20.00
sold:VBD 15.46   0.00 15.50 13.55 20.00 15.50 19.42
FBI:NNP 14.96 15.50   0.00 10.50 20.50 15.00 20.50
secrets:NNS 10.46 13.55 10.50   0.00 20.00   8.35 20.00
the:DT 20.50 20.00 20.50 20.00   0.00 20.50 10.00
Russians:NNPS 14.96 15.50 15.00   8.35 20.50   0.00 20.50
,:, 20.50 19.81 20.50 20.00 10.00 20.50   5.73
could:MD 20.46 19.96 20.46 19.96 10.00 20.46 10.00
face:VB 15.46   8.07 15.50 12.44 20.00 13.35 17.99
the:DT 20.50 20.00 20.50 20.00   0.00 20.50 10.00
death_penalty:NN 10.46 12.10 10.50   8.72 20.00   9.84 20.00
.:. 20.50 19.42 20.50 20.00 10.00 20.50   0.00
NO_WORD 10.00 10.00 10.00 10.00   1.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -2.0000
Features matched: Structure.argsMismatch: args have different parents but same relations: text "Hanssen" <-nsubj-- "face vs. hyp "Hanssen" <-nsubj-- "sold", which aligned to text "sold" args have different parents but same relations: text "." <-punct-- "face vs. hyp "." <-punct-- "sold", which aligned to text "sold"
Hand-tuned score: -2.0000
Threshold: -3.3437


Inference ID: -41

Txt: Hanssen, who sold FBI secrets to the Russians, could face the death penalty.

Hyp: Hanssen did not sell FBI secrets to the Russians. (Don't know.)

Hanssen
NNP
did
VBD
not
RB
sell
VB
FBI
NNP
secrets
NNS
the
DT
Russians
NNPS
.
.
Hanssen:NNP   0.00 15.46 15.46 15.46 14.96 10.46 20.50 14.96 20.50
,:, 20.50 19.80 20.00 19.98 20.50 20.00 10.00 20.50   5.73
who:WP 12.50 15.00 20.00 15.00 12.50 12.00 20.00 12.50 20.00
sold:VBD 15.46   7.69 19.96   0.50 15.50 13.55 20.00 15.50 19.42
FBI:NNP 14.96 15.50 15.46 15.50   0.00 10.50 20.50 15.00 20.50
secrets:NNS 10.46 12.85 14.96 14.18 10.50   0.00 20.00   8.35 20.00
the:DT 20.50 20.00 20.00 20.00 20.50 20.00   0.00 20.50 10.00
Russians:NNPS 14.96 13.35 15.46 15.50 15.00   8.35 20.50   0.00 20.50
,:, 20.50 19.80 20.00 19.98 20.50 20.00 10.00 20.50   5.73
could:MD 20.46 17.84 19.96 19.96 20.46 19.96 10.00 20.46 10.00
face:VB 15.46   4.55 19.96   8.07 15.50 12.44 20.00 13.35 17.99
the:DT 20.50 20.00 20.00 20.00 20.50 20.00   0.00 20.50 10.00
death_penalty:NN 10.46 12.69 14.96 12.10 10.50   8.72 20.00   9.84 20.00
.:. 20.50 17.99 20.00 19.05 20.50 20.00 10.00 20.50   0.00
NO_WORD 10.00   1.00   9.00 10.00 10.00 10.00   1.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -12.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "sell": neg; NullPunisher.aux: did; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "Hanssen" <-nsubj-- "face vs. hyp "Hanssen" <-nsubj-- "sell", which aligned to text "sold" args have different parents but same relations: text "." <-punct-- "face vs. hyp "." <-punct-- "sell", which aligned to text "sold"
Hand-tuned score: -8.0500
Threshold: -3.3437


Inference ID: 42

Txt: The New York Times reported that Hanssen, who sold fbi secrets to the Russians, could face the death penalty.

Hyp: Hanssen sold fbi secrets to the Russians. (Yes.)

Hanssen
NNP
sold
VBD
fbi
NN
secrets
NNS
the
DT
Russians
NNPS
.
.
The:DT 20.50 20.00 20.00 20.00   0.00 20.50 10.00
New_York_Times:NNPS 14.96 14.68   9.41   9.84 20.50 14.34 20.50
reported:VBD 15.46   7.69 12.96 11.05 20.00 13.35 19.71
that:IN 20.50 20.00 20.00 20.00 20.00 20.50 20.00
Hanssen:NNP   0.00 15.46 10.46 10.46 20.50 14.96 20.50
,:, 20.50 19.81 20.00 20.00 10.00 20.50   5.73
who:WP 12.50 15.00 12.00 12.00 20.00 12.50 20.00
sold:VBD 15.46   0.00 14.77 13.55 20.00 15.50 19.42
fbi:NN 10.46 14.77   0.00   7.33 20.00 10.50 19.70
secrets:NNS 10.46 13.55   7.33   0.00 20.00   8.35 20.00
the:DT 20.50 20.00 20.00 20.00   0.00 20.50 10.00
Russians:NNPS 14.96 15.50 10.50   8.35 20.50   0.00 20.50
,:, 20.50 19.81 20.00 20.00 10.00 20.50   5.73
could:MD 20.46 19.96 19.96 19.96 10.00 20.46 10.00
face:VB 15.46   8.07 15.00 12.44 20.00 13.35 17.99
the:DT 20.50 20.00 20.00 20.00   0.00 20.50 10.00
death_penalty:NN 10.46 12.10 10.00   8.72 20.00   9.84 20.00
.:. 20.50 19.42 19.70 20.00 10.00 20.50   0.00
NO_WORD 10.00 10.00 10.00 10.00   1.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -2.0000
Features matched: Structure.argsMismatch: args have different parents but same relations: text "Hanssen" <-nsubj-- "face vs. hyp "Hanssen" <-nsubj-- "sold", which aligned to text "sold" args have different parents but same relations: text "." <-punct-- "reported vs. hyp "." <-punct-- "sold", which aligned to text "sold"
Hand-tuned score: -2.0000
Threshold: -3.3437


Inference ID: -42

Txt: The New York Times reported that Hanssen, who sold fbi secrets to the Russians, could face the death penalty.

Hyp: Hanssen did not sell fbi secrets to the Russians. (Don't know.)

Hanssen
NNP
did
VBD
not
RB
sell
VB
fbi
NN
secrets
NNS
the
DT
Russians
NNPS
.
.
The:DT 20.50 20.00 20.00 20.00 20.00 20.00   0.00 20.50 10.00
New_York_Times:NNPS 14.96 14.84 15.46 14.68   9.41   9.84 20.50 14.34 20.50
reported:VBD 15.46   7.69 19.96   7.69 12.96 11.05 20.00 13.35 19.71
that:IN 20.50 20.00 20.00 20.00 20.00 20.00 20.00 20.50 20.00
Hanssen:NNP   0.00 15.46 15.46 15.46 10.46 10.46 20.50 14.96 20.50
,:, 20.50 19.80 20.00 19.98 20.00 20.00 10.00 20.50   5.73
who:WP 12.50 15.00 20.00 15.00 12.00 12.00 20.00 12.50 20.00
sold:VBD 15.46   7.69 19.96   0.50 14.77 13.55 20.00 15.50 19.42
fbi:NN 10.46 13.20 14.96 15.00   0.00   7.33 20.00 10.50 19.70
secrets:NNS 10.46 12.85 14.96 14.18   7.33   0.00 20.00   8.35 20.00
the:DT 20.50 20.00 20.00 20.00 20.00 20.00   0.00 20.50 10.00
Russians:NNPS 14.96 13.35 15.46 15.50 10.50   8.35 20.50   0.00 20.50
,:, 20.50 19.80 20.00 19.98 20.00 20.00 10.00 20.50   5.73
could:MD 20.46 17.84 19.96 19.96 19.96 19.96 10.00 20.46 10.00
face:VB 15.46   4.55 19.96   8.07 15.00 12.44 20.00 13.35 17.99
the:DT 20.50 20.00 20.00 20.00 20.00 20.00   0.00 20.50 10.00
death_penalty:NN 10.46 12.69 14.96 12.10 10.00   8.72 20.00   9.84 20.00
.:. 20.50 17.99 20.00 19.05 19.70 20.00 10.00 20.50   0.00
NO_WORD 10.00   1.00   9.00 10.00 10.00 10.00   1.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -12.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "sell": neg; NullPunisher.other: not; NullPunisher.aux: did; Structure.argsMismatch: args have different parents but same relations: text "Hanssen" <-nsubj-- "face vs. hyp "Hanssen" <-nsubj-- "sell", which aligned to text "sold" args have different parents but same relations: text "." <-punct-- "reported vs. hyp "." <-punct-- "sell", which aligned to text "sold"
Hand-tuned score: -8.0500
Threshold: -3.3437


Inference ID: 43

Txt: The New York Times reported that Hanssen sold fbi secrets to the Russians and could face the death penalty.

Hyp: Hanssen sold fbi secrets to the Russians. (Don't know.)

Hanssen
NNP
sold
VBD
fbi
NN
secrets
NNS
the
DT
Russians
NNPS
.
.
The:DT 20.50 20.00 20.00 20.00   0.00 20.50 10.00
New_York_Times:NNPS 14.96 14.68   9.41   9.84 20.50 14.34 20.50
reported:VBD 15.46   7.69 12.96 11.05 20.00 13.35 19.71
that:IN 20.50 20.00 20.00 20.00 20.00 20.50 20.00
Hanssen:NNP   0.00 15.46 10.46 10.46 20.50 14.96 20.50
sold:VBD 15.46   0.00 14.77 13.55 20.00 15.50 19.42
fbi:NN 10.46 14.77   0.00   7.33 20.00 10.50 19.70
secrets:NNS 10.46 13.55   7.33   0.00 20.00   8.35 20.00
the:DT 20.50 20.00 20.00 20.00   0.00 20.50 10.00
Russians:NNPS 14.96 15.50 10.50   8.35 20.50   0.00 20.50
could:MD 20.46 19.96 19.96 19.96 10.00 20.46 10.00
face:VB 15.46   8.07 15.00 12.44 20.00 13.35 17.99
the:DT 20.50 20.00 20.00 20.00   0.00 20.50 10.00
death_penalty:NN 10.46 12.10 10.00   8.72 20.00   9.84 20.00
.:. 20.50 19.42 19.70 20.00 10.00 20.50   0.00
NO_WORD 10.00 10.00 10.00 10.00   1.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -2.0000
Features matched: Factive.unknownPassage: non factive text -- unknown: reported-VBD; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "reported vs. hyp "." <-punct-- "sold", which aligned to text "sold"
Hand-tuned score: -1.5000
Threshold: -3.3437


Inference ID: -43

Txt: The New York Times reported that Hanssen sold fbi secrets to the Russians and could face the death penalty.

Hyp: Hanssen did not sell fbi secrets to the Russians. (Don't know.)

Hanssen
NNP
did
VBD
not
RB
sell
VB
fbi
NN
secrets
NNS
the
DT
Russians
NNPS
.
.
The:DT 20.50 20.00 20.00 20.00 20.00 20.00   0.00 20.50 10.00
New_York_Times:NNPS 14.96 14.84 15.46 14.68   9.41   9.84 20.50 14.34 20.50
reported:VBD 15.46   7.69 19.96   7.69 12.96 11.05 20.00 13.35 19.71
that:IN 20.50 20.00 20.00 20.00 20.00 20.00 20.00 20.50 20.00
Hanssen:NNP   0.00 15.46 15.46 15.46 10.46 10.46 20.50 14.96 20.50
sold:VBD 15.46   7.69 19.96   0.50 14.77 13.55 20.00 15.50 19.42
fbi:NN 10.46 13.20 14.96 15.00   0.00   7.33 20.00 10.50 19.70
secrets:NNS 10.46 12.85 14.96 14.18   7.33   0.00 20.00   8.35 20.00
the:DT 20.50 20.00 20.00 20.00 20.00 20.00   0.00 20.50 10.00
Russians:NNPS 14.96 13.35 15.46 15.50 10.50   8.35 20.50   0.00 20.50
could:MD 20.46 17.84 19.96 19.96 19.96 19.96 10.00 20.46 10.00
face:VB 15.46   4.55 19.96   8.07 15.00 12.44 20.00 13.35 17.99
the:DT 20.50 20.00 20.00 20.00 20.00 20.00   0.00 20.50 10.00
death_penalty:NN 10.46 12.69 14.96 12.10 10.00   8.72 20.00   9.84 20.00
.:. 20.50 17.99 20.00 19.05 19.70 20.00 10.00 20.50   0.00
NO_WORD 10.00   1.00   9.00 10.00 10.00 10.00   1.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -12.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.unknownPassage: non factive text -- unknown: reported-VBD; Polarity.hypNegMarker: "sell": neg; NullPunisher.other: not; NullPunisher.aux: did; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "reported vs. hyp "." <-punct-- "sell", which aligned to text "sold"
Hand-tuned score: -7.5500
Threshold: -3.3437


Inference ID: 44

Txt: Bush said that it was Khan who sold centrifuges to North Korea.

Hyp: Centrifuges were sold to North Korea. (Yes.)

Centrifuges
NNS
were
VBD
sold
VBN
North_Korea
NNP
.
.
Bush:NNP   9.45 14.84 12.74 12.11 20.50
said:VBD 15.00   6.24   7.80 15.50 18.58
that:IN 20.00 20.00 20.00 20.50 20.00
it:PRP 12.00 15.00 15.00 12.50 20.00
was:VBD 14.34   0.50 10.00 11.83 20.00
Khan:NNP   8.53 14.84 15.50 14.02 20.50
who:WP 12.00 15.00 15.00 12.50 20.00
sold:VBD 15.00   7.80   0.00 15.50 19.42
centrifuges:NNS   0.00 14.34 14.23   9.84 19.93
North_Korea:NNP   9.84 14.84 15.50   0.00 20.50
.:. 20.00 20.00 19.42 20.50   0.00
NO_WORD 10.00   1.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -3.0000
Features matched: NullPunisher.aux: were; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "said vs. hyp "." <-punct-- "sold", which aligned to text "sold"
Hand-tuned score: -2.0500
Threshold: -3.3437


Inference ID: -44

Txt: Bush said that it was Khan who sold centrifuges to North Korea.

Hyp: Centrifuges were not sold to North Korea. (Don't know.)

Centrifuges
NNS
were
VBD
not
RB
sold
VBN
North_Korea
NNP
.
.
Bush:NNP   9.45 14.84 15.46 12.74 12.11 20.50
said:VBD 15.00   6.24 19.96   7.80 15.50 18.58
that:IN 20.00 20.00 20.00 20.00 20.50 20.00
it:PRP 12.00 15.00 20.00 15.00 12.50 20.00
was:VBD 14.34   0.50 19.96 10.00 11.83 20.00
Khan:NNP   8.53 14.84 15.46 15.50 14.02 20.50
who:WP 12.00 15.00 20.00 15.00 12.50 20.00
sold:VBD 15.00   7.80 19.96   0.00 15.50 19.42
centrifuges:NNS   0.00 14.34 14.96 14.23   9.84 19.93
North_Korea:NNP   9.84 14.84 15.46 15.50   0.00 20.50
.:. 20.00 20.00 20.00 19.42 20.50   0.00
NO_WORD 10.00   1.00   9.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -12.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "sold": neg; NullPunisher.aux: were; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "said vs. hyp "." <-punct-- "sold", which aligned to text "sold"
Hand-tuned score: -8.0500
Threshold: -3.3437


Inference ID: 45

Txt: Bush said that Khan sold centrifuges to North Korea.

Hyp: Centrifuges were sold to North Korea. (Don't know.)

Centrifuges
NNS
were
VBD
sold
VBN
North_Korea
NNP
.
.
Bush:NNP   9.45 14.84 12.74 12.11 20.50
said:VBD 15.00   6.24   7.80 15.50 18.58
that:IN 20.00 20.00 20.00 20.50 20.00
Khan:NNP   8.53 14.84 15.50 14.02 20.50
sold:VBD 15.00   7.80   0.00 15.50 19.42
centrifuges:NNS   0.00 14.34 14.23   9.84 19.93
North_Korea:NNP   9.84 14.84 15.50   0.00 20.50
.:. 20.00 20.00 19.42 20.50   0.00
NO_WORD 10.00   1.00 10.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -3.0000
Features matched: Factive.unknownPassage: non factive text -- unknown: said-VBD; NullPunisher.aux: were; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "said vs. hyp "." <-punct-- "sold", which aligned to text "sold"
Hand-tuned score: -1.5500
Threshold: -3.3437


Inference ID: -45

Txt: Bush said that Khan sold centrifuges to North Korea.

Hyp: Centrifuges were not sold to North Korea. (Don't know.)

Centrifuges
NNS
were
VBD
not
RB
sold
VBN
North_Korea
NNP
.
.
Bush:NNP   9.45 14.84 15.46 12.74 12.11 20.50
said:VBD 15.00   6.24 19.96   7.80 15.50 18.58
that:IN 20.00 20.00 20.00 20.00 20.50 20.00
Khan:NNP   8.53 14.84 15.46 15.50 14.02 20.50
sold:VBD 15.00   7.80 19.96   0.00 15.50 19.42
centrifuges:NNS   0.00 14.34 14.96 14.23   9.84 19.93
North_Korea:NNP   9.84 14.84 15.46 15.50   0.00 20.50
.:. 20.00 20.00 20.00 19.42 20.50   0.00
NO_WORD 10.00   1.00   9.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -12.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.unknownPassage: non factive text -- unknown: said-VBD; Polarity.hypNegMarker: "sold": neg; NullPunisher.aux: were; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "said vs. hyp "." <-punct-- "sold", which aligned to text "sold"
Hand-tuned score: -7.5500
Threshold: -3.3437


Inference ID: 46

Txt: What we found in Iraq was rusted shrapnel.

Hyp: We found something in Iraq. (Yes.)

We
PRP
found
VBD
something
NN
Iraq
NNP
.
.
What:WP 10.00 15.00 12.00 12.50 20.00
we:PRP   0.00 15.00 12.00 12.50 20.00
found:VBD 15.00   0.00 15.00 15.50 19.57
Iraq:NNP 12.50 15.50   9.84   0.00 20.50
was:VBD 15.00 10.00 14.34 11.83 20.00
rusted:VBN 15.00   9.02 14.34 14.84 20.00
shrapnel:JJ 15.00   8.64 11.34 11.84 20.00
.:. 20.00 19.57 20.00 20.50   0.00
NO_WORD 10.00 10.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -12.0000
Features matched: NullPunisher.other: something; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "rusted vs. hyp "." <-punct-- "found", which aligned to text "found"
Hand-tuned score: -3.0000
Threshold: -3.3437


Inference ID: -46

Txt: What we found in Iraq was rusted shrapnel.

Hyp: We found nothing in Iraq. (Don't know.)

We
PRP
found
VBD
nothing
NN
Iraq
NNP
.
.
What:WP 10.00 15.00 12.00 12.50 20.00
we:PRP   0.00 15.00 12.00 12.50 20.00
found:VBD 15.00   0.00 15.00 15.50 19.57
Iraq:NNP 12.50 15.50   9.84   0.00 20.50
was:VBD 15.00 10.00 14.34 11.83 20.00
rusted:VBN 15.00   9.02 14.34 14.84 20.00
shrapnel:JJ 15.00   8.64 11.34 11.84 20.00
.:. 20.00 19.57 20.00 20.50   0.00
NO_WORD 10.00 10.00 10.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -12.0000
Features matched: NullPunisher.other: nothing; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "rusted vs. hyp "." <-punct-- "found", which aligned to text "found"
Hand-tuned score: -3.0000
Threshold: -3.3437


Inference ID: 47

Txt: The fact that Bin Laden was in Tora Bora lead to the suspicion that the Afghan campaign was mismanaged.

Hyp: Bin Laden was in Tora Bora. (Yes.)

Bin_Laden
NNP
was
VBD
Tora_Bora
NNP
.
.
The:DT 20.50 20.00 20.50 10.00
fact:NN   9.42 15.00 10.46 17.53
that:IN 20.50 20.00 20.50 20.00
Bin_Laden:NNP   0.00 14.84 14.96 20.50
was:VBD 14.84   0.00 15.46 20.00
Tora_Bora:NNP 14.96 15.46   0.00 20.50
lead:VBP 13.55   7.52 15.46 19.02
the:DT 20.50 20.00 20.50 10.00
suspicion:NN   9.84 15.00 10.46 19.99
that:IN 20.50 20.00 20.50 20.00
the:DT 20.50 20.00 20.50 10.00
Afghan:JJ 15.05 11.84 16.96 20.50
campaign:NN 10.50 15.00 10.46 19.85
was:VBD 14.84   0.00 15.46 20.00
mismanaged:VBN 15.50 10.00 15.46 20.00
.:. 20.50 20.00 20.50   0.00
NO_WORD 10.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -2.0000
Features matched: Factive.factivePassage: factive entails : fact-NN; Location.mismatch: no clear info of matching: be(X, prep_in); Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "lead vs. hyp "." <-punct-- "was", which aligned to text "was"
Hand-tuned score: -3.0000
Threshold: -3.3437


Inference ID: -47

Txt: The fact that Bin Laden was in Tora Bora lead to the suspicion that the Afghan campaign was mismanaged.

Hyp: Bin Laden was not in Tora Bora. (Don't know.)

Bin_Laden
NNP
was
VBD
not
RB
Tora_Bora
NNP
.
.
The:DT 20.50 20.00 20.00 20.50 10.00
fact:NN   9.42 15.00 14.96 10.46 17.53
that:IN 20.50 20.00 20.00 20.50 20.00
Bin_Laden:NNP   0.00 14.84 15.46 14.96 20.50
was:VBD 14.84   0.00 19.96 15.46 20.00
Tora_Bora:NNP 14.96 15.46 15.46   0.00 20.50
lead:VBP 13.55   7.52 19.96 15.46 19.02
the:DT 20.50 20.00 20.00 20.50 10.00
suspicion:NN   9.84 15.00 14.96 10.46 19.99
that:IN 20.50 20.00 20.00 20.50 20.00
the:DT 20.50 20.00 20.00 20.50 10.00
Afghan:JJ 15.05 11.84 12.46 16.96 20.50
campaign:NN 10.50 15.00 14.96 10.46 19.85
was:VBD 14.84   0.00 19.96 15.46 20.00
mismanaged:VBN 15.50 10.00 19.96 15.46 20.00
.:. 20.50 20.00 20.00 20.50   0.00
NO_WORD 10.00 10.00   9.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -11.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.factivePassage: factive entails : fact-NN; Polarity.hypNegMarker: "was": neg; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "lead vs. hyp "." <-punct-- "was", which aligned to text "was"
Hand-tuned score: -7.0000
Threshold: -3.3437


Inference ID: 48

Txt: The fact that Bin Laden was in Tora Bora lead to the suspicion that the Afghan campaign was mismanaged.

Hyp: The Afghan campaign was mismanaged. (Don't know.)

The
DT
Afghan
JJ
campaign
NN
was
VBD
mismanaged
VBN
.
.
The:DT   0.00 20.50 20.00 20.00 20.00 10.00
fact:NN 20.00 10.35   8.94 15.00 14.66 17.53
that:IN 20.00 20.50 20.00 20.00 20.00 20.00
Bin_Laden:NNP 20.50 15.05 10.50 14.84 15.50 20.50
was:VBD 20.00 11.84 15.00   0.00 10.00 20.00
Tora_Bora:NNP 20.50 16.96 10.46 15.46 15.46 20.50
lead:VBP 20.00 10.35 12.72   7.52   6.73 19.02
the:DT   0.00 20.50 20.00 20.00 20.00 10.00
suspicion:NN 20.00 11.19   8.40 15.00 12.80 19.99
that:IN 20.00 20.50 20.00 20.00 20.00 20.00
the:DT   0.00 20.50 20.00 20.00 20.00 10.00
Afghan:JJ 20.50   0.00 12.50 11.84 12.50 20.50
campaign:NN 20.00 12.50   0.00 15.00 14.12 19.85
was:VBD 20.00 11.84 15.00   0.00 10.00 20.00
mismanaged:VBN 20.00 12.50 14.12 10.00   0.00 20.00
.:. 10.00 20.50 19.85 20.00 20.00   0.00
NO_WORD   1.00   9.00 10.00   1.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -2.0000
Features matched: Factive.inPositiveEmbedding: embedded positive text; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "lead vs. hyp "." <-punct-- "mismanaged", which aligned to text "mismanaged"
Hand-tuned score: -1.0000
Threshold: -3.3437


Inference ID: -48

Txt: The fact that Bin Laden was in Tora Bora lead to the suspicion that the Afghan campaign was mismanaged.

Hyp: The Afghan campaign was not mismanaged. (Don't know.)

The
DT
Afghan
JJ
campaign
NN
was
VBD
not
RB
mismanaged
VBN
.
.
The:DT   0.00 20.50 20.00 20.00 20.00 20.00 10.00
fact:NN 20.00 10.35   8.94 15.00 14.96 14.66 17.53
that:IN 20.00 20.50 20.00 20.00 20.00 20.00 20.00
Bin_Laden:NNP 20.50 15.05 10.50 14.84 15.46 15.50 20.50
was:VBD 20.00 11.84 15.00   0.00 19.96 10.00 20.00
Tora_Bora:NNP 20.50 16.96 10.46 15.46 15.46 15.46 20.50
lead:VBP 20.00 10.35 12.72   7.52 19.96   6.73 19.02
the:DT   0.00 20.50 20.00 20.00 20.00 20.00 10.00
suspicion:NN 20.00 11.19   8.40 15.00 14.96 12.80 19.99
that:IN 20.00 20.50 20.00 20.00 20.00 20.00 20.00
the:DT   0.00 20.50 20.00 20.00 20.00 20.00 10.00
Afghan:JJ 20.50   0.00 12.50 11.84 12.46 12.50 20.50
campaign:NN 20.00 12.50   0.00 15.00 14.96 14.12 19.85
was:VBD 20.00 11.84 15.00   0.00 19.96 10.00 20.00
mismanaged:VBN 20.00 12.50 14.12 10.00 19.96   0.00 20.00
.:. 10.00 20.50 19.85 20.00 20.00 20.00   0.00
NO_WORD   1.00   9.00 10.00   1.00   9.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -11.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.inPositiveEmbedding: embedded positive text; Polarity.hypNegMarker: "mismanaged": neg; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "lead vs. hyp "." <-punct-- "mismanaged", which aligned to text "mismanaged"
Hand-tuned score: -7.0000
Threshold: -3.3437


Inference ID: 49

Txt: The paper concluded that the election had been rigged.

Hyp: The election was rigged. (Don't know.)

The
DT
election
NN
was
VBD
rigged
VBN
.
.
The:DT   0.00 20.00 20.00 20.00 10.00
paper:NN 20.00   8.35 14.34 12.12 18.64
concluded:VBD 20.00 15.00 10.00 10.00 19.24
that:IN 20.00 20.00 20.00 20.00 20.00
the:DT   0.00 20.00 20.00 20.00 10.00
election:NN 20.00   0.00 15.00 11.41 20.00
had:VBD 20.00 15.00   9.34   5.73 20.00
been:VBN 20.00 15.00   0.50   7.80 20.00
rigged:VBN 20.00 11.41   9.34   0.00 20.00
.:. 10.00 20.00 20.00 20.00   0.00
NO_WORD   1.00 10.00   1.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -2.5000
Features matched: Factive.unknownPassage: non factive text -- unknown: concluded-VBD; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "concluded vs. hyp "." <-punct-- "rigged", which aligned to text "rigged"
Hand-tuned score: -1.5000
Threshold: -3.3437


Inference ID: -49

Txt: The paper concluded that the election had been rigged.

Hyp: The election was not rigged. (Don't know.)

The
DT
election
NN
was
VBD
not
RB
rigged
VBN
.
.
The:DT   0.00 20.00 20.00 20.00 20.00 10.00
paper:NN 20.00   8.35 14.34 14.96 12.12 18.64
concluded:VBD 20.00 15.00 10.00 19.96 10.00 19.24
that:IN 20.00 20.00 20.00 20.00 20.00 20.00
the:DT   0.00 20.00 20.00 20.00 20.00 10.00
election:NN 20.00   0.00 15.00 14.96 11.41 20.00
had:VBD 20.00 15.00   9.34 19.96   5.73 20.00
been:VBN 20.00 15.00   0.50 19.96   7.80 20.00
rigged:VBN 20.00 11.41   9.34 19.96   0.00 20.00
.:. 10.00 20.00 20.00 20.00 20.00   0.00
NO_WORD   1.00 10.00   1.00   9.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -11.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.unknownPassage: non factive text -- unknown: concluded-VBD; Polarity.hypNegMarker: "rigged": neg; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "concluded vs. hyp "." <-punct-- "rigged", which aligned to text "rigged"
Hand-tuned score: -7.5000
Threshold: -3.3437


Inference ID: 50

Txt: Ames was, as the press reported, a successful spy.

Hyp: Ames was a successful spy. (Yes.)

Ames
NNP
was
VBD
a
DT
successful
JJ
spy
NN
.
.
Ames:NNP   0.00 15.46 20.50 12.46 10.46 20.50
was:VBD 15.46   0.00 20.00 11.96 14.34 20.00
,:, 20.50 20.00 10.00 19.58 20.00   5.73
the:DT 20.50 20.00 10.00 20.00 20.00 10.00
press:NN 10.46 14.34 20.00 11.96   8.50 19.26
reported:VBN 15.46 10.00 20.00 11.96 14.05 19.71
,:, 20.50 20.00 10.00 19.58 20.00   5.73
a:DT 20.50 20.00   0.00 20.00 20.00 10.00
successful:JJ 12.46 11.96 20.00   0.00 11.78 18.38
spy:NN 10.46 14.34 20.00 11.78   0.00 20.00
.:. 20.50 20.00 10.00 18.38 20.00   0.00
NO_WORD 10.00   1.00   1.00   9.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -5.0000
Features matched: Factive.unknownPassage: non factive text -- unknown: reported-VBN; NullPunisher.aux: was; Quant.contract: [a,a]; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "reported vs. hyp "." <-punct-- "spy", which aligned to text "spy" args have different parents, different relations: text "Ames" <-nsubjpass-- "reported" vs. hyp "Ames" <-nsubj-- "spy", which aligned to text "spy"
Hand-tuned score: -0.5500
Threshold: -3.3437


Inference ID: -50

Txt: Ames was, as the press reported, a successful spy.

Hyp: Ames was not a successful spy. (Don't know.)

Ames
NNP
was
VBD
not
RB
a
DT
successful
JJ
spy
NN
.
.
Ames:NNP   0.00 15.46 15.46 20.50 12.46 10.46 20.50
was:VBD 15.46   0.00 19.96 20.00 11.96 14.34 20.00
,:, 20.50 20.00 20.00 10.00 19.58 20.00   5.73
the:DT 20.50 20.00 20.00 10.00 20.00 20.00 10.00
press:NN 10.46 14.34 14.96 20.00 11.96   8.50 19.26
reported:VBN 15.46 10.00 19.96 20.00 11.96 14.05 19.71
,:, 20.50 20.00 20.00 10.00 19.58 20.00   5.73
a:DT 20.50 20.00 20.00   0.00 20.00 20.00 10.00
successful:JJ 12.46 11.96 11.96 20.00   0.00 11.78 18.38
spy:NN 10.46 14.34 14.96 20.00 11.78   0.00 20.00
.:. 20.50 20.00 20.00 10.00 18.38 20.00   0.00
NO_WORD 10.00   1.00   9.00   1.00   9.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -14.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.unknownPassage: non factive text -- unknown: reported-VBN; Polarity.hypNegMarker: "spy": neg; NullPunisher.aux: was; NullPunisher.other: not; Quant.contract: [a,a]; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "reported vs. hyp "." <-punct-- "spy", which aligned to text "spy" args have different parents, different relations: text "Ames" <-nsubjpass-- "reported" vs. hyp "Ames" <-nsubj-- "spy", which aligned to text "spy"
Hand-tuned score: -6.5500
Threshold: -3.3437


Inference ID: 51

Txt: The press reported that Ames was a successful spy.

Hyp: Ames was a successful spy. (Don't know.)

Ames
NNP
was
VBD
a
DT
successful
JJ
spy
NN
.
.
The:DT 20.50 20.00 10.00 20.00 20.00 10.00
press:NN 10.46 14.34 20.00 11.96   8.50 19.26
reported:VBD 15.46 10.00 20.00 11.96 14.05 19.71
that:IN 20.50 20.00 20.00 20.00 20.00 20.00
Ames:NNP   0.00 15.46 20.50 12.46 10.46 20.50
was:VBD 15.46   0.00 20.00 11.96 14.34 20.00
a:DT 20.50 20.00   0.00 20.00 20.00 10.00
successful:JJ 12.46 11.96 20.00   0.00 11.78 18.38
spy:NN 10.46 14.34 20.00 11.78   0.00 20.00
.:. 20.50 20.00 10.00 18.38 20.00   0.00
NO_WORD 10.00   1.00   1.00   9.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -2.0000
Features matched: Factive.unknownPassage: non factive text -- unknown: reported-VBD; Quant.contract: [a,a]; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "reported vs. hyp "." <-punct-- "spy", which aligned to text "spy"
Hand-tuned score: -0.5000
Threshold: -3.3437


Inference ID: -51

Txt: The press reported that Ames was a successful spy.

Hyp: Ames was not a successful spy. (Don't know.)

Ames
NNP
was
VBD
not
RB
a
DT
successful
JJ
spy
NN
.
.
The:DT 20.50 20.00 20.00 10.00 20.00 20.00 10.00
press:NN 10.46 14.34 14.96 20.00 11.96   8.50 19.26
reported:VBD 15.46 10.00 19.96 20.00 11.96 14.05 19.71
that:IN 20.50 20.00 20.00 20.00 20.00 20.00 20.00
Ames:NNP   0.00 15.46 15.46 20.50 12.46 10.46 20.50
was:VBD 15.46   0.00 19.96 20.00 11.96 14.34 20.00
a:DT 20.50 20.00 20.00   0.00 20.00 20.00 10.00
successful:JJ 12.46 11.96 11.96 20.00   0.00 11.78 18.38
spy:NN 10.46 14.34 14.96 20.00 11.78   0.00 20.00
.:. 20.50 20.00 20.00 10.00 18.38 20.00   0.00
NO_WORD 10.00   1.00   9.00   1.00   9.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -11.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.unknownPassage: non factive text -- unknown: reported-VBD; Polarity.hypNegMarker: "spy": neg; NullPunisher.other: not; Quant.contract: [a,a]; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "reported vs. hyp "." <-punct-- "spy", which aligned to text "spy"
Hand-tuned score: -6.5000
Threshold: -3.3437


Inference ID: 52

Txt: The US forgot that the Afghans speak several different languages.

Hyp: The Afghans speak several different languages. (Yes.)

The
DT
Afghans
NNPS
speak
VBP
several
JJ
different
JJ
languages
NNS
.
.
The:DT   0.00 20.50 20.00 20.00 20.00 20.00 10.00
US:NNP 20.50 14.34 15.50 12.46 12.46 10.50 20.50
forgot:VBD 20.00 15.50   9.43 11.96 11.96 15.00 19.56
that:IN 20.00 20.50 20.00 20.00 20.00 20.00 20.00
the:DT   0.00 20.50 20.00 20.00 20.00 20.00 10.00
Afghans:NNPS 20.50   0.00 15.50 12.46 12.46   5.87 20.50
speak:VBP 20.00 15.50   0.00 11.96 11.39 12.82 18.41
several:JJ 20.00 12.46 11.96   0.00   9.96 11.96 20.00
different:JJ 20.00 12.46 11.39   9.96   0.00   8.76 17.27
languages:NNS 20.00   5.87 12.82 11.96   8.76   0.00 19.86
.:. 10.00 20.50 18.41 20.00 17.27 19.86   0.00
NO_WORD   1.00 10.00 10.00   9.00   9.00 10.00 10.00

Response: dontknow (INCORRECT)
Justification:
Alignment score: -2.0000
Features matched: Factive.negativeStatement: non factive text : forgot-VBD; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "forgot vs. hyp "." <-punct-- "speak", which aligned to text "speak"
Hand-tuned score: -7.0000
Threshold: -3.3437


Inference ID: -52

Txt: The US forgot that the Afghans speak several different languages.

Hyp: The Afghans do not speak several different languages. (Don't know.)

The
DT
Afghans
NNPS
do
VBP
not
RB
speak
VB
several
JJ
different
JJ
languages
NNS
.
.
The:DT   0.00 20.50 20.00 20.00 20.00 20.00 20.00 20.00 10.00
US:NNP 20.50 14.34 15.50 15.46 15.50 12.46 12.46 10.50 20.50
forgot:VBD 20.00 15.50   8.48 19.96   9.43 11.96 11.96 15.00 19.56
that:IN 20.00 20.50 20.00 20.00 20.00 20.00 20.00 20.00 20.00
the:DT   0.00 20.50 20.00 20.00 20.00 20.00 20.00 20.00 10.00
Afghans:NNPS 20.50   0.00 13.35 15.46 15.50 12.46 12.46   5.87 20.50
speak:VBP 20.00 15.50   5.23 19.96   0.00 11.96 11.39 12.82 18.41
several:JJ 20.00 12.46 11.96 11.96 11.96   0.00   9.96 11.96 20.00
different:JJ 20.00 12.46 10.27 11.96 11.39   9.96   0.00   8.76 17.27
languages:NNS 20.00   5.87 11.24 14.96 12.82 11.96   8.76   0.00 19.86
.:. 10.00 20.50 18.81 20.00 18.41 20.00 17.27 19.86   0.00
NO_WORD   1.00 10.00   1.00   9.00 10.00   9.00   9.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -12.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.negativeStatement: non factive text : forgot-VBD; Polarity.hypNegMarker: "speak": neg; NullPunisher.aux: do; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "forgot vs. hyp "." <-punct-- "speak", which aligned to text "speak"
Hand-tuned score: -13.0500
Threshold: -3.3437


Inference ID: 53

Txt: Bush realized that the US Army had to be transformed to meet new threats.

Hyp: The US Army had to be transformed to meet new threats. (Yes.)

The
DT
US_Army
NNP
had
VBD
to
TO
be
VB
transformed
VBN
to
TO
meet
VB
new
JJ
threats
NNS
.
.
Bush:NNP 20.00   8.63 13.05 20.00 14.34 15.00 20.00 12.02 11.96   8.05 20.00
realized:VBD 20.00 15.50   6.84 20.00 10.00   7.20 20.00   7.24 11.96 14.16 17.47
that:IN 20.00 20.50 20.00 20.00 20.00 20.00 20.00 20.00 20.00 20.00 20.00
the:DT   0.00 20.50 20.00 10.00 20.00 20.00 10.00 20.00 20.00 20.00 10.00
US_Army:NNP 20.50   0.00 15.17 20.50 15.17 15.50 20.50 15.50 12.46 10.17 20.50
had:VBD 20.00 15.17   0.00 20.00   7.80   7.61 20.00   3.72 11.96 13.05 20.00
to:TO 10.00 20.50 20.00   0.00 20.00 20.00   0.00 20.00 20.00 20.00 10.00
be:VB 20.00 15.17   7.80 20.00   0.00 10.00 20.00   2.00 11.96 14.34 20.00
transformed:VBN 20.00 15.50   7.61 20.00 10.00   0.00 20.00   7.61 10.33 14.61 20.00
to:TO 10.00 20.50 20.00   0.00 20.00 20.00   0.00 20.00 20.00 20.00 10.00
meet:VB 20.00 15.50   3.72 20.00   6.07   7.61 20.00   0.00 11.24 14.50 19.52
new:JJ 20.00 12.46 11.96 20.00 11.96 10.33 20.00 11.24   0.00 11.96 20.00
threats:NNS 20.00 10.17 13.05 20.00 14.34 14.61 20.00 14.50 11.96   0.00 19.03
.:. 10.00 20.50 20.00 10.00 20.00 20.00 10.00 19.52 20.00 19.03   0.00
NO_WORD   1.00 10.00 10.00 10.00   1.00 10.00 10.00 10.00   9.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -2.0000
Features matched: Factive.factivePassage: factive entails : realized-VBD; Modal.weakYes: necessary -> necessary; Structure.argsMismatch: args have different parents but same relations: text "US_Army" <-xsubj-- "transformed vs. hyp "US_Army" <-nsubj-- "had", which aligned to text "had" args have different parents but same relations: text "." <-punct-- "realized vs. hyp "." <-punct-- "had", which aligned to text "had"
Hand-tuned score: 0.0000
Threshold: -3.3437


Inference ID: -53

Txt: Bush realized that the US Army had to be transformed to meet new threats.

Hyp: The US Army did not have to be transformed to meet new threats. (Don't know.)

The
DT
US_Army
NNP
did
VBD
not
RB
have
VB
to
TO
be
VB
transformed
VBN
to
TO
meet
VB
new
JJ
threats
NNS
.
.
Bush:NNP 20.00   8.63 15.00 14.96 13.05 20.00 14.34 15.00 20.00 12.02 11.96   8.05 20.00
realized:VBD 20.00 15.50   7.32 19.96   6.84 20.00 10.00   7.20 20.00   7.24 11.96 14.16 17.47
that:IN 20.00 20.50 20.00 20.00 20.00 20.00 20.00 20.00 20.00 20.00 20.00 20.00 20.00
the:DT   0.00 20.50 20.00 20.00 20.00 10.00 20.00 20.00 10.00 20.00 20.00 20.00 10.00
US_Army:NNP 20.50   0.00 15.50 15.46 15.17 20.50 15.17 15.50 20.50 15.50 12.46 10.17 20.50
had:VBD 20.00 15.17   7.32 19.96   0.50 20.00   7.80   7.61 20.00   3.72 11.96 13.05 20.00
to:TO 10.00 20.50 20.00 20.00 20.00   0.00 20.00 20.00   0.00 20.00 20.00 20.00 10.00
be:VB 20.00 15.17   6.07 19.96   7.80 20.00   0.00 10.00 20.00   2.00 11.96 14.34 20.00
transformed:VBN 20.00 15.50   7.62 19.96   7.61 20.00 10.00   0.00 20.00   7.61 10.33 14.61 20.00
to:TO 10.00 20.50 20.00 20.00 20.00   0.00 20.00 20.00   0.00 20.00 20.00 20.00 10.00
meet:VB 20.00 15.50   5.11 19.96   3.72 20.00   6.07   7.61 20.00   0.00 11.24 14.50 19.52
new:JJ 20.00 12.46 11.96 11.96 11.96 20.00 11.96 10.33 20.00 11.24   0.00 11.96 20.00
threats:NNS 20.00 10.17 12.85 14.96 13.05 20.00 14.34 14.61 20.00 14.50 11.96   0.00 19.03
.:. 10.00 20.50 17.99 20.00 20.00 10.00 20.00 20.00 10.00 19.52 20.00 19.03   0.00
NO_WORD   1.00 10.00   1.00   9.00 10.00 10.00   1.00 10.00 10.00 10.00   9.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -12.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.factivePassage: factive entails : realized-VBD; Modal.no: necessary -> not necessary; Polarity.hypNegMarker: "have": neg; NullPunisher.other: not; NullPunisher.aux: did; Structure.argsMismatch: args have different parents but same relations: text "US_Army" <-xsubj-- "transformed vs. hyp "US_Army" <-nsubj-- "have", which aligned to text "had" args have different parents but same relations: text "." <-punct-- "realized vs. hyp "." <-punct-- "have", which aligned to text "had"
Hand-tuned score: -13.0500
Threshold: -3.3437


Inference ID: 54

Txt: Bush didn't realize that Afghanistan is land-locked.

Hyp: Afghanistan is land-locked. (Yes.)

Afghanistan
NNP
is
VBZ
land
NN
-
:
locked
VBN
.
.
Bush:NNP   7.61 14.34   7.11 20.00   8.87 20.00
did:VBD 15.50   6.07 12.62 20.00   7.32 17.99
n't:RB 15.46 19.96 14.96 20.00 19.96 17.90
realize:VB 15.50 10.00 14.42 20.00   7.32 17.34
that:IN 20.50 20.00 20.00 20.00 20.00 20.00
Afghanistan:NNP   0.00 14.84   5.84 20.50 14.84 20.50
is:VBZ 14.84   0.00 14.34 20.00   9.34 20.00
land:NN   2.50 14.34   0.00 20.00 12.65 18.77
-:: 20.50 20.00 20.00   0.00 20.00 10.00
locked:VBN 14.84   9.34 12.65 20.00   0.00 19.17
.:. 20.50 20.00 18.77 10.00 19.17   0.00
NO_WORD 10.00   1.00 10.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -2.0000
Features matched: Factive.factivePassage: factive entails : realize-VB; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "realize vs. hyp "." <-punct-- "land", which aligned to text "land"
Hand-tuned score: -1.0000
Threshold: -3.3437


Inference ID: -54

Txt: Bush didn't realize that Afghanistan is land-locked.

Hyp: Afghanistan is not land-locked. (Don't know.)

Afghanistan
NNP
is
VBZ
not
RB
land
NN
-
:
locked
VBN
.
.
Bush:NNP   7.61 14.34 14.96   7.11 20.00   8.87 20.00
did:VBD 15.50   6.07 19.96 12.62 20.00   7.32 17.99
n't:RB 15.46 19.96   0.50 14.96 20.00 19.96 17.90
realize:VB 15.50 10.00 19.96 14.42 20.00   7.32 17.34
that:IN 20.50 20.00 20.00 20.00 20.00 20.00 20.00
Afghanistan:NNP   0.00 14.84 15.46   5.84 20.50 14.84 20.50
is:VBZ 14.84   0.00 19.96 14.34 20.00   9.34 20.00
land:NN   2.50 14.34 14.96   0.00 20.00 12.65 18.77
-:: 20.50 20.00 20.00 20.00   0.00 20.00 10.00
locked:VBN 14.84   9.34 19.96 12.65 20.00   0.00 19.17
.:. 20.50 20.00 20.00 18.77 10.00 19.17   0.00
NO_WORD 10.00   1.00   9.00 10.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -8.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.factivePassage: factive entails : realize-VB; Polarity.hypNegMarker: "land": neg; Structure.argsMismatch: args have different parents but same relations: text "n't" <-neg-- "realize vs. hyp "not" <-neg-- "land", which aligned to text "land" args have different parents but same relations: text "." <-punct-- "realize vs. hyp "." <-punct-- "land", which aligned to text "land" text "locked" is amod of "land" while hyp "locked" is partmod of "land" which aligned to text "land"
Hand-tuned score: -6.0000
Threshold: -3.3437


Inference ID: 55

Txt: There is a belief that the US will invade Syria.

Hyp: The US will invade Syria. (Don't know.)

The
DT
US
NNP
will
MD
invade
VB
Syria
NNP
.
.
There:EX 10.00 20.50 10.00 20.00 20.50 10.00
is:VBZ 20.00 14.84 20.00   6.70 14.84 20.00
a:DT 10.00 20.50 10.00 20.00 20.50 10.00
belief:NN 20.00 10.50 17.47 14.83 10.50 18.82
that:IN 20.00 20.50 20.00 20.00 20.50 20.00
the:DT   0.00 20.50 10.00 20.00 20.50 10.00
US:NNP 20.50   0.00 20.50 15.50   5.34 20.50
will:MD 10.00 20.50   0.00 20.00 20.50 10.00
invade:VB 20.00 15.50 20.00   0.00 15.50 20.00
Syria:NNP 20.50   5.34 20.50 15.50   0.00 20.50
.:. 10.00 20.50 10.00 20.00 20.50   0.00
NO_WORD   1.00 10.00 10.00 10.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -2.0000
Features matched: Factive.unknownPassage: non factive text -- unknown: belief-NN; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "is vs. hyp "." <-punct-- "invade", which aligned to text "invade"
Hand-tuned score: -1.5000
Threshold: -3.3437


Inference ID: -55

Txt: There is a belief that the US will invade Syria.

Hyp: The US will not invade Syria. (Don't know.)

The
DT
US
NNP
will
MD
not
RB
invade
VB
Syria
NNP
.
.
There:EX 10.00 20.50 10.00 20.00 20.00 20.50 10.00
is:VBZ 20.00 14.84 20.00 19.96   6.70 14.84 20.00
a:DT 10.00 20.50 10.00 20.00 20.00 20.50 10.00
belief:NN 20.00 10.50 17.47 14.96 14.83 10.50 18.82
that:IN 20.00 20.50 20.00 20.00 20.00 20.50 20.00
the:DT   0.00 20.50 10.00 20.00 20.00 20.50 10.00
US:NNP 20.50   0.00 20.50 15.46 15.50   5.34 20.50
will:MD 10.00 20.50   0.00 19.96 20.00 20.50 10.00
invade:VB 20.00 15.50 20.00 19.96   0.00 15.50 20.00
Syria:NNP 20.50   5.34 20.50 15.46 15.50   0.00 20.50
.:. 10.00 20.50 10.00 20.00 20.00 20.50   0.00
NO_WORD   1.00 10.00 10.00   9.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -11.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.unknownPassage: non factive text -- unknown: belief-NN; Polarity.hypNegMarker: "invade": neg; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "is vs. hyp "." <-punct-- "invade", which aligned to text "invade"
Hand-tuned score: -7.5000
Threshold: -3.3437


Inference ID: 56

Txt: It is not surprising that Bush has the lead in Ohio.

Hyp: Bush has the lead in Ohio. (Yes.)

Bush
NNP
has
VBZ
the
DT
lead
NN
Ohio
NNP
.
.
It:PRP 12.50 15.00 20.00 12.00 12.50 20.00
is:VBZ 14.84   8.64 20.00   9.39 14.84 20.00
not:RB 15.46 19.96 20.00 14.96 15.46 20.00
surprising:JJ 12.50 12.00 20.00   9.70 12.50 19.84
that:IN 20.50 20.00 20.00 20.00 20.50 20.00
Bush:NNP   0.00 13.02 20.50   8.02 12.11 20.50
has:VBZ 13.02   0.00 20.00   7.65 13.02 20.00
the:DT 20.50 20.00   0.00 20.00 20.50 10.00
lead:NN   8.02   7.65 20.00   0.00   8.02 19.02
Ohio:NNP 12.11 13.02 20.50   8.02   0.00 20.50
.:. 20.50 20.00 10.00 19.02 20.50   0.00
NO_WORD 10.00 10.00   1.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -2.0000
Features matched: Factive.factivePassage: factive entails : surprising-JJ; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "surprising vs. hyp "." <-punct-- "has", which aligned to text "has"
Hand-tuned score: -1.0000
Threshold: -3.3437


Inference ID: -56

Txt: It is not surprising that Bush has the lead in Ohio.

Hyp: Bush does not have the lead in Ohio. (Don't know.)

Bush
NNP
does
VBZ
not
RB
have
VB
the
DT
lead
NN
Ohio
NNP
.
.
It:PRP 12.50 15.00 20.00 15.00 20.00 12.00 12.50 20.00
is:VBZ 14.84   9.34 19.96   7.80 20.00   9.39 14.84 20.00
not:RB 15.46 19.96   0.00 19.96 20.00 14.96 15.46 20.00
surprising:JJ 12.50 11.87 11.96 10.07 20.00   9.70 12.50 19.84
that:IN 20.50 20.00 20.00 20.00 20.00 20.00 20.50 20.00
Bush:NNP   0.00 13.61 15.46 13.55 20.50   8.02 12.11 20.50
has:VBZ 13.02   9.34 19.96   0.50 20.00   7.65 13.02 20.00
the:DT 20.50 18.65 20.00 20.00   0.00 20.00 20.50 10.00
lead:NN   8.02 13.11 14.96 10.68 20.00   0.00   8.02 19.02
Ohio:NNP 12.11 14.84 15.46 14.84 20.50   8.02   0.00 20.50
.:. 20.50 20.00 20.00 20.00 10.00 19.02 20.50   0.00
NO_WORD 10.00   1.00   9.00 10.00   1.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -5.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.factivePassage: factive entails : surprising-JJ; Polarity.hypNegMarker: "have": neg; NullPunisher.aux: does; Structure.argsMismatch: args have different parents but same relations: text "not" <-neg-- "surprising vs. hyp "not" <-neg-- "have", which aligned to text "has" args have different parents but same relations: text "." <-punct-- "surprising vs. hyp "." <-punct-- "have", which aligned to text "has"
Hand-tuned score: -6.0500
Threshold: -3.3437


Inference ID: 57

Txt: It is not likely that Bush has the lead in Ohio.

Hyp: Bush has the lead in Ohio. (Don't know.)

Bush
NNP
has
VBZ
the
DT
lead
NN
Ohio
NNP
.
.
It:PRP 12.50 15.00 20.00 12.00 12.50 20.00
is:VBZ 14.84   8.64 20.00   9.39 14.84 20.00
not:RB 15.46 19.96 20.00 14.96 15.46 20.00
likely:JJ 12.46 11.96 20.00 10.90 12.46 19.92
that:IN 20.50 20.00 20.00 20.00 20.50 20.00
Bush:NNP   0.00 13.02 20.50   8.02 12.11 20.50
has:VBZ 13.02   0.00 20.00   7.65 13.02 20.00
the:DT 20.50 20.00   0.00 20.00 20.50 10.00
lead:NN   8.02   7.65 20.00   0.00   8.02 19.02
Ohio:NNP 12.11 13.02 20.50   8.02   0.00 20.50
.:. 20.50 20.00 10.00 19.02 20.50   0.00
NO_WORD 10.00 10.00   1.00 10.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -2.0000
Features matched: Factive.unknownPassage: non factive text -- unknown: likely-JJ; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "likely vs. hyp "." <-punct-- "has", which aligned to text "has"
Hand-tuned score: -1.5000
Threshold: -3.3437


Inference ID: -57

Txt: It is not likely that Bush has the lead in Ohio.

Hyp: Bush does not have the lead in Ohio. (Don't know.)

Bush
NNP
does
VBZ
not
RB
have
VB
the
DT
lead
NN
Ohio
NNP
.
.
It:PRP 12.50 15.00 20.00 15.00 20.00 12.00 12.50 20.00
is:VBZ 14.84   9.34 19.96   7.80 20.00   9.39 14.84 20.00
not:RB 15.46 19.96   0.00 19.96 20.00 14.96 15.46 20.00
likely:JJ 12.46   9.43 11.96 11.96 20.00 10.90 12.46 19.92
that:IN 20.50 20.00 20.00 20.00 20.00 20.00 20.50 20.00
Bush:NNP   0.00 13.61 15.46 13.55 20.50   8.02 12.11 20.50
has:VBZ 13.02   9.34 19.96   0.50 20.00   7.65 13.02 20.00
the:DT 20.50 18.65 20.00 20.00   0.00 20.00 20.50 10.00
lead:NN   8.02 13.11 14.96 10.68 20.00   0.00   8.02 19.02
Ohio:NNP 12.11 14.84 15.46 14.84 20.50   8.02   0.00 20.50
.:. 20.50 20.00 20.00 20.00 10.00 19.02 20.50   0.00
NO_WORD 10.00   1.00   9.00 10.00   1.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -5.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.unknownPassage: non factive text -- unknown: likely-JJ; Polarity.hypNegMarker: "have": neg; NullPunisher.aux: does; Structure.argsMismatch: args have different parents but same relations: text "not" <-neg-- "likely vs. hyp "not" <-neg-- "have", which aligned to text "has" args have different parents but same relations: text "." <-punct-- "likely vs. hyp "." <-punct-- "have", which aligned to text "has"
Hand-tuned score: -6.5500
Threshold: -3.3437


Inference ID: 58

Txt: Kerry knew that Edwards would accept the nomination.

Hyp: Kerry knew whether Edwards would accept the nomination. (Yes.)

Kerry
NNP
knew
VBD
whether
IN
Edwards
NNP
would
MD
accept
VB
the
DT
nomination
NN
.
.
Kerry:NNP   0.00 15.46 20.50   9.07 20.46 15.46 20.50 10.46 20.50
knew:VBD 15.46   0.00 20.00 15.50 19.96   5.35 20.00 14.92 17.19
that:IN 20.50 20.00 10.00 20.50 20.00 20.00 20.00 20.00 20.00
Edwards:NNP   9.07 15.50 20.50   0.00 20.46 15.50 20.50 10.50 20.50
would:MD 20.46 19.96 20.00 20.46   0.00 19.96 10.00 19.96 10.00
accept:VB 15.46   5.35 20.00 15.50 19.96   0.00 20.00 15.00 18.61
the:DT 20.50 20.00 20.00 20.50 10.00 20.00   0.00 20.00 10.00
nomination:NN 10.46 14.92 20.00 10.50 19.96 15.00 20.00   0.00 19.99
.:. 20.50 17.19 20.00 20.50 10.00 18.61 10.00 19.99   0.00
NO_WORD 10.00 10.00   1.00 10.00 10.00 10.00   1.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -1.0000
Features matched: NullPunisher.functionWord: whether
Hand-tuned score: 0.9000
Threshold: -3.3437


Inference ID: -58

Txt: Kerry knew that Edwards would accept the nomination.

Hyp: Kerry did not know whether Edwards would accept the nomination. (Don't know.)

Kerry
NNP
did
VBD
not
RB
know
VB
whether
IN
Edwards
NNP
would
MD
accept
VB
the
DT
nomination
NN
.
.
Kerry:NNP   0.00 15.46 15.46 15.46 20.50   9.07 20.46 15.46 20.50 10.46 20.50
knew:VBD 15.46   4.04 19.96   0.50 20.00 15.50 19.96   5.35 20.00 14.92 17.19
that:IN 20.50 20.00 20.00 20.00 10.00 20.50 20.00 20.00 20.00 20.00 20.00
Edwards:NNP   9.07 15.50 15.46 15.50 20.50   0.00 20.46 15.50 20.50 10.50 20.50
would:MD 20.46 18.57 19.96 19.96 20.00 20.46   0.00 19.96 10.00 19.96 10.00
accept:VB 15.46   7.47 19.96   1.00 20.00 15.50 19.96   0.00 20.00 15.00 18.61
the:DT 20.50 20.00 20.00 20.00 20.00 20.50 10.00 20.00   0.00 20.00 10.00
nomination:NN 10.46 14.36 14.96 14.84 20.00 10.50 19.96 15.00 20.00   0.00 19.99
.:. 20.50 17.99 20.00 17.93 20.00 20.50 10.00 18.61 10.00 19.99   0.00
NO_WORD 10.00   1.00   9.00 10.00   1.00 10.00 10.00 10.00   1.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -11.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "know": neg; NullPunisher.other: not; NullPunisher.functionWord: whether; NullPunisher.aux: did
Hand-tuned score: -5.1500
Threshold: -3.3437


Inference ID: 59

Txt: Tom knows that Naples is in Campania.

Hyp: Tom knows where Naples is. (Yes.)

Tom
NNP
knows
VBZ
where
WRB
Naples
NNP
is
VBZ
.
.
Tom:NNP   0.00 15.00 19.96   9.84 14.34 20.00
knows:VBZ 15.00   0.00 19.96 15.50   8.07 20.00
that:IN 20.00 20.00 18.69 20.50 20.00 20.00
Naples:NNP   9.84 15.50 20.46   0.00 14.84 20.50
is:VBZ 14.34   8.07 19.96 14.84   0.00 20.00
Campania:NNP   9.84 15.50 20.46   2.00 14.84 20.50
.:. 20.00 20.00 10.00 20.50 20.00   0.00
NO_WORD 10.00 10.00 10.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -10.0000
Features matched: Adjunct.addPosCxt: hyp added where[where-WRB]; Adjunct.dropPosCxt: text adjunct "Campania" of "is" dropped on aligned hyp word "is"; NullPunisher.other: where
Hand-tuned score: -0.5000
Threshold: -3.3437


Inference ID: -59

Txt: Tom knows that Naples is in Campania.

Hyp: Tom does not know where Naples is. (Don't know.)

Tom
NNP
does
VBZ
not
RB
know
VB
where
WRB
Naples
NNP
is
VBZ
.
.
Tom:NNP   0.00 10.17 14.96 15.00 19.96   9.84 14.34 20.00
knows:VBZ 15.00   2.16 19.96   0.50 19.96 15.50   8.07 20.00
that:IN 20.00 20.00 20.00 20.00 18.69 20.50 20.00 20.00
Naples:NNP   9.84 14.84 15.46 15.50 20.46   0.00 14.84 20.50
is:VBZ 14.34   9.34 19.96   8.07 19.96 14.84   0.00 20.00
Campania:NNP   9.84 14.84 15.46 15.50 20.46   2.00 14.84 20.50
.:. 20.00 20.00 20.00 17.93 10.00 20.50 20.00   0.00
NO_WORD 10.00   1.00   9.00 10.00 10.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -20.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "know": neg; NullPunisher.aux: does; NullPunisher.other: not; NullPunisher.other: where
Hand-tuned score: -6.0500
Threshold: -3.3437


Inference ID: 60

Txt: We met in September during the feast.

Hyp: The feast took place in September. (Yes.)

The
DT
feast
NN
took_place
VBD
September
NNP
.
.
We:PRP 20.00 12.00 15.00 12.50 20.00
met:VBD 20.00 10.15   8.51 15.50 19.15
September:NNP 20.50 10.50 14.84   0.00 20.50
the:DT   0.00 20.00 20.00 20.50 10.00
feast:NN 20.00   0.00 13.28 10.50 20.00
.:. 10.00 20.00 20.00 20.50   0.00
NO_WORD   1.00 10.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -10.5097
Features matched: Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 09/01/1000; RootEntailment.poorlyAlignedRoot: "took_place" aligned badly to "met"; Structure.relMismatch: text "feast" is prep_during of "met" while hyp "feast" is nsubj of "took_place" which aligned to text "met"
Hand-tuned score: -1.0000
Threshold: -3.3437


Inference ID: -60

Txt: We met in September during the feast.

Hyp: The feast did not take place in September. (Don't know.)

The
DT
feast
NN
did
VBD
not
RB
take_place
VB
September
NNP
.
.
We:PRP 20.00 12.00 15.00 20.00 15.00 12.50 20.00
met:VBD 20.00 10.15   5.11 19.96   8.51 15.50 19.15
September:NNP 20.50 10.50 14.19 15.46 14.84   0.00 20.50
the:DT   0.00 20.00 20.00 20.00 20.00 20.50 10.00
feast:NN 20.00   0.00   9.07 14.96 13.28 10.50 20.00
.:. 10.00 20.00 17.99 20.00 20.00 20.50   0.00
NO_WORD   1.00 10.00   1.00   9.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -20.5097
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 09/01/1000; Polarity.hypNegMarker: "take_place": neg; NullPunisher.other: not; NullPunisher.aux: did; RootEntailment.poorlyAlignedRoot: "take_place" aligned badly to "met"; Structure.relMismatch: text "feast" is prep_during of "met" while hyp "feast" is nsubj of "take_place" which aligned to text "met"
Hand-tuned score: -7.0500
Threshold: -3.3437


Inference ID: 61

Txt: It is false that Bin Laden was seen in Tora Bora.

Hyp: Bin Laden was seen in Tora Bora. (Don't know.)

Bin_Laden
NNP
was
VBD
seen
VBN
Tora_Bora
NNP
.
.
It:PRP 12.50 15.00 15.00 12.50 20.00
is:VBZ 14.84   0.50   7.74 15.46 20.00
false:JJ 12.46 11.96 11.96 12.46 18.76
that:IN 20.50 20.00 20.00 20.50 20.00
Bin_Laden:NNP   0.00 14.84 14.84 14.96 20.50
was:VBD 14.84   0.00   7.11 15.46 20.00
seen:VBN 14.84   7.11   0.00 15.46 18.17
Tora_Bora:NNP 14.96 15.46 15.46   0.00 20.50
.:. 20.50 20.00 18.17 20.50   0.00
NO_WORD 10.00   1.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -2.0000
Features matched: Factive.negativeStatement: non factive text : false-JJ; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "false vs. hyp "." <-punct-- "seen", which aligned to text "seen"
Hand-tuned score: -7.0000
Threshold: -3.3437


Inference ID: -61

Txt: It is false that Bin Laden was seen in Tora Bora.

Hyp: Bin Laden was not seen in Tora Bora. (Yes.)

Bin_Laden
NNP
was
VBD
not
RB
seen
VBN
Tora_Bora
NNP
.
.
It:PRP 12.50 15.00 20.00 15.00 12.50 20.00
is:VBZ 14.84   0.50 19.96   7.74 15.46 20.00
false:JJ 12.46 11.96 11.96 11.96 12.46 18.76
that:IN 20.50 20.00 20.00 20.00 20.50 20.00
Bin_Laden:NNP   0.00 14.84 15.46 14.84 14.96 20.50
was:VBD 14.84   0.00 19.96   7.11 15.46 20.00
seen:VBN 14.84   7.11 19.96   0.00 15.46 18.17
Tora_Bora:NNP 14.96 15.46 15.46 15.46   0.00 20.50
.:. 20.50 20.00 20.00 18.17 20.50   0.00
NO_WORD 10.00   1.00   9.00 10.00 10.00 10.00

Response: dontknow (INCORRECT)
Justification:
Alignment score: -11.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.negativeStatement: non factive text : false-JJ; Polarity.hypNegMarker: "seen": neg; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "false vs. hyp "." <-punct-- "seen", which aligned to text "seen"
Hand-tuned score: -13.0000
Threshold: -3.3437


Inference ID: 62

Txt: It follows that Bin Laden was in Tora Bora.

Hyp: Bin Laden was in Tora Bora. (Yes.)

Bin_Laden
NNP
was
VBD
Tora_Bora
NNP
.
.
It:PRP 12.50 15.00 12.50 20.00
follows:VBZ 15.50 10.00 15.46 17.41
that:IN 20.50 20.00 20.50 20.00
Bin_Laden:NNP   0.00 14.84 14.96 20.50
was:VBD 14.84   0.00 15.46 20.00
Tora_Bora:NNP 14.96 15.46   0.00 20.50
.:. 20.50 20.00 20.50   0.00
NO_WORD 10.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -2.0000
Features matched: Factive.factivePassage: factive entails : follows-VBZ; Location.mismatch: no clear info of matching: be(X, prep_in); Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "follows vs. hyp "." <-punct-- "was", which aligned to text "was"
Hand-tuned score: -3.0000
Threshold: -3.3437


Inference ID: -62

Txt: It follows that Bin Laden was in Tora Bora.

Hyp: Bin Laden was not in Tora Bora. (Don't know.)

Bin_Laden
NNP
was
VBD
not
RB
Tora_Bora
NNP
.
.
It:PRP 12.50 15.00 20.00 12.50 20.00
follows:VBZ 15.50 10.00 19.96 15.46 17.41
that:IN 20.50 20.00 20.00 20.50 20.00
Bin_Laden:NNP   0.00 14.84 15.46 14.96 20.50
was:VBD 14.84   0.00 19.96 15.46 20.00
Tora_Bora:NNP 14.96 15.46 15.46   0.00 20.50
.:. 20.50 20.00 20.00 20.50   0.00
NO_WORD 10.00 10.00   9.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -11.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.factivePassage: factive entails : follows-VBZ; Polarity.hypNegMarker: "was": neg; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "follows vs. hyp "." <-punct-- "was", which aligned to text "was"
Hand-tuned score: -7.0000
Threshold: -3.3437


Inference ID: 63

Txt: It is likely that Bin Laden was in Tora Bora.

Hyp: Bin Laden was in Tora Bora. (Don't know.)

Bin_Laden
NNP
was
VBD
Tora_Bora
NNP
.
.
It:PRP 12.50 15.00 12.50 20.00
is:VBZ 14.84   0.50 15.46 20.00
likely:JJ 12.46 11.96 12.46 19.92
that:IN 20.50 20.00 20.50 20.00
Bin_Laden:NNP   0.00 14.84 14.96 20.50
was:VBD 14.84   0.00 15.46 20.00
Tora_Bora:NNP 14.96 15.46   0.00 20.50
.:. 20.50 20.00 20.50   0.00
NO_WORD 10.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -2.0000
Features matched: Factive.unknownPassage: non factive text -- unknown: likely-JJ; Location.mismatch: no clear info of matching: be(X, prep_in); Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "likely vs. hyp "." <-punct-- "was", which aligned to text "was"
Hand-tuned score: -3.5000
Threshold: -3.3437


Inference ID: -63

Txt: It is likely that Bin Laden was in Tora Bora.

Hyp: Bin Laden was not in Tora Bora. (Don't know.)

Bin_Laden
NNP
was
VBD
not
RB
Tora_Bora
NNP
.
.
It:PRP 12.50 15.00 20.00 12.50 20.00
is:VBZ 14.84   0.50 19.96 15.46 20.00
likely:JJ 12.46 11.96 11.96 12.46 19.92
that:IN 20.50 20.00 20.00 20.50 20.00
Bin_Laden:NNP   0.00 14.84 15.46 14.96 20.50
was:VBD 14.84   0.00 19.96 15.46 20.00
Tora_Bora:NNP 14.96 15.46 15.46   0.00 20.50
.:. 20.50 20.00 20.00 20.50   0.00
NO_WORD 10.00 10.00   9.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -11.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.unknownPassage: non factive text -- unknown: likely-JJ; Polarity.hypNegMarker: "was": neg; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "likely vs. hyp "." <-punct-- "was", which aligned to text "was"
Hand-tuned score: -7.5000
Threshold: -3.3437


Inference ID: 64

Txt: Tony Hall left Amman on Sunday.

Hyp: Tony Hall was in Amman on Sunday. (Yes.)

Tony_Hall
NNP
was
VBD
Amman
NNP
Sunday
NNP
.
.
Tony_Hall:NNP   0.00 15.17 14.67 14.96 20.50
left:VBD 15.17   7.11 11.79 15.50 19.46
Amman:NNP 14.67 11.83   0.00 15.00 20.50
Sunday:NNP 14.96 15.50 15.00   0.00 20.50
.:. 20.50 20.00 20.50 20.50   0.00
NO_WORD 10.00 10.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -9.1114
Features matched: Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/1000; Location.mismatch: no clear info of matching: be(X, prep_in); RootEntailment.poorlyAlignedRoot: "was" aligned badly to "left"; Structure.relMismatch: text "Amman" is dobj of "left" while hyp "Amman" is prep_in of "was" which aligned to text "left"
Hand-tuned score: -3.0000
Threshold: -3.3437


Inference ID: -64

Txt: Tony Hall left Amman on Sunday.

Hyp: Tony Hall was not in Amman on Sunday. (Don't know.)

Tony_Hall
NNP
was
VBD
not
RB
Amman
NNP
Sunday
NNP
.
.
Tony_Hall:NNP   0.00 15.17 15.46 14.67 14.96 20.50
left:VBD 15.17   7.11 19.96 11.79 15.50 19.46
Amman:NNP 14.67 11.83 15.46   0.00 15.00 20.50
Sunday:NNP 14.96 15.50 15.46 15.00   0.00 20.50
.:. 20.50 20.00 20.00 20.50 20.50   0.00
NO_WORD 10.00 10.00   9.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -18.1114
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/1000; Polarity.hypNegMarker: "was": neg; NullPunisher.other: not; RootEntailment.poorlyAlignedRoot: "was" aligned badly to "left"; Structure.relMismatch: text "Amman" is dobj of "left" while hyp "Amman" is prep_in of "was" which aligned to text "left"
Hand-tuned score: -7.0000
Threshold: -3.3437


Inference ID: 65

Txt: Tony Hall left Amman on Sunday.

Hyp: Tony Hall was in Amman on Saturday. (Don't know.)

Tony_Hall
NNP
was
VBD
Amman
NNP
Saturday
NNP
.
.
Tony_Hall:NNP   0.00 15.17 14.67 14.96 20.50
left:VBD 15.17   7.11 11.79 15.50 19.46
Amman:NNP 14.67 11.83   0.00 15.00 20.50
Sunday:NNP 14.96 15.50 15.00   4.13 20.50
.:. 20.50 20.00 20.50 20.50   0.00
NO_WORD 10.00 10.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -13.2437
Features matched: Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/1000; Location.mismatch: no clear info of matching: be(X, prep_in); Numeric.mismatch: date Saturday != Sunday; RootEntailment.poorlyAlignedRoot: "was" aligned badly to "left"; Structure.relMismatch: text "Amman" is dobj of "left" while hyp "Amman" is prep_in of "was" which aligned to text "left"
Hand-tuned score: -9.0000
Threshold: -3.3437


Inference ID: -65

Txt: Tony Hall left Amman on Sunday.

Hyp: Tony Hall was not in Amman on Saturday. (Don't know.)

Tony_Hall
NNP
was
VBD
not
RB
Amman
NNP
Saturday
NNP
.
.
Tony_Hall:NNP   0.00 15.17 15.46 14.67 14.96 20.50
left:VBD 15.17   7.11 19.96 11.79 15.50 19.46
Amman:NNP 14.67 11.83 15.46   0.00 15.00 20.50
Sunday:NNP 14.96 15.50 15.46 15.00   4.13 20.50
.:. 20.50 20.00 20.00 20.50 20.50   0.00
NO_WORD 10.00 10.00   9.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -22.2437
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/1000; Polarity.hypNegMarker: "was": neg; NullPunisher.other: not; Numeric.mismatch: date Saturday != Sunday; RootEntailment.poorlyAlignedRoot: "was" aligned badly to "left"; Structure.relMismatch: text "Amman" is dobj of "left" while hyp "Amman" is prep_in of "was" which aligned to text "left"
Hand-tuned score: -13.0000
Threshold: -3.3437


Inference ID: 66

Txt: Khan sold 10 centrifuges to North Korea.

Hyp: North Korea bought 10 centrifuges. (Yes.)

North_Korea
NNP
bought
VBD
10
CD
centrifuges
NNS
.
.
Khan:NNP 14.02 15.50 25.00   8.53 20.50
sold:VBD 15.50   2.96 20.00 14.23 19.42
10:CD 24.34 19.52   0.00 20.50 19.16
centrifuges:NNS   9.84 15.00 20.50   0.00 19.93
North_Korea:NNP   0.00 15.50 24.34   9.84 20.50
.:. 20.50 19.32 19.16 19.93   0.00
NO_WORD 10.00 10.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -4.9562
Features matched: Structure.relMismatch: text "North_Korea" is prep_to of "sold" while hyp "North_Korea" is nsubj of "bought" which aligned to text "sold"
Hand-tuned score: 0.0000
Threshold: -3.3437


Inference ID: -66

Txt: Khan sold 10 centrifuges to North Korea.

Hyp: North Korea did not buy 10 centrifuges. (Don't know.)

North_Korea
NNP
did
VBD
not
RB
buy
VB
10
CD
centrifuges
NNS
.
.
Khan:NNP 14.02 15.50 15.46 15.50 25.00   8.53 20.50
sold:VBD 15.50   7.69 19.96   6.51 20.00 14.23 19.42
10:CD 24.34 19.19 20.46 20.03   0.00 20.50 19.16
centrifuges:NNS   9.84 15.00 14.96 14.88 20.50   0.00 19.93
North_Korea:NNP   0.00 14.52 15.46 15.50 24.34   9.84 20.50
.:. 20.50 17.99 20.00 20.00 19.16 19.93   0.00
NO_WORD 10.00   1.00   9.00 10.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -18.5110
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "buy": neg; NullPunisher.other: not; NullPunisher.aux: did; RootEntailment.poorlyAlignedRoot: "buy" aligned badly to "sold"; Structure.relMismatch: text "North_Korea" is prep_to of "sold" while hyp "North_Korea" is nsubj of "buy" which aligned to text "sold"
Hand-tuned score: -8.0500
Threshold: -3.3437


Inference ID: 67

Txt: The US invasion of Afghanistan prevented Al-Qaida from attacking Ryad in 2002.

Hyp: Al-Qaida attacked Ryad in 2002. (Don't know.)

Al-Qaida
NNP
attacked
VBD
Ryad
NNP
2002
CD
.
.
The:DT 20.50 20.00 20.00 20.50 10.00
US:NNP 10.00 15.50 10.46 24.96 20.50
invasion:NN 10.50 10.39   9.96 20.46 20.00
Afghanistan:NNP 10.00 15.50 10.46 24.96 20.50
prevented:VBD 15.50   6.64 14.96 20.37 18.96
Al-Qaida:NNP   0.50 15.00   9.96 20.46 20.00
from:IN 20.50 20.00 20.00 20.50 20.00
attacking:VBG 15.50   0.50 14.96 20.26 19.81
Ryad:NNP   9.96 15.46   0.50 24.96 20.50
2002:CD 24.96 19.46 20.46   0.00 20.50
.:. 20.50 20.00 20.00 20.50   0.00
NO_WORD 10.00 10.00 10.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -5.5000
Features matched: Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/2002; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "prevented vs. hyp "." <-punct-- "attacked", which aligned to text "attacking" args have different parents, different relations: text "Al-Qaida" <-dobj-- "prevented" vs. hyp "Al-Qaida" <-nsubj-- "attacked", which aligned to text "attacking"
Hand-tuned score: -1.0000
Threshold: -3.3437


Inference ID: -67

Txt: The US invasion of Afghanistan prevented Al-Qaida from attacking Ryad in 2002.

Hyp: Al-Qaida did not attack Ryad in 2002. (Yes.)

Al-Qaida
NNP
did
VBD
not
RB
attack
VB
Ryad
NNP
2002
CD
.
.
The:DT 20.00 20.00 20.00 20.00 20.00 20.50 10.00
US:NNP 10.50 15.50 15.46 15.50 10.46 24.96 20.50
invasion:NN 10.00 12.69 14.96 10.06   9.96 20.46 20.00
Afghanistan:NNP 10.50 15.50 15.46 15.50 10.46 24.96 20.50
prevented:VBD 15.00   9.29 19.96   6.37 14.96 20.37 18.96
Al-Qaida:NNP   0.00 15.00 14.96 15.00   9.96 20.46 20.00
from:IN 20.00 20.00 20.00 20.00 20.00 20.50 20.00
attacking:VBG 15.00   7.62 19.96   0.50 14.96 20.26 19.81
Ryad:NNP 10.46 15.46 15.46 15.46   0.50 24.96 20.50
2002:CD 20.46 20.46 20.46 20.44 20.46   0.00 20.50
.:. 20.00 17.99 20.00 20.00 20.00 20.50   0.00
NO_WORD 10.00   1.00   9.00 10.00 10.00 10.00 10.00

Response: dontknow (INCORRECT)
Justification:
Alignment score: -15.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/2002; Polarity.hypNegMarker: "attack": neg; NullPunisher.other: not; NullPunisher.aux: did; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "prevented vs. hyp "." <-punct-- "attack", which aligned to text "attacking" args have different parents, different relations: text "Al-Qaida" <-dobj-- "prevented" vs. hyp "Al-Qaida" <-nsubj-- "attack", which aligned to text "attacking"
Hand-tuned score: -7.0500
Threshold: -3.3437


Inference ID: 68

Txt: The administration managed to track down the perpetrators.

Hyp: The administration tracked down the perpetrators. (Yes.)

The
DT
administration
NN
tracked_down
VBD
the
DT
perpetrators
NNS
.
.
The:DT   0.00 20.00 20.00   0.00 20.00 10.00
administration:NN 20.00   0.00 13.86 20.00   9.76 19.88
managed:VBD 20.00 13.37 10.00 20.00 15.00 20.00
to:TO 10.00 20.00 20.00 10.00 20.00 10.00
track_down:VB 20.00 13.86   0.00 20.00 14.02 20.00
the:DT   0.00 20.00 20.00   0.00 20.00 10.00
perpetrators:NNS 20.00   9.76 14.02 20.00   0.00 19.30
.:. 10.00 19.88 20.00 10.00 19.30   0.00
NO_WORD   1.00 10.00 10.00   1.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -4.0000
Features matched: Structure.argsMismatch: args have different parents but same relations: text "administration" <-nsubj-- "managed vs. hyp "administration" <-nsubj-- "tracked_down", which aligned to text "track_down" args have different parents but same relations: text "." <-punct-- "managed vs. hyp "." <-punct-- "tracked_down", which aligned to text "track_down"
Hand-tuned score: -2.0000
Threshold: -3.3437


Inference ID: -68

Txt: The administration managed to track down the perpetrators.

Hyp: The administration did not track down the perpetrators. (Don't know.)

The
DT
administration
NN
did
VBD
not
RB
track_down
VB
the
DT
perpetrators
NNS
.
.
The:DT   0.00 20.00 20.00 20.00 20.00   0.00 20.00 10.00
administration:NN 20.00   0.00 12.69 14.96 13.86 20.00   9.76 19.88
managed:VBD 20.00 13.37   1.52 19.96 10.00 20.00 15.00 20.00
to:TO 10.00 20.00 20.00 20.00 20.00 10.00 20.00 10.00
track_down:VB 20.00 13.86   7.72 19.96   0.00 20.00 14.02 20.00
the:DT   0.00 20.00 20.00 20.00 20.00   0.00 20.00 10.00
perpetrators:NNS 20.00   9.76 14.73 14.96 14.02 20.00   0.00 19.30
.:. 10.00 19.88 17.99 20.00 20.00 10.00 19.30   0.00
NO_WORD   1.00 10.00   1.00   9.00 10.00   1.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -14.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "track_down": neg; NullPunisher.other: not; NullPunisher.aux: did; Structure.argsMismatch: args have different parents but same relations: text "administration" <-nsubj-- "managed vs. hyp "administration" <-nsubj-- "track_down", which aligned to text "track_down" args have different parents but same relations: text "." <-punct-- "managed vs. hyp "." <-punct-- "track_down", which aligned to text "track_down"
Hand-tuned score: -8.0500
Threshold: -3.3437


Inference ID: 69

Txt: The administration didn't manage to track down the perpetrators.

Hyp: The administration tracked down the perpetrators. (Don't know.)

The
DT
administration
NN
tracked_down
VBD
the
DT
perpetrators
NNS
.
.
The:DT   0.00 20.00 20.00   0.00 20.00 10.00
administration:NN 20.00   0.00 13.86 20.00   9.76 19.88
did:VBD 20.00 12.69   7.72 20.00 14.73 17.99
n't:RB 20.00 14.96 19.96 20.00 13.39 17.90
manage:VB 20.00 15.00 10.00 20.00 14.77 19.98
to:TO 10.00 20.00 20.00 10.00 20.00 10.00
track_down:VB 20.00 13.86   0.00 20.00 14.02 20.00
the:DT   0.00 20.00 20.00   0.00 20.00 10.00
perpetrators:NNS 20.00   9.76 14.02 20.00   0.00 19.30
.:. 10.00 19.88 20.00 10.00 19.30   0.00
NO_WORD   1.00 10.00 10.00   1.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -4.0000
Features matched: Structure.argsMismatch: args have different parents but same relations: text "administration" <-nsubj-- "manage vs. hyp "administration" <-nsubj-- "tracked_down", which aligned to text "track_down" args have different parents but same relations: text "." <-punct-- "manage vs. hyp "." <-punct-- "tracked_down", which aligned to text "track_down"
Hand-tuned score: -2.0000
Threshold: -3.3437


Inference ID: -69

Txt: The administration didn't manage to track down the perpetrators.

Hyp: The administration did not track down the perpetrators. (Yes.)

The
DT
administration
NN
did
VBD
not
RB
track_down
VB
the
DT
perpetrators
NNS
.
.
The:DT   0.00 20.00 20.00 20.00 20.00   0.00 20.00 10.00
administration:NN 20.00   0.00 12.69 14.96 13.86 20.00   9.76 19.88
did:VBD 20.00 12.69   0.00 19.96   7.72 20.00 14.73 17.99
n't:RB 20.00 14.96 13.27   0.50 19.96 20.00 13.39 17.90
manage:VB 20.00 15.00   1.52 19.96 10.00 20.00 14.77 19.98
to:TO 10.00 20.00 20.00 20.00 20.00 10.00 20.00 10.00
track_down:VB 20.00 13.86   7.72 19.96   0.00 20.00 14.02 20.00
the:DT   0.00 20.00 20.00 20.00 20.00   0.00 20.00 10.00
perpetrators:NNS 20.00   9.76 14.73 14.96 14.02 20.00   0.00 19.30
.:. 10.00 19.88 17.99 20.00 20.00 10.00 19.30   0.00
NO_WORD   1.00 10.00   1.00   9.00 10.00   1.00 10.00 10.00

Response: dontknow (INCORRECT)
Justification:
Alignment score: -7.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "track_down": neg; NullPunisher.aux: did; Structure.argsMismatch: args have different parents but same relations: text "administration" <-nsubj-- "manage vs. hyp "administration" <-nsubj-- "track_down", which aligned to text "track_down" args have different parents but same relations: text "n't" <-neg-- "manage vs. hyp "not" <-neg-- "track_down", which aligned to text "track_down" args have different parents but same relations: text "." <-punct-- "manage vs. hyp "." <-punct-- "track_down", which aligned to text "track_down"
Hand-tuned score: -7.0500
Threshold: -3.3437


Inference ID: 70

Txt: Bush didn't have the time to read the report.

Hyp: Bush read the report. (Don't know.)

Bush
NNP
read
VBP
the
DT
report
NN
.
.
Bush:NNP   0.00 13.95 20.00 10.00 20.00
did:VBD 15.00   6.81 20.00 12.69 17.99
n't:RB 14.96 18.98 20.00 14.01 17.90
have:VB 13.05   1.00 20.00 12.80 20.00
the:DT 20.00 20.00   0.00 20.00 10.00
time:NN 10.00 12.32 20.00   6.89 17.52
to:TO 20.00 20.00 10.00 20.00 10.00
read:VB 13.95   0.00 20.00 11.87 18.96
the:DT 20.00 20.00   0.00 20.00 10.00
report:NN 10.00 11.87 20.00   0.00 19.87
.:. 20.00 18.96 10.00 19.87   0.00
NO_WORD 10.00 10.00   1.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -4.0000
Features matched: Factive.inNegatedEmbedding: embedded negative text; Structure.argsMismatch: args have different parents but same relations: text "Bush" <-nsubj-- "have vs. hyp "Bush" <-nsubj-- "read", which aligned to text "read" args have different parents but same relations: text "." <-punct-- "have vs. hyp "." <-punct-- "read", which aligned to text "read"
Hand-tuned score: -7.0000
Threshold: -3.3437


Inference ID: -70

Txt: Bush didn't have the time to read the report.

Hyp: Bush did not read the report. (Yes.)

Bush
NNP
did
VBD
not
RB
read
VB
the
DT
report
NN
.
.
Bush:NNP   0.00 15.00 14.96 13.95 20.00 10.00 20.00
did:VBD 15.00   0.00 19.96   6.81 20.00 12.69 17.99
n't:RB 14.96 13.27   0.50 18.98 20.00 14.01 17.90
have:VB 13.05   7.32 19.96   1.00 20.00 12.80 20.00
the:DT 20.00 20.00 20.00 20.00   0.00 20.00 10.00
time:NN 10.00 11.26 14.96 12.32 20.00   6.89 17.52
to:TO 20.00 20.00 20.00 20.00 10.00 20.00 10.00
read:VB 13.95   6.81 19.96   0.00 20.00 11.87 18.96
the:DT 20.00 20.00 20.00 20.00   0.00 20.00 10.00
report:NN 10.00 12.69 14.96 11.87 20.00   0.00 19.87
.:. 20.00 17.99 20.00 18.96 10.00 19.87   0.00
NO_WORD 10.00   1.00   9.00 10.00   1.00 10.00 10.00

Response: dontknow (INCORRECT)
Justification:
Alignment score: -7.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.inNegatedEmbedding: embedded negative text; Polarity.hypNegMarker: "read": neg; NullPunisher.aux: did; Structure.argsMismatch: args have different parents but same relations: text "Bush" <-nsubj-- "have vs. hyp "Bush" <-nsubj-- "read", which aligned to text "read" args have different parents but same relations: text "n't" <-neg-- "have vs. hyp "not" <-neg-- "read", which aligned to text "read" args have different parents but same relations: text "." <-punct-- "have vs. hyp "." <-punct-- "read", which aligned to text "read"
Hand-tuned score: -12.0500
Threshold: -3.3437


Inference ID: 71

Txt: Bush had the time to read the report.

Hyp: Bush read the report. (Yes.)

Bush
NNP
read
VBP
the
DT
report
NN
.
.
Bush:NNP   0.00 13.95 20.00 10.00 20.00
had:VBD 13.05   5.95 20.00 12.80 20.00
the:DT 20.00 20.00   0.00 20.00 10.00
time:NN 10.00 12.32 20.00   6.89 17.52
to:TO 20.00 20.00 10.00 20.00 10.00
read:VB 13.95   0.00 20.00 11.87 18.96
the:DT 20.00 20.00   0.00 20.00 10.00
report:NN 10.00 11.87 20.00   0.00 19.87
.:. 20.00 18.96 10.00 19.87   0.00
NO_WORD 10.00 10.00   1.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -4.0000
Features matched: Factive.inPositiveEmbedding: embedded positive text; Structure.argsMismatch: args have different parents but same relations: text "Bush" <-nsubj-- "had vs. hyp "Bush" <-nsubj-- "read", which aligned to text "read" args have different parents but same relations: text "." <-punct-- "had vs. hyp "." <-punct-- "read", which aligned to text "read"
Hand-tuned score: -1.0000
Threshold: -3.3437


Inference ID: -71

Txt: Bush had the time to read the report.

Hyp: Bush did not read the report. (Don't know.)

Bush
NNP
did
VBD
not
RB
read
VB
the
DT
report
NN
.
.
Bush:NNP   0.00 15.00 14.96 13.95 20.00 10.00 20.00
had:VBD 13.05   7.32 19.96   5.95 20.00 12.80 20.00
the:DT 20.00 20.00 20.00 20.00   0.00 20.00 10.00
time:NN 10.00 11.26 14.96 12.32 20.00   6.89 17.52
to:TO 20.00 20.00 20.00 20.00 10.00 20.00 10.00
read:VB 13.95   6.81 19.96   0.00 20.00 11.87 18.96
the:DT 20.00 20.00 20.00 20.00   0.00 20.00 10.00
report:NN 10.00 12.69 14.96 11.87 20.00   0.00 19.87
.:. 20.00 17.99 20.00 18.96 10.00 19.87   0.00
NO_WORD 10.00   1.00   9.00 10.00   1.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -14.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.inPositiveEmbedding: embedded positive text; Polarity.hypNegMarker: "read": neg; NullPunisher.aux: did; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "Bush" <-nsubj-- "had vs. hyp "Bush" <-nsubj-- "read", which aligned to text "read" args have different parents but same relations: text "." <-punct-- "had vs. hyp "." <-punct-- "read", which aligned to text "read"
Hand-tuned score: -7.0500
Threshold: -3.3437


Inference ID: 72

Txt: The president wasn't able to attend the meeting.

Hyp: The president attended the meeting. (Don't know.)

The
DT
president
NN
attended
VBD
the
DT
meeting
NN
.
.
The:DT   0.00 20.00 20.00   0.00 20.00 10.00
president:NN 20.00   0.00 14.56 20.00   8.35 20.00
was:VBD 20.00 14.34 10.00 20.00 12.52 20.00
n't:RB 20.00 14.96 19.96 20.00 14.96 17.90
able:JJ 20.00 11.96 11.96 20.00 11.96 17.24
to:TO 10.00 20.00 20.00 10.00 20.00 10.00
attend:VB 20.00 14.16   0.50 20.00 11.65 19.67
the:DT   0.00 20.00 20.00   0.00 20.00 10.00
meeting:NN 20.00   8.35 11.39 20.00   0.00 19.92
.:. 10.00 20.00 19.97 10.00 19.92   0.00
NO_WORD   1.00 10.00 10.00   1.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -4.5000
Features matched: Structure.argsMismatch: args have different parents but same relations: text "president" <-nsubj-- "able vs. hyp "president" <-nsubj-- "attended", which aligned to text "attend" args have different parents but same relations: text "." <-punct-- "able vs. hyp "." <-punct-- "attended", which aligned to text "attend"
Hand-tuned score: -2.0000
Threshold: -3.3437


Inference ID: -72

Txt: The president wasn't able to attend the meeting.

Hyp: The president did not attend the meeting. (Yes.)

The
DT
president
NN
did
VBD
not
RB
attend
VB
the
DT
meeting
NN
.
.
The:DT   0.00 20.00 20.00 20.00 20.00   0.00 20.00 10.00
president:NN 20.00   0.00 15.00 14.96 14.16 20.00   8.35 20.00
was:VBD 20.00 14.34 10.00 19.96 10.00 20.00 12.52 20.00
n't:RB 20.00 14.96 13.27   0.50 18.46 20.00 14.96 17.90
able:JJ 20.00 11.96 11.55 11.96 10.38 20.00 11.96 17.24
to:TO 10.00 20.00 20.00 20.00 20.00 10.00 20.00 10.00
attend:VB 20.00 14.16   8.17 19.96   0.00 20.00 11.65 19.67
the:DT   0.00 20.00 20.00 20.00 20.00   0.00 20.00 10.00
meeting:NN 20.00   8.35 12.69 14.96 11.65 20.00   0.00 19.92
.:. 10.00 20.00 17.99 20.00 19.67 10.00 19.92   0.00
NO_WORD   1.00 10.00   1.00   9.00 10.00   1.00 10.00 10.00

Response: dontknow (INCORRECT)
Justification:
Alignment score: -7.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "attend": neg; NullPunisher.aux: did; Structure.argsMismatch: args have different parents but same relations: text "president" <-nsubj-- "able vs. hyp "president" <-nsubj-- "attend", which aligned to text "attend" args have different parents but same relations: text "n't" <-neg-- "able vs. hyp "not" <-neg-- "attend", which aligned to text "attend" args have different parents but same relations: text "." <-punct-- "able vs. hyp "." <-punct-- "attend", which aligned to text "attend"
Hand-tuned score: -7.0500
Threshold: -3.3437


Inference ID: 73

Txt: The president was able to attend to meeting.

Hyp: The president attended the meeting. (Yes.)

The
DT
president
NN
attended
VBD
the
DT
meeting
NN
.
.
The:DT   0.00 20.00 20.00   0.00 20.00 10.00
president:NN 20.00   0.00 14.56 20.00   8.35 20.00
was:VBD 20.00 14.34 10.00 20.00 12.52 20.00
able:JJ 20.00 11.96 11.96 20.00 11.96 17.24
to:TO 10.00 20.00 20.00 10.00 20.00 10.00
attend:VB 20.00 14.16   0.50 20.00 11.65 19.67
meeting:NN 20.00   8.35 11.39 20.00   0.00 19.92
.:. 10.00 20.00 19.97 10.00 19.92   0.00
NO_WORD   1.00 10.00 10.00   1.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -7.5000
Features matched: NullPunisher.article: the; Structure.argsMismatch: args have different parents but same relations: text "president" <-nsubj-- "able vs. hyp "president" <-nsubj-- "attended", which aligned to text "attend" args have different parents but same relations: text "." <-punct-- "able vs. hyp "." <-punct-- "attended", which aligned to text "attend" text "meeting" is prep_to of "attend" while hyp "meeting" is dobj of "attended" which aligned to text "attend"
Hand-tuned score: -2.1000
Threshold: -3.3437


Inference ID: -73

Txt: The president was able to attend to meeting.

Hyp: The president did not attend the meeting. (Don't know.)

The
DT
president
NN
did
VBD
not
RB
attend
VB
the
DT
meeting
NN
.
.
The:DT   0.00 20.00 20.00 20.00 20.00   0.00 20.00 10.00
president:NN 20.00   0.00 15.00 14.96 14.16 20.00   8.35 20.00
was:VBD 20.00 14.34 10.00 19.96 10.00 20.00 12.52 20.00
able:JJ 20.00 11.96 11.55 11.96 10.38 20.00 11.96 17.24
to:TO 10.00 20.00 20.00 20.00 20.00 10.00 20.00 10.00
attend:VB 20.00 14.16   8.17 19.96   0.00 20.00 11.65 19.67
meeting:NN 20.00   8.35 12.69 14.96 11.65 20.00   0.00 19.92
.:. 10.00 20.00 17.99 20.00 19.67 10.00 19.92   0.00
NO_WORD   1.00 10.00   1.00   9.00 10.00   1.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -17.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "attend": neg; NullPunisher.aux: did; NullPunisher.other: not; NullPunisher.article: the; Structure.argsMismatch: args have different parents but same relations: text "president" <-nsubj-- "able vs. hyp "president" <-nsubj-- "attend", which aligned to text "attend" args have different parents but same relations: text "." <-punct-- "able vs. hyp "." <-punct-- "attend", which aligned to text "attend" text "meeting" is prep_to of "attend" while hyp "meeting" is dobj of "attend" which aligned to text "attend"
Hand-tuned score: -8.1500
Threshold: -3.3437


Inference ID: 74

Txt: Many soldiers were killed in the ambush.

Hyp: All soldiers were killed in the ambush. (Don't know.)

All
DT
soldiers
NNS
were
VBD
killed
VBN
the
DT
ambush
NN
.
.
Many:JJ 20.00 11.96 11.96 11.96 20.00 11.96 20.00
soldiers:NNS 20.00   0.00 14.34   9.33 20.00   4.63 20.00
were:VBD 20.00 14.34   0.00   8.33 20.00 15.00 20.00
killed:VBN 20.00   9.33   8.33   0.00 20.00   9.69 20.00
the:DT 10.00 20.00 20.00 20.00   0.00 20.00 10.00
ambush:NN 20.00   4.63 15.00   9.69 20.00   0.00 20.00
.:. 10.00 20.00 20.00 20.00 10.00 20.00   0.00
NO_WORD 10.00 10.00   1.00 10.00   1.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -10.0000
Features matched: Adjunct.dropPosCxt: text adjunct "Many" of "soldiers" dropped on aligned hyp word "soldiers"; NullPunisher.other: All; Quant.expand: [many,all]
Hand-tuned score: -6.5000
Threshold: -3.3437


Inference ID: -74

Txt: Many soldiers were killed in the ambush.

Hyp: Not all soldiers were killed in the ambush. (Yes.)

Not
RB
all
DT
soldiers
NNS
were
VBD
killed
VBN
the
DT
ambush
NN
.
.
Many:JJ 11.96 20.00 11.96 11.96 11.96 20.00 11.96 20.00
soldiers:NNS 14.96 20.00   0.00 14.34   9.33 20.00   4.63 20.00
were:VBD 19.96 20.00 14.34   0.00   8.33 20.00 15.00 20.00
killed:VBN 19.96 20.00   9.33   8.33   0.00 20.00   9.69 20.00
the:DT 20.00 10.00 20.00 20.00 20.00   0.00 20.00 10.00
ambush:NN 14.96 20.00   4.63 15.00   9.69 20.00   0.00 20.00
.:. 20.00 10.00 20.00 20.00 20.00 10.00 20.00   0.00
NO_WORD   9.00 10.00 10.00   1.00 10.00   1.00 10.00 10.00

Response: dontknow (INCORRECT)
Justification:
Alignment score: -19.0000
Features matched: Adjunct.addPosCxt: hyp added Not[Not-RB]; Adjunct.dropPosCxt: text adjunct "Many" of "soldiers" dropped on aligned hyp word "soldiers"; NullPunisher.other: Not; NullPunisher.other: all; Quant.expand: [many,all]
Hand-tuned score: -8.5000
Threshold: -3.3437


Inference ID: 75

Txt: The man had $20 in his pocket.

Hyp: The man had $40 in his pocket. (Don't know.)

The
DT
man
NN
had
VBD
$
$
40
CD
his
PRP$
pocket
NN
.
.
The:DT   0.00 20.00 20.00 10.50 20.50 20.00 20.00 10.00
man:NN 20.00   0.00 13.05 18.93 20.50 12.00   6.78 19.76
had:VBD 20.00 13.05   0.00 20.50 20.50 15.00 13.95 20.00
$:$ 10.50 18.93 20.50   0.00 20.00 20.50 17.92   9.91
20:CD 20.50 20.50 20.50 20.00   0.69 20.50 19.19 19.23
his:PRP$ 20.00 12.00 15.00 20.50 20.50   0.00 12.00 20.00
pocket:NN 20.00   6.78 13.95 17.92 19.19 12.00   0.00 18.62
.:. 10.00 19.76 20.00   9.91 18.57 20.00 18.62   0.00
NO_WORD   1.00 10.00 10.00 10.00 10.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -0.6931
Features matched: Numeric.mismatch: MONEY mismatch: '$40.0' vs '$20.0'
Hand-tuned score: -5.0000
Threshold: -3.3437


Inference ID: -75

Txt: The man had $20 in his pocket.

Hyp: The man did not have $40 in his pocket. (Yes.)

The
DT
man
NN
did
VBD
not
RB
have
VB
$
$
40
CD
his
PRP$
pocket
NN
.
.
The:DT   0.00 20.00 20.00 20.00 20.00 10.50 20.50 20.00 20.00 10.00
man:NN 20.00   0.00 12.63 14.96 13.05 18.93 20.50 12.00   6.78 19.76
had:VBD 20.00 13.05   7.32 19.96   0.50 20.50 20.50 15.00 13.95 20.00
$:$ 10.50 18.93 20.50 20.50 20.50   0.00 20.00 20.50 17.92   9.91
20:CD 20.50 20.50 19.19 20.46 20.50 20.00   0.69 20.50 19.19 19.23
his:PRP$ 20.00 12.00 15.00 20.00 15.00 20.50 20.50   0.00 12.00 20.00
pocket:NN 20.00   6.78 12.53 14.96 13.95 17.92 19.19 12.00   0.00 18.62
.:. 10.00 19.76 17.99 20.00 20.00   9.91 18.57 20.00 18.62   0.00
NO_WORD   1.00 10.00   1.00   9.00 10.00 10.00 10.00 10.00 10.00 10.00

Response: dontknow (INCORRECT)
Justification:
Alignment score: -11.1931
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "have": neg; NullPunisher.other: not; NullPunisher.aux: did; Numeric.mismatch: MONEY mismatch: '$40.0' vs '$20.0'
Hand-tuned score: -11.0500
Threshold: -3.3437


Inference ID: 76

Txt: The man had $20 in his pocket.

Hyp: The man had $10 in his pocket. (Yes.)

The
DT
man
NN
had
VBD
$
$
10
CD
his
PRP$
pocket
NN
.
.
The:DT   0.00 20.00 20.00 10.50 20.50 20.00 20.00 10.00
man:NN 20.00   0.00 13.05 18.93 20.50 12.00   6.78 19.76
had:VBD 20.00 13.05   0.00 20.50 20.50 15.00 13.95 20.00
$:$ 10.50 18.93 20.50   0.00 20.00 20.50 17.92   9.91
20:CD 20.50 20.50 20.50 20.00   0.69 20.50 19.19 19.23
his:PRP$ 20.00 12.00 15.00 20.50 20.50   0.00 12.00 20.00
pocket:NN 20.00   6.78 13.95 17.92 19.19 12.00   0.00 18.62
.:. 10.00 19.76 20.00   9.91 19.16 20.00 18.62   0.00
NO_WORD   1.00 10.00 10.00 10.00 10.00 10.00 10.00 10.00

Response: dontknow (INCORRECT)
Justification:
Alignment score: -0.6931
Features matched: Numeric.mismatch: MONEY mismatch: '$10.0' vs '$20.0'
Hand-tuned score: -5.0000
Threshold: -3.3437


Inference ID: -76

Txt: The man had $20 in his pocket.

Hyp: The man did not have $10 in his pocket. (Don't know.)

The
DT
man
NN
did
VBD
not
RB
have
VB
$
$
10
CD
his
PRP$
pocket
NN
.
.
The:DT   0.00 20.00 20.00 20.00 20.00 10.50 20.50 20.00 20.00 10.00
man:NN 20.00   0.00 12.63 14.96 13.05 18.93 20.50 12.00   6.78 19.76
had:VBD 20.00 13.05   7.32 19.96   0.50 20.50 20.50 15.00 13.95 20.00
$:$ 10.50 18.93 20.50 20.50 20.50   0.00 20.00 20.50 17.92   9.91
20:CD 20.50 20.50 19.19 20.46 20.50 20.00   0.69 20.50 19.19 19.23
his:PRP$ 20.00 12.00 15.00 20.00 15.00 20.50 20.50   0.00 12.00 20.00
pocket:NN 20.00   6.78 12.53 14.96 13.95 17.92 19.19 12.00   0.00 18.62
.:. 10.00 19.76 17.99 20.00 20.00   9.91 19.16 20.00 18.62   0.00
NO_WORD   1.00 10.00   1.00   9.00 10.00 10.00 10.00 10.00 10.00 10.00

Response: dontknow (CORRECT)
Justification:
Alignment score: -11.1931
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "have": neg; NullPunisher.aux: did; NullPunisher.other: not; Numeric.mismatch: MONEY mismatch: '$10.0' vs '$20.0'
Hand-tuned score: -11.0500
Threshold: -3.3437


Word similarity table built on Thu Jul 06 10:25:56 PDT 2006 using command:
java edu.stanford.nlp.rte.WordSimilarityGenerator -info /u/nlp/rte/data/byformat/align/stochastic/parc_predev.pipeline.align.xml -output /u/nlp/rte/data/byformat/wordsim/stochastic/parc_predev.pipeline.wordsim.html -lex.BasicWN off