Txt: Some students came to school by car.
Hyp: Some students came to school. (Yes.)
| Some DT |
students NNS |
came VBD |
school NN |
. . |
|
| Some:DT | 0.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| students:NNS | 20.00 | 0.00 | 15.00 | 0.75 | 20.00 |
| came:VBD | 20.00 | 15.00 | 0.00 | 12.45 | 17.90 |
| school:NN | 20.00 | 0.75 | 12.45 | 0.00 | 19.99 |
| car:NN | 20.00 | 8.95 | 13.49 | 7.06 | 19.69 |
| .:. | 10.00 | 20.00 | 17.90 | 19.99 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: 0.0000
Features matched: Adjunct.dropPosCxt: text adjunct "car" of "came" dropped on aligned hyp word "came"; Location.mismatch: no clear info of matching: come(X, prep_to); Quant.contract: [some,some]; Adjunct.dropPosCxt&Align.veryGood:
Hand-tuned score: 1.0000
Threshold: -3.3437
Txt: Some students came to school by car.
Hyp: No students came to school. (Don't know.)
| No DT |
students NNS |
came VBD |
school NN |
. . |
|
| Some:DT | 10.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| students:NNS | 20.00 | 0.00 | 15.00 | 0.75 | 20.00 |
| came:VBD | 20.00 | 15.00 | 0.00 | 12.45 | 17.90 |
| school:NN | 20.00 | 0.75 | 12.45 | 0.00 | 19.99 |
| car:NN | 20.00 | 8.95 | 13.49 | 7.06 | 19.69 |
| .:. | 10.00 | 20.00 | 17.90 | 19.99 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -10.0000
Features matched: Adjunct.dropPosCxt: text adjunct "car" of "came" dropped on aligned hyp word "came"; Antonym.samePol: matching polarity with antonyms: No & Some; Location.mismatch: no clear info of matching: come(X, prep_to); Quant.oneNo: [some,no[
Hand-tuned score: -14.5000
Threshold: -3.3437
Txt: No students came to school by car.
Hyp: Some students came to school. (Don't know.)
| Some DT |
students NNS |
came VBD |
school NN |
. . |
|
| No:DT | 10.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| students:NNS | 20.00 | 0.00 | 15.00 | 0.75 | 20.00 |
| came:VBD | 20.00 | 15.00 | 0.00 | 12.45 | 17.90 |
| school:NN | 20.00 | 0.75 | 12.45 | 0.00 | 19.99 |
| car:NN | 20.00 | 8.95 | 13.49 | 7.06 | 19.69 |
| .:. | 10.00 | 20.00 | 17.90 | 19.99 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -10.0000
Features matched: Adjunct.dropPosCxt: text adjunct "car" of "came" dropped on aligned hyp word "came"; Antonym.samePol: matching polarity with antonyms: Some & No; Location.mismatch: no clear info of matching: come(X, prep_to); Quant.oneNo: [no,some[
Hand-tuned score: -14.5000
Threshold: -3.3437
Txt: No students came to school by car.
Hyp: No students came to school. (Don't know.)
| No DT |
students NNS |
came VBD |
school NN |
. . |
|
| No:DT | 0.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| students:NNS | 20.00 | 0.00 | 15.00 | 0.75 | 20.00 |
| came:VBD | 20.00 | 15.00 | 0.00 | 12.45 | 17.90 |
| school:NN | 20.00 | 0.75 | 12.45 | 0.00 | 19.99 |
| car:NN | 20.00 | 8.95 | 13.49 | 7.06 | 19.69 |
| .:. | 10.00 | 20.00 | 17.90 | 19.99 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: 0.0000
Features matched: Adjunct.dropPosCxt: text adjunct "car" of "came" dropped on aligned hyp word "came"; Location.mismatch: no clear info of matching: come(X, prep_to); Quant.bothNo: [no,no]; Adjunct.dropPosCxt&Align.veryGood:
Hand-tuned score: 1.0000
Threshold: -3.3437
Txt: John drove legally.
Hyp: John drove. (Yes.)
| John NNP |
drove VBD |
. . |
|
| John:NNP | 0.00 | 13.53 | 20.50 |
| drove:VBD | 13.53 | 0.00 | 18.77 |
| legally:RB | 15.46 | 19.96 | 19.03 |
| .:. | 20.50 | 18.77 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: 0.0000
Features matched: Adjunct.dropPosCxt: text adjunct "legally" of "drove" dropped on aligned hyp word "drove"; Adjunct.dropPosCxt&Align.veryGood:
Hand-tuned score: 2.0000
Threshold: -3.3437
Txt: John drove legally.
Hyp: John did not drive. (Don't know.)
| John NNP |
did VBD |
not RB |
drive VB |
. . |
|
| John:NNP | 0.00 | 13.35 | 15.46 | 13.53 | 20.50 |
| drove:VBD | 13.53 | 8.93 | 19.96 | 0.50 | 18.77 |
| legally:RB | 15.46 | 17.86 | 9.96 | 19.50 | 19.03 |
| .:. | 20.50 | 17.99 | 20.00 | 19.58 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -10.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "drive": neg; NullPunisher.aux: did; NullPunisher.other: not
Hand-tuned score: -5.0500
Threshold: -3.3437
Txt: John drove predictably.
Hyp: John drove. (Yes.)
| John NNP |
drove VBD |
. . |
|
| John:NNP | 0.00 | 13.53 | 20.50 |
| drove:VBD | 13.53 | 0.00 | 18.77 |
| predictably:RB | 15.46 | 19.94 | 20.00 |
| .:. | 20.50 | 18.77 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: 0.0000
Features matched: Adjunct.dropPosCxt: text adjunct "predictably" of "drove" dropped on aligned hyp word "drove"; Adjunct.dropPosCxt&Align.veryGood:
Hand-tuned score: 2.0000
Threshold: -3.3437
Txt: John drove predictably.
Hyp: John did not drive. (Don't know.)
| John NNP |
did VBD |
not RB |
drive VB |
. . |
|
| John:NNP | 0.00 | 13.35 | 15.46 | 13.53 | 20.50 |
| drove:VBD | 13.53 | 8.93 | 19.96 | 0.50 | 18.77 |
| predictably:RB | 15.46 | 19.17 | 9.96 | 19.68 | 20.00 |
| .:. | 20.50 | 17.99 | 20.00 | 19.58 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -10.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "drive": neg; NullPunisher.aux: did; NullPunisher.other: not
Hand-tuned score: -5.0500
Threshold: -3.3437
Txt: Legally, John could drive.
Hyp: John drove. (Don't know.)
| John NNP |
drove VBD |
. . |
|
| Legally:RB | 15.46 | 19.96 | 20.00 |
| ,:, | 20.50 | 18.13 | 5.73 |
| John:NNP | 0.00 | 13.53 | 20.50 |
| could:MD | 20.46 | 19.96 | 10.00 |
| drive:VB | 13.53 | 0.50 | 19.58 |
| .:. | 20.50 | 18.77 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -0.5000
Features matched: Adjunct.dropPosCxt: text adjunct "Legally" of "drive" dropped on aligned hyp word "drove"; Modal.dontKnow: possible -> actual; Adjunct.dropPosCxt&Align.veryGood:
Hand-tuned score: 0.0000
Threshold: -3.3437
Txt: Legally, John could drive.
Hyp: John did not drive. (Don't know.)
| John NNP |
did VBD |
not RB |
drive VB |
. . |
|
| Legally:RB | 15.46 | 19.96 | 9.96 | 19.96 | 20.00 |
| ,:, | 20.50 | 19.80 | 20.00 | 19.25 | 5.73 |
| John:NNP | 0.00 | 13.35 | 15.46 | 13.53 | 20.50 |
| could:MD | 20.46 | 17.84 | 19.96 | 19.96 | 10.00 |
| drive:VB | 13.53 | 6.26 | 19.96 | 0.00 | 19.58 |
| .:. | 20.50 | 17.99 | 20.00 | 19.58 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -10.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Modal.dontKnow: possible -> not actual; Polarity.hypNegMarker: "drive": neg; NullPunisher.other: not; NullPunisher.aux: did
Hand-tuned score: -7.0500
Threshold: -3.3437
Txt: Predictably, John drove.
Hyp: John drove. (Yes.)
| John NNP |
drove VBD |
. . |
|
| Predictably:RB | 15.46 | 19.96 | 20.00 |
| ,:, | 20.50 | 18.13 | 5.73 |
| John:NNP | 0.00 | 13.53 | 20.50 |
| drove:VBD | 13.53 | 0.00 | 18.77 |
| .:. | 20.50 | 18.77 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: 0.0000
Features matched: Adjunct.dropPosCxt: text adjunct "Predictably" of "drove" dropped on aligned hyp word "drove"; Adjunct.dropPosCxt&Align.veryGood:
Hand-tuned score: 2.0000
Threshold: -3.3437
Txt: Predictably, John drove.
Hyp: John did not drive. (Don't know.)
| John NNP |
did VBD |
not RB |
drive VB |
. . |
|
| Predictably:RB | 15.46 | 19.96 | 9.96 | 19.96 | 20.00 |
| ,:, | 20.50 | 19.80 | 20.00 | 19.25 | 5.73 |
| John:NNP | 0.00 | 13.35 | 15.46 | 13.53 | 20.50 |
| drove:VBD | 13.53 | 8.93 | 19.96 | 0.50 | 18.77 |
| .:. | 20.50 | 17.99 | 20.00 | 19.58 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -10.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "drive": neg; NullPunisher.other: not; NullPunisher.aux: did
Hand-tuned score: -5.0500
Threshold: -3.3437
Txt: The technician cooled the room.
Hyp: The technician lowered the temperature of the room. (Yes.)
| The DT |
technician NN |
lowered VBD |
the DT |
temperature NN |
the DT |
room NN |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 0.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| technician:NN | 20.00 | 0.00 | 13.95 | 20.00 | 8.15 | 20.00 | 8.71 | 20.00 |
| cooled:VBD | 20.00 | 13.08 | 7.62 | 20.00 | 9.59 | 20.00 | 13.15 | 19.82 |
| the:DT | 0.00 | 20.00 | 20.00 | 0.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| room:NN | 20.00 | 8.71 | 12.93 | 20.00 | 6.97 | 20.00 | 0.00 | 19.15 |
| .:. | 10.00 | 20.00 | 20.00 | 10.00 | 20.00 | 10.00 | 19.15 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 10.00 | 1.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -16.5893
Features matched: RootEntailment.poorlyAlignedRoot: "lowered" aligned badly to "cooled"
Hand-tuned score: -1.0000
Threshold: -3.3437
Txt: The technician cooled the room.
Hyp: The technician did not lower the temperature of the room. (Don't know.)
| The DT |
technician NN |
did VBD |
not RB |
lower VB |
the DT |
temperature NN |
the DT |
room NN |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| technician:NN | 20.00 | 0.00 | 12.84 | 14.96 | 13.95 | 20.00 | 8.15 | 20.00 | 8.71 | 20.00 |
| cooled:VBD | 20.00 | 13.08 | 7.53 | 19.96 | 7.62 | 20.00 | 9.59 | 20.00 | 13.15 | 19.82 |
| the:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| room:NN | 20.00 | 8.71 | 13.03 | 14.96 | 11.31 | 20.00 | 6.97 | 20.00 | 0.00 | 19.15 |
| .:. | 10.00 | 20.00 | 17.99 | 20.00 | 19.18 | 10.00 | 20.00 | 10.00 | 19.15 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 9.00 | 10.00 | 1.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -26.5893
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "lower": neg; NullPunisher.aux: did; NullPunisher.other: not; RootEntailment.poorlyAlignedRoot: "lower" aligned badly to "cooled"
Hand-tuned score: -7.0500
Threshold: -3.3437
Txt: The technician raised the temperature of the room.
Hyp: The technician cooled the room. (Don't know.)
| The DT |
technician NN |
cooled VBD |
the DT |
room NN |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| technician:NN | 20.00 | 0.00 | 13.08 | 20.00 | 8.71 | 20.00 |
| raised:VBD | 20.00 | 13.95 | 6.95 | 20.00 | 13.69 | 20.00 |
| the:DT | 0.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| temperature:NN | 20.00 | 8.15 | 9.59 | 20.00 | 6.97 | 20.00 |
| the:DT | 0.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| room:NN | 20.00 | 8.71 | 13.15 | 20.00 | 0.00 | 19.15 |
| .:. | 10.00 | 20.00 | 19.82 | 10.00 | 19.15 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -8.9537
Features matched: RootEntailment.poorlyAlignedRoot: "cooled" aligned badly to "raised"; Structure.parentsMismatch: args have different parents, different relations: text "room" <-prep_of-- "temperature" vs. hyp "room" <-dobj-- "cooled", which aligned to text "raised"
Hand-tuned score: -4.0000
Threshold: -3.3437
Txt: The technician raised the temperature of the room.
Hyp: The technician did not cool the room. (Yes.)
| The DT |
technician NN |
did VBD |
not RB |
cool VB |
the DT |
room NN |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| technician:NN | 20.00 | 0.00 | 12.84 | 14.96 | 13.08 | 20.00 | 8.71 | 20.00 |
| raised:VBD | 20.00 | 13.95 | 5.44 | 19.96 | 6.95 | 20.00 | 13.69 | 20.00 |
| the:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| temperature:NN | 20.00 | 8.15 | 12.53 | 14.96 | 9.59 | 20.00 | 6.97 | 20.00 |
| the:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| room:NN | 20.00 | 8.71 | 13.03 | 14.96 | 11.00 | 20.00 | 0.00 | 19.15 |
| .:. | 10.00 | 20.00 | 17.99 | 20.00 | 20.00 | 10.00 | 19.15 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 9.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: dontknow (INCORRECT)
Justification:
Alignment score: -18.9537
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "cool": neg; NullPunisher.aux: did; NullPunisher.other: not; RootEntailment.poorlyAlignedRoot: "cool" aligned badly to "raised"; Structure.parentsMismatch: args have different parents, different relations: text "room" <-prep_of-- "temperature" vs. hyp "room" <-dobj-- "cool", which aligned to text "raised"
Hand-tuned score: -10.0500
Threshold: -3.3437
Txt: The president visited Iraq in September.
Hyp: The president has gone to Iraq. (Yes.)
| The DT |
president NN |
has VBZ |
gone VBN |
Iraq NNP |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| president:NN | 20.00 | 0.00 | 14.34 | 12.72 | 9.84 | 20.00 |
| visited:VBD | 20.00 | 13.07 | 10.00 | 7.43 | 15.50 | 19.78 |
| Iraq:NNP | 20.50 | 9.84 | 13.02 | 14.84 | 0.00 | 20.50 |
| September:NNP | 20.50 | 10.50 | 14.19 | 12.73 | 15.00 | 20.50 |
| .:. | 10.00 | 20.00 | 20.00 | 19.35 | 20.50 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -10.4338
Features matched: Adjunct.dropPosCxt: text adjunct "September" of "Iraq" dropped on aligned hyp word "Iraq"; NullPunisher.aux: has; RootEntailment.poorlyAlignedRoot: "gone" aligned badly to "visited"; Structure.relMismatch: text "Iraq" is dobj of "visited" while hyp "Iraq" is prep_to of "gone" which aligned to text "visited"
Hand-tuned score: -1.5500
Threshold: -3.3437
Txt: The president visited Iraq in September.
Hyp: The president has not gone to Iraq. (Don't know.)
| The DT |
president NN |
has VBZ |
not RB |
gone VBN |
Iraq NNP |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| president:NN | 20.00 | 0.00 | 14.34 | 14.96 | 12.72 | 9.84 | 20.00 |
| visited:VBD | 20.00 | 13.07 | 10.00 | 19.96 | 7.43 | 15.50 | 19.78 |
| Iraq:NNP | 20.50 | 9.84 | 13.02 | 15.46 | 14.84 | 0.00 | 20.50 |
| September:NNP | 20.50 | 10.50 | 14.19 | 15.46 | 12.73 | 15.00 | 20.50 |
| .:. | 10.00 | 20.00 | 20.00 | 20.00 | 19.35 | 20.50 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -19.4338
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "gone": neg; NullPunisher.other: not; NullPunisher.aux: has; RootEntailment.poorlyAlignedRoot: "gone" aligned badly to "visited"; Structure.relMismatch: text "Iraq" is dobj of "visited" while hyp "Iraq" is prep_to of "gone" which aligned to text "visited"
Hand-tuned score: -8.0500
Threshold: -3.3437
Txt: Jones has visited Iraq.
Hyp: Jones visited Iraq in September. (Don't know.)
| Jones NNP |
visited VBD |
Iraq NNP |
September NNP |
. . |
|
| Jones:NNP | 0.00 | 15.50 | 14.34 | 15.00 | 20.50 |
| has:VBZ | 14.84 | 10.00 | 13.02 | 14.19 | 20.00 |
| visited:VBN | 15.50 | 0.00 | 15.50 | 15.50 | 19.78 |
| Iraq:NNP | 14.34 | 15.50 | 0.00 | 15.00 | 20.50 |
| .:. | 20.50 | 19.78 | 20.50 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -10.0000
Features matched: Adjunct.addPosCxt: hyp added September[September-NNP]; Date.hypDateIns: hypothesis date insertion: September; NullPunisher.other: September
Hand-tuned score: -3.0000
Threshold: -3.3437
Txt: Jones has visited Iraq.
Hyp: Jones did not visit Iraq in September. (Don't know.)
| Jones NNP |
did VBD |
not RB |
visit VB |
Iraq NNP |
September NNP |
. . |
|
| Jones:NNP | 0.00 | 15.50 | 15.46 | 15.50 | 14.34 | 15.00 | 20.50 |
| has:VBZ | 14.84 | 7.53 | 19.96 | 10.00 | 13.02 | 14.19 | 20.00 |
| visited:VBN | 15.50 | 7.62 | 19.96 | 0.31 | 15.50 | 15.50 | 19.78 |
| Iraq:NNP | 14.34 | 15.50 | 15.46 | 15.50 | 0.00 | 15.00 | 20.50 |
| .:. | 20.50 | 17.99 | 20.00 | 20.00 | 20.50 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -20.3094
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Date.hypDateIns: hypothesis date insertion: September; Polarity.hypNegMarker: "visit": neg; NullPunisher.other: not; NullPunisher.aux: did; NullPunisher.other: September
Hand-tuned score: -8.0500
Threshold: -3.3437
Txt: Jones arrived in Paris in September last year.
Hyp: Jones arrived in Paris last year. (Yes.)
| Jones NNP |
arrived VBD |
Paris NNP |
last JJ |
year NN |
. . |
|
| Jones:NNP | 0.00 | 15.00 | 9.84 | 11.45 | 10.50 | 20.00 |
| arrived:VBD | 15.00 | 0.00 | 15.50 | 12.50 | 15.28 | 20.00 |
| Paris:NNP | 9.84 | 15.50 | 0.00 | 16.34 | 13.13 | 20.50 |
| September:NNP | 10.50 | 15.50 | 15.00 | 9.84 | 7.23 | 20.50 |
| last:JJ | 11.45 | 12.50 | 16.34 | 0.00 | 9.84 | 20.50 |
| year:NN | 10.50 | 15.28 | 13.13 | 9.84 | 0.00 | 17.60 |
| .:. | 20.00 | 20.00 | 20.50 | 20.50 | 17.60 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 9.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: 0.0000
Features matched: Adjunct.dropPosCxt: text adjunct "September" of "Paris" dropped on aligned hyp word "Paris"; Date.matchDatesByGraph: hyp/txt matching, by graph: year and children; Adjunct.dropPosCxt&Align.veryGood:
Hand-tuned score: 3.0000
Threshold: -3.3437
Txt: Jones arrived in Paris in September last year.
Hyp: Jones did not arrive in Paris last year. (Don't know.)
| Jones NNP |
did VBD |
not RB |
arrive VB |
Paris NNP |
last JJ |
year NN |
. . |
|
| Jones:NNP | 0.50 | 15.00 | 14.96 | 15.00 | 9.84 | 11.45 | 10.50 | 20.00 |
| arrived:VBD | 15.50 | 7.47 | 19.96 | 0.50 | 15.50 | 12.50 | 15.28 | 20.00 |
| Paris:NNP | 14.34 | 15.50 | 15.46 | 15.50 | 0.00 | 16.34 | 13.13 | 20.50 |
| September:NNP | 15.00 | 14.19 | 15.46 | 15.50 | 15.00 | 9.84 | 7.23 | 20.50 |
| last:JJ | 15.95 | 10.19 | 12.46 | 12.50 | 16.34 | 0.00 | 9.84 | 20.50 |
| year:NN | 15.00 | 14.19 | 15.46 | 15.50 | 13.13 | 9.84 | 0.00 | 17.60 |
| .:. | 20.50 | 17.99 | 20.00 | 20.00 | 20.50 | 20.50 | 17.60 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 9.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -11.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Date.matchDatesByGraph: hyp/txt matching, by graph: year and children; Polarity.hypNegMarker: "arrive": neg; NullPunisher.aux: did; NullPunisher.other: not
Hand-tuned score: -4.0500
Threshold: -3.3437
Txt: Jones arrived in Paris in September last year.
Hyp: Jones arrived in Paris in September. (Don't know.)
| Jones NNP |
arrived VBD |
Paris NNP |
September NNP |
. . |
|
| Jones:NNP | 0.00 | 15.00 | 9.84 | 10.50 | 20.00 |
| arrived:VBD | 15.00 | 0.00 | 15.50 | 15.50 | 20.00 |
| Paris:NNP | 9.84 | 15.50 | 0.00 | 15.00 | 20.50 |
| September:NNP | 10.50 | 15.50 | 15.00 | 0.00 | 20.50 |
| last:JJ | 11.45 | 12.50 | 16.34 | 9.84 | 20.50 |
| year:NN | 10.50 | 15.28 | 13.13 | 7.23 | 17.60 |
| .:. | 20.00 | 20.00 | 20.50 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -2.0000
Features matched: Adjunct.dropPosCxt: text adjunct "year" of "arrived" dropped on aligned hyp word "arrived"; Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 09/01/1000; Structure.argsMismatch: args have different parents but same relations: text "September" <-prep_in-- "Paris vs. hyp "September" <-prep_in-- "arrived", which aligned to text "arrived"
Hand-tuned score: -0.5000
Threshold: -3.3437
Txt: Jones arrived in Paris in September last year.
Hyp: Jones did not arrive in Paris in September. (Don't know.)
| Jones NNP |
did VBD |
not RB |
arrive VB |
Paris NNP |
September NNP |
. . |
|
| Jones:NNP | 0.50 | 15.00 | 14.96 | 15.00 | 9.84 | 10.50 | 20.00 |
| arrived:VBD | 15.50 | 7.47 | 19.96 | 0.50 | 15.50 | 15.50 | 20.00 |
| Paris:NNP | 14.34 | 15.50 | 15.46 | 15.50 | 0.00 | 15.00 | 20.50 |
| September:NNP | 15.00 | 14.19 | 15.46 | 15.50 | 15.00 | 0.00 | 20.50 |
| last:JJ | 15.95 | 10.19 | 12.46 | 12.50 | 16.34 | 9.84 | 20.50 |
| year:NN | 15.00 | 14.19 | 15.46 | 15.50 | 13.13 | 7.23 | 17.60 |
| .:. | 20.50 | 17.99 | 20.00 | 20.00 | 20.50 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -11.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 09/01/1000; Polarity.hypNegMarker: "arrive": neg; NullPunisher.aux: did; NullPunisher.other: not
Hand-tuned score: -4.0500
Threshold: -3.3437
Txt: Jones arrived on a Sunday in September.
Hyp: Jones arrived on a Sunday. (Yes.)
| Jones NNP |
arrived VBD |
a DT |
Sunday NNP |
. . |
|
| Jones:NNP | 0.00 | 15.00 | 20.00 | 10.50 | 20.00 |
| arrived:VBD | 15.00 | 0.00 | 20.00 | 15.50 | 20.00 |
| a:DT | 20.00 | 20.00 | 0.00 | 20.50 | 10.00 |
| Sunday:NNP | 10.50 | 15.50 | 20.50 | 0.00 | 20.50 |
| September:NNP | 10.50 | 15.50 | 20.50 | 7.23 | 20.50 |
| .:. | 20.00 | 20.00 | 10.00 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: 0.0000
Features matched: Adjunct.dropPosCxt: text adjunct "September" of "arrived" dropped on aligned hyp word "arrived"; Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/1000; Quant.contract: [a,a]; Adjunct.dropPosCxt&Align.veryGood:
Hand-tuned score: 4.0000
Threshold: -3.3437
Txt: Jones arrived on a Sunday in September.
Hyp: Jones did not arrive on a Sunday. (Don't know.)
| Jones NNP |
did VBD |
not RB |
arrive VB |
a DT |
Sunday NNP |
. . |
|
| Jones:NNP | 0.50 | 15.00 | 14.96 | 15.00 | 20.00 | 10.50 | 20.00 |
| arrived:VBD | 15.50 | 7.47 | 19.96 | 0.50 | 20.00 | 15.50 | 20.00 |
| a:DT | 20.50 | 20.00 | 20.00 | 20.00 | 0.00 | 20.50 | 10.00 |
| Sunday:NNP | 15.00 | 10.20 | 15.46 | 15.50 | 20.50 | 0.00 | 20.50 |
| September:NNP | 15.00 | 14.19 | 15.46 | 15.50 | 20.50 | 7.23 | 20.50 |
| .:. | 20.50 | 17.99 | 20.00 | 20.00 | 10.00 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -11.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/1000; Polarity.hypNegMarker: "arrive": neg; NullPunisher.other: not; NullPunisher.aux: did; Quant.contract: [a,a]
Hand-tuned score: -3.0500
Threshold: -3.3437
Txt: Jones arrived on a Sunday in September.
Hyp: Jones arrived in September. (Yes.)
| Jones NNP |
arrived VBD |
September NNP |
. . |
|
| Jones:NNP | 0.00 | 15.00 | 10.50 | 20.00 |
| arrived:VBD | 15.00 | 0.00 | 15.50 | 20.00 |
| a:DT | 20.00 | 20.00 | 20.50 | 10.00 |
| Sunday:NNP | 10.50 | 15.50 | 7.23 | 20.50 |
| September:NNP | 10.50 | 15.50 | 0.00 | 20.50 |
| .:. | 20.00 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: 0.0000
Features matched: Adjunct.dropPosCxt: text adjunct "Sunday" of "arrived" dropped on aligned hyp word "arrived"; Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 09/01/1000; Adjunct.dropPosCxt&Align.veryGood:
Hand-tuned score: 3.0000
Threshold: -3.3437
Txt: Jones arrived on a Sunday in September.
Hyp: Jones did not arrive in September. (Don't know.)
| Jones NNP |
did VBD |
not RB |
arrive VB |
September NNP |
. . |
|
| Jones:NNP | 0.50 | 15.00 | 14.96 | 15.00 | 10.50 | 20.00 |
| arrived:VBD | 15.50 | 7.47 | 19.96 | 0.50 | 15.50 | 20.00 |
| a:DT | 20.50 | 20.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| Sunday:NNP | 15.00 | 10.20 | 15.46 | 15.50 | 7.23 | 20.50 |
| September:NNP | 15.00 | 14.19 | 15.46 | 15.50 | 0.00 | 20.50 |
| .:. | 20.50 | 17.99 | 20.00 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -11.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 09/01/1000; Polarity.hypNegMarker: "arrive": neg; NullPunisher.aux: did; NullPunisher.other: not
Hand-tuned score: -4.0500
Threshold: -3.3437
Txt: The president left after the diplomat arrived.
Hyp: The diplomat arrived before the president left. (Yes.)
| The DT |
diplomat NN |
arrived VBD |
before IN |
the DT |
president NN |
left VBD |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 20.00 | 10.00 |
| president:NN | 20.00 | 3.94 | 13.72 | 20.00 | 20.00 | 0.00 | 13.35 | 20.00 |
| left:VBD | 20.00 | 14.34 | 6.03 | 20.00 | 20.00 | 13.35 | 0.00 | 19.46 |
| after:IN | 20.00 | 20.00 | 20.00 | 0.00 | 17.88 | 20.00 | 20.00 | 20.00 |
| the:DT | 0.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 20.00 | 10.00 |
| diplomat:NN | 20.00 | 0.00 | 13.41 | 20.00 | 20.00 | 3.94 | 14.34 | 19.87 |
| arrived:VBD | 20.00 | 13.41 | 0.00 | 20.00 | 20.00 | 13.72 | 6.03 | 20.00 |
| .:. | 10.00 | 19.87 | 20.00 | 20.00 | 10.00 | 20.00 | 19.46 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -6.0000
Features matched: Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "left vs. hyp "." <-punct-- "arrived", which aligned to text "arrived"
Hand-tuned score: -2.0000
Threshold: -3.3437
Txt: The president left after the diplomat arrived.
Hyp: The diplomat did not arrive before the president left. (Don't know.)
| The DT |
diplomat NN |
did VBD |
not RB |
arrive VB |
before IN |
the DT |
president NN |
left VBD |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 20.00 | 10.00 |
| president:NN | 20.00 | 3.94 | 15.00 | 14.96 | 14.47 | 20.00 | 20.00 | 0.00 | 13.35 | 20.00 |
| left:VBD | 20.00 | 14.34 | 7.37 | 19.96 | 9.62 | 20.00 | 20.00 | 13.35 | 0.00 | 19.46 |
| after:IN | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 17.88 | 20.00 | 20.00 | 20.00 |
| the:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 20.00 | 10.00 |
| diplomat:NN | 20.00 | 0.00 | 15.00 | 14.96 | 13.56 | 20.00 | 20.00 | 3.94 | 14.34 | 19.87 |
| arrived:VBD | 20.00 | 13.41 | 7.47 | 19.96 | 0.50 | 20.00 | 20.00 | 13.72 | 6.03 | 20.00 |
| .:. | 10.00 | 19.87 | 17.99 | 20.00 | 20.00 | 20.00 | 10.00 | 20.00 | 19.46 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -16.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "arrive": neg; NullPunisher.other: not; NullPunisher.aux: did; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "left vs. hyp "." <-punct-- "arrive", which aligned to text "arrived"
Hand-tuned score: -8.0500
Threshold: -3.3437
Txt: No US congressman has visited Iraq since the war ended.
Hyp: Jones, a US Congressman, has visited Iraq after the war ended. (Don't know.)
| Jones NNP |
, , |
a DT |
US_Congressman NNP |
, , |
has VBZ |
visited VBN |
Iraq NNP |
after IN |
the DT |
war NN |
ended VBD |
. . |
|
| No:DT | 20.50 | 10.00 | 10.00 | 20.50 | 10.00 | 20.00 | 20.00 | 20.50 | 20.00 | 10.00 | 20.00 | 20.00 | 10.00 |
| US:NNP | 14.34 | 20.50 | 20.50 | 5.00 | 20.50 | 13.02 | 15.50 | 5.34 | 20.50 | 20.50 | 10.50 | 13.02 | 20.50 |
| congressman:NN | 10.19 | 19.74 | 20.50 | 5.00 | 19.74 | 14.84 | 15.32 | 14.34 | 20.50 | 20.50 | 9.08 | 13.55 | 20.50 |
| has:VBZ | 14.84 | 20.00 | 20.00 | 14.26 | 20.00 | 0.00 | 10.00 | 13.02 | 20.00 | 20.00 | 15.00 | 7.52 | 20.00 |
| visited:VBN | 15.50 | 19.44 | 20.00 | 15.46 | 19.44 | 10.00 | 0.00 | 15.50 | 20.00 | 20.00 | 12.10 | 7.62 | 19.78 |
| Iraq:NNP | 14.34 | 20.50 | 20.50 | 12.67 | 20.50 | 13.02 | 15.50 | 0.00 | 20.50 | 20.50 | 10.50 | 13.02 | 20.50 |
| since:IN | 20.50 | 20.00 | 20.00 | 20.50 | 20.00 | 20.00 | 20.00 | 20.50 | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 |
| the:DT | 20.50 | 10.00 | 10.00 | 20.50 | 10.00 | 20.00 | 20.00 | 20.50 | 17.88 | 0.00 | 20.00 | 20.00 | 10.00 |
| war:NN | 10.50 | 20.00 | 20.00 | 10.46 | 20.00 | 15.00 | 12.10 | 10.50 | 20.00 | 20.00 | 0.00 | 12.44 | 19.45 |
| ended:VBD | 9.59 | 18.30 | 20.00 | 14.26 | 18.30 | 7.52 | 7.62 | 13.02 | 20.00 | 20.00 | 12.44 | 0.00 | 20.00 |
| .:. | 20.50 | 5.73 | 10.00 | 20.50 | 5.73 | 20.00 | 19.78 | 20.50 | 20.00 | 10.00 | 19.45 | 20.00 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -33.4591
Features matched: Adjunct.dropPosCxt: text adjunct "US" of "congressman" dropped on aligned hyp word "US_Congressman"; NullPunisher.other: Jones; NullPunisher.article: a; Quant.oneNo: [no,a[
Hand-tuned score: -5.6000
Threshold: -3.3437
Txt: No US congressman has visited Iraq since the war ended.
Hyp: Jones, a US Congressman, has not visited Iraq after the war ended. (Yes.)
| Jones NNP |
, , |
a DT |
US_Congressman NNP |
, , |
has VBZ |
not RB |
visited VBN |
Iraq NNP |
after IN |
the DT |
war NN |
ended VBD |
. . |
|
| No:DT | 20.50 | 10.00 | 10.00 | 20.50 | 10.00 | 20.00 | 20.00 | 20.00 | 20.50 | 20.00 | 10.00 | 20.00 | 20.00 | 10.00 |
| US:NNP | 14.34 | 20.50 | 20.50 | 5.00 | 20.50 | 13.02 | 15.46 | 15.50 | 5.34 | 20.50 | 20.50 | 10.50 | 13.02 | 20.50 |
| congressman:NN | 10.19 | 19.74 | 20.50 | 5.00 | 19.74 | 14.84 | 15.46 | 15.32 | 14.34 | 20.50 | 20.50 | 9.08 | 13.55 | 20.50 |
| has:VBZ | 14.84 | 20.00 | 20.00 | 14.26 | 20.00 | 0.00 | 19.96 | 10.00 | 13.02 | 20.00 | 20.00 | 15.00 | 7.52 | 20.00 |
| visited:VBN | 15.50 | 19.44 | 20.00 | 15.46 | 19.44 | 10.00 | 19.96 | 0.00 | 15.50 | 20.00 | 20.00 | 12.10 | 7.62 | 19.78 |
| Iraq:NNP | 14.34 | 20.50 | 20.50 | 12.67 | 20.50 | 13.02 | 15.46 | 15.50 | 0.00 | 20.50 | 20.50 | 10.50 | 13.02 | 20.50 |
| since:IN | 20.50 | 20.00 | 20.00 | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 |
| the:DT | 20.50 | 10.00 | 10.00 | 20.50 | 10.00 | 20.00 | 20.00 | 20.00 | 20.50 | 17.88 | 0.00 | 20.00 | 20.00 | 10.00 |
| war:NN | 10.50 | 20.00 | 20.00 | 10.46 | 20.00 | 15.00 | 14.96 | 12.10 | 10.50 | 20.00 | 20.00 | 0.00 | 12.44 | 19.45 |
| ended:VBD | 9.59 | 18.30 | 20.00 | 14.26 | 18.30 | 7.52 | 19.96 | 7.62 | 13.02 | 20.00 | 20.00 | 12.44 | 0.00 | 20.00 |
| .:. | 20.50 | 5.73 | 10.00 | 20.50 | 5.73 | 20.00 | 20.00 | 19.78 | 20.50 | 20.00 | 10.00 | 19.45 | 20.00 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (INCORRECT)
Justification:
Alignment score: -42.4591
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "visited": neg; NullPunisher.other: not; NullPunisher.other: Jones; NullPunisher.article: a; Quant.oneNo: [no,a[
Hand-tuned score: -12.1000
Threshold: -3.3437
Txt: No US congressman has visited Iraq since the war.
Hyp: Jones, a US Congressman, visited Iraq before the war. (Don't know.)
| Jones NNP |
, , |
a DT |
US_Congressman NNP |
, , |
visited VBD |
Iraq NNP |
the DT |
war NN |
. . |
|
| No:DT | 20.50 | 10.00 | 10.00 | 20.50 | 10.00 | 20.00 | 20.50 | 10.00 | 20.00 | 10.00 |
| US:NNP | 14.34 | 20.50 | 20.50 | 5.00 | 20.50 | 15.50 | 5.34 | 20.50 | 10.50 | 20.50 |
| congressman:NN | 10.19 | 19.74 | 20.50 | 5.00 | 19.74 | 15.32 | 14.34 | 20.50 | 9.08 | 20.50 |
| has:VBZ | 14.84 | 20.00 | 20.00 | 14.26 | 20.00 | 10.00 | 13.02 | 20.00 | 15.00 | 20.00 |
| visited:VBN | 15.50 | 19.44 | 20.00 | 15.46 | 19.44 | 0.00 | 15.50 | 20.00 | 12.10 | 19.78 |
| Iraq:NNP | 14.34 | 20.50 | 20.50 | 12.67 | 20.50 | 15.50 | 0.00 | 20.50 | 10.50 | 20.50 |
| the:DT | 20.50 | 10.00 | 10.00 | 20.50 | 10.00 | 20.00 | 20.50 | 0.00 | 20.00 | 10.00 |
| war:NN | 10.50 | 20.00 | 20.00 | 10.46 | 20.00 | 12.10 | 10.50 | 20.00 | 0.00 | 19.45 |
| .:. | 20.50 | 5.73 | 10.00 | 20.50 | 5.73 | 19.78 | 20.50 | 10.00 | 19.45 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -35.4591
Features matched: Adjunct.dropPosCxt: text adjunct "US" of "congressman" dropped on aligned hyp word "US_Congressman"; NullPunisher.article: a; NullPunisher.other: Jones; Quant.oneNo: [no,a[; Structure.parentsMismatch: args have different parents, different relations: text "war" <-prep_since-- "Iraq" vs. hyp "war" <-prep_before-- "visited", which aligned to text "visited"
Hand-tuned score: -8.6000
Threshold: -3.3437
Txt: No US congressman has visited Iraq since the war.
Hyp: Jones, a US Congressman, did not visit Iraq before the war. (Don't know.)
| Jones NNP |
, , |
a DT |
US_Congressman NNP |
, , |
did VBD |
not RB |
visit VB |
Iraq NNP |
the DT |
war NN |
. . |
|
| No:DT | 20.50 | 10.00 | 10.00 | 20.50 | 10.00 | 20.00 | 20.00 | 20.00 | 20.50 | 10.00 | 20.00 | 10.00 |
| US:NNP | 14.34 | 20.50 | 20.50 | 5.00 | 20.50 | 15.50 | 15.46 | 15.50 | 5.34 | 20.50 | 10.50 | 20.50 |
| congressman:NN | 10.19 | 19.74 | 20.50 | 5.00 | 19.74 | 14.41 | 15.46 | 15.50 | 14.34 | 20.50 | 9.08 | 20.50 |
| has:VBZ | 14.84 | 20.00 | 20.00 | 14.26 | 20.00 | 7.53 | 19.96 | 10.00 | 13.02 | 20.00 | 15.00 | 20.00 |
| visited:VBN | 15.50 | 19.44 | 20.00 | 15.46 | 19.44 | 7.62 | 19.96 | 0.31 | 15.50 | 20.00 | 12.10 | 19.78 |
| Iraq:NNP | 14.34 | 20.50 | 20.50 | 12.67 | 20.50 | 15.50 | 15.46 | 15.50 | 0.00 | 20.50 | 10.50 | 20.50 |
| the:DT | 20.50 | 10.00 | 10.00 | 20.50 | 10.00 | 20.00 | 20.00 | 20.00 | 20.50 | 0.00 | 20.00 | 10.00 |
| war:NN | 10.50 | 20.00 | 20.00 | 10.46 | 20.00 | 12.69 | 14.96 | 12.10 | 10.50 | 20.00 | 0.00 | 19.45 |
| .:. | 20.50 | 5.73 | 10.00 | 20.50 | 5.73 | 17.99 | 20.00 | 20.00 | 20.50 | 10.00 | 19.45 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -45.7685
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "visit": neg; NullPunisher.article: a; NullPunisher.other: not; NullPunisher.aux: did; NullPunisher.other: Jones; Quant.oneNo: [no,a[; Structure.parentsMismatch: args have different parents, different relations: text "war" <-prep_since-- "Iraq" vs. hyp "war" <-prep_before-- "visit", which aligned to text "visited"
Hand-tuned score: -15.1500
Threshold: -3.3437
Txt: No US congressman visited Iraq until the war.
Hyp: Some US congressman visited Iraq before the war. (Don't know.)
| Some DT |
US NNP |
congressman NN |
visited VBD |
Iraq NNP |
the DT |
war NN |
. . |
|
| No:DT | 10.00 | 20.50 | 20.00 | 20.00 | 20.50 | 10.00 | 20.00 | 10.00 |
| US:NNP | 20.50 | 0.00 | 9.84 | 15.50 | 5.34 | 20.50 | 10.50 | 20.50 |
| congressman:NN | 20.00 | 9.84 | 0.00 | 14.82 | 9.84 | 20.00 | 8.58 | 20.00 |
| visited:VBD | 20.00 | 15.50 | 14.82 | 0.00 | 15.50 | 20.00 | 12.10 | 19.78 |
| Iraq:NNP | 20.50 | 5.34 | 9.84 | 15.50 | 0.00 | 20.50 | 10.50 | 20.50 |
| the:DT | 10.00 | 20.50 | 20.00 | 20.00 | 20.50 | 0.00 | 20.00 | 10.00 |
| war:NN | 20.00 | 10.50 | 8.58 | 12.10 | 10.50 | 20.00 | 0.00 | 19.45 |
| .:. | 10.00 | 20.50 | 20.00 | 19.78 | 20.50 | 10.00 | 19.45 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -11.0000
Features matched: Antonym.samePol: matching polarity with antonyms: Some & No; Quant.oneNo: [no,some[; Structure.relMismatch: text "war" is prep_until of "visited" while hyp "war" is prep_before of "visited" which aligned to text "visited"
Hand-tuned score: -14.0000
Threshold: -3.3437
Txt: No US congressman visited Iraq until the war.
Hyp: No US congressman visited Iraq before the war. (Yes.)
| No DT |
US NNP |
congressman NN |
visited VBD |
Iraq NNP |
the DT |
war NN |
. . |
|
| No:DT | 0.00 | 20.50 | 20.00 | 20.00 | 20.50 | 10.00 | 20.00 | 10.00 |
| US:NNP | 20.50 | 0.00 | 9.84 | 15.50 | 5.34 | 20.50 | 10.50 | 20.50 |
| congressman:NN | 20.00 | 9.84 | 0.00 | 14.82 | 9.84 | 20.00 | 8.58 | 20.00 |
| visited:VBD | 20.00 | 15.50 | 14.82 | 0.00 | 15.50 | 20.00 | 12.10 | 19.78 |
| Iraq:NNP | 20.50 | 5.34 | 9.84 | 15.50 | 0.00 | 20.50 | 10.50 | 20.50 |
| the:DT | 10.00 | 20.50 | 20.00 | 20.00 | 20.50 | 0.00 | 20.00 | 10.00 |
| war:NN | 20.00 | 10.50 | 8.58 | 12.10 | 10.50 | 20.00 | 0.00 | 19.45 |
| .:. | 10.00 | 20.50 | 20.00 | 19.78 | 20.50 | 10.00 | 19.45 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -1.0000
Features matched: Quant.bothNo: [no,no]; Structure.relMismatch: text "war" is prep_until of "visited" while hyp "war" is prep_before of "visited" which aligned to text "visited"
Hand-tuned score: 1.0000
Threshold: -3.3437
Txt: Some students arrived at the school on Sunday.
Hyp: There were some students at the school on Sunday. (Yes.)
| There EX |
were VBD |
some DT |
students NNS |
the DT |
school NN |
Sunday NNP |
. . |
|
| Some:DT | 10.00 | 20.00 | 0.00 | 20.00 | 10.00 | 20.00 | 20.50 | 10.00 |
| students:NNS | 20.00 | 14.34 | 20.00 | 0.00 | 20.00 | 0.75 | 10.50 | 20.00 |
| arrived:VBD | 20.00 | 10.00 | 20.00 | 14.29 | 20.00 | 13.50 | 15.50 | 20.00 |
| the:DT | 10.00 | 20.00 | 10.00 | 20.00 | 0.00 | 20.00 | 20.50 | 10.00 |
| school:NN | 20.00 | 14.34 | 20.00 | 0.75 | 20.00 | 0.00 | 7.73 | 19.99 |
| Sunday:NNP | 20.50 | 15.50 | 20.50 | 10.50 | 20.50 | 7.73 | 0.00 | 20.50 |
| .:. | 10.00 | 20.00 | 10.00 | 20.00 | 10.00 | 19.99 | 20.50 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -15.0000
Features matched: Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/1000; NullPunisher.functionWord: There; Quant.contract: [some,some]; RootEntailment.poorlyAlignedRoot: "were" aligned badly to "arrived"
Hand-tuned score: 0.9000
Threshold: -3.3437
Txt: Some students arrived at the school on Sunday.
Hyp: There were no students at the school on Sunday. (Don't know.)
| There EX |
were VBD |
no DT |
students NNS |
the DT |
school NN |
Sunday NNP |
. . |
|
| Some:DT | 10.00 | 20.00 | 10.00 | 20.00 | 10.00 | 20.00 | 20.50 | 10.00 |
| students:NNS | 20.00 | 14.34 | 20.00 | 0.00 | 20.00 | 0.75 | 10.50 | 20.00 |
| arrived:VBD | 20.00 | 10.00 | 20.00 | 14.29 | 20.00 | 13.50 | 15.50 | 20.00 |
| the:DT | 10.00 | 20.00 | 10.00 | 20.00 | 0.00 | 20.00 | 20.50 | 10.00 |
| school:NN | 20.00 | 14.34 | 20.00 | 0.75 | 20.00 | 0.00 | 7.73 | 19.99 |
| Sunday:NNP | 20.50 | 15.50 | 20.50 | 10.50 | 20.50 | 7.73 | 0.00 | 20.50 |
| .:. | 10.00 | 20.00 | 10.00 | 20.00 | 10.00 | 19.99 | 20.50 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -25.0000
Features matched: Antonym.samePol: matching polarity with antonyms: no & Some; Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/1000; Polarity.hypNegMarker: "no": no; NullPunisher.functionWord: There; Quant.oneNo: [some,no[; RootEntailment.poorlyAlignedRoot: "were" aligned badly to "arrived"
Hand-tuned score: -14.1000
Threshold: -3.3437
Txt: No students arrived at the school on Sunday.
Hyp: There were some students at the school on Sunday. (Don't know.)
| There EX |
were VBD |
some DT |
students NNS |
the DT |
school NN |
Sunday NNP |
. . |
|
| No:DT | 10.00 | 20.00 | 10.00 | 20.00 | 10.00 | 20.00 | 20.50 | 10.00 |
| students:NNS | 20.00 | 14.34 | 20.00 | 0.00 | 20.00 | 0.75 | 10.50 | 20.00 |
| arrived:VBD | 20.00 | 10.00 | 20.00 | 14.29 | 20.00 | 13.50 | 15.50 | 20.00 |
| the:DT | 10.00 | 20.00 | 10.00 | 20.00 | 0.00 | 20.00 | 20.50 | 10.00 |
| school:NN | 20.00 | 14.34 | 20.00 | 0.75 | 20.00 | 0.00 | 7.73 | 19.99 |
| Sunday:NNP | 20.50 | 15.50 | 20.50 | 10.50 | 20.50 | 7.73 | 0.00 | 20.50 |
| .:. | 10.00 | 20.00 | 10.00 | 20.00 | 10.00 | 19.99 | 20.50 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -25.0000
Features matched: Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/1000; NullPunisher.functionWord: There; NullPunisher.other: some; Quant.oneNo: [no,some[; RootEntailment.poorlyAlignedRoot: "were" aligned badly to "arrived"
Hand-tuned score: -7.1000
Threshold: -3.3437
Txt: No students arrived at the school on Sunday.
Hyp: There were no students at the school on Sunday. (Don't know.)
| There EX |
were VBD |
no DT |
students NNS |
the DT |
school NN |
Sunday NNP |
. . |
|
| No:DT | 10.00 | 20.00 | 0.00 | 20.00 | 10.00 | 20.00 | 20.50 | 10.00 |
| students:NNS | 20.00 | 14.34 | 20.00 | 0.00 | 20.00 | 0.75 | 10.50 | 20.00 |
| arrived:VBD | 20.00 | 10.00 | 20.00 | 14.29 | 20.00 | 13.50 | 15.50 | 20.00 |
| the:DT | 10.00 | 20.00 | 10.00 | 20.00 | 0.00 | 20.00 | 20.50 | 10.00 |
| school:NN | 20.00 | 14.34 | 20.00 | 0.75 | 20.00 | 0.00 | 7.73 | 19.99 |
| Sunday:NNP | 20.50 | 15.50 | 20.50 | 10.50 | 20.50 | 7.73 | 0.00 | 20.50 |
| .:. | 10.00 | 20.00 | 10.00 | 20.00 | 10.00 | 19.99 | 20.50 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -15.0000
Features matched: Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/1000; Polarity.hypNegMarker: "no": no; NullPunisher.functionWord: There; Quant.bothNo: [no,no]; RootEntailment.poorlyAlignedRoot: "were" aligned badly to "arrived"
Hand-tuned score: 0.9000
Threshold: -3.3437
Txt: There were no students at the school on Sunday.
Hyp: Some students arrived at the school on Sunday. (Don't know.)
| Some DT |
students NNS |
arrived VBD |
the DT |
school NN |
Sunday NNP |
. . |
|
| There:EX | 10.00 | 20.00 | 20.00 | 10.00 | 20.00 | 20.50 | 10.00 |
| were:VBD | 20.00 | 14.34 | 10.00 | 20.00 | 14.34 | 15.50 | 20.00 |
| no:DT | 10.00 | 20.00 | 20.00 | 10.00 | 20.00 | 20.50 | 10.00 |
| students:NNS | 20.00 | 0.00 | 14.29 | 20.00 | 0.75 | 10.50 | 20.00 |
| the:DT | 10.00 | 20.00 | 20.00 | 0.00 | 20.00 | 20.50 | 10.00 |
| school:NN | 20.00 | 0.75 | 13.50 | 20.00 | 0.00 | 7.73 | 19.99 |
| Sunday:NNP | 20.50 | 10.50 | 15.50 | 20.50 | 7.73 | 0.00 | 20.50 |
| .:. | 10.00 | 20.00 | 20.00 | 10.00 | 19.99 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -24.0000
Features matched: Antonym.samePol: matching polarity with antonyms: Some & no; Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/1000; Polarity.txtNegMarker: "no": no; Quant.oneNo: [no,some[; RootEntailment.poorlyAlignedRoot: "arrived" aligned badly to "were"; Structure.argsMismatch: args have different parents but same relations: text "school" <-prep_at-- "students vs. hyp "school" <-prep_at-- "arrived", which aligned to text "were" args have different parents but same relations: text "Sunday" <-prep_on-- "school vs. hyp "Sunday" <-prep_on-- "arrived", which aligned to text "were"
Hand-tuned score: -17.0000
Threshold: -3.3437
Txt: There were no students at the school on Sunday.
Hyp: No students arrived at the school on Sunday. (Yes.)
| No DT |
students NNS |
arrived VBD |
the DT |
school NN |
Sunday NNP |
. . |
|
| There:EX | 10.00 | 20.00 | 20.00 | 10.00 | 20.00 | 20.50 | 10.00 |
| were:VBD | 20.00 | 14.34 | 10.00 | 20.00 | 14.34 | 15.50 | 20.00 |
| no:DT | 0.00 | 20.00 | 20.00 | 10.00 | 20.00 | 20.50 | 10.00 |
| students:NNS | 20.00 | 0.00 | 14.29 | 20.00 | 0.75 | 10.50 | 20.00 |
| the:DT | 10.00 | 20.00 | 20.00 | 0.00 | 20.00 | 20.50 | 10.00 |
| school:NN | 20.00 | 0.75 | 13.50 | 20.00 | 0.00 | 7.73 | 19.99 |
| Sunday:NNP | 20.50 | 10.50 | 15.50 | 20.50 | 7.73 | 0.00 | 20.50 |
| .:. | 10.00 | 20.00 | 20.00 | 10.00 | 19.99 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -14.0000
Features matched: Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/1000; Polarity.txtNegMarker: "no": no; Quant.bothNo: [no,no]; RootEntailment.poorlyAlignedRoot: "arrived" aligned badly to "were"; Structure.argsMismatch: args have different parents but same relations: text "school" <-prep_at-- "students vs. hyp "school" <-prep_at-- "arrived", which aligned to text "were" args have different parents but same relations: text "Sunday" <-prep_on-- "school vs. hyp "Sunday" <-prep_on-- "arrived", which aligned to text "were"
Hand-tuned score: -2.0000
Threshold: -3.3437
Txt: The diplomat left Baghdad last week.
Hyp: The diplomat has been to Baghdad. (Yes.)
| The DT |
diplomat NN |
has VBZ |
been VBN |
Baghdad NNP |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| diplomat:NN | 20.00 | 0.00 | 14.34 | 14.34 | 9.84 | 19.87 |
| left:VBD | 20.00 | 14.34 | 7.52 | 9.34 | 11.79 | 19.46 |
| Baghdad:NNP | 20.50 | 9.84 | 13.02 | 14.84 | 0.00 | 20.50 |
| last:JJ | 20.50 | 11.45 | 11.19 | 10.83 | 16.34 | 20.50 |
| week:NN | 20.50 | 10.50 | 14.19 | 15.50 | 15.00 | 17.43 |
| .:. | 10.00 | 19.87 | 20.00 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -12.3420
Features matched: Adjunct.dropPosCxt: text adjunct "week" of "left" dropped on aligned hyp word "been"; NullPunisher.aux: has; RootEntailment.poorlyAlignedRoot: "been" aligned badly to "left"; Structure.relMismatch: text "Baghdad" is dobj of "left" while hyp "Baghdad" is prep_to of "been" which aligned to text "left"
Hand-tuned score: -1.5500
Threshold: -3.3437
Txt: The diplomat left Baghdad last week.
Hyp: The diplomat has not been to Baghdad. (Don't know.)
| The DT |
diplomat NN |
has VBZ |
not RB |
been VBN |
Baghdad NNP |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| diplomat:NN | 20.00 | 0.00 | 14.34 | 14.96 | 14.34 | 9.84 | 19.87 |
| left:VBD | 20.00 | 14.34 | 7.52 | 19.96 | 9.34 | 11.79 | 19.46 |
| Baghdad:NNP | 20.50 | 9.84 | 13.02 | 15.46 | 14.84 | 0.00 | 20.50 |
| last:JJ | 20.50 | 11.45 | 11.19 | 12.46 | 10.83 | 16.34 | 20.50 |
| week:NN | 20.50 | 10.50 | 14.19 | 15.46 | 15.50 | 15.00 | 17.43 |
| .:. | 10.00 | 19.87 | 20.00 | 20.00 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -21.3420
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "been": neg; NullPunisher.other: not; NullPunisher.aux: has; RootEntailment.poorlyAlignedRoot: "been" aligned badly to "left"; Structure.relMismatch: text "Baghdad" is dobj of "left" while hyp "Baghdad" is prep_to of "been" which aligned to text "left"
Hand-tuned score: -8.0500
Threshold: -3.3437
Txt: The diplomat will arrive in Baghdad next week.
Hyp: The diplomat has been to Baghdad. (Don't know.)
| The DT |
diplomat NN |
has VBZ |
been VBN |
Baghdad NNP |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| diplomat:NN | 20.00 | 0.00 | 14.34 | 14.34 | 9.84 | 19.87 |
| will:MD | 10.00 | 20.00 | 18.69 | 20.00 | 20.50 | 10.00 |
| arrive:VB | 20.00 | 13.56 | 10.00 | 10.00 | 15.50 | 20.00 |
| Baghdad:NNP | 20.50 | 9.84 | 13.02 | 14.84 | 0.00 | 20.50 |
| next:JJ | 20.00 | 11.96 | 11.96 | 11.96 | 12.46 | 20.00 |
| week:NN | 20.00 | 10.00 | 13.69 | 15.00 | 10.50 | 16.93 |
| .:. | 10.00 | 19.87 | 20.00 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -12.0000
Features matched: Adjunct.dropPosCxt: text adjunct "week" of "arrive" dropped on aligned hyp word "been"; NullPunisher.aux: has; RootEntailment.poorlyAlignedRoot: "been" aligned badly to "arrive"; Structure.relMismatch: text "Baghdad" is prep_in of "arrive" while hyp "Baghdad" is prep_to of "been" which aligned to text "arrive"
Hand-tuned score: -1.5500
Threshold: -3.3437
Txt: The diplomat will arrive in Baghdad next week.
Hyp: The diplomat has not been to Baghdad. (Don't know.)
| The DT |
diplomat NN |
has VBZ |
not RB |
been VBN |
Baghdad NNP |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| diplomat:NN | 20.00 | 0.00 | 14.34 | 14.96 | 14.34 | 9.84 | 19.87 |
| will:MD | 10.00 | 20.00 | 18.69 | 19.96 | 20.00 | 20.50 | 10.00 |
| arrive:VB | 20.00 | 13.56 | 10.00 | 19.96 | 10.00 | 15.50 | 20.00 |
| Baghdad:NNP | 20.50 | 9.84 | 13.02 | 15.46 | 14.84 | 0.00 | 20.50 |
| next:JJ | 20.00 | 11.96 | 11.96 | 11.96 | 11.96 | 12.46 | 20.00 |
| week:NN | 20.00 | 10.00 | 13.69 | 14.96 | 15.00 | 10.50 | 16.93 |
| .:. | 10.00 | 19.87 | 20.00 | 20.00 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -21.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "been": neg; NullPunisher.aux: has; NullPunisher.other: not; RootEntailment.poorlyAlignedRoot: "been" aligned badly to "arrive"; Structure.relMismatch: text "Baghdad" is prep_in of "arrive" while hyp "Baghdad" is prep_to of "been" which aligned to text "arrive"
Hand-tuned score: -8.0500
Threshold: -3.3437
Txt: The president knows that the diplomat left Baghdad.
Hyp: The diplomat has been to Baghdad. (Yes.)
| The DT |
diplomat NN |
has VBZ |
been VBN |
Baghdad NNP |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| president:NN | 20.00 | 3.94 | 14.34 | 14.34 | 9.84 | 20.00 |
| knows:VBZ | 20.00 | 13.88 | 10.00 | 8.07 | 15.50 | 20.00 |
| that:IN | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 20.00 |
| the:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| diplomat:NN | 20.00 | 0.00 | 14.34 | 14.34 | 9.84 | 19.87 |
| left:VBD | 20.00 | 14.34 | 7.52 | 9.34 | 11.79 | 19.46 |
| Baghdad:NNP | 20.50 | 9.84 | 13.02 | 14.84 | 0.00 | 20.50 |
| .:. | 10.00 | 19.87 | 20.00 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (INCORRECT)
Justification:
Alignment score: -13.0669
Features matched: NullPunisher.aux: has; RootEntailment.poorlyAlignedRoot: "been" aligned badly to "knows"; Structure.argsMismatch: args have different parents but same relations: text "diplomat" <-nsubj-- "left vs. hyp "diplomat" <-nsubj-- "been", which aligned to text "knows" args have different parents, different relations: text "Baghdad" <-dobj-- "left" vs. hyp "Baghdad" <-prep_to-- "been", which aligned to text "knows"
Hand-tuned score: -4.0500
Threshold: -3.3437
Txt: The president knows that the diplomat left Baghdad.
Hyp: The diplomat has not been to Baghdad. (Don't know.)
| The DT |
diplomat NN |
has VBZ |
not RB |
been VBN |
Baghdad NNP |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| president:NN | 20.00 | 3.94 | 14.34 | 14.96 | 14.34 | 9.84 | 20.00 |
| knows:VBZ | 20.00 | 13.88 | 10.00 | 19.96 | 8.07 | 15.50 | 20.00 |
| that:IN | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 20.00 |
| the:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| diplomat:NN | 20.00 | 0.00 | 14.34 | 14.96 | 14.34 | 9.84 | 19.87 |
| left:VBD | 20.00 | 14.34 | 7.52 | 19.96 | 9.34 | 11.79 | 19.46 |
| Baghdad:NNP | 20.50 | 9.84 | 13.02 | 15.46 | 14.84 | 0.00 | 20.50 |
| .:. | 10.00 | 19.87 | 20.00 | 20.00 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -22.0669
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "been": neg; NullPunisher.aux: has; NullPunisher.other: not; RootEntailment.poorlyAlignedRoot: "been" aligned badly to "knows"; Structure.argsMismatch: args have different parents but same relations: text "diplomat" <-nsubj-- "left vs. hyp "diplomat" <-nsubj-- "been", which aligned to text "knows" args have different parents, different relations: text "Baghdad" <-dobj-- "left" vs. hyp "Baghdad" <-prep_to-- "been", which aligned to text "knows"
Hand-tuned score: -10.0500
Threshold: -3.3437
Txt: The president hasn't gone to Iraq since the diplomat left Baghdad.
Hyp: The diplomat has been to Baghdad. (Yes.)
| The DT |
diplomat NN |
has VBZ |
been VBN |
Baghdad NNP |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| president:NN | 20.00 | 3.94 | 14.34 | 14.34 | 9.84 | 20.00 |
| has:VBZ | 20.00 | 14.34 | 0.00 | 9.34 | 13.02 | 20.00 |
| n't:RB | 20.00 | 14.17 | 19.96 | 19.96 | 15.46 | 17.90 |
| gone:VBN | 20.00 | 13.08 | 8.69 | 6.07 | 14.84 | 19.35 |
| Iraq:NNP | 20.50 | 9.84 | 13.02 | 14.84 | 2.00 | 20.50 |
| since:IN | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 20.00 |
| the:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| diplomat:NN | 20.00 | 0.00 | 14.34 | 14.34 | 9.84 | 19.87 |
| left:VBD | 20.00 | 14.34 | 7.52 | 9.34 | 11.79 | 19.46 |
| Baghdad:NNP | 20.50 | 9.84 | 13.02 | 14.84 | 0.00 | 20.50 |
| .:. | 10.00 | 19.87 | 20.00 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (INCORRECT)
Justification:
Alignment score: -10.0708
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.txtNegMarker: "gone": neg; RootEntailment.poorlyAlignedRoot: "been" aligned badly to "gone"; Structure.argsMismatch: args have different parents but same relations: text "diplomat" <-nsubj-- "left vs. hyp "diplomat" <-nsubj-- "been", which aligned to text "gone" args have different parents, different relations: text "Baghdad" <-dobj-- "left" vs. hyp "Baghdad" <-prep_to-- "been", which aligned to text "gone"
Hand-tuned score: -9.0000
Threshold: -3.3437
Txt: The president hasn't gone to Iraq since the diplomat left Baghdad.
Hyp: The diplomat has not been to Baghdad. (Don't know.)
| The DT |
diplomat NN |
has VBZ |
not RB |
been VBN |
Baghdad NNP |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| president:NN | 20.00 | 3.94 | 14.34 | 14.96 | 14.34 | 9.84 | 20.00 |
| has:VBZ | 20.00 | 14.34 | 0.00 | 19.96 | 9.34 | 13.02 | 20.00 |
| n't:RB | 20.00 | 14.17 | 19.96 | 0.50 | 19.96 | 15.46 | 17.90 |
| gone:VBN | 20.00 | 13.08 | 8.69 | 19.96 | 6.07 | 14.84 | 19.35 |
| Iraq:NNP | 20.50 | 9.84 | 13.02 | 15.46 | 14.84 | 2.00 | 20.50 |
| since:IN | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 20.00 |
| the:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| diplomat:NN | 20.00 | 0.00 | 14.34 | 14.96 | 14.34 | 9.84 | 19.87 |
| left:VBD | 20.00 | 14.34 | 7.52 | 19.96 | 9.34 | 11.79 | 19.46 |
| Baghdad:NNP | 20.50 | 9.84 | 13.02 | 15.46 | 14.84 | 0.00 | 20.50 |
| .:. | 10.00 | 19.87 | 20.00 | 20.00 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -10.5708
Features matched: Adjunct.dropNegCxt: text adjunct "Iraq" of "gone" dropped on aligned hyp word "been"; Polarity.hypNegMarker: "been": neg; Polarity.txtNegMarker: "gone": neg; RootEntailment.poorlyAlignedRoot: "been" aligned badly to "gone"; Structure.argsMismatch: args have different parents but same relations: text "diplomat" <-nsubj-- "left vs. hyp "diplomat" <-nsubj-- "been", which aligned to text "gone" args have different parents, different relations: text "Baghdad" <-dobj-- "left" vs. hyp "Baghdad" <-prep_to-- "been", which aligned to text "gone" ; Polarity.txtNegMarker&PolarityhypNegMarker:
Hand-tuned score: -7.0000
Threshold: -3.3437
Txt: The president hasn't gone to Iraq since the diplomat left Baghdad.
Hyp: The president has been to Iraq. (Don't know.)
| The DT |
president NN |
has VBZ |
been VBN |
Iraq NNP |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| president:NN | 20.00 | 0.00 | 14.34 | 14.34 | 9.84 | 20.00 |
| has:VBZ | 20.00 | 14.34 | 0.00 | 9.34 | 13.02 | 20.00 |
| n't:RB | 20.00 | 14.96 | 19.96 | 19.96 | 15.46 | 17.90 |
| gone:VBN | 20.00 | 12.72 | 8.69 | 6.07 | 14.84 | 19.35 |
| Iraq:NNP | 20.50 | 9.84 | 13.02 | 14.84 | 0.00 | 20.50 |
| since:IN | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 20.00 |
| the:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| diplomat:NN | 20.00 | 3.94 | 14.34 | 14.34 | 9.84 | 19.87 |
| left:VBD | 20.00 | 13.35 | 7.52 | 9.34 | 12.61 | 19.46 |
| Baghdad:NNP | 20.50 | 9.84 | 13.02 | 14.84 | 2.00 | 20.50 |
| .:. | 10.00 | 20.00 | 20.00 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -6.0708
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.txtNegMarker: "gone": neg; RootEntailment.poorlyAlignedRoot: "been" aligned badly to "gone"
Hand-tuned score: -6.0000
Threshold: -3.3437
Txt: The president hasn't gone to Iraq since the diplomat left Baghdad.
Hyp: The president has not been to Iraq. (Don't know.)
| The DT |
president NN |
has VBZ |
not RB |
been VBN |
Iraq NNP |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| president:NN | 20.00 | 0.00 | 14.34 | 14.96 | 14.34 | 9.84 | 20.00 |
| has:VBZ | 20.00 | 14.34 | 0.00 | 19.96 | 9.34 | 13.02 | 20.00 |
| n't:RB | 20.00 | 14.96 | 19.96 | 0.50 | 19.96 | 15.46 | 17.90 |
| gone:VBN | 20.00 | 12.72 | 8.69 | 19.96 | 6.07 | 14.84 | 19.35 |
| Iraq:NNP | 20.50 | 9.84 | 13.02 | 15.46 | 14.84 | 0.00 | 20.50 |
| since:IN | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 20.00 |
| the:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| diplomat:NN | 20.00 | 3.94 | 14.34 | 14.96 | 14.34 | 9.84 | 19.87 |
| left:VBD | 20.00 | 13.35 | 7.52 | 19.96 | 9.34 | 12.61 | 19.46 |
| Baghdad:NNP | 20.50 | 9.84 | 13.02 | 15.46 | 14.84 | 2.00 | 20.50 |
| .:. | 10.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -6.5708
Features matched: Polarity.hypNegMarker: "been": neg; Polarity.txtNegMarker: "gone": neg; RootEntailment.poorlyAlignedRoot: "been" aligned badly to "gone"; Polarity.txtNegMarker&PolarityhypNegMarker:
Hand-tuned score: -1.0000
Threshold: -3.3437
Txt: The diplomat didn't manage to leave Baghdad.
Hyp: The diplomat has been to Baghdad. (Yes.)
| The DT |
diplomat NN |
has VBZ |
been VBN |
Baghdad NNP |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| diplomat:NN | 20.00 | 0.00 | 14.34 | 14.34 | 9.84 | 19.87 |
| did:VBD | 20.00 | 15.00 | 7.53 | 6.07 | 15.50 | 17.99 |
| n't:RB | 20.00 | 14.17 | 19.96 | 19.96 | 15.46 | 17.90 |
| manage:VB | 20.00 | 15.00 | 10.00 | 8.07 | 15.50 | 19.98 |
| to:TO | 10.00 | 20.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| leave:VB | 20.00 | 15.00 | 8.69 | 7.74 | 15.50 | 19.32 |
| Baghdad:NNP | 20.50 | 9.84 | 13.02 | 14.84 | 0.00 | 20.50 |
| .:. | 10.00 | 19.87 | 20.00 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (INCORRECT)
Justification:
Alignment score: -11.0669
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.txtNegMarker: "manage": neg; NullPunisher.aux: has; RootEntailment.poorlyAlignedRoot: "been" aligned badly to "manage"; Structure.parentsMismatch: args have different parents, different relations: text "Baghdad" <-dobj-- "leave" vs. hyp "Baghdad" <-prep_to-- "been", which aligned to text "manage"
Hand-tuned score: -9.0500
Threshold: -3.3437
Txt: The diplomat didn't manage to leave Baghdad.
Hyp: The diplomat has not been to Baghdad. (Don't know.)
| The DT |
diplomat NN |
has VBZ |
not RB |
been VBN |
Baghdad NNP |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| diplomat:NN | 20.00 | 0.00 | 14.34 | 14.96 | 14.34 | 9.84 | 19.87 |
| did:VBD | 20.00 | 15.00 | 7.53 | 19.96 | 6.07 | 15.50 | 17.99 |
| n't:RB | 20.00 | 14.17 | 19.96 | 0.50 | 19.96 | 15.46 | 17.90 |
| manage:VB | 20.00 | 15.00 | 10.00 | 19.96 | 8.07 | 15.50 | 19.98 |
| to:TO | 10.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| leave:VB | 20.00 | 15.00 | 8.69 | 19.96 | 7.74 | 15.50 | 19.32 |
| Baghdad:NNP | 20.50 | 9.84 | 13.02 | 15.46 | 14.84 | 0.00 | 20.50 |
| .:. | 10.00 | 19.87 | 20.00 | 20.00 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -11.5669
Features matched: Polarity.hypNegMarker: "been": neg; Polarity.txtNegMarker: "manage": neg; NullPunisher.aux: has; RootEntailment.poorlyAlignedRoot: "been" aligned badly to "manage"; Structure.parentsMismatch: args have different parents, different relations: text "Baghdad" <-dobj-- "leave" vs. hyp "Baghdad" <-prep_to-- "been", which aligned to text "manage" ; Polarity.txtNegMarker&PolarityhypNegMarker:
Hand-tuned score: -4.0500
Threshold: -3.3437
Txt: The diplomat hasn't managed to leave Baghdad.
Hyp: The diplomat is in Baghdad now. (Yes.)
| The DT |
diplomat NN |
is VBZ |
Baghdad NNP |
now RB |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.50 | 20.00 | 10.00 |
| diplomat:NN | 20.00 | 0.00 | 14.34 | 9.84 | 15.00 | 19.87 |
| has:VBZ | 20.00 | 14.34 | 8.64 | 13.02 | 18.69 | 20.00 |
| n't:RB | 20.00 | 14.17 | 19.96 | 15.46 | 9.96 | 17.90 |
| managed:VBN | 20.00 | 15.00 | 8.07 | 15.50 | 20.00 | 20.00 |
| to:TO | 10.00 | 20.00 | 20.00 | 20.50 | 20.00 | 10.00 |
| leave:VB | 20.00 | 15.00 | 7.74 | 15.50 | 18.69 | 19.32 |
| Baghdad:NNP | 20.50 | 9.84 | 14.84 | 0.00 | 15.50 | 20.50 |
| .:. | 10.00 | 19.87 | 20.00 | 20.50 | 20.00 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 10.00 | 10.00 | 9.00 | 10.00 |
Response: dontknow (INCORRECT)
Justification:
Alignment score: -19.0669
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.txtNegMarker: "managed": neg; NullPunisher.other: now; RootEntailment.poorlyAlignedRoot: "is" aligned badly to "managed"; Structure.parentsMismatch: args have different parents, different relations: text "Baghdad" <-dobj-- "leave" vs. hyp "Baghdad" <-prep_in-- "is", which aligned to text "managed"
Hand-tuned score: -10.0000
Threshold: -3.3437
Txt: The diplomat hasn't managed to leave Baghdad.
Hyp: The diplomat is not in Baghdad now. (Don't know.)
| The DT |
diplomat NN |
is VBZ |
not RB |
Baghdad NNP |
now RB |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.50 | 20.00 | 10.00 |
| diplomat:NN | 20.00 | 0.00 | 14.34 | 14.96 | 9.84 | 15.00 | 19.87 |
| has:VBZ | 20.00 | 14.34 | 8.64 | 19.96 | 13.02 | 18.69 | 20.00 |
| n't:RB | 20.00 | 14.17 | 19.96 | 0.50 | 15.46 | 9.96 | 17.90 |
| managed:VBN | 20.00 | 15.00 | 8.07 | 19.96 | 15.50 | 20.00 | 20.00 |
| to:TO | 10.00 | 20.00 | 20.00 | 20.00 | 20.50 | 20.00 | 10.00 |
| leave:VB | 20.00 | 15.00 | 7.74 | 19.96 | 15.50 | 18.69 | 19.32 |
| Baghdad:NNP | 20.50 | 9.84 | 14.84 | 15.46 | 0.00 | 15.50 | 20.50 |
| .:. | 10.00 | 19.87 | 20.00 | 20.00 | 20.50 | 20.00 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 10.00 | 9.00 | 10.00 | 9.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -19.5669
Features matched: Adjunct.addNegCxt: hyp added now[now-RB]; Polarity.hypNegMarker: "is": neg; Polarity.txtNegMarker: "managed": neg; NullPunisher.other: now; RootEntailment.poorlyAlignedRoot: "is" aligned badly to "managed"; Structure.parentsMismatch: args have different parents, different relations: text "Baghdad" <-dobj-- "leave" vs. hyp "Baghdad" <-prep_in-- "is", which aligned to text "managed" ; Polarity.txtNegMarker&PolarityhypNegMarker:
Hand-tuned score: -4.0000
Threshold: -3.3437
Txt: The room was full of intelligent women.
Hyp: The room was full of women. (Yes.)
| The DT |
room NN |
was VBD |
full JJ |
women NNS |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| room:NN | 20.00 | 0.00 | 14.34 | 10.93 | 8.13 | 19.15 |
| was:VBD | 20.00 | 14.34 | 0.00 | 12.00 | 14.34 | 20.00 |
| full:JJ | 20.00 | 10.93 | 12.00 | 0.00 | 11.15 | 17.26 |
| intelligent:JJ | 20.00 | 11.96 | 11.96 | 9.96 | 9.83 | 19.37 |
| women:NNS | 20.00 | 8.13 | 14.34 | 11.15 | 0.00 | 19.64 |
| .:. | 10.00 | 19.15 | 20.00 | 17.26 | 19.64 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: 0.0000
Features matched: Adjunct.dropPosCxt: text adjunct "intelligent" of "women" dropped on aligned hyp word "women"; Adjunct.dropPosCxt&Align.veryGood:
Hand-tuned score: 2.0000
Threshold: -3.3437
Txt: The room was full of intelligent women.
Hyp: The room was not full of women. (Don't know.)
| The DT |
room NN |
was VBD |
not RB |
full JJ |
women NNS |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| room:NN | 20.00 | 0.00 | 14.34 | 14.96 | 10.93 | 8.13 | 19.15 |
| was:VBD | 20.00 | 14.34 | 0.00 | 19.96 | 12.00 | 14.34 | 20.00 |
| full:JJ | 20.00 | 10.93 | 12.00 | 11.96 | 0.00 | 11.15 | 17.26 |
| intelligent:JJ | 20.00 | 11.96 | 11.96 | 11.96 | 9.96 | 9.83 | 19.37 |
| women:NNS | 20.00 | 8.13 | 14.34 | 14.96 | 11.15 | 0.00 | 19.64 |
| .:. | 10.00 | 19.15 | 20.00 | 20.00 | 17.26 | 19.64 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 9.00 | 9.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -9.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "full": neg; NullPunisher.other: not
Hand-tuned score: -5.0000
Threshold: -3.3437
Txt: The room was full of women.
Hyp: The room was full of intelligent women. (Don't know.)
| The DT |
room NN |
was VBD |
full JJ |
intelligent JJ |
women NNS |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| room:NN | 20.00 | 0.00 | 14.34 | 10.93 | 11.96 | 8.13 | 19.15 |
| was:VBD | 20.00 | 14.34 | 0.00 | 12.00 | 11.96 | 14.34 | 20.00 |
| full:JJ | 20.00 | 10.93 | 12.00 | 0.00 | 9.96 | 11.15 | 17.26 |
| women:NNS | 20.00 | 8.13 | 14.34 | 11.15 | 9.83 | 0.00 | 19.64 |
| .:. | 10.00 | 19.15 | 20.00 | 17.26 | 19.37 | 19.64 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 9.00 | 9.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -9.0000
Features matched: Adjunct.addPosCxt: hyp added intelligent[intelligent-JJ]; NullPunisher.other: intelligent
Hand-tuned score: -1.0000
Threshold: -3.3437
Txt: The room was full of women.
Hyp: The room was not full of intelligent women. (Don't know.)
| The DT |
room NN |
was VBD |
not RB |
full JJ |
intelligent JJ |
women NNS |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| room:NN | 20.00 | 0.00 | 14.34 | 14.96 | 10.93 | 11.96 | 8.13 | 19.15 |
| was:VBD | 20.00 | 14.34 | 0.00 | 19.96 | 12.00 | 11.96 | 14.34 | 20.00 |
| full:JJ | 20.00 | 10.93 | 12.00 | 11.96 | 0.00 | 9.96 | 11.15 | 17.26 |
| women:NNS | 20.00 | 8.13 | 14.34 | 14.96 | 11.15 | 9.83 | 0.00 | 19.64 |
| .:. | 10.00 | 19.15 | 20.00 | 20.00 | 17.26 | 19.37 | 19.64 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 9.00 | 9.00 | 9.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -18.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "full": neg; NullPunisher.other: not; NullPunisher.other: intelligent
Hand-tuned score: -6.0000
Threshold: -3.3437
Txt: Children are not admitted to the theatre.
Hyp: Small children are admitted to the theatre. (Don't know.)
| Small JJ |
children NNS |
are VBP |
admitted VBN |
the DT |
theater NN |
. . |
|
| Children:NNP | 11.34 | 0.00 | 15.00 | 15.00 | 20.00 | 8.95 | 20.00 |
| are:VBP | 10.69 | 15.00 | 0.00 | 10.00 | 20.00 | 15.00 | 20.00 |
| not:RB | 11.96 | 14.96 | 19.96 | 19.96 | 20.00 | 14.96 | 20.00 |
| admitted:VBN | 12.00 | 12.85 | 10.00 | 0.00 | 20.00 | 15.00 | 19.33 |
| the:DT | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| theater:NN | 11.34 | 8.95 | 15.00 | 15.00 | 20.00 | 0.00 | 20.00 |
| .:. | 20.00 | 19.49 | 20.00 | 19.33 | 10.00 | 20.00 | 0.00 |
| NO_WORD | 9.00 | 10.00 | 1.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -9.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.txtNegMarker: "admitted": neg; NullPunisher.other: Small
Hand-tuned score: -5.0000
Threshold: -3.3437
Txt: Children are not admitted to the theatre.
Hyp: Small children are not admitted to the theatre. (Yes.)
| Small JJ |
children NNS |
are VBP |
not RB |
admitted VBN |
the DT |
theater NN |
. . |
|
| Children:NNP | 11.34 | 0.00 | 15.00 | 14.96 | 15.00 | 20.00 | 8.95 | 20.00 |
| are:VBP | 10.69 | 15.00 | 0.00 | 19.96 | 10.00 | 20.00 | 15.00 | 20.00 |
| not:RB | 11.96 | 14.96 | 19.96 | 0.00 | 19.96 | 20.00 | 14.96 | 20.00 |
| admitted:VBN | 12.00 | 12.85 | 10.00 | 19.96 | 0.00 | 20.00 | 15.00 | 19.33 |
| the:DT | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| theater:NN | 11.34 | 8.95 | 15.00 | 14.96 | 15.00 | 20.00 | 0.00 | 20.00 |
| .:. | 20.00 | 19.49 | 20.00 | 20.00 | 19.33 | 10.00 | 20.00 | 0.00 |
| NO_WORD | 9.00 | 10.00 | 1.00 | 9.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -9.0000
Features matched: Adjunct.addNegCxt: hyp added Small[Small-JJ]; Polarity.hypNegMarker: "admitted": neg; Polarity.txtNegMarker: "admitted": neg; NullPunisher.other: Small; Polarity.txtNegMarker&PolarityhypNegMarker:
Hand-tuned score: 1.0000
Threshold: -3.3437
Txt: Small children are not admitted to the theatre.
Hyp: Children are admitted to the theatre. (Don't know.)
| Children NNP |
are VBP |
admitted VBN |
the DT |
theater NN |
. . |
|
| Small:JJ | 11.34 | 10.69 | 12.00 | 20.00 | 11.34 | 20.00 |
| children:NNS | 0.00 | 15.00 | 12.85 | 20.00 | 8.95 | 19.49 |
| are:VBP | 15.00 | 0.00 | 10.00 | 20.00 | 15.00 | 20.00 |
| not:RB | 14.96 | 19.96 | 19.96 | 20.00 | 14.96 | 20.00 |
| admitted:VBN | 15.00 | 10.00 | 0.00 | 20.00 | 15.00 | 19.33 |
| the:DT | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| theater:NN | 8.95 | 15.00 | 15.00 | 20.00 | 0.00 | 20.00 |
| .:. | 20.00 | 20.00 | 19.33 | 10.00 | 20.00 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: 0.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.txtNegMarker: "admitted": neg
Hand-tuned score: -4.0000
Threshold: -3.3437
Txt: Small children are not admitted to the theatre.
Hyp: Children are not admitted to the theatre. (Don't know.)
| Children NNP |
are VBP |
not RB |
admitted VBN |
the DT |
theater NN |
. . |
|
| Small:JJ | 11.34 | 10.69 | 11.96 | 12.00 | 20.00 | 11.34 | 20.00 |
| children:NNS | 0.00 | 15.00 | 14.96 | 12.85 | 20.00 | 8.95 | 19.49 |
| are:VBP | 15.00 | 0.00 | 19.96 | 10.00 | 20.00 | 15.00 | 20.00 |
| not:RB | 14.96 | 19.96 | 0.00 | 19.96 | 20.00 | 14.96 | 20.00 |
| admitted:VBN | 15.00 | 10.00 | 19.96 | 0.00 | 20.00 | 15.00 | 19.33 |
| the:DT | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| theater:NN | 8.95 | 15.00 | 14.96 | 15.00 | 20.00 | 0.00 | 20.00 |
| .:. | 20.00 | 20.00 | 20.00 | 19.33 | 10.00 | 20.00 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: 0.0000
Features matched: Adjunct.dropNegCxt: text adjunct "Small" of "children" dropped on aligned hyp word "Children"; Polarity.hypNegMarker: "admitted": neg; Polarity.txtNegMarker: "admitted": neg; Polarity.txtNegMarker&PolarityhypNegMarker:
Hand-tuned score: -2.0000
Threshold: -3.3437
Txt: All companies have to file annual reports.
Hyp: All Fortune 500 companies have to file annual reports. (Yes.)
| All DT |
Fortune JJ |
500 CD |
companies NNS |
have VBP |
to TO |
file VB |
annual JJ |
reports NNS |
. . |
|
| All:DT | 0.00 | 20.00 | 20.50 | 20.00 | 20.00 | 10.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| companies:NNS | 20.00 | 9.44 | 19.02 | 0.00 | 12.80 | 20.00 | 13.13 | 10.11 | 7.80 | 19.68 |
| have:VBP | 20.00 | 12.00 | 20.50 | 12.80 | 0.00 | 20.00 | 7.02 | 10.11 | 12.80 | 20.00 |
| to:TO | 10.00 | 20.00 | 20.50 | 20.00 | 20.00 | 0.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| file:VB | 20.00 | 12.00 | 19.19 | 13.13 | 7.02 | 20.00 | 0.00 | 10.03 | 12.37 | 20.00 |
| annual:JJ | 20.00 | 10.00 | 19.41 | 10.11 | 10.11 | 20.00 | 10.03 | 0.00 | 10.23 | 20.00 |
| reports:NNS | 20.00 | 12.00 | 19.19 | 7.80 | 12.80 | 20.00 | 12.37 | 10.23 | 0.00 | 19.70 |
| .:. | 10.00 | 20.00 | 19.65 | 19.68 | 20.00 | 10.00 | 20.00 | 20.00 | 19.70 | 0.00 |
| NO_WORD | 10.00 | 9.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 | 9.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -19.0000
Features matched: Adjunct.addAllCxt: all cxt -- hyp adds 500[500-CD];, hyp added 500[500-CD]; Modal.weakYes: necessary -> necessary; NullPunisher.other: 500; NullPunisher.other: Fortune; Quant.contract: [all,all]
Hand-tuned score: 4.0000
Threshold: -3.3437
Txt: All companies have to file annual reports.
Hyp: Not all Fortune 500 companies have to file annual reports. (Don't know.)
| Not RB |
all DT |
Fortune JJ |
500 CD |
companies NNS |
have VBP |
to TO |
file VB |
annual JJ |
reports NNS |
. . |
|
| All:DT | 20.00 | 0.00 | 20.00 | 20.50 | 20.00 | 20.00 | 10.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| companies:NNS | 14.96 | 20.00 | 9.44 | 19.02 | 0.00 | 12.80 | 20.00 | 13.13 | 10.11 | 7.80 | 19.68 |
| have:VBP | 19.96 | 20.00 | 12.00 | 20.50 | 12.80 | 0.00 | 20.00 | 7.02 | 10.11 | 12.80 | 20.00 |
| to:TO | 20.00 | 10.00 | 20.00 | 20.50 | 20.00 | 20.00 | 0.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| file:VB | 19.96 | 20.00 | 12.00 | 19.19 | 13.13 | 7.02 | 20.00 | 0.00 | 10.03 | 12.37 | 20.00 |
| annual:JJ | 11.96 | 20.00 | 10.00 | 19.41 | 10.11 | 10.11 | 20.00 | 10.03 | 0.00 | 10.23 | 20.00 |
| reports:NNS | 14.96 | 20.00 | 12.00 | 19.19 | 7.80 | 12.80 | 20.00 | 12.37 | 10.23 | 0.00 | 19.70 |
| .:. | 20.00 | 10.00 | 20.00 | 19.65 | 19.68 | 20.00 | 10.00 | 20.00 | 20.00 | 19.70 | 0.00 |
| NO_WORD | 9.00 | 10.00 | 9.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 | 9.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -28.0000
Features matched: Adjunct.addAllCxt: all cxt -- hyp adds 500[500-CD];, hyp added 500[500-CD]; Modal.weakYes: necessary -> necessary; NullPunisher.other: Fortune; NullPunisher.other: Not; NullPunisher.other: 500; Quant.contract: [all,all]
Hand-tuned score: 3.0000
Threshold: -3.3437
Txt: All Fortune 500 companies have to file annual reports.
Hyp: All companies have to file annual reports. (Don't know.)
| All DT |
companies NNS |
have VBP |
to TO |
file VB |
annual JJ |
reports NNS |
. . |
|
| All:DT | 0.00 | 20.00 | 20.00 | 10.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| Fortune:JJ | 20.00 | 9.44 | 12.00 | 20.00 | 12.00 | 10.00 | 12.00 | 20.00 |
| 500:CD | 20.50 | 19.02 | 20.50 | 20.50 | 19.19 | 19.41 | 19.19 | 19.65 |
| companies:NNS | 20.00 | 0.00 | 12.80 | 20.00 | 13.13 | 10.11 | 7.80 | 19.68 |
| have:VBP | 20.00 | 12.80 | 0.00 | 20.00 | 7.02 | 10.11 | 12.80 | 20.00 |
| to:TO | 10.00 | 20.00 | 20.00 | 0.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| file:VB | 20.00 | 13.13 | 7.02 | 20.00 | 0.00 | 10.03 | 12.37 | 20.00 |
| annual:JJ | 20.00 | 10.11 | 10.11 | 20.00 | 10.03 | 0.00 | 10.23 | 20.00 |
| reports:NNS | 20.00 | 7.80 | 12.80 | 20.00 | 12.37 | 10.23 | 0.00 | 19.70 |
| .:. | 10.00 | 19.68 | 20.00 | 10.00 | 20.00 | 20.00 | 19.70 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 | 9.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: 0.0000
Features matched: Adjunct.dropAllCxt: all cxt -- hyp drops adjunct 500 of companies aligned to hyp companies, text adjunct "500" of "companies" dropped on aligned hyp word "companies"; Modal.weakYes: necessary -> necessary; Quant.contract: [all,all]
Hand-tuned score: 0.0000
Threshold: -3.3437
Txt: All Fortune 500 companies have to file annual reports.
Hyp: Not all companies have to file annual reports. (Don't know.)
| Not RB |
all DT |
companies NNS |
have VBP |
to TO |
file VB |
annual JJ |
reports NNS |
. . |
|
| All:DT | 20.00 | 0.00 | 20.00 | 20.00 | 10.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| Fortune:JJ | 11.96 | 20.00 | 9.44 | 12.00 | 20.00 | 12.00 | 10.00 | 12.00 | 20.00 |
| 500:CD | 20.46 | 20.50 | 19.02 | 20.50 | 20.50 | 19.19 | 19.41 | 19.19 | 19.65 |
| companies:NNS | 14.96 | 20.00 | 0.00 | 12.80 | 20.00 | 13.13 | 10.11 | 7.80 | 19.68 |
| have:VBP | 19.96 | 20.00 | 12.80 | 0.00 | 20.00 | 7.02 | 10.11 | 12.80 | 20.00 |
| to:TO | 20.00 | 10.00 | 20.00 | 20.00 | 0.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| file:VB | 19.96 | 20.00 | 13.13 | 7.02 | 20.00 | 0.00 | 10.03 | 12.37 | 20.00 |
| annual:JJ | 11.96 | 20.00 | 10.11 | 10.11 | 20.00 | 10.03 | 0.00 | 10.23 | 20.00 |
| reports:NNS | 14.96 | 20.00 | 7.80 | 12.80 | 20.00 | 12.37 | 10.23 | 0.00 | 19.70 |
| .:. | 20.00 | 10.00 | 19.68 | 20.00 | 10.00 | 20.00 | 20.00 | 19.70 | 0.00 |
| NO_WORD | 9.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 | 9.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -9.0000
Features matched: Adjunct.dropAllCxt: all cxt -- hyp drops adjunct 500 of companies aligned to hyp companies, text adjunct "500" of "companies" dropped on aligned hyp word "companies", hyp added Not[Not-RB]; Modal.weakYes: necessary -> necessary; NullPunisher.other: Not; Quant.contract: [all,all]
Hand-tuned score: -1.0000
Threshold: -3.3437
Txt: All companies have to file annual reports to the sec.
Hyp: All companies have to file annual reports. (Yes.)
| All DT |
companies NNS |
have VBP |
to TO |
file VB |
annual JJ |
reports NNS |
. . |
|
| All:DT | 0.00 | 20.00 | 20.00 | 10.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| companies:NNS | 20.00 | 0.00 | 12.80 | 20.00 | 13.13 | 10.11 | 7.80 | 19.68 |
| have:VBP | 20.00 | 12.80 | 0.00 | 20.00 | 7.02 | 10.11 | 12.80 | 20.00 |
| to:TO | 10.00 | 20.00 | 20.00 | 0.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| file:VB | 20.00 | 13.13 | 7.02 | 20.00 | 0.00 | 10.03 | 12.37 | 20.00 |
| annual:JJ | 20.00 | 10.11 | 10.11 | 20.00 | 10.03 | 0.00 | 10.23 | 20.00 |
| reports:NNS | 20.00 | 7.80 | 12.80 | 20.00 | 12.37 | 10.23 | 0.00 | 19.70 |
| the:DT | 10.00 | 20.00 | 20.00 | 10.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| sec:NN | 20.00 | 6.69 | 15.00 | 20.00 | 12.11 | 12.00 | 7.76 | 19.27 |
| .:. | 10.00 | 19.68 | 20.00 | 10.00 | 20.00 | 20.00 | 19.70 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 | 9.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: 0.0000
Features matched: Adjunct.dropPosCxt: text adjunct "sec" of "file" dropped on aligned hyp word "file"; Modal.weakYes: necessary -> necessary; Quant.contract: [all,all]; Adjunct.dropPosCxt&Align.veryGood:
Hand-tuned score: 4.0000
Threshold: -3.3437
Txt: All companies have to file annual reports to the sec.
Hyp: Not all companies have to file annual reports. (Don't know.)
| Not RB |
all DT |
companies NNS |
have VBP |
to TO |
file VB |
annual JJ |
reports NNS |
. . |
|
| All:DT | 20.00 | 0.00 | 20.00 | 20.00 | 10.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| companies:NNS | 14.96 | 20.00 | 0.00 | 12.80 | 20.00 | 13.13 | 10.11 | 7.80 | 19.68 |
| have:VBP | 19.96 | 20.00 | 12.80 | 0.00 | 20.00 | 7.02 | 10.11 | 12.80 | 20.00 |
| to:TO | 20.00 | 10.00 | 20.00 | 20.00 | 0.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| file:VB | 19.96 | 20.00 | 13.13 | 7.02 | 20.00 | 0.00 | 10.03 | 12.37 | 20.00 |
| annual:JJ | 11.96 | 20.00 | 10.11 | 10.11 | 20.00 | 10.03 | 0.00 | 10.23 | 20.00 |
| reports:NNS | 14.96 | 20.00 | 7.80 | 12.80 | 20.00 | 12.37 | 10.23 | 0.00 | 19.70 |
| the:DT | 20.00 | 10.00 | 20.00 | 20.00 | 10.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| sec:NN | 14.96 | 20.00 | 6.69 | 15.00 | 20.00 | 12.11 | 12.00 | 7.76 | 19.27 |
| .:. | 20.00 | 10.00 | 19.68 | 20.00 | 10.00 | 20.00 | 20.00 | 19.70 | 0.00 |
| NO_WORD | 9.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 | 9.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -9.0000
Features matched: Adjunct.addAllCxt: all cxt -- hyp adds Not[Not-RB];, text adjunct "sec" of "file" dropped on aligned hyp word "file", hyp added Not[Not-RB]; Modal.weakYes: necessary -> necessary; NullPunisher.other: Not; Quant.contract: [all,all]
Hand-tuned score: 5.0000
Threshold: -3.3437
Txt: All companies have to file annual reports.
Hyp: All companies have to file annual reports to the sec. (Don't know.)
| All DT |
companies NNS |
have VBP |
to TO |
file VB |
annual JJ |
reports NNS |
the DT |
sec NN |
. . |
|
| All:DT | 0.00 | 20.00 | 20.00 | 10.00 | 20.00 | 20.00 | 20.00 | 10.00 | 20.00 | 10.00 |
| companies:NNS | 20.00 | 0.00 | 12.80 | 20.00 | 13.13 | 10.11 | 7.80 | 20.00 | 6.69 | 19.68 |
| have:VBP | 20.00 | 12.80 | 0.00 | 20.00 | 7.02 | 10.11 | 12.80 | 20.00 | 15.00 | 20.00 |
| to:TO | 10.00 | 20.00 | 20.00 | 0.00 | 20.00 | 20.00 | 20.00 | 10.00 | 20.00 | 10.00 |
| file:VB | 20.00 | 13.13 | 7.02 | 20.00 | 0.00 | 10.03 | 12.37 | 20.00 | 12.11 | 20.00 |
| annual:JJ | 20.00 | 10.11 | 10.11 | 20.00 | 10.03 | 0.00 | 10.23 | 20.00 | 12.00 | 20.00 |
| reports:NNS | 20.00 | 7.80 | 12.80 | 20.00 | 12.37 | 10.23 | 0.00 | 20.00 | 7.76 | 19.70 |
| .:. | 10.00 | 19.68 | 20.00 | 10.00 | 20.00 | 20.00 | 19.70 | 10.00 | 19.27 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 | 9.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -9.6929
Features matched: Modal.weakYes: necessary -> necessary; NullPunisher.article: the; Quant.contract: [all,all]; Quant.contract: [all,the]
Hand-tuned score: 3.9000
Threshold: -3.3437
Txt: All companies have to file annual reports.
Hyp: Not all companies have to file annual reports to the sec. (Don't know.)
| Not RB |
all DT |
companies NNS |
have VBP |
to TO |
file VB |
annual JJ |
reports NNS |
the DT |
sec NN |
. . |
|
| All:DT | 20.00 | 0.00 | 20.00 | 20.00 | 10.00 | 20.00 | 20.00 | 20.00 | 10.00 | 20.00 | 10.00 |
| companies:NNS | 14.96 | 20.00 | 0.00 | 12.80 | 20.00 | 13.13 | 10.11 | 7.80 | 20.00 | 6.69 | 19.68 |
| have:VBP | 19.96 | 20.00 | 12.80 | 0.00 | 20.00 | 7.02 | 10.11 | 12.80 | 20.00 | 15.00 | 20.00 |
| to:TO | 20.00 | 10.00 | 20.00 | 20.00 | 0.00 | 20.00 | 20.00 | 20.00 | 10.00 | 20.00 | 10.00 |
| file:VB | 19.96 | 20.00 | 13.13 | 7.02 | 20.00 | 0.00 | 10.03 | 12.37 | 20.00 | 12.11 | 20.00 |
| annual:JJ | 11.96 | 20.00 | 10.11 | 10.11 | 20.00 | 10.03 | 0.00 | 10.23 | 20.00 | 12.00 | 20.00 |
| reports:NNS | 14.96 | 20.00 | 7.80 | 12.80 | 20.00 | 12.37 | 10.23 | 0.00 | 20.00 | 7.76 | 19.70 |
| .:. | 20.00 | 10.00 | 19.68 | 20.00 | 10.00 | 20.00 | 20.00 | 19.70 | 10.00 | 19.27 | 0.00 |
| NO_WORD | 9.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 | 9.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -18.6929
Features matched: Adjunct.addAllCxt: all cxt -- hyp adds Not[Not-RB];, hyp added Not[Not-RB]; Modal.weakYes: necessary -> necessary; NullPunisher.article: the; NullPunisher.other: Not; Quant.contract: [all,all]; Quant.contract: [all,the]
Hand-tuned score: 5.9000
Threshold: -3.3437
Txt: No delegates finished the report.
Hyp: Some delegates finished the report on time. (Don't know.)
| Some DT |
delegates NNS |
finished VBD |
the DT |
report NN |
on_time IN |
. . |
|
| No:DT | 10.00 | 20.00 | 20.00 | 10.00 | 20.00 | 20.00 | 10.00 |
| delegates:NNS | 20.00 | 0.00 | 14.34 | 20.00 | 9.57 | 20.00 | 20.00 |
| finished:VBD | 20.00 | 14.34 | 0.00 | 20.00 | 11.89 | 20.00 | 18.93 |
| the:DT | 10.00 | 20.00 | 20.00 | 0.00 | 20.00 | 20.00 | 10.00 |
| report:NN | 20.00 | 9.57 | 11.89 | 20.00 | 0.00 | 20.00 | 19.87 |
| .:. | 10.00 | 20.00 | 18.93 | 10.00 | 19.87 | 20.00 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -20.0000
Features matched: NullPunisher.other: Some; NullPunisher.other: on_time; Quant.oneNo: [no,some[
Hand-tuned score: -7.0000
Threshold: -3.3437
Txt: No delegates finished the report.
Hyp: No delegates finish the report on time. (Yes.)
| No DT |
delegates NNS |
finish VBP |
the DT |
report NN |
on_time IN |
. . |
|
| No:DT | 0.00 | 20.00 | 20.00 | 10.00 | 20.00 | 20.00 | 10.00 |
| delegates:NNS | 20.00 | 0.00 | 13.85 | 20.00 | 9.57 | 20.00 | 20.00 |
| finished:VBD | 20.00 | 14.34 | 0.50 | 20.00 | 11.89 | 20.00 | 18.93 |
| the:DT | 10.00 | 20.00 | 20.00 | 0.00 | 20.00 | 20.00 | 10.00 |
| report:NN | 20.00 | 9.57 | 11.89 | 20.00 | 0.00 | 20.00 | 19.87 |
| .:. | 10.00 | 20.00 | 18.26 | 10.00 | 19.87 | 20.00 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -10.5000
Features matched: NullPunisher.other: on_time; Quant.bothNo: [no,no]
Hand-tuned score: 1.0000
Threshold: -3.3437
Txt: The US troops stayed in Iraq although the war was over.
Hyp: The war was over. (Yes.)
| The DT |
war NN |
was VBD |
over RP |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 10.00 | 10.00 |
| US:NNP | 20.50 | 10.50 | 11.83 | 20.50 | 20.50 |
| troops:NNS | 20.00 | 5.07 | 15.00 | 20.00 | 20.00 |
| stayed:VBD | 20.00 | 12.44 | 9.34 | 18.69 | 18.18 |
| Iraq:NNP | 20.50 | 10.50 | 11.83 | 20.50 | 20.50 |
| although:IN | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 |
| the:DT | 0.00 | 20.00 | 20.00 | 10.00 | 10.00 |
| war:NN | 20.00 | 0.00 | 15.00 | 20.00 | 19.45 |
| was:VBD | 20.00 | 15.00 | 0.00 | 20.00 | 20.00 |
| over:RP | 10.00 | 20.00 | 20.00 | 0.00 | 10.00 |
| .:. | 10.00 | 19.45 | 20.00 | 10.00 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -2.0000
Features matched: Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "stayed vs. hyp "." <-punct-- "over", which aligned to text "over"
Hand-tuned score: -2.0000
Threshold: -3.3437
Txt: The US troops stayed in Iraq although the war was over.
Hyp: The war was not over. (Don't know.)
| The DT |
war NN |
was VBD |
not RB |
over RP |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 10.00 | 10.00 |
| US:NNP | 20.50 | 10.50 | 11.83 | 15.46 | 20.50 | 20.50 |
| troops:NNS | 20.00 | 5.07 | 15.00 | 14.96 | 20.00 | 20.00 |
| stayed:VBD | 20.00 | 12.44 | 9.34 | 19.96 | 18.69 | 18.18 |
| Iraq:NNP | 20.50 | 10.50 | 11.83 | 15.46 | 20.50 | 20.50 |
| although:IN | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 |
| the:DT | 0.00 | 20.00 | 20.00 | 20.00 | 10.00 | 10.00 |
| war:NN | 20.00 | 0.00 | 15.00 | 14.96 | 20.00 | 19.45 |
| was:VBD | 20.00 | 15.00 | 0.00 | 19.96 | 20.00 | 20.00 |
| over:RP | 10.00 | 20.00 | 20.00 | 19.96 | 0.00 | 10.00 |
| .:. | 10.00 | 19.45 | 20.00 | 20.00 | 10.00 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -11.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "over": neg; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "stayed vs. hyp "." <-punct-- "over", which aligned to text "over"
Hand-tuned score: -8.0000
Threshold: -3.3437
Txt: Since it was cold, he closed the window.
Hyp: It was cold. (Yes.)
| It PRP |
was VBD |
cold JJ |
. . |
|
| Since:IN | 20.00 | 20.00 | 20.00 | 20.00 |
| it:PRP | 0.00 | 15.00 | 15.00 | 20.00 |
| was:VBD | 15.00 | 0.00 | 11.34 | 20.00 |
| cold:JJ | 15.00 | 11.34 | 0.00 | 19.61 |
| ,:, | 20.00 | 20.00 | 20.00 | 5.73 |
| he:PRP | 10.00 | 15.00 | 15.00 | 20.00 |
| closed:VBD | 15.00 | 10.00 | 9.84 | 19.49 |
| the:DT | 20.00 | 20.00 | 20.00 | 10.00 |
| window:NN | 12.00 | 12.52 | 9.28 | 19.62 |
| .:. | 20.00 | 20.00 | 19.61 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -2.0000
Features matched: Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "closed vs. hyp "." <-punct-- "cold", which aligned to text "cold"
Hand-tuned score: -2.0000
Threshold: -3.3437
Txt: Since it was cold, he closed the window.
Hyp: It was not cold. (Don't know.)
| It PRP |
was VBD |
not RB |
cold JJ |
. . |
|
| Since:IN | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 |
| it:PRP | 0.00 | 15.00 | 20.00 | 15.00 | 20.00 |
| was:VBD | 15.00 | 0.00 | 19.96 | 11.34 | 20.00 |
| cold:JJ | 15.00 | 11.34 | 11.96 | 0.00 | 19.61 |
| ,:, | 20.00 | 20.00 | 20.00 | 20.00 | 5.73 |
| he:PRP | 10.00 | 15.00 | 20.00 | 15.00 | 20.00 |
| closed:VBD | 15.00 | 10.00 | 19.96 | 9.84 | 19.49 |
| the:DT | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| window:NN | 12.00 | 12.52 | 14.96 | 9.28 | 19.62 |
| .:. | 20.00 | 20.00 | 20.00 | 19.61 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 9.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -11.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "cold": neg; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "closed vs. hyp "." <-punct-- "cold", which aligned to text "cold"
Hand-tuned score: -8.0000
Threshold: -3.3437
Txt: John didn't visit us after he returned from Spain.
Hyp: John returned from Spain. (Yes.)
| John NNP |
returned VBD |
Spain NNP |
. . |
|
| John:NNP | 0.00 | 12.27 | 14.34 | 20.50 |
| did:VBD | 13.35 | 6.77 | 15.50 | 17.99 |
| n't:RB | 15.46 | 19.96 | 15.46 | 17.90 |
| visit:VB | 15.50 | 7.10 | 15.50 | 20.00 |
| us:PRP | 12.50 | 15.00 | 12.50 | 20.00 |
| after:IN | 20.50 | 20.00 | 20.50 | 20.00 |
| he:PRP | 12.50 | 15.00 | 12.50 | 20.00 |
| returned:VBD | 12.27 | 0.00 | 14.84 | 19.41 |
| Spain:NNP | 14.34 | 14.84 | 0.00 | 20.50 |
| .:. | 20.50 | 19.41 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -4.0000
Features matched: Structure.argsMismatch: args have different parents but same relations: text "John" <-nsubj-- "visit vs. hyp "John" <-nsubj-- "returned", which aligned to text "returned" args have different parents but same relations: text "." <-punct-- "visit vs. hyp "." <-punct-- "returned", which aligned to text "returned"
Hand-tuned score: -2.0000
Threshold: -3.3437
Txt: John didn't visit us after he returned from Spain.
Hyp: John did not return from Spain. (Don't know.)
| John NNP |
did VBD |
not RB |
return VB |
Spain NNP |
. . |
|
| John:NNP | 0.00 | 13.35 | 15.46 | 12.27 | 14.34 | 20.50 |
| did:VBD | 13.35 | 0.00 | 19.96 | 5.85 | 15.50 | 17.99 |
| n't:RB | 15.46 | 13.27 | 0.50 | 17.74 | 15.46 | 17.90 |
| visit:VB | 15.50 | 7.62 | 19.96 | 7.10 | 15.50 | 20.00 |
| us:PRP | 12.50 | 15.00 | 20.00 | 15.00 | 12.50 | 20.00 |
| after:IN | 20.50 | 20.00 | 20.00 | 20.00 | 20.50 | 20.00 |
| he:PRP | 12.50 | 15.00 | 20.00 | 15.00 | 12.50 | 20.00 |
| returned:VBD | 12.27 | 6.77 | 19.96 | 0.31 | 14.84 | 19.41 |
| Spain:NNP | 14.34 | 15.50 | 15.46 | 14.84 | 0.00 | 20.50 |
| .:. | 20.50 | 17.99 | 20.00 | 19.01 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -7.8094
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "return": neg; NullPunisher.aux: did; Structure.argsMismatch: args have different parents but same relations: text "John" <-nsubj-- "visit vs. hyp "John" <-nsubj-- "return", which aligned to text "returned" args have different parents but same relations: text "n't" <-neg-- "visit vs. hyp "not" <-neg-- "return", which aligned to text "returned" args have different parents but same relations: text "." <-punct-- "visit vs. hyp "." <-punct-- "return", which aligned to text "returned"
Hand-tuned score: -7.0500
Threshold: -3.3437
Txt: Hanssen, who sold FBI secrets to the Russians, could face the death penalty.
Hyp: Hanssen sold FBI secrets to the Russians. (Yes.)
| Hanssen NNP |
sold VBD |
FBI NNP |
secrets NNS |
the DT |
Russians NNPS |
. . |
|
| Hanssen:NNP | 0.00 | 15.46 | 14.96 | 10.46 | 20.50 | 14.96 | 20.50 |
| ,:, | 20.50 | 19.81 | 20.50 | 20.00 | 10.00 | 20.50 | 5.73 |
| who:WP | 12.50 | 15.00 | 12.50 | 12.00 | 20.00 | 12.50 | 20.00 |
| sold:VBD | 15.46 | 0.00 | 15.50 | 13.55 | 20.00 | 15.50 | 19.42 |
| FBI:NNP | 14.96 | 15.50 | 0.00 | 10.50 | 20.50 | 15.00 | 20.50 |
| secrets:NNS | 10.46 | 13.55 | 10.50 | 0.00 | 20.00 | 8.35 | 20.00 |
| the:DT | 20.50 | 20.00 | 20.50 | 20.00 | 0.00 | 20.50 | 10.00 |
| Russians:NNPS | 14.96 | 15.50 | 15.00 | 8.35 | 20.50 | 0.00 | 20.50 |
| ,:, | 20.50 | 19.81 | 20.50 | 20.00 | 10.00 | 20.50 | 5.73 |
| could:MD | 20.46 | 19.96 | 20.46 | 19.96 | 10.00 | 20.46 | 10.00 |
| face:VB | 15.46 | 8.07 | 15.50 | 12.44 | 20.00 | 13.35 | 17.99 |
| the:DT | 20.50 | 20.00 | 20.50 | 20.00 | 0.00 | 20.50 | 10.00 |
| death_penalty:NN | 10.46 | 12.10 | 10.50 | 8.72 | 20.00 | 9.84 | 20.00 |
| .:. | 20.50 | 19.42 | 20.50 | 20.00 | 10.00 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -2.0000
Features matched: Structure.argsMismatch: args have different parents but same relations: text "Hanssen" <-nsubj-- "face vs. hyp "Hanssen" <-nsubj-- "sold", which aligned to text "sold" args have different parents but same relations: text "." <-punct-- "face vs. hyp "." <-punct-- "sold", which aligned to text "sold"
Hand-tuned score: -2.0000
Threshold: -3.3437
Txt: Hanssen, who sold FBI secrets to the Russians, could face the death penalty.
Hyp: Hanssen did not sell FBI secrets to the Russians. (Don't know.)
| Hanssen NNP |
did VBD |
not RB |
sell VB |
FBI NNP |
secrets NNS |
the DT |
Russians NNPS |
. . |
|
| Hanssen:NNP | 0.00 | 15.46 | 15.46 | 15.46 | 14.96 | 10.46 | 20.50 | 14.96 | 20.50 |
| ,:, | 20.50 | 19.80 | 20.00 | 19.98 | 20.50 | 20.00 | 10.00 | 20.50 | 5.73 |
| who:WP | 12.50 | 15.00 | 20.00 | 15.00 | 12.50 | 12.00 | 20.00 | 12.50 | 20.00 |
| sold:VBD | 15.46 | 7.69 | 19.96 | 0.50 | 15.50 | 13.55 | 20.00 | 15.50 | 19.42 |
| FBI:NNP | 14.96 | 15.50 | 15.46 | 15.50 | 0.00 | 10.50 | 20.50 | 15.00 | 20.50 |
| secrets:NNS | 10.46 | 12.85 | 14.96 | 14.18 | 10.50 | 0.00 | 20.00 | 8.35 | 20.00 |
| the:DT | 20.50 | 20.00 | 20.00 | 20.00 | 20.50 | 20.00 | 0.00 | 20.50 | 10.00 |
| Russians:NNPS | 14.96 | 13.35 | 15.46 | 15.50 | 15.00 | 8.35 | 20.50 | 0.00 | 20.50 |
| ,:, | 20.50 | 19.80 | 20.00 | 19.98 | 20.50 | 20.00 | 10.00 | 20.50 | 5.73 |
| could:MD | 20.46 | 17.84 | 19.96 | 19.96 | 20.46 | 19.96 | 10.00 | 20.46 | 10.00 |
| face:VB | 15.46 | 4.55 | 19.96 | 8.07 | 15.50 | 12.44 | 20.00 | 13.35 | 17.99 |
| the:DT | 20.50 | 20.00 | 20.00 | 20.00 | 20.50 | 20.00 | 0.00 | 20.50 | 10.00 |
| death_penalty:NN | 10.46 | 12.69 | 14.96 | 12.10 | 10.50 | 8.72 | 20.00 | 9.84 | 20.00 |
| .:. | 20.50 | 17.99 | 20.00 | 19.05 | 20.50 | 20.00 | 10.00 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -12.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "sell": neg; NullPunisher.aux: did; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "Hanssen" <-nsubj-- "face vs. hyp "Hanssen" <-nsubj-- "sell", which aligned to text "sold" args have different parents but same relations: text "." <-punct-- "face vs. hyp "." <-punct-- "sell", which aligned to text "sold"
Hand-tuned score: -8.0500
Threshold: -3.3437
Txt: The New York Times reported that Hanssen, who sold fbi secrets to the Russians, could face the death penalty.
Hyp: Hanssen sold fbi secrets to the Russians. (Yes.)
| Hanssen NNP |
sold VBD |
fbi NN |
secrets NNS |
the DT |
Russians NNPS |
. . |
|
| The:DT | 20.50 | 20.00 | 20.00 | 20.00 | 0.00 | 20.50 | 10.00 |
| New_York_Times:NNPS | 14.96 | 14.68 | 9.41 | 9.84 | 20.50 | 14.34 | 20.50 |
| reported:VBD | 15.46 | 7.69 | 12.96 | 11.05 | 20.00 | 13.35 | 19.71 |
| that:IN | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 20.00 |
| Hanssen:NNP | 0.00 | 15.46 | 10.46 | 10.46 | 20.50 | 14.96 | 20.50 |
| ,:, | 20.50 | 19.81 | 20.00 | 20.00 | 10.00 | 20.50 | 5.73 |
| who:WP | 12.50 | 15.00 | 12.00 | 12.00 | 20.00 | 12.50 | 20.00 |
| sold:VBD | 15.46 | 0.00 | 14.77 | 13.55 | 20.00 | 15.50 | 19.42 |
| fbi:NN | 10.46 | 14.77 | 0.00 | 7.33 | 20.00 | 10.50 | 19.70 |
| secrets:NNS | 10.46 | 13.55 | 7.33 | 0.00 | 20.00 | 8.35 | 20.00 |
| the:DT | 20.50 | 20.00 | 20.00 | 20.00 | 0.00 | 20.50 | 10.00 |
| Russians:NNPS | 14.96 | 15.50 | 10.50 | 8.35 | 20.50 | 0.00 | 20.50 |
| ,:, | 20.50 | 19.81 | 20.00 | 20.00 | 10.00 | 20.50 | 5.73 |
| could:MD | 20.46 | 19.96 | 19.96 | 19.96 | 10.00 | 20.46 | 10.00 |
| face:VB | 15.46 | 8.07 | 15.00 | 12.44 | 20.00 | 13.35 | 17.99 |
| the:DT | 20.50 | 20.00 | 20.00 | 20.00 | 0.00 | 20.50 | 10.00 |
| death_penalty:NN | 10.46 | 12.10 | 10.00 | 8.72 | 20.00 | 9.84 | 20.00 |
| .:. | 20.50 | 19.42 | 19.70 | 20.00 | 10.00 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -2.0000
Features matched: Structure.argsMismatch: args have different parents but same relations: text "Hanssen" <-nsubj-- "face vs. hyp "Hanssen" <-nsubj-- "sold", which aligned to text "sold" args have different parents but same relations: text "." <-punct-- "reported vs. hyp "." <-punct-- "sold", which aligned to text "sold"
Hand-tuned score: -2.0000
Threshold: -3.3437
Txt: The New York Times reported that Hanssen, who sold fbi secrets to the Russians, could face the death penalty.
Hyp: Hanssen did not sell fbi secrets to the Russians. (Don't know.)
| Hanssen NNP |
did VBD |
not RB |
sell VB |
fbi NN |
secrets NNS |
the DT |
Russians NNPS |
. . |
|
| The:DT | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.50 | 10.00 |
| New_York_Times:NNPS | 14.96 | 14.84 | 15.46 | 14.68 | 9.41 | 9.84 | 20.50 | 14.34 | 20.50 |
| reported:VBD | 15.46 | 7.69 | 19.96 | 7.69 | 12.96 | 11.05 | 20.00 | 13.35 | 19.71 |
| that:IN | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 20.00 |
| Hanssen:NNP | 0.00 | 15.46 | 15.46 | 15.46 | 10.46 | 10.46 | 20.50 | 14.96 | 20.50 |
| ,:, | 20.50 | 19.80 | 20.00 | 19.98 | 20.00 | 20.00 | 10.00 | 20.50 | 5.73 |
| who:WP | 12.50 | 15.00 | 20.00 | 15.00 | 12.00 | 12.00 | 20.00 | 12.50 | 20.00 |
| sold:VBD | 15.46 | 7.69 | 19.96 | 0.50 | 14.77 | 13.55 | 20.00 | 15.50 | 19.42 |
| fbi:NN | 10.46 | 13.20 | 14.96 | 15.00 | 0.00 | 7.33 | 20.00 | 10.50 | 19.70 |
| secrets:NNS | 10.46 | 12.85 | 14.96 | 14.18 | 7.33 | 0.00 | 20.00 | 8.35 | 20.00 |
| the:DT | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.50 | 10.00 |
| Russians:NNPS | 14.96 | 13.35 | 15.46 | 15.50 | 10.50 | 8.35 | 20.50 | 0.00 | 20.50 |
| ,:, | 20.50 | 19.80 | 20.00 | 19.98 | 20.00 | 20.00 | 10.00 | 20.50 | 5.73 |
| could:MD | 20.46 | 17.84 | 19.96 | 19.96 | 19.96 | 19.96 | 10.00 | 20.46 | 10.00 |
| face:VB | 15.46 | 4.55 | 19.96 | 8.07 | 15.00 | 12.44 | 20.00 | 13.35 | 17.99 |
| the:DT | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.50 | 10.00 |
| death_penalty:NN | 10.46 | 12.69 | 14.96 | 12.10 | 10.00 | 8.72 | 20.00 | 9.84 | 20.00 |
| .:. | 20.50 | 17.99 | 20.00 | 19.05 | 19.70 | 20.00 | 10.00 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -12.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "sell": neg; NullPunisher.other: not; NullPunisher.aux: did; Structure.argsMismatch: args have different parents but same relations: text "Hanssen" <-nsubj-- "face vs. hyp "Hanssen" <-nsubj-- "sell", which aligned to text "sold" args have different parents but same relations: text "." <-punct-- "reported vs. hyp "." <-punct-- "sell", which aligned to text "sold"
Hand-tuned score: -8.0500
Threshold: -3.3437
Txt: The New York Times reported that Hanssen sold fbi secrets to the Russians and could face the death penalty.
Hyp: Hanssen sold fbi secrets to the Russians. (Don't know.)
| Hanssen NNP |
sold VBD |
fbi NN |
secrets NNS |
the DT |
Russians NNPS |
. . |
|
| The:DT | 20.50 | 20.00 | 20.00 | 20.00 | 0.00 | 20.50 | 10.00 |
| New_York_Times:NNPS | 14.96 | 14.68 | 9.41 | 9.84 | 20.50 | 14.34 | 20.50 |
| reported:VBD | 15.46 | 7.69 | 12.96 | 11.05 | 20.00 | 13.35 | 19.71 |
| that:IN | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 20.00 |
| Hanssen:NNP | 0.00 | 15.46 | 10.46 | 10.46 | 20.50 | 14.96 | 20.50 |
| sold:VBD | 15.46 | 0.00 | 14.77 | 13.55 | 20.00 | 15.50 | 19.42 |
| fbi:NN | 10.46 | 14.77 | 0.00 | 7.33 | 20.00 | 10.50 | 19.70 |
| secrets:NNS | 10.46 | 13.55 | 7.33 | 0.00 | 20.00 | 8.35 | 20.00 |
| the:DT | 20.50 | 20.00 | 20.00 | 20.00 | 0.00 | 20.50 | 10.00 |
| Russians:NNPS | 14.96 | 15.50 | 10.50 | 8.35 | 20.50 | 0.00 | 20.50 |
| could:MD | 20.46 | 19.96 | 19.96 | 19.96 | 10.00 | 20.46 | 10.00 |
| face:VB | 15.46 | 8.07 | 15.00 | 12.44 | 20.00 | 13.35 | 17.99 |
| the:DT | 20.50 | 20.00 | 20.00 | 20.00 | 0.00 | 20.50 | 10.00 |
| death_penalty:NN | 10.46 | 12.10 | 10.00 | 8.72 | 20.00 | 9.84 | 20.00 |
| .:. | 20.50 | 19.42 | 19.70 | 20.00 | 10.00 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -2.0000
Features matched: Factive.unknownPassage: non factive text -- unknown: reported-VBD; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "reported vs. hyp "." <-punct-- "sold", which aligned to text "sold"
Hand-tuned score: -1.5000
Threshold: -3.3437
Txt: The New York Times reported that Hanssen sold fbi secrets to the Russians and could face the death penalty.
Hyp: Hanssen did not sell fbi secrets to the Russians. (Don't know.)
| Hanssen NNP |
did VBD |
not RB |
sell VB |
fbi NN |
secrets NNS |
the DT |
Russians NNPS |
. . |
|
| The:DT | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.50 | 10.00 |
| New_York_Times:NNPS | 14.96 | 14.84 | 15.46 | 14.68 | 9.41 | 9.84 | 20.50 | 14.34 | 20.50 |
| reported:VBD | 15.46 | 7.69 | 19.96 | 7.69 | 12.96 | 11.05 | 20.00 | 13.35 | 19.71 |
| that:IN | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 20.00 |
| Hanssen:NNP | 0.00 | 15.46 | 15.46 | 15.46 | 10.46 | 10.46 | 20.50 | 14.96 | 20.50 |
| sold:VBD | 15.46 | 7.69 | 19.96 | 0.50 | 14.77 | 13.55 | 20.00 | 15.50 | 19.42 |
| fbi:NN | 10.46 | 13.20 | 14.96 | 15.00 | 0.00 | 7.33 | 20.00 | 10.50 | 19.70 |
| secrets:NNS | 10.46 | 12.85 | 14.96 | 14.18 | 7.33 | 0.00 | 20.00 | 8.35 | 20.00 |
| the:DT | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.50 | 10.00 |
| Russians:NNPS | 14.96 | 13.35 | 15.46 | 15.50 | 10.50 | 8.35 | 20.50 | 0.00 | 20.50 |
| could:MD | 20.46 | 17.84 | 19.96 | 19.96 | 19.96 | 19.96 | 10.00 | 20.46 | 10.00 |
| face:VB | 15.46 | 4.55 | 19.96 | 8.07 | 15.00 | 12.44 | 20.00 | 13.35 | 17.99 |
| the:DT | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.50 | 10.00 |
| death_penalty:NN | 10.46 | 12.69 | 14.96 | 12.10 | 10.00 | 8.72 | 20.00 | 9.84 | 20.00 |
| .:. | 20.50 | 17.99 | 20.00 | 19.05 | 19.70 | 20.00 | 10.00 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -12.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.unknownPassage: non factive text -- unknown: reported-VBD; Polarity.hypNegMarker: "sell": neg; NullPunisher.other: not; NullPunisher.aux: did; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "reported vs. hyp "." <-punct-- "sell", which aligned to text "sold"
Hand-tuned score: -7.5500
Threshold: -3.3437
Txt: Bush said that it was Khan who sold centrifuges to North Korea.
Hyp: Centrifuges were sold to North Korea. (Yes.)
| Centrifuges NNS |
were VBD |
sold VBN |
North_Korea NNP |
. . |
|
| Bush:NNP | 9.45 | 14.84 | 12.74 | 12.11 | 20.50 |
| said:VBD | 15.00 | 6.24 | 7.80 | 15.50 | 18.58 |
| that:IN | 20.00 | 20.00 | 20.00 | 20.50 | 20.00 |
| it:PRP | 12.00 | 15.00 | 15.00 | 12.50 | 20.00 |
| was:VBD | 14.34 | 0.50 | 10.00 | 11.83 | 20.00 |
| Khan:NNP | 8.53 | 14.84 | 15.50 | 14.02 | 20.50 |
| who:WP | 12.00 | 15.00 | 15.00 | 12.50 | 20.00 |
| sold:VBD | 15.00 | 7.80 | 0.00 | 15.50 | 19.42 |
| centrifuges:NNS | 0.00 | 14.34 | 14.23 | 9.84 | 19.93 |
| North_Korea:NNP | 9.84 | 14.84 | 15.50 | 0.00 | 20.50 |
| .:. | 20.00 | 20.00 | 19.42 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -3.0000
Features matched: NullPunisher.aux: were; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "said vs. hyp "." <-punct-- "sold", which aligned to text "sold"
Hand-tuned score: -2.0500
Threshold: -3.3437
Txt: Bush said that it was Khan who sold centrifuges to North Korea.
Hyp: Centrifuges were not sold to North Korea. (Don't know.)
| Centrifuges NNS |
were VBD |
not RB |
sold VBN |
North_Korea NNP |
. . |
|
| Bush:NNP | 9.45 | 14.84 | 15.46 | 12.74 | 12.11 | 20.50 |
| said:VBD | 15.00 | 6.24 | 19.96 | 7.80 | 15.50 | 18.58 |
| that:IN | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 20.00 |
| it:PRP | 12.00 | 15.00 | 20.00 | 15.00 | 12.50 | 20.00 |
| was:VBD | 14.34 | 0.50 | 19.96 | 10.00 | 11.83 | 20.00 |
| Khan:NNP | 8.53 | 14.84 | 15.46 | 15.50 | 14.02 | 20.50 |
| who:WP | 12.00 | 15.00 | 20.00 | 15.00 | 12.50 | 20.00 |
| sold:VBD | 15.00 | 7.80 | 19.96 | 0.00 | 15.50 | 19.42 |
| centrifuges:NNS | 0.00 | 14.34 | 14.96 | 14.23 | 9.84 | 19.93 |
| North_Korea:NNP | 9.84 | 14.84 | 15.46 | 15.50 | 0.00 | 20.50 |
| .:. | 20.00 | 20.00 | 20.00 | 19.42 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -12.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "sold": neg; NullPunisher.aux: were; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "said vs. hyp "." <-punct-- "sold", which aligned to text "sold"
Hand-tuned score: -8.0500
Threshold: -3.3437
Txt: Bush said that Khan sold centrifuges to North Korea.
Hyp: Centrifuges were sold to North Korea. (Don't know.)
| Centrifuges NNS |
were VBD |
sold VBN |
North_Korea NNP |
. . |
|
| Bush:NNP | 9.45 | 14.84 | 12.74 | 12.11 | 20.50 |
| said:VBD | 15.00 | 6.24 | 7.80 | 15.50 | 18.58 |
| that:IN | 20.00 | 20.00 | 20.00 | 20.50 | 20.00 |
| Khan:NNP | 8.53 | 14.84 | 15.50 | 14.02 | 20.50 |
| sold:VBD | 15.00 | 7.80 | 0.00 | 15.50 | 19.42 |
| centrifuges:NNS | 0.00 | 14.34 | 14.23 | 9.84 | 19.93 |
| North_Korea:NNP | 9.84 | 14.84 | 15.50 | 0.00 | 20.50 |
| .:. | 20.00 | 20.00 | 19.42 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -3.0000
Features matched: Factive.unknownPassage: non factive text -- unknown: said-VBD; NullPunisher.aux: were; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "said vs. hyp "." <-punct-- "sold", which aligned to text "sold"
Hand-tuned score: -1.5500
Threshold: -3.3437
Txt: Bush said that Khan sold centrifuges to North Korea.
Hyp: Centrifuges were not sold to North Korea. (Don't know.)
| Centrifuges NNS |
were VBD |
not RB |
sold VBN |
North_Korea NNP |
. . |
|
| Bush:NNP | 9.45 | 14.84 | 15.46 | 12.74 | 12.11 | 20.50 |
| said:VBD | 15.00 | 6.24 | 19.96 | 7.80 | 15.50 | 18.58 |
| that:IN | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 20.00 |
| Khan:NNP | 8.53 | 14.84 | 15.46 | 15.50 | 14.02 | 20.50 |
| sold:VBD | 15.00 | 7.80 | 19.96 | 0.00 | 15.50 | 19.42 |
| centrifuges:NNS | 0.00 | 14.34 | 14.96 | 14.23 | 9.84 | 19.93 |
| North_Korea:NNP | 9.84 | 14.84 | 15.46 | 15.50 | 0.00 | 20.50 |
| .:. | 20.00 | 20.00 | 20.00 | 19.42 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -12.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.unknownPassage: non factive text -- unknown: said-VBD; Polarity.hypNegMarker: "sold": neg; NullPunisher.aux: were; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "said vs. hyp "." <-punct-- "sold", which aligned to text "sold"
Hand-tuned score: -7.5500
Threshold: -3.3437
Txt: What we found in Iraq was rusted shrapnel.
Hyp: We found something in Iraq. (Yes.)
| We PRP |
found VBD |
something NN |
Iraq NNP |
. . |
|
| What:WP | 10.00 | 15.00 | 12.00 | 12.50 | 20.00 |
| we:PRP | 0.00 | 15.00 | 12.00 | 12.50 | 20.00 |
| found:VBD | 15.00 | 0.00 | 15.00 | 15.50 | 19.57 |
| Iraq:NNP | 12.50 | 15.50 | 9.84 | 0.00 | 20.50 |
| was:VBD | 15.00 | 10.00 | 14.34 | 11.83 | 20.00 |
| rusted:VBN | 15.00 | 9.02 | 14.34 | 14.84 | 20.00 |
| shrapnel:JJ | 15.00 | 8.64 | 11.34 | 11.84 | 20.00 |
| .:. | 20.00 | 19.57 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -12.0000
Features matched: NullPunisher.other: something; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "rusted vs. hyp "." <-punct-- "found", which aligned to text "found"
Hand-tuned score: -3.0000
Threshold: -3.3437
Txt: What we found in Iraq was rusted shrapnel.
Hyp: We found nothing in Iraq. (Don't know.)
| We PRP |
found VBD |
nothing NN |
Iraq NNP |
. . |
|
| What:WP | 10.00 | 15.00 | 12.00 | 12.50 | 20.00 |
| we:PRP | 0.00 | 15.00 | 12.00 | 12.50 | 20.00 |
| found:VBD | 15.00 | 0.00 | 15.00 | 15.50 | 19.57 |
| Iraq:NNP | 12.50 | 15.50 | 9.84 | 0.00 | 20.50 |
| was:VBD | 15.00 | 10.00 | 14.34 | 11.83 | 20.00 |
| rusted:VBN | 15.00 | 9.02 | 14.34 | 14.84 | 20.00 |
| shrapnel:JJ | 15.00 | 8.64 | 11.34 | 11.84 | 20.00 |
| .:. | 20.00 | 19.57 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -12.0000
Features matched: NullPunisher.other: nothing; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "rusted vs. hyp "." <-punct-- "found", which aligned to text "found"
Hand-tuned score: -3.0000
Threshold: -3.3437
Txt: The fact that Bin Laden was in Tora Bora lead to the suspicion that the Afghan campaign was mismanaged.
Hyp: Bin Laden was in Tora Bora. (Yes.)
| Bin_Laden NNP |
was VBD |
Tora_Bora NNP |
. . |
|
| The:DT | 20.50 | 20.00 | 20.50 | 10.00 |
| fact:NN | 9.42 | 15.00 | 10.46 | 17.53 |
| that:IN | 20.50 | 20.00 | 20.50 | 20.00 |
| Bin_Laden:NNP | 0.00 | 14.84 | 14.96 | 20.50 |
| was:VBD | 14.84 | 0.00 | 15.46 | 20.00 |
| Tora_Bora:NNP | 14.96 | 15.46 | 0.00 | 20.50 |
| lead:VBP | 13.55 | 7.52 | 15.46 | 19.02 |
| the:DT | 20.50 | 20.00 | 20.50 | 10.00 |
| suspicion:NN | 9.84 | 15.00 | 10.46 | 19.99 |
| that:IN | 20.50 | 20.00 | 20.50 | 20.00 |
| the:DT | 20.50 | 20.00 | 20.50 | 10.00 |
| Afghan:JJ | 15.05 | 11.84 | 16.96 | 20.50 |
| campaign:NN | 10.50 | 15.00 | 10.46 | 19.85 |
| was:VBD | 14.84 | 0.00 | 15.46 | 20.00 |
| mismanaged:VBN | 15.50 | 10.00 | 15.46 | 20.00 |
| .:. | 20.50 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -2.0000
Features matched: Factive.factivePassage: factive entails : fact-NN; Location.mismatch: no clear info of matching: be(X, prep_in); Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "lead vs. hyp "." <-punct-- "was", which aligned to text "was"
Hand-tuned score: -3.0000
Threshold: -3.3437
Txt: The fact that Bin Laden was in Tora Bora lead to the suspicion that the Afghan campaign was mismanaged.
Hyp: Bin Laden was not in Tora Bora. (Don't know.)
| Bin_Laden NNP |
was VBD |
not RB |
Tora_Bora NNP |
. . |
|
| The:DT | 20.50 | 20.00 | 20.00 | 20.50 | 10.00 |
| fact:NN | 9.42 | 15.00 | 14.96 | 10.46 | 17.53 |
| that:IN | 20.50 | 20.00 | 20.00 | 20.50 | 20.00 |
| Bin_Laden:NNP | 0.00 | 14.84 | 15.46 | 14.96 | 20.50 |
| was:VBD | 14.84 | 0.00 | 19.96 | 15.46 | 20.00 |
| Tora_Bora:NNP | 14.96 | 15.46 | 15.46 | 0.00 | 20.50 |
| lead:VBP | 13.55 | 7.52 | 19.96 | 15.46 | 19.02 |
| the:DT | 20.50 | 20.00 | 20.00 | 20.50 | 10.00 |
| suspicion:NN | 9.84 | 15.00 | 14.96 | 10.46 | 19.99 |
| that:IN | 20.50 | 20.00 | 20.00 | 20.50 | 20.00 |
| the:DT | 20.50 | 20.00 | 20.00 | 20.50 | 10.00 |
| Afghan:JJ | 15.05 | 11.84 | 12.46 | 16.96 | 20.50 |
| campaign:NN | 10.50 | 15.00 | 14.96 | 10.46 | 19.85 |
| was:VBD | 14.84 | 0.00 | 19.96 | 15.46 | 20.00 |
| mismanaged:VBN | 15.50 | 10.00 | 19.96 | 15.46 | 20.00 |
| .:. | 20.50 | 20.00 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 9.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -11.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.factivePassage: factive entails : fact-NN; Polarity.hypNegMarker: "was": neg; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "lead vs. hyp "." <-punct-- "was", which aligned to text "was"
Hand-tuned score: -7.0000
Threshold: -3.3437
Txt: The fact that Bin Laden was in Tora Bora lead to the suspicion that the Afghan campaign was mismanaged.
Hyp: The Afghan campaign was mismanaged. (Don't know.)
| The DT |
Afghan JJ |
campaign NN |
was VBD |
mismanaged VBN |
. . |
|
| The:DT | 0.00 | 20.50 | 20.00 | 20.00 | 20.00 | 10.00 |
| fact:NN | 20.00 | 10.35 | 8.94 | 15.00 | 14.66 | 17.53 |
| that:IN | 20.00 | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 |
| Bin_Laden:NNP | 20.50 | 15.05 | 10.50 | 14.84 | 15.50 | 20.50 |
| was:VBD | 20.00 | 11.84 | 15.00 | 0.00 | 10.00 | 20.00 |
| Tora_Bora:NNP | 20.50 | 16.96 | 10.46 | 15.46 | 15.46 | 20.50 |
| lead:VBP | 20.00 | 10.35 | 12.72 | 7.52 | 6.73 | 19.02 |
| the:DT | 0.00 | 20.50 | 20.00 | 20.00 | 20.00 | 10.00 |
| suspicion:NN | 20.00 | 11.19 | 8.40 | 15.00 | 12.80 | 19.99 |
| that:IN | 20.00 | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 |
| the:DT | 0.00 | 20.50 | 20.00 | 20.00 | 20.00 | 10.00 |
| Afghan:JJ | 20.50 | 0.00 | 12.50 | 11.84 | 12.50 | 20.50 |
| campaign:NN | 20.00 | 12.50 | 0.00 | 15.00 | 14.12 | 19.85 |
| was:VBD | 20.00 | 11.84 | 15.00 | 0.00 | 10.00 | 20.00 |
| mismanaged:VBN | 20.00 | 12.50 | 14.12 | 10.00 | 0.00 | 20.00 |
| .:. | 10.00 | 20.50 | 19.85 | 20.00 | 20.00 | 0.00 |
| NO_WORD | 1.00 | 9.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -2.0000
Features matched: Factive.inPositiveEmbedding: embedded positive text; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "lead vs. hyp "." <-punct-- "mismanaged", which aligned to text "mismanaged"
Hand-tuned score: -1.0000
Threshold: -3.3437
Txt: The fact that Bin Laden was in Tora Bora lead to the suspicion that the Afghan campaign was mismanaged.
Hyp: The Afghan campaign was not mismanaged. (Don't know.)
| The DT |
Afghan JJ |
campaign NN |
was VBD |
not RB |
mismanaged VBN |
. . |
|
| The:DT | 0.00 | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| fact:NN | 20.00 | 10.35 | 8.94 | 15.00 | 14.96 | 14.66 | 17.53 |
| that:IN | 20.00 | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 |
| Bin_Laden:NNP | 20.50 | 15.05 | 10.50 | 14.84 | 15.46 | 15.50 | 20.50 |
| was:VBD | 20.00 | 11.84 | 15.00 | 0.00 | 19.96 | 10.00 | 20.00 |
| Tora_Bora:NNP | 20.50 | 16.96 | 10.46 | 15.46 | 15.46 | 15.46 | 20.50 |
| lead:VBP | 20.00 | 10.35 | 12.72 | 7.52 | 19.96 | 6.73 | 19.02 |
| the:DT | 0.00 | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| suspicion:NN | 20.00 | 11.19 | 8.40 | 15.00 | 14.96 | 12.80 | 19.99 |
| that:IN | 20.00 | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 |
| the:DT | 0.00 | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| Afghan:JJ | 20.50 | 0.00 | 12.50 | 11.84 | 12.46 | 12.50 | 20.50 |
| campaign:NN | 20.00 | 12.50 | 0.00 | 15.00 | 14.96 | 14.12 | 19.85 |
| was:VBD | 20.00 | 11.84 | 15.00 | 0.00 | 19.96 | 10.00 | 20.00 |
| mismanaged:VBN | 20.00 | 12.50 | 14.12 | 10.00 | 19.96 | 0.00 | 20.00 |
| .:. | 10.00 | 20.50 | 19.85 | 20.00 | 20.00 | 20.00 | 0.00 |
| NO_WORD | 1.00 | 9.00 | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -11.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.inPositiveEmbedding: embedded positive text; Polarity.hypNegMarker: "mismanaged": neg; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "lead vs. hyp "." <-punct-- "mismanaged", which aligned to text "mismanaged"
Hand-tuned score: -7.0000
Threshold: -3.3437
Txt: The paper concluded that the election had been rigged.
Hyp: The election was rigged. (Don't know.)
| The DT |
election NN |
was VBD |
rigged VBN |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| paper:NN | 20.00 | 8.35 | 14.34 | 12.12 | 18.64 |
| concluded:VBD | 20.00 | 15.00 | 10.00 | 10.00 | 19.24 |
| that:IN | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 |
| the:DT | 0.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| election:NN | 20.00 | 0.00 | 15.00 | 11.41 | 20.00 |
| had:VBD | 20.00 | 15.00 | 9.34 | 5.73 | 20.00 |
| been:VBN | 20.00 | 15.00 | 0.50 | 7.80 | 20.00 |
| rigged:VBN | 20.00 | 11.41 | 9.34 | 0.00 | 20.00 |
| .:. | 10.00 | 20.00 | 20.00 | 20.00 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -2.5000
Features matched: Factive.unknownPassage: non factive text -- unknown: concluded-VBD; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "concluded vs. hyp "." <-punct-- "rigged", which aligned to text "rigged"
Hand-tuned score: -1.5000
Threshold: -3.3437
Txt: The paper concluded that the election had been rigged.
Hyp: The election was not rigged. (Don't know.)
| The DT |
election NN |
was VBD |
not RB |
rigged VBN |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| paper:NN | 20.00 | 8.35 | 14.34 | 14.96 | 12.12 | 18.64 |
| concluded:VBD | 20.00 | 15.00 | 10.00 | 19.96 | 10.00 | 19.24 |
| that:IN | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 |
| the:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| election:NN | 20.00 | 0.00 | 15.00 | 14.96 | 11.41 | 20.00 |
| had:VBD | 20.00 | 15.00 | 9.34 | 19.96 | 5.73 | 20.00 |
| been:VBN | 20.00 | 15.00 | 0.50 | 19.96 | 7.80 | 20.00 |
| rigged:VBN | 20.00 | 11.41 | 9.34 | 19.96 | 0.00 | 20.00 |
| .:. | 10.00 | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -11.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.unknownPassage: non factive text -- unknown: concluded-VBD; Polarity.hypNegMarker: "rigged": neg; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "concluded vs. hyp "." <-punct-- "rigged", which aligned to text "rigged"
Hand-tuned score: -7.5000
Threshold: -3.3437
Txt: Ames was, as the press reported, a successful spy.
Hyp: Ames was a successful spy. (Yes.)
| Ames NNP |
was VBD |
a DT |
successful JJ |
spy NN |
. . |
|
| Ames:NNP | 0.00 | 15.46 | 20.50 | 12.46 | 10.46 | 20.50 |
| was:VBD | 15.46 | 0.00 | 20.00 | 11.96 | 14.34 | 20.00 |
| ,:, | 20.50 | 20.00 | 10.00 | 19.58 | 20.00 | 5.73 |
| the:DT | 20.50 | 20.00 | 10.00 | 20.00 | 20.00 | 10.00 |
| press:NN | 10.46 | 14.34 | 20.00 | 11.96 | 8.50 | 19.26 |
| reported:VBN | 15.46 | 10.00 | 20.00 | 11.96 | 14.05 | 19.71 |
| ,:, | 20.50 | 20.00 | 10.00 | 19.58 | 20.00 | 5.73 |
| a:DT | 20.50 | 20.00 | 0.00 | 20.00 | 20.00 | 10.00 |
| successful:JJ | 12.46 | 11.96 | 20.00 | 0.00 | 11.78 | 18.38 |
| spy:NN | 10.46 | 14.34 | 20.00 | 11.78 | 0.00 | 20.00 |
| .:. | 20.50 | 20.00 | 10.00 | 18.38 | 20.00 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 1.00 | 9.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -5.0000
Features matched: Factive.unknownPassage: non factive text -- unknown: reported-VBN; NullPunisher.aux: was; Quant.contract: [a,a]; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "reported vs. hyp "." <-punct-- "spy", which aligned to text "spy" args have different parents, different relations: text "Ames" <-nsubjpass-- "reported" vs. hyp "Ames" <-nsubj-- "spy", which aligned to text "spy"
Hand-tuned score: -0.5500
Threshold: -3.3437
Txt: Ames was, as the press reported, a successful spy.
Hyp: Ames was not a successful spy. (Don't know.)
| Ames NNP |
was VBD |
not RB |
a DT |
successful JJ |
spy NN |
. . |
|
| Ames:NNP | 0.00 | 15.46 | 15.46 | 20.50 | 12.46 | 10.46 | 20.50 |
| was:VBD | 15.46 | 0.00 | 19.96 | 20.00 | 11.96 | 14.34 | 20.00 |
| ,:, | 20.50 | 20.00 | 20.00 | 10.00 | 19.58 | 20.00 | 5.73 |
| the:DT | 20.50 | 20.00 | 20.00 | 10.00 | 20.00 | 20.00 | 10.00 |
| press:NN | 10.46 | 14.34 | 14.96 | 20.00 | 11.96 | 8.50 | 19.26 |
| reported:VBN | 15.46 | 10.00 | 19.96 | 20.00 | 11.96 | 14.05 | 19.71 |
| ,:, | 20.50 | 20.00 | 20.00 | 10.00 | 19.58 | 20.00 | 5.73 |
| a:DT | 20.50 | 20.00 | 20.00 | 0.00 | 20.00 | 20.00 | 10.00 |
| successful:JJ | 12.46 | 11.96 | 11.96 | 20.00 | 0.00 | 11.78 | 18.38 |
| spy:NN | 10.46 | 14.34 | 14.96 | 20.00 | 11.78 | 0.00 | 20.00 |
| .:. | 20.50 | 20.00 | 20.00 | 10.00 | 18.38 | 20.00 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 1.00 | 9.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -14.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.unknownPassage: non factive text -- unknown: reported-VBN; Polarity.hypNegMarker: "spy": neg; NullPunisher.aux: was; NullPunisher.other: not; Quant.contract: [a,a]; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "reported vs. hyp "." <-punct-- "spy", which aligned to text "spy" args have different parents, different relations: text "Ames" <-nsubjpass-- "reported" vs. hyp "Ames" <-nsubj-- "spy", which aligned to text "spy"
Hand-tuned score: -6.5500
Threshold: -3.3437
Txt: The press reported that Ames was a successful spy.
Hyp: Ames was a successful spy. (Don't know.)
| Ames NNP |
was VBD |
a DT |
successful JJ |
spy NN |
. . |
|
| The:DT | 20.50 | 20.00 | 10.00 | 20.00 | 20.00 | 10.00 |
| press:NN | 10.46 | 14.34 | 20.00 | 11.96 | 8.50 | 19.26 |
| reported:VBD | 15.46 | 10.00 | 20.00 | 11.96 | 14.05 | 19.71 |
| that:IN | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 |
| Ames:NNP | 0.00 | 15.46 | 20.50 | 12.46 | 10.46 | 20.50 |
| was:VBD | 15.46 | 0.00 | 20.00 | 11.96 | 14.34 | 20.00 |
| a:DT | 20.50 | 20.00 | 0.00 | 20.00 | 20.00 | 10.00 |
| successful:JJ | 12.46 | 11.96 | 20.00 | 0.00 | 11.78 | 18.38 |
| spy:NN | 10.46 | 14.34 | 20.00 | 11.78 | 0.00 | 20.00 |
| .:. | 20.50 | 20.00 | 10.00 | 18.38 | 20.00 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 1.00 | 9.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -2.0000
Features matched: Factive.unknownPassage: non factive text -- unknown: reported-VBD; Quant.contract: [a,a]; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "reported vs. hyp "." <-punct-- "spy", which aligned to text "spy"
Hand-tuned score: -0.5000
Threshold: -3.3437
Txt: The press reported that Ames was a successful spy.
Hyp: Ames was not a successful spy. (Don't know.)
| Ames NNP |
was VBD |
not RB |
a DT |
successful JJ |
spy NN |
. . |
|
| The:DT | 20.50 | 20.00 | 20.00 | 10.00 | 20.00 | 20.00 | 10.00 |
| press:NN | 10.46 | 14.34 | 14.96 | 20.00 | 11.96 | 8.50 | 19.26 |
| reported:VBD | 15.46 | 10.00 | 19.96 | 20.00 | 11.96 | 14.05 | 19.71 |
| that:IN | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 |
| Ames:NNP | 0.00 | 15.46 | 15.46 | 20.50 | 12.46 | 10.46 | 20.50 |
| was:VBD | 15.46 | 0.00 | 19.96 | 20.00 | 11.96 | 14.34 | 20.00 |
| a:DT | 20.50 | 20.00 | 20.00 | 0.00 | 20.00 | 20.00 | 10.00 |
| successful:JJ | 12.46 | 11.96 | 11.96 | 20.00 | 0.00 | 11.78 | 18.38 |
| spy:NN | 10.46 | 14.34 | 14.96 | 20.00 | 11.78 | 0.00 | 20.00 |
| .:. | 20.50 | 20.00 | 20.00 | 10.00 | 18.38 | 20.00 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 1.00 | 9.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -11.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.unknownPassage: non factive text -- unknown: reported-VBD; Polarity.hypNegMarker: "spy": neg; NullPunisher.other: not; Quant.contract: [a,a]; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "reported vs. hyp "." <-punct-- "spy", which aligned to text "spy"
Hand-tuned score: -6.5000
Threshold: -3.3437
Txt: The US forgot that the Afghans speak several different languages.
Hyp: The Afghans speak several different languages. (Yes.)
| The DT |
Afghans NNPS |
speak VBP |
several JJ |
different JJ |
languages NNS |
. . |
|
| The:DT | 0.00 | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| US:NNP | 20.50 | 14.34 | 15.50 | 12.46 | 12.46 | 10.50 | 20.50 |
| forgot:VBD | 20.00 | 15.50 | 9.43 | 11.96 | 11.96 | 15.00 | 19.56 |
| that:IN | 20.00 | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 |
| the:DT | 0.00 | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| Afghans:NNPS | 20.50 | 0.00 | 15.50 | 12.46 | 12.46 | 5.87 | 20.50 |
| speak:VBP | 20.00 | 15.50 | 0.00 | 11.96 | 11.39 | 12.82 | 18.41 |
| several:JJ | 20.00 | 12.46 | 11.96 | 0.00 | 9.96 | 11.96 | 20.00 |
| different:JJ | 20.00 | 12.46 | 11.39 | 9.96 | 0.00 | 8.76 | 17.27 |
| languages:NNS | 20.00 | 5.87 | 12.82 | 11.96 | 8.76 | 0.00 | 19.86 |
| .:. | 10.00 | 20.50 | 18.41 | 20.00 | 17.27 | 19.86 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 10.00 | 9.00 | 9.00 | 10.00 | 10.00 |
Response: dontknow (INCORRECT)
Justification:
Alignment score: -2.0000
Features matched: Factive.negativeStatement: non factive text : forgot-VBD; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "forgot vs. hyp "." <-punct-- "speak", which aligned to text "speak"
Hand-tuned score: -7.0000
Threshold: -3.3437
Txt: The US forgot that the Afghans speak several different languages.
Hyp: The Afghans do not speak several different languages. (Don't know.)
| The DT |
Afghans NNPS |
do VBP |
not RB |
speak VB |
several JJ |
different JJ |
languages NNS |
. . |
|
| The:DT | 0.00 | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| US:NNP | 20.50 | 14.34 | 15.50 | 15.46 | 15.50 | 12.46 | 12.46 | 10.50 | 20.50 |
| forgot:VBD | 20.00 | 15.50 | 8.48 | 19.96 | 9.43 | 11.96 | 11.96 | 15.00 | 19.56 |
| that:IN | 20.00 | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 |
| the:DT | 0.00 | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| Afghans:NNPS | 20.50 | 0.00 | 13.35 | 15.46 | 15.50 | 12.46 | 12.46 | 5.87 | 20.50 |
| speak:VBP | 20.00 | 15.50 | 5.23 | 19.96 | 0.00 | 11.96 | 11.39 | 12.82 | 18.41 |
| several:JJ | 20.00 | 12.46 | 11.96 | 11.96 | 11.96 | 0.00 | 9.96 | 11.96 | 20.00 |
| different:JJ | 20.00 | 12.46 | 10.27 | 11.96 | 11.39 | 9.96 | 0.00 | 8.76 | 17.27 |
| languages:NNS | 20.00 | 5.87 | 11.24 | 14.96 | 12.82 | 11.96 | 8.76 | 0.00 | 19.86 |
| .:. | 10.00 | 20.50 | 18.81 | 20.00 | 18.41 | 20.00 | 17.27 | 19.86 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 9.00 | 10.00 | 9.00 | 9.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -12.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.negativeStatement: non factive text : forgot-VBD; Polarity.hypNegMarker: "speak": neg; NullPunisher.aux: do; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "forgot vs. hyp "." <-punct-- "speak", which aligned to text "speak"
Hand-tuned score: -13.0500
Threshold: -3.3437
Txt: Bush realized that the US Army had to be transformed to meet new threats.
Hyp: The US Army had to be transformed to meet new threats. (Yes.)
| The DT |
US_Army NNP |
had VBD |
to TO |
be VB |
transformed VBN |
to TO |
meet VB |
new JJ |
threats NNS |
. . |
|
| Bush:NNP | 20.00 | 8.63 | 13.05 | 20.00 | 14.34 | 15.00 | 20.00 | 12.02 | 11.96 | 8.05 | 20.00 |
| realized:VBD | 20.00 | 15.50 | 6.84 | 20.00 | 10.00 | 7.20 | 20.00 | 7.24 | 11.96 | 14.16 | 17.47 |
| that:IN | 20.00 | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 |
| the:DT | 0.00 | 20.50 | 20.00 | 10.00 | 20.00 | 20.00 | 10.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| US_Army:NNP | 20.50 | 0.00 | 15.17 | 20.50 | 15.17 | 15.50 | 20.50 | 15.50 | 12.46 | 10.17 | 20.50 |
| had:VBD | 20.00 | 15.17 | 0.00 | 20.00 | 7.80 | 7.61 | 20.00 | 3.72 | 11.96 | 13.05 | 20.00 |
| to:TO | 10.00 | 20.50 | 20.00 | 0.00 | 20.00 | 20.00 | 0.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| be:VB | 20.00 | 15.17 | 7.80 | 20.00 | 0.00 | 10.00 | 20.00 | 2.00 | 11.96 | 14.34 | 20.00 |
| transformed:VBN | 20.00 | 15.50 | 7.61 | 20.00 | 10.00 | 0.00 | 20.00 | 7.61 | 10.33 | 14.61 | 20.00 |
| to:TO | 10.00 | 20.50 | 20.00 | 0.00 | 20.00 | 20.00 | 0.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| meet:VB | 20.00 | 15.50 | 3.72 | 20.00 | 6.07 | 7.61 | 20.00 | 0.00 | 11.24 | 14.50 | 19.52 |
| new:JJ | 20.00 | 12.46 | 11.96 | 20.00 | 11.96 | 10.33 | 20.00 | 11.24 | 0.00 | 11.96 | 20.00 |
| threats:NNS | 20.00 | 10.17 | 13.05 | 20.00 | 14.34 | 14.61 | 20.00 | 14.50 | 11.96 | 0.00 | 19.03 |
| .:. | 10.00 | 20.50 | 20.00 | 10.00 | 20.00 | 20.00 | 10.00 | 19.52 | 20.00 | 19.03 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 | 9.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -2.0000
Features matched: Factive.factivePassage: factive entails : realized-VBD; Modal.weakYes: necessary -> necessary; Structure.argsMismatch: args have different parents but same relations: text "US_Army" <-xsubj-- "transformed vs. hyp "US_Army" <-nsubj-- "had", which aligned to text "had" args have different parents but same relations: text "." <-punct-- "realized vs. hyp "." <-punct-- "had", which aligned to text "had"
Hand-tuned score: 0.0000
Threshold: -3.3437
Txt: Bush realized that the US Army had to be transformed to meet new threats.
Hyp: The US Army did not have to be transformed to meet new threats. (Don't know.)
| The DT |
US_Army NNP |
did VBD |
not RB |
have VB |
to TO |
be VB |
transformed VBN |
to TO |
meet VB |
new JJ |
threats NNS |
. . |
|
| Bush:NNP | 20.00 | 8.63 | 15.00 | 14.96 | 13.05 | 20.00 | 14.34 | 15.00 | 20.00 | 12.02 | 11.96 | 8.05 | 20.00 |
| realized:VBD | 20.00 | 15.50 | 7.32 | 19.96 | 6.84 | 20.00 | 10.00 | 7.20 | 20.00 | 7.24 | 11.96 | 14.16 | 17.47 |
| that:IN | 20.00 | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 |
| the:DT | 0.00 | 20.50 | 20.00 | 20.00 | 20.00 | 10.00 | 20.00 | 20.00 | 10.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| US_Army:NNP | 20.50 | 0.00 | 15.50 | 15.46 | 15.17 | 20.50 | 15.17 | 15.50 | 20.50 | 15.50 | 12.46 | 10.17 | 20.50 |
| had:VBD | 20.00 | 15.17 | 7.32 | 19.96 | 0.50 | 20.00 | 7.80 | 7.61 | 20.00 | 3.72 | 11.96 | 13.05 | 20.00 |
| to:TO | 10.00 | 20.50 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 20.00 | 0.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| be:VB | 20.00 | 15.17 | 6.07 | 19.96 | 7.80 | 20.00 | 0.00 | 10.00 | 20.00 | 2.00 | 11.96 | 14.34 | 20.00 |
| transformed:VBN | 20.00 | 15.50 | 7.62 | 19.96 | 7.61 | 20.00 | 10.00 | 0.00 | 20.00 | 7.61 | 10.33 | 14.61 | 20.00 |
| to:TO | 10.00 | 20.50 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 20.00 | 0.00 | 20.00 | 20.00 | 20.00 | 10.00 |
| meet:VB | 20.00 | 15.50 | 5.11 | 19.96 | 3.72 | 20.00 | 6.07 | 7.61 | 20.00 | 0.00 | 11.24 | 14.50 | 19.52 |
| new:JJ | 20.00 | 12.46 | 11.96 | 11.96 | 11.96 | 20.00 | 11.96 | 10.33 | 20.00 | 11.24 | 0.00 | 11.96 | 20.00 |
| threats:NNS | 20.00 | 10.17 | 12.85 | 14.96 | 13.05 | 20.00 | 14.34 | 14.61 | 20.00 | 14.50 | 11.96 | 0.00 | 19.03 |
| .:. | 10.00 | 20.50 | 17.99 | 20.00 | 20.00 | 10.00 | 20.00 | 20.00 | 10.00 | 19.52 | 20.00 | 19.03 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 | 9.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -12.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.factivePassage: factive entails : realized-VBD; Modal.no: necessary -> not necessary; Polarity.hypNegMarker: "have": neg; NullPunisher.other: not; NullPunisher.aux: did; Structure.argsMismatch: args have different parents but same relations: text "US_Army" <-xsubj-- "transformed vs. hyp "US_Army" <-nsubj-- "have", which aligned to text "had" args have different parents but same relations: text "." <-punct-- "realized vs. hyp "." <-punct-- "have", which aligned to text "had"
Hand-tuned score: -13.0500
Threshold: -3.3437
Txt: Bush didn't realize that Afghanistan is land-locked.
Hyp: Afghanistan is land-locked. (Yes.)
| Afghanistan NNP |
is VBZ |
land NN |
- : |
locked VBN |
. . |
|
| Bush:NNP | 7.61 | 14.34 | 7.11 | 20.00 | 8.87 | 20.00 |
| did:VBD | 15.50 | 6.07 | 12.62 | 20.00 | 7.32 | 17.99 |
| n't:RB | 15.46 | 19.96 | 14.96 | 20.00 | 19.96 | 17.90 |
| realize:VB | 15.50 | 10.00 | 14.42 | 20.00 | 7.32 | 17.34 |
| that:IN | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 |
| Afghanistan:NNP | 0.00 | 14.84 | 5.84 | 20.50 | 14.84 | 20.50 |
| is:VBZ | 14.84 | 0.00 | 14.34 | 20.00 | 9.34 | 20.00 |
| land:NN | 2.50 | 14.34 | 0.00 | 20.00 | 12.65 | 18.77 |
| -:: | 20.50 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| locked:VBN | 14.84 | 9.34 | 12.65 | 20.00 | 0.00 | 19.17 |
| .:. | 20.50 | 20.00 | 18.77 | 10.00 | 19.17 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -2.0000
Features matched: Factive.factivePassage: factive entails : realize-VB; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "realize vs. hyp "." <-punct-- "land", which aligned to text "land"
Hand-tuned score: -1.0000
Threshold: -3.3437
Txt: Bush didn't realize that Afghanistan is land-locked.
Hyp: Afghanistan is not land-locked. (Don't know.)
| Afghanistan NNP |
is VBZ |
not RB |
land NN |
- : |
locked VBN |
. . |
|
| Bush:NNP | 7.61 | 14.34 | 14.96 | 7.11 | 20.00 | 8.87 | 20.00 |
| did:VBD | 15.50 | 6.07 | 19.96 | 12.62 | 20.00 | 7.32 | 17.99 |
| n't:RB | 15.46 | 19.96 | 0.50 | 14.96 | 20.00 | 19.96 | 17.90 |
| realize:VB | 15.50 | 10.00 | 19.96 | 14.42 | 20.00 | 7.32 | 17.34 |
| that:IN | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 |
| Afghanistan:NNP | 0.00 | 14.84 | 15.46 | 5.84 | 20.50 | 14.84 | 20.50 |
| is:VBZ | 14.84 | 0.00 | 19.96 | 14.34 | 20.00 | 9.34 | 20.00 |
| land:NN | 2.50 | 14.34 | 14.96 | 0.00 | 20.00 | 12.65 | 18.77 |
| -:: | 20.50 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| locked:VBN | 14.84 | 9.34 | 19.96 | 12.65 | 20.00 | 0.00 | 19.17 |
| .:. | 20.50 | 20.00 | 20.00 | 18.77 | 10.00 | 19.17 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -8.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.factivePassage: factive entails : realize-VB; Polarity.hypNegMarker: "land": neg; Structure.argsMismatch: args have different parents but same relations: text "n't" <-neg-- "realize vs. hyp "not" <-neg-- "land", which aligned to text "land" args have different parents but same relations: text "." <-punct-- "realize vs. hyp "." <-punct-- "land", which aligned to text "land" text "locked" is amod of "land" while hyp "locked" is partmod of "land" which aligned to text "land"
Hand-tuned score: -6.0000
Threshold: -3.3437
Txt: There is a belief that the US will invade Syria.
Hyp: The US will invade Syria. (Don't know.)
| The DT |
US NNP |
will MD |
invade VB |
Syria NNP |
. . |
|
| There:EX | 10.00 | 20.50 | 10.00 | 20.00 | 20.50 | 10.00 |
| is:VBZ | 20.00 | 14.84 | 20.00 | 6.70 | 14.84 | 20.00 |
| a:DT | 10.00 | 20.50 | 10.00 | 20.00 | 20.50 | 10.00 |
| belief:NN | 20.00 | 10.50 | 17.47 | 14.83 | 10.50 | 18.82 |
| that:IN | 20.00 | 20.50 | 20.00 | 20.00 | 20.50 | 20.00 |
| the:DT | 0.00 | 20.50 | 10.00 | 20.00 | 20.50 | 10.00 |
| US:NNP | 20.50 | 0.00 | 20.50 | 15.50 | 5.34 | 20.50 |
| will:MD | 10.00 | 20.50 | 0.00 | 20.00 | 20.50 | 10.00 |
| invade:VB | 20.00 | 15.50 | 20.00 | 0.00 | 15.50 | 20.00 |
| Syria:NNP | 20.50 | 5.34 | 20.50 | 15.50 | 0.00 | 20.50 |
| .:. | 10.00 | 20.50 | 10.00 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -2.0000
Features matched: Factive.unknownPassage: non factive text -- unknown: belief-NN; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "is vs. hyp "." <-punct-- "invade", which aligned to text "invade"
Hand-tuned score: -1.5000
Threshold: -3.3437
Txt: There is a belief that the US will invade Syria.
Hyp: The US will not invade Syria. (Don't know.)
| The DT |
US NNP |
will MD |
not RB |
invade VB |
Syria NNP |
. . |
|
| There:EX | 10.00 | 20.50 | 10.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| is:VBZ | 20.00 | 14.84 | 20.00 | 19.96 | 6.70 | 14.84 | 20.00 |
| a:DT | 10.00 | 20.50 | 10.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| belief:NN | 20.00 | 10.50 | 17.47 | 14.96 | 14.83 | 10.50 | 18.82 |
| that:IN | 20.00 | 20.50 | 20.00 | 20.00 | 20.00 | 20.50 | 20.00 |
| the:DT | 0.00 | 20.50 | 10.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| US:NNP | 20.50 | 0.00 | 20.50 | 15.46 | 15.50 | 5.34 | 20.50 |
| will:MD | 10.00 | 20.50 | 0.00 | 19.96 | 20.00 | 20.50 | 10.00 |
| invade:VB | 20.00 | 15.50 | 20.00 | 19.96 | 0.00 | 15.50 | 20.00 |
| Syria:NNP | 20.50 | 5.34 | 20.50 | 15.46 | 15.50 | 0.00 | 20.50 |
| .:. | 10.00 | 20.50 | 10.00 | 20.00 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 10.00 | 9.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -11.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.unknownPassage: non factive text -- unknown: belief-NN; Polarity.hypNegMarker: "invade": neg; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "is vs. hyp "." <-punct-- "invade", which aligned to text "invade"
Hand-tuned score: -7.5000
Threshold: -3.3437
Txt: It is not surprising that Bush has the lead in Ohio.
Hyp: Bush has the lead in Ohio. (Yes.)
| Bush NNP |
has VBZ |
the DT |
lead NN |
Ohio NNP |
. . |
|
| It:PRP | 12.50 | 15.00 | 20.00 | 12.00 | 12.50 | 20.00 |
| is:VBZ | 14.84 | 8.64 | 20.00 | 9.39 | 14.84 | 20.00 |
| not:RB | 15.46 | 19.96 | 20.00 | 14.96 | 15.46 | 20.00 |
| surprising:JJ | 12.50 | 12.00 | 20.00 | 9.70 | 12.50 | 19.84 |
| that:IN | 20.50 | 20.00 | 20.00 | 20.00 | 20.50 | 20.00 |
| Bush:NNP | 0.00 | 13.02 | 20.50 | 8.02 | 12.11 | 20.50 |
| has:VBZ | 13.02 | 0.00 | 20.00 | 7.65 | 13.02 | 20.00 |
| the:DT | 20.50 | 20.00 | 0.00 | 20.00 | 20.50 | 10.00 |
| lead:NN | 8.02 | 7.65 | 20.00 | 0.00 | 8.02 | 19.02 |
| Ohio:NNP | 12.11 | 13.02 | 20.50 | 8.02 | 0.00 | 20.50 |
| .:. | 20.50 | 20.00 | 10.00 | 19.02 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -2.0000
Features matched: Factive.factivePassage: factive entails : surprising-JJ; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "surprising vs. hyp "." <-punct-- "has", which aligned to text "has"
Hand-tuned score: -1.0000
Threshold: -3.3437
Txt: It is not surprising that Bush has the lead in Ohio.
Hyp: Bush does not have the lead in Ohio. (Don't know.)
| Bush NNP |
does VBZ |
not RB |
have VB |
the DT |
lead NN |
Ohio NNP |
. . |
|
| It:PRP | 12.50 | 15.00 | 20.00 | 15.00 | 20.00 | 12.00 | 12.50 | 20.00 |
| is:VBZ | 14.84 | 9.34 | 19.96 | 7.80 | 20.00 | 9.39 | 14.84 | 20.00 |
| not:RB | 15.46 | 19.96 | 0.00 | 19.96 | 20.00 | 14.96 | 15.46 | 20.00 |
| surprising:JJ | 12.50 | 11.87 | 11.96 | 10.07 | 20.00 | 9.70 | 12.50 | 19.84 |
| that:IN | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 20.00 |
| Bush:NNP | 0.00 | 13.61 | 15.46 | 13.55 | 20.50 | 8.02 | 12.11 | 20.50 |
| has:VBZ | 13.02 | 9.34 | 19.96 | 0.50 | 20.00 | 7.65 | 13.02 | 20.00 |
| the:DT | 20.50 | 18.65 | 20.00 | 20.00 | 0.00 | 20.00 | 20.50 | 10.00 |
| lead:NN | 8.02 | 13.11 | 14.96 | 10.68 | 20.00 | 0.00 | 8.02 | 19.02 |
| Ohio:NNP | 12.11 | 14.84 | 15.46 | 14.84 | 20.50 | 8.02 | 0.00 | 20.50 |
| .:. | 20.50 | 20.00 | 20.00 | 20.00 | 10.00 | 19.02 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -5.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.factivePassage: factive entails : surprising-JJ; Polarity.hypNegMarker: "have": neg; NullPunisher.aux: does; Structure.argsMismatch: args have different parents but same relations: text "not" <-neg-- "surprising vs. hyp "not" <-neg-- "have", which aligned to text "has" args have different parents but same relations: text "." <-punct-- "surprising vs. hyp "." <-punct-- "have", which aligned to text "has"
Hand-tuned score: -6.0500
Threshold: -3.3437
Txt: It is not likely that Bush has the lead in Ohio.
Hyp: Bush has the lead in Ohio. (Don't know.)
| Bush NNP |
has VBZ |
the DT |
lead NN |
Ohio NNP |
. . |
|
| It:PRP | 12.50 | 15.00 | 20.00 | 12.00 | 12.50 | 20.00 |
| is:VBZ | 14.84 | 8.64 | 20.00 | 9.39 | 14.84 | 20.00 |
| not:RB | 15.46 | 19.96 | 20.00 | 14.96 | 15.46 | 20.00 |
| likely:JJ | 12.46 | 11.96 | 20.00 | 10.90 | 12.46 | 19.92 |
| that:IN | 20.50 | 20.00 | 20.00 | 20.00 | 20.50 | 20.00 |
| Bush:NNP | 0.00 | 13.02 | 20.50 | 8.02 | 12.11 | 20.50 |
| has:VBZ | 13.02 | 0.00 | 20.00 | 7.65 | 13.02 | 20.00 |
| the:DT | 20.50 | 20.00 | 0.00 | 20.00 | 20.50 | 10.00 |
| lead:NN | 8.02 | 7.65 | 20.00 | 0.00 | 8.02 | 19.02 |
| Ohio:NNP | 12.11 | 13.02 | 20.50 | 8.02 | 0.00 | 20.50 |
| .:. | 20.50 | 20.00 | 10.00 | 19.02 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -2.0000
Features matched: Factive.unknownPassage: non factive text -- unknown: likely-JJ; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "likely vs. hyp "." <-punct-- "has", which aligned to text "has"
Hand-tuned score: -1.5000
Threshold: -3.3437
Txt: It is not likely that Bush has the lead in Ohio.
Hyp: Bush does not have the lead in Ohio. (Don't know.)
| Bush NNP |
does VBZ |
not RB |
have VB |
the DT |
lead NN |
Ohio NNP |
. . |
|
| It:PRP | 12.50 | 15.00 | 20.00 | 15.00 | 20.00 | 12.00 | 12.50 | 20.00 |
| is:VBZ | 14.84 | 9.34 | 19.96 | 7.80 | 20.00 | 9.39 | 14.84 | 20.00 |
| not:RB | 15.46 | 19.96 | 0.00 | 19.96 | 20.00 | 14.96 | 15.46 | 20.00 |
| likely:JJ | 12.46 | 9.43 | 11.96 | 11.96 | 20.00 | 10.90 | 12.46 | 19.92 |
| that:IN | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 20.00 |
| Bush:NNP | 0.00 | 13.61 | 15.46 | 13.55 | 20.50 | 8.02 | 12.11 | 20.50 |
| has:VBZ | 13.02 | 9.34 | 19.96 | 0.50 | 20.00 | 7.65 | 13.02 | 20.00 |
| the:DT | 20.50 | 18.65 | 20.00 | 20.00 | 0.00 | 20.00 | 20.50 | 10.00 |
| lead:NN | 8.02 | 13.11 | 14.96 | 10.68 | 20.00 | 0.00 | 8.02 | 19.02 |
| Ohio:NNP | 12.11 | 14.84 | 15.46 | 14.84 | 20.50 | 8.02 | 0.00 | 20.50 |
| .:. | 20.50 | 20.00 | 20.00 | 20.00 | 10.00 | 19.02 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -5.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.unknownPassage: non factive text -- unknown: likely-JJ; Polarity.hypNegMarker: "have": neg; NullPunisher.aux: does; Structure.argsMismatch: args have different parents but same relations: text "not" <-neg-- "likely vs. hyp "not" <-neg-- "have", which aligned to text "has" args have different parents but same relations: text "." <-punct-- "likely vs. hyp "." <-punct-- "have", which aligned to text "has"
Hand-tuned score: -6.5500
Threshold: -3.3437
Txt: Kerry knew that Edwards would accept the nomination.
Hyp: Kerry knew whether Edwards would accept the nomination. (Yes.)
| Kerry NNP |
knew VBD |
whether IN |
Edwards NNP |
would MD |
accept VB |
the DT |
nomination NN |
. . |
|
| Kerry:NNP | 0.00 | 15.46 | 20.50 | 9.07 | 20.46 | 15.46 | 20.50 | 10.46 | 20.50 |
| knew:VBD | 15.46 | 0.00 | 20.00 | 15.50 | 19.96 | 5.35 | 20.00 | 14.92 | 17.19 |
| that:IN | 20.50 | 20.00 | 10.00 | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 |
| Edwards:NNP | 9.07 | 15.50 | 20.50 | 0.00 | 20.46 | 15.50 | 20.50 | 10.50 | 20.50 |
| would:MD | 20.46 | 19.96 | 20.00 | 20.46 | 0.00 | 19.96 | 10.00 | 19.96 | 10.00 |
| accept:VB | 15.46 | 5.35 | 20.00 | 15.50 | 19.96 | 0.00 | 20.00 | 15.00 | 18.61 |
| the:DT | 20.50 | 20.00 | 20.00 | 20.50 | 10.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| nomination:NN | 10.46 | 14.92 | 20.00 | 10.50 | 19.96 | 15.00 | 20.00 | 0.00 | 19.99 |
| .:. | 20.50 | 17.19 | 20.00 | 20.50 | 10.00 | 18.61 | 10.00 | 19.99 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -1.0000
Features matched: NullPunisher.functionWord: whether
Hand-tuned score: 0.9000
Threshold: -3.3437
Txt: Kerry knew that Edwards would accept the nomination.
Hyp: Kerry did not know whether Edwards would accept the nomination. (Don't know.)
| Kerry NNP |
did VBD |
not RB |
know VB |
whether IN |
Edwards NNP |
would MD |
accept VB |
the DT |
nomination NN |
. . |
|
| Kerry:NNP | 0.00 | 15.46 | 15.46 | 15.46 | 20.50 | 9.07 | 20.46 | 15.46 | 20.50 | 10.46 | 20.50 |
| knew:VBD | 15.46 | 4.04 | 19.96 | 0.50 | 20.00 | 15.50 | 19.96 | 5.35 | 20.00 | 14.92 | 17.19 |
| that:IN | 20.50 | 20.00 | 20.00 | 20.00 | 10.00 | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 |
| Edwards:NNP | 9.07 | 15.50 | 15.46 | 15.50 | 20.50 | 0.00 | 20.46 | 15.50 | 20.50 | 10.50 | 20.50 |
| would:MD | 20.46 | 18.57 | 19.96 | 19.96 | 20.00 | 20.46 | 0.00 | 19.96 | 10.00 | 19.96 | 10.00 |
| accept:VB | 15.46 | 7.47 | 19.96 | 1.00 | 20.00 | 15.50 | 19.96 | 0.00 | 20.00 | 15.00 | 18.61 |
| the:DT | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 10.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| nomination:NN | 10.46 | 14.36 | 14.96 | 14.84 | 20.00 | 10.50 | 19.96 | 15.00 | 20.00 | 0.00 | 19.99 |
| .:. | 20.50 | 17.99 | 20.00 | 17.93 | 20.00 | 20.50 | 10.00 | 18.61 | 10.00 | 19.99 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -11.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "know": neg; NullPunisher.other: not; NullPunisher.functionWord: whether; NullPunisher.aux: did
Hand-tuned score: -5.1500
Threshold: -3.3437
Txt: Tom knows that Naples is in Campania.
Hyp: Tom knows where Naples is. (Yes.)
| Tom NNP |
knows VBZ |
where WRB |
Naples NNP |
is VBZ |
. . |
|
| Tom:NNP | 0.00 | 15.00 | 19.96 | 9.84 | 14.34 | 20.00 |
| knows:VBZ | 15.00 | 0.00 | 19.96 | 15.50 | 8.07 | 20.00 |
| that:IN | 20.00 | 20.00 | 18.69 | 20.50 | 20.00 | 20.00 |
| Naples:NNP | 9.84 | 15.50 | 20.46 | 0.00 | 14.84 | 20.50 |
| is:VBZ | 14.34 | 8.07 | 19.96 | 14.84 | 0.00 | 20.00 |
| Campania:NNP | 9.84 | 15.50 | 20.46 | 2.00 | 14.84 | 20.50 |
| .:. | 20.00 | 20.00 | 10.00 | 20.50 | 20.00 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -10.0000
Features matched: Adjunct.addPosCxt: hyp added where[where-WRB]; Adjunct.dropPosCxt: text adjunct "Campania" of "is" dropped on aligned hyp word "is"; NullPunisher.other: where
Hand-tuned score: -0.5000
Threshold: -3.3437
Txt: Tom knows that Naples is in Campania.
Hyp: Tom does not know where Naples is. (Don't know.)
| Tom NNP |
does VBZ |
not RB |
know VB |
where WRB |
Naples NNP |
is VBZ |
. . |
|
| Tom:NNP | 0.00 | 10.17 | 14.96 | 15.00 | 19.96 | 9.84 | 14.34 | 20.00 |
| knows:VBZ | 15.00 | 2.16 | 19.96 | 0.50 | 19.96 | 15.50 | 8.07 | 20.00 |
| that:IN | 20.00 | 20.00 | 20.00 | 20.00 | 18.69 | 20.50 | 20.00 | 20.00 |
| Naples:NNP | 9.84 | 14.84 | 15.46 | 15.50 | 20.46 | 0.00 | 14.84 | 20.50 |
| is:VBZ | 14.34 | 9.34 | 19.96 | 8.07 | 19.96 | 14.84 | 0.00 | 20.00 |
| Campania:NNP | 9.84 | 14.84 | 15.46 | 15.50 | 20.46 | 2.00 | 14.84 | 20.50 |
| .:. | 20.00 | 20.00 | 20.00 | 17.93 | 10.00 | 20.50 | 20.00 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -20.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "know": neg; NullPunisher.aux: does; NullPunisher.other: not; NullPunisher.other: where
Hand-tuned score: -6.0500
Threshold: -3.3437
Txt: We met in September during the feast.
Hyp: The feast took place in September. (Yes.)
| The DT |
feast NN |
took_place VBD |
September NNP |
. . |
|
| We:PRP | 20.00 | 12.00 | 15.00 | 12.50 | 20.00 |
| met:VBD | 20.00 | 10.15 | 8.51 | 15.50 | 19.15 |
| September:NNP | 20.50 | 10.50 | 14.84 | 0.00 | 20.50 |
| the:DT | 0.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| feast:NN | 20.00 | 0.00 | 13.28 | 10.50 | 20.00 |
| .:. | 10.00 | 20.00 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -10.5097
Features matched: Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 09/01/1000; RootEntailment.poorlyAlignedRoot: "took_place" aligned badly to "met"; Structure.relMismatch: text "feast" is prep_during of "met" while hyp "feast" is nsubj of "took_place" which aligned to text "met"
Hand-tuned score: -1.0000
Threshold: -3.3437
Txt: We met in September during the feast.
Hyp: The feast did not take place in September. (Don't know.)
| The DT |
feast NN |
did VBD |
not RB |
take_place VB |
September NNP |
. . |
|
| We:PRP | 20.00 | 12.00 | 15.00 | 20.00 | 15.00 | 12.50 | 20.00 |
| met:VBD | 20.00 | 10.15 | 5.11 | 19.96 | 8.51 | 15.50 | 19.15 |
| September:NNP | 20.50 | 10.50 | 14.19 | 15.46 | 14.84 | 0.00 | 20.50 |
| the:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| feast:NN | 20.00 | 0.00 | 9.07 | 14.96 | 13.28 | 10.50 | 20.00 |
| .:. | 10.00 | 20.00 | 17.99 | 20.00 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -20.5097
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 09/01/1000; Polarity.hypNegMarker: "take_place": neg; NullPunisher.other: not; NullPunisher.aux: did; RootEntailment.poorlyAlignedRoot: "take_place" aligned badly to "met"; Structure.relMismatch: text "feast" is prep_during of "met" while hyp "feast" is nsubj of "take_place" which aligned to text "met"
Hand-tuned score: -7.0500
Threshold: -3.3437
Txt: It is false that Bin Laden was seen in Tora Bora.
Hyp: Bin Laden was seen in Tora Bora. (Don't know.)
| Bin_Laden NNP |
was VBD |
seen VBN |
Tora_Bora NNP |
. . |
|
| It:PRP | 12.50 | 15.00 | 15.00 | 12.50 | 20.00 |
| is:VBZ | 14.84 | 0.50 | 7.74 | 15.46 | 20.00 |
| false:JJ | 12.46 | 11.96 | 11.96 | 12.46 | 18.76 |
| that:IN | 20.50 | 20.00 | 20.00 | 20.50 | 20.00 |
| Bin_Laden:NNP | 0.00 | 14.84 | 14.84 | 14.96 | 20.50 |
| was:VBD | 14.84 | 0.00 | 7.11 | 15.46 | 20.00 |
| seen:VBN | 14.84 | 7.11 | 0.00 | 15.46 | 18.17 |
| Tora_Bora:NNP | 14.96 | 15.46 | 15.46 | 0.00 | 20.50 |
| .:. | 20.50 | 20.00 | 18.17 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -2.0000
Features matched: Factive.negativeStatement: non factive text : false-JJ; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "false vs. hyp "." <-punct-- "seen", which aligned to text "seen"
Hand-tuned score: -7.0000
Threshold: -3.3437
Txt: It is false that Bin Laden was seen in Tora Bora.
Hyp: Bin Laden was not seen in Tora Bora. (Yes.)
| Bin_Laden NNP |
was VBD |
not RB |
seen VBN |
Tora_Bora NNP |
. . |
|
| It:PRP | 12.50 | 15.00 | 20.00 | 15.00 | 12.50 | 20.00 |
| is:VBZ | 14.84 | 0.50 | 19.96 | 7.74 | 15.46 | 20.00 |
| false:JJ | 12.46 | 11.96 | 11.96 | 11.96 | 12.46 | 18.76 |
| that:IN | 20.50 | 20.00 | 20.00 | 20.00 | 20.50 | 20.00 |
| Bin_Laden:NNP | 0.00 | 14.84 | 15.46 | 14.84 | 14.96 | 20.50 |
| was:VBD | 14.84 | 0.00 | 19.96 | 7.11 | 15.46 | 20.00 |
| seen:VBN | 14.84 | 7.11 | 19.96 | 0.00 | 15.46 | 18.17 |
| Tora_Bora:NNP | 14.96 | 15.46 | 15.46 | 15.46 | 0.00 | 20.50 |
| .:. | 20.50 | 20.00 | 20.00 | 18.17 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (INCORRECT)
Justification:
Alignment score: -11.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.negativeStatement: non factive text : false-JJ; Polarity.hypNegMarker: "seen": neg; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "false vs. hyp "." <-punct-- "seen", which aligned to text "seen"
Hand-tuned score: -13.0000
Threshold: -3.3437
Txt: It follows that Bin Laden was in Tora Bora.
Hyp: Bin Laden was in Tora Bora. (Yes.)
| Bin_Laden NNP |
was VBD |
Tora_Bora NNP |
. . |
|
| It:PRP | 12.50 | 15.00 | 12.50 | 20.00 |
| follows:VBZ | 15.50 | 10.00 | 15.46 | 17.41 |
| that:IN | 20.50 | 20.00 | 20.50 | 20.00 |
| Bin_Laden:NNP | 0.00 | 14.84 | 14.96 | 20.50 |
| was:VBD | 14.84 | 0.00 | 15.46 | 20.00 |
| Tora_Bora:NNP | 14.96 | 15.46 | 0.00 | 20.50 |
| .:. | 20.50 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -2.0000
Features matched: Factive.factivePassage: factive entails : follows-VBZ; Location.mismatch: no clear info of matching: be(X, prep_in); Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "follows vs. hyp "." <-punct-- "was", which aligned to text "was"
Hand-tuned score: -3.0000
Threshold: -3.3437
Txt: It follows that Bin Laden was in Tora Bora.
Hyp: Bin Laden was not in Tora Bora. (Don't know.)
| Bin_Laden NNP |
was VBD |
not RB |
Tora_Bora NNP |
. . |
|
| It:PRP | 12.50 | 15.00 | 20.00 | 12.50 | 20.00 |
| follows:VBZ | 15.50 | 10.00 | 19.96 | 15.46 | 17.41 |
| that:IN | 20.50 | 20.00 | 20.00 | 20.50 | 20.00 |
| Bin_Laden:NNP | 0.00 | 14.84 | 15.46 | 14.96 | 20.50 |
| was:VBD | 14.84 | 0.00 | 19.96 | 15.46 | 20.00 |
| Tora_Bora:NNP | 14.96 | 15.46 | 15.46 | 0.00 | 20.50 |
| .:. | 20.50 | 20.00 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 9.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -11.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.factivePassage: factive entails : follows-VBZ; Polarity.hypNegMarker: "was": neg; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "follows vs. hyp "." <-punct-- "was", which aligned to text "was"
Hand-tuned score: -7.0000
Threshold: -3.3437
Txt: It is likely that Bin Laden was in Tora Bora.
Hyp: Bin Laden was in Tora Bora. (Don't know.)
| Bin_Laden NNP |
was VBD |
Tora_Bora NNP |
. . |
|
| It:PRP | 12.50 | 15.00 | 12.50 | 20.00 |
| is:VBZ | 14.84 | 0.50 | 15.46 | 20.00 |
| likely:JJ | 12.46 | 11.96 | 12.46 | 19.92 |
| that:IN | 20.50 | 20.00 | 20.50 | 20.00 |
| Bin_Laden:NNP | 0.00 | 14.84 | 14.96 | 20.50 |
| was:VBD | 14.84 | 0.00 | 15.46 | 20.00 |
| Tora_Bora:NNP | 14.96 | 15.46 | 0.00 | 20.50 |
| .:. | 20.50 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -2.0000
Features matched: Factive.unknownPassage: non factive text -- unknown: likely-JJ; Location.mismatch: no clear info of matching: be(X, prep_in); Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "likely vs. hyp "." <-punct-- "was", which aligned to text "was"
Hand-tuned score: -3.5000
Threshold: -3.3437
Txt: It is likely that Bin Laden was in Tora Bora.
Hyp: Bin Laden was not in Tora Bora. (Don't know.)
| Bin_Laden NNP |
was VBD |
not RB |
Tora_Bora NNP |
. . |
|
| It:PRP | 12.50 | 15.00 | 20.00 | 12.50 | 20.00 |
| is:VBZ | 14.84 | 0.50 | 19.96 | 15.46 | 20.00 |
| likely:JJ | 12.46 | 11.96 | 11.96 | 12.46 | 19.92 |
| that:IN | 20.50 | 20.00 | 20.00 | 20.50 | 20.00 |
| Bin_Laden:NNP | 0.00 | 14.84 | 15.46 | 14.96 | 20.50 |
| was:VBD | 14.84 | 0.00 | 19.96 | 15.46 | 20.00 |
| Tora_Bora:NNP | 14.96 | 15.46 | 15.46 | 0.00 | 20.50 |
| .:. | 20.50 | 20.00 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 9.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -11.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.unknownPassage: non factive text -- unknown: likely-JJ; Polarity.hypNegMarker: "was": neg; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "likely vs. hyp "." <-punct-- "was", which aligned to text "was"
Hand-tuned score: -7.5000
Threshold: -3.3437
Txt: Tony Hall left Amman on Sunday.
Hyp: Tony Hall was in Amman on Sunday. (Yes.)
| Tony_Hall NNP |
was VBD |
Amman NNP |
Sunday NNP |
. . |
|
| Tony_Hall:NNP | 0.00 | 15.17 | 14.67 | 14.96 | 20.50 |
| left:VBD | 15.17 | 7.11 | 11.79 | 15.50 | 19.46 |
| Amman:NNP | 14.67 | 11.83 | 0.00 | 15.00 | 20.50 |
| Sunday:NNP | 14.96 | 15.50 | 15.00 | 0.00 | 20.50 |
| .:. | 20.50 | 20.00 | 20.50 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -9.1114
Features matched: Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/1000; Location.mismatch: no clear info of matching: be(X, prep_in); RootEntailment.poorlyAlignedRoot: "was" aligned badly to "left"; Structure.relMismatch: text "Amman" is dobj of "left" while hyp "Amman" is prep_in of "was" which aligned to text "left"
Hand-tuned score: -3.0000
Threshold: -3.3437
Txt: Tony Hall left Amman on Sunday.
Hyp: Tony Hall was not in Amman on Sunday. (Don't know.)
| Tony_Hall NNP |
was VBD |
not RB |
Amman NNP |
Sunday NNP |
. . |
|
| Tony_Hall:NNP | 0.00 | 15.17 | 15.46 | 14.67 | 14.96 | 20.50 |
| left:VBD | 15.17 | 7.11 | 19.96 | 11.79 | 15.50 | 19.46 |
| Amman:NNP | 14.67 | 11.83 | 15.46 | 0.00 | 15.00 | 20.50 |
| Sunday:NNP | 14.96 | 15.50 | 15.46 | 15.00 | 0.00 | 20.50 |
| .:. | 20.50 | 20.00 | 20.00 | 20.50 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 9.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -18.1114
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/1000; Polarity.hypNegMarker: "was": neg; NullPunisher.other: not; RootEntailment.poorlyAlignedRoot: "was" aligned badly to "left"; Structure.relMismatch: text "Amman" is dobj of "left" while hyp "Amman" is prep_in of "was" which aligned to text "left"
Hand-tuned score: -7.0000
Threshold: -3.3437
Txt: Tony Hall left Amman on Sunday.
Hyp: Tony Hall was in Amman on Saturday. (Don't know.)
| Tony_Hall NNP |
was VBD |
Amman NNP |
Saturday NNP |
. . |
|
| Tony_Hall:NNP | 0.00 | 15.17 | 14.67 | 14.96 | 20.50 |
| left:VBD | 15.17 | 7.11 | 11.79 | 15.50 | 19.46 |
| Amman:NNP | 14.67 | 11.83 | 0.00 | 15.00 | 20.50 |
| Sunday:NNP | 14.96 | 15.50 | 15.00 | 4.13 | 20.50 |
| .:. | 20.50 | 20.00 | 20.50 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -13.2437
Features matched: Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/1000; Location.mismatch: no clear info of matching: be(X, prep_in); Numeric.mismatch: date Saturday != Sunday; RootEntailment.poorlyAlignedRoot: "was" aligned badly to "left"; Structure.relMismatch: text "Amman" is dobj of "left" while hyp "Amman" is prep_in of "was" which aligned to text "left"
Hand-tuned score: -9.0000
Threshold: -3.3437
Txt: Tony Hall left Amman on Sunday.
Hyp: Tony Hall was not in Amman on Saturday. (Don't know.)
| Tony_Hall NNP |
was VBD |
not RB |
Amman NNP |
Saturday NNP |
. . |
|
| Tony_Hall:NNP | 0.00 | 15.17 | 15.46 | 14.67 | 14.96 | 20.50 |
| left:VBD | 15.17 | 7.11 | 19.96 | 11.79 | 15.50 | 19.46 |
| Amman:NNP | 14.67 | 11.83 | 15.46 | 0.00 | 15.00 | 20.50 |
| Sunday:NNP | 14.96 | 15.50 | 15.46 | 15.00 | 4.13 | 20.50 |
| .:. | 20.50 | 20.00 | 20.00 | 20.50 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 9.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -22.2437
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/1000; Polarity.hypNegMarker: "was": neg; NullPunisher.other: not; Numeric.mismatch: date Saturday != Sunday; RootEntailment.poorlyAlignedRoot: "was" aligned badly to "left"; Structure.relMismatch: text "Amman" is dobj of "left" while hyp "Amman" is prep_in of "was" which aligned to text "left"
Hand-tuned score: -13.0000
Threshold: -3.3437
Txt: Khan sold 10 centrifuges to North Korea.
Hyp: North Korea bought 10 centrifuges. (Yes.)
| North_Korea NNP |
bought VBD |
10 CD |
centrifuges NNS |
. . |
|
| Khan:NNP | 14.02 | 15.50 | 25.00 | 8.53 | 20.50 |
| sold:VBD | 15.50 | 2.96 | 20.00 | 14.23 | 19.42 |
| 10:CD | 24.34 | 19.52 | 0.00 | 20.50 | 19.16 |
| centrifuges:NNS | 9.84 | 15.00 | 20.50 | 0.00 | 19.93 |
| North_Korea:NNP | 0.00 | 15.50 | 24.34 | 9.84 | 20.50 |
| .:. | 20.50 | 19.32 | 19.16 | 19.93 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -4.9562
Features matched: Structure.relMismatch: text "North_Korea" is prep_to of "sold" while hyp "North_Korea" is nsubj of "bought" which aligned to text "sold"
Hand-tuned score: 0.0000
Threshold: -3.3437
Txt: Khan sold 10 centrifuges to North Korea.
Hyp: North Korea did not buy 10 centrifuges. (Don't know.)
| North_Korea NNP |
did VBD |
not RB |
buy VB |
10 CD |
centrifuges NNS |
. . |
|
| Khan:NNP | 14.02 | 15.50 | 15.46 | 15.50 | 25.00 | 8.53 | 20.50 |
| sold:VBD | 15.50 | 7.69 | 19.96 | 6.51 | 20.00 | 14.23 | 19.42 |
| 10:CD | 24.34 | 19.19 | 20.46 | 20.03 | 0.00 | 20.50 | 19.16 |
| centrifuges:NNS | 9.84 | 15.00 | 14.96 | 14.88 | 20.50 | 0.00 | 19.93 |
| North_Korea:NNP | 0.00 | 14.52 | 15.46 | 15.50 | 24.34 | 9.84 | 20.50 |
| .:. | 20.50 | 17.99 | 20.00 | 20.00 | 19.16 | 19.93 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -18.5110
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "buy": neg; NullPunisher.other: not; NullPunisher.aux: did; RootEntailment.poorlyAlignedRoot: "buy" aligned badly to "sold"; Structure.relMismatch: text "North_Korea" is prep_to of "sold" while hyp "North_Korea" is nsubj of "buy" which aligned to text "sold"
Hand-tuned score: -8.0500
Threshold: -3.3437
Txt: The US invasion of Afghanistan prevented Al-Qaida from attacking Ryad in 2002.
Hyp: Al-Qaida attacked Ryad in 2002. (Don't know.)
| Al-Qaida NNP |
attacked VBD |
Ryad NNP |
2002 CD |
. . |
|
| The:DT | 20.50 | 20.00 | 20.00 | 20.50 | 10.00 |
| US:NNP | 10.00 | 15.50 | 10.46 | 24.96 | 20.50 |
| invasion:NN | 10.50 | 10.39 | 9.96 | 20.46 | 20.00 |
| Afghanistan:NNP | 10.00 | 15.50 | 10.46 | 24.96 | 20.50 |
| prevented:VBD | 15.50 | 6.64 | 14.96 | 20.37 | 18.96 |
| Al-Qaida:NNP | 0.50 | 15.00 | 9.96 | 20.46 | 20.00 |
| from:IN | 20.50 | 20.00 | 20.00 | 20.50 | 20.00 |
| attacking:VBG | 15.50 | 0.50 | 14.96 | 20.26 | 19.81 |
| Ryad:NNP | 9.96 | 15.46 | 0.50 | 24.96 | 20.50 |
| 2002:CD | 24.96 | 19.46 | 20.46 | 0.00 | 20.50 |
| .:. | 20.50 | 20.00 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -5.5000
Features matched: Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/2002; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "prevented vs. hyp "." <-punct-- "attacked", which aligned to text "attacking" args have different parents, different relations: text "Al-Qaida" <-dobj-- "prevented" vs. hyp "Al-Qaida" <-nsubj-- "attacked", which aligned to text "attacking"
Hand-tuned score: -1.0000
Threshold: -3.3437
Txt: The US invasion of Afghanistan prevented Al-Qaida from attacking Ryad in 2002.
Hyp: Al-Qaida did not attack Ryad in 2002. (Yes.)
| Al-Qaida NNP |
did VBD |
not RB |
attack VB |
Ryad NNP |
2002 CD |
. . |
|
| The:DT | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 10.00 |
| US:NNP | 10.50 | 15.50 | 15.46 | 15.50 | 10.46 | 24.96 | 20.50 |
| invasion:NN | 10.00 | 12.69 | 14.96 | 10.06 | 9.96 | 20.46 | 20.00 |
| Afghanistan:NNP | 10.50 | 15.50 | 15.46 | 15.50 | 10.46 | 24.96 | 20.50 |
| prevented:VBD | 15.00 | 9.29 | 19.96 | 6.37 | 14.96 | 20.37 | 18.96 |
| Al-Qaida:NNP | 0.00 | 15.00 | 14.96 | 15.00 | 9.96 | 20.46 | 20.00 |
| from:IN | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.50 | 20.00 |
| attacking:VBG | 15.00 | 7.62 | 19.96 | 0.50 | 14.96 | 20.26 | 19.81 |
| Ryad:NNP | 10.46 | 15.46 | 15.46 | 15.46 | 0.50 | 24.96 | 20.50 |
| 2002:CD | 20.46 | 20.46 | 20.46 | 20.44 | 20.46 | 0.00 | 20.50 |
| .:. | 20.00 | 17.99 | 20.00 | 20.00 | 20.00 | 20.50 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (INCORRECT)
Justification:
Alignment score: -15.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/2002; Polarity.hypNegMarker: "attack": neg; NullPunisher.other: not; NullPunisher.aux: did; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "prevented vs. hyp "." <-punct-- "attack", which aligned to text "attacking" args have different parents, different relations: text "Al-Qaida" <-dobj-- "prevented" vs. hyp "Al-Qaida" <-nsubj-- "attack", which aligned to text "attacking"
Hand-tuned score: -7.0500
Threshold: -3.3437
Txt: The administration managed to track down the perpetrators.
Hyp: The administration tracked down the perpetrators. (Yes.)
| The DT |
administration NN |
tracked_down VBD |
the DT |
perpetrators NNS |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| administration:NN | 20.00 | 0.00 | 13.86 | 20.00 | 9.76 | 19.88 |
| managed:VBD | 20.00 | 13.37 | 10.00 | 20.00 | 15.00 | 20.00 |
| to:TO | 10.00 | 20.00 | 20.00 | 10.00 | 20.00 | 10.00 |
| track_down:VB | 20.00 | 13.86 | 0.00 | 20.00 | 14.02 | 20.00 |
| the:DT | 0.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| perpetrators:NNS | 20.00 | 9.76 | 14.02 | 20.00 | 0.00 | 19.30 |
| .:. | 10.00 | 19.88 | 20.00 | 10.00 | 19.30 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -4.0000
Features matched: Structure.argsMismatch: args have different parents but same relations: text "administration" <-nsubj-- "managed vs. hyp "administration" <-nsubj-- "tracked_down", which aligned to text "track_down" args have different parents but same relations: text "." <-punct-- "managed vs. hyp "." <-punct-- "tracked_down", which aligned to text "track_down"
Hand-tuned score: -2.0000
Threshold: -3.3437
Txt: The administration managed to track down the perpetrators.
Hyp: The administration did not track down the perpetrators. (Don't know.)
| The DT |
administration NN |
did VBD |
not RB |
track_down VB |
the DT |
perpetrators NNS |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| administration:NN | 20.00 | 0.00 | 12.69 | 14.96 | 13.86 | 20.00 | 9.76 | 19.88 |
| managed:VBD | 20.00 | 13.37 | 1.52 | 19.96 | 10.00 | 20.00 | 15.00 | 20.00 |
| to:TO | 10.00 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 | 20.00 | 10.00 |
| track_down:VB | 20.00 | 13.86 | 7.72 | 19.96 | 0.00 | 20.00 | 14.02 | 20.00 |
| the:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| perpetrators:NNS | 20.00 | 9.76 | 14.73 | 14.96 | 14.02 | 20.00 | 0.00 | 19.30 |
| .:. | 10.00 | 19.88 | 17.99 | 20.00 | 20.00 | 10.00 | 19.30 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 9.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -14.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "track_down": neg; NullPunisher.other: not; NullPunisher.aux: did; Structure.argsMismatch: args have different parents but same relations: text "administration" <-nsubj-- "managed vs. hyp "administration" <-nsubj-- "track_down", which aligned to text "track_down" args have different parents but same relations: text "." <-punct-- "managed vs. hyp "." <-punct-- "track_down", which aligned to text "track_down"
Hand-tuned score: -8.0500
Threshold: -3.3437
Txt: The administration didn't manage to track down the perpetrators.
Hyp: The administration tracked down the perpetrators. (Don't know.)
| The DT |
administration NN |
tracked_down VBD |
the DT |
perpetrators NNS |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| administration:NN | 20.00 | 0.00 | 13.86 | 20.00 | 9.76 | 19.88 |
| did:VBD | 20.00 | 12.69 | 7.72 | 20.00 | 14.73 | 17.99 |
| n't:RB | 20.00 | 14.96 | 19.96 | 20.00 | 13.39 | 17.90 |
| manage:VB | 20.00 | 15.00 | 10.00 | 20.00 | 14.77 | 19.98 |
| to:TO | 10.00 | 20.00 | 20.00 | 10.00 | 20.00 | 10.00 |
| track_down:VB | 20.00 | 13.86 | 0.00 | 20.00 | 14.02 | 20.00 |
| the:DT | 0.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| perpetrators:NNS | 20.00 | 9.76 | 14.02 | 20.00 | 0.00 | 19.30 |
| .:. | 10.00 | 19.88 | 20.00 | 10.00 | 19.30 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -4.0000
Features matched: Structure.argsMismatch: args have different parents but same relations: text "administration" <-nsubj-- "manage vs. hyp "administration" <-nsubj-- "tracked_down", which aligned to text "track_down" args have different parents but same relations: text "." <-punct-- "manage vs. hyp "." <-punct-- "tracked_down", which aligned to text "track_down"
Hand-tuned score: -2.0000
Threshold: -3.3437
Txt: The administration didn't manage to track down the perpetrators.
Hyp: The administration did not track down the perpetrators. (Yes.)
| The DT |
administration NN |
did VBD |
not RB |
track_down VB |
the DT |
perpetrators NNS |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| administration:NN | 20.00 | 0.00 | 12.69 | 14.96 | 13.86 | 20.00 | 9.76 | 19.88 |
| did:VBD | 20.00 | 12.69 | 0.00 | 19.96 | 7.72 | 20.00 | 14.73 | 17.99 |
| n't:RB | 20.00 | 14.96 | 13.27 | 0.50 | 19.96 | 20.00 | 13.39 | 17.90 |
| manage:VB | 20.00 | 15.00 | 1.52 | 19.96 | 10.00 | 20.00 | 14.77 | 19.98 |
| to:TO | 10.00 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 | 20.00 | 10.00 |
| track_down:VB | 20.00 | 13.86 | 7.72 | 19.96 | 0.00 | 20.00 | 14.02 | 20.00 |
| the:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| perpetrators:NNS | 20.00 | 9.76 | 14.73 | 14.96 | 14.02 | 20.00 | 0.00 | 19.30 |
| .:. | 10.00 | 19.88 | 17.99 | 20.00 | 20.00 | 10.00 | 19.30 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 9.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: dontknow (INCORRECT)
Justification:
Alignment score: -7.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "track_down": neg; NullPunisher.aux: did; Structure.argsMismatch: args have different parents but same relations: text "administration" <-nsubj-- "manage vs. hyp "administration" <-nsubj-- "track_down", which aligned to text "track_down" args have different parents but same relations: text "n't" <-neg-- "manage vs. hyp "not" <-neg-- "track_down", which aligned to text "track_down" args have different parents but same relations: text "." <-punct-- "manage vs. hyp "." <-punct-- "track_down", which aligned to text "track_down"
Hand-tuned score: -7.0500
Threshold: -3.3437
Txt: Bush didn't have the time to read the report.
Hyp: Bush read the report. (Don't know.)
| Bush NNP |
read VBP |
the DT |
report NN |
. . |
|
| Bush:NNP | 0.00 | 13.95 | 20.00 | 10.00 | 20.00 |
| did:VBD | 15.00 | 6.81 | 20.00 | 12.69 | 17.99 |
| n't:RB | 14.96 | 18.98 | 20.00 | 14.01 | 17.90 |
| have:VB | 13.05 | 1.00 | 20.00 | 12.80 | 20.00 |
| the:DT | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| time:NN | 10.00 | 12.32 | 20.00 | 6.89 | 17.52 |
| to:TO | 20.00 | 20.00 | 10.00 | 20.00 | 10.00 |
| read:VB | 13.95 | 0.00 | 20.00 | 11.87 | 18.96 |
| the:DT | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| report:NN | 10.00 | 11.87 | 20.00 | 0.00 | 19.87 |
| .:. | 20.00 | 18.96 | 10.00 | 19.87 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -4.0000
Features matched: Factive.inNegatedEmbedding: embedded negative text; Structure.argsMismatch: args have different parents but same relations: text "Bush" <-nsubj-- "have vs. hyp "Bush" <-nsubj-- "read", which aligned to text "read" args have different parents but same relations: text "." <-punct-- "have vs. hyp "." <-punct-- "read", which aligned to text "read"
Hand-tuned score: -7.0000
Threshold: -3.3437
Txt: Bush didn't have the time to read the report.
Hyp: Bush did not read the report. (Yes.)
| Bush NNP |
did VBD |
not RB |
read VB |
the DT |
report NN |
. . |
|
| Bush:NNP | 0.00 | 15.00 | 14.96 | 13.95 | 20.00 | 10.00 | 20.00 |
| did:VBD | 15.00 | 0.00 | 19.96 | 6.81 | 20.00 | 12.69 | 17.99 |
| n't:RB | 14.96 | 13.27 | 0.50 | 18.98 | 20.00 | 14.01 | 17.90 |
| have:VB | 13.05 | 7.32 | 19.96 | 1.00 | 20.00 | 12.80 | 20.00 |
| the:DT | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| time:NN | 10.00 | 11.26 | 14.96 | 12.32 | 20.00 | 6.89 | 17.52 |
| to:TO | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 | 20.00 | 10.00 |
| read:VB | 13.95 | 6.81 | 19.96 | 0.00 | 20.00 | 11.87 | 18.96 |
| the:DT | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| report:NN | 10.00 | 12.69 | 14.96 | 11.87 | 20.00 | 0.00 | 19.87 |
| .:. | 20.00 | 17.99 | 20.00 | 18.96 | 10.00 | 19.87 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: dontknow (INCORRECT)
Justification:
Alignment score: -7.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.inNegatedEmbedding: embedded negative text; Polarity.hypNegMarker: "read": neg; NullPunisher.aux: did; Structure.argsMismatch: args have different parents but same relations: text "Bush" <-nsubj-- "have vs. hyp "Bush" <-nsubj-- "read", which aligned to text "read" args have different parents but same relations: text "n't" <-neg-- "have vs. hyp "not" <-neg-- "read", which aligned to text "read" args have different parents but same relations: text "." <-punct-- "have vs. hyp "." <-punct-- "read", which aligned to text "read"
Hand-tuned score: -12.0500
Threshold: -3.3437
Txt: Bush had the time to read the report.
Hyp: Bush read the report. (Yes.)
| Bush NNP |
read VBP |
the DT |
report NN |
. . |
|
| Bush:NNP | 0.00 | 13.95 | 20.00 | 10.00 | 20.00 |
| had:VBD | 13.05 | 5.95 | 20.00 | 12.80 | 20.00 |
| the:DT | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| time:NN | 10.00 | 12.32 | 20.00 | 6.89 | 17.52 |
| to:TO | 20.00 | 20.00 | 10.00 | 20.00 | 10.00 |
| read:VB | 13.95 | 0.00 | 20.00 | 11.87 | 18.96 |
| the:DT | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| report:NN | 10.00 | 11.87 | 20.00 | 0.00 | 19.87 |
| .:. | 20.00 | 18.96 | 10.00 | 19.87 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -4.0000
Features matched: Factive.inPositiveEmbedding: embedded positive text; Structure.argsMismatch: args have different parents but same relations: text "Bush" <-nsubj-- "had vs. hyp "Bush" <-nsubj-- "read", which aligned to text "read" args have different parents but same relations: text "." <-punct-- "had vs. hyp "." <-punct-- "read", which aligned to text "read"
Hand-tuned score: -1.0000
Threshold: -3.3437
Txt: Bush had the time to read the report.
Hyp: Bush did not read the report. (Don't know.)
| Bush NNP |
did VBD |
not RB |
read VB |
the DT |
report NN |
. . |
|
| Bush:NNP | 0.00 | 15.00 | 14.96 | 13.95 | 20.00 | 10.00 | 20.00 |
| had:VBD | 13.05 | 7.32 | 19.96 | 5.95 | 20.00 | 12.80 | 20.00 |
| the:DT | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| time:NN | 10.00 | 11.26 | 14.96 | 12.32 | 20.00 | 6.89 | 17.52 |
| to:TO | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 | 20.00 | 10.00 |
| read:VB | 13.95 | 6.81 | 19.96 | 0.00 | 20.00 | 11.87 | 18.96 |
| the:DT | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| report:NN | 10.00 | 12.69 | 14.96 | 11.87 | 20.00 | 0.00 | 19.87 |
| .:. | 20.00 | 17.99 | 20.00 | 18.96 | 10.00 | 19.87 | 0.00 |
| NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -14.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Factive.inPositiveEmbedding: embedded positive text; Polarity.hypNegMarker: "read": neg; NullPunisher.aux: did; NullPunisher.other: not; Structure.argsMismatch: args have different parents but same relations: text "Bush" <-nsubj-- "had vs. hyp "Bush" <-nsubj-- "read", which aligned to text "read" args have different parents but same relations: text "." <-punct-- "had vs. hyp "." <-punct-- "read", which aligned to text "read"
Hand-tuned score: -7.0500
Threshold: -3.3437
Txt: The president wasn't able to attend the meeting.
Hyp: The president attended the meeting. (Don't know.)
| The DT |
president NN |
attended VBD |
the DT |
meeting NN |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| president:NN | 20.00 | 0.00 | 14.56 | 20.00 | 8.35 | 20.00 |
| was:VBD | 20.00 | 14.34 | 10.00 | 20.00 | 12.52 | 20.00 |
| n't:RB | 20.00 | 14.96 | 19.96 | 20.00 | 14.96 | 17.90 |
| able:JJ | 20.00 | 11.96 | 11.96 | 20.00 | 11.96 | 17.24 |
| to:TO | 10.00 | 20.00 | 20.00 | 10.00 | 20.00 | 10.00 |
| attend:VB | 20.00 | 14.16 | 0.50 | 20.00 | 11.65 | 19.67 |
| the:DT | 0.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| meeting:NN | 20.00 | 8.35 | 11.39 | 20.00 | 0.00 | 19.92 |
| .:. | 10.00 | 20.00 | 19.97 | 10.00 | 19.92 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -4.5000
Features matched: Structure.argsMismatch: args have different parents but same relations: text "president" <-nsubj-- "able vs. hyp "president" <-nsubj-- "attended", which aligned to text "attend" args have different parents but same relations: text "." <-punct-- "able vs. hyp "." <-punct-- "attended", which aligned to text "attend"
Hand-tuned score: -2.0000
Threshold: -3.3437
Txt: The president wasn't able to attend the meeting.
Hyp: The president did not attend the meeting. (Yes.)
| The DT |
president NN |
did VBD |
not RB |
attend VB |
the DT |
meeting NN |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| president:NN | 20.00 | 0.00 | 15.00 | 14.96 | 14.16 | 20.00 | 8.35 | 20.00 |
| was:VBD | 20.00 | 14.34 | 10.00 | 19.96 | 10.00 | 20.00 | 12.52 | 20.00 |
| n't:RB | 20.00 | 14.96 | 13.27 | 0.50 | 18.46 | 20.00 | 14.96 | 17.90 |
| able:JJ | 20.00 | 11.96 | 11.55 | 11.96 | 10.38 | 20.00 | 11.96 | 17.24 |
| to:TO | 10.00 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 | 20.00 | 10.00 |
| attend:VB | 20.00 | 14.16 | 8.17 | 19.96 | 0.00 | 20.00 | 11.65 | 19.67 |
| the:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| meeting:NN | 20.00 | 8.35 | 12.69 | 14.96 | 11.65 | 20.00 | 0.00 | 19.92 |
| .:. | 10.00 | 20.00 | 17.99 | 20.00 | 19.67 | 10.00 | 19.92 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 9.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: dontknow (INCORRECT)
Justification:
Alignment score: -7.5000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "attend": neg; NullPunisher.aux: did; Structure.argsMismatch: args have different parents but same relations: text "president" <-nsubj-- "able vs. hyp "president" <-nsubj-- "attend", which aligned to text "attend" args have different parents but same relations: text "n't" <-neg-- "able vs. hyp "not" <-neg-- "attend", which aligned to text "attend" args have different parents but same relations: text "." <-punct-- "able vs. hyp "." <-punct-- "attend", which aligned to text "attend"
Hand-tuned score: -7.0500
Threshold: -3.3437
Txt: The president was able to attend to meeting.
Hyp: The president attended the meeting. (Yes.)
| The DT |
president NN |
attended VBD |
the DT |
meeting NN |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| president:NN | 20.00 | 0.00 | 14.56 | 20.00 | 8.35 | 20.00 |
| was:VBD | 20.00 | 14.34 | 10.00 | 20.00 | 12.52 | 20.00 |
| able:JJ | 20.00 | 11.96 | 11.96 | 20.00 | 11.96 | 17.24 |
| to:TO | 10.00 | 20.00 | 20.00 | 10.00 | 20.00 | 10.00 |
| attend:VB | 20.00 | 14.16 | 0.50 | 20.00 | 11.65 | 19.67 |
| meeting:NN | 20.00 | 8.35 | 11.39 | 20.00 | 0.00 | 19.92 |
| .:. | 10.00 | 20.00 | 19.97 | 10.00 | 19.92 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -7.5000
Features matched: NullPunisher.article: the; Structure.argsMismatch: args have different parents but same relations: text "president" <-nsubj-- "able vs. hyp "president" <-nsubj-- "attended", which aligned to text "attend" args have different parents but same relations: text "." <-punct-- "able vs. hyp "." <-punct-- "attended", which aligned to text "attend" text "meeting" is prep_to of "attend" while hyp "meeting" is dobj of "attended" which aligned to text "attend"
Hand-tuned score: -2.1000
Threshold: -3.3437
Txt: The president was able to attend to meeting.
Hyp: The president did not attend the meeting. (Don't know.)
| The DT |
president NN |
did VBD |
not RB |
attend VB |
the DT |
meeting NN |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| president:NN | 20.00 | 0.00 | 15.00 | 14.96 | 14.16 | 20.00 | 8.35 | 20.00 |
| was:VBD | 20.00 | 14.34 | 10.00 | 19.96 | 10.00 | 20.00 | 12.52 | 20.00 |
| able:JJ | 20.00 | 11.96 | 11.55 | 11.96 | 10.38 | 20.00 | 11.96 | 17.24 |
| to:TO | 10.00 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 | 20.00 | 10.00 |
| attend:VB | 20.00 | 14.16 | 8.17 | 19.96 | 0.00 | 20.00 | 11.65 | 19.67 |
| meeting:NN | 20.00 | 8.35 | 12.69 | 14.96 | 11.65 | 20.00 | 0.00 | 19.92 |
| .:. | 10.00 | 20.00 | 17.99 | 20.00 | 19.67 | 10.00 | 19.92 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 9.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -17.0000
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "attend": neg; NullPunisher.aux: did; NullPunisher.other: not; NullPunisher.article: the; Structure.argsMismatch: args have different parents but same relations: text "president" <-nsubj-- "able vs. hyp "president" <-nsubj-- "attend", which aligned to text "attend" args have different parents but same relations: text "." <-punct-- "able vs. hyp "." <-punct-- "attend", which aligned to text "attend" text "meeting" is prep_to of "attend" while hyp "meeting" is dobj of "attend" which aligned to text "attend"
Hand-tuned score: -8.1500
Threshold: -3.3437
Txt: Many soldiers were killed in the ambush.
Hyp: All soldiers were killed in the ambush. (Don't know.)
| All DT |
soldiers NNS |
were VBD |
killed VBN |
the DT |
ambush NN |
. . |
|
| Many:JJ | 20.00 | 11.96 | 11.96 | 11.96 | 20.00 | 11.96 | 20.00 |
| soldiers:NNS | 20.00 | 0.00 | 14.34 | 9.33 | 20.00 | 4.63 | 20.00 |
| were:VBD | 20.00 | 14.34 | 0.00 | 8.33 | 20.00 | 15.00 | 20.00 |
| killed:VBN | 20.00 | 9.33 | 8.33 | 0.00 | 20.00 | 9.69 | 20.00 |
| the:DT | 10.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| ambush:NN | 20.00 | 4.63 | 15.00 | 9.69 | 20.00 | 0.00 | 20.00 |
| .:. | 10.00 | 20.00 | 20.00 | 20.00 | 10.00 | 20.00 | 0.00 |
| NO_WORD | 10.00 | 10.00 | 1.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -10.0000
Features matched: Adjunct.dropPosCxt: text adjunct "Many" of "soldiers" dropped on aligned hyp word "soldiers"; NullPunisher.other: All; Quant.expand: [many,all]
Hand-tuned score: -6.5000
Threshold: -3.3437
Txt: Many soldiers were killed in the ambush.
Hyp: Not all soldiers were killed in the ambush. (Yes.)
| Not RB |
all DT |
soldiers NNS |
were VBD |
killed VBN |
the DT |
ambush NN |
. . |
|
| Many:JJ | 11.96 | 20.00 | 11.96 | 11.96 | 11.96 | 20.00 | 11.96 | 20.00 |
| soldiers:NNS | 14.96 | 20.00 | 0.00 | 14.34 | 9.33 | 20.00 | 4.63 | 20.00 |
| were:VBD | 19.96 | 20.00 | 14.34 | 0.00 | 8.33 | 20.00 | 15.00 | 20.00 |
| killed:VBN | 19.96 | 20.00 | 9.33 | 8.33 | 0.00 | 20.00 | 9.69 | 20.00 |
| the:DT | 20.00 | 10.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
| ambush:NN | 14.96 | 20.00 | 4.63 | 15.00 | 9.69 | 20.00 | 0.00 | 20.00 |
| .:. | 20.00 | 10.00 | 20.00 | 20.00 | 20.00 | 10.00 | 20.00 | 0.00 |
| NO_WORD | 9.00 | 10.00 | 10.00 | 1.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: dontknow (INCORRECT)
Justification:
Alignment score: -19.0000
Features matched: Adjunct.addPosCxt: hyp added Not[Not-RB]; Adjunct.dropPosCxt: text adjunct "Many" of "soldiers" dropped on aligned hyp word "soldiers"; NullPunisher.other: Not; NullPunisher.other: all; Quant.expand: [many,all]
Hand-tuned score: -8.5000
Threshold: -3.3437
Txt: The man had $20 in his pocket.
Hyp: The man had $40 in his pocket. (Don't know.)
| The DT |
man NN |
had VBD |
$ $ |
40 CD |
his PRP$ |
pocket NN |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 10.50 | 20.50 | 20.00 | 20.00 | 10.00 |
| man:NN | 20.00 | 0.00 | 13.05 | 18.93 | 20.50 | 12.00 | 6.78 | 19.76 |
| had:VBD | 20.00 | 13.05 | 0.00 | 20.50 | 20.50 | 15.00 | 13.95 | 20.00 |
| $:$ | 10.50 | 18.93 | 20.50 | 0.00 | 20.00 | 20.50 | 17.92 | 9.91 |
| 20:CD | 20.50 | 20.50 | 20.50 | 20.00 | 0.69 | 20.50 | 19.19 | 19.23 |
| his:PRP$ | 20.00 | 12.00 | 15.00 | 20.50 | 20.50 | 0.00 | 12.00 | 20.00 |
| pocket:NN | 20.00 | 6.78 | 13.95 | 17.92 | 19.19 | 12.00 | 0.00 | 18.62 |
| .:. | 10.00 | 19.76 | 20.00 | 9.91 | 18.57 | 20.00 | 18.62 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -0.6931
Features matched: Numeric.mismatch: MONEY mismatch: '$40.0' vs '$20.0'
Hand-tuned score: -5.0000
Threshold: -3.3437
Txt: The man had $20 in his pocket.
Hyp: The man did not have $40 in his pocket. (Yes.)
| The DT |
man NN |
did VBD |
not RB |
have VB |
$ $ |
40 CD |
his PRP$ |
pocket NN |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 10.50 | 20.50 | 20.00 | 20.00 | 10.00 |
| man:NN | 20.00 | 0.00 | 12.63 | 14.96 | 13.05 | 18.93 | 20.50 | 12.00 | 6.78 | 19.76 |
| had:VBD | 20.00 | 13.05 | 7.32 | 19.96 | 0.50 | 20.50 | 20.50 | 15.00 | 13.95 | 20.00 |
| $:$ | 10.50 | 18.93 | 20.50 | 20.50 | 20.50 | 0.00 | 20.00 | 20.50 | 17.92 | 9.91 |
| 20:CD | 20.50 | 20.50 | 19.19 | 20.46 | 20.50 | 20.00 | 0.69 | 20.50 | 19.19 | 19.23 |
| his:PRP$ | 20.00 | 12.00 | 15.00 | 20.00 | 15.00 | 20.50 | 20.50 | 0.00 | 12.00 | 20.00 |
| pocket:NN | 20.00 | 6.78 | 12.53 | 14.96 | 13.95 | 17.92 | 19.19 | 12.00 | 0.00 | 18.62 |
| .:. | 10.00 | 19.76 | 17.99 | 20.00 | 20.00 | 9.91 | 18.57 | 20.00 | 18.62 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (INCORRECT)
Justification:
Alignment score: -11.1931
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "have": neg; NullPunisher.other: not; NullPunisher.aux: did; Numeric.mismatch: MONEY mismatch: '$40.0' vs '$20.0'
Hand-tuned score: -11.0500
Threshold: -3.3437
Txt: The man had $20 in his pocket.
Hyp: The man had $10 in his pocket. (Yes.)
| The DT |
man NN |
had VBD |
$ $ |
10 CD |
his PRP$ |
pocket NN |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 10.50 | 20.50 | 20.00 | 20.00 | 10.00 |
| man:NN | 20.00 | 0.00 | 13.05 | 18.93 | 20.50 | 12.00 | 6.78 | 19.76 |
| had:VBD | 20.00 | 13.05 | 0.00 | 20.50 | 20.50 | 15.00 | 13.95 | 20.00 |
| $:$ | 10.50 | 18.93 | 20.50 | 0.00 | 20.00 | 20.50 | 17.92 | 9.91 |
| 20:CD | 20.50 | 20.50 | 20.50 | 20.00 | 0.69 | 20.50 | 19.19 | 19.23 |
| his:PRP$ | 20.00 | 12.00 | 15.00 | 20.50 | 20.50 | 0.00 | 12.00 | 20.00 |
| pocket:NN | 20.00 | 6.78 | 13.95 | 17.92 | 19.19 | 12.00 | 0.00 | 18.62 |
| .:. | 10.00 | 19.76 | 20.00 | 9.91 | 19.16 | 20.00 | 18.62 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (INCORRECT)
Justification:
Alignment score: -0.6931
Features matched: Numeric.mismatch: MONEY mismatch: '$10.0' vs '$20.0'
Hand-tuned score: -5.0000
Threshold: -3.3437
Txt: The man had $20 in his pocket.
Hyp: The man did not have $10 in his pocket. (Don't know.)
| The DT |
man NN |
did VBD |
not RB |
have VB |
$ $ |
10 CD |
his PRP$ |
pocket NN |
. . |
|
| The:DT | 0.00 | 20.00 | 20.00 | 20.00 | 20.00 | 10.50 | 20.50 | 20.00 | 20.00 | 10.00 |
| man:NN | 20.00 | 0.00 | 12.63 | 14.96 | 13.05 | 18.93 | 20.50 | 12.00 | 6.78 | 19.76 |
| had:VBD | 20.00 | 13.05 | 7.32 | 19.96 | 0.50 | 20.50 | 20.50 | 15.00 | 13.95 | 20.00 |
| $:$ | 10.50 | 18.93 | 20.50 | 20.50 | 20.50 | 0.00 | 20.00 | 20.50 | 17.92 | 9.91 |
| 20:CD | 20.50 | 20.50 | 19.19 | 20.46 | 20.50 | 20.00 | 0.69 | 20.50 | 19.19 | 19.23 |
| his:PRP$ | 20.00 | 12.00 | 15.00 | 20.00 | 15.00 | 20.50 | 20.50 | 0.00 | 12.00 | 20.00 |
| pocket:NN | 20.00 | 6.78 | 12.53 | 14.96 | 13.95 | 17.92 | 19.19 | 12.00 | 0.00 | 18.62 |
| .:. | 10.00 | 19.76 | 17.99 | 20.00 | 20.00 | 9.91 | 19.16 | 20.00 | 18.62 | 0.00 |
| NO_WORD | 1.00 | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: dontknow (CORRECT)
Justification:
Alignment score: -11.1931
Features matched: Adjunct.diffPol: hyp and txt have different polarity; Polarity.hypNegMarker: "have": neg; NullPunisher.aux: did; NullPunisher.other: not; Numeric.mismatch: MONEY mismatch: '$10.0' vs '$20.0'
Hand-tuned score: -11.0500
Threshold: -3.3437
java edu.stanford.nlp.rte.WordSimilarityGenerator -info /u/nlp/rte/data/byformat/align/stochastic/parc_predev.pipeline.align.xml -output /u/nlp/rte/data/byformat/wordsim/stochastic/parc_predev.pipeline.wordsim.html -lex.BasicWN off