Txt/Hyp term similarities

The rows are the txt words. The columns are hyp words.
Resource summary:
Acronym: AcronymLexicalResource
BasicWN: BasicWNLexicalResource
Country: CountryLexicalResource
Cyc: null
DekangLin: DekangLinLexicalResource
Google: null
InfoMap: InfoMapLexicalResource
NomBank: NomBankLexicalResource
Number: NumberLexicalResource
Ordinal: OrdinalLexicalResource
Preposition: PrepositionLexicalResource
Ravichandran: RavichandranLexicalResource
ResnikWN: ResnikWNLexicalResource
StringSim: StringSimLexicalResource



Inference ID: ATM1-1

Txt: John took the plane from Paris to Baghdad. On the way the plane stopped in Rome. Steve, who was sitting next to John, got down in Rome.

Hyp: John is in Baghdad . (yes)

John
NNP
is
VBZ
Baghdad
NNP
.
.
John:NNP   0.00 14.84 14.34 20.50
took:VBD 15.50   4.35 15.50 18.92
the:DT 20.50 20.00 20.50 10.00
plane:NN   8.53 14.34   9.84 19.75
Paris:NNP 14.34 14.84   3.98 20.50
Baghdad:NNP 14.34 14.84   0.00 20.50
.:. 20.50 20.00 20.50   0.00
the:DT 20.50 20.00 20.50 10.00
way:NN   7.97 14.34   8.02 16.60
the:DT 20.50 20.00 20.50 10.00
plane:NN   8.53 14.34   9.84 19.75
stopped:VBD 12.56   9.34 13.02 18.36
Rome:NNP 14.34 14.84   3.98 20.50
.:. 20.50 20.00 20.50   0.00
Steve:NNP   8.58 15.46 14.96 20.50
,:, 20.50 20.00 20.50   5.73
who:WP 12.50 15.00 12.50 20.00
was:VBD 14.84   0.50 11.83 20.00
sitting:VBG 11.18 10.00 15.50 19.32
John:NNP   0.00 14.84 14.34 20.50
,:, 20.50 20.00 20.50   5.73
got_down:VBD 14.52   7.74 15.17 20.00
Rome:NNP 14.34 14.84   3.98 20.50
.:. 20.50 20.00 20.50   0.00
NO_WORD 10.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -5.3499
Features matched: Adjunct.dropPosCxt: text adjunct "Paris" of "took" dropped on aligned hyp word "is"; Location.mismatch: no clear info of matching: be(X, prep_in); RootEntailment.poorlyAlignedRoot: "is" aligned badly to "took"; Structure.relMismatch: text "Baghdad" is prep_to of "took" while hyp "Baghdad" is prep_in of "is" which aligned to text "took"
Hand-tuned score: -3.5000
Threshold: -11.4590


Inference ID: ATM1-2

Txt: John took the plane from Paris to Baghdad. On the way the plane stopped in Rome. Steve, who was sitting next to John, got down in Rome.

Hyp: John is in Paris . (no)

John
NNP
is
VBZ
Paris
NNP
.
.
John:NNP   0.00 14.84 14.34 20.50
took:VBD 15.50   4.35 15.50 18.92
the:DT 20.50 20.00 20.50 10.00
plane:NN   8.53 14.34   9.84 19.75
Paris:NNP 14.34 14.84   0.00 20.50
Baghdad:NNP 14.34 14.84   3.98 20.50
.:. 20.50 20.00 20.50   0.00
the:DT 20.50 20.00 20.50 10.00
way:NN   7.97 14.34   8.02 16.60
the:DT 20.50 20.00 20.50 10.00
plane:NN   8.53 14.34   9.84 19.75
stopped:VBD 12.56   9.34 13.02 18.36
Rome:NNP 14.34 14.84   3.98 20.50
.:. 20.50 20.00 20.50   0.00
Steve:NNP   8.58 15.46 14.96 20.50
,:, 20.50 20.00 20.50   5.73
who:WP 12.50 15.00 12.50 20.00
was:VBD 14.84   0.50 11.83 20.00
sitting:VBG 11.18 10.00 13.63 19.32
John:NNP   0.00 14.84 14.34 20.50
,:, 20.50 20.00 20.50   5.73
got_down:VBD 14.52   7.74 15.17 20.00
Rome:NNP 14.34 14.84   3.98 20.50
.:. 20.50 20.00 20.50   0.00
NO_WORD 10.00 10.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -5.3499
Features matched: Adjunct.dropPosCxt: text adjunct "Baghdad" of "took" dropped on aligned hyp word "is"; Location.mismatch: no clear info of matching: be(X, prep_in); RootEntailment.poorlyAlignedRoot: "is" aligned badly to "took"; Structure.relMismatch: text "Paris" is prep_from of "took" while hyp "Paris" is prep_in of "is" which aligned to text "took"
Hand-tuned score: -3.5000
Threshold: -11.4590


Inference ID: ATM1-3

Txt: John took the plane from Paris to Baghdad. On the way the plane stopped in Rome. Steve, who was sitting next to John, got down in Rome.

Hyp: John was in Paris . (yes)

John
NNP
was
VBD
Paris
NNP
.
.
John:NNP   0.00 14.84 14.34 20.50
took:VBD 15.50 10.00 15.50 18.92
the:DT 20.50 20.00 20.50 10.00
plane:NN   8.53 14.34   9.84 19.75
Paris:NNP 14.34 11.83   0.00 20.50
Baghdad:NNP 14.34 11.83   3.98 20.50
.:. 20.50 20.00 20.50   0.00
the:DT 20.50 20.00 20.50 10.00
way:NN   7.97 12.52   8.02 16.60
the:DT 20.50 20.00 20.50 10.00
plane:NN   8.53 14.34   9.84 19.75
stopped:VBD 12.56   7.52 13.02 18.36
Rome:NNP 14.34 11.83   3.98 20.50
.:. 20.50 20.00 20.50   0.00
Steve:NNP   8.58 15.46 14.96 20.50
,:, 20.50 20.00 20.50   5.73
who:WP 12.50 15.00 12.50 20.00
was:VBD 14.84   0.00 11.83 20.00
sitting:VBG 11.18 10.00 13.63 19.32
John:NNP   0.00 14.84 14.34 20.50
,:, 20.50 20.00 20.50   5.73
got_down:VBD 14.52   9.67 15.17 20.00
Rome:NNP 14.34 11.83   3.98 20.50
.:. 20.50 20.00 20.50   0.00
NO_WORD 10.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -6.0000
Features matched: Location.mismatch: no clear info of matching: be(X, prep_in); Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "got_down vs. hyp "." <-punct-- "was", which aligned to text "was" args have different parents, different relations: text "John" <-prep_next_to-- "sitting" vs. hyp "John" <-nsubj-- "was", which aligned to text "was" args have different parents, different relations: text "Paris" <-prep_from-- "took" vs. hyp "Paris" <-prep_in-- "was", which aligned to text "was"
Hand-tuned score: -4.0000
Threshold: -11.4590


Inference ID: ATM1-4

Txt: John took the plane from Paris to Baghdad. On the way the plane stopped in Rome. Steve, who was sitting next to John, got down in Rome.

Hyp: John was in Rome . (yes)

John
NNP
was
VBD
Rome
NNP
.
.
John:NNP   0.00 14.84 14.34 20.50
took:VBD 15.50 10.00 15.50 18.92
the:DT 20.50 20.00 20.50 10.00
plane:NN   8.53 14.34   9.84 19.75
Paris:NNP 14.34 11.83   3.98 20.50
Baghdad:NNP 14.34 11.83   3.98 20.50
.:. 20.50 20.00 20.50   0.00
the:DT 20.50 20.00 20.50 10.00
way:NN   7.97 12.52   8.02 16.60
the:DT 20.50 20.00 20.50 10.00
plane:NN   8.53 14.34   9.84 19.75
stopped:VBD 12.56   7.52 13.02 18.36
Rome:NNP 14.34 11.83   0.00 20.50
.:. 20.50 20.00 20.50   0.00
Steve:NNP   8.58 15.46 14.96 20.50
,:, 20.50 20.00 20.50   5.73
who:WP 12.50 15.00 12.50 20.00
was:VBD 14.84   0.00 11.83 20.00
sitting:VBG 11.18 10.00 12.02 19.32
John:NNP   0.00 14.84 14.34 20.50
,:, 20.50 20.00 20.50   5.73
got_down:VBD 14.52   9.67 15.17 20.00
Rome:NNP 14.34 11.83   0.00 20.50
.:. 20.50 20.00 20.50   0.00
NO_WORD 10.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -6.0000
Features matched: Location.mismatch: no clear info of matching: be(X, prep_in); Structure.argsMismatch: args have different parents but same relations: text "Rome" <-prep_in-- "stopped vs. hyp "Rome" <-prep_in-- "was", which aligned to text "was" args have different parents but same relations: text "." <-punct-- "got_down vs. hyp "." <-punct-- "was", which aligned to text "was" args have different parents, different relations: text "John" <-prep_next_to-- "sitting" vs. hyp "John" <-nsubj-- "was", which aligned to text "was"
Hand-tuned score: -4.0000
Threshold: -11.4590


Inference ID: ATM1-5

Txt: John took the plane from Paris to Baghdad. On the way the plane stopped in Rome. Steve, who was sitting next to John, got down in Rome.

Hyp: John in Rome was WH[LOCATION] . (airport)

John
NNP
Rome
NNP
was
VBD
WH[LOCATION]
JJ
.
.
John:NNP   0.00 14.34 14.84   0.00 20.50
took:VBD 15.50 15.50 10.00 17.50 18.92
the:DT 20.50 20.50 20.00 17.50 10.00
plane:NN   8.53   9.84 14.34   0.00 19.75
Paris:NNP 14.34   3.98 11.83   0.00 20.50
Baghdad:NNP 14.34   3.98 11.83   0.00 20.50
.:. 20.50 20.50 20.00 17.50   0.00
the:DT 20.50 20.50 20.00 17.50 10.00
way:NN   7.97   8.02 12.52   0.00 16.60
the:DT 20.50 20.50 20.00 17.50 10.00
plane:NN   8.53   9.84 14.34   0.00 19.75
stopped:VBD 12.56 13.02   7.52 17.50 18.36
Rome:NNP 14.34   0.00 11.83   0.00 20.50
.:. 20.50 20.50 20.00 17.50   0.00
Steve:NNP   8.58 14.96 15.46   0.00 20.50
,:, 20.50 20.50 20.00 17.50   5.73
who:WP 12.50 12.50 15.00 17.50 20.00
was:VBD 14.84 11.83   0.00 17.50 20.00
sitting:VBG 11.18 12.02 10.00 17.50 19.32
John:NNP   0.00 14.34 14.84   0.00 20.50
,:, 20.50 20.50 20.00 17.50   5.73
got_down:VBD 14.52 15.17   9.67 17.50 20.00
Rome:NNP 14.34   0.00 11.83   0.00 20.50
.:. 20.50 20.50 20.00 17.50   0.00
NO_WORD 10.00 10.00   1.00   9.00 10.00

Response: plane (INCORRECT)
Justification:
Alignment score: -7.0000
Features matched: NullPunisher.aux: was; Structure.argsMismatch: args have different parents but same relations: text "John" <-nsubj-- "took vs. hyp "John" <-nsubj-- "WH[LOCATION]", which aligned to text "plane" args have different parents but same relations: text "." <-punct-- "took vs. hyp "." <-punct-- "WH[LOCATION]", which aligned to text "plane"
Hand-tuned score: -2.0500
Threshold: -11.4590


Inference ID: ATM1-6

Txt: John took the plane from Paris to Baghdad. On the way the plane stopped in Rome. Steve, who was sitting next to John, got down in Rome.

Hyp: John did pass through the Rome airport customs area . (no)

John
NNP
did
VBD
pass
VB
the
DT
Rome
NNP
airport
NN
customs
NNS
area
NN
.
.
John:NNP   0.00 13.35 12.27 20.50 14.34   8.53 10.50   6.11 20.50
took:VBD 15.50   6.90   4.98 20.00 15.50 15.00 11.96 15.00 18.92
the:DT 20.50 20.00 20.00   0.00 20.50 20.00 20.00 20.00 10.00
plane:NN   8.53 12.45 12.44 20.00   9.84   4.07   7.02   7.44 19.75
Paris:NNP 14.34 15.50 13.02 20.50   3.98   9.61 10.50   6.57 20.50
Baghdad:NNP 14.34 15.50 13.02 20.50   3.98   9.84 10.50   6.57 20.50
.:. 20.50 17.99 18.93 10.00 20.50 20.00 20.00 18.66   0.00
the:DT 20.50 20.00 20.00   0.00 20.50 20.00 20.00 20.00 10.00
way:NN   7.97 12.53 10.78 20.00   8.02   8.03   6.96   6.95 16.60
the:DT 20.50 20.00 20.00   0.00 20.50 20.00 20.00 20.00 10.00
plane:NN   8.53 12.45 12.44 20.00   9.84   4.07   7.02   7.44 19.75
stopped:VBD 12.56   4.55   6.89 20.00 13.02 12.72 13.72 12.06 18.36
Rome:NNP 14.34 15.50 13.02 20.50   0.00   9.70 10.50   6.57 20.50
.:. 20.50 17.99 18.93 10.00 20.50 20.00 20.00 18.66   0.00
Steve:NNP   8.58 15.46 15.46 20.50 14.96 10.46 10.46 10.46 20.50
,:, 20.50 19.80 20.00 10.00 20.50 20.00 19.94 18.60   5.73
who:WP 12.50 15.00 15.00 20.00 12.50 12.00 12.00 12.00 20.00
was:VBD 14.84 10.00   7.52 20.00 11.83 14.34 15.00 12.11 20.00
sitting:VBG 11.18   7.85   6.90 20.00 12.02 13.96 15.00 13.69 19.32
John:NNP   0.00 13.35 12.27 20.50 14.34   8.53 10.50   6.11 20.50
,:, 20.50 19.80 20.00 10.00 20.50 20.00 19.94 18.60   5.73
got_down:VBD 14.52   7.45   7.45 20.00 15.17 14.47 15.00 14.47 20.00
Rome:NNP 14.34 15.50 13.02 20.50   0.00   9.70 10.50   6.57 20.50
.:. 20.50 17.99 18.93 10.00 20.50 20.00 20.00 18.66   0.00
NO_WORD 10.00   1.00 10.00   1.00 10.00 10.00 10.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -31.5797
Features matched: Adjunct.dropPosCxt: text adjunct "Paris" of "took" dropped on aligned hyp word "pass"; NullPunisher.article: the; NullPunisher.aux: did; RootEntailment.poorlyAlignedRoot: "pass" aligned badly to "took"; Structure.relMismatch: text "Baghdad" is prep_to of "took" while hyp "area" is prep_through of "pass" which aligned to text "took"
Hand-tuned score: -1.6500
Threshold: -11.4590


Inference ID: ATM1-7

Txt: John took the plane from Paris to Baghdad. On the way the plane stopped in Rome. Steve, who was sitting next to John, got down in Rome.

Hyp: Steve did pass through the Rome airport customs area . (yes)

Steve
NNP
did
VBD
pass
VB
the
DT
Rome
NNP
airport
NN
customs
NNS
area
NN
.
.
John:NNP   8.58 13.35 12.27 20.50 14.34   8.53 10.50   6.11 20.50
took:VBD 15.46   6.90   4.98 20.00 15.50 15.00 11.96 15.00 18.92
the:DT 20.50 20.00 20.00   0.00 20.50 20.00 20.00 20.00 10.00
plane:NN 10.46 12.45 12.44 20.00   9.84   4.07   7.02   7.44 19.75
Paris:NNP 14.96 15.50 13.02 20.50   3.98   9.61 10.50   6.57 20.50
Baghdad:NNP 14.96 15.50 13.02 20.50   3.98   9.84 10.50   6.57 20.50
.:. 20.50 17.99 18.93 10.00 20.50 20.00 20.00 18.66   0.00
the:DT 20.50 20.00 20.00   0.00 20.50 20.00 20.00 20.00 10.00
way:NN 10.46 12.53 10.78 20.00   8.02   8.03   6.96   6.95 16.60
the:DT 20.50 20.00 20.00   0.00 20.50 20.00 20.00 20.00 10.00
plane:NN 10.46 12.45 12.44 20.00   9.84   4.07   7.02   7.44 19.75
stopped:VBD 15.46   4.55   6.89 20.00 13.02 12.72 13.72 12.06 18.36
Rome:NNP 14.96 15.50 13.02 20.50   0.00   9.70 10.50   6.57 20.50
.:. 20.50 17.99 18.93 10.00 20.50 20.00 20.00 18.66   0.00
Steve:NNP   0.00 15.46 15.46 20.50 14.96 10.46 10.46 10.46 20.50
,:, 20.50 19.80 20.00 10.00 20.50 20.00 19.94 18.60   5.73
who:WP 12.50 15.00 15.00 20.00 12.50 12.00 12.00 12.00 20.00
was:VBD 15.46 10.00   7.52 20.00 11.83 14.34 15.00 12.11 20.00
sitting:VBG 15.46   7.85   6.90 20.00 12.02 13.96 15.00 13.69 19.32
John:NNP   8.58 13.35 12.27 20.50 14.34   8.53 10.50   6.11 20.50
,:, 20.50 19.80 20.00 10.00 20.50 20.00 19.94 18.60   5.73
got_down:VBD 15.46   7.45   7.45 20.00 15.17 14.47 15.00 14.47 20.00
Rome:NNP 14.96 15.50 13.02 20.50   0.00   9.70 10.50   6.57 20.50
.:. 20.50 17.99 18.93 10.00 20.50 20.00 20.00 18.66   0.00
NO_WORD 10.00   1.00 10.00   1.00 10.00 10.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -33.5797
Features matched: Adjunct.dropPosCxt: text adjunct "Paris" of "took" dropped on aligned hyp word "pass"; NullPunisher.aux: did; NullPunisher.article: the; RootEntailment.poorlyAlignedRoot: "pass" aligned badly to "took"; Structure.argsMismatch: args have different parents but same relations: text "Steve" <-nsubj-- "sitting vs. hyp "Steve" <-nsubj-- "pass", which aligned to text "took" args have different parents but same relations: text "Steve" <-nsubj-- "got_down vs. hyp "Steve" <-nsubj-- "pass", which aligned to text "took" text "Baghdad" is prep_to of "took" while hyp "area" is prep_through of "pass" which aligned to text "took"
Hand-tuned score: -3.6500
Threshold: -11.4590


Inference ID: ATM2-1

Txt: John took the plane from Paris to Baghdad. On the way the plane stopped in Rome, where John was arrested.

Hyp: John is in Baghdad . (no)

John
NNP
is
VBZ
Baghdad
NNP
.
.
John:NNP   0.00 14.84 14.34 20.50
took:VBD 15.50   4.35 15.50 18.92
the:DT 20.50 20.00 20.50 10.00
plane:NN   8.53 14.34   9.84 19.75
Paris:NNP 14.34 14.84   3.98 20.50
Baghdad:NNP 14.34 14.84   0.00 20.50
.:. 20.50 20.00 20.50   0.00
the:DT 20.50 20.00 20.50 10.00
way:NN   7.97 14.34   8.02 16.60
the:DT 20.50 20.00 20.50 10.00
plane:NN   8.53 14.34   9.84 19.75
stopped:VBD 12.56   9.34 13.02 18.36
Rome:NNP 14.34 14.84   3.98 20.50
,:, 20.50 20.00 20.50   5.73
where:WRB 20.46 19.96 20.46 10.00
John:NNP   0.00 14.84 14.34 20.50
was:VBD 14.84   0.50 11.83 20.00
arrested:VBN 15.50 10.00 15.50 20.00
.:. 20.50 20.00 20.50   0.00
NO_WORD 10.00 10.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -5.3499
Features matched: Adjunct.dropPosCxt: text adjunct "Paris" of "took" dropped on aligned hyp word "is"; Location.mismatch: no clear info of matching: be(X, prep_in); RootEntailment.poorlyAlignedRoot: "is" aligned badly to "took"; Structure.relMismatch: text "Baghdad" is prep_to of "took" while hyp "Baghdad" is prep_in of "is" which aligned to text "took"
Hand-tuned score: -3.5000
Threshold: -11.4590


Inference ID: ATM2-2

Txt: John took the plane from Paris to Baghdad. On the way the plane stopped in Rome, where John was arrested.

Hyp: John is WH[LOCATION] . (Rome)

John
NNP
is
VBZ
WH[LOCATION]
JJ
.
.
John:NNP   0.00 14.84   0.00 20.50
took:VBD 15.50   4.35 17.50 18.92
the:DT 20.50 20.00 17.50 10.00
plane:NN   8.53 14.34   0.00 19.75
Paris:NNP 14.34 14.84   0.00 20.50
Baghdad:NNP 14.34 14.84   0.00 20.50
.:. 20.50 20.00 17.50   0.00
the:DT 20.50 20.00 17.50 10.00
way:NN   7.97 14.34   0.00 16.60
the:DT 20.50 20.00 17.50 10.00
plane:NN   8.53 14.34   0.00 19.75
stopped:VBD 12.56   9.34 17.50 18.36
Rome:NNP 14.34 14.84   0.00 20.50
,:, 20.50 20.00 17.50   5.73
where:WRB 20.46 19.96 17.50 10.00
John:NNP   0.00 14.84   0.00 20.50
was:VBD 14.84   0.50 17.50 20.00
arrested:VBN 15.50 10.00 17.50 20.00
.:. 20.50 20.00 17.50   0.00
NO_WORD 10.00   1.00   9.00 10.00

Response: Paris (INCORRECT)
Justification:
Alignment score: -5.0000
Features matched: NullPunisher.aux: is; Structure.argsMismatch: args have different parents but same relations: text "John" <-nsubj-- "took vs. hyp "John" <-nsubj-- "WH[LOCATION]", which aligned to text "Paris" args have different parents but same relations: text "." <-punct-- "took vs. hyp "." <-punct-- "WH[LOCATION]", which aligned to text "Paris"
Hand-tuned score: -2.0500
Threshold: -11.4590


Inference ID: ATM2-3

Txt: John took the plane from Paris to Baghdad. On the way the plane stopped in Rome, where John was arrested.

Hyp: John did pass through the Rome airport customs . (maybe)

John
NNP
did
VBD
pass
VB
the
DT
Rome
NNP
airport
NN
customs
NNS
.
.
John:NNP   0.00 13.35 12.27 20.50 14.34   8.53 10.50 20.50
took:VBD 15.50   6.90   4.98 20.00 15.50 15.00 11.96 18.92
the:DT 20.50 20.00 20.00   0.00 20.50 20.00 20.00 10.00
plane:NN   8.53 12.45 12.44 20.00   9.84   4.07   7.02 19.75
Paris:NNP 14.34 15.50 13.02 20.50   3.98   9.61 10.50 20.50
Baghdad:NNP 14.34 15.50 13.02 20.50   3.98   9.84 10.50 20.50
.:. 20.50 17.99 18.93 10.00 20.50 20.00 20.00   0.00
the:DT 20.50 20.00 20.00   0.00 20.50 20.00 20.00 10.00
way:NN   7.97 12.53 10.78 20.00   8.02   8.03   6.96 16.60
the:DT 20.50 20.00 20.00   0.00 20.50 20.00 20.00 10.00
plane:NN   8.53 12.45 12.44 20.00   9.84   4.07   7.02 19.75
stopped:VBD 12.56   4.55   6.89 20.00 13.02 12.72 13.72 18.36
Rome:NNP 14.34 15.50 13.02 20.50   0.00   9.70 10.50 20.50
,:, 20.50 19.80 20.00 10.00 20.50 20.00 19.94   5.73
where:WRB 20.46 19.96 19.96 10.00 20.46 19.96 19.96 10.00
John:NNP   0.00 13.35 12.27 20.50 14.34   8.53 10.50 20.50
was:VBD 14.84 10.00   7.52 20.00 11.83 14.34 15.00 20.00
arrested:VBN 15.50   9.94   5.88 20.00 15.50 12.52 11.82 20.00
.:. 20.50 17.99 18.93 10.00 20.50 20.00 20.00   0.00
NO_WORD 10.00   1.00 10.00   1.00 10.00 10.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -23.0067
Features matched: Adjunct.dropPosCxt: text adjunct "Baghdad" of "took" dropped on aligned hyp word "pass"; NullPunisher.aux: did; RootEntailment.poorlyAlignedRoot: "pass" aligned badly to "took"; Structure.parentsMismatch: args have different parents, different relations: text "way" <-prep_on-- "stopped" vs. hyp "customs" <-prep_through-- "pass", which aligned to text "took"
Hand-tuned score: -3.5500
Threshold: -11.4590


Inference ID: ATM2-4

Txt: John took the plane from Paris to Baghdad. On the way the plane stopped in Rome, where John was arrested.

Hyp: The plane is in Baghdad . (yes)

The
DT
plane
NN
is
VBZ
Baghdad
NNP
.
.
John:NNP 20.50   8.53 14.84 14.34 20.50
took:VBD 20.00 12.38   4.35 15.50 18.92
the:DT   0.00 20.00 20.00 20.50 10.00
plane:NN 20.00   0.00 14.34   9.84 19.75
Paris:NNP 20.50   9.84 14.84   3.98 20.50
Baghdad:NNP 20.50   9.84 14.84   0.00 20.50
.:. 10.00 19.75 20.00 20.50   0.00
the:DT   0.00 20.00 20.00 20.50 10.00
way:NN 20.00   7.44 14.34   8.02 16.60
the:DT   0.00 20.00 20.00 20.50 10.00
plane:NN 20.00   0.00 14.34   9.84 19.75
stopped:VBD 20.00   9.68   9.34 13.02 18.36
Rome:NNP 20.50   9.84 14.84   3.98 20.50
,:, 10.00 19.91 20.00 20.50   5.73
where:WRB 10.00 19.96 19.96 20.46 10.00
John:NNP 20.50   8.53 14.84 14.34 20.50
was:VBD 20.00 14.34   0.50 11.83 20.00
arrested:VBN 20.00 12.44 10.00 15.50 20.00
.:. 10.00 19.75 20.00 20.50   0.00
NO_WORD   1.00 10.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -6.5000
Features matched: Factive.inPositiveEmbedding: embedded positive text; Location.mismatch: no clear info of matching: be(X, prep_in); Structure.argsMismatch: args have different parents but same relations: text "plane" <-nsubj-- "stopped vs. hyp "plane" <-nsubj-- "is", which aligned to text "was" args have different parents but same relations: text "." <-punct-- "took vs. hyp "." <-punct-- "is", which aligned to text "was" args have different parents, different relations: text "Baghdad" <-prep_to-- "took" vs. hyp "Baghdad" <-prep_in-- "is", which aligned to text "was"
Hand-tuned score: -3.0000
Threshold: -11.4590


Inference ID: ATM3-1

Txt: John is a US citizen. He is in Boston on Dec 1. That day he applied for the passport.

Hyp: He can go to Paris on Dec. 1 . (no)

He
PRP
can
MD
go
VB
Paris
NNP
Dec.
NNP
1
CD
.
.
John:NNP 12.50 12.58 13.58 14.34 14.96 23.69 20.50
is:VBZ 15.00 19.34   6.07 14.84 15.46 20.50 20.00
a:DT 20.00 10.00 20.00 20.50 20.50 20.50 10.00
US:NNP 12.50 19.84 14.84   6.33 14.96 25.00 20.50
citizen:NN 12.00 18.95 13.08   9.84 10.46 20.50 20.00
.:. 20.00 10.00 18.37 20.50 20.50 18.29   0.00
He:PRP   0.00 20.00 15.00 12.50 12.50 20.50 20.00
is:VBZ 15.00 19.34   6.07 14.84 15.46 20.50 20.00
Boston:NNP 12.50 19.84 14.84   4.34 14.96 25.00 20.50
Dec:NNP 12.00 17.84 12.23   8.02   6.21 18.34 20.00
1:CD 20.50 18.34 18.34 25.00 24.96   5.00 18.29
.:. 20.00 10.00 18.37 20.50 20.50 18.29   0.00
That:DT 20.00 10.00 20.00 20.50 20.50 20.50 10.00
day:NN 12.00 17.84   8.57   9.84 10.46 18.34 18.14
he:PRP   0.00 20.00 15.00 12.50 12.50 20.50 20.00
applied:VBD 15.00 20.00   8.07 15.50 15.46 16.93 19.34
the:DT 20.00 10.00 20.00 20.50 20.50 20.50 10.00
passport:NN 12.00 17.85 13.69 10.50 10.46 19.19 20.00
.:. 20.00 10.00 18.37 20.50 20.50 18.29   0.00
NO_WORD 10.00 10.00 10.00 10.00 10.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -32.6266
Features matched: Date.hypDateModifierIns: extra modifiers in hypothesis, with not equivalent normalized forms, but matching heads: Dec.; Modal.yes: actual -> possible; NullPunisher.aux: can; Quant.contract: [1,1]; RootEntailment.poorlyAlignedRoot: "go" aligned badly to "is"; Structure.relMismatch: text "Boston" is prep_in of "is" while hyp "Paris" is prep_to of "go" which aligned to text "is"
Hand-tuned score: -0.0500
Threshold: -11.4590


Inference ID: ATM3-2

Txt: John is a US citizen. He is in Boston on Dec 1. That day he applied for the passport.

Hyp: He can go to Paris on Dec 14th . (maybe)

He
PRP
can
MD
go
VB
Paris
NNP
Dec
NNP
14th
CD
.
.
John:NNP 12.50 12.58 13.58 14.34   9.19 24.96 20.50
is:VBZ 15.00 19.34   6.07 14.84 14.34 20.46 20.00
a:DT 20.00 10.00 20.00 20.50 20.00 20.50 10.00
US:NNP 12.50 19.84 14.84   6.33   8.02 24.96 20.50
citizen:NN 12.00 18.95 13.08   9.84   9.34 19.27 20.00
.:. 20.00 10.00 18.37 20.50 20.00 20.45   0.00
He:PRP   0.00 20.00 15.00 12.50 12.00 20.50 20.00
is:VBZ 15.00 19.34   6.07 14.84 14.34 20.46 20.00
Boston:NNP 12.50 19.84 14.84   4.34   8.02 24.96 20.50
Dec:NNP 12.00 17.84 12.23   8.02   0.00 20.46 20.00
1:CD 20.50 18.34 18.34 25.00 18.34   5.00 18.29
.:. 20.00 10.00 18.37 20.50 20.00 20.45   0.00
That:DT 20.00 10.00 20.00 20.50 20.00 20.50 10.00
day:NN 12.00 17.84   8.57   9.84   7.23 20.46 18.14
he:PRP   0.00 20.00 15.00 12.50 12.00 20.50 20.00
applied:VBD 15.00 20.00   8.07 15.50 15.00 19.62 19.34
the:DT 20.00 10.00 20.00 20.50 20.00 20.50 10.00
passport:NN 12.00 17.85 13.69 10.50   7.53 19.24 20.00
.:. 20.00 10.00 18.37 20.50 20.00 20.45   0.00
NO_WORD 10.00 10.00 10.00 10.00 10.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -26.4123
Features matched: Modal.yes: actual -> possible; NullPunisher.aux: can; Numeric.mismatch: NUMBER mismatch: '14.0' vs '1.0'; RootEntailment.poorlyAlignedRoot: "go" aligned badly to "is"; Structure.relMismatch: text "Boston" is prep_in of "is" while hyp "Paris" is prep_to of "go" which aligned to text "is"
Hand-tuned score: -6.0500
Threshold: -11.4590


Inference ID: ATM3-3

Txt: John is a US citizen. He is in Boston on Dec 1. That day he applied for the passport.

Hyp: He can go to Paris on the Jan 10th . (yes)

He
PRP
can
MD
go
VB
Paris
NNP
the
DT
Jan
NNP
10th
NN
.
.
John:NNP 12.50 12.58 13.58 14.34 20.50   8.69 10.46 20.50
is:VBZ 15.00 19.34   6.07 14.84 20.00 15.50 14.96 20.00
a:DT 20.00 10.00 20.00 20.50 10.00 20.50 20.00 10.00
US:NNP 12.50 19.84 14.84   6.33 20.50 15.00 10.46 20.50
citizen:NN 12.00 18.95 13.08   9.84 20.00 10.50   7.29 20.00
.:. 20.00 10.00 18.37 20.50 10.00 20.50 20.00   0.00
He:PRP   0.00 20.00 15.00 12.50 20.00 12.50 12.00 20.00
is:VBZ 15.00 19.34   6.07 14.84 20.00 15.50 14.96 20.00
Boston:NNP 12.50 19.84 14.84   4.34 20.50 15.00 10.46 20.50
Dec:NNP 12.00 17.84 12.23   8.02 20.00   5.07   9.96 20.00
1:CD 20.50 18.34 18.34 25.00 20.50 22.84 20.46 18.29
.:. 20.00 10.00 18.37 20.50 10.00 20.50 20.00   0.00
That:DT 20.00 10.00 20.00 20.50 10.00 20.50 20.00 10.00
day:NN 12.00 17.84   8.57   9.84 20.00   7.73   7.42 18.14
he:PRP   0.00 20.00 15.00 12.50 18.00 12.50 12.00 20.00
applied:VBD 15.00 20.00   8.07 15.50 20.00 15.50 14.77 19.34
the:DT 20.00 10.00 20.00 20.50   0.00 20.50 20.00 10.00
passport:NN 12.00 17.85 13.69 10.50 20.00   9.19   8.29 20.00
.:. 20.00 10.00 18.37 20.50 10.00 20.50 20.00   0.00
NO_WORD 10.00 10.00 10.00 10.00   1.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -38.7711
Features matched: Adjunct.dropPosCxt: text adjunct "1" of "Dec" dropped on aligned hyp word "Jan"; Modal.yes: actual -> possible; NullPunisher.aux: can; RootEntailment.poorlyAlignedRoot: "go" aligned badly to "is"; Structure.relMismatch: text "Boston" is prep_in of "is" while hyp "Paris" is prep_to of "go" which aligned to text "is"
Hand-tuned score: 0.4500
Threshold: -11.4590


Inference ID: ATM4-1

Txt: John, who always carries his laptop with him, took a flight from Boston to Paris on the morning of Dec 11th.

Hyp: John 's laptop is In WH[city] city on the evening of Dec 10th . (Boston)

John
NNP
laptop
NN
is
VBZ
WH[city]
JJ
city
NN
the
DT
evening
NN
Dec
JJ
10th
NN
.
.
John:NNP   0.00   8.53 14.84   0.00   9.84 20.50   9.19 15.69 10.46 20.50
,:, 20.50 19.79 20.00 20.00 20.00 10.00 19.54 20.50 20.00   5.73
who:WP 12.50 12.00 15.00 20.00 12.00 20.00 12.00 15.50 12.00 20.00
always:RB 15.46 14.96 19.96   0.00 14.96 20.00 14.96 12.46 14.96 20.00
carries:VBZ 15.50 13.35   7.74   0.00 15.00 20.00 14.78 12.50 13.38 20.00
his:PRP$ 12.50 12.00 13.00 20.00 12.00 20.00 12.00 15.50 12.00 20.00
laptop:NN   8.53   0.00 14.34   0.00   9.34 20.00 10.00 11.84   9.02 20.00
him:PRP 12.50 12.00 15.00 20.00 12.00 20.00 12.00 15.50 12.00 20.00
,:, 20.50 19.79 20.00 20.00 20.00 10.00 19.54 20.50 20.00   5.73
took:VBD 15.50 15.00   4.35   0.00 15.00 20.00 12.98 12.50 10.97 18.92
a:DT 20.50 20.00 20.00 20.00 20.00 10.00 20.00 20.50 20.00 10.00
flight:NN   8.32   8.03 14.34   0.00   7.81 20.00   6.55 11.84   8.32 20.00
Boston:NNP 14.34   9.84 14.84   0.00   5.90 20.50 10.50 14.52 10.46 20.50
Paris:NNP 14.34   9.84 14.84   0.00   5.90 20.50 10.50 14.52 10.46 20.50
the:DT 20.50 20.00 20.00 20.00 20.00   0.00 20.00 20.50 20.00 10.00
morning:NN 12.82 10.50 15.50   0.00 10.35 20.50   2.96 14.23 10.15 18.27
Dec:NNP 13.69   9.84 14.84   0.00   8.02 20.50   7.73   0.00 10.46 20.50
11th:JJ 12.46 11.58 11.96   0.00 11.96 20.00 11.01 10.46   3.13 20.00
.:. 20.50 20.00 20.00 20.00 19.66 10.00 19.42 20.50 20.00   0.00
NO_WORD 10.00 10.00 10.00   9.00 10.00   1.00 10.00   9.00 10.00 10.00

Response: John (INCORRECT)
Justification:
Alignment score: -29.3381
Features matched: Adjunct.dropPosCxt: text adjunct "carries" of "John" dropped on aligned hyp word "WH[city]"; Location.mismatch: no clear info of matching: be(X, prep_in); RootEntailment.poorlyAlignedRoot: "is" aligned badly to "took"; Structure.relMismatch: noun args have different parents but same relations: "WH[city]": "carries" vs. "city" text "Paris" is prep_to of "took" while hyp "city" is prep_in of "is" which aligned to text "took"
Hand-tuned score: -3.5000
Threshold: -11.4590


Inference ID: ATM4-2

Txt: John, who always carries his laptop with him, took a flight from Boston to Paris on the morning of Dec 11th.

Hyp: John 's laptop is In WH[city] city on the evening of Dec 11th . (Paris)

John
NNP
laptop
NN
is
VBZ
WH[city]
JJ
city
NN
the
DT
evening
NN
Dec
NNP
11th
JJ
.
.
John:NNP   0.00   8.53 14.84   0.00   9.84 20.50   9.19 13.69 12.46 20.50
,:, 20.50 19.79 20.00 20.00 20.00 10.00 19.54 20.50 19.94   5.73
who:WP 12.50 12.00 15.00 20.00 12.00 20.00 12.00 12.50 15.00 20.00
always:RB 15.46 14.96 19.96   0.00 14.96 20.00 14.96 15.46 11.96 20.00
carries:VBZ 15.50 13.35   7.74   0.00 15.00 20.00 14.78 15.50 10.10 20.00
his:PRP$ 12.50 12.00 13.00 20.00 12.00 20.00 12.00 12.50 15.00 20.00
laptop:NN   8.53   0.00 14.34   0.00   9.34 20.00 10.00   9.84 11.58 20.00
him:PRP 12.50 12.00 15.00 20.00 12.00 20.00 12.00 12.50 15.00 20.00
,:, 20.50 19.79 20.00 20.00 20.00 10.00 19.54 20.50 19.94   5.73
took:VBD 15.50 15.00   4.35   0.00 15.00 20.00 12.98 15.50   8.31 18.92
a:DT 20.50 20.00 20.00 20.00 20.00 10.00 20.00 20.50 20.00 10.00
flight:NN   8.32   8.03 14.34   0.00   7.81 20.00   6.55   9.84 10.85 20.00
Boston:NNP 14.34   9.84 14.84   0.00   5.90 20.50 10.50 12.52 12.46 20.50
Paris:NNP 14.34   9.84 14.84   0.00   5.90 20.50 10.50 12.52 12.46 20.50
the:DT 20.50 20.00 20.00 20.00 20.00   0.00 20.00 20.50 20.00 10.00
morning:NN 12.82 10.50 15.50   0.00 10.35 20.50   2.96 12.23 12.08 18.27
Dec:NNP 13.69   9.84 14.84   0.00   8.02 20.50   7.73   0.00 12.46 20.50
11th:JJ 12.46 11.58 11.96   0.00 11.96 20.00 11.01 12.46   0.00 20.00
.:. 20.50 20.00 20.00 20.00 19.66 10.00 19.42 20.50 20.00   0.00
NO_WORD 10.00 10.00 10.00   9.00 10.00   1.00 10.00 10.00   9.00 10.00

Response: laptop (INCORRECT)
Justification:
Alignment score: -24.2064
Features matched: Adjunct.dropPosCxt: text adjunct "Boston" of "took" dropped on aligned hyp word "is"; RootEntailment.poorlyAlignedRoot: "is" aligned badly to "took"; Structure.argsMismatch: args have different parents but same relations: text "11th" <-advmod-- "Dec vs. hyp "11th" <-advmod-- "is", which aligned to text "took" args have different parents, different relations: text "laptop" <-dobj-- "carries" vs. hyp "laptop" <-nsubj-- "is", which aligned to text "took" noun args have different parents but same relations: "WH[city]": "carries" vs. "city" text "Paris" is prep_to of "took" while hyp "city" is prep_in of "is" which aligned to text "took"
Hand-tuned score: -3.5000
Threshold: -11.4590


Inference ID: ATM5-1

Txt: John, who travels abroad often, is at home in Boston and receives a call that he must immediately go to Paris.

Hyp: He just can get on a plane and fly to Paris . (No)

He
PRP
just
RB
can
MD
get
VB
a
DT
plane
NN
fly
VB
Paris
NNP
.
.
John:NNP 12.50 15.46 12.58 15.50 20.50   8.53 13.53 14.34 20.50
,:, 20.00 17.47 10.00 19.64 10.00 19.91 19.45 20.50   5.73
who:WP 10.00 20.00 18.57 15.00 20.00 12.00 15.00 12.50 20.00
travels:VBZ 15.00 19.96 20.00 10.00 20.00 12.45   3.21 15.50 20.00
abroad:RB 15.00   9.96 19.96 19.74 20.00 14.96 18.35 15.46 19.14
often:RB 15.00   9.96 19.96 19.96 20.00 14.96 19.96 15.46 20.00
,:, 20.00 17.47 10.00 19.64 10.00 19.91 19.45 20.50   5.73
is:VBZ 15.00 19.96 19.34   5.51 20.00 14.34   9.34 14.84 20.00
at_home:IN 20.00 20.00 20.00 20.00 20.00 20.00 20.00 20.50 20.00
Boston:NNP 12.50 15.46 19.84 15.50 20.50   9.84 14.84   4.34 20.50
receives:VBZ 15.00 19.96 20.00   4.73 20.00 12.45   7.45 15.50 20.00
a:DT 20.00 20.00 10.00 20.00   0.00 20.00 20.00 20.50 10.00
call:NN 12.00 14.79 15.67 12.62 20.00   7.12 12.12   9.84 19.11
that:IN 20.00 20.00 20.00 20.00 20.00 20.00 20.00 20.50 20.00
he:PRP   0.00 20.00 20.00 15.00 20.00 12.00 15.00 12.50 20.00
must:MD 20.00 15.00   6.88 20.00 10.00 17.53 19.34 19.84 10.00
immediately:RB 15.00   9.96 19.96 19.92 20.00 11.35 18.86 15.46 16.49
go:VB 15.00 14.23 17.84   0.00 20.00 12.45   1.00 14.84 18.37
Paris:NNP 12.50 15.46 19.84 15.50 20.50   9.84 14.84   0.00 20.50
.:. 20.00 16.65 10.00 17.78 10.00 19.75 19.77 20.50   0.00
NO_WORD 10.00   9.00 10.00 10.00   1.00 10.00 10.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -36.6734
Features matched: Adjunct.addPosCxt: hyp added just[just-RB]; Adjunct.dropPosCxt: text adjunct "often" of "travels" dropped on aligned hyp word "fly"; Factive.inPositiveEmbedding: embedded positive text; Modal.yes: necessary -> possible; NullPunisher.other: just; Quant.contract: [a,a]; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "is vs. hyp "." <-punct-- "get", which aligned to text "go" args have different parents, different relations: text "call" <-dobj-- "receives" vs. hyp "plane" <-prep_on-- "get", which aligned to text "go" args have different parents, different relations: text "travels" <-rcmod-- "John" vs. hyp "fly" <-conj_and-- "get", which aligned to text "go"
Hand-tuned score: 0.5000
Threshold: -11.4590


Inference ID: ATM5-2

Txt: John, who travels abroad often, is at home in Boston and receives a call that he must immediately go to Paris.

Hyp: He does need to do to be in Paris WH[] . (buy a ticket; get to the airport; get on the plane)

He
PRP
does
VBZ
need
VB
to
TO
do
VB
to
TO
be
VB
Paris
NNP
WH[]
NNP
.
.
John:NNP 12.50 13.61 13.40 20.50 13.35 20.50 14.84 14.34   0.00 20.50
,:, 20.00 19.26 19.63 10.00 19.52 10.00 20.00 20.50 20.00   5.73
who:WP 10.00 15.00 15.00 20.00 15.00 20.00 15.00 12.50 20.00 20.00
travels:VBZ 15.00   9.01 10.00 20.00   7.45 20.00 10.00 15.50   0.00 20.00
abroad:RB 15.00 19.96 19.08 20.00 19.96 20.00 19.96 15.46   0.00 19.14
often:RB 15.00 19.96 19.96 20.00 19.96 20.00 19.96 15.46   0.00 20.00
,:, 20.00 19.26 19.63 10.00 19.52 10.00 20.00 20.50 20.00   5.73
is:VBZ 15.00   9.34   8.33 20.00   6.07 20.00   0.31 14.84   0.00 20.00
at_home:IN 20.00 20.00 20.00 20.00 20.00 20.00 20.00 20.50 20.00 20.00
Boston:NNP 12.50 14.84 14.84 20.50 15.50 20.50 14.84   4.34   0.00 20.50
receives:VBZ 15.00   7.26   8.66 20.00   7.45 20.00 10.00 15.50   0.00 20.00
a:DT 20.00 20.00 20.00 10.00 20.00 10.00 20.00 20.50 20.00 10.00
call:NN 12.00 13.95 12.90 20.00 10.67 20.00 12.74   9.84   0.00 19.11
that:IN 20.00 20.00 20.00 20.00 20.00 20.00 20.00 20.50 20.00 20.00
he:PRP   0.00 15.00 15.00 20.00 15.00 20.00 15.00 12.50 20.00 20.00
must:MD 20.00 19.34 13.41 10.00 17.53 10.00 17.34 19.84   0.00 10.00
immediately:RB 15.00 19.96 19.96 20.00 19.96 20.00 19.96 15.46   0.00 16.49
go:VB 15.00   8.25   5.56 20.00   1.00 20.00   6.07 14.84   0.00 18.37
Paris:NNP 12.50 13.63 14.84 20.50 15.50 20.50 14.84   0.00   0.00 20.50
.:. 20.00 20.00 18.42 10.00 18.81 10.00 20.00 20.50 20.00   0.00
NO_WORD 10.00   1.00 10.00 10.00 10.00 10.00 10.00 10.00 10.00 10.00

Response: Boston (INCORRECT)
Justification:
Alignment score: -41.1957
Features matched: Adjunct.dropPosCxt: text adjunct "at_home" of "is" dropped on aligned hyp word "be"; Factive.inPositiveEmbedding: embedded positive text; Modal.yes: necessary -> actual; NullPunisher.functionWord: to; NullPunisher.aux: does; RootEntailment.poorlyAlignedRoot: "need" aligned badly to "go"; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "is vs. hyp "." <-punct-- "need", which aligned to text "go" args have different parents, different relations: text "go" <-ccomp-- "receives" vs. hyp "do" <-xcomp-- "need", which aligned to text "go" args have different parents, different relations: text "go" <-rcmod-- "call" vs. hyp "do" <-xcomp-- "need", which aligned to text "go"
Hand-tuned score: -0.6500
Threshold: -11.4590


Inference ID: ATM6-1

Txt: John spent December 10 in Paris and took a plane to Baghdad the next morning. He was planning to meet Bob who was waiting for him there.

Hyp: John was in the Middle_East in mid-December . (yes)

John
NNP
was
VBD
the
DT
Middle_East
NNP
mid-December
NN
.
.
John:NNP   0.00 14.84 20.50   9.23 13.69 20.50
spent:VBD 15.50 10.00 20.00 15.00 15.50 20.00
December:NNP 13.69 15.50 20.50   9.42   2.00 20.50
10:CD 23.69 20.50 20.50 19.42 17.84 19.16
Paris:NNP 14.34 11.83 20.50   6.79 15.00 20.50
took:VBD 15.50 10.00 20.00 13.26 15.50 18.92
a:DT 20.50 20.00 10.00 20.00 20.50 10.00
plane:NN   8.53 14.34 20.00   8.82   9.19 19.75
Baghdad:NNP 14.34 11.83 20.50   6.79 15.00 20.50
the:DT 20.50 20.00   0.00 20.00 20.50 10.00
next:JJ 12.46 11.96 20.00 11.96 12.46 20.00
morning:NN   8.32 15.00 20.00   8.91   7.73 17.77
.:. 20.50 20.00 10.00 20.00 20.50   0.00
He:PRP 12.50 15.00 20.00 12.00 12.50 20.00
was:VBD 14.84   0.00 20.00 12.11 15.50 20.00
planning:VBG 13.32 10.00 20.00 13.91 15.50 19.49
to:TO 20.50 20.00 10.00 20.00 20.50 10.00
meet:VB 15.50 10.00 20.00 15.00 15.50 19.52
Bob:NNP   8.03 14.84 20.50   8.84 12.84 20.50
who:WP 12.50 15.00 20.00 12.00 12.50 20.00
was:VBD 14.84   0.00 20.00 12.11 15.50 20.00
waiting:VBG 15.50 10.00 20.00 15.00 15.50 18.98
him:PRP 12.50 15.00 20.00 12.00 12.50 20.00
there:RB 14.84 17.52 20.00 12.52 15.50 20.00
.:. 20.50 20.00 10.00 20.00 20.50   0.00
NO_WORD 10.00 10.00   1.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -20.7913
Features matched: Adjunct.dropPosCxt: text adjunct "10" of "December" dropped on aligned hyp word "mid-December"; Date.dateHeadMismatch: mid-December vs. December; Location.mismatch: no clear info of matching: be(X, prep_in); NullPunisher.article: the; Structure.argsMismatch: args have different parents but same relations: text "John" <-nsubj-- "spent vs. hyp "John" <-nsubj-- "was", which aligned to text "was" args have different parents but same relations: text "." <-punct-- "planning vs. hyp "." <-punct-- "was", which aligned to text "was" args have different parents, different relations: text "Baghdad" <-prep_to-- "took" vs. hyp "Middle_East" <-prep_in-- "was", which aligned to text "was"
Hand-tuned score: -6.6000
Threshold: -11.4590


Inference ID: ATM6-2

Txt: John spent December 10 in Paris and took a plane to Baghdad the next morning. He was planning to meet Bob who was waiting for him there.

Hyp: He did meet Bob in the Middle_East in mid-December . (yes)

He
PRP
did
VBD
meet
VB
Bob
NNP
the
DT
Middle_East
NNP
mid-December
NN
.
.
John:NNP 12.50 13.35 15.50   8.03 20.50   9.23 13.69 20.50
spent:VBD 15.00   4.70   7.02 15.50 20.00 15.00 15.50 20.00
December:NNP 12.50 14.19 15.50 12.84 20.50   9.42   2.00 20.50
10:CD 20.50 19.19 20.50 21.46 20.50 19.42 17.84 19.16
Paris:NNP 12.50 15.50 15.50 14.34 20.50   6.79 15.00 20.50
took:VBD 15.00   6.90   7.61 12.33 20.00 13.26 15.50 18.92
a:DT 20.00 20.00 20.00 20.50 10.00 20.00 20.50 10.00
plane:NN 12.00 12.45 12.61   5.53 20.00   8.82   9.19 19.75
Baghdad:NNP 12.50 15.50 15.50 14.34 20.50   6.79 15.00 20.50
the:DT 20.00 20.00 20.00 20.50   0.00 20.00 20.50 10.00
next:JJ 15.00 11.96 11.96 12.46 20.00 11.96 12.46 20.00
morning:NN 12.00 12.69 12.47   8.34 20.00   8.91   7.73 17.77
.:. 20.00 17.99 19.52 20.50 10.00 20.00 20.50   0.00
He:PRP   0.00 15.00 15.00 12.50 20.00 12.00 12.50 20.00
was:VBD 15.00 10.00 10.00 14.84 20.00 12.11 15.50 20.00
planning:VBG 15.00 10.00   6.47 13.85 20.00 13.91 15.50 19.49
to:TO 20.00 20.00 20.00 20.50 10.00 20.00 20.50 10.00
meet:VB 15.00   5.11   0.00 15.50 20.00 15.00 15.50 19.52
Bob:NNP 12.50   7.56 15.50   0.00 20.50   8.84 12.84 20.50
who:WP 10.00 15.00 15.00 12.50 20.00 12.00 12.50 20.00
was:VBD 15.00 10.00 10.00 14.84 20.00 12.11 15.50 20.00
waiting:VBG 15.00 10.00   7.30 13.85 20.00 15.00 15.50 18.98
him:PRP   0.50 15.00 15.00 12.50 20.00 12.00 12.50 20.00
there:RB 15.00 20.00 20.00 14.84 20.00 12.52 15.50 20.00
.:. 20.00 17.99 19.52 20.50 10.00 20.00 20.50   0.00
NO_WORD 10.00   1.00 10.00 10.00   1.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -21.7913
Features matched: Adjunct.dropPosCxt: text adjunct "10" of "December" dropped on aligned hyp word "mid-December"; Date.dateHeadMismatch: mid-December vs. December; NullPunisher.aux: did; NullPunisher.article: the; Structure.argsMismatch: args have different parents but same relations: text "He" <-nsubj-- "planning vs. hyp "He" <-nsubj-- "meet", which aligned to text "meet" args have different parents but same relations: text "." <-punct-- "planning vs. hyp "." <-punct-- "meet", which aligned to text "meet" args have different parents, different relations: text "Bob" <-nsubj-- "waiting" vs. hyp "Bob" <-dobj-- "meet", which aligned to text "meet" args have different parents, different relations: text "December" <-tmod-- "spent" vs. hyp "mid-December" <-prep_in-- "meet", which aligned to text "meet"
Hand-tuned score: -4.6500
Threshold: -11.4590


Inference ID: ATM7-1

Txt: President Bush attended Pope John Paul II's funeral in Rome on the Morning of April 8.

Hyp: President_Bush was in Rome on April 8 . (yes)

President_Bush
NNP
was
VBD
Rome
NNP
April
NNP
8
CD
.
.
President_Bush:NNP   0.00 13.56   8.65 10.50 20.50 20.00
attended:VBD 15.00 10.00 15.50 15.50 20.16 19.97
Pope_John_Paul_II:NNP   7.34 15.17 14.67 13.92 22.39 20.50
funeral:NN 10.00 15.00 10.50 10.50 20.50 19.15
Rome:NNP   8.65 11.83   0.00 15.00 25.00 20.50
the:DT 20.00 20.00 20.50 20.50 20.50 10.00
Morning:NN 10.00 15.00 10.50   7.73 18.34 20.00
April:NNP 10.50 15.50 15.00   0.00 17.84 20.50
8:CD 20.50 20.50 25.00 17.84   0.00 19.96
.:. 20.00 20.00 20.50 20.50 19.96   0.00
NO_WORD 10.00 10.00 10.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -12.0000
Features matched: Adjunct.dropPosCxt: text adjunct "Morning" of "attended" dropped on aligned hyp word "was"; Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 04/08/1000; Location.mismatch: no clear info of matching: be(X, prep_in); RootEntailment.poorlyAlignedRoot: "was" aligned badly to "attended"
Hand-tuned score: -1.5000
Threshold: -11.4590


Inference ID: ATM7-2

Txt: President Bush attended Pope John Paul II's funeral in Rome on the Morning of April 8.

Hyp: President_Bush was in Texas on the afternoon of April 8 . . (probably not)

President_Bush
NNP
was
VBD
Texas
NNP
the
DT
afternoon
NN
April
NNP
8
CD
.
.
.
.
President_Bush:NNP   0.00 13.56   9.06 20.00 10.50 10.50 20.50 20.00 20.00
attended:VBD 15.00 10.00 15.50 20.00 13.14 15.50 20.16 19.97 19.97
Pope_John_Paul_II:NNP   7.34 15.17 14.67 20.50 13.92 13.92 22.39 20.50 20.50
funeral:NN 10.00 15.00 10.50 20.00   6.86 10.50 20.50 19.15 19.15
Rome:NNP   8.65 11.83   6.33 20.50 15.00 15.00 25.00 20.50 20.50
the:DT 20.00 20.00 20.50   0.00 20.50 20.50 20.50 10.00 10.00
Morning:NN 10.00 15.00 10.50 20.00   3.03   7.73 18.34 20.00 20.00
April:NNP 10.50 15.50 15.00 20.50 12.23   0.00 17.84 20.50 20.50
8:CD 20.50 20.50 25.00 20.50 22.57 17.84   0.00 19.96 19.96
.:. 20.00 20.00 20.50 10.00 18.61 20.50 19.96   0.00   0.00
NO_WORD 10.00 10.00 10.00   1.00 10.00 10.00 10.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -19.3628
Features matched: Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 04/08/1000; RootEntailment.poorlyAlignedRoot: "was" aligned badly to "attended"
Hand-tuned score: 0.0000
Threshold: -11.4590


Inference ID: ATM8-1

Txt: A rainstorm caused cancellation of the important meeting on Thursday. Friday had beautiful weather.

Hyp: The meeting did take_place on Thursday . (no)

The
DT
meeting
NN
did
VBD
take_place
NN
Thursday
NNP
.
.
A:DT 10.00 20.00 20.00 20.00 20.50 10.00
rainstorm:NN 20.00   9.74 14.83 10.00 10.50 20.00
caused:VBD 20.00 11.89   6.26 13.32 14.19 19.20
cancellation:NN 20.00   6.90 12.85   8.92   9.19 20.00
the:DT   0.00 20.00 20.00 20.00 20.50 10.00
important:JJ 20.00 11.96 11.96 11.96 12.46 18.05
meeting:NN 20.00   0.00 12.69   7.73   9.81 19.92
Thursday:NNP 20.50   9.81 14.19   9.84   0.00 20.50
.:. 10.00 19.92 17.99 20.00 20.50   0.00
Friday:NNP 20.50   9.83 14.19   9.84   3.93 20.50
had:VBD 20.00 14.34   7.32 10.92 15.50 20.00
beautiful:JJ 20.00 11.96 11.96 11.96 12.46 18.77
weather:NN 20.00 10.00 12.45   8.72 10.50 20.00
.:. 10.00 19.92 17.99 20.00 20.50   0.00
NO_WORD   1.00 10.00 10.00 10.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -17.1834
Features matched: Adjunct.dropPosCxt: text adjunct "important" of "meeting" dropped on aligned hyp word "meeting"; Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/1000; RootEntailment.poorlyAlignedRoot: "did" aligned badly to "caused"; Structure.parentsMismatch: args have different parents, different relations: text "meeting" <-prep_of-- "cancellation" vs. hyp "meeting" <-nsubj-- "did", which aligned to text "caused"
Hand-tuned score: -2.5000
Threshold: -11.4590


Inference ID: ATM8-2

Txt: A rainstorm caused cancellation of the important meeting on Thursday. Friday had beautiful weather.

Hyp: The meeting did take_place on Friday . (possibly)

The
DT
meeting
NN
did
VBD
take_place
NN
Friday
NNP
.
.
A:DT 10.00 20.00 20.00 20.00 20.50 10.00
rainstorm:NN 20.00   9.74 14.83 10.00 10.50 20.00
caused:VBD 20.00 11.89   6.26 13.32 14.19 19.20
cancellation:NN 20.00   6.90 12.85   8.92   9.19 20.00
the:DT   0.00 20.00 20.00 20.00 20.50 10.00
important:JJ 20.00 11.96 11.96 11.96 12.46 18.05
meeting:NN 20.00   0.00 12.69   7.73   9.83 19.92
Thursday:NNP 20.50   9.81 14.19   9.84   3.93 20.50
.:. 10.00 19.92 17.99 20.00 20.50   0.00
Friday:NNP 20.50   9.83 14.19   9.84   0.00 20.50
had:VBD 20.00 14.34   7.32 10.92 15.50 20.00
beautiful:JJ 20.00 11.96 11.96 11.96 12.46 18.77
weather:NN 20.00 10.00 12.45   8.72 10.50 20.00
.:. 10.00 19.92 17.99 20.00 20.50   0.00
NO_WORD   1.00 10.00 10.00 10.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -19.1834
Features matched: Adjunct.dropPosCxt: text adjunct "Thursday" of "caused" dropped on aligned hyp word "did"; Date.matchDatesByNormForm: hyp/txt matching, by normalized form: 01/01/1000; RootEntailment.poorlyAlignedRoot: "did" aligned badly to "caused"; Structure.parentsMismatch: args have different parents, different relations: text "meeting" <-prep_of-- "cancellation" vs. hyp "meeting" <-nsubj-- "did", which aligned to text "caused" args have different parents, different relations: text "Friday" <-nsubj-- "had" vs. hyp "Friday" <-prep_on-- "did", which aligned to text "caused"
Hand-tuned score: -2.5000
Threshold: -11.4590


Inference ID: ATM9-1

Txt: A CH-47 Chinook, carrying 10 people, crashed near Ghazni while returning to Bagram Air Base near the capital. The pilot perished in the crash.

Hyp: The Chinook did arrive at Bagram Air_Base . (no)

The
DT
Chinook
NNP
did
VBD
arrive
VB
Bagram
NNP
Air_Base
NNP
.
.
A:DT 10.00 20.50 20.00 20.00 20.50 20.00 10.00
CH-47:NNP 20.50   9.96 15.46 15.46 14.96 10.46 20.50
Chinook:NNP 20.50   0.00 13.35 15.50 14.96   7.35 20.50
,:, 10.00 20.50 19.80 20.00 20.50 20.00   5.73
carrying:VBG 20.00 15.50   5.52   7.55 15.46 12.15 18.98
10:CD 20.50 23.69 19.19 19.57 24.96 18.36 19.16
people:NNS 20.00 10.50 12.62 13.85 10.46   8.81 17.46
,:, 10.00 20.50 19.80 20.00 20.50 20.00   5.73
crashed:VBD 20.00 15.50   7.45   7.82 15.46 13.69 20.00
Ghazni:NNP 20.50 14.96 15.46 15.46   9.96 10.46 20.50
while:IN 20.00 20.50 20.00 20.00 20.50 20.00 20.00
returning:VBG 20.00 13.35   7.45   6.32 15.46 12.22 19.78
Bagram_Air_Base:NNP 20.50 11.85 13.62 15.46   0.00   0.50 20.50
the:DT   0.00 20.50 20.00 20.00 20.50 20.00 10.00
capital:NN 20.00   8.35   9.55 15.00 10.46   7.93 20.00
.:. 10.00 20.50 17.99 20.00 20.50 20.00   0.00
The:DT   0.00 20.50 20.00 20.00 20.50 20.00 10.00
pilot:NN 20.00   8.35 11.36 12.37 10.46   6.98 20.00
perished:VBD 20.00 15.50   9.39   7.16 15.46 15.00 20.00
the:DT   0.00 20.50 20.00 20.00 20.50 20.00 10.00
crash:NN 20.00 10.50 12.45 13.04 10.46   8.69 20.00
.:. 10.00 20.50 17.99 20.00 20.50 20.00   0.00
NO_WORD   1.00 10.00   1.00 10.00 10.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -14.3201
Features matched: Adjunct.dropPosCxt: text adjunct "capital" of "Bagram_Air_Base" dropped on aligned hyp word "Air_Base"; NullPunisher.aux: did; NullPunisher.article: The; Quant.contract: [a,the]; RootEntailment.poorlyAlignedRoot: "arrive" aligned badly to "crashed"; Structure.parentsMismatch: args have different parents, different relations: text "Bagram_Air_Base" <-prep_to-- "returning" vs. hyp "Air_Base" <-prep_at-- "arrive", which aligned to text "crashed"
Hand-tuned score: -2.6500
Threshold: -11.4590


Inference ID: ATM9-2

Txt: A CH-47 Chinook, carrying 10 people, crashed near Ghazni while returning to Bagram Air Base near the capital. The pilot perished in the crash.

Hyp: The Pilot is alive . (no)

The
DT
Pilot
NN
is
VBZ
alive
JJ
.
.
A:DT 10.00 20.50 20.00 20.00 10.00
CH-47:NNP 20.50 14.96 15.46 12.46 20.50
Chinook:NNP 20.50 12.85 12.84 12.46 20.50
,:, 10.00 20.50 20.00 18.72   5.73
carrying:VBG 20.00 15.50   7.74 11.26 18.98
10:CD 20.50 23.69 20.50 20.30 19.16
people:NNS 20.00 10.50 11.70   8.31 17.46
,:, 10.00 20.50 20.00 18.72   5.73
crashed:VBD 20.00 11.83   6.70   9.44 20.00
Ghazni:NNP 20.50   9.96 15.46 12.46 20.50
while:IN 20.00 20.50 20.00 20.00 20.00
returning:VBG 20.00 12.62   5.51 10.01 19.78
Bagram_Air_Base:NNP 20.50   6.98 14.17 12.46 20.50
the:DT   0.00 20.50 20.00 20.00 10.00
capital:NN 20.00   8.35 14.34 11.96 20.00
.:. 10.00 20.50 20.00 18.91   0.00
The:DT   0.00 20.50 20.00 20.00 10.00
pilot:NN 20.00   0.50 14.34 10.78 20.00
perished:VBD 20.00 15.50 10.00   7.56 20.00
the:DT   0.00 20.50 20.00 20.00 10.00
crash:NN 20.00   6.83 11.70   8.86 20.00
.:. 10.00 20.50 20.00 18.91   0.00
NO_WORD   1.00 10.00   1.00   9.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -9.0598
Features matched: Adjunct.dropPosCxt: text adjunct "crash" of "perished" dropped on aligned hyp word "alive"; NullPunisher.aux: is; RootEntailment.poorlyAlignedRoot: "alive" aligned badly to "perished"
Hand-tuned score: -0.5500
Threshold: -11.4590


Inference ID: ATM9-3

Txt: A CH-47 Chinook, carrying 10 people, crashed near Ghazni while returning to Bagram Air Base near the capital. The pilot perished in the crash.

Hyp: WH[QUANTITY] many people survived . (at most 9)

WH[QUANTITY]
JJ
many
JJ
people
NNS
survived
VBD
.
.
A:DT 20.00 20.00 20.00 20.00 10.00
CH-47:NNP   0.00 12.46 10.46 15.46 20.50
Chinook:NNP   0.00 12.46 10.50 15.50 20.50
,:, 20.00 20.00 18.63 18.97   5.73
carrying:VBG   0.00 11.96 12.62 10.00 18.98
10:CD   0.00 20.46 20.50 20.41 19.16
people:NNS   0.00 11.96   0.00 15.00 17.46
,:, 20.00 20.00 18.63 18.97   5.73
crashed:VBD   0.00 11.96 10.02   8.32 20.00
Ghazni:NNP   0.00 12.46 10.46 15.46 20.50
while:IN 20.00 20.00 20.00 20.00 20.00
returning:VBG   0.00 11.96 15.00   9.94 19.78
Bagram_Air_Base:NNP   0.00 12.46   9.31 15.46 20.50
the:DT 20.00 20.00 20.00 20.00 10.00
capital:NN   0.00 11.96   9.07 15.00 20.00
.:. 20.00 20.00 17.46 20.00   0.00
The:DT 20.00 20.00 20.00 20.00 10.00
pilot:NN   0.00 11.96   8.79 15.00 20.00
perished:VBD   0.00 11.96 11.79   6.18 20.00
the:DT 20.00 20.00 20.00 20.00 10.00
crash:NN   0.00 11.96   5.02 14.59 20.00
.:. 20.00 20.00 17.46 20.00   0.00
NO_WORD   9.00   9.00 10.00 10.00 10.00

Response: Ghazni (INCORRECT)
Justification:
Alignment score: -19.1761
Features matched: Adjunct.addPosCxt: hyp added many[many-JJ]; Adjunct.dropPosCxt: text adjunct "crash" of "perished" dropped on aligned hyp word "survived"; NullPunisher.other: many; RootEntailment.poorlyAlignedRoot: "survived" aligned badly to "perished"; Structure.parentsMismatch: args have different parents, different relations: text "people" <-dobj-- "carrying" vs. hyp "people" <-nsubj-- "survived", which aligned to text "perished"
Hand-tuned score: -5.5000
Threshold: -11.4590


Inference ID: ATM10-1

Txt: A.Q. Khan of Pakistan sold plans for a nuclear bomb to Iran. He also sold the XYZ-11, the key part necessary for the trigger.

Hyp: Iran does have plans for a nuclear bomb . (yes)

Iran
NNP
does
VBZ
have
VB
plans
NNS
a
DT
nuclear
JJ
bomb
NN
.
.
A.Q._Khan:NNP 14.67 14.55 14.52   9.52 20.50 12.46   9.52 20.50
Pakistan:NNP   3.17 14.84 14.84   9.84 20.50 12.46   9.84 20.50
sold:VBD 15.50 10.00   5.68 12.55 20.00 11.45 12.69 19.42
plans:NNS   9.84 12.76 12.32   0.00 20.00 11.17   8.03 20.00
a:DT 20.50 20.00 20.00 20.00   0.00 20.00 20.00 10.00
nuclear:JJ 12.46 10.23 11.96 11.17 20.00   0.00   8.86 19.75
bomb:NN   9.84 13.13 13.95   8.03 20.00   8.86   0.00 20.00
Iran:NNP   0.00 14.84 14.84   9.84 20.50 12.46   9.84 20.50
.:. 20.50 20.00 20.00 20.00 10.00 19.75 20.00   0.00
He:PRP 12.50 15.00 15.00 12.00 20.00 15.00 12.00 20.00
also:RB 15.46 19.96 19.96 14.96 20.00 11.96 14.96 20.00
sold:VBD 15.50 10.00   5.68 12.55 20.00 11.45 12.69 19.42
the:DT 20.50 18.65 20.00 20.00 10.00 20.00 20.00 10.00
XYZ-11:NN 10.46 14.96 14.96   9.96 20.00 11.96   9.96 20.00
,:, 20.50 19.26 20.00 20.00 10.00 19.81 20.00   5.73
the:DT 20.50 18.65 20.00 20.00 10.00 20.00 20.00 10.00
key:JJ   9.61 10.11   9.62   9.82 20.00   9.68   8.40 19.53
part:NN   8.02 13.95 12.61   6.99 20.00 10.78   8.95 17.01
necessary:JJ 11.84   7.48 11.34 10.20 20.00   6.87 11.34 18.22
the:DT 20.50 18.65 20.00 20.00 10.00 20.00 20.00 10.00
trigger:NN   9.84 11.31 12.32   7.32 20.00 11.00   6.40 20.00
.:. 20.50 20.00 20.00 20.00 10.00 19.75 20.00   0.00
NO_WORD 10.00   1.00 10.00 10.00   1.00   9.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -10.6779
Features matched: NullPunisher.aux: does; Quant.contract: [a,a]; RootEntailment.poorlyAlignedRoot: "have" aligned badly to "sold"; Structure.relMismatch: text "Iran" is prep_to of "sold" while hyp "Iran" is nsubj of "have" which aligned to text "sold"
Hand-tuned score: -1.0500
Threshold: -11.4590


Inference ID: ATM10-2

Txt: A.Q. Khan of Pakistan sold plans for a nuclear bomb to Iran. He also sold the XYZ-11, the key part necessary for the trigger.

Hyp: Pakistan does have plans for a nuclear bomb . (yes)

Pakistan
NNP
does
VBZ
have
VB
plans
NNS
a
DT
nuclear
JJ
bomb
NN
.
.
A.Q._Khan:NNP 14.67 14.55 14.52   9.52 20.50 12.46   9.52 20.50
Pakistan:NNP   0.00 14.84 14.84   9.84 20.50 12.46   9.84 20.50
sold:VBD 15.50 10.00   5.68 12.55 20.00 11.45 12.69 19.42
plans:NNS   9.84 12.76 12.32   0.00 20.00 11.17   8.03 20.00
a:DT 20.50 20.00 20.00 20.00   0.00 20.00 20.00 10.00
nuclear:JJ 12.46 10.23 11.96 11.17 20.00   0.00   8.86 19.75
bomb:NN   9.84 13.13 13.95   8.03 20.00   8.86   0.00 20.00
Iran:NNP   3.17 14.84 14.84   9.84 20.50 12.46   9.84 20.50
.:. 20.50 20.00 20.00 20.00 10.00 19.75 20.00   0.00
He:PRP 12.50 15.00 15.00 12.00 20.00 15.00 12.00 20.00
also:RB 15.46 19.96 19.96 14.96 20.00 11.96 14.96 20.00
sold:VBD 15.50 10.00   5.68 12.55 20.00 11.45 12.69 19.42
the:DT 20.50 18.65 20.00 20.00 10.00 20.00 20.00 10.00
XYZ-11:NN 10.46 14.96 14.96   9.96 20.00 11.96   9.96 20.00
,:, 20.50 19.26 20.00 20.00 10.00 19.81 20.00   5.73
the:DT 20.50 18.65 20.00 20.00 10.00 20.00 20.00 10.00
key:JJ   9.61 10.11   9.62   9.82 20.00   9.68   8.40 19.53
part:NN   8.02 13.95 12.61   6.99 20.00 10.78   8.95 17.01
necessary:JJ 11.84   7.48 11.34 10.20 20.00   6.87 11.34 18.22
the:DT 20.50 18.65 20.00 20.00 10.00 20.00 20.00 10.00
trigger:NN   9.84 11.31 12.32   7.32 20.00 11.00   6.40 20.00
.:. 20.50 20.00 20.00 20.00 10.00 19.75 20.00   0.00
NO_WORD 10.00   1.00 10.00 10.00   1.00   9.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -10.6779
Features matched: Adjunct.dropPosCxt: text adjunct "Iran" of "sold" dropped on aligned hyp word "have"; NullPunisher.aux: does; Quant.contract: [a,a]; RootEntailment.poorlyAlignedRoot: "have" aligned badly to "sold"; Structure.parentsMismatch: args have different parents, different relations: text "Pakistan" <-prep_of-- "A.Q._Khan" vs. hyp "Pakistan" <-nsubj-- "have", which aligned to text "sold"
Hand-tuned score: -2.5500
Threshold: -11.4590


Inference ID: ATM10-3

Txt: A.Q. Khan of Pakistan sold plans for a nuclear bomb to Iran. He also sold the XYZ-11, the key part necessary for the trigger.

Hyp: Iran does have the XYZ-11 . (yes)

Iran
NNP
does
VBZ
have
VB
the
DT
XYZ-11
NN
.
.
A.Q._Khan:NNP 14.67 14.55 14.52 20.50 10.46 20.50
Pakistan:NNP   3.17 14.84 14.84 20.50 10.46 20.50
sold:VBD 15.50 10.00   5.68 20.00 14.96 19.42
plans:NNS   9.84 12.76 12.32 20.00   9.96 20.00
a:DT 20.50 20.00 20.00 10.00 20.00 10.00
nuclear:JJ 12.46 10.23 11.96 20.00 11.96 19.75
bomb:NN   9.84 13.13 13.95 20.00   9.96 20.00
Iran:NNP   0.00 14.84 14.84 20.50 10.46 20.50
.:. 20.50 20.00 20.00 10.00 20.00   0.00
He:PRP 12.50 15.00 15.00 20.00 12.00 20.00
also:RB 15.46 19.96 19.96 20.00 14.96 20.00
sold:VBD 15.50 10.00   5.68 20.00 14.96 19.42
the:DT 20.50 18.65 20.00   0.00 20.00 10.00
XYZ-11:NN 10.46 14.96 14.96 20.00   0.00 20.00
,:, 20.50 19.26 20.00 10.00 20.00   5.73
the:DT 20.50 18.65 20.00   0.00 20.00 10.00
key:JJ   9.61 10.11   9.62 20.00 11.96 19.53
part:NN   8.02 13.95 12.61 20.00   9.96 17.01
necessary:JJ 11.84   7.48 11.34 20.00 11.96 18.22
the:DT 20.50 18.65 20.00   0.00 20.00 10.00
trigger:NN   9.84 11.31 12.32 20.00   9.96 20.00
.:. 20.50 20.00 20.00 10.00 20.00   0.00
NO_WORD 10.00   1.00 10.00   1.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -8.6779
Features matched: Adjunct.dropPosCxt: text adjunct "part" of "XYZ-11" dropped on aligned hyp word "XYZ-11"; NullPunisher.aux: does; RootEntailment.poorlyAlignedRoot: "have" aligned badly to "sold"; Structure.relMismatch: text "Iran" is prep_to of "sold" while hyp "Iran" is nsubj of "have" which aligned to text "sold"
Hand-tuned score: -1.5500
Threshold: -11.4590


Inference ID: ATM10-4

Txt: A.Q. Khan of Pakistan sold plans for a nuclear bomb to Iran. He also sold the XYZ-11, the key part necessary for the trigger.

Hyp: Pakistan does have the XYZ-11 . (yes)

Pakistan
NNP
does
VBZ
have
VB
the
DT
XYZ-11
NN
.
.
A.Q._Khan:NNP 14.67 14.55 14.52 20.50 10.46 20.50
Pakistan:NNP   0.00 14.84 14.84 20.50 10.46 20.50
sold:VBD 15.50 10.00   5.68 20.00 14.96 19.42
plans:NNS   9.84 12.76 12.32 20.00   9.96 20.00
a:DT 20.50 20.00 20.00 10.00 20.00 10.00
nuclear:JJ 12.46 10.23 11.96 20.00 11.96 19.75
bomb:NN   9.84 13.13 13.95 20.00   9.96 20.00
Iran:NNP   3.17 14.84 14.84 20.50 10.46 20.50
.:. 20.50 20.00 20.00 10.00 20.00   0.00
He:PRP 12.50 15.00 15.00 20.00 12.00 20.00
also:RB 15.46 19.96 19.96 20.00 14.96 20.00
sold:VBD 15.50 10.00   5.68 20.00 14.96 19.42
the:DT 20.50 18.65 20.00   0.00 20.00 10.00
XYZ-11:NN 10.46 14.96 14.96 20.00   0.00 20.00
,:, 20.50 19.26 20.00 10.00 20.00   5.73
the:DT 20.50 18.65 20.00   0.00 20.00 10.00
key:JJ   9.61 10.11   9.62 20.00 11.96 19.53
part:NN   8.02 13.95 12.61 20.00   9.96 17.01
necessary:JJ 11.84   7.48 11.34 20.00 11.96 18.22
the:DT 20.50 18.65 20.00   0.00 20.00 10.00
trigger:NN   9.84 11.31 12.32 20.00   9.96 20.00
.:. 20.50 20.00 20.00 10.00 20.00   0.00
NO_WORD 10.00   1.00 10.00   1.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -8.6779
Features matched: Adjunct.dropPosCxt: text adjunct "part" of "XYZ-11" dropped on aligned hyp word "XYZ-11"; NullPunisher.aux: does; RootEntailment.poorlyAlignedRoot: "have" aligned badly to "sold"; Structure.parentsMismatch: args have different parents, different relations: text "Pakistan" <-prep_of-- "A.Q._Khan" vs. hyp "Pakistan" <-nsubj-- "have", which aligned to text "sold"
Hand-tuned score: -3.5500
Threshold: -11.4590


Inference ID: ATM11-1

Txt: John was at the station in Washington D.C at 10:00 AM on March 15, 2005. His train to New York was scheduled to leave the station at 10:30.

Hyp: John was in Washington at 10 . (yes)

John
NNP
was
VBD
Washington
NNP
10
CD
.
.
John:NNP   0.00 14.84   8.69 23.69 20.50
was:VBD 14.84   0.00   6.65 20.50 20.00
at:IN 20.50 20.00 20.50 20.50 20.00
the:DT 20.50 20.00 20.50 20.50 10.00
station:NN   8.53 12.52   8.02 19.66 19.88
Washington_D.C:NNP 11.84 11.08   0.00 24.96 20.50
10:00:CD 24.96 20.46 24.96   5.00 20.50
AM:NNP   8.35 14.34   9.84 19.19 20.00
March:NNP 10.03 11.88 11.38 22.84 20.50
15:CD 23.69 20.50 25.00   5.41 19.54
,:, 25.00 20.50 25.00 23.48   6.23
2005:CD 24.96 20.46 24.96 10.00 20.50
.:. 20.50 20.00 20.50 19.16   0.00
His:PRP$ 12.50 15.00 12.50 20.50 20.00
train:NN   8.53 14.34   8.63 19.77 20.00
New_York:NNP 14.34 10.00   4.50 24.96 20.50
was:VBD 14.84   0.00   6.65 20.50 20.00
scheduled:VBN 12.97 10.00 15.50 19.19 20.00
to:TO 20.50 20.00 20.50 20.50 10.00
leave:VB 13.35 10.00 15.50 18.34 19.32
the:DT 20.50 20.00 20.50 20.50 10.00
station:NN   8.53 12.52   8.02 19.66 19.88
10:30:CD 24.96 20.46 24.96   5.00 20.50
.:. 20.50 20.00 20.50 19.16   0.00
NO_WORD 10.00 10.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -13.0000
Features matched: Location.mismatch: no clear info of matching: be(X, prep_in); Numeric.mismatch: NUMBER mismatch: '10.0' vs ''; Structure.argsMismatch: args have different parents but same relations: text "John" <-nsubj-- "AM vs. hyp "John" <-nsubj-- "was", which aligned to text "was" args have different parents but same relations: text "Washington_D.C" <-prep_in-- "at vs. hyp "Washington" <-prep_in-- "was", which aligned to text "was" args have different parents but same relations: text "." <-punct-- "AM vs. hyp "." <-punct-- "was", which aligned to text "was"
Hand-tuned score: -10.0000
Threshold: -11.4590


Inference ID: ATM11-2

Txt: John was at the station in Washington D.C at 10:00 AM on March 15, 2005. His train to New York was scheduled to leave the station at 10:30.

Hyp: John was in Washington at 10:15 . (yes)

John
NNP
was
VBD
Washington
NNP
10:15
CD
.
.
John:NNP   0.00 14.84   8.69 24.96 20.50
was:VBD 14.84   0.00   6.65 20.46 20.00
at:IN 20.50 20.00 20.50 20.50 20.00
the:DT 20.50 20.00 20.50 20.50 10.00
station:NN   8.53 12.52   8.02 17.11 19.88
Washington_D.C:NNP 11.84 11.08   0.00 24.96 20.50
10:00:CD 24.96 20.46 24.96   4.45 20.50
AM:NNP   8.35 14.34   9.84 20.46 20.00
March:NNP 10.03 11.88 11.38 24.96 20.50
15:CD 23.69 20.50 25.00 10.00 19.54
,:, 25.00 20.50 25.00 25.00   6.23
2005:CD 24.96 20.46 24.96 10.00 20.50
.:. 20.50 20.00 20.50 20.50   0.00
His:PRP$ 12.50 15.00 12.50 20.50 20.00
train:NN   8.53 14.34   8.63 17.49 20.00
New_York:NNP 14.34 10.00   4.50 24.96 20.50
was:VBD 14.84   0.00   6.65 20.46 20.00
scheduled:VBN 12.97 10.00 15.50 16.75 20.00
to:TO 20.50 20.00 20.50 20.50 10.00
leave:VB 13.35 10.00 15.50 18.49 19.32
the:DT 20.50 20.00 20.50 20.50 10.00
station:NN   8.53 12.52   8.02 17.11 19.88
10:30:CD 24.96 20.46 24.96   2.47 20.50
.:. 20.50 20.00 20.50 20.50   0.00
NO_WORD 10.00 10.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -10.4712
Features matched: Location.mismatch: no clear info of matching: be(X, prep_in); Structure.argsMismatch: args have different parents but same relations: text "John" <-nsubj-- "AM vs. hyp "John" <-nsubj-- "was", which aligned to text "was" args have different parents but same relations: text "Washington_D.C" <-prep_in-- "at vs. hyp "Washington" <-prep_in-- "was", which aligned to text "was" args have different parents but same relations: text "." <-punct-- "scheduled vs. hyp "." <-punct-- "was", which aligned to text "was"
Hand-tuned score: -4.0000
Threshold: -11.4590


Inference ID: ATM11-3

Txt: John was at the station in Washington D.C at 10:00 AM on March 15, 2005. His train to New York was scheduled to leave the station at 10:30.

Hyp: John was in Washington at 10:30 . (yes)

John
NNP
was
VBD
Washington
NNP
10:30
CD
.
.
John:NNP   0.00 14.84   8.69 24.96 20.50
was:VBD 14.84   0.00   6.65 20.46 20.00
at:IN 20.50 20.00 20.50 20.50 20.00
the:DT 20.50 20.00 20.50 20.50 10.00
station:NN   8.53 12.52   8.02 18.51 19.88
Washington_D.C:NNP 11.84 11.08   0.00 24.96 20.50
10:00:CD 24.96 20.46 24.96   4.96 20.50
AM:NNP   8.35 14.34   9.84 20.46 20.00
March:NNP 10.03 11.88 11.38 24.96 20.50
15:CD 23.69 20.50 25.00 10.00 19.54
,:, 25.00 20.50 25.00 25.00   6.23
2005:CD 24.96 20.46 24.96 10.00 20.50
.:. 20.50 20.00 20.50 20.50   0.00
His:PRP$ 12.50 15.00 12.50 20.50 20.00
train:NN   8.53 14.34   8.63 17.89 20.00
New_York:NNP 14.34 10.00   4.50 24.96 20.50
was:VBD 14.84   0.00   6.65 20.46 20.00
scheduled:VBN 12.97 10.00 15.50 15.18 20.00
to:TO 20.50 20.00 20.50 20.50 10.00
leave:VB 13.35 10.00 15.50 18.29 19.32
the:DT 20.50 20.00 20.50 20.50 10.00
station:NN   8.53 12.52   8.02 18.51 19.88
10:30:CD 24.96 20.46 24.96   0.00 20.50
.:. 20.50 20.00 20.50 20.50   0.00
NO_WORD 10.00 10.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -8.0000
Features matched: Location.mismatch: no clear info of matching: be(X, prep_in); Structure.argsMismatch: args have different parents but same relations: text "John" <-nsubj-- "AM vs. hyp "John" <-nsubj-- "was", which aligned to text "was" args have different parents but same relations: text "Washington_D.C" <-prep_in-- "at vs. hyp "Washington" <-prep_in-- "was", which aligned to text "was" args have different parents but same relations: text "." <-punct-- "AM vs. hyp "." <-punct-- "was", which aligned to text "was"
Hand-tuned score: -4.0000
Threshold: -11.4590


Inference ID: ATM11-4

Txt: John was at the station in Washington D.C at 10:00 AM on March 15, 2005. His train to New York was scheduled to leave the station at 10:30.

Hyp: John was in Washington on March 15 . (MULTIPLE ANSWERS)

John
NNP
was
VBD
Washington
NNP
March
NNP
15
CD
.
.
John:NNP   0.00 14.84   8.69 10.03 23.69 20.50
was:VBD 14.84   0.00   6.65 11.88 20.50 20.00
at:IN 20.50 20.00 20.50 20.50 20.50 20.00
the:DT 20.50 20.00 20.50 20.50 20.50 10.00
station:NN   8.53 12.52   8.02   8.02 19.85 19.88
Washington_D.C:NNP 11.84 11.08   0.00 13.19 24.96 20.50
10:00:CD 24.96 20.46 24.96 24.96 10.00 20.50
AM:NNP   8.35 14.34   9.84   1.65 19.19 20.00
March:NNP 10.03 11.88 11.38   0.00 17.84 20.50
15:CD 23.69 20.50 25.00 17.84   0.00 19.54
,:, 25.00 20.50 25.00 20.00 18.42   6.23
2005:CD 24.96 20.46 24.96 19.96   4.90 20.50
.:. 20.50 20.00 20.50 20.50 19.54   0.00
His:PRP$ 12.50 15.00 12.50 12.50 20.50 20.00
train:NN   8.53 14.34   8.63   2.41 19.82 20.00
New_York:NNP 14.34 10.00   4.50 11.38 24.96 20.50
was:VBD 14.84   0.00   6.65 11.88 20.50 20.00
scheduled:VBN 12.97 10.00 15.50 13.10 19.19 20.00
to:TO 20.50 20.00 20.50 20.50 20.50 10.00
leave:VB 13.35 10.00 15.50 12.73 18.34 19.32
the:DT 20.50 20.00 20.50 20.50 20.50 10.00
station:NN   8.53 12.52   8.02   8.02 19.85 19.88
10:30:CD 24.96 20.46 24.96 24.96 10.00 20.50
.:. 20.50 20.00 20.50 20.50 19.54   0.00
NO_WORD 10.00 10.00 10.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -8.0000
Features matched: Adjunct.dropPosCxt: text adjunct "2005" of "March" dropped on aligned hyp word "March"; Date.matchDatesByGraph: hyp/txt matching, by graph: March and children; Location.mismatch: no clear info of matching: be(X, prep_in); Structure.argsMismatch: args have different parents but same relations: text "John" <-nsubj-- "AM vs. hyp "John" <-nsubj-- "was", which aligned to text "was" args have different parents but same relations: text "Washington_D.C" <-prep_in-- "at vs. hyp "Washington" <-prep_in-- "was", which aligned to text "was" args have different parents but same relations: text "." <-punct-- "AM vs. hyp "." <-punct-- "was", which aligned to text "was"
Hand-tuned score: -2.5000
Threshold: -11.4590


Inference ID: ATM11-5

Txt: John was at the station in Washington D.C at 10:00 AM on March 15, 2005. His train to New York was scheduled to leave the station at 10:30.

Hyp: John was in New_York on March 15th . (yes)

John
NNP
was
VBD
New_York
NNP
March
NNP
15th
NNP
.
.
John:NNP   0.00 14.84   9.84 10.03 14.96 20.50
was:VBD 14.84   0.00   9.50 11.88 15.46 20.00
at:IN 20.50 20.00 20.00 20.50 20.50 20.00
the:DT 20.50 20.00 20.00 20.50 20.50 10.00
station:NN   8.53 12.52   6.03   8.02 10.46 19.88
Washington_D.C:NNP 11.84 11.08   9.41 13.19 14.96 20.50
10:00:CD 24.96 20.46 20.46 24.96 24.38 20.50
AM:NNP   8.35 14.34   9.34   1.65 10.46 20.00
March:NNP 10.03 11.88   6.88   0.00   9.96 20.50
15:CD 23.69 20.50 20.46 17.84 19.96 19.54
,:, 25.00 20.50 20.50 20.00 19.76   6.23
2005:CD 24.96 20.46 20.46 19.96 19.20 20.50
.:. 20.50 20.00 20.00 20.50 20.50   0.00
His:PRP$ 12.50 15.00 12.00 12.50 12.50 20.00
train:NN   8.53 14.34   9.06   2.41   8.85 20.00
New_York:NNP 14.34 10.00   0.50 11.38 14.96 20.50
was:VBD 14.84   0.00   9.50 11.88 15.46 20.00
scheduled:VBN 12.97 10.00 14.96 13.10 12.94 20.00
to:TO 20.50 20.00 20.00 20.50 20.50 10.00
leave:VB 13.35 10.00 14.96 12.73 14.63 19.32
the:DT 20.50 20.00 20.00 20.50 20.50 10.00
station:NN   8.53 12.52   6.03   8.02 10.46 19.88
10:30:CD 24.96 20.46 20.46 24.96 23.36 20.50
.:. 20.50 20.00 20.00 20.50 20.50   0.00
NO_WORD 10.00 10.00 10.00 10.00 10.00 10.00

Response: yes (CORRECT)
Justification:
Alignment score: -18.5000
Features matched: Adjunct.addPosCxt: hyp added 15th[15th-NNP]; Adjunct.dropPosCxt: text adjunct "2005" of "March" dropped on aligned hyp word "March"; Date.hypDateIns: hypothesis date insertion: 15th; Location.mismatch: no clear info of matching: be(X, prep_in); NullPunisher.other: 15th; Structure.argsMismatch: args have different parents but same relations: text "John" <-nsubj-- "AM vs. hyp "John" <-nsubj-- "was", which aligned to text "was" args have different parents but same relations: text "." <-punct-- "scheduled vs. hyp "." <-punct-- "was", which aligned to text "was" args have different parents, different relations: text "New_York" <-prep_to-- "train" vs. hyp "New_York" <-prep_in-- "was", which aligned to text "was"
Hand-tuned score: -7.5000
Threshold: -11.4590


Inference ID: ATM11-6

Txt: John was at the station in Washington D.C at 10:00 AM on March 15, 2005. His train to New York was scheduled to leave the station at 10:30.

Hyp: John was in California on March 15th . (no)

John
NNP
was
VBD
California
NNP
March
NNP
15th
NNP
.
.
John:NNP   0.00 14.84 14.34 10.03 14.96 20.50
was:VBD 14.84   0.00 10.00 11.88 15.46 20.00
at:IN 20.50 20.00 20.50 20.50 20.50 20.00
the:DT 20.50 20.00 20.50 20.50 20.50 10.00
station:NN   8.53 12.52   8.02   8.02 10.46 19.88
Washington_D.C:NNP 11.84 11.08   7.25 13.19 14.96 20.50
10:00:CD 24.96 20.46 24.96 24.96 24.38 20.50
AM:NNP   8.35 14.34   9.84   1.65 10.46 20.00
March:NNP 10.03 11.88 11.38   0.00   9.96 20.50
15:CD 23.69 20.50 25.00 17.84 19.96 19.54
,:, 25.00 20.50 25.00 20.00 19.76   6.23
2005:CD 24.96 20.46 24.96 19.96 19.20 20.50
.:. 20.50 20.00 20.50 20.50 20.50   0.00
His:PRP$ 12.50 15.00 12.50 12.50 12.50 20.00
train:NN   8.53 14.34   9.84   2.41   8.85 20.00
New_York:NNP 14.34 10.00   4.50 11.38 14.96 20.50
was:VBD 14.84   0.00 10.00 11.88 15.46 20.00
scheduled:VBN 12.97 10.00 15.50 13.10 12.94 20.00
to:TO 20.50 20.00 20.50 20.50 20.50 10.00
leave:VB 13.35 10.00 15.50 12.73 14.63 19.32
the:DT 20.50 20.00 20.50 20.50 20.50 10.00
station:NN   8.53 12.52   8.02   8.02 10.46 19.88
10:30:CD 24.96 20.46 24.96 24.96 23.36 20.50
.:. 20.50 20.00 20.50 20.50 20.50   0.00
NO_WORD 10.00 10.00 10.00 10.00 10.00 10.00

Response: yes (INCORRECT)
Justification:
Alignment score: -22.4959
Features matched: Adjunct.addPosCxt: hyp added 15th[15th-NNP]; Adjunct.dropPosCxt: text adjunct "2005" of "March" dropped on aligned hyp word "March"; Date.hypDateIns: hypothesis date insertion: 15th; Location.mismatch: no clear info of matching: be(X, prep_in); NullPunisher.other: 15th; Structure.argsMismatch: args have different parents but same relations: text "John" <-nsubj-- "AM vs. hyp "John" <-nsubj-- "was", which aligned to text "was" args have different parents but same relations: text "." <-punct-- "scheduled vs. hyp "." <-punct-- "was", which aligned to text "was" args have different parents, different relations: text "New_York" <-prep_to-- "train" vs. hyp "California" <-prep_in-- "was", which aligned to text "was"
Hand-tuned score: -7.5000
Threshold: -11.4590


Word similarity table built on Thu Jul 06 11:12:42 PDT 2006 using command:
java edu.stanford.nlp.rte.WordSimilarityGenerator -info /u/nlp/rte/data/byformat/align/stochastic/atm_dev.pipeline.align.xml -output /u/nlp/rte/data/byformat/wordsim/stochastic/atm_dev.pipeline.wordsim.html -lex.BasicWN off