Txt: Sandy owns a Golden Retriever, Harley, with whom she has won a Dog World Award.
Hyp: Harley is a herding dog . (No.)
Harley NNP |
is VBZ |
a DT |
herding JJ |
dog NN |
. . |
|
Sandy:NNP | 9.96 | 15.46 | 20.50 | 12.46 | 10.46 | 20.50 |
owns:VBZ | 15.46 | 10.00 | 20.00 | 12.00 | 14.72 | 19.98 |
a:DT | 20.50 | 20.00 | 0.00 | 20.00 | 20.00 | 10.00 |
Golden_Retriever:NNP | 14.96 | 14.84 | 20.50 | 12.46 | 3.88 | 20.50 |
,:, | 20.50 | 20.00 | 10.00 | 20.00 | 18.12 | 5.73 |
Harley:NNP | 0.00 | 15.46 | 20.50 | 12.46 | 10.46 | 20.50 |
,:, | 20.50 | 20.00 | 10.00 | 20.00 | 18.12 | 5.73 |
with:IN | 20.50 | 20.00 | 18.45 | 20.00 | 20.00 | 20.00 |
she:PRP | 12.50 | 15.00 | 20.00 | 15.00 | 12.00 | 20.00 |
has:VBZ | 15.46 | 8.64 | 20.00 | 12.00 | 14.34 | 20.00 |
won:VBN | 15.46 | 10.00 | 20.00 | 11.04 | 15.00 | 19.64 |
a:DT | 20.50 | 20.00 | 0.00 | 20.00 | 20.00 | 10.00 |
Dog:NNP | 14.96 | 12.84 | 20.50 | 12.50 | 0.50 | 20.50 |
World:NNP | 14.96 | 14.84 | 20.50 | 10.63 | 9.45 | 20.50 |
Award:NNP | 14.96 | 15.50 | 20.50 | 12.50 | 10.50 | 20.50 |
.:. | 20.50 | 20.00 | 10.00 | 19.45 | 19.56 | 0.00 |
NO_WORD | 10.00 | 1.00 | 1.00 | 9.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -15.5000
Features matched: Adjunct.addPosCxt: hyp added herding[herding-JJ]; Adjunct.dropPosCxt: text adjunct "won" of "Harley" dropped on aligned hyp word "Harley"; NullPunisher.aux: is; NullPunisher.other: herding; NullPunisher.article: a; Structure.argsMismatch: args have different parents but same relations: text "Harley" <-appos-- "Golden_Retriever vs. hyp "Harley" <-nsubj-- "dog", which aligned to text "Dog" args have different parents but same relations: text "." <-punct-- "owns vs. hyp "." <-punct-- "dog", which aligned to text "Dog" args have different parents, different relations: text "Harley" <-dep-- "with" vs. hyp "Harley" <-nsubj-- "dog", which aligned to text "Dog"
Hand-tuned score: -3.6500
Threshold: -11.4590
Txt: Many cellphones have built-in digital cameras.
Hyp: Some cellphones can be used to take pictures . (Yes.)
Some DT |
cellphones NNS |
can MD |
be VB |
used VBN |
to TO |
take VB |
pictures NNS |
. . |
|
Many:JJ | 20.00 | 11.96 | 19.96 | 11.96 | 11.96 | 20.00 | 11.96 | 11.96 | 20.00 |
cellphones:NNS | 20.00 | 0.00 | 17.12 | 14.34 | 15.00 | 20.00 | 15.00 | 8.03 | 20.00 |
have:VBP | 20.00 | 13.95 | 17.32 | 7.80 | 6.55 | 20.00 | 1.83 | 12.32 | 20.00 |
built-in:JJ | 20.00 | 11.96 | 19.96 | 11.96 | 11.96 | 20.00 | 11.96 | 11.96 | 20.00 |
digital_cameras:NNS | 20.00 | 5.06 | 17.12 | 14.34 | 14.96 | 20.00 | 14.96 | 8.03 | 20.00 |
.:. | 10.00 | 20.00 | 10.00 | 20.00 | 18.25 | 10.00 | 17.67 | 18.10 | 0.00 |
NO_WORD | 10.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -51.4203
Features matched: Adjunct.dropPosCxt: text adjunct "built-in" of "digital_cameras" dropped on aligned hyp word "pictures"; Modal.yes: actual -> possible; NullPunisher.functionWord: to; NullPunisher.aux: be; NullPunisher.other: Some; NullPunisher.aux: can; Quant.contract: [many,some]; RootEntailment.poorlyAlignedRoot: "used" aligned badly to "have"; Structure.relMismatch: text "cellphones" is nsubj of "have" while hyp "cellphones" is nsubjpass of "used" which aligned to text "have"
Hand-tuned score: 0.3000
Threshold: -11.4590
Txt: One in four cellphones sold has a camera in it.
Hyp: Do most cell_phones have a lens . (No.)
Do VB |
most JJS |
cell_phones NNS |
have VB |
a DT |
lens NN |
. . |
|
One:CD | 19.19 | 20.46 | 19.84 | 20.50 | 20.50 | 20.50 | 20.50 |
four:CD | 19.19 | 20.46 | 19.84 | 20.50 | 20.50 | 20.50 | 19.35 |
cellphones:NNS | 15.00 | 11.96 | 1.90 | 13.95 | 20.00 | 3.53 | 20.00 |
sold:VBN | 7.69 | 11.96 | 12.80 | 5.68 | 20.00 | 13.35 | 19.42 |
has:VBZ | 7.53 | 11.96 | 14.34 | 0.50 | 20.00 | 14.34 | 20.00 |
a:DT | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
camera:NN | 15.00 | 11.96 | 6.77 | 13.95 | 20.00 | 3.53 | 18.65 |
it:PRP | 15.00 | 15.00 | 12.00 | 15.00 | 20.00 | 12.00 | 20.00 |
.:. | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 | 20.00 | 0.00 |
NO_WORD | 10.00 | 9.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -26.4626
Features matched: Adjunct.addPosCxt: hyp added most[most-JJS]; Adjunct.dropPosCxt: text adjunct "it" of "camera" dropped on aligned hyp word "lens"; Polarity.hypNegMarker: "most": JJS; NullPunisher.other: most; Quant.contract: [a,a]; RootEntailment.poorlyAlignedRoot: "Do" aligned badly to "has"
Hand-tuned score: -1.5000
Threshold: -11.4590
Txt: Three in four cellphones sold has a camera in it.
Hyp: Do most cell_phones have a lens . (Yes.)
Do VB |
most JJS |
cell_phones NNS |
have VB |
a DT |
lens NN |
. . |
|
Three:CD | 19.19 | 20.46 | 19.84 | 20.50 | 20.50 | 20.50 | 20.50 |
four:CD | 19.19 | 20.46 | 19.84 | 20.50 | 20.50 | 20.50 | 19.35 |
cellphones:NNS | 15.00 | 11.96 | 1.90 | 13.95 | 20.00 | 3.53 | 20.00 |
sold:VBN | 7.69 | 11.96 | 12.80 | 5.68 | 20.00 | 13.35 | 19.42 |
has:VBZ | 7.53 | 11.96 | 14.34 | 0.50 | 20.00 | 14.34 | 20.00 |
a:DT | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
camera:NN | 15.00 | 11.96 | 6.77 | 13.95 | 20.00 | 3.53 | 18.65 |
it:PRP | 15.00 | 15.00 | 12.00 | 15.00 | 20.00 | 12.00 | 20.00 |
.:. | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 | 20.00 | 0.00 |
NO_WORD | 10.00 | 9.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -26.4626
Features matched: Adjunct.addPosCxt: hyp added most[most-JJS]; Adjunct.dropPosCxt: text adjunct "it" of "camera" dropped on aligned hyp word "lens"; Polarity.hypNegMarker: "most": JJS; NullPunisher.other: most; Quant.contract: [a,a]; RootEntailment.poorlyAlignedRoot: "Do" aligned badly to "has"
Hand-tuned score: -1.5000
Threshold: -11.4590
Txt: Mr. Radley ordered a 16 ounce slab of slowly roasted Black Angus Prime Rib.
Hyp: Radley is a vegetarian . (No.)
Radley NNP |
is VBZ |
a DT |
vegetarian NN |
. . |
|
Mr._Radley:NNP | 0.00 | 15.46 | 20.50 | 10.46 | 20.50 |
ordered:VBD | 15.46 | 7.74 | 20.00 | 15.00 | 20.00 |
a:DT | 20.50 | 20.00 | 0.00 | 20.00 | 10.00 |
16:CD | 24.96 | 20.50 | 20.50 | 19.80 | 19.82 |
ounce:NN | 10.46 | 14.34 | 20.00 | 8.11 | 19.86 |
slab:NN | 10.46 | 14.34 | 20.00 | 8.03 | 20.00 |
slowly:RB | 15.46 | 19.96 | 20.00 | 14.96 | 18.14 |
roasted:JJ | 12.46 | 9.34 | 20.00 | 7.83 | 20.00 |
Black_Angus_Prime_Rib:NNP | 14.96 | 14.17 | 20.50 | 9.52 | 20.50 |
.:. | 20.50 | 20.00 | 10.00 | 20.00 | 0.00 |
NO_WORD | 10.00 | 1.00 | 1.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -13.0325
Features matched: Adjunct.dropPosCxt: text adjunct "Black_Angus_Prime_Rib" of "slab" dropped on aligned hyp word "vegetarian"; NullPunisher.aux: is; Quant.contract: [a,a]; RootEntailment.poorlyAlignedRoot: "vegetarian" aligned badly to "slab"; Structure.argsMismatch: args have different parents but same relations: text "Mr._Radley" <-nsubj-- "ordered vs. hyp "Radley" <-nsubj-- "vegetarian", which aligned to text "slab" args have different parents but same relations: text "." <-punct-- "ordered vs. hyp "." <-punct-- "vegetarian", which aligned to text "slab"
Hand-tuned score: -2.5500
Threshold: -11.4590
Txt: Sue ran down to McDonald's and got a hamburger happy meal with a large Diet Coke.
Hyp: Sue did get a cup . (Yes.)
Sue NNP |
did VBD |
get VB |
a DT |
cup NN |
. . |
|
Sue:NNP | 0.00 | 15.00 | 12.65 | 20.00 | 7.65 | 20.00 |
ran_down:VBD | 13.82 | 7.45 | 3.85 | 20.00 | 12.62 | 20.00 |
McDonald:NNP | 10.46 | 15.46 | 15.46 | 20.50 | 10.46 | 20.50 |
got:VBD | 12.65 | 5.62 | 0.50 | 20.00 | 12.62 | 17.75 |
a:DT | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
hamburger:RB | 14.34 | 18.41 | 18.31 | 20.00 | 10.84 | 20.00 |
happy:JJ | 11.96 | 11.96 | 6.05 | 20.00 | 11.42 | 17.14 |
meal:NN | 9.34 | 13.69 | 12.37 | 20.00 | 5.84 | 19.61 |
a:DT | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
large:JJ | 12.00 | 9.53 | 12.00 | 20.00 | 8.55 | 18.45 |
Diet_Coke:NN | 9.54 | 15.46 | 13.79 | 20.50 | 7.82 | 20.50 |
.:. | 20.00 | 17.99 | 17.78 | 10.00 | 20.00 | 0.00 |
NO_WORD | 10.00 | 1.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -11.3413
Features matched: Adjunct.dropPosCxt: text adjunct "happy" of "meal" dropped on aligned hyp word "cup"; NullPunisher.aux: did; Quant.contract: [a,a]; Structure.argsMismatch: args have different parents but same relations: text "Sue" <-nsubj-- "ran_down vs. hyp "Sue" <-nsubj-- "get", which aligned to text "got" args have different parents but same relations: text "." <-punct-- "ran_down vs. hyp "." <-punct-- "get", which aligned to text "got"
Hand-tuned score: -0.5500
Threshold: -11.4590
Txt: Angela Drake will be moving to California with her husband, where she has accepted a full-time position with the San Diego Public Library.
Hyp: Angela Drake does live in Los_Angeles . (No.)
Angela_Drake NNP |
does VBZ |
live VB |
Los_Angeles NNP |
. . |
|
Angela_Drake:NNP | 0.00 | 13.23 | 15.46 | 10.46 | 20.50 |
will:MD | 20.46 | 20.00 | 20.00 | 19.96 | 10.00 |
be:VB | 15.17 | 9.34 | 1.00 | 14.34 | 20.00 |
moving:VBG | 15.46 | 10.00 | 4.03 | 14.96 | 17.38 |
California:NNP | 14.67 | 14.84 | 15.50 | 2.50 | 20.50 |
her:PRP$ | 12.50 | 15.00 | 15.00 | 12.00 | 20.00 |
husband:NN | 9.52 | 13.11 | 13.53 | 9.34 | 19.33 |
,:, | 20.50 | 19.26 | 17.82 | 20.00 | 5.73 |
where:WRB | 20.46 | 19.96 | 19.96 | 19.96 | 10.00 |
she:PRP | 12.50 | 15.00 | 15.00 | 12.00 | 20.00 |
has:VBZ | 15.17 | 9.34 | 10.00 | 12.52 | 20.00 |
accepted:VBN | 15.46 | 10.00 | 8.07 | 14.96 | 17.53 |
a:DT | 20.50 | 20.00 | 20.00 | 20.00 | 10.00 |
full-time:JJ | 12.46 | 11.96 | 10.88 | 11.96 | 17.39 |
position:NN | 10.17 | 14.34 | 15.00 | 6.03 | 18.83 |
the:DT | 20.50 | 18.65 | 20.00 | 20.00 | 10.00 |
San_Diego_Public_Library:NNP | 14.47 | 14.41 | 15.46 | 10.46 | 20.50 |
.:. | 20.50 | 20.00 | 18.97 | 20.00 | 0.00 |
NO_WORD | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -12.8582
Features matched: Adjunct.dropPosCxt: text adjunct "husband" of "moving" dropped on aligned hyp word "live"; NullPunisher.aux: does; RootEntailment.poorlyAlignedRoot: "live" aligned badly to "moving"; Structure.relMismatch: text "California" is prep_to of "moving" while hyp "Los_Angeles" is prep_in of "live" which aligned to text "moving"
Hand-tuned score: -1.5500
Threshold: -11.4590
Txt: Ambassador Richard C. Holbrooke will visit Paris, France from Saturday, April 22 until Tuesday, April 25.
Hyp: Ambassador Holbrooke will visit France in April . (Yes.)
Ambassador NNP |
Holbrooke NNP |
will MD |
visit VB |
France NNP |
April NNP |
. . |
|
Ambassador:NNP | 0.00 | 9.29 | 20.00 | 15.00 | 8.55 | 10.50 | 20.00 |
Richard_C._Holbrooke:NNP | 10.46 | 0.00 | 20.46 | 15.46 | 14.96 | 14.96 | 20.50 |
will:MD | 20.00 | 20.46 | 0.00 | 20.00 | 20.50 | 19.19 | 10.00 |
visit:VB | 15.00 | 15.46 | 20.00 | 0.00 | 15.50 | 15.50 | 20.00 |
Paris:NNP | 9.84 | 14.96 | 20.50 | 13.63 | 2.00 | 15.00 | 20.50 |
,:, | 20.50 | 25.00 | 10.50 | 20.50 | 25.00 | 20.00 | 6.23 |
France:NNP | 8.55 | 14.96 | 20.50 | 15.50 | 0.00 | 15.00 | 20.50 |
Saturday:NNP | 10.50 | 14.96 | 19.19 | 15.50 | 15.00 | 7.23 | 20.50 |
,:, | 20.50 | 25.00 | 10.50 | 20.50 | 25.00 | 20.00 | 6.23 |
April:NNP | 10.50 | 14.96 | 19.19 | 15.50 | 15.00 | 0.00 | 20.50 |
22:CD | 20.50 | 24.96 | 19.19 | 20.50 | 25.00 | 17.84 | 19.32 |
Tuesday:NNP | 10.50 | 14.96 | 19.19 | 15.50 | 15.00 | 7.23 | 20.50 |
,:, | 20.50 | 25.00 | 10.50 | 20.50 | 25.00 | 20.00 | 6.23 |
April:NNP | 10.50 | 14.96 | 19.19 | 15.50 | 15.00 | 0.00 | 20.50 |
25:CD | 20.50 | 24.96 | 19.19 | 20.50 | 25.00 | 17.84 | 19.52 |
.:. | 20.00 | 20.50 | 10.00 | 20.00 | 20.50 | 20.50 | 0.00 |
NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -4.0000
Features matched: Adjunct.dropPosCxt: text adjunct "25" of "April" dropped on aligned hyp word "April"; Date.matchDatesByGraph: hyp/txt matching, by graph: April and children; Structure.parentsMismatch: args have different parents, different relations: text "France" <-appos-- "Paris" vs. hyp "France" <-dobj-- "visit", which aligned to text "visit"
Hand-tuned score: -0.5000
Threshold: -11.4590
Txt: A full-time administrative assistant, Janet Smith lives close to Long Beach, where she's worked for the past 12 years.
Hyp: Janet Smith does live in Long_Beach . (No.)
Janet_Smith NNP |
does VBZ |
live VB |
Long_Beach NNP |
. . |
|
A:DT | 20.50 | 20.00 | 20.00 | 20.00 | 10.00 |
full-time:JJ | 12.46 | 11.96 | 10.88 | 11.96 | 17.39 |
administrative:JJ | 12.46 | 10.80 | 11.96 | 11.96 | 19.85 |
assistant:NN | 8.61 | 13.11 | 15.00 | 9.34 | 20.00 |
,:, | 20.50 | 19.26 | 17.82 | 20.00 | 5.73 |
Janet_Smith:NNP | 0.00 | 14.55 | 15.46 | 9.97 | 20.50 |
lives:VBZ | 14.52 | 8.11 | 0.50 | 13.62 | 18.56 |
Long_Beach:NNP | 14.47 | 14.84 | 15.50 | 0.50 | 20.50 |
,:, | 20.50 | 19.26 | 17.82 | 20.00 | 5.73 |
where:WRB | 20.46 | 19.96 | 19.96 | 19.96 | 10.00 |
she:PRP | 12.50 | 15.00 | 15.00 | 12.00 | 20.00 |
's:VBZ | 15.17 | 6.09 | 9.76 | 13.92 | 18.25 |
worked:VBN | 14.97 | 8.95 | 8.07 | 12.52 | 18.80 |
the:DT | 20.50 | 18.65 | 20.00 | 20.00 | 10.00 |
past:JJ | 12.46 | 10.13 | 12.00 | 10.62 | 19.32 |
12:CD | 24.96 | 20.50 | 19.17 | 19.42 | 19.69 |
years:NNS | 10.46 | 15.00 | 14.82 | 6.52 | 19.49 |
.:. | 20.50 | 20.00 | 18.97 | 20.00 | 0.00 |
NO_WORD | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -5.0000
Features matched: NullPunisher.aux: does; Structure.argsMismatch: args have different parents but same relations: text "Janet_Smith" <-appos-- "assistant vs. hyp "Janet_Smith" <-nsubj-- "live", which aligned to text "lives" text "Long_Beach" is prep_close_to of "lives" while hyp "Long_Beach" is prep_in of "live" which aligned to text "lives"
Hand-tuned score: -2.0500
Threshold: -11.4590
Txt: Bob Appleton, 33, lives in Sacramento but commutes to Davis in his silver Honda CRV.
Hyp: Appleton does have a California Driver 's license . (Yes.)
Appleton NNP |
does VBZ |
have VB |
a DT |
California_Driver NNP |
license NN |
. . |
|
Bob_Appleton:NNP | 5.00 | 14.55 | 14.31 | 20.50 | 13.17 | 9.68 | 20.50 |
,:, | 20.50 | 19.26 | 20.00 | 10.00 | 20.50 | 19.79 | 5.73 |
33:CD | 24.96 | 20.46 | 20.46 | 20.50 | 24.96 | 20.46 | 18.52 |
,:, | 20.50 | 19.26 | 20.00 | 10.00 | 20.50 | 19.79 | 5.73 |
lives:VBZ | 13.55 | 8.11 | 8.05 | 20.00 | 14.42 | 12.44 | 18.56 |
Sacramento:NNP | 10.69 | 14.84 | 14.84 | 20.50 | 13.17 | 10.50 | 20.50 |
commutes:NNP | 10.50 | 14.05 | 12.32 | 20.00 | 10.46 | 10.00 | 20.00 |
Davis:NNP | 13.05 | 13.61 | 13.55 | 20.50 | 12.04 | 10.50 | 20.50 |
his:PRP$ | 12.50 | 15.00 | 15.00 | 20.00 | 12.50 | 12.00 | 20.00 |
silver:JJ | 11.45 | 10.95 | 9.61 | 20.00 | 11.42 | 9.85 | 18.95 |
Honda_CRV:NN | 9.96 | 15.46 | 15.46 | 20.50 | 9.96 | 10.46 | 20.50 |
.:. | 20.50 | 20.00 | 20.00 | 10.00 | 20.50 | 19.98 | 0.00 |
NO_WORD | 10.00 | 1.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -35.0496
Features matched: Adjunct.dropPosCxt: text adjunct "Davis" of "lives" dropped on aligned hyp word "have"; NullPunisher.article: a; NullPunisher.aux: does; NullPunisher.other: California_Driver; NullPunisher.other: license; RootEntailment.poorlyAlignedRoot: "have" aligned badly to "lives"
Hand-tuned score: -2.6500
Threshold: -11.4590
Txt: When they remodeled their house, the Kirchners transformed the original kitchen into a game room with dark cherry laminate flooring and cabinets.
Hyp: Does the Krichners ' kitchen have cherry cabinets ? (Unknown.)
Does RB |
the DT |
Krichners NNPS |
kitchen NN |
have VBP |
cherry JJ |
cabinets NNS |
? . |
|
When:WRB | 19.96 | 10.00 | 20.46 | 19.96 | 19.96 | 19.96 | 19.96 | 10.00 |
they:PRP | 20.00 | 15.71 | 12.50 | 12.00 | 15.00 | 15.00 | 12.00 | 20.00 |
remodeled:VBD | 20.00 | 20.00 | 15.46 | 11.05 | 7.62 | 9.51 | 11.30 | 20.00 |
their:PRP$ | 20.00 | 20.00 | 12.50 | 12.00 | 15.00 | 15.00 | 12.00 | 20.00 |
house:NN | 11.69 | 20.00 | 10.46 | 6.19 | 13.95 | 10.29 | 6.66 | 20.00 |
,:, | 20.00 | 10.00 | 20.50 | 18.54 | 20.00 | 17.15 | 18.85 | 10.00 |
the:DT | 20.00 | 0.00 | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 |
Kirchners:NNPS | 13.61 | 20.50 | 9.44 | 9.45 | 13.55 | 10.61 | 9.45 | 20.50 |
transformed:VBD | 20.00 | 20.00 | 15.46 | 15.00 | 7.61 | 12.00 | 15.00 | 19.73 |
the:DT | 20.00 | 0.00 | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 |
original:JJ | 10.95 | 20.00 | 12.46 | 10.03 | 10.95 | 8.95 | 10.03 | 19.83 |
kitchen:NN | 13.95 | 20.00 | 10.46 | 0.00 | 13.95 | 7.60 | 3.12 | 19.98 |
a:DT | 20.00 | 10.00 | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 |
game_room:NN | 12.88 | 20.00 | 10.46 | 5.46 | 13.95 | 9.27 | 5.61 | 20.00 |
dark:JJ | 11.34 | 20.00 | 12.46 | 6.73 | 11.34 | 5.26 | 8.72 | 20.00 |
cherry:JJ | 10.11 | 20.00 | 12.46 | 7.60 | 10.11 | 0.00 | 7.97 | 19.72 |
laminate:JJ | 10.95 | 20.00 | 12.46 | 8.72 | 9.62 | 6.18 | 7.69 | 18.60 |
flooring:NNS | 13.95 | 20.00 | 10.46 | 5.75 | 13.95 | 8.75 | 5.08 | 20.00 |
cabinets:NNS | 10.85 | 20.00 | 10.46 | 3.12 | 13.95 | 7.97 | 0.00 | 19.54 |
.:. | 20.00 | 10.00 | 20.50 | 18.73 | 20.00 | 19.85 | 19.19 | 10.00 |
NO_WORD | 9.00 | 1.00 | 10.00 | 10.00 | 10.00 | 9.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -43.6142
Features matched: Adjunct.addPosCxt: hyp added Does[Does-RB]; Adjunct.dropPosCxt: text adjunct "flooring" of "transformed" dropped on aligned hyp word "have"; NullPunisher.other: Does; NullPunisher.other: ?; NullPunisher.other: Krichners; NullPunisher.article: the; RootEntailment.poorlyAlignedRoot: "have" aligned badly to "transformed"; Structure.relMismatch: text "kitchen" is dobj of "transformed" while hyp "kitchen" is nsubj of "have" which aligned to text "transformed" text "cabinets" is prep_with of "transformed" while hyp "cabinets" is dobj of "have" which aligned to text "transformed"
Hand-tuned score: -5.6000
Threshold: -11.4590
Txt: Mark Jacobs, who lives in Edmonds and works in Seattle, spends 2 hours a day commuting.
Hyp: Jacobs does commute to work during rush_hour . (Yes.)
Jacobs NNP |
does VBZ |
commute NN |
to TO |
work VB |
rush_hour NN |
. . |
|
Mark_Jacobs:NNS | 0.00 | 14.55 | 9.16 | 20.50 | 13.23 | 7.50 | 20.50 |
,:, | 20.50 | 19.26 | 20.00 | 10.00 | 19.61 | 20.00 | 5.73 |
who:WP | 12.50 | 15.00 | 12.00 | 20.00 | 15.00 | 12.00 | 20.00 |
lives:VBZ | 13.55 | 8.11 | 10.16 | 20.00 | 6.27 | 12.82 | 18.56 |
Edmonds:NNP | 14.96 | 15.46 | 10.46 | 20.50 | 15.46 | 10.46 | 20.50 |
works:VBZ | 14.45 | 8.42 | 13.82 | 20.00 | 0.50 | 14.18 | 19.16 |
Seattle:NNP | 14.34 | 14.84 | 10.50 | 20.50 | 10.96 | 10.17 | 20.50 |
,:, | 20.50 | 19.26 | 20.00 | 10.00 | 19.61 | 20.00 | 5.73 |
spends:VBZ | 15.50 | 8.12 | 11.68 | 20.00 | 7.78 | 15.00 | 20.00 |
2:CD | 25.00 | 20.50 | 20.50 | 20.50 | 20.41 | 19.42 | 18.58 |
hours:NNS | 10.50 | 15.00 | 6.02 | 20.00 | 11.60 | 5.00 | 19.95 |
a:DT | 20.50 | 20.00 | 20.00 | 10.00 | 20.00 | 20.00 | 10.00 |
day:NN | 4.79 | 13.11 | 7.09 | 20.00 | 12.57 | 8.01 | 18.14 |
commuting:VBG | 15.50 | 10.00 | 0.50 | 20.00 | 3.90 | 13.04 | 20.00 |
.:. | 20.50 | 20.00 | 19.75 | 10.00 | 18.57 | 20.00 | 0.00 |
NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -30.1176
Features matched: Adjunct.dropPosCxt: text adjunct "2" of "hours" dropped on aligned hyp word "rush_hour"; NullPunisher.functionWord: to; RootEntailment.poorlyAlignedRoot: "does" aligned badly to "spends"; Structure.parentsMismatch: args have different parents, different relations: text "commuting" <-partmod-- "hours" vs. hyp "commute" <-dobj-- "does", which aligned to text "spends" args have different parents, different relations: text "works" <-rcmod-- "Mark_Jacobs" vs. hyp "work" <-xcomp-- "does", which aligned to text "spends"
Hand-tuned score: -3.6000
Threshold: -11.4590
Txt: Terry Parks married Robert Paulson in 1979.
Hyp: Terry Parks is a man . (No.)
Terry_Parks NNS |
is VBZ |
a DT |
man NN |
. . |
|
Terry_Parks:NNS | 0.00 | 15.17 | 20.50 | 9.52 | 20.50 |
married:VBD | 15.46 | 8.07 | 20.00 | 10.61 | 19.71 |
Robert_Paulson:NNP | 9.02 | 15.17 | 20.50 | 9.52 | 20.50 |
1979:CD | 24.96 | 20.46 | 20.50 | 18.79 | 19.57 |
.:. | 20.50 | 20.00 | 10.00 | 19.76 | 0.00 |
NO_WORD | 10.00 | 1.00 | 1.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -12.6122
Features matched: NullPunisher.aux: is; NullPunisher.article: a; RootEntailment.poorlyAlignedRoot: "man" aligned badly to "married"
Hand-tuned score: -1.1500
Threshold: -11.4590
Txt: The Paulsons celebrated their 25th anniversary on June 14, 2004.
Hyp: The Paulsons did get_married on Flag_Day . (MULTIPLE ANSWERS)
The DT |
Paulsons NNPS |
did VBD |
get_married VBN |
Flag_Day NNP |
. . |
|
The:DT | 0.00 | 20.50 | 20.00 | 20.00 | 20.00 | 10.00 |
Paulsons:NNPS | 20.50 | 0.00 | 15.46 | 15.46 | 10.46 | 20.50 |
celebrated:VBD | 20.00 | 15.46 | 9.33 | 10.00 | 15.00 | 20.00 |
their:PRP$ | 20.00 | 12.50 | 15.00 | 15.00 | 12.00 | 20.00 |
25th:JJ | 20.00 | 12.46 | 11.52 | 11.96 | 11.96 | 20.00 |
anniversary:NN | 20.00 | 10.46 | 13.69 | 15.00 | 4.16 | 20.00 |
June_14:NNP | 20.50 | 14.96 | 14.19 | 15.50 | 0.50 | 20.50 |
,:, | 10.50 | 25.00 | 20.30 | 20.50 | 20.50 | 6.23 |
2004:CD | 20.50 | 24.96 | 20.46 | 20.46 | 20.46 | 20.50 |
.:. | 10.00 | 20.50 | 17.99 | 20.00 | 20.00 | 0.00 |
NO_WORD | 1.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -11.8094
Features matched: Adjunct.dropPosCxt: text adjunct "2004" of "June_14" dropped on aligned hyp word "Flag_Day"; NullPunisher.aux: did; RootEntailment.poorlyAlignedRoot: "get_married" aligned badly to "celebrated"
Hand-tuned score: -0.5500
Threshold: -11.4590
Txt: The Island Nut Sampler includes an 8 ounce box of milk chocolate covered macadamia nuts and an 8 ounce box of white chocolate covered macadamia nuts.
Hyp: It does include a pound of macadamia nuts . (Yes.)
It PRP |
does VBZ |
include VB |
a DT |
pound NN |
macadamia NN |
nuts NNS |
. . |
|
The:DT | 20.00 | 20.00 | 20.00 | 10.00 | 20.00 | 20.00 | 20.00 | 10.00 |
Island:NNP | 12.50 | 14.45 | 15.50 | 20.50 | 9.45 | 9.45 | 9.45 | 20.50 |
Nut:NNP | 12.50 | 13.61 | 15.50 | 20.50 | 8.34 | 8.61 | 0.81 | 20.50 |
Sampler:NNP | 12.50 | 13.61 | 15.50 | 20.50 | 8.53 | 8.61 | 8.53 | 20.50 |
includes:VBZ | 15.00 | 9.55 | 0.50 | 20.00 | 12.61 | 14.87 | 15.00 | 19.56 |
an:DT | 20.00 | 17.89 | 20.00 | 8.73 | 20.00 | 20.00 | 20.00 | 10.00 |
8:CD | 20.50 | 20.50 | 20.50 | 20.50 | 16.96 | 20.50 | 18.34 | 19.96 |
ounce:NN | 12.00 | 10.17 | 15.00 | 20.00 | 0.94 | 8.11 | 7.84 | 19.86 |
box:NN | 12.00 | 13.08 | 12.42 | 20.00 | 3.45 | 4.40 | 7.81 | 19.82 |
milk_chocolate:NN | 12.00 | 14.34 | 12.87 | 20.00 | 8.81 | 8.53 | 8.44 | 20.00 |
covered:VBN | 15.00 | 8.49 | 5.64 | 20.00 | 12.45 | 13.95 | 11.46 | 19.33 |
macadamia:NN | 12.00 | 13.11 | 13.91 | 20.00 | 8.11 | 0.00 | 5.49 | 19.94 |
nuts:NNS | 12.00 | 13.11 | 15.00 | 20.00 | 7.07 | 5.49 | 0.00 | 19.80 |
an:DT | 20.00 | 17.89 | 20.00 | 8.73 | 20.00 | 20.00 | 20.00 | 10.00 |
8:CD | 20.50 | 20.50 | 20.50 | 20.50 | 16.96 | 20.50 | 18.34 | 19.96 |
ounce:NN | 12.00 | 10.17 | 15.00 | 20.00 | 0.94 | 8.11 | 7.84 | 19.86 |
box:NN | 12.00 | 13.08 | 12.42 | 20.00 | 3.45 | 4.40 | 7.81 | 19.82 |
white_chocolate:NN | 12.00 | 14.05 | 13.81 | 20.00 | 7.15 | 9.05 | 8.44 | 20.00 |
covered:VBD | 15.00 | 8.49 | 5.64 | 20.00 | 12.45 | 13.95 | 11.46 | 19.33 |
macadamia:NN | 12.00 | 13.11 | 13.91 | 20.00 | 8.11 | 0.00 | 5.49 | 19.94 |
nuts:NNS | 12.00 | 13.11 | 15.00 | 20.00 | 7.07 | 5.49 | 0.00 | 19.80 |
.:. | 20.00 | 20.00 | 19.86 | 10.00 | 19.64 | 19.94 | 19.80 | 0.00 |
NO_WORD | 10.00 | 1.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -17.4408
Features matched: NullPunisher.aux: does; NullPunisher.other: It; NullPunisher.article: a; Structure.parentsMismatch: args have different parents, different relations: text "ounce" <-nn-- "box" vs. hyp "pound" <-dobj-- "include", which aligned to text "includes"
Hand-tuned score: -3.1500
Threshold: -11.4590
Txt: Nanolab says it can't stay profitable if the demand for nanotubes decreases.
Hyp: Nanolab does manufacture nanotubes . (MULTIPLE ANSWERS)
Nanolab NNP |
does VBZ |
manufacture NN |
nanotubes NNS |
. . |
|
Nanolab:NNP | 0.00 | 14.96 | 9.96 | 9.96 | 20.00 |
says:VBZ | 14.96 | 9.34 | 14.82 | 15.00 | 18.68 |
it:PRP | 12.00 | 15.00 | 12.00 | 12.00 | 20.00 |
ca:MD | 19.96 | 16.22 | 19.83 | 15.05 | 9.11 |
n't:RB | 14.96 | 17.14 | 13.59 | 14.96 | 17.90 |
stay:VB | 14.96 | 8.33 | 13.35 | 14.34 | 18.79 |
profitable:JJ | 11.96 | 11.96 | 9.71 | 11.96 | 19.77 |
if:IN | 20.00 | 16.87 | 20.00 | 20.00 | 20.00 |
the:DT | 20.00 | 18.65 | 20.00 | 20.00 | 10.00 |
demand:NN | 9.96 | 15.00 | 7.10 | 10.00 | 20.00 |
nanotubes:NNS | 9.96 | 14.34 | 10.00 | 0.00 | 20.00 |
decreases:VBZ | 14.96 | 10.00 | 12.69 | 15.00 | 19.25 |
.:. | 20.00 | 20.00 | 19.10 | 20.00 | 0.00 |
NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -20.4384
Features matched: RootEntailment.poorlyAlignedRoot: "does" aligned badly to "says"; Structure.parentsMismatch: args have different parents, different relations: text "nanotubes" <-prep_for-- "demand" vs. hyp "nanotubes" <-dobj-- "does", which aligned to text "says"
Hand-tuned score: -4.0000
Threshold: -11.4590
Txt: The Swedish Embassy in Bangkok will be closed April 13-17 during the Songkran Festival.
Hyp: The Songkran Festival is a Swedish holiday . (Unknown.)
The DT |
Songkran_Festival NNP |
is VBZ |
a DT |
Swedish JJ |
holiday NN |
. . |
|
The:DT | 0.00 | 20.50 | 20.00 | 10.00 | 20.50 | 20.00 | 10.00 |
Swedish:NNP | 20.50 | 14.34 | 15.50 | 20.50 | 0.00 | 9.19 | 20.50 |
Embassy:NNP | 20.50 | 14.96 | 14.84 | 20.50 | 12.00 | 10.50 | 20.50 |
Bangkok:NNP | 20.50 | 14.96 | 14.84 | 20.50 | 17.00 | 10.50 | 20.50 |
will:MD | 10.00 | 19.84 | 20.00 | 10.00 | 18.35 | 18.69 | 10.00 |
be:VB | 20.00 | 15.46 | 0.31 | 20.00 | 12.50 | 15.00 | 20.00 |
closed:VBN | 20.00 | 14.42 | 8.07 | 20.00 | 10.35 | 12.84 | 19.49 |
April:NNP | 20.50 | 13.62 | 15.50 | 20.50 | 15.69 | 7.73 | 20.50 |
13-17:CD | 20.50 | 24.96 | 20.46 | 20.50 | 24.96 | 20.24 | 19.37 |
the:DT | 0.00 | 20.50 | 20.00 | 10.00 | 20.50 | 20.00 | 10.00 |
Songkran_Festival:NNP | 20.50 | 0.00 | 15.46 | 20.50 | 16.34 | 9.12 | 20.50 |
.:. | 10.00 | 20.50 | 20.00 | 10.00 | 20.50 | 19.99 | 0.00 |
NO_WORD | 1.00 | 10.00 | 1.00 | 1.00 | 9.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -15.7318
Features matched: Adjunct.dropPosCxt: text adjunct "13-17" of "April" dropped on aligned hyp word "holiday"; NullPunisher.aux: is; NullPunisher.article: a; RootEntailment.poorlyAlignedRoot: "holiday" aligned badly to "April"; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "closed vs. hyp "." <-punct-- "holiday", which aligned to text "April" args have different parents, different relations: text "Songkran_Festival" <-prep_during-- "closed" vs. hyp "Songkran_Festival" <-nsubj-- "holiday", which aligned to text "April" args have different parents, different relations: text "Swedish" <-nn-- "Embassy" vs. hyp "Swedish" <-amod-- "holiday", which aligned to text "April" noun args have different parents but same relations: "Swedish": "Embassy" vs. "holiday"
Hand-tuned score: -3.6500
Threshold: -11.4590
Txt: The Nut Sampler set includes two 8 ounce boxes of chocolate covered nuts.
Hyp: This set does contain milk_chocolate covered nuts . (Unknown.)
This DT |
set NN |
does VBZ |
contain VB |
milk_chocolate JJ |
covered VBN |
nuts NNS |
. . |
|
The:DT | 10.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 |
Nut_Sampler:NNP | 20.50 | 5.65 | 14.55 | 15.46 | 10.94 | 13.73 | 5.65 | 20.50 |
set:NN | 20.00 | 0.00 | 12.81 | 12.79 | 10.81 | 6.83 | 0.31 | 18.94 |
includes:VBZ | 20.00 | 11.46 | 9.55 | 6.26 | 9.87 | 5.64 | 15.00 | 19.56 |
two:CD | 20.50 | 18.34 | 20.50 | 19.56 | 19.84 | 19.92 | 18.34 | 19.42 |
8:CD | 20.50 | 18.34 | 20.50 | 20.17 | 19.84 | 19.42 | 18.34 | 19.96 |
ounce:NN | 20.00 | 7.84 | 10.17 | 14.37 | 11.34 | 13.95 | 7.84 | 19.86 |
boxes:NNS | 20.00 | 7.12 | 13.11 | 12.66 | 10.76 | 9.85 | 7.84 | 18.97 |
chocolate:NN | 20.00 | 8.69 | 14.34 | 14.77 | 2.00 | 11.75 | 3.08 | 19.54 |
covered:VBN | 20.00 | 6.83 | 8.49 | 4.58 | 10.81 | 0.00 | 11.46 | 19.33 |
nuts:NNS | 20.00 | 0.31 | 13.11 | 15.00 | 10.44 | 11.46 | 0.00 | 19.80 |
.:. | 10.00 | 18.94 | 20.00 | 18.50 | 20.00 | 19.33 | 19.80 | 0.00 |
NO_WORD | 10.00 | 10.00 | 1.00 | 10.00 | 9.00 | 10.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -23.2614
Features matched: Adjunct.dropPosCxt: text adjunct "Nut_Sampler" of "set" dropped on aligned hyp word "set"; NullPunisher.aux: does; NullPunisher.other: This; RootEntailment.poorlyAlignedRoot: "contain" aligned badly to "includes"; Structure.parentsMismatch: args have different parents, different relations: text "nuts" <-prep_of-- "boxes" vs. hyp "nuts" <-dobj-- "contain", which aligned to text "includes"
Hand-tuned score: -4.5500
Threshold: -11.4590
Txt: The Mini Mac was introduced by Apple CEO Steve Jobs at his keynote address on January 11.
Hyp: Apple Computer did release a new Macintosh in January . (Yes.)
Apple_Computer NNP |
did VBD |
release VB |
a DT |
new JJ |
Macintosh NNP |
January NNP |
. . |
|
The:DT | 20.50 | 20.00 | 20.00 | 10.00 | 20.00 | 20.50 | 20.50 | 10.00 |
Mini_Mac:NNP | 9.02 | 15.46 | 13.47 | 20.50 | 12.46 | 5.42 | 14.96 | 20.50 |
was:VBD | 15.17 | 10.00 | 9.34 | 20.00 | 11.96 | 14.84 | 15.50 | 20.00 |
introduced:VBN | 15.46 | 6.26 | 6.87 | 20.00 | 10.51 | 15.50 | 15.50 | 20.00 |
Apple:NNP | 0.00 | 15.50 | 14.45 | 20.50 | 12.46 | 8.95 | 15.00 | 20.50 |
CEO:NNP | 9.52 | 15.00 | 13.95 | 20.00 | 11.96 | 9.45 | 10.50 | 20.00 |
Steve_Jobs:NNP | 14.02 | 14.34 | 13.89 | 20.50 | 12.46 | 14.02 | 14.34 | 20.50 |
his:PRP$ | 12.50 | 15.00 | 15.00 | 20.00 | 15.00 | 12.50 | 12.50 | 20.00 |
keynote_address:NN | 10.17 | 13.38 | 13.35 | 20.00 | 11.96 | 10.17 | 9.84 | 20.00 |
January:NNP | 14.96 | 14.19 | 14.19 | 20.50 | 12.46 | 15.00 | 0.00 | 20.50 |
11:CD | 24.96 | 19.19 | 19.19 | 20.50 | 20.30 | 25.00 | 17.84 | 19.37 |
.:. | 20.50 | 17.99 | 19.68 | 10.00 | 20.00 | 20.50 | 20.50 | 0.00 |
NO_WORD | 10.00 | 1.00 | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -26.2923
Features matched: Adjunct.addPosCxt: hyp added new[new-JJ]; Adjunct.dropPosCxt: text adjunct "11" of "January" dropped on aligned hyp word "January"; Date.matchDatesByGraph: hyp/txt matching, by graph: January and children; NullPunisher.article: a; NullPunisher.other: new; NullPunisher.aux: did; Quant.contract: [the,a]; RootEntailment.poorlyAlignedRoot: "release" aligned badly to "introduced"; Structure.relMismatch: text "January" is prep_on of "introduced" while hyp "January" is prep_in of "release" which aligned to text "introduced"
Hand-tuned score: -1.6500
Threshold: -11.4590
Txt: The Donald criticized President Bush over his decision to go to war with Iraq.
Hyp: Trump does support W 's decision to go_to_war . (No.)
Trump NNP |
does VBZ |
support VB |
W NNP |
decision NN |
go_to_war NN |
. . |
|
The:DT | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 |
Donald:NNP | 14.96 | 15.46 | 15.46 | 10.46 | 10.46 | 10.46 | 20.50 |
criticized:VBD | 15.50 | 10.00 | 6.01 | 15.00 | 13.36 | 14.96 | 19.96 |
President_Bush:NNP | 9.45 | 13.11 | 13.51 | 9.34 | 9.18 | 8.86 | 20.00 |
his:PRP$ | 12.50 | 15.00 | 15.00 | 12.00 | 12.00 | 12.00 | 20.00 |
decision:NN | 10.50 | 14.46 | 12.38 | 8.69 | 0.00 | 8.84 | 18.75 |
to:TO | 20.50 | 17.95 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 |
go_to_go_to_war:VB | 11.79 | 9.67 | 7.80 | 13.92 | 13.84 | 5.00 | 20.00 |
Iraq:NNP | 14.34 | 14.84 | 14.84 | 9.84 | 10.50 | 10.17 | 20.50 |
.:. | 20.50 | 20.00 | 18.74 | 20.00 | 18.75 | 20.00 | 0.00 |
NO_WORD | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -36.0073
Features matched: Adjunct.dropPosCxt: text adjunct "Iraq" of "go_to_go_to_war" dropped on aligned hyp word "go_to_war"; NullPunisher.other: Trump; NullPunisher.other: W; NullPunisher.aux: does; RootEntailment.poorlyAlignedRoot: "support" aligned badly to "criticized"; Structure.parentsMismatch: args have different parents, different relations: text "decision" <-prep_over-- "President_Bush" vs. hyp "decision" <-dobj-- "support", which aligned to text "criticized" args have different parents, different relations: text "go_to_go_to_war" <-infmod-- "decision" vs. hyp "go_to_war" <-prep_to-- "support", which aligned to text "criticized"
Hand-tuned score: -5.5500
Threshold: -11.4590
Txt: LEDs can last ten years, whereas an incandescent bulb typically lasts 5000 hours
Hyp: Incandescent bulbs do last longer than LEDs . (No.)
Incandescent JJ |
bulbs NNS |
do VBP |
last RB |
longer RB |
LEDs NNP |
. . |
|
LEDs:NNS | 11.96 | 6.40 | 15.00 | 11.40 | 13.95 | 0.00 | 20.00 |
can:MD | 19.96 | 16.69 | 15.67 | 17.12 | 18.95 | 17.12 | 10.00 |
last:VB | 11.96 | 11.40 | 7.63 | 0.00 | 18.95 | 11.40 | 20.00 |
ten:NN | 11.96 | 9.68 | 13.69 | 11.46 | 12.31 | 10.00 | 19.42 |
years:NNS | 11.96 | 8.69 | 13.69 | 12.84 | 14.54 | 10.00 | 19.49 |
,:, | 20.00 | 20.00 | 19.52 | 20.00 | 19.51 | 20.00 | 5.73 |
whereas:IN | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 |
an:DT | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 |
incandescent:NN | 2.00 | 6.14 | 14.96 | 14.96 | 14.35 | 9.96 | 19.79 |
bulb:NN | 11.96 | 0.50 | 13.66 | 11.40 | 13.95 | 6.40 | 20.00 |
typically:RB | 11.96 | 14.28 | 19.58 | 9.96 | 5.29 | 14.96 | 19.79 |
lasts:VBZ | 11.96 | 11.40 | 7.69 | 10.31 | 14.40 | 11.40 | 19.71 |
5000:JJ | 9.96 | 11.40 | 11.46 | 11.96 | 11.96 | 11.96 | 20.00 |
hours:NNS | 11.96 | 7.89 | 13.69 | 12.84 | 12.46 | 10.00 | 19.95 |
NO_WORD | 9.00 | 10.00 | 10.00 | 9.00 | 9.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -29.2083
Features matched: RootEntailment.poorlyAlignedRoot: "do" aligned badly to "lasts"; Structure.relMismatch: text "LEDs" is nsubj of "last" while hyp "LEDs" is prep_than of "do" which aligned to text "lasts"
Hand-tuned score: -2.0000
Threshold: -11.4590
Txt: The "Just Picture It" workshop, sponsored by Oceanside Photo and Video, Inc. and taught by portrait photographer Gale Carlson, has received rave reviews.
Hyp: Is `` Just Picture It '' a photography workshop ? (Yes.)
Is VBZ |
`` `` |
Just RB |
Picture VBG |
It PRP |
'' '' |
a DT |
photography NN |
workshop NN |
? . |
|
The:DT | 20.00 | 10.00 | 20.00 | 20.00 | 20.00 | 10.00 | 10.00 | 20.00 | 20.00 | 10.00 |
``:`` | 20.00 | 0.00 | 20.00 | 20.00 | 20.00 | 1.06 | 10.00 | 20.00 | 20.00 | 10.00 |
Just:RB | 19.96 | 20.00 | 0.00 | 19.96 | 15.00 | 20.00 | 20.00 | 14.96 | 14.96 | 20.00 |
Picture:VBG | 9.34 | 20.00 | 19.96 | 0.00 | 15.00 | 20.00 | 20.00 | 15.00 | 14.34 | 20.00 |
It:PRP | 15.00 | 20.00 | 20.00 | 15.00 | 0.00 | 20.00 | 20.00 | 12.00 | 12.00 | 20.00 |
'':'' | 20.00 | 1.06 | 20.00 | 20.00 | 20.00 | 0.00 | 10.00 | 20.00 | 19.89 | 9.33 |
workshop:NN | 14.34 | 20.00 | 14.96 | 14.34 | 12.00 | 19.89 | 20.00 | 7.71 | 0.00 | 20.00 |
,:, | 20.00 | 6.83 | 20.00 | 20.00 | 20.00 | 6.68 | 10.00 | 19.37 | 19.76 | 10.00 |
sponsored:VBN | 9.34 | 20.00 | 19.96 | 8.95 | 15.00 | 19.76 | 20.00 | 14.25 | 10.75 | 19.75 |
Oceanside_Photo_and_Video_,_Inc._and:NNP | 15.17 | 20.50 | 15.46 | 11.19 | 12.50 | 20.50 | 20.50 | 10.46 | 10.17 | 20.50 |
taught:VBN | 7.74 | 19.47 | 19.96 | 8.95 | 15.00 | 20.00 | 20.00 | 14.25 | 10.96 | 20.00 |
portrait:NN | 14.34 | 20.00 | 14.96 | 7.08 | 12.00 | 20.00 | 20.00 | 7.62 | 7.79 | 20.00 |
photographer:NN | 14.34 | 18.38 | 14.96 | 13.95 | 12.00 | 18.09 | 20.00 | 5.00 | 7.78 | 20.00 |
Gale_Carlson:NNP | 15.46 | 20.50 | 15.46 | 15.46 | 12.50 | 20.50 | 20.50 | 8.97 | 10.46 | 20.50 |
,:, | 20.50 | 7.33 | 20.50 | 20.50 | 20.50 | 7.18 | 10.50 | 19.87 | 20.26 | 10.50 |
has:VBZ | 9.34 | 20.00 | 19.96 | 8.69 | 15.00 | 20.00 | 20.00 | 15.00 | 12.52 | 20.00 |
received:VBN | 10.00 | 20.00 | 19.96 | 10.00 | 15.00 | 20.00 | 20.00 | 15.00 | 14.47 | 20.00 |
rave:JJ | 9.74 | 18.71 | 11.96 | 8.36 | 15.00 | 19.62 | 20.00 | 10.71 | 7.13 | 19.37 |
reviews:NNS | 14.34 | 18.81 | 14.96 | 10.45 | 12.00 | 18.20 | 20.00 | 6.36 | 7.29 | 20.00 |
.:. | 20.00 | 7.32 | 20.00 | 20.00 | 20.00 | 6.80 | 10.00 | 20.00 | 20.00 | 10.00 |
NO_WORD | 1.00 | 10.00 | 9.00 | 10.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -25.0000
Features matched: Adjunct.dropPosCxt: text adjunct "sponsored" of "workshop" dropped on aligned hyp word "workshop"; NullPunisher.article: a; NullPunisher.other: ?; NullPunisher.other: Is; Quant.contract: [the,a]; Structure.argsMismatch: args have different parents but same relations: text "``" <-punct-- "workshop vs. hyp "``" <-punct-- "Picture", which aligned to text "Picture" args have different parents but same relations: text "''" <-punct-- "workshop vs. hyp "''" <-punct-- "Picture", which aligned to text "Picture" args have different parents but same relations: text "workshop" <-nsubjpass-- "sponsored vs. hyp "workshop" <-dobj-- "Picture", which aligned to text "Picture" args have different parents, different relations: text "workshop" <-nsubj-- "received" vs. hyp "workshop" <-dobj-- "Picture", which aligned to text "Picture"
Hand-tuned score: -2.6000
Threshold: -11.4590
Txt: The equatorial diameter of Saturn is 10% larger than its polar diameter.
Hyp: Saturn is an ellipsoid . (Yes.)
Saturn NNP |
is VBZ |
an DT |
ellipsoid NN |
. . |
|
The:DT | 20.50 | 20.00 | 10.00 | 20.00 | 10.00 |
equatorial:JJ | 12.46 | 11.96 | 20.00 | 11.96 | 20.00 |
diameter:NN | 10.50 | 15.00 | 20.00 | 5.41 | 19.61 |
Saturn:NNP | 0.00 | 14.84 | 20.50 | 10.50 | 20.50 |
is:VBZ | 14.84 | 0.00 | 20.00 | 15.00 | 20.00 |
10:CD | 25.00 | 20.50 | 20.50 | 19.19 | 19.16 |
%:NN | 15.00 | 15.50 | 20.50 | 10.50 | 20.50 |
larger:JJR | 12.46 | 11.96 | 20.00 | 11.96 | 17.97 |
its:PRP$ | 12.50 | 13.00 | 20.00 | 12.00 | 20.00 |
polar:JJ | 12.46 | 11.96 | 20.00 | 11.96 | 20.00 |
diameter:NN | 10.50 | 15.00 | 20.00 | 5.41 | 19.61 |
.:. | 20.50 | 20.00 | 10.00 | 20.00 | 0.00 |
NO_WORD | 10.00 | 1.00 | 1.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -11.4140
Features matched: Adjunct.dropPosCxt: text adjunct "polar" of "diameter" dropped on aligned hyp word "ellipsoid"; NullPunisher.aux: is; NullPunisher.article: an; RootEntailment.poorlyAlignedRoot: "ellipsoid" aligned badly to "diameter"; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "larger vs. hyp "." <-punct-- "ellipsoid", which aligned to text "diameter" text "Saturn" is prep_of of "diameter" while hyp "Saturn" is nsubj of "ellipsoid" which aligned to text "diameter"
Hand-tuned score: -3.6500
Threshold: -11.4590
Txt: The equatorial diameter of Saturn is 10% larger than its polar diameter.
Hyp: Saturn is a sphere . (No.)
Saturn NNP |
is VBZ |
a DT |
sphere NN |
. . |
|
The:DT | 20.50 | 20.00 | 10.00 | 20.00 | 10.00 |
equatorial:JJ | 12.46 | 11.96 | 20.00 | 11.22 | 20.00 |
diameter:NN | 10.50 | 15.00 | 20.00 | 5.41 | 19.61 |
Saturn:NNP | 0.00 | 14.84 | 20.50 | 7.97 | 20.50 |
is:VBZ | 14.84 | 0.00 | 20.00 | 14.34 | 20.00 |
10:CD | 25.00 | 20.50 | 20.50 | 19.19 | 19.16 |
%:NN | 15.00 | 15.50 | 20.50 | 10.08 | 20.50 |
larger:JJR | 12.46 | 11.96 | 20.00 | 11.96 | 17.97 |
its:PRP$ | 12.50 | 13.00 | 20.00 | 12.00 | 20.00 |
polar:JJ | 12.46 | 11.96 | 20.00 | 9.25 | 20.00 |
diameter:NN | 10.50 | 15.00 | 20.00 | 5.41 | 19.61 |
.:. | 20.50 | 20.00 | 10.00 | 20.00 | 0.00 |
NO_WORD | 10.00 | 1.00 | 1.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -11.4140
Features matched: Adjunct.dropPosCxt: text adjunct "equatorial" of "diameter" dropped on aligned hyp word "sphere"; NullPunisher.article: a; NullPunisher.aux: is; Quant.contract: [the,a]; RootEntailment.poorlyAlignedRoot: "sphere" aligned badly to "diameter"; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "larger vs. hyp "." <-punct-- "sphere", which aligned to text "diameter" text "Saturn" is prep_of of "diameter" while hyp "Saturn" is nsubj of "sphere" which aligned to text "diameter"
Hand-tuned score: -2.6500
Threshold: -11.4590
Txt: Charlotte Jones gave birth to a healthy baby boy, Johnathan Daniel, on March 14th, 2004.
Hyp: Charlotte Jones was pregnant on December 14 , 2003 . (Yes.)
Charlotte_Jones NNS |
was VBD |
pregnant JJ |
December NNP |
14 CD |
, , |
2003 CD |
. . |
|
Charlotte_Jones:NNS | 5.00 | 13.67 | 12.46 | 14.96 | 24.96 | 25.00 | 24.96 | 20.50 |
gave_birth:VBD | 15.46 | 10.00 | 11.96 | 14.42 | 19.42 | 20.50 | 20.46 | 20.00 |
a:DT | 20.50 | 20.00 | 20.00 | 20.50 | 20.50 | 10.50 | 20.50 | 10.00 |
healthy:JJ | 12.46 | 11.96 | 6.37 | 12.46 | 20.37 | 20.50 | 20.46 | 19.46 |
baby:NN | 9.52 | 14.34 | 5.63 | 10.50 | 20.50 | 19.20 | 20.46 | 19.63 |
boy:NN | 9.52 | 14.34 | 6.78 | 10.50 | 20.50 | 19.17 | 20.46 | 19.66 |
,:, | 25.00 | 20.50 | 19.69 | 20.00 | 18.78 | 0.00 | 19.45 | 6.23 |
Johnathan_Daniel:NNP | 12.99 | 15.17 | 12.46 | 14.34 | 24.34 | 25.00 | 24.96 | 20.50 |
,:, | 25.00 | 20.50 | 19.69 | 20.00 | 18.78 | 0.00 | 19.45 | 6.23 |
March:NNP | 13.19 | 11.88 | 12.46 | 4.57 | 17.84 | 20.00 | 19.96 | 20.50 |
14th:CD | 24.96 | 20.46 | 20.46 | 19.96 | 5.00 | 20.00 | 5.00 | 20.45 |
,:, | 25.00 | 20.50 | 19.69 | 20.00 | 18.78 | 0.00 | 19.45 | 6.23 |
2004:CD | 24.96 | 20.46 | 20.15 | 19.96 | 4.96 | 19.10 | 0.00 | 20.50 |
.:. | 20.50 | 20.00 | 19.79 | 20.50 | 19.34 | 6.23 | 20.50 | 0.00 |
NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -27.1610
Features matched: Adjunct.dropPosCxt: text adjunct "14th" of "March" dropped on aligned hyp word "December"; Date.dateHeadMismatch: December vs. March; NullPunisher.aux: was; RootEntailment.poorlyAlignedRoot: "pregnant" aligned badly to "baby"; Structure.argsMismatch: args have different parents but same relations: text "Charlotte_Jones" <-nsubj-- "gave_birth vs. hyp "Charlotte_Jones" <-nsubj-- "pregnant", which aligned to text "baby" args have different parents but same relations: text "March" <-prep_on-- "gave_birth vs. hyp "December" <-prep_on-- "pregnant", which aligned to text "baby" args have different parents but same relations: text "." <-punct-- "gave_birth vs. hyp "." <-punct-- "pregnant", which aligned to text "baby"
Hand-tuned score: -6.5500
Threshold: -11.4590
Txt: Keikaimalu, a wholphin (whale-dolphin hybrid), has given birth to female calf.
Hyp: Keikaimalu did gave_birth to a ruminant . (No.)
Keikaimalu NNP |
did VBD |
gave_birth NN |
a DT |
ruminant NN |
. . |
|
Keikaimalu:NNP | 0.00 | 15.46 | 10.46 | 20.50 | 10.46 | 20.50 |
,:, | 20.50 | 19.80 | 20.00 | 10.00 | 20.00 | 5.73 |
a:DT | 20.50 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
wholphin:NN | 10.46 | 14.96 | 9.96 | 20.00 | 9.96 | 20.00 |
-LRB-:-LRB- | 20.50 | 20.00 | 20.00 | 10.00 | 20.00 | 10.00 |
whale-dolphin:JJ | 12.46 | 11.51 | 11.96 | 20.00 | 11.96 | 20.00 |
hybrid:NN | 10.46 | 11.24 | 9.02 | 20.00 | 8.11 | 19.87 |
-RRB-:-RRB- | 20.50 | 20.00 | 20.00 | 10.00 | 20.00 | 10.00 |
,:, | 20.50 | 19.80 | 20.00 | 10.00 | 20.00 | 5.73 |
has:VBZ | 15.46 | 7.53 | 13.76 | 20.00 | 14.34 | 20.00 |
given_birth:VBN | 15.46 | 7.32 | 5.00 | 20.00 | 15.00 | 20.00 |
female:JJ | 12.46 | 12.00 | 12.00 | 20.00 | 7.76 | 20.00 |
calf:NN | 10.46 | 13.64 | 10.00 | 20.00 | 5.76 | 20.00 |
.:. | 20.50 | 17.99 | 20.00 | 10.00 | 20.00 | 0.00 |
NO_WORD | 10.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -21.0762
Features matched: Adjunct.dropPosCxt: text adjunct "female" of "calf" dropped on aligned hyp word "ruminant"; NullPunisher.article: a; RootEntailment.poorlyAlignedRoot: "did" aligned badly to "given_birth"
Hand-tuned score: -0.6000
Threshold: -11.4590
Txt: A wholphin is a whale-dolphin hybrid.
Hyp: The term does `` wholphin '' refer to a type of hybrid . (Yes.)
The DT |
term NN |
does VBZ |
`` `` |
wholphin NN |
'' '' |
refer VBP |
a DT |
type NN |
hybrid NN |
. . |
|
A:DT | 10.00 | 20.00 | 20.00 | 10.00 | 20.00 | 10.00 | 20.00 | 0.00 | 20.00 | 20.00 | 10.00 |
wholphin:NN | 20.00 | 9.96 | 14.96 | 20.00 | 0.00 | 20.00 | 14.96 | 20.00 | 9.96 | 9.96 | 20.00 |
is:VBZ | 20.00 | 14.34 | 9.34 | 20.00 | 14.96 | 20.00 | 5.51 | 20.00 | 12.74 | 14.34 | 20.00 |
a:DT | 10.00 | 20.00 | 20.00 | 10.00 | 20.00 | 10.00 | 20.00 | 0.00 | 20.00 | 20.00 | 10.00 |
whale-dolphin:JJ | 20.00 | 11.96 | 11.21 | 20.00 | 7.00 | 19.85 | 11.13 | 20.00 | 11.34 | 9.67 | 20.00 |
hybrid:NN | 20.00 | 5.91 | 13.05 | 20.00 | 9.96 | 20.00 | 14.48 | 20.00 | 6.67 | 0.00 | 19.87 |
.:. | 10.00 | 19.66 | 20.00 | 7.32 | 20.00 | 6.80 | 19.97 | 10.00 | 17.87 | 19.87 | 0.00 |
NO_WORD | 1.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -58.5544
Features matched: Adjunct.dropPosCxt: text adjunct "whale-dolphin" of "hybrid" dropped on aligned hyp word "hybrid"; NullPunisher.article: The; Quant.contract: [a,the]; Quant.contract: [a,a]; RootEntailment.poorlyAlignedRoot: "does" aligned badly to "is"; Structure.argsMismatch: args have different parents but same relations: text "." <-punct-- "hybrid vs. hyp "." <-punct-- "does", which aligned to text "is" args have different parents, different relations: text "is" <-cop-- "hybrid" vs. hyp "refer" <-ccomp-- "does", which aligned to text "is"
Hand-tuned score: -1.6000
Threshold: -11.4590
Txt: Marilyn Connors had a very difficult pregnancy and died in childbirth yesterday.
Hyp: Marilyn Connors is pregnant . (No.)
Marilyn_Connors NNS |
is VBZ |
pregnant JJ |
. . |
|
Marilyn_Connors:NNS | 0.00 | 15.17 | 12.46 | 20.50 |
had:VBD | 14.52 | 7.80 | 11.96 | 20.00 |
a:DT | 20.50 | 20.00 | 20.00 | 10.00 |
very:RB | 15.46 | 19.96 | 11.96 | 20.00 |
difficult:JJ | 12.46 | 11.96 | 9.96 | 17.00 |
pregnancy:NN | 10.46 | 15.00 | 3.75 | 19.77 |
died:VBD | 14.97 | 8.07 | 7.91 | 20.00 |
childbirth:NN | 10.46 | 15.00 | 5.16 | 20.00 |
yesterday:NN | 10.46 | 15.00 | 11.96 | 18.11 |
.:. | 20.50 | 20.00 | 19.79 | 0.00 |
NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -8.7539
Features matched: Adjunct.dropPosCxt: text adjunct "difficult" of "pregnancy" dropped on aligned hyp word "pregnant"; NullPunisher.aux: is; Structure.argsMismatch: args have different parents but same relations: text "Marilyn_Connors" <-nsubj-- "had vs. hyp "Marilyn_Connors" <-nsubj-- "pregnant", which aligned to text "pregnancy" args have different parents but same relations: text "." <-punct-- "had vs. hyp "." <-punct-- "pregnant", which aligned to text "pregnancy"
Hand-tuned score: -1.5500
Threshold: -11.4590
Txt: Serena Williams is a great tennis player, perhaps one of the best ever.
Hyp: Serena Williams does play tennis well . (Yes.)
Serena_Williams NNS |
does VBZ |
play NN |
tennis NN |
well RB |
. . |
|
Serena_Williams:NNS | 5.00 | 14.55 | 10.46 | 10.46 | 14.97 | 20.50 |
is:VBZ | 15.17 | 9.34 | 13.07 | 15.00 | 19.34 | 20.00 |
a:DT | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 |
great:JJ | 12.46 | 11.01 | 8.17 | 10.79 | 11.96 | 17.92 |
tennis_player:NN | 7.17 | 13.11 | 5.00 | 0.00 | 13.95 | 20.00 |
,:, | 20.50 | 19.26 | 19.21 | 20.00 | 20.00 | 5.73 |
perhaps:RB | 15.46 | 19.96 | 14.96 | 14.96 | 9.96 | 20.00 |
one:CD | 24.96 | 20.50 | 17.49 | 20.50 | 19.19 | 20.50 |
the:DT | 20.50 | 18.65 | 20.00 | 20.00 | 20.00 | 10.00 |
best:JJS | 11.52 | 9.70 | 7.41 | 8.49 | 10.95 | 18.78 |
ever:RB | 15.46 | 17.36 | 14.96 | 14.96 | 9.96 | 20.00 |
.:. | 20.50 | 20.00 | 18.13 | 20.00 | 20.00 | 0.00 |
NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 | 9.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -36.1080
Features matched: Adjunct.addPosCxt: hyp added well[well-RB]; Adjunct.dropPosCxt: text adjunct "ever" of "tennis_player" dropped on aligned hyp word "tennis"; Modal.dontKnow: possible -> actual; NullPunisher.other: well; RootEntailment.poorlyAlignedRoot: "does" aligned badly to "tennis_player"
Hand-tuned score: -4.5000
Threshold: -11.4590
Txt: War and Peace is a very long novel.
Hyp: War and Peace does contain more pages than the average novel . (Yes.)
War NNP |
Peace NNP |
does VBZ |
contain VB |
more JJR |
pages NNS |
the DT |
average JJ |
novel NN |
. . |
|
War:NNP | 0.00 | 7.94 | 15.00 | 15.00 | 12.00 | 10.00 | 20.00 | 12.00 | 10.00 | 20.00 |
Peace:NNP | 7.94 | 0.00 | 15.50 | 15.50 | 12.50 | 10.50 | 20.50 | 10.40 | 7.27 | 20.50 |
is:VBZ | 15.00 | 15.50 | 9.34 | 8.33 | 11.34 | 12.34 | 20.00 | 10.33 | 14.34 | 20.00 |
a:DT | 20.00 | 20.50 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 | 20.00 | 20.00 | 10.00 |
very:RB | 14.96 | 15.46 | 19.96 | 19.96 | 11.96 | 14.96 | 20.00 | 11.96 | 14.96 | 20.00 |
long:JJ | 12.00 | 11.19 | 11.33 | 12.00 | 8.25 | 12.00 | 20.00 | 10.00 | 10.69 | 17.94 |
novel:NN | 10.00 | 7.27 | 13.95 | 13.75 | 10.95 | 6.58 | 20.00 | 12.00 | 0.00 | 19.66 |
.:. | 20.00 | 20.50 | 20.00 | 18.50 | 20.00 | 20.00 | 10.00 | 19.31 | 19.66 | 0.00 |
NO_WORD | 10.00 | 10.00 | 1.00 | 10.00 | 9.00 | 10.00 | 1.00 | 9.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -42.1560
Features matched: Adjunct.addPosCxt: hyp added average[average-JJ]; Adjunct.dropPosCxt: text adjunct "very" of "long" dropped on aligned hyp word "more"; NullPunisher.aux: does; NullPunisher.article: the; NullPunisher.other: average; Quant.contract: [a,the]; RootEntailment.poorlyAlignedRoot: "contain" aligned badly to "is"; Structure.argsMismatch: args have different parents but same relations: text "War" <-nsubj-- "novel vs. hyp "War" <-nsubj-- "contain", which aligned to text "is" args have different parents but same relations: text "." <-punct-- "novel vs. hyp "." <-punct-- "contain", which aligned to text "is"
Hand-tuned score: -4.6500
Threshold: -11.4590
Txt: Ambassador Richard C. Holbrooke will visit Paris, France from Saturday, April 22 until Tuesday, April 25.
Hyp: Ambassador Holbrooke will be in Paris on April 24th . (Yes.)
Ambassador NNP |
Holbrooke NNP |
will MD |
be VB |
Paris NNP |
April NNP |
24th JJ |
. . |
|
Ambassador:NNP | 0.00 | 9.29 | 20.00 | 14.34 | 9.84 | 10.50 | 12.46 | 20.00 |
Richard_C._Holbrooke:NNP | 10.46 | 0.00 | 20.46 | 15.46 | 14.96 | 14.96 | 16.96 | 20.50 |
will:MD | 20.00 | 20.46 | 0.00 | 20.00 | 20.50 | 19.19 | 20.46 | 10.00 |
visit:VB | 15.00 | 15.46 | 20.00 | 7.74 | 13.63 | 15.50 | 11.05 | 20.00 |
Paris:NNP | 9.84 | 14.96 | 20.50 | 14.84 | 0.00 | 15.00 | 16.96 | 20.50 |
,:, | 20.50 | 25.00 | 10.50 | 20.50 | 25.00 | 20.00 | 20.00 | 6.23 |
France:NNP | 8.55 | 14.96 | 20.50 | 14.84 | 2.00 | 15.00 | 16.96 | 20.50 |
Saturday:NNP | 10.50 | 14.96 | 19.19 | 15.50 | 15.00 | 7.23 | 11.96 | 20.50 |
,:, | 20.50 | 25.00 | 10.50 | 20.50 | 25.00 | 20.00 | 20.00 | 6.23 |
April:NNP | 10.50 | 14.96 | 19.19 | 15.50 | 15.00 | 0.00 | 11.96 | 20.50 |
22:CD | 20.50 | 24.96 | 19.19 | 20.50 | 25.00 | 17.84 | 19.56 | 19.32 |
Tuesday:NNP | 10.50 | 14.96 | 19.19 | 15.50 | 15.00 | 7.23 | 11.96 | 20.50 |
,:, | 20.50 | 25.00 | 10.50 | 20.50 | 25.00 | 20.00 | 20.00 | 6.23 |
April:NNP | 10.50 | 14.96 | 19.19 | 15.50 | 15.00 | 0.00 | 11.96 | 20.50 |
25:CD | 20.50 | 24.96 | 19.19 | 20.50 | 25.00 | 17.84 | 19.96 | 19.52 |
.:. | 20.00 | 20.50 | 10.00 | 20.00 | 20.50 | 20.50 | 20.50 | 0.00 |
NO_WORD | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 | 10.00 | 9.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -20.7439
Features matched: Adjunct.dropPosCxt: text adjunct "22" of "April" dropped on aligned hyp word "April"; Date.matchDatesByGraph: hyp/txt matching, by graph: April and children; Date.hypDateIns: hypothesis date insertion: 24th; NullPunisher.other: 24th; RootEntailment.poorlyAlignedRoot: "be" aligned badly to "visit"; Structure.relMismatch: text "Paris" is dobj of "visit" while hyp "Paris" is prep_in of "be" which aligned to text "visit"
Hand-tuned score: -3.5000
Threshold: -11.4590
Txt: The Island Nut Sampler includes an 8 ounce box of milk chocolate covered macadamia nuts and an 8 ounce box of white chocolate covered macadamia nuts.
Hyp: It does include some dark-chocolate covered macadamia nuts . (No.)
It PRP |
does VBZ |
include VB |
some DT |
dark-chocolate JJ |
covered JJ |
macadamia NN |
nuts NNS |
. . |
|
The:DT | 20.00 | 20.00 | 20.00 | 10.00 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 |
Island:NNP | 12.50 | 14.45 | 15.50 | 20.50 | 12.46 | 11.45 | 9.45 | 9.45 | 20.50 |
Nut:NNP | 12.50 | 13.61 | 15.50 | 20.50 | 12.46 | 8.96 | 8.61 | 0.81 | 20.50 |
Sampler:NNP | 12.50 | 13.61 | 15.50 | 20.50 | 12.46 | 10.53 | 8.61 | 8.53 | 20.50 |
includes:VBZ | 15.00 | 9.55 | 0.50 | 20.00 | 11.96 | 7.64 | 14.87 | 15.00 | 19.56 |
an:DT | 20.00 | 17.89 | 20.00 | 10.00 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 |
8:CD | 20.50 | 20.50 | 20.50 | 20.50 | 19.13 | 19.42 | 20.50 | 18.34 | 19.96 |
ounce:NN | 12.00 | 10.17 | 15.00 | 20.00 | 11.66 | 10.95 | 8.11 | 7.84 | 19.86 |
box:NN | 12.00 | 13.08 | 12.42 | 20.00 | 8.60 | 6.85 | 4.40 | 7.81 | 19.82 |
milk_chocolate:NN | 12.00 | 14.34 | 12.87 | 20.00 | 7.00 | 10.81 | 8.53 | 8.44 | 20.00 |
covered:VBN | 15.00 | 8.49 | 5.64 | 20.00 | 8.32 | 0.00 | 13.95 | 11.46 | 19.33 |
macadamia:NN | 12.00 | 13.11 | 13.91 | 20.00 | 7.79 | 10.95 | 0.00 | 5.49 | 19.94 |
nuts:NNS | 12.00 | 13.11 | 15.00 | 20.00 | 6.73 | 8.46 | 5.49 | 0.00 | 19.80 |
an:DT | 20.00 | 17.89 | 20.00 | 10.00 | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 |
8:CD | 20.50 | 20.50 | 20.50 | 20.50 | 19.13 | 19.42 | 20.50 | 18.34 | 19.96 |
ounce:NN | 12.00 | 10.17 | 15.00 | 20.00 | 11.66 | 10.95 | 8.11 | 7.84 | 19.86 |
box:NN | 12.00 | 13.08 | 12.42 | 20.00 | 8.60 | 6.85 | 4.40 | 7.81 | 19.82 |
white_chocolate:NN | 12.00 | 14.05 | 13.81 | 20.00 | 7.00 | 9.89 | 9.05 | 8.44 | 20.00 |
covered:VBD | 15.00 | 8.49 | 5.64 | 20.00 | 8.32 | 0.00 | 13.95 | 11.46 | 19.33 |
macadamia:NN | 12.00 | 13.11 | 13.91 | 20.00 | 7.79 | 10.95 | 0.00 | 5.49 | 19.94 |
nuts:NNS | 12.00 | 13.11 | 15.00 | 20.00 | 6.73 | 8.46 | 5.49 | 0.00 | 19.80 |
.:. | 20.00 | 20.00 | 19.86 | 10.00 | 19.36 | 19.33 | 19.94 | 19.80 | 0.00 |
NO_WORD | 10.00 | 1.00 | 10.00 | 10.00 | 9.00 | 9.00 | 10.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -34.2319
Features matched: NullPunisher.other: It; NullPunisher.other: some; NullPunisher.aux: does; Structure.argsMismatch: args have different parents but same relations: text "nuts" <-dobj-- "covered vs. hyp "nuts" <-dobj-- "include", which aligned to text "includes"
Hand-tuned score: -4.0500
Threshold: -11.4590
Txt: A quarter of all cellphones have built-in cameras.
Hyp: Do most cell_phones have a lens . (No.)
Do VB |
most JJS |
cell_phones NNS |
have VB |
a DT |
lens NN |
. . |
|
A:DT | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
quarter:NN | 10.67 | 11.96 | 9.02 | 12.61 | 20.00 | 7.90 | 19.74 |
all:DT | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 | 20.00 | 10.00 |
cellphones:NNS | 15.00 | 11.96 | 1.90 | 13.95 | 20.00 | 3.53 | 20.00 |
have:VBP | 7.32 | 11.96 | 12.80 | 0.00 | 20.00 | 13.95 | 20.00 |
built-in:JJ | 11.96 | 9.96 | 11.96 | 11.96 | 20.00 | 11.96 | 20.00 |
cameras:NNS | 15.00 | 11.96 | 6.77 | 13.95 | 20.00 | 3.53 | 19.47 |
.:. | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 | 20.00 | 0.00 |
NO_WORD | 10.00 | 9.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: yes (INCORRECT)
Justification:
Alignment score: -26.7570
Features matched: Adjunct.addPosCxt: hyp added most[most-JJS]; Adjunct.dropPosCxt: text adjunct "built-in" of "cameras" dropped on aligned hyp word "lens"; Polarity.hypNegMarker: "most": JJS; NullPunisher.article: a; NullPunisher.other: most; Quant.contract: [all,most]; RootEntailment.poorlyAlignedRoot: "Do" aligned badly to "have"
Hand-tuned score: -1.6000
Threshold: -11.4590
Txt: Three quarters of all cellphones have a built-in camera.
Hyp: Do most cell_phones have a lens . (Yes.)
Do VB |
most JJS |
cell_phones NNS |
have VB |
a DT |
lens NN |
. . |
|
Three:CD | 19.19 | 20.46 | 19.84 | 20.50 | 20.50 | 20.50 | 20.50 |
quarters:NNS | 15.00 | 11.96 | 8.53 | 13.95 | 20.00 | 8.03 | 19.46 |
all:DT | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 | 20.00 | 10.00 |
cellphones:NNS | 15.00 | 11.96 | 1.90 | 13.95 | 20.00 | 3.53 | 20.00 |
have:VBP | 7.32 | 11.96 | 12.80 | 0.00 | 20.00 | 13.95 | 20.00 |
a:DT | 20.00 | 20.00 | 20.00 | 20.00 | 0.00 | 20.00 | 10.00 |
built-in:JJ | 11.96 | 9.96 | 11.96 | 11.96 | 20.00 | 11.96 | 20.00 |
camera:NN | 15.00 | 11.96 | 6.77 | 13.95 | 20.00 | 3.53 | 18.65 |
.:. | 20.00 | 20.00 | 20.00 | 20.00 | 10.00 | 20.00 | 0.00 |
NO_WORD | 10.00 | 9.00 | 10.00 | 10.00 | 1.00 | 10.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -25.7570
Features matched: Adjunct.addPosCxt: hyp added most[most-JJS]; Adjunct.dropPosCxt: text adjunct "built-in" of "camera" dropped on aligned hyp word "lens"; Polarity.hypNegMarker: "most": JJS; NullPunisher.other: most; Quant.contract: [all,most]; Quant.contract: [a,a]; RootEntailment.poorlyAlignedRoot: "Do" aligned badly to "have"
Hand-tuned score: -0.5000
Threshold: -11.4590
Txt: Marilyn Connors had a very difficult pregnancy and died in childbirth yesterday.
Hyp: Marilyn Connors was pregnant . (Yes.)
Marilyn_Connors NNS |
was VBD |
pregnant JJ |
. . |
|
Marilyn_Connors:NNS | 0.00 | 15.17 | 12.46 | 20.50 |
had:VBD | 14.52 | 9.34 | 11.96 | 20.00 |
a:DT | 20.50 | 20.00 | 20.00 | 10.00 |
very:RB | 15.46 | 19.96 | 11.96 | 20.00 |
difficult:JJ | 12.46 | 11.96 | 9.96 | 17.00 |
pregnancy:NN | 10.46 | 15.00 | 3.75 | 19.77 |
died:VBD | 14.97 | 9.34 | 7.91 | 20.00 |
childbirth:NN | 10.46 | 15.00 | 5.16 | 20.00 |
yesterday:NN | 10.46 | 15.00 | 11.96 | 18.11 |
.:. | 20.50 | 20.00 | 19.79 | 0.00 |
NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -8.7539
Features matched: Adjunct.dropPosCxt: text adjunct "difficult" of "pregnancy" dropped on aligned hyp word "pregnant"; NullPunisher.aux: was; Structure.argsMismatch: args have different parents but same relations: text "Marilyn_Connors" <-nsubj-- "had vs. hyp "Marilyn_Connors" <-nsubj-- "pregnant", which aligned to text "pregnancy" args have different parents but same relations: text "." <-punct-- "had vs. hyp "." <-punct-- "pregnant", which aligned to text "pregnancy"
Hand-tuned score: -1.5500
Threshold: -11.4590
Txt: Marilyn Connors had a very difficult pregnancy and died in childbirth yesterday.
Hyp: Marilyn Connors is dead . (Yes.)
Marilyn_Connors NNS |
is VBZ |
dead JJ |
. . |
|
Marilyn_Connors:NNS | 0.00 | 15.17 | 12.46 | 20.50 |
had:VBD | 14.52 | 7.80 | 12.00 | 20.00 |
a:DT | 20.50 | 20.00 | 20.00 | 10.00 |
very:RB | 15.46 | 19.96 | 11.96 | 20.00 |
difficult:JJ | 12.46 | 11.96 | 9.96 | 17.00 |
pregnancy:NN | 10.46 | 15.00 | 9.39 | 19.77 |
died:VBD | 14.97 | 8.07 | 5.77 | 20.00 |
childbirth:NN | 10.46 | 15.00 | 8.86 | 20.00 |
yesterday:NN | 10.46 | 15.00 | 9.84 | 18.11 |
.:. | 20.50 | 20.00 | 18.65 | 0.00 |
NO_WORD | 10.00 | 1.00 | 9.00 | 10.00 |
Response: yes (CORRECT)
Justification:
Alignment score: -10.7745
Features matched: Adjunct.dropPosCxt: text adjunct "yesterday" of "died" dropped on aligned hyp word "dead"; NullPunisher.aux: is; RootEntailment.poorlyAlignedRoot: "dead" aligned badly to "died"; Structure.argsMismatch: args have different parents but same relations: text "Marilyn_Connors" <-nsubj-- "had vs. hyp "Marilyn_Connors" <-nsubj-- "dead", which aligned to text "died" args have different parents but same relations: text "." <-punct-- "had vs. hyp "." <-punct-- "dead", which aligned to text "died"
Hand-tuned score: -3.5500
Threshold: -11.4590
java edu.stanford.nlp.rte.WordSimilarityGenerator -info /u/nlp/rte/data/byformat/align/stochastic/cycorp_dev.pipeline.align.xml -output /u/nlp/rte/data/byformat/wordsim/stochastic/cycorp_dev.pipeline.wordsim.html -lex.BasicWN off