This page lists errata for Introduction to Information Retrieval by Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze.
Entries are of the form: p. x, line y, change "A" to "B"
Page numbers are with respect to the printed book = irbookprint.pdf
.
Line counts ignore figures, equations and other displayed material.
Negative line counts are from the bottom of the page.
FRONT MATTER p.xix Insert in list of Acknowledgments in the paragraph beginning "We are very grateful ..." Björn Andrist Miklós Erdélyi Marek Kowalkiewicz Karthik Raghunathan p.xx Insert in list of Acknowledgments in the paragraph beginning "Many people gave us ..." Valentin I. Spitkovsky CHAPTER 3 p.48, end of 3rd paragraph "A walk down the reverse B-tree then enumerates all terms $R$ in the vocabulary with a given prefix." Should be "A walk down the reverse B-tree then enumerates all terms $R$ in the vocabulary with a given postfix." p.48, towards end of 4th paragraph Immediately before the sentence "Finally, we use the standard ...", insert the following new sentence: We scan and filter out any terms that match the prefix as well as the suffix because these two strings overlap (for instance, the query \query{ba*ba} would result in the term \term{ba} being in $W\cap R$; this would be filtered out). FIXING THIS ALSO CHANGED p.49 OF PRINTED BOOK. p.55, line 2: "given a set ${\cal S}$ of strings" should be "given a set $V$ of strings" CHAPTER 6 pp.100-123: Throughout Chapter 6, change all occurrences of "non-relevant" to "nonrelevant". p.103, line 5: "maps the presence of query terms in a zone to ${0,1}$." should be "maps the presence of query terms in a zone to $\{0,1\}$." i.e., the curly braces should be made to print. p.109, line 2: "gives an example of idfs" should be "gives an example of idf's" p.112: Figure 6.12: "and" before "Pride and Prejudice" should not be italicized p.114, Example 6.4 final sentence "Invoking (6.9) ..." should be "Invoking (6.12) ..." p.115, line 8: change "the the" to "the" before "array Scores" p.115, lines 9-10: "When the scores are finally computed in Step 11, all that remains in Step 12 is to pick off the $K$ documents with the highest scores." should be "When the scores are finally computed in Step 9, all that remains in Step 10 is to pick off the $K$ documents with the highest scores." p.115, line 17: "it seems appear necessary to store" - delete "appear" CHAPTER 7 p.125, Figure 7.1, line 5: \mbox{calculate }\mbox{w}_{t,q}\mbox{ and fetch postings list for }t\\ Should be \mbox{ fetch postings list for }t\\ p.125, line 5: "with line 8 altered since we take" should be "with line 7 altered since we take" p.132, line 13: "A common solution to this issue is the user of" should be "A common solution to this issue is the use of" p.134 "Rank them by vector space ..." should be "Rank the results by vector space ..." CHAPTER 10 p.191, figure 10.10, lower the starting point of the three arrows (the starting point should be on the lower line of the box, not in the box) p.191, figure 10.10, lower the end point of the second arrow (its end position should be at about 50% of the second box) p. 191, figure 10.10, lower the end point of the third arrow (its end position should be at about 50% of the third box) CHAPTER 11 p.208, line 10, change "the the" to "the" CHAPTER 13 p.244, line 5, change "Bernoulli NB models" to "Bernoulli NB model models" CHAPTER 14 p.273, line 3, change "Theta(|D|V|)" to "Theta(|D||V|)" p.280, figure 14.10, continue the non-straight solid line all the way to the left bounding line (i.e., to the vertical line at coordinate 0) CHAPTER 16 p.324, caption of figure 16.2, line 3, change "Cluster" (in italics) to "cluster" (in roman) p.337, figure 16.8, the figure should depict 9 connected little circles, but it only shows 8. In other words, add a little circle at the end of the last line above 10 p.338, line 19, change "2MK" (roman) to "2MK" (math mode = italics) CHAPTER 17 p.357, line 9, change "implimentations" to "implementations" CHAPTER 20 p.415, line 9, exercise 20.3: change "number back queues" to "number of back queues" CHAPTER 21 p.427, line 6, change "the its states" to "its states" p.437, line 9, "inclusion of pages points to" should be "inclusion of pages pointing to" INDEX p.471, change page list for "Document-partitioned index" to "68, 70, 415" p.471, delete entry for "Document!partitioning" p.473, change page list for "Indexes!document-partitioned" to "68, 70, 415" p.473, change page list for "Indexes!term-partitioned" to "68, 70, 415" p.480, change page list for "Term-partitioned index" to "68, 70, 415" p.480, delete entry for "Terms!partitioning"
FRONT MATTER p.xix-xx: Insert in list of Acknowledgments in the paragraph beginning "We are very grateful ..." Marek Kowalkiewicz Juha Makkonen Dell Zhang CHAPTER 2 p. 36, line -4, insert comma after "For example" CHAPTER 3 p.54: Figure 3.5: line 8 is missing a space before the closing "fi." p.60, line 2: "candidate mismatches" should be "candidate matches" CHAPTER 4 p.64, Table 4.2, last line: add the symbol T in math mode ($T$) in the first column. The last line should then read: T tokens 100,000,000 p.66, line -17: Insert space between "SPIMI." and "SPIMI". (The first SPIMI should be set as "SPIMI\@.") p.66, line -12: The hyphen in "term-docID" needs to be longer. (That is, set this as "term--docID" as was done for other hyphenated expressions.) p.68, line -10: Replace "shouldn't" with "should not" p.72, footnote: Replace "ch. 19" with "Chapter 19" CHAPTER 5 p.81, Figure 5.1 p.83, Figure 5.2 "least squares fit" vs "least-squares fit" Change the caption of Figure 5.1 on p.81 to include a hyphen between "least" and "squares" p.89, Figure 5.8, line 3 of caption: Change "Preprend" to "Prepend" p.90, section heading of 5.3.2: change "Codes" to "codes" p.96, line 9: change "Codes" to "codes" CHAPTER 11 p.204, line 5: Change "the cost of retrieval of a relevant document" to "the cost of not retrieving a relevant document" p.204, eqn (11.7), on both sides of comparison, swap order of terms and join with minus: C_0\cdot P(R=0|d) - C_1\cdot P(R=1|d) \le C_0\cdot P(R=0|d') - C_1\cdot P(R=1|d') p.209, numbered item 3: Replace the last two lines of text starting after "empirically" so it reads: not observed empirically: as might be expected, $p_t$ is shown to rise with $\docf_t$. Based on his data analysis, a plausible proposal would be to use the estimate $p_t =\frac{1}{3} + \frac{2}{3}\docf_t/N$. CHAPTER 13 p.237, line -3: add a prime sign (or apostrophe) after d in gamma(d). The line should then read: gamma(d')=China, which ... p.240, line -14: replace "we're" with "we are" p.240, line -7: replace "occuring" with "occurring" p.242, line -9: after "Finally," add "assuming that the length of test documents is bounded," p.252, table above footnote: The numbers in two table cells are reversed. In upper right corner of the table, replace the number after equals sign with: 27,652 In lower left corner of the table, replace the number after equals sign with: 141 Set the two occurrences of poultry and the two occurrences of export in the same fonts as in the table in Example 13.4 on page 255. That is, set "poultry" in italics and set "export" in textsf (sans serif). p.252, footnote, replace Equation (13.17) with Equation (13.18) p.253, formula below line 1: replace the number after the approx sign with: 0.0001105 (that is, there should be two 1s, not one 1) p.255, table above line -6: The numbers in two table cells are reversed. In upper right corner of table, replace the number after equal sign with 27,652 and the number after approx sign with 27,694.4. In left lower corner of table, replace the number after equal sign with 141 and the number after approx sign with 183.4 p.258, table in Exercise 13.5: Set the four words brazil, council, producers, and roasted using textsf (the same way all terms are set, e.g., the terms london, british, britain in Table 13.7). p.264, line -14: replace "the two \chi^2 formulas" with "the two X^2 formulas". Use the same symbol as is displayed as the first symbol of the two equations that the exercise refers to (13.18 and 13.19). CHAPTER 15 p.298, line 11: Change "will be helpful to" to "will be helpful in" p.314, line -1: Change "Figure ??" to "Table 15.3" p.315, line -13: Change "Becuase" to "Because" CHAPTER 16 p.330, line 12: insert "of" after "maximum". The sentence then should read "reaches its maximum of log N" p.338, line -2: insert "and covariance is spherical" after "distributed". The sentence should then read "If the noise is normally distributed and covariance is spherical, this procedure will result in clusters of spherical shape." CHAPTER 17 p.348, Figure 17.1: Align the document title "Backtoschool spending is up" with the other document titles. Remove the period from "Ag trade reform." and also align this title with the other document titles. p.349, Figure 17.2, line 5: change "collects" to "assembles" p.352, Figure 17.5: Align the document title "Backtoschool spending is up" with the other document titles. Remove the period from "Ag trade reform." and also align this title with the other document titles. p.361, line 5: change "over" to "on" (sentence should then read: by induction on the number of clusters K) CHAPTER 19: p.387, first line of Section 19.2: "growth of the web" – replace ‘web’ by ‘Web’ p. 388, lines 10 and 11: “content creation on the web” and “the web contained truth” - replace ‘web’ by ‘Web’ p. 388 bottom line: "as show in" should be “as shown in” CHAPTER 20: p.412, line 12: change "the designers of Mercator recommend of the order of five attempts" to “Mercator's designers recommend roughly five attempts” p.415, line 5 of Section 20.3: "also know as local index" -> “also known as local index” p.426, fourth line from the bottom: "such the visit frequency converges” should be “the visit frequency converges” p.426, line -3: Insert "a" before "fixed, steady-state quantity". p.428: Beginning of the page: move the footnote 2 to be at the end of the sentence after "steady state." (so it's not confused with the math material).