We want to find a query
vector, denoted as ,
that maximizes similarity with relevant documents while minimizing
similarity with nonrelevant documents. If is the set of
relevant documents and is the set of nonrelevant documents,
then we wish to find:^{}

(47) 
where is defined as in Equation 24.
Under cosine similarity, the optimal query vector for separating the relevant and nonrelevant documents is:

(48) 
That is, the optimal query is the vector difference between
the centroids of the relevant and nonrelevant documents; see Figure 9.3 . However, this observation is not terribly useful, precisely because the full set of relevant documents is
not known: it is what we want to find.
