I'm attempting to search for a quoted phrase within a crawled document
and having trouble getting Nutch to pass the quotes through. As an
example say I am looking for a song title on a page, So Says I. I
only want to return results where the title was quoted within the
page, "So Says I".
Query.parse("\"So Says I\"", conf)
results in a querystring of
"So Says I".
This will be interpreted as a query for the phrase, not a query for
the quoted phrase.
Looking at Lucene documentation, it looks like to search for quotes
they simply need to be escaped \. So I think my desired querystring is:
\"So Says I\" or "\"So Says I\""
However, I've been unable to get this out of Query.parse
Query.parse("\\\"So Says I\\\"",conf)
results in a query of
"So Says I"
Query.parse("\"\\\"So Says I\\\"\"", conf)
results in
So Says I
Anyone know how to get this done?
Thanks.
Brent