I'm attempting to search for a quoted phrase within a crawled document and having trouble getting Nutch to pass the quotes through. As an example say I am looking for a song title on a page, So Says I. I only want to return results where the title was quoted within the page, "So Says I".

Query.parse("\"So Says I\"", conf)

results in a querystring of

"So Says I".

This will be interpreted as a query for the phrase, not a query for the quoted phrase.

Looking at Lucene documentation, it looks like to search for quotes they simply need to be escaped \. So I think my desired querystring is:

\"So Says I\" or "\"So Says I\""

However, I've been unable to get this out of Query.parse

Query.parse("\\\"So Says I\\\"",conf)

results in a query of

"So Says I"

Query.parse("\"\\\"So Says I\\\"\"", conf)

results in

So Says I

Anyone know how to get this done?

Thanks.

Brent

Reply via email to