Try doing a single word search, instead of a phrase. I once had a similar problem when I indexed using Field.setOmitTf(true) which removed all the positional information from index, which is required to do phrase searches.

-siraj

Erick Erickson wrote:
The first thing I'd do is get a copy of Luke and look in my index
to see exactly what's there. Nothing in your e-mails indicates that you
*should* get any hits. Although I admin not getting jakarta lucene in
50M pages seems unlikely.

But Ian's suggestion that you start with a smaller index is spot on.

Best
Erick

On Thu, Jul 16, 2009 at 8:42 AM, prashant ullegaddi <
prashullega...@gmail.com> wrote:

50 million HTML pages (part of clueweb09 dataset for TREC) were indexed
using Hadoop into 56 indexes. 56 indexes were merged into a single index.
Analyzer is the StandardAnalyzer.



On Thu, Jul 16, 2009 at 6:07 PM, Anshum <ansh...@gmail.com> wrote:

Hi Prashant,

What did you index? how did you index? what analyzer did you use? without
all of these, perhaps it'd be difficult to figure out the issue.

--
Anshum Gupta
Naukri Labs!
http://ai-cafe.blogspot.com

The facts expressed here belong to everybody, the opinions to me. The
distinction is yours to draw............


On Thu, Jul 16, 2009 at 6:04 PM, prashant ullegaddi <
prashullega...@gmail.com> wrote:

Hi,

I tried searching:
"Apache Jakarta"~10

Nothing was returned. What might be wrong?

Regards,
Prashant.



Reply via email to