Re: Retrieving the term vectors of a document in Nutch

2012-01-10 Thread atcach
Hi House I had the same problem and tried the same solution, but I am getting an empty termDocs. How have you done it ? My code is: TermDocs td = ir.termDocs(); // Primero los guardo en un temporal porque ponerlos en un array no puedo porque me falta la cantidad

Re: Retrieving the term vectors of a document in Nutch

2009-06-08 Thread House Less
Hello Grant, > I'd ask on the nutch-u...@lucene.apache.org mailing list. While Lucene can > do > all of these things, it is not clear how Nutch exposes, if at all, any of > this > information. You should be able to get results there. Thanks, I'll be sure to ask them. > Note, however, t

Re: Retrieving the term vectors of a document in Nutch

2009-06-08 Thread Grant Ingersoll
I'd ask on the nutch-u...@lucene.apache.org mailing list. While Lucene can do all of these things, it is not clear how Nutch exposes, if at all, any of this information. You should be able to get results there. Note, however, that Term Vecs must be created during indexing by creating th

Re: Retrieving the term vectors of a document in Nutch

2009-06-07 Thread House Less
In retrospect, pardon my stupidity: surely it cannot be right that the term frequency vector for a page is not present within Nutch, for it needs this to compute the score for a page given a query. I would appreciate it if you would tell me where I may find it given a document number. Thank you