Piotr,
I will give it a try and see what happens. If it fails, I'll refetch.
Thanks.
-lucas
On Aug 25, 2005, at 12:32 PM, Piotr Kosiorowski wrote:
You can try it out but I think parsing separately expects some
directories in segment have different names than you have after
standard fetch with parsing.
Regards
Piotr
Lucas Rockwell wrote:
Hi Piotr,
Thanks for the response.
So, I can't use:
bin/nutch parse <segment directory>
and then reindex?
-lucas
On Aug 25, 2005, at 11:28 AM, Piotr Kosiorowski wrote:
As I understand if you had parse-pdf disabled you have to reparse
(snd then reindex) segments. There is no standard way to do it (I
think it might be done with some tricks). The easiest way would be
to refetch it with pdf parsing enabled.
Piotr
Lucas Rockwell wrote:
Hi all,
I have enabled the parse-pdf and index-more plugins and reindexed
my segments and then enabled those plus the query-more plugin in my
front-end application and when I do a query I still can not get at
the contents of the PDFs in the index. And even when I search for
"pdf" -- which gets me all PDF files because of the url -- and use
FetchedSegments.getSummary() there is nothing there. Any idea what
I am doing wrong?
Thanks.
-lucas