NutchBean.getContent() and NutchBean.getParseData() do this, but require
a HitDetails instance. In the non-distributed case, the only required
field of the HitDetails for these calls is "url". In the distributed
case, the "segment" field must also be provided, so that the request can
be routed to a node serving that segment. These are implemented by
FetchedSegments.java and DistributedSearch.java.
Doug
Dennis Kubes wrote:
Can somebody direct me on how to get the stored text and parse metadata
for a given url?
Dennis
- Re: How to get Text and Parse data for URL Doug Cutting
-