|
I have two crawl scripts with
different parameter sets (depth, urls, etc.) that I need to merge. I
have successfully used the SegmentMergeTool to create a combined
segment. However, this new merged segment does not appear in the
directory structure with an index as I expected. Each of my segments
that I am merging have the following directory structure: crawlDir /segments /index /db The SegmentMergeTool simply produces another segment directory named using a timestamp (/20050216...). I see that SegmentMergeTool has an index flag (-i) but that seems to only create an index inside of the segment directory. What do I do with this new segment to create a top-level index for it? Do I even need to do this? I have yet to figure out a way for my code to access this newly merged segment. What am I missing? --
Kyle Gabhart Sr Prin Info Sys Architect L-3 Communications Link Simulation and Training |
