Hi Amna, You could write a filter extending the HtmlParseFilter class for this, you get the contents and check for the presence of the unicode characters and then return the ParseResult accordingly.
I am not sure if there is a command for it as such. Experts here, please correct me if I am wrong. ./Abi On Wed, Feb 9, 2011 at 6:31 PM, Amna Waqar <[email protected]> wrote: > hi all, > I want to delete the fetched pages stored in the segment based on its > content processing like if contains unicode characters(0x600 to 0x6FF) then > it should not be stored in the segement..How can i do this in Fetcher.java > where Content content = output.getContent(); (content of the page has > been > fetched) > I need some command to delete that page before storing it in segment. > > Please help me > Thanks > Amna Waqar >

