hello everyone, i m very new to nutch.i m using nutch 0.7.2 version.Im using nutch to crawl the url's in Unix OS.
its working fine.Im having the crawled data as output. now i need to modify the code for my requirement.. for example i want to write a java program where i need to use the pruneIndexTool in my program.. when i import the statements of the nutch package into my sample java program,say for example "import org.apache.lucene.search.Query;", its showing me compilation errors... i thought importing the packages are similar to that of importing "import java.io.*".. i copied the sample code from nabble and when i tried tocompile,it is throwing errors as "package import org.apache.lucene.search.Query does not exist "(All the nutch packages are not existing..i got 57 errors when i compiled a sample program!!!!) i jus copied the code and compiled as how i compile a normal HelloWorld program.. am i compiling it correctly? if not please tell me the correct way to compile the code... is there any way to compile the code importing the nutch packages or nutch classes...? these are some of the nutch packages used in the program... import org.apache.nutch.io.UTF8; import org.apache.nutch.util.LogFormatter; import org.apache.lucene.search.Query; import org.apache.lucene.store.Directory; import org.apache.lucene.store.FSDirectory; Thanks in Advance, Franklin.S Siddharth Jonathan wrote: > > Do you mean how do you get rid of some of the fields that are indexed by > default? eg. content, anchor text etc. > > Jonathan > On 4/2/07, Ratnesh,V2Solutions India > wrote: >> >> >> Hi, >> I have written a plugin , which finds no. of Object tags in a html and >> corresponding urls. >> I am storing "objects" as fields and page url as values. >> >> And finally interested in seeing the search realted with "objects" >> indexed >> fields not those which is already stored as indexed fields. >> >> So how shall I delete those index fields which is already stored???? >> >> Looking forward towards your reply(Valuable >> inputs)......................... >> >> Thnx to Nutch Community >> -- >> View this message in context: >> http://www.nabble.com/How-to-delete-already-stored-indexed-fields----tf3504164.html#a9786377 >> Sent from the Nutch - User mailing list archive at Nabble.com. >> >> > > -- View this message in context: http://www.nabble.com/How-to-delete-already-stored-indexed-fields----tf3504164.html#a10115413 Sent from the Nutch - User mailing list archive at Nabble.com.
