Hello everyone, I just wanted to know if the following features are present in nutch.. 1) postscript file parsing support 2) MS excel file parsing support 3) whether search based on file type (pdf,ps,xls,ppt,doc..etc) can be given as a query (similar to filetype: in google)..if yes what syntax should be used. 4) whether search within the url like, the word research in urls like www.ibm.com/research or www.research.ibm.com can be given as a query (similar to inurl: in google). If yes what syntax should be used ? 5) whether search within the page title can be done (similar to intitle: or allintitle: in google). If yes what syntax should be used ?
i would highly appreciate your answers to these questions thanks in advance, regards, Rohit
