Hi, I am Apache Nutch newbie and found found it very interesting the way it is designed and implemented. I would like to incorporate nutch within our middleware which moreover caters Grid Computing users. Well Grid Computing allows accessibility of several resources which are dispersed geographically and can be accessed seamlessly. This middleware is web services based, therefore accessing of resources can be achieved by invoking web services operations. We have an immediate requirement of developing information service which gathers all the meta-data about resources and provide this up-to-date information to brokers (a piece of software) via standard interface (i.e. API, WS).
Therefore in the context of information services, is it possible to query apache nutch via any query language (e.g. XQuery most recommended for our scenario) since all the information which is gathered from different resource sites are in XML (can be abide by xyz xml schema) format or can it be extensible? Other question is to impose restriction on the access of information for example if user A searches for the resources in Grid then she will only be allowed to see the information which she is allowed to access? Thanks in advance for any help, Shiraz Memon
