Hi Sir , By Parent URL , i mean the page the PDF document is linked from .
In other words , the name of website where the PDF is present in the site Example : I am crawling multiple pdf from multiple websites . I just wanted to index the respective website name along with each pdf crawled from respective websites. Thanks, Uma -- Sent from: http://lucene.472066.n3.nabble.com/Nutch-User-f603147.html

