I think it's the query string exclusion in files conf/regex-urlfilter.txt or conf/crawl-urlfilter.txt:

FIND:
# skip URLs containing certain characters as probable queries, etc.
-[...@=]

REPLACE:
# skip URLs containing certain characters as probable queries, etc.
# -[...@=]

OR CHANGE:
# -[...@=]
-...@]


Am 24.08.2010 02:50, schrieb Israel:
Hello volley. please help me one more time, i want to crawl this page, but
don't generate nothing...is posible?

http://uc.princeton.edu/main/index.php?option=com_vodcast&view=feed&format=raw
...

Reply via email to