Hi Peter,

This issue with the Virtuoso Crawler has been recreated and is scheduled to be 
fixed for the next release. A short term workaround would be to query the 
Virtuoso SPARQL endpoint (/sparql) with the  "Retrieve remote RDF data for all 
missing source graphs"  option (get:soft pragma) set, for example:

        
http://demo.openlinksw.com/sparql?default-graph-uri=http%3A%2F%2Flod.geospecies.org%2Findex.rdf&should-sponge=soft&query=select+*+where+{%3Fs+%3Fp+%3Fo}&format=text%2Fhtml&debug=on

Or use some of  other Virtuoso pragma options to tailor your query as required. 
Further details on IRI de-referencing and use of pragmas can be obtained from:

        http://docs.openlinksw.com/virtuoso/rdfiridereferencing.html

Best Regards
Hugh Williams
Professional Services
OpenLink Software
Web: http://www.openlinksw.com
Support: http://support.openlinksw.com
Forums: http://boards.openlinksw.com/support
Twitter: http://twitter.com/OpenLink

On 28 Dec 2009, at 18:16, Peter DeVries wrote:

> Hi!
> 
> I have installed the latest Virtuoso open source and I am having trouble 
> getting it to crawl my data set.
> 
> The crawler downloads the first nine pages but then stops.
> 
> This happens when the target is http://lod.geospecies.org/ or 
> http://lod.geospecies.org/index.rdf.
> 
> The same dataset (rdf pages) can be successfully crawled with Elmo.
> 
> I think that this has something to do with a preference for crawling the 
> .xhtml pages rather than the rdf pages.
> 
> Is there something I should be including in the crawler input screen?
> 
> 
> Thanks!
> 
> - Pete
> 
> 
> 
> ----------------------------------------------------------------
> Pete DeVries
> Department of Entomology
> University of Wisconsin - Madison
> 445 Russell Laboratories
> 1630 Linden Drive
> Madison, WI 53706
> GeoSpecies Knowledge Base 
> About the GeoSpecies Knowledge Base
> ------------------------------------------------------------
> ------------------------------------------------------------------------------
> This SF.Net email is sponsored by the Verizon Developer Community
> Take advantage of Verizon's best-in-class app development support
> A streamlined, 14 day to market process makes app distribution fast and easy
> Join now and get one step closer to millions of Verizon customers
> http://p.sf.net/sfu/verizon-dev2dev 
> _______________________________________________
> Virtuoso-users mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/virtuoso-users

Reply via email to