Thank you very much Kingsley. I just checked my cartridges and noticed
that I don't have a "HTML (and variants)" option. Is it called
something else or definitely called "HTML" (and variants)?

Also, in conductor I navigated to "Linked Data" > "Sponger" and do not
see a HTML cartridge to configure. However, I do see a RDFa, RDFa (no
translation, and xHTML in the list. Were you referring to one of this
or am I somehow missing the HTML cartridge?

If I'm missing it is there a way to add it?


Also, I just noticed while in the Sponger section I clicked on the
"Entity URIs" tab just to see what was there and received the
following error page:

Error S0002

SQ200: No table DB.DBA.RDF_ENTITY_URI_CARTRIDGE_MODES

My virtuoso is a clean install on Debian+Ubuntu from a week ago. I'm
running  07.20.3214.


------------------------------

Message: 3
Date: Thu, 15 Oct 2015 12:45:14 -0400
From: Kingsley Idehen <kide...@openlinksw.com>
Subject: Re: [Virtuoso-users] RDF Mapper Options in Conductor
To: virtuoso-users@lists.sourceforge.net
Message-ID: <561fd81a.7090...@openlinksw.com>
Content-Type: text/plain; charset="windows-1252"

On 10/15/15 12:09 PM, Haag, Jason wrote:
> Hi All,
>
> Rather than wait for someone to tell me the right options based on all
> of the different options that are available for a crawl job targeting
> RDFa I will post which options I selected. Perhaps someone can tell me
> what I'm missing or did wrong. I'm running Version: 07.20.3214.
>
> Here is what I entered under "Web Application Server > Content Imports":
>
> Target Description: ADL Verbs (RDFa/HTML)
> Target URL: http://xapi.vocab.pub/datasets/adl/verbs/index.html

http://xapi.vocab.pub/datasets/adl/verbs/ -- if you want everything.


> Copy to Local DAV collection: DAV/home/dba/rdf_sink
/DAV/home/dba/rdf_sink/

> Number of redirects to follow: 1
> Update interval: 10
> Checked the following:
> X Run Sponger
> X Store Metadata
>
> Cartridges Selected:
> X RDFa

Select HTML (and variants) -- but note that via the "Linked Data" menu's
"Sponger"  section you need to goto "Extractor Cartridges" section to
select and configure the HTML with the following:

add-html-meta=yes
get-feeds=no
preview-length=512
fallback-mode=no
rdfa=yes
reify_html5md=0
reify_rdfa=0
reify_jsonld=0
reify_all_grddl=0
reify_html=0
passthrough_mode=yes
loose=yes
reify_html_misc=no
reify_turtle=no

I know this seems awkward, but this is the best solution we could come
up with due to the problems posed by text/html content-type overloading
re. HTML+Microdata and RDFa etc..


>
> After I created the crawl job, I went to "import queues" and clicked "run"
>
> I received the following message:
>
> Results for xapi.vocab.pub
> errors while retrieving target. Select "reset" to return initial state
> Total URLs processed : 1
> Download finished
>
> I also checked "retrieved sites" and 0/1 were downloaded.
>
> Where do I find out the error that was encountered while retrieving
> target? Thanks!

Click on the "Edit" button aligned with your crawler job.

Another bit of quirky UI to be fixed.
>
> Also, I'm not sure if this is a bug, but I noticed when I specify the
> Local DAV collection of DAV/home/dba/rdf_sink/ and then go to edit the
> job, it changes to /DAV/DAV/home/dba/rdf_sink/ by adding a new
> directory 'DAV'

You initial value should have been: /DAV/home/dba/rdf_sink/.

We are going to look into some of these quirks, in due course.

>
> If I use /home/dba/rdf_sink/ as the the local path when creating the
> crawl job it won't let me without adding /DAV/ path in front of it. So
> it seems it is creating a sub directory /DAV/ under /DAV/ when it
> shouldn't be.

See my comment above.

Kingsley

------------------------------------------------------------------------------
_______________________________________________
Virtuoso-users mailing list
Virtuoso-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/virtuoso-users

Reply via email to