At first I did not pass parameters to the exts:getDatastreamText
I did it now. Still no OCR text content if OUT.txt fields.

Serhiy


On Mon, Nov 21, 2011 at 2:27 AM, Serhiy Polyakov <sp0...@gmail.com> wrote:
> Hello,
>
> I want to use command line to process exported Fedora object using
> foxmlToSolr.xslt stylesheet. I need to see the resulting document that
> will be used by solr/conf/schema.xml to create index.
>
> Object's Foxml includes inline DC datastream and managed (external)
> OCR datastream that contains text/plain. Foxml includes reference to
> OCR datastream on the local server like
> http://localhost:8080/fedora/get/... I pointed browser to the OCR
> datastream reference and I see the text there. My FedoraGSearch
> indexed DC and OCR alright as a part of regular workflow so
> foxmlToSolr.xslt must be correct.
>
> However I need to do transformation from command line for the
> analysts. I downloaded Xalan and run:
>
> java -cp dk/defxws/fedoragsearch/server:path/to/xalan/*:
> org.apache.xalan.xslt.Process -in <SOURCE.xml> -xsl foxmlToSolr.xslt
> -out <OUT.txt>
>
> Here is excerpt from OUT.txt
> <field name=”dc.title”>My Title</field>
> <field name=”dsm.OCR”/>
>
> So it is not grabbing managed content (OCR in my case).
>
> foxmlToSolr.xslt includes external function definition and I believe
> is using it for managed content:
> ======
> …
> xmlns:exts="xalan://dk.defxws.fedoragsearch.server.GenericOperationsImpl"
> …
> xsl:value-of select="exts:getDatastreamText($PID, $REPOSITORYNAME,
> @ID, $FEDORASOAP, $FEDORAUSER, $FEDORAPASS, $TRUSTSTOREPATH,
> $TRUSTSTOREPASS)"/>
> …
> =====
>
> Could somebody suggest me if this is at all possible to get managed
> content into the output when I am doing command line processing.
> Again, managed content is getting to the index as part of regular
> FedoraGSearch workflow with the same foxmlToSolr.xslt.
>
> Thanks,
> Serhiy
>

------------------------------------------------------------------------------
All the data continuously generated in your IT infrastructure 
contains a definitive record of customers, application performance, 
security threats, fraudulent activity, and more. Splunk takes this 
data and makes sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-novd2d
_______________________________________________
Fedora-commons-users mailing list
Fedora-commons-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/fedora-commons-users

Reply via email to