Hi Serhiy,

I think that you are missing 
dk.defxws.fedoragsearch.server.GenericOperationsImpl
and related classes from the classpath, when you run from command line. Let me 
know how it goes.

-Gert


On 21/11/2011, at 10.04, Serhiy Polyakov wrote:

> At first I did not pass parameters to the exts:getDatastreamText
> I did it now. Still no OCR text content if OUT.txt fields.
> 
> Serhiy
> 
> 
> On Mon, Nov 21, 2011 at 2:27 AM, Serhiy Polyakov <sp0...@gmail.com> wrote:
>> Hello,
>> 
>> I want to use command line to process exported Fedora object using
>> foxmlToSolr.xslt stylesheet. I need to see the resulting document that
>> will be used by solr/conf/schema.xml to create index.
>> 
>> Object's Foxml includes inline DC datastream and managed (external)
>> OCR datastream that contains text/plain. Foxml includes reference to
>> OCR datastream on the local server like
>> http://localhost:8080/fedora/get/... I pointed browser to the OCR
>> datastream reference and I see the text there. My FedoraGSearch
>> indexed DC and OCR alright as a part of regular workflow so
>> foxmlToSolr.xslt must be correct.
>> 
>> However I need to do transformation from command line for the
>> analysts. I downloaded Xalan and run:
>> 
>> java -cp dk/defxws/fedoragsearch/server:path/to/xalan/*:
>> org.apache.xalan.xslt.Process -in <SOURCE.xml> -xsl foxmlToSolr.xslt
>> -out <OUT.txt>
>> 
>> Here is excerpt from OUT.txt
>> <field name=”dc.title”>My Title</field>
>> <field name=”dsm.OCR”/>
>> 
>> So it is not grabbing managed content (OCR in my case).
>> 
>> foxmlToSolr.xslt includes external function definition and I believe
>> is using it for managed content:
>> ======
>> …
>> xmlns:exts="xalan://dk.defxws.fedoragsearch.server.GenericOperationsImpl"
>> …
>> xsl:value-of select="exts:getDatastreamText($PID, $REPOSITORYNAME,
>> @ID, $FEDORASOAP, $FEDORAUSER, $FEDORAPASS, $TRUSTSTOREPATH,
>> $TRUSTSTOREPASS)"/>
>> …
>> =====
>> 
>> Could somebody suggest me if this is at all possible to get managed
>> content into the output when I am doing command line processing.
>> Again, managed content is getting to the index as part of regular
>> FedoraGSearch workflow with the same foxmlToSolr.xslt.
>> 
>> Thanks,
>> Serhiy
>> 
> 
> ------------------------------------------------------------------------------
> All the data continuously generated in your IT infrastructure 
> contains a definitive record of customers, application performance, 
> security threats, fraudulent activity, and more. Splunk takes this 
> data and makes sense of it. IT sense. And common sense.
> http://p.sf.net/sfu/splunk-novd2d
> _______________________________________________
> Fedora-commons-users mailing list
> Fedora-commons-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/fedora-commons-users


------------------------------------------------------------------------------
All the data continuously generated in your IT infrastructure 
contains a definitive record of customers, application performance, 
security threats, fraudulent activity, and more. Splunk takes this 
data and makes sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-novd2d
_______________________________________________
Fedora-commons-users mailing list
Fedora-commons-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/fedora-commons-users

Reply via email to