Hi Serhiy, I think that you are missing dk.defxws.fedoragsearch.server.GenericOperationsImpl and related classes from the classpath, when you run from command line. Let me know how it goes.
-Gert On 21/11/2011, at 10.04, Serhiy Polyakov wrote: > At first I did not pass parameters to the exts:getDatastreamText > I did it now. Still no OCR text content if OUT.txt fields. > > Serhiy > > > On Mon, Nov 21, 2011 at 2:27 AM, Serhiy Polyakov <sp0...@gmail.com> wrote: >> Hello, >> >> I want to use command line to process exported Fedora object using >> foxmlToSolr.xslt stylesheet. I need to see the resulting document that >> will be used by solr/conf/schema.xml to create index. >> >> Object's Foxml includes inline DC datastream and managed (external) >> OCR datastream that contains text/plain. Foxml includes reference to >> OCR datastream on the local server like >> http://localhost:8080/fedora/get/... I pointed browser to the OCR >> datastream reference and I see the text there. My FedoraGSearch >> indexed DC and OCR alright as a part of regular workflow so >> foxmlToSolr.xslt must be correct. >> >> However I need to do transformation from command line for the >> analysts. I downloaded Xalan and run: >> >> java -cp dk/defxws/fedoragsearch/server:path/to/xalan/*: >> org.apache.xalan.xslt.Process -in <SOURCE.xml> -xsl foxmlToSolr.xslt >> -out <OUT.txt> >> >> Here is excerpt from OUT.txt >> <field name=”dc.title”>My Title</field> >> <field name=”dsm.OCR”/> >> >> So it is not grabbing managed content (OCR in my case). >> >> foxmlToSolr.xslt includes external function definition and I believe >> is using it for managed content: >> ====== >> … >> xmlns:exts="xalan://dk.defxws.fedoragsearch.server.GenericOperationsImpl" >> … >> xsl:value-of select="exts:getDatastreamText($PID, $REPOSITORYNAME, >> @ID, $FEDORASOAP, $FEDORAUSER, $FEDORAPASS, $TRUSTSTOREPATH, >> $TRUSTSTOREPASS)"/> >> … >> ===== >> >> Could somebody suggest me if this is at all possible to get managed >> content into the output when I am doing command line processing. >> Again, managed content is getting to the index as part of regular >> FedoraGSearch workflow with the same foxmlToSolr.xslt. >> >> Thanks, >> Serhiy >> > > ------------------------------------------------------------------------------ > All the data continuously generated in your IT infrastructure > contains a definitive record of customers, application performance, > security threats, fraudulent activity, and more. Splunk takes this > data and makes sense of it. IT sense. And common sense. > http://p.sf.net/sfu/splunk-novd2d > _______________________________________________ > Fedora-commons-users mailing list > Fedora-commons-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/fedora-commons-users ------------------------------------------------------------------------------ All the data continuously generated in your IT infrastructure contains a definitive record of customers, application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-novd2d _______________________________________________ Fedora-commons-users mailing list Fedora-commons-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/fedora-commons-users