Re: query regarding Indexing xml files -db-data-config.xml

2009-05-18 Thread jayakeerthi s
Hi  Noble,

Thanks for the reply,

As advised I have changed the db-data-config.xml as below. But still the
str name=Indexing completed. Added/Updated: 0 documents. Deleted 0
documents./str

dataConfig
dataSource type=FileDataSource name =xmlindex/
document name=products
 entity name=xmlfile processor=FileListEntityProcessor
fileName=c:\\test\\ipod_other.xml  recursive=true rootEntity=false
dataSource=null baseDir=${dataimporter.request.xmlDataDir}
useSolrAddSchema=true
entity name=data processor=XPathEntityProcessor
url=${xmlfile.fileAbsolutePath}
  field column=manu name=manu/
  /entity
   /entity
   /document
/dataConfig


Got error as below when baseDir is removed

INFO: last commit = 1242683454570
May 18, 2009 2:55:15 PM org.apache.solr.handler.dataimport.DataImporter
doFullImport
SEVERE: Full Import failed
org.apache.solr.handler.dataimport.DataImportHandlerException: 'baseDir' is
a required attribute Pro
cessing Document # 1
at
org.apache.solr.handler.dataimport.FileListEntityProcessor.init(FileListEntityProcessor.j
ava:76)
at
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:299)
at
org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:225)
at
org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:167)
at
org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:324)
at
org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:382)
at
org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:363)
May 18, 2009 2:55:15 PM org.apache.solr.update.DirectUpdateHandler2 rollback
INFO: start rollback

Please advise.

Thanks and regards,
Jay

2009/5/17 Noble Paul നോബിള്‍ नोब्ळ् noble.p...@corp.aol.com

 hi ,
 u may not need that enclosing entity , if you only wish to index one file.

 baseDir is not required if you give absolute path in the fileName.

 no need to mention forEach or fields if you set useSolrAddSchema=true

 On Sat, May 16, 2009 at 1:23 AM, jayakeerthi s mail2keer...@gmail.com
 wrote:
  Hi All,
 
  I am trying to index the fileds from the xml files, here is the
  configuration that I am using.
 
 
  db-data-config.xml
 
  dataConfig
 dataSource type=FileDataSource name =xmlindex/
 document name=products
  entity name=xmlfile processor=FileListEntityProcessor
  fileName=c:\test\ipod_other.xml  recursive=true rootEntity=false
  dataSource=null baseDir=${dataimporter.request.xmlDataDir}
  entity name=data processor=XPathEntityProcessor forEach=/record
 |
  /the/record/xpath  url=${xmlfile.fileAbsolutePath}
 field column=manu
  name=manu/
 
  /entity
 /entity
/document
  /dataConfig
 
  Schema.xml has the field manu
 
  The input xml file used to import the field is
 
  doc
   field name=idF8V7067-APL-KIT/field
   field name=nameBelkin Mobile Power Cord for iPod w/ Dock/field
   field name=manuBelkin/field
   field name=catelectronics/field
   field name=catconnector/field
   field name=featurescar power adapter, white/field
   field name=weight4/field
   field name=price19.95/field
   field name=popularity1/field
   field name=inStockfalse/field
  /doc
 
 
  doing the full-import this is the response I am getting
 
  - lst name=statusMessages
   str name=Total Requests made to DataSource0/str
   str name=Total Rows Fetched0/str
   str name=Total Documents Skipped0/str
   str name=Full Dump Started2009-05-15 11:58:00/str
   str name=Indexing completed. Added/Updated: 0 documents. Deleted 0
  documents./str
   str name=Committed2009-05-15 11:58:00/str
   str name=Optimized2009-05-15 11:58:00/str
   str name=Time taken0:0:0.172/str
   /lst
   str name=WARNINGThis response format is experimental. It is likely
 to
  change in the future./str
   /response
 
 
  Do I missing anything here or is there any format on the input xml,??
 please
  help resolving this.
 
  Thanks and regards,
  Jay
 



 --
 -
 Noble Paul | Principal Engineer| AOL | http://aol.com



Re: query regarding Indexing xml files -db-data-config.xml

2009-05-17 Thread Noble Paul നോബിള്‍ नोब्ळ्
hi ,
u may not need that enclosing entity , if you only wish to index one file.

baseDir is not required if you give absolute path in the fileName.

no need to mention forEach or fields if you set useSolrAddSchema=true

On Sat, May 16, 2009 at 1:23 AM, jayakeerthi s mail2keer...@gmail.com wrote:
 Hi All,

 I am trying to index the fileds from the xml files, here is the
 configuration that I am using.


 db-data-config.xml

 dataConfig
    dataSource type=FileDataSource name =xmlindex/
    document name=products
     entity name=xmlfile processor=FileListEntityProcessor
 fileName=c:\test\ipod_other.xml  recursive=true rootEntity=false
 dataSource=null baseDir=${dataimporter.request.xmlDataDir}
     entity name=data processor=XPathEntityProcessor forEach=/record |
 /the/record/xpath  url=${xmlfile.fileAbsolutePath}
            field column=manu
 name=manu/

     /entity
        /entity
       /document
 /dataConfig

 Schema.xml has the field manu

 The input xml file used to import the field is

 doc
  field name=idF8V7067-APL-KIT/field
  field name=nameBelkin Mobile Power Cord for iPod w/ Dock/field
  field name=manuBelkin/field
  field name=catelectronics/field
  field name=catconnector/field
  field name=featurescar power adapter, white/field
  field name=weight4/field
  field name=price19.95/field
  field name=popularity1/field
  field name=inStockfalse/field
 /doc


 doing the full-import this is the response I am getting

 - lst name=statusMessages
  str name=Total Requests made to DataSource0/str
  str name=Total Rows Fetched0/str
  str name=Total Documents Skipped0/str
  str name=Full Dump Started2009-05-15 11:58:00/str
  str name=Indexing completed. Added/Updated: 0 documents. Deleted 0
 documents./str
  str name=Committed2009-05-15 11:58:00/str
  str name=Optimized2009-05-15 11:58:00/str
  str name=Time taken0:0:0.172/str
  /lst
  str name=WARNINGThis response format is experimental. It is likely to
 change in the future./str
  /response


 Do I missing anything here or is there any format on the input xml,?? please
 help resolving this.

 Thanks and regards,
 Jay




-- 
-
Noble Paul | Principal Engineer| AOL | http://aol.com


Re: query regarding Indexing xml files -db-data-config.xml

2009-05-16 Thread Fergus McMenemie
Hmmm, 

I thought that if you were using the XPathEntityProcessor that 
you have to specify an xpath for each of the fields you want
to populate. Unless you are using XPathEntityProcessor's use
useSolrAddSchema mode?

Fergus.

If that is your complete input file then it looks like you are missing the
wrapping add/add element:

add
doc
 field name=idF8V7067-APL-KIT/

 field
  field name=nameBelkin Mobile Power Cord for iPod w/ Dock/field
  field name=manuBelkin/field
  field name=catelectronics/field
  field name=catconnector/field
  field name=featurescar power adapter, white/field
  field name=weight4/field
  field name=price19.95/field
  field name=popularity1/field
  field name=inStockfalse/field
 /doc

/add

Is it possible you just forgot to include the add?

-Jay


On Fri, May 15, 2009 at 12:53 PM, jayakeerthi s mail2keer...@gmail.comwrote:

 Hi All,

 I am trying to index the fileds from the xml files, here is the
 configuration that I am using.


 db-data-config.xml

 dataConfig
dataSource type=FileDataSource name =xmlindex/
document name=products
 entity name=xmlfile processor=FileListEntityProcessor
 fileName=c:\test\ipod_other.xml  recursive=true rootEntity=false
 dataSource=null baseDir=${dataimporter.request.xmlDataDir}
 entity name=data processor=XPathEntityProcessor forEach=/record |
 /the/record/xpath  url=${xmlfile.fileAbsolutePath}
field column=manu
 name=manu/

 /entity
/entity
   /document
 /dataConfig

 Schema.xml has the field manu

 The input xml file used to import the field is

 doc
  field name=idF8V7067-APL-KIT/field
  field name=nameBelkin Mobile Power Cord for iPod w/ Dock/field
  field name=manuBelkin/field
  field name=catelectronics/field
  field name=catconnector/field
  field name=featurescar power adapter, white/field
  field name=weight4/field
  field name=price19.95/field
  field name=popularity1/field
  field name=inStockfalse/field
 /doc


 doing the full-import this is the response I am getting

 - lst name=statusMessages
  str name=Total Requests made to DataSource0/str
  str name=Total Rows Fetched0/str
  str name=Total Documents Skipped0/str
  str name=Full Dump Started2009-05-15 11:58:00/str
  str name=Indexing completed. Added/Updated: 0 documents. Deleted 0
 documents./str
  str name=Committed2009-05-15 11:58:00/str
  str name=Optimized2009-05-15 11:58:00/str
  str name=Time taken0:0:0.172/str
  /lst
  str name=WARNINGThis response format is experimental. It is likely to
 change in the future./str
  /response


 Do I missing anything here or is there any format on the input xml,??
 please
 help resolving this.

 Thanks and regards,
 Jay


-- 

===
Fergus McMenemie   Email:fer...@twig.me.uk
Techmore Ltd   Phone:(UK) 07721 376021

Unix/Mac/Intranets Analyst Programmer
===


Re: query regarding Indexing xml files -db-data-config.xml

2009-05-15 Thread Jay Hill
If that is your complete input file then it looks like you are missing the
wrapping add/add element:

add
doc
 field name=idF8V7067-APL-KIT/

 field
  field name=nameBelkin Mobile Power Cord for iPod w/ Dock/field
  field name=manuBelkin/field
  field name=catelectronics/field
  field name=catconnector/field
  field name=featurescar power adapter, white/field
  field name=weight4/field
  field name=price19.95/field
  field name=popularity1/field
  field name=inStockfalse/field
 /doc

/add

Is it possible you just forgot to include the add?

-Jay


On Fri, May 15, 2009 at 12:53 PM, jayakeerthi s mail2keer...@gmail.comwrote:

 Hi All,

 I am trying to index the fileds from the xml files, here is the
 configuration that I am using.


 db-data-config.xml

 dataConfig
dataSource type=FileDataSource name =xmlindex/
document name=products
 entity name=xmlfile processor=FileListEntityProcessor
 fileName=c:\test\ipod_other.xml  recursive=true rootEntity=false
 dataSource=null baseDir=${dataimporter.request.xmlDataDir}
 entity name=data processor=XPathEntityProcessor forEach=/record |
 /the/record/xpath  url=${xmlfile.fileAbsolutePath}
field column=manu
 name=manu/

 /entity
/entity
   /document
 /dataConfig

 Schema.xml has the field manu

 The input xml file used to import the field is

 doc
  field name=idF8V7067-APL-KIT/field
  field name=nameBelkin Mobile Power Cord for iPod w/ Dock/field
  field name=manuBelkin/field
  field name=catelectronics/field
  field name=catconnector/field
  field name=featurescar power adapter, white/field
  field name=weight4/field
  field name=price19.95/field
  field name=popularity1/field
  field name=inStockfalse/field
 /doc


 doing the full-import this is the response I am getting

 - lst name=statusMessages
  str name=Total Requests made to DataSource0/str
  str name=Total Rows Fetched0/str
  str name=Total Documents Skipped0/str
  str name=Full Dump Started2009-05-15 11:58:00/str
  str name=Indexing completed. Added/Updated: 0 documents. Deleted 0
 documents./str
  str name=Committed2009-05-15 11:58:00/str
  str name=Optimized2009-05-15 11:58:00/str
  str name=Time taken0:0:0.172/str
  /lst
  str name=WARNINGThis response format is experimental. It is likely to
 change in the future./str
  /response


 Do I missing anything here or is there any format on the input xml,??
 please
 help resolving this.

 Thanks and regards,
 Jay



Re: query regarding Indexing xml files -db-data-config.xml

2009-05-15 Thread jayakeerthi s
Many thanks for the reply

The complete input xml file is below I missed to include this earlier.


add
doc
  field name=idF8V7067-APL-KIT/field
  field name=nameBelkin Mobile Power Cord for iPod w/ Dock/field
  field name=manuBelkin/field
  field name=catelectronics/field
  field name=catconnector/field
  field name=featurescar power adapter, white/field
  field name=weight4/field
  field name=price19.95/field
  field name=popularity1/field
  field name=inStockfalse/field
/doc
doc
  field name=idIW-02/field
  field name=nameiPod amp; iPod Mini USB 2.0 Cable/field
  field name=manuBelkin/field
  field name=catelectronics/field
  field name=catconnector/field
  field name=featurescar power adapter for iPod, white/field
  field name=weight2/field
  field name=price11.50/field
  field name=popularity1/field
  field name=inStockfalse/field
/doc

/add

regards,
Jay
On Fri, May 15, 2009 at 1:14 PM, Jay Hill jayallenh...@gmail.com wrote:

 If that is your complete input file then it looks like you are missing the
 wrapping add/add element:

 add
 doc
  field name=idF8V7067-APL-KIT/
 
  field
   field name=nameBelkin Mobile Power Cord for iPod w/ Dock/field
   field name=manuBelkin/field
   field name=catelectronics/field
   field name=catconnector/field
   field name=featurescar power adapter, white/field
   field name=weight4/field
   field name=price19.95/field
   field name=popularity1/field
   field name=inStockfalse/field
  /doc

 /add

 Is it possible you just forgot to include the add?

 -Jay


 On Fri, May 15, 2009 at 12:53 PM, jayakeerthi s mail2keer...@gmail.com
 wrote:

  Hi All,
 
  I am trying to index the fileds from the xml files, here is the
  configuration that I am using.
 
 
  db-data-config.xml
 
  dataConfig
 dataSource type=FileDataSource name =xmlindex/
 document name=products
  entity name=xmlfile processor=FileListEntityProcessor
  fileName=c:\test\ipod_other.xml  recursive=true rootEntity=false
  dataSource=null baseDir=${dataimporter.request.xmlDataDir}
  entity name=data processor=XPathEntityProcessor forEach=/record
 |
  /the/record/xpath  url=${xmlfile.fileAbsolutePath}
 field column=manu
  name=manu/
 
  /entity
 /entity
/document
  /dataConfig
 
  Schema.xml has the field manu
 
  The input xml file used to import the field is
 
  doc
   field name=idF8V7067-APL-KIT/field
   field name=nameBelkin Mobile Power Cord for iPod w/ Dock/field
   field name=manuBelkin/field
   field name=catelectronics/field
   field name=catconnector/field
   field name=featurescar power adapter, white/field
   field name=weight4/field
   field name=price19.95/field
   field name=popularity1/field
   field name=inStockfalse/field
  /doc
 
 
  doing the full-import this is the response I am getting
 
  - lst name=statusMessages
   str name=Total Requests made to DataSource0/str
   str name=Total Rows Fetched0/str
   str name=Total Documents Skipped0/str
   str name=Full Dump Started2009-05-15 11:58:00/str
   str name=Indexing completed. Added/Updated: 0 documents. Deleted 0
  documents./str
   str name=Committed2009-05-15 11:58:00/str
   str name=Optimized2009-05-15 11:58:00/str
   str name=Time taken0:0:0.172/str
   /lst
   str name=WARNINGThis response format is experimental. It is likely
 to
  change in the future./str
   /response
 
 
  Do I missing anything here or is there any format on the input xml,??
  please
  help resolving this.
 
  Thanks and regards,
  Jay