Hi Mike, Just curious to know the use-case here. Why do you want to limit updates to 100 instead of importing all documents?
On Tue, Jun 24, 2008 at 10:23 AM, mike segv <[EMAIL PROTECTED]> wrote: > > That fixed it. > > If I'm inserting millions of documents, how do I control docs/update? E.g. > if there are 50K docs per file, I'm thinking that I should probably code up > my own DataSource that allows me to stipulate docs/update. Like say, 100 > instead of 50K. Does this make sense? > > Mike > > > Noble Paul നോബിള് नोब्ळ् wrote: > > > > hi , > > You have not registered any datasources . the second entity needs a > > datasource. > > Remove the dataSource="null" and add a name for the second entity > > (good practice). No need for baseDir attribute for second entity . > > See the modified xml added below > > --Noble > > > > <dataConfig> > > <dataSource type="FileDataSource"/> > > <document> > > <entity name="f" processor="FileListEntityProcessor" fileName=".*xml" > > newerThan="'NOW-10DAYS'" recursive="true" rootEntity="false" > > dataSource="null" baseDir="/san/tomcat-services/solr-medline"> > > <entity name="x" processor="XPathEntityProcessor" > > forEach="/MedlineCitation" > > url="${f.fileAbsolutePath}" > > > <field column="pmid" xpath="/MedlineCitation/PMID"/> > > </entity> > > </entity> > > </document> > > </dataConfig> > > > > On Tue, Jun 24, 2008 at 6:39 AM, mike segv <[EMAIL PROTECTED]> wrote: > >> > >> I'm trying to use the fileListEntityProcessor to add some xml documents > >> to a > >> solr index. I'm running a nightly version of solr-1.3 with SOLR-469 and > >> SOLR-563. I've been able to successfuly run the slashdot httpDataSource > >> example. My data-config.xml file loads without errors. When I attempt > >> the > >> full-import command I get the exception below. Thanks for any help. > >> > >> Mike > >> > >> WARNING: No lockType configured for > >> /san/tomcat-services/solr-medline/solr/data/index/ assuming 'simple' > >> Jun 23, 2008 7:59:49 PM org.apache.solr.handler.dataimport.DataImporter > >> doFullImport > >> SEVERE: Full Import failed > >> java.lang.RuntimeException: java.lang.NullPointerException > >> at > >> > org.apache.solr.handler.dataimport.XPathRecordReader.streamRecords(XPathRecordReader.java:97) > >> at > >> > org.apache.solr.handler.dataimport.XPathEntityProcessor.initQuery(XPathEntityProcessor.java:212) > >> at > >> > org.apache.solr.handler.dataimport.XPathEntityProcessor.fetchNextRow(XPathEntityProcessor.java:166) > >> at > >> > org.apache.solr.handler.dataimport.XPathEntityProcessor.nextRow(XPathEntityProcessor.java:149) > >> at > >> > org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:286) > >> at > >> > org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:312) > >> at > >> > org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:179) > >> at > >> > org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:140) > >> at > >> > org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:335) > >> at > >> > org.apache.solr.handler.dataimport.DataImporter.rumCmd(DataImporter.java:386) > >> at > >> > org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:377) > >> Caused by: java.lang.NullPointerException > >> at java.io.Reader.<init>(Reader.java:61) > >> at java.io.BufferedReader.<init>(BufferedReader.java:76) > >> at com.bea.xml.stream.MXParser.checkForXMLDecl(MXParser.java:775) > >> at com.bea.xml.stream.MXParser.setInput(MXParser.java:806) > >> at > >> > com.bea.xml.stream.MXParserFactory.createXMLStreamReader(MXParserFactory.java:261) > >> at > >> > org.apache.solr.handler.dataimport.XPathRecordReader.streamRecords(XPathRecordReader.java:93) > >> ... 10 more > >> > >> Here is my data-config: > >> > >> <dataConfig> > >> <document> > >> <entity name="f" processor="FileListEntityProcessor" fileName=".*xml" > >> newerThan="'NOW-10DAYS'" recursive="true" rootEntity="false" > >> dataSource="null" baseDi > >> r="/san/tomcat-services/solr-medline"> > >> <entity processor="XPathEntityProcessor" forEach="/MedlineCitation" > >> url="${f.fileAbsolutePath}" dataSource="null"> > >> <field column="pmid" xpath="/MedlineCitation/PMID"/> > >> </entity> > >> </entity> > >> </document> > >> </dataConfig> > >> > >> And a snippet from an xml file: > >> <MedlineCitation Owner="PIP" Status="MEDLINE"> > >> <PMID>12236137</PMID> > >> <DateCreated> > >> <Year>1980</Year> > >> <Month>01</Month> > >> <Day>03</Day> > >> </DateCreated> > >> > >> > >> -- > >> View this message in context: > >> > http://www.nabble.com/Attempting-dataimport-using-FileListEntityProcessor-tp18081671p18081671.html > >> Sent from the Solr - User mailing list archive at Nabble.com. > >> > >> > > > > > > > > -- > > --Noble Paul > > > > > > -- > View this message in context: > http://www.nabble.com/Attempting-dataimport-using-FileListEntityProcessor-tp18081671p18083747.html > Sent from the Solr - User mailing list archive at Nabble.com. > > -- Regards, Shalin Shekhar Mangar.