Out of Memory

2010-03-23 Thread Neil Chaudhuri
I am using the DataImportHandler to index literally millions of documents in an Oracle database. Not surprisingly, I got the following after a few hours: java.sql.SQLException: ORA-04030: out of process memory when trying to allocate 4032 bytes (kolaGetRfcHeap,kghsseg: kolaslCreateCtx) Has

XPath Processing Applied to Clob

2010-03-17 Thread Neil Chaudhuri
I am using the DataImportHandler to index 3 fields in a table: an id, a date, and the text of a document. This is an Oracle database, and the document is an XML document stored as Oracle's xmltype data type. Since this is nothing more than a fancy CLOB, I am using the ClobTransformer to extract

RE: XPath Processing Applied to Clob

2010-03-17 Thread Neil Chaudhuri
anything. Any insight is appreciated. Thanks. From: Neil Chaudhuri Sent: Wednesday, March 17, 2010 3:24 PM To: solr-user@lucene.apache.org Subject: XPath Processing Applied to Clob I am using the DataImportHandler to index 3 fields in a table: an id, a date, and the text of a document

Moving From Oracle Text Search To Solr

2010-03-16 Thread Neil Chaudhuri
I am working on an application that currently hits a database containing millions of very large documents. I use Oracle Text Search at the moment, and things work fine. However, there is a request for faceting capability, and Solr seems like a technology I should look at. Suffice to say I am

RE: Moving From Oracle Text Search To Solr

2010-03-16 Thread Neil Chaudhuri
://www.packtpub.com/solr-1-4-enterprise-search-server/ On Mar 16, 2010, at 4:13 PM, Neil Chaudhuri wrote: Certainly I could use some basic SQL count(*) queries to achieve faceted results, but I am not sure of the flexibility, extensibility, or scalability of that approach. And from what I have read

Indexing CLOB Column in Oracle

2010-03-16 Thread Neil Chaudhuri
Since my original thread was straying to a new topic, I thought it made sense to create a new thread of discussion. I am using the DataImportHandler to index 3 fields in a table: an id, a date, and the text of a document. This is an Oracle database, and the document is an XML document stored