Index Optimization: Which is Better?

2001-10-11 Thread W. Eliot Kimber
, is there anything we can or need to do to optimize Lucene to handle lots of little Lucene documents? Thanks, Eliot -- . . . . . . . . . . . . . . . . . . . . . . . . W. Eliot Kimber | Lead Brain 1016 La Posada Dr. | Suite 240 | Austin TX 78752 T 512.656.4139 | F 512.419.1860 | [EMAIL PROTECTED

Re: Index Optimization: Which is Better?

2001-10-12 Thread W. Eliot Kimber
hope it performs adequately. Cheers, E. -- . . . . . . . . . . . . . . . . . . . . . . . . W. Eliot Kimber | Lead Brain 1016 La Posada Dr. | Suite 240 | Austin TX 78752 T 512.656.4139 | F 512.419.1860 | [EMAIL PROTECTED] w w w . d a t a c h a n n e l . c o m

Another Indexing Question: Case Sensitivity

2001-10-13 Thread W. Eliot Kimber
. -- . . . . . . . . . . . . . . . . . . . . . . . . W. Eliot Kimber | Lead Brain 1016 La Posada Dr. | Suite 240 | Austin TX 78752 T 512.656.4139 | F 512.419.1860 | [EMAIL PROTECTED] w w w . d a t a c h a n n e l . c o m

Indexing XML With Lucene: Some Initial Results

2001-10-14 Thread W. Eliot Kimber
). Cheers, Eliot -- . . . . . . . . . . . . . . . . . . . . . . . . W. Eliot Kimber | Lead Brain 1016 La Posada Dr. | Suite 240 | Austin TX 78752 T 512.656.4139 | F 512.419.1860 | [EMAIL PROTECTED] w w w . d a t a c h a n n e l . c o m

Trying To Understand Query Syntax Details

2001-10-16 Thread W. Eliot Kimber
, is there a description of the algorithm ~ uses? Thanks, E. -- . . . . . . . . . . . . . . . . . . . . . . . . W. Eliot Kimber | Lead Brain 1016 La Posada Dr. | Suite 240 | Austin TX 78752 T 512.656.4139 | F 512.419.1860 | [EMAIL PROTECTED] w w w . d a t a c h a n n e l . c o m

Re: Trying To Understand Query Syntax Details

2001-10-16 Thread W. Eliot Kimber
. Eliot Kimber | Lead Brain 1016 La Posada Dr. | Suite 240 | Austin TX 78752 T 512.656.4139 | F 512.419.1860 | [EMAIL PROTECTED] w w w . d a t a c h a n n e l . c o m

XML Indexing Samples

2001-10-16 Thread W. Eliot Kimber
I have put together a hopefully useful package that demonstrates our current experiments with using Lucene for XML indexing. You can get the files by anonymous ftp from che.isogen.com, /outgoing/lucene. There are two zip files: - lucene_xml_indexing.zip This is the core indexing code and a

Re: Zones

2002-01-25 Thread W. Eliot Kimber
Ogren, Philip V. wrote: We are indexing a large corpus of XML documents (~10M). One thing that Verity does with XML notes is that it indexes each XML tag as a zone.* What's cool about it is that the zones are nested so that it mirrors the schema of your XML document. You can limit your

XML Indexing With Lucene: New Location For Package

2002-02-01 Thread W. Eliot Kimber
You can now find our package for doing XML indexing with Lucene on the ISOGEN web site: http://www.isogen.com/papers/lucene_xml_indexing.html The package (lucene_xml_indexing.zip) includes all the 3rd-party libraries it depends on (Lucene, Xerces 1.4.4, junit). This package is provided as-is

Re: indexing PDF files

2002-05-03 Thread W. Eliot Kimber
this functionality in order to correlate PDF annotations (links, bookmarks, notes) to the page objects they relate to--it's all done with bounding boxes. Cheers, Eliot -- W. Eliot Kimber, [EMAIL PROTECTED] Consultant, ISOGEN International 1016 La Posada Dr., Suite 240 Austin, TX 78752 Phone

Re: PDF4J Project: Gathering Feature Requests

2002-05-06 Thread W. Eliot Kimber
main writing usecase is the rewriting of existing PDFs following some amount of manipulation through our API. A caution: I am still waiting to get approval from my employers to do this work as open source--it may be a while before I can even start on the coding. Cheers, Eliot -- W. Eliot Kimber

XML Lucene Indexing Package Updated

2002-05-15 Thread W. Eliot Kimber
/runLuceneClient.bat script (on Windows) and it should just work. If it doesn't, let me know. Cheers, Eliot -- W. Eliot Kimber, [EMAIL PROTECTED] Consultant, ISOGEN International 1016 La Posada Dr., Suite 240 Austin, TX 78752 Phone: 512.656.4139 -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED