Expanding 25+ GB per day is not ideal. If its possible to index the binary directly, as it sounds like it might, we'll just do that.
I think what I was missing was - I didn't see AbstractField which seems like it has the stuff I need (if indeed Field is used as I assume it is) ./paul Sent from my Verizon Wireless BlackBerry -----Original Message----- From: Shashi Kant <[email protected]> Date: Fri, 30 Jan 2009 09:38:16 To: <[email protected]> Subject: Re: indexing binary files? Unless I am missing something, not sure I see the issue here. You can convert to Base64 purely for indexing purposes and leave the original binary as-is. ----- Original Message ---- From: Paul Feuer <[email protected]> To: Lucene User List <[email protected]>; Shashi Kant <[email protected]> Sent: Friday, January 30, 2009 10:12:33 AM Subject: Re: indexing binary files? The binary events in the file are parsable by both our java server-side processes and the clients of these processes, so we need to keep the data in the binary format. ../paul Sent from my Verizon Wireless BlackBerry -----Original Message----- From: Shashi Kant <[email protected]> Date: Fri, 30 Jan 2009 06:32:19 To: <[email protected]> Subject: Re: indexing binary files? Hi Paul, have you tried persisting the binaries in Base64 format and then indexing them? As you are aware, Base64 is a robust representation used in email attachments for example. Thanks Shashi ----- Original Message ---- From: Paul Feuer <[email protected]> To: [email protected] Sent: Thursday, January 29, 2009 10:43:36 PM Subject: indexing binary files? Hi - I've looked on the FAQ, the Java Docs, and searched a little in google, but haven't been able to figure out if Lucene can index binary files. Our binary files can get up into the 20-30 gigabyte range. If it is possible, anyone have any pointers to what interfaces I should look at? Thanks, .../paul --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
