Use an encrypting filesystem rather than encrypt the index
IMO.

Here's the problem. Any encryption process that you could
use for encoding short tokens that you can then search is
easily broken (ask Adobe about that!).

Wildcards won't work. Consider that you've indexed (encrypted)
running and run. Searching run* won't match running, unless the
encryption is completely trivial.

You _can_ (and I have) create a custom bit of code that will encrypt
the _stored_ data but not the indexed data. That still makes it
possible to pretty much reconstruct the document from the index.
It's painful, but it can be done (Luke does this for instance).

Basically, any scheme I've seen that allows robust searching
doesn't really buy you much in the way of security, you're better
off encrypting at the filesystem level.

Best,
Erick


On Tue, Nov 19, 2013 at 6:35 AM, Vincent Sevel <v.se...@lombardodier.com>wrote:

> Hi,
>
> I am looking for an option to use lucene with an encrypted index on disk.
> I looked for some lucene encrypted directories and found few options:
>
> -          https://issues.apache.org/jira/browse/LUCENE-2228
>
> -          https://code.google.com/p/lucenetransform/
> But they are old, or with no activity.
>
> I am wondering if there are recommended solutions in that space?
> Thanks for helping,
> vince
>
> ************************ DISCLAIMER ************************
> This message is intended only for use by the person to
> whom it is addressed. It may contain information that is
> privileged and confidential. Its content does not constitute
> a formal commitment by Lombard Odier & Cie or any of its
> branches or affiliates. If you are not the intended recipient
> of this message, kindly notify the sender immediately and
> destroy this message. Thank You.
> *****************************************************************
>

Reply via email to