I've got them configured at index and query time, so sounds like I'm
all set.
I'm doing anonymization of social security numbers, converting them to
xxx-xx-xxxx. I don't *think* users can find a way of identifying these
docs if the stopwords-based block works.
Thank you both for the confirmation.
Mike
On Sun 08 Jan 2012 09:32:53 PM PST, Gora Mohanty wrote:
On Mon, Jan 9, 2012 at 5:03 AM, Michael Lissner
<mliss...@michaeljaylissner.com> wrote:
I have a unique use case where I have words in my corpus that users
shouldn't ever be allowed to search for. My theory is that if I add these to
the stopwords list, that should do the trick.
Yes, that should work. Are you including the stop words at index-time,
query-time, or both? Normally, you should do both.
If done at the time of indexing, these terms will not even be in the
index, so I cannot think of any security issues.
Regards,
Gora