On Tue, Jun 25, 2013 at 8:54 AM, Shai Erera <[email protected]> wrote:

> Hi
>
> Today TfIdfSimilarity forces the encoding of norms into a single byte, and
> there's no way to override it. E.g. if I don't want to lose precision, the
> only thing I can do is write a different Similarity while copying most of
> the code from TfIdfSimilarity.
>

But as you said, its expert enough :)

I'm a little worried about how complex this would make the API. Today
TFIDFSimilarity hides all of this stuff and only provides a simple API with
tf(), idf(), etc for tuning. I think thats really how they all should
work...

Reply via email to