Just for background, I'll mention that TF-IDF scores are relative to the contents of your database. So if the database changes, the scores for queries may also change. If your database is active, scores will almost certainly change over time. In general this is more noticeable with smaller databases, but it happens with any database.
That aside, if I noticed a change of several orders of magnitude, I too would suspect a change in algorithm. Without having any inside information, I suppose that it could be a bug, or the MarkLogic folks decided it would work better with more dynamic range. Scores are xs:integer, and there is a fair amount of math involved, so accuracy could be better using larger scores. -- Mike On 28 Oct 2011, at 10:39 , John Mulholland wrote: > I have noticed that version 4.2-4 scores were lower than 1000. As of version > 4.2-5 through 4.2-7 the scores have gone up dramatically, over 100000. I > have been unable to find any information on this change. Does anyone have > any insight? > > John > > > > NOTICE: This email message is for the sole use of the intended recipient(s) > and may contain confidential and privileged information. Any unauthorized > review, use, disclosure or distribution is prohibited. If you are not the > intended recipient, please contact the sender by reply email and destroy all > copies of the original message. > > _______________________________________________ > General mailing list > [email protected] > http://developer.marklogic.com/mailman/listinfo/general _______________________________________________ General mailing list [email protected] http://developer.marklogic.com/mailman/listinfo/general
