On Fri, Mar 02, 2007 at 11:00:56PM +0100, Martin Zobel-Helas wrote: > > Although I think the idea is nice, I don't think the current data is all > > that usable. Some examples of "links" that IMO are completely useless in > > practice, just from the top of the file and not even complete for the > > selected packages: > > 3dchess: kworldclock > > 3ddesktop: module-assistant, devscripts > > 915resolution: linux-image-2.6.18-3-686 > > 9base: xserver-xephyr, libxml2-dev > > IMO some heavy filtering needs to be done for this data to be anything > > more than a toy and publishable. > i agree with you that we need to do some data filtering here. OTOH this > intends to be an "amazon"like feature, where not every package needs to > stand in any relation to the other package. What needs to be done is > some filtering, so not every package lists libfoo or bar-common, but i > guess Enrico has already done so.
So far I've seen two causes for bad suggestions:
1) Suggestions for a package that is too popular tend to be
meaningless: this is because when I query Xapian with, for example,
"please give me 20 typical systems that have 'grep' installed", I
get random systems as all systems have grep installed.
This *might* be detectable looking at the Xapian's relevance
estimate, which I'd expect to be low in cases like this.
2) Packages being partially tagged. I normally filter out all the
libfoobar0 and foo-common packages using this tag expression:
'!role::shared-lib && !role::app-data'
but this filtering fails if a -common package, for example, is not
well tagged and does not have the role::app-data tag.
This is, afaict, only solvable by fixing the tags.
I'd be happy if people could find out more.
> Perhaps you can give some idea on how you would implement a better
> filtering.
> PS: I would really like to see that feature on packages.debian.org :)
Me too. I wouldn't actually mind if we just added it as it is, marked
*experimental*, and see where we can go from there.
Ciao,
Enrico
--
GPG key: 1024D/797EBFAB 2000-12-05 Enrico Zini <[EMAIL PROTECTED]>
signature.asc
Description: Digital signature

