Grant Ingersoll wrote:
Cool, hadn't seen that.
Hi folks. Long time lurker (in RSS), first time mailer. I just
wanted to say that (obviously) I think this is a great idea and we
should try to push it a little further along. I posted a bit more
about it in my blog this morning:
http://blogs.sun.com/searchguy/entry/open_source_trec_trecmentum
The practical upshot: I'd be more than happy to participate in this
and to try to get data sources and queries from Sun or elsewhere. I'd
also be up for trying to find some place to host the collections and
maybe even try to figure out some way that we could get computing
resources to run the evaluations. No guarantees on that (I'm sure a
Sun Lawyer's ears are burning somewhere right now, just for me having
said that!), but I'm willing to tilt at that windmill.
TREC had a huge impact on the academic and commercial IR communities
and I think an OSTREC (see, it's already got a cool acronym!) could
benefit all of us (it would give us bragging rights if nothing else :-)
This is not directly related to Lucene development, so we might want
to take further discussions off-list.
On May 20, 2008, at 1:01 PM, Steven A Rowe wrote:
> On 05/19/2008 at 3:58 PM, Grant Ingersoll wrote:
>> I think it is time the open source search community (and
>> I don=92t mean just Lucene) develop and publish a set of
>> TREC-style relevance judgments for freely available data
>> that is easily obtained from the Internet.
>
> Stephen Green, Minion developer at Sun, whose posts comparing
Minion =20=
> and Lucene were recently mentioned on the solr-user mailing
list[1], =20=
> has similar ideas. =46rom =
<http://blogs.sun.com/searchguy/entry/minion_and_lucene_performance=20
> >:
>
> I think it would be a good idea for all of the open
> source engines to get together, find a nice open document
> collection (the Apache mailing list archives and their
> associated searches?) and build a nice set of regression
> tests and some pooled relevance sets so that we can test
> retrieval performance without having to rely on the TREC
> data.
>
Steve
--
Stephen Green // [EMAIL PROTECTED]
Principal Investigator \\ http://blogs.sun.com/searchguy
Aura Project // Voice: +1 781-442-0926
Sun Microsystems Labs \\ Fax: +1 781-442-1692
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]