Grant Ingersoll wrote:

Cool, hadn't seen that.

Hi folks. Long time lurker (in RSS), first time mailer. I just wanted to say that (obviously) I think this is a great idea and we should try to push it a little further along. I posted a bit more about it in my blog this morning:

http://blogs.sun.com/searchguy/entry/open_source_trec_trecmentum

The practical upshot: I'd be more than happy to participate in this and to try to get data sources and queries from Sun or elsewhere. I'd also be up for trying to find some place to host the collections and maybe even try to figure out some way that we could get computing resources to run the evaluations. No guarantees on that (I'm sure a Sun Lawyer's ears are burning somewhere right now, just for me having said that!), but I'm willing to tilt at that windmill.

TREC had a huge impact on the academic and commercial IR communities and I think an OSTREC (see, it's already got a cool acronym!) could benefit all of us (it would give us bragging rights if nothing else :-)

This is not directly related to Lucene development, so we might want to take further discussions off-list.

On May 20, 2008, at 1:01 PM, Steven A Rowe wrote:

> On 05/19/2008 at 3:58 PM, Grant Ingersoll wrote:
>> I think it is time the open source search community (and
>> I don=92t mean just Lucene) develop and publish a set of
>> TREC-style relevance judgments for freely available data
>> that is easily obtained from the Internet.
>
> Stephen Green, Minion developer at Sun, whose posts comparing Minion =20=

> and Lucene were recently mentioned on the solr-user mailing list[1], =20=

> has similar ideas.  =46rom =
<http://blogs.sun.com/searchguy/entry/minion_and_lucene_performance=20
> >:
>
>   I think it would be a good idea for all of the open
>   source engines to get together, find a nice open document
>   collection (the Apache mailing list archives and their
>   associated searches?) and build a nice set of regression
>   tests and some pooled relevance sets so that we can test
>   retrieval performance without having to rely on the TREC
>   data.
>


Steve
--
Stephen Green                      //   [EMAIL PROTECTED]
Principal Investigator             \\   http://blogs.sun.com/searchguy
Aura Project                       //   Voice: +1 781-442-0926
Sun Microsystems Labs              \\   Fax:   +1 781-442-1692




---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to