Hi,

----- Original Message ----
> From: Stack <[email protected]>

> > Regarding this:
> >> Going  forward I for one was going to try and mine our archives more at
> >>  least for dealing with the repeats.
> >
> > Do you mean manually, or did  you have something more sophisticated in mind?
> 
> No sophistication  other than my use of the snazzy hadoop-search.com tool.
> 
> Do you have  something in mind?  Could we be making better use of the
> sematext  summaries?

Hm... we already index HBase and other Digests on search-hadoop.com.
I was thinking more along the lines of mining the ML archives and doing 
automatic Q&A extraction.
I don't know how difficult it would be.  Maybe the input would be too noisy 
(people don't ask proper questions, answers are not full sentences, quote 
characters prefixing lines from old messages add a layer of complexity...), but 
that's what I thought you might have meant.

Otis
----
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/

Reply via email to