Hi,
----- Original Message ---- > From: Stack <[email protected]> > > Regarding this: > >> Going forward I for one was going to try and mine our archives more at > >> least for dealing with the repeats. > > > > Do you mean manually, or did you have something more sophisticated in mind? > > No sophistication other than my use of the snazzy hadoop-search.com tool. > > Do you have something in mind? Could we be making better use of the > sematext summaries? Hm... we already index HBase and other Digests on search-hadoop.com. I was thinking more along the lines of mining the ML archives and doing automatic Q&A extraction. I don't know how difficult it would be. Maybe the input would be too noisy (people don't ask proper questions, answers are not full sentences, quote characters prefixing lines from old messages add a layer of complexity...), but that's what I thought you might have meant. Otis ---- Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/
