Thanks! And you are right, it's roughly the same as Powerset. It's slower because: * The demo runs on a single machine (not on a cluster). * We need to query search engines through their API, which have a 1 second build-in delay per query. * We parse sentences once we retrieve them from the search engines and parsers are still rather slow. Powerset on the other hand, parses Wikipedia before indexing and indexes the semantic structures. So no parsing needs to be performed when a user asks a query (beside the parsing of that query, I suppose.) * The Lucene index of the complete English Wikipedia we built is 8.3 GB big. On our machine it takes 2 seconds per query to get a result.
You could address these issues with enough money and man power. But it's just a research project, developed by one person. We don't have the resources. (Please drop me an email if you have some ;-) Cheers, Michael -- View this message in context: http://www.nabble.com/Online-Question-Answering-demo-using-Lucene-tp17232494p17236078.html Sent from the Lucene - General mailing list archive at Nabble.com.