[ https://issues.apache.org/jira/browse/LUCENE-848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12491394 ]
Doron Cohen commented on LUCENE-848: ------------------------------------ I haven't tried this patch yet - hesitated/thinking it must take very long to download the huge start-up data (is this correct?)... anyhow I was wondering abut the new jars - whether we should try to make xcerses and xml-apis jars "ext-jars", i.e. downloaded from somewhere (where?) only when attempting to use this package. Otherwise this is adding ~2.5MB to the checkout/dev-pack - do others consider this an issue at all? > Add supported for Wikipedia English as a corpus in the benchmarker stuff > ------------------------------------------------------------------------ > > Key: LUCENE-848 > URL: https://issues.apache.org/jira/browse/LUCENE-848 > Project: Lucene - Java > Issue Type: New Feature > Components: contrib/benchmark > Reporter: Steven Parkes > Assigned To: Grant Ingersoll > Priority: Minor > Fix For: 2.2 > > Attachments: LUCENE-848.txt, LUCENE-848.txt, LUCENE-848.txt, > LUCENE-848.txt, WikipediaHarvester.java, xerces.jar, xerces.jar, xml-apis.jar > > > Add support for using Wikipedia for benchmarking. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]