[
https://issues.apache.org/jira/browse/NUTCH-1843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-1843:
----------------------------------------
Attachment: NUTCH-1843v2.patch
New patch which includes
* License headers for all newly generated files, however we also retain the
Avro header which states no-one should mess around with the files
* adds updates for all gora resources within ivy/ivy.xml e.g. adds new solr
and mongodb artifacts
* adds gora-solr/mongodb-mapping.xml files
* adds conf/gora-solr-host-schema.xml which is a schema to be used when
building the Host solr core
* adds conf/gora-solr-webpage-schema.xml which is a schema to be used when
building the WebPage solr core
* adds the new properties to gora.properties
* sorts out a bug in both ParseChecker and WebTableReader which now presents
the metadata information properly when the tools are invoked.
ACTION: We need people to test out both the Solr and MongoDB set ups as I am
100% they are not correct. In particular the gora-solr-webpage-schema.xml field
types are not correct. I am putting this here with the aim that someone else
can take a look at it and fill in the gaps.
I will also have a look at it tomorrow when I have a bit more energy.
> Upgrade to Gora 0.5
> -------------------
>
> Key: NUTCH-1843
> URL: https://issues.apache.org/jira/browse/NUTCH-1843
> Project: Nutch
> Issue Type: Improvement
> Components: build, storage
> Reporter: Lewis John McGibbney
> Assignee: Talat UYARER
> Fix For: 2.3
>
> Attachments: NUTCH-1843.patch, NUTCH-1843v2.patch
>
>
> We just released Gora 0.5
> http://www.mail-archive.com/dev%40gora.apache.org/msg05236.html
> We should upgrade before releasing Nutch 2.3
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)