Aaron McCurry created BLUR-220:
----------------------------------
Summary: Support for humongous Rows
Key: BLUR-220
URL: https://issues.apache.org/jira/browse/BLUR-220
Project: Apache Blur
Issue Type: Improvement
Components: Blur
Affects Versions: 0.3.0
Reporter: Aaron McCurry
Fix For: 0.3.0
One of the limitations of Blur is size of Rows stored, specifically the number
of Records. The current updates are performed on Lucene is by deleting the
document and re-adding to the index. Unfortunately when any update is perform
on a Row in Blur, the entire Row has to be re-read (if the RowMutationType is
UPDATE_ROW) and then whatever modification needs are made then it is reindexed
in it's entirety.
Due to all of this overhead, there is a realistic limit on the size of a given
Row. It may vary based the kind of hardware that is being used, as the Row
grows in size the indexing (mutations) against that Row will slow.
This issue is being created to discuss techniques on how to deal with this
problem.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira