[ 
https://issues.apache.org/jira/browse/BLUR-290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13831127#comment-13831127
 ] 

Aaron McCurry commented on BLUR-290:
------------------------------------

- I was just trying a proof-of-concept stuff. There are many nocommits and 
loose ends, so haven't really gone to the perf aspect. But if you can eval, 
will be great. I have fixed some bugs and attached few test-cases, in case you 
want to try it.

I will try to give it a go this week.

- +1 for introducing table-types. A less adventurous table-type can even write 
unsorted segments to disk and take a hit during searches, while doing the 
balancing act during merge.

I think that it will be a property that we read from the table properties on 
open.  That way if someone else comes up with their own they don't have to 
modify anymore code than necessary.

- It would be great if we can detect that NRT re-opens are going to suffer 
under load and hence adjust accordingly. Shouldn't it be a policy that is 
applicable to all table-types instead of it being a table-type itself?

Hmm, not sure.  I have started creating a simple blur index that will not show 
any uncommitted data.  Thus the visibility of the data is going to be very 
slow.  I have also starting working on an alternate version that allows for the 
inbound data to be written to a hdfs backed queue for indexing.  That way the 
indexer can take larger and larger chunks of data if it gets behind the inbound 
data.  So perhaps we could combine all of these?  Maybe not.  Not sure at this 
point.  I did notice that I will need to modify the BlurIndex api to allow for 
partial Row updates (Record appending or something).

- PS: Sorry for the huge number of files. Is there place to check-in, may be an 
experimental-branch?

At this point there's no where I can give you access.  We should probably have 
infrastructure mirror apache git to github.  That way it would be easier for 
you to branch and post pull requests etc.

Aaron

> NRT Updates using RAMDirectory & Swap
> -------------------------------------
>
>                 Key: BLUR-290
>                 URL: https://issues.apache.org/jira/browse/BLUR-290
>             Project: Apache Blur
>          Issue Type: New Feature
>    Affects Versions: experimental-dev
>            Reporter: Ravikumar
>         Attachments: BlurFieldsConsumer.java, BlurFieldsConsumer.java, 
> BlurFieldsConsumer.java, BlurFlushingIndexWriter.java, BlurIndexTracker.java, 
> BlurPostingsConsumer.java, BlurPostingsConsumer.java, 
> BlurPostingsFormat.java, BlurPostingsFormat.java, BlurRealTimeIndex.java, 
> BlurRealTimeIndex.java, BlurRealTimeIndexTest.java, 
> BlurRealTimeIndexWriter.java, BlurRealTimeManager.java, 
> BlurRealTimeManagerReopenThread.java, BlurRowCodec.java, BlurRowCodec.java, 
> BlurSegmentInfoFormat.java, BlurSegmentInfoWriter.java, 
> BlurTermsConsumer.java, BlurTermsConsumer.java, 
> CompressingRowIndexReader.java, CompressingRowIndexWriter.java, 
> CompressingRowReader.java, CompressingRowReader.java, 
> CompressingRowReader.java, CompressingRowWriter.java, 
> CompressingRowWriter.java, CompressingRowWriter.java, 
> GrowableByteArrayDataOutput.java, PrimeDocCache.java, 
> RealTimeTransactionRecorder.java, RealTimeTransactionRecorder.java, 
> RowCache.java, RowDocsCollector.java, RowDocsCollector.java, 
> RowReaderCache.java, RowReaderCache.java, SlabAllocator.java, 
> SlabRAMDirectory.java, SlabRAMFile.java, SlabRAMInputStream.java, 
> SlabRAMOutputStream.java, SortingMultiReader.java, SortingMultiReader.java, 
> TestCompressingRowWriter.java, TestCompressingRowWriter.java
>
>
> We have been discussing about handling humungous rows in Blur (BLUR-220). 
> Explore the idea of using RAMDirectory at the front, backed by 
> persistent-index.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to