[ 
https://issues.apache.org/jira/browse/ASTERIXDB-1639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15480237#comment-15480237
 ] 

Taewoo Kim edited comment on ASTERIXDB-1639 at 9/10/16 6:45 PM:
----------------------------------------------------------------

You assigned this to right guy even if he lacks some knowledge. :-) We can 
discuss more details next week. 


was (Author: wangsaeu):
You assigned to this to right guy even if he lacks some knowledge. :-) We can 
discuss more details next week. 

> Need a spatial test case with a high point dup factor
> -----------------------------------------------------
>
>                 Key: ASTERIXDB-1639
>                 URL: https://issues.apache.org/jira/browse/ASTERIXDB-1639
>             Project: Apache AsterixDB
>          Issue Type: Test
>          Components: AsterixDB, Hyracks, Storage
>            Reporter: Michael J. Carey
>            Assignee: Taewoo Kim
>              Labels: soon
>
> We need a few LSM R-tree test cases where we have many leaves worth of data 
> (which could be achieved by making an artificially small NC config?) that 
> have the same key - to make sure that we can handle that case properly.  (I'm 
> wondering after talking with Wail if that's the root of his problems a few 
> weeks ago - he had a high duplicate rate.)  E.g., we should trying to insert 
> a ton of data all at one of the same 2-3 unique spatial points.  It would be 
> good for there to be enough data that multi-level Hilbert sorting is required 
> as well.  This is likely to be a time-consuming test so it should be in our 
> period (not per-checkin) tests.  We should actually do this extreme-dup-case 
> test for all index types, but R trees are suspected of maybe doing this 
> wrong.  Who would be best to write/run this test w/o much effort?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to