Re: [Neo4j] Problems with structure after multiple BatchInserter runs and supernodes

2014-10-28 Thread Joseph Guhlin
I was able to create the database using only one instance of the 
BatchInserter, and am still having the error:

START x=node:main(id = "Medtr2125s0010")
  MATCH (x)-[:EXPRESSED]-(y)
WITH x,y
  MATCH (y)-[:EXPRESSED]-(g)
RETURN x.id,y.id,g.id

Where x is never a g in the second part of the statement. Is there any way 
to regenerate the Relationship Store or Relationship Group Store?

I'll see if I can make this a test case and report it as a bug, unless 
anyone has any other ideas.

--Joseph

On Tuesday, October 21, 2014 1:35:05 PM UTC-5, Joseph Guhlin wrote:
>
> I did have that as an issue before, and it has been fixed (on the Neo4j 
> side), and the index query in the query above gives the proper result. 
>
> It is the relationship that seems to exist only when coming from node (x) 
> in the above example, not when coming from (y) (despite not listing it as a 
> directional relationship). Plus the missing several thousand relationships. 
> I'm not using a relationship index as I haven't had need to. 
>
> Sorry my first message wasn't more clear.
>
> Best,
> --Joseph
>
>
> On Tuesday, October 21, 2014 1:28:29 PM UTC-5, Michael Hunger wrote:
>>
>> Joseph, could it be that you write to the index only in one of the runs?
>> I thought there was an issue once where, when you didn't write to an 
>> index in one run it removed the index-definition? 
>>
>> Could you check that?
>>
>> Michael
>>
>> On Tue, Oct 21, 2014 at 10:25 AM, Joseph Guhlin  
>> wrote:
>>
>>> JDK: 1.8
>>> Neo4j: 2.1.5 -  Embedded and then stand-alone to test the data
>>>
>>> I'm using a program to insert a large amount of data to Neo4j. Because 
>>> of memory limitations and sped limitations I usually have to do this in a 
>>> few batches using BatchInserter (separate commands, long after the database 
>>> has shut down -- not multiple threads). I'm getting things like this:
>>>
>>> START x=node:main(id = "Medtr2125s0010")
>>>   MATCH (x)-[:EXPRESSED]-(y)
>>> WITH x,y
>>>   MATCH (y)-[:EXPRESSED]-(g)
>>> RETURN x.id,y.id,g.id
>>>
>>> Results here (copy and paste from the Web Console wasn't pretty so this 
>>> is by hand):
>>> x.id: Medtr2125s0010
>>> y.id: Nodule
>>> g.id: PAC:26323170
>>> Returned 1 row in 110 ms
>>>
>>> Which doesn't make sense to me, not only should there be over 20,000 
>>> entries, even if it is finding just this one it should find the same 
>>> x.id where g.id is.
>>> I've had no trouble doing multiple BatchInserter calls before, but have 
>>> had lots of trouble with the 2.1.x line, I believe it is related to the 
>>> RelationshipGroupStore, which was causing a massive slowdown (see my 
>>> StackOverflow question here: 
>>> http://stackoverflow.com/questions/26451609/relationshipgroupstore-mapped-memory-setting-for-batchinserter
>>>  
>>> ).
>>>
>>> I plan on trying only 1 BatchInserter incarnation tonight and running it 
>>> to see if it finishes properly and will report back. But this seems like a 
>>> bug.
>>>
>>> Any advice on speeding up when RelationshipGroupStore slows down during 
>>> the insert would also be greatly appreciated.
>>>
>>> Thanks,
>>> --Joseph
>>>
>>> -- 
>>> You received this message because you are subscribed to the Google 
>>> Groups "Neo4j" group.
>>> To unsubscribe from this group and stop receiving emails from it, send 
>>> an email to neo4j+un...@googlegroups.com.
>>> For more options, visit https://groups.google.com/d/optout.
>>>
>>
>>

-- 
You received this message because you are subscribed to the Google Groups 
"Neo4j" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to neo4j+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: [Neo4j] Problems with structure after multiple BatchInserter runs and supernodes

2014-10-21 Thread Joseph Guhlin
I did have that as an issue before, and it has been fixed (on the Neo4j 
side), and the index query in the query above gives the proper result. 

It is the relationship that seems to exist only when coming from node (x) 
in the above example, not when coming from (y) (despite not listing it as a 
directional relationship). Plus the missing several thousand relationships. 
I'm not using a relationship index as I haven't had need to. 

Sorry my first message wasn't more clear.

Best,
--Joseph


On Tuesday, October 21, 2014 1:28:29 PM UTC-5, Michael Hunger wrote:
>
> Joseph, could it be that you write to the index only in one of the runs?
> I thought there was an issue once where, when you didn't write to an index 
> in one run it removed the index-definition? 
>
> Could you check that?
>
> Michael
>
> On Tue, Oct 21, 2014 at 10:25 AM, Joseph Guhlin  > wrote:
>
>> JDK: 1.8
>> Neo4j: 2.1.5 -  Embedded and then stand-alone to test the data
>>
>> I'm using a program to insert a large amount of data to Neo4j. Because of 
>> memory limitations and sped limitations I usually have to do this in a few 
>> batches using BatchInserter (separate commands, long after the database has 
>> shut down -- not multiple threads). I'm getting things like this:
>>
>> START x=node:main(id = "Medtr2125s0010")
>>   MATCH (x)-[:EXPRESSED]-(y)
>> WITH x,y
>>   MATCH (y)-[:EXPRESSED]-(g)
>> RETURN x.id,y.id,g.id
>>
>> Results here (copy and paste from the Web Console wasn't pretty so this 
>> is by hand):
>> x.id: Medtr2125s0010
>> y.id: Nodule
>> g.id: PAC:26323170
>> Returned 1 row in 110 ms
>>
>> Which doesn't make sense to me, not only should there be over 20,000 
>> entries, even if it is finding just this one it should find the same x.id 
>> where g.id is.
>> I've had no trouble doing multiple BatchInserter calls before, but have 
>> had lots of trouble with the 2.1.x line, I believe it is related to the 
>> RelationshipGroupStore, which was causing a massive slowdown (see my 
>> StackOverflow question here: 
>> http://stackoverflow.com/questions/26451609/relationshipgroupstore-mapped-memory-setting-for-batchinserter
>>  
>> ).
>>
>> I plan on trying only 1 BatchInserter incarnation tonight and running it 
>> to see if it finishes properly and will report back. But this seems like a 
>> bug.
>>
>> Any advice on speeding up when RelationshipGroupStore slows down during 
>> the insert would also be greatly appreciated.
>>
>> Thanks,
>> --Joseph
>>
>> -- 
>> You received this message because you are subscribed to the Google Groups 
>> "Neo4j" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to neo4j+un...@googlegroups.com .
>> For more options, visit https://groups.google.com/d/optout.
>>
>
>

-- 
You received this message because you are subscribed to the Google Groups 
"Neo4j" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to neo4j+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: [Neo4j] Problems with structure after multiple BatchInserter runs and supernodes

2014-10-21 Thread Michael Hunger
Joseph, could it be that you write to the index only in one of the runs?
I thought there was an issue once where, when you didn't write to an index
in one run it removed the index-definition?

Could you check that?

Michael

On Tue, Oct 21, 2014 at 10:25 AM, Joseph Guhlin 
wrote:

> JDK: 1.8
> Neo4j: 2.1.5 -  Embedded and then stand-alone to test the data
>
> I'm using a program to insert a large amount of data to Neo4j. Because of
> memory limitations and sped limitations I usually have to do this in a few
> batches using BatchInserter (separate commands, long after the database has
> shut down -- not multiple threads). I'm getting things like this:
>
> START x=node:main(id = "Medtr2125s0010")
>   MATCH (x)-[:EXPRESSED]-(y)
> WITH x,y
>   MATCH (y)-[:EXPRESSED]-(g)
> RETURN x.id,y.id,g.id
>
> Results here (copy and paste from the Web Console wasn't pretty so this is
> by hand):
> x.id: Medtr2125s0010
> y.id: Nodule
> g.id: PAC:26323170
> Returned 1 row in 110 ms
>
> Which doesn't make sense to me, not only should there be over 20,000
> entries, even if it is finding just this one it should find the same x.id
> where g.id is.
> I've had no trouble doing multiple BatchInserter calls before, but have
> had lots of trouble with the 2.1.x line, I believe it is related to the
> RelationshipGroupStore, which was causing a massive slowdown (see my
> StackOverflow question here:
> http://stackoverflow.com/questions/26451609/relationshipgroupstore-mapped-memory-setting-for-batchinserter
> ).
>
> I plan on trying only 1 BatchInserter incarnation tonight and running it
> to see if it finishes properly and will report back. But this seems like a
> bug.
>
> Any advice on speeding up when RelationshipGroupStore slows down during
> the insert would also be greatly appreciated.
>
> Thanks,
> --Joseph
>
> --
> You received this message because you are subscribed to the Google Groups
> "Neo4j" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to neo4j+unsubscr...@googlegroups.com.
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"Neo4j" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to neo4j+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.