Re: [Neo4j] shortestPath slower than it could be

Mattias Persson Sat, 23 Jul 2011 07:31:16 -0700

Hi John,

the algorithm is written to dodge these kinds of pitfalls. Maybe there's
some issue with the implementation, but in principal it should make no
difference. I'll look at it when I get the time (I wrote that
implementation).


2011/7/23 John cyuczieekc <[email protected]>

> Hey guys, me bugging you again :)
>
> (This whole thing is kind of based on the lack of being able to get the
> number of relationships a node has)
>
>  If I have two nodes, and the first one has 1 million outgoing
> relationships
> of the type X to 1 million unique/different nodes,
> and the second node has 10 incoming relationships of type X (same type) of
> which one is from the first node,
> then using GraphAlgoFactory.shortestPath  (or suggest a better way?)
>  How can I tell neo4j to iterate the search on the second node's incoming
> rels simply because it has 10 relationships instead of 1 million, in order
> to check if each relationship is in the form of firstNode-->secondNode ?
>
> For the case when first node has 100,000 relationships and second node has
> 10,
> it takes *1.7 seconds* for shortestPath to find the only one link between
> them using:
>
> final PathFinder<Path> finder = GraphAlgoFactory.shortestPath(
> Traversal.expanderForTypes( rel, Direction.OUTGOING  ), 1 );
> final Path foundPath = finder.findSinglePath( *one, two* );
>
> I can put Direction.*BOTH *and get the same amount of time
> *Path from one to two: (one)-->(two) timedelta=1,862,726,634 ns*
>
> *BUT*, get this: if I swap the nodes:
> finder.findSinglePath(* two, one*);
> and i use either Direction.INCOMING or Direction.*BOTH  *(which makes sense
> for the second node ,right) then I get *20ms* the time until it finishes...
> *Path from one to two: (two)<--(one) timedelta=20,830,111 ns*
>
> (both cases are without data being priorly cached)
>
> I was expecting it to act like this: (but only when using Direction.BOTH)
>  see which node has the least number of relationships and iterate on those,
> but this would work if findSinglePath would be made for depth 1 (aka
> particular case), but as I read "Tries to find a single path between
> startand
> end nodes." then it makes sense to me why it works like it does... that is,
> iterate on relationships from start node, rather than from end node... but
> I'm not sure if it would *not *make sense to iterate on the end node
> instead
> of start node, when knowing that end node has less relationships, for make
> the search faster (well at least if depth is one) - I didn't look into how
> neo4j actually does stuff yet :D
>
> anyway, it's fairly clear to me that I could make a simple wrapper method
> to
> make this kind of search faster, *IF* I had the ability to know how many
> relationships each node has, so I can call findSinglePath  with the first
> param being the node with the least relationship count :) But as I
> understood it, it's not possible to find how many rels a node has... gimme
> feat! :)) [by not possible I mean, without having to iterate thru all and
> count them, which would make the use case here obsolete]
>
> PS: clearly all the text I wrote here would benefit from being represented
> by a graph, just think about all those grouping with autohiding the ie.
> "[]"
> and all kinds of stuff... heh
> _______________________________________________
> Neo4j mailing list
> [email protected]
> https://lists.neo4j.org/mailman/listinfo/user
>



-- 
Mattias Persson, [[email protected]]
Hacker, Neo Technology
www.neotechnology.com
_______________________________________________
Neo4j mailing list
[email protected]
https://lists.neo4j.org/mailman/listinfo/user

Re: [Neo4j] shortestPath slower than it could be

Reply via email to