Hi Benjamin, interesting problems to say the least. Depending on the data structure and which queries are prioritized it can be modelled differently, each solution being particularly good at specific features/queries.
We've created a wiki page for this model as well at http://wiki.neo4j.org/content/Site_Usage_Analytics / Mattias 2009/11/11 Benjamin Dageroth <[email protected]>: > Hi, > > I am currently thinking about the possible usage of neo4j for a web analytics > solution. Currently we are using an RDBMS (Exasol) which performs quite well, > but there are a few cases, which do not work well: We would like to compute > commonly travelled paths of users going through a website. With our current > solution this is pretty much impossible for any complex website. > > Now I was thinking, whether this is indeed a good use case for neo4j because > I would have to save a path in the correct order, the order in which a > visitor visited the pages (nodes). > E.g. a website which has 5 Pages (A,B,C,D,E). Three Visitors(X,Y,Z) tavel > through the Page. > > X: A, D, E, A > Y: A, B,D,E,A > Z: A, B, D, C,D,C,E > > Now I would like to know how often a path A from E was travelled, with less > than 3 Edges in between. > Or I would like to know the most frequently travelled paths. > > Therefore my question on neo4j is basically, can I easily make these > computations? It seems to me that there would need be to have multiple Edges > between two nodes, basically each with an ID of the Visitor, is that easily > possible? So far I have always only seen one edge between two nodes > describing the relationship between the nodes and this seems to be a somewhat > different use case. > > When it comes to questions like: > What are the most frequent paths of Visitors who bought something (or those > who bought nothing)? > > Then the best solution for this seems to me that I would run two databases: > First I ask our current current database to give me all Visitors, to whom > this criteria applies, and then ask neo4j to look only at the edges of these > visitors. Or would neo4J be powerful enough to deliever a similar performance > as traditional RDBMS Systems when confronted with data that is not really > resembling a graph? Is it usually easy to transform a traditional schema into > a graphDB Schema that performs just as well? > > Thanks for your answers, > > Benjamin > > _______________________________________ > Benjamin Dageroth, Business Development Manager > Webtrekk GmbH > Boxhagener Str. 76-78, 10245 Berlin > fon 030 - 755 415 - 360 > fax 030 - 755 415 - 100 > [email protected] > http://www.webtrekk.com<http://www.webtrekk.de/> > Amtsgericht Berlin, HRB 93435 B > Geschäftsführer Christian Sauer > > > _______________________________________ > > > _______________________________________________ > Neo mailing list > [email protected] > https://lists.neo4j.org/mailman/listinfo/user > -- Mattias Persson, [[email protected]] Neo Technology, www.neotechnology.com _______________________________________________ Neo mailing list [email protected] https://lists.neo4j.org/mailman/listinfo/user

