Re: [Neo] Web Analytics Use Case

Benjamin Dageroth Wed, 11 Nov 2009 06:05:48 -0800

Hi,

thanks for the fast answers and even drawing a domain model! What did you use 
for modeling? I have something similar on paper by now, perhaps I can 
contribute the stuff as well and I'll try to talk my CTO into some 
experimenting.

The road to aggregation is unfortunately blocked for us, as we want our 
customers to be able to drill down to really small groups and even single 
visits - for this we need the raw data. Once aggregated you cannot distinguish 
anymore and not answer all questions depending on the filters that are used on 
the data. E.g. What are the favourite paths of Visitors from Germany. Since 
this capability is one of the major sellingpoints of our solution we'd probably 
try to use both databases. The relational data for all kind of filtering and 
the graph database for path analysis.

Graph Databases are really quite new yet, yes? I am trying to find some 
literature on them, but Oreilly seem to have nothing yet on the topic.

Other than often traveled paths I am by now thinking of using the database for 
recommendation engines. Currently we use pretty much a brute force approach to 
find out which articles have often been purchased together, but with a graph 
database one should be able to implement algorithms which are way more 
efficient and using information stored in the edges should also make it 
possible to increase the quality of recommendations as more factors than just 
the edge can be taken into account. And of course realtime recommendations 
should be possible independent of nightly batch jobs.

Thanks again for the answers,

Benjamin

_______________________________________
Benjamin Dageroth, Business Development Manager
Webtrekk GmbH
Boxhagener Str. 76-78, 10245 Berlin
fon 030 - 755 415 - 360
fax 030 - 755 415 - 100
[email protected]
http://www.webtrekk.com
Amtsgericht Berlin, HRB 93435 B
Geschäftsführer Christian Sauer

_______________________________________

-----Ursprüngliche Nachricht-----
Von: [email protected] [mailto:[email protected]] Im 
Auftrag von Johan Svensson
Gesendet: Mittwoch, 11. November 2009 14:39
An: Neo user discussions
Betreff: Re: [Neo] Web Analytics Use Case

On Wed, Nov 11, 2009 at 11:02 AM, Benjamin Dageroth
<[email protected]> wrote:
> Hi,
> ...
>
> Then the best solution for this seems to me that I would run two databases: 
> First I ask our current current database to give me all Visitors, to whom 
> this criteria applies, and then ask neo4j to look only at the edges of these 
> visitors. Or would neo4J be powerful enough to deliever a similar performance 
> as traditional RDBMS Systems when confronted with data that is not really 
> resembling a graph? Is it usually easy to transform a traditional schema into 
> a graphDB Schema that performs just as well?
>

Hi,

Running two databases (graph db + RDMS) may be good idea in this case.
RDMS are really good at working with set operations on the full
dataset. On the other hand it is often possible to calculate/aggregate
such information on the fly when data is updated in a graph db.

Regarding transforming a traditional schema into a graph I would say
it is easy. I usually recommend not to look at the schema, instead
look at the domain and its data and transform that into a graph.
People are playing with automatic relational schema -> graph db
transformation tools. Have a look at Peters SQL importer here:
http://wiki.neo4j.org/content/SQL_Importer

Regards,
-Johan
_______________________________________________
Neo mailing list
[email protected]
https://lists.neo4j.org/mailman/listinfo/user
_______________________________________________
Neo mailing list
[email protected]
https://lists.neo4j.org/mailman/listinfo/user

Re: [Neo] Web Analytics Use Case

Reply via email to