Re: [Neo4j] Mac Pro Question ...

2011-04-14 Thread Tim McNamara
Rense-

You should explain what you have tried. Which commands have you used? What
error messages have you received?

Good luck

Tim McNamara  |  @timClicks   |
timmcnamara.co.nz


On 15 April 2011 16:19, Rense Lange  wrote:

>
>
>
> Is there anyone who is running neo4j on their Mac? How did you do it? I
> have a Mac pro, and I can't seem to get it going. Right now, the
> installation methods provided on the web don't work. I hope there is a way
> around this. Help!
>
> Rense Lange
>
> ___
> Neo4j mailing list
> User@lists.neo4j.org
> https://lists.neo4j.org/mailman/listinfo/user
>
___
Neo4j mailing list
User@lists.neo4j.org
https://lists.neo4j.org/mailman/listinfo/user


[Neo4j] Mac Pro Question ...

2011-04-14 Thread Rense Lange



Is there anyone who is running neo4j on their Mac? How did you do it? I have a 
Mac pro, and I can't seem to get it going. Right now, the installation methods 
provided on the web don't work. I hope there is a way around this. Help!

Rense Lange

___
Neo4j mailing list
User@lists.neo4j.org
https://lists.neo4j.org/mailman/listinfo/user


Re: [Neo4j] 20 billion primitives machine

2011-04-14 Thread Jamie Stephens
Tobias,

Thanks very much for the response.  I'll follow up with you privately,
and then I can summarize here.

--Jamie

On Thu, Apr 14, 2011 at 6:24 PM, Tobias Ivarsson
 wrote:
> Hi Jamie,
>
> Very interesting use case you have there.
>
> If you could just provide a few more bits of information about your data,
> I'd be able to provide a better analysis.
>
> * Is the distribution of relationships uniform across the nodes. If not, how
> much does it vary?
>
> * What kind of operation you want to do in the 50ms latency that you've
> specified. You just mentioned "relatively shallow traversals", approximately
> how deep is that? The key figure to get at is how many relationships you
> need to traverse, and that is a function of the depth of the traversal and
> the distribution of the relationships.
>
> * Is the 50ms latency for each single request? What is the estimated number
> of concurrent requests?
>
> * What kind of properties are those 32 byte values? ASCII strings of length
> 32?
>
> * The bulk loading phase you describe, is it a one time import of initial
> data, or a regularly recurring thing?
>
> * In the bulk load phase, what form does the data have? Would it be possible
> to have that data in a format where each node is uniquely identified by a
> number (in both the node data listing, and the edge-list).
>
>
> Finally, do you have test data for these sizes? If possible I'd love to work
> with you on this, to get an good use case to work on for improving the large
> data story in Neo4j.
>
> Cheers,
> Tobias
>
> On Thu, Apr 14, 2011 at 3:12 PM, Jamie Stephens  wrote:
>
>> Folks,
>>
>> I've got an application that has (will have) about 2 billion vertexes
>> and maybe 8 billion edges (?).  Maybe an avg of 4 properties per
>> vertex -- with maybe an avg of 32 bytes/value.  So I guess that's 16
>> billion primitives.  Let's round to 20 billion.  My edges estimate is
>> a relatively uninformed guess.  Just starting to dig into the data.
>>
>> Traversals will be relatively shallow.  Concurrent access.  Throughput
>> is more important than latency.  But latency should be better than
>> maybe 50ms 99% of the time (allowing for some cache warming and some
>> GC).  I don't know much yet about locality.  I'm not sure yet how
>> sensitive the app will be to long GCs.
>>
>> We will need to do a big batch load, and writes will need to be fast
>> in that phase.  After that, we'll see more reads that writes.  So I
>> imagine a config for the batch load and another config for production.
>>
>> I understand cache sharding, application-level partitioning, and so
>> forth.  I'm wondering what I can do on a single machine -- and what
>> that machine should look like.
>>
>> http://docs.neo4j.org/chunked/stable/configuration-jvm.html and
>> http://wiki.neo4j.org/content/Neo4j_Performance_Guide are encouraging.
>>  And having knobs as documented at
>> http://wiki.neo4j.org/content/Configuration_Settings is great.  Nice
>> work!
>>
>> I'm hoping I might be able to get away with 128GB RAM on 12 cores with
>> data striped over a handful of disks (SSDs if required).  We'll
>> probably also need a cluster for both traffic and availability, but
>> that's another topic.
>>
>> Does anybody have experience with a data set like this on a similar
>> machine?  How much RAM and how much disk -- and what kinds and in what
>> configuration?  Latency, throughput, general experience?  Any
>> production deployments?
>>
>> I'd appreciate any guidance or feedback.  I'm happy to summarize later
>> if that'd be helpful.
>>
>> BTW, my testbed uses Clojure with clojure.contrib.server-socket and
>> https://github.com/wagjo/borneo. Very convenient!
>>
>> --Jamie
>> ___
>> Neo4j mailing list
>> User@lists.neo4j.org
>> https://lists.neo4j.org/mailman/listinfo/user
>>
>
>
>
> --
> Tobias Ivarsson 
> Hacker, Neo Technology
> www.neotechnology.com
> Cellphone: +46 706 534857
> ___
> Neo4j mailing list
> User@lists.neo4j.org
> https://lists.neo4j.org/mailman/listinfo/user
>
___
Neo4j mailing list
User@lists.neo4j.org
https://lists.neo4j.org/mailman/listinfo/user


[Neo4j] Question about REST interface concurrency

2011-04-14 Thread Stephen Roos
Hello Neo Team!

Congrats on the recent release!  I'm using 1.3 enterprise in my development 
environment.  I noticed that in earlier versions there were some patches to 
allow setting the min/max thread pool size for the REST servlet container.  Are 
there any similar options now?  Under load tests, it seems like I would benefit 
from at least having a higher initial thread pool size.  Are there any other 
configuration changes or strategies that would help with overall throughput 
under heavy load?

Thanks for your help!

Stephen Roos
Software Engineer

CareerArc Group
The Social Exceleration Network
www.careerarcgroup.com

This email and any files transmitted with it are confidential and intended 
solely for the use of the individual or entity to whom they are addressed.  If 
you have received this email in error, please notify the sender and delete this 
email from your system.


___
Neo4j mailing list
User@lists.neo4j.org
https://lists.neo4j.org/mailman/listinfo/user


Re: [Neo4j] REST indexes questions

2011-04-14 Thread Javier de la Rosa
Thank you very much for the aclarations, Michael.

Regards!

On Thu, Apr 14, 2011 at 18:55, Michael Hunger
 wrote:
> 1. Removing an index is not exposed, possible to add with a server 
> plugin/extension.
> 2. One can easily store the index-names as node properties with the indexing 
> operation - that's also something that could be automated via an server 
> plugin.
> 3. Same goes for auto removal, btw. it is also necessary for relationships, 
> so perhaps an operation removing a node and its relationships (and all of 
> those from their indexes) would be sensible.
> 4. Query exact is looking for the exact match, Advanced Query takes also 
> lucene query strings and partial matches like (Matr*), for advanced-query 
> fulltext matches the values must have been indexed on a fulltext index.
>
> It should be a matter of a few hours of coding to provide a server plugin 
> that does the above.
>
> Cheers
>
> Michael
>
> Am 13.04.2011 um 21:46 schrieb Javier de la Rosa:
>
>> Hi all, at first congratulations for the new GA release of Neo4j.
>> Really great job.
>>
>> I'm now developing the support for indexes and I have changed a bit
>> the compatibility with neo4j.py due to currently it doesn't work with
>> nodes indexes neither relationships indexes.
>> But I have some related questions:
>> 1. How can I remove an index?
>> 2. Would it be possible, when I get the info of a node, also provide
>> the indexes in which the node has been indexed?
>> 3. What it happens if I remove a node that it's already indexed? Is it
>> removed from the index too?
>> 4. What's the real difference between the query exact method and the
>> advanced query? I mean, using only the advanced query, are you able to
>> query in exact mode?
>>
>> --
>> Javier de la Rosa
>> http://versae.es
>> ___
>> Neo4j mailing list
>> User@lists.neo4j.org
>> https://lists.neo4j.org/mailman/listinfo/user
>
> ___
> Neo4j mailing list
> User@lists.neo4j.org
> https://lists.neo4j.org/mailman/listinfo/user
>



-- 
Javier de la Rosa
http://versae.es
___
Neo4j mailing list
User@lists.neo4j.org
https://lists.neo4j.org/mailman/listinfo/user


Re: [Neo4j] 20 billion primitives machine

2011-04-14 Thread Tobias Ivarsson
Hi Jamie,

Very interesting use case you have there.

If you could just provide a few more bits of information about your data,
I'd be able to provide a better analysis.

* Is the distribution of relationships uniform across the nodes. If not, how
much does it vary?

* What kind of operation you want to do in the 50ms latency that you've
specified. You just mentioned "relatively shallow traversals", approximately
how deep is that? The key figure to get at is how many relationships you
need to traverse, and that is a function of the depth of the traversal and
the distribution of the relationships.

* Is the 50ms latency for each single request? What is the estimated number
of concurrent requests?

* What kind of properties are those 32 byte values? ASCII strings of length
32?

* The bulk loading phase you describe, is it a one time import of initial
data, or a regularly recurring thing?

* In the bulk load phase, what form does the data have? Would it be possible
to have that data in a format where each node is uniquely identified by a
number (in both the node data listing, and the edge-list).


Finally, do you have test data for these sizes? If possible I'd love to work
with you on this, to get an good use case to work on for improving the large
data story in Neo4j.

Cheers,
Tobias

On Thu, Apr 14, 2011 at 3:12 PM, Jamie Stephens  wrote:

> Folks,
>
> I've got an application that has (will have) about 2 billion vertexes
> and maybe 8 billion edges (?).  Maybe an avg of 4 properties per
> vertex -- with maybe an avg of 32 bytes/value.  So I guess that's 16
> billion primitives.  Let's round to 20 billion.  My edges estimate is
> a relatively uninformed guess.  Just starting to dig into the data.
>
> Traversals will be relatively shallow.  Concurrent access.  Throughput
> is more important than latency.  But latency should be better than
> maybe 50ms 99% of the time (allowing for some cache warming and some
> GC).  I don't know much yet about locality.  I'm not sure yet how
> sensitive the app will be to long GCs.
>
> We will need to do a big batch load, and writes will need to be fast
> in that phase.  After that, we'll see more reads that writes.  So I
> imagine a config for the batch load and another config for production.
>
> I understand cache sharding, application-level partitioning, and so
> forth.  I'm wondering what I can do on a single machine -- and what
> that machine should look like.
>
> http://docs.neo4j.org/chunked/stable/configuration-jvm.html and
> http://wiki.neo4j.org/content/Neo4j_Performance_Guide are encouraging.
>  And having knobs as documented at
> http://wiki.neo4j.org/content/Configuration_Settings is great.  Nice
> work!
>
> I'm hoping I might be able to get away with 128GB RAM on 12 cores with
> data striped over a handful of disks (SSDs if required).  We'll
> probably also need a cluster for both traffic and availability, but
> that's another topic.
>
> Does anybody have experience with a data set like this on a similar
> machine?  How much RAM and how much disk -- and what kinds and in what
> configuration?  Latency, throughput, general experience?  Any
> production deployments?
>
> I'd appreciate any guidance or feedback.  I'm happy to summarize later
> if that'd be helpful.
>
> BTW, my testbed uses Clojure with clojure.contrib.server-socket and
> https://github.com/wagjo/borneo. Very convenient!
>
> --Jamie
> ___
> Neo4j mailing list
> User@lists.neo4j.org
> https://lists.neo4j.org/mailman/listinfo/user
>



-- 
Tobias Ivarsson 
Hacker, Neo Technology
www.neotechnology.com
Cellphone: +46 706 534857
___
Neo4j mailing list
User@lists.neo4j.org
https://lists.neo4j.org/mailman/listinfo/user


Re: [Neo4j] Groovy and Unable to lock store..this is usually a result of..

2011-04-14 Thread Michael Hunger
Hi,

Perhaps you're not shutting down the db after usage and have some other process 
(or visualization like neoclipse in non-ro mode) running at the same datastore?

But as this seems to be blueprints related you should perhaps cross-post this 
question on the tinkerpop/gremlin mailing list.

Cheers

Michael

Am 15.04.2011 um 00:57 schrieb mindthief:

> Hi All,
> Just trying to get started with Neo4j, and I am seeing the issues reported
> above. Initially I was getting the "IllegalStateException: Unable to lock
> store" message, and now I'm getting the other message
> "TransactionFailureException: Could not create data source". The code
> correctly created the db the first time I ran it -- these errors are
> happening on subsequent runs. There is only one application, so no
> possibility of another being open and accessing the db. I am using Groovy
> and Blueprints.
> 
> Any advice would be appreciated, thanks!
> -M
> 
> 
> --
> View this message in context: 
> http://neo4j-user-list.438527.n3.nabble.com/Neo4j-Groovy-and-Unable-to-lock-store-this-is-usually-a-result-of-tp1567247p2822672.html
> Sent from the Neo4J User List mailing list archive at Nabble.com.
> ___
> Neo4j mailing list
> User@lists.neo4j.org
> https://lists.neo4j.org/mailman/listinfo/user

___
Neo4j mailing list
User@lists.neo4j.org
https://lists.neo4j.org/mailman/listinfo/user


Re: [Neo4j] Groovy and Unable to lock store..this is usually a result of..

2011-04-14 Thread mindthief
Hi All,
Just trying to get started with Neo4j, and I am seeing the issues reported
above. Initially I was getting the "IllegalStateException: Unable to lock
store" message, and now I'm getting the other message
"TransactionFailureException: Could not create data source". The code
correctly created the db the first time I ran it -- these errors are
happening on subsequent runs. There is only one application, so no
possibility of another being open and accessing the db. I am using Groovy
and Blueprints.

Any advice would be appreciated, thanks!
-M


--
View this message in context: 
http://neo4j-user-list.438527.n3.nabble.com/Neo4j-Groovy-and-Unable-to-lock-store-this-is-usually-a-result-of-tp1567247p2822672.html
Sent from the Neo4J User List mailing list archive at Nabble.com.
___
Neo4j mailing list
User@lists.neo4j.org
https://lists.neo4j.org/mailman/listinfo/user


Re: [Neo4j] REST indexes questions

2011-04-14 Thread Michael Hunger
1. Removing an index is not exposed, possible to add with a server 
plugin/extension.
2. One can easily store the index-names as node properties with the indexing 
operation - that's also something that could be automated via an server plugin.
3. Same goes for auto removal, btw. it is also necessary for relationships, so 
perhaps an operation removing a node and its relationships (and all of those 
from their indexes) would be sensible.
4. Query exact is looking for the exact match, Advanced Query takes also lucene 
query strings and partial matches like (Matr*), for advanced-query fulltext 
matches the values must have been indexed on a fulltext index.

It should be a matter of a few hours of coding to provide a server plugin that 
does the above. 

Cheers

Michael

Am 13.04.2011 um 21:46 schrieb Javier de la Rosa:

> Hi all, at first congratulations for the new GA release of Neo4j.
> Really great job.
> 
> I'm now developing the support for indexes and I have changed a bit
> the compatibility with neo4j.py due to currently it doesn't work with
> nodes indexes neither relationships indexes.
> But I have some related questions:
> 1. How can I remove an index?
> 2. Would it be possible, when I get the info of a node, also provide
> the indexes in which the node has been indexed?
> 3. What it happens if I remove a node that it's already indexed? Is it
> removed from the index too?
> 4. What's the real difference between the query exact method and the
> advanced query? I mean, using only the advanced query, are you able to
> query in exact mode?
> 
> -- 
> Javier de la Rosa
> http://versae.es
> ___
> Neo4j mailing list
> User@lists.neo4j.org
> https://lists.neo4j.org/mailman/listinfo/user

___
Neo4j mailing list
User@lists.neo4j.org
https://lists.neo4j.org/mailman/listinfo/user


Re: [Neo4j] How to connect Neoclipse remotely

2011-04-14 Thread Michael Hunger
At least for 1.3. webadmin has similar visualization appeal as neoclipse.

Could you try to share the directory of the graph database with a network file 
system like samba or nfs?
I don't know how fast/slow that is but it's worth a try.

I also have another idea but I have to test that first.

Cheers

Michael

Am 15.04.2011 um 00:18 schrieb Bhargav Gunda:

> As far as I know Neoclipse is a stand alone application. For connecting the 
> database file in neoclipse,
> Goto-> Preferences( left top corner) -> Neo4j-> browse the loaction for neo4j 
> database file. For more details follow the link.
> And here is the wiki link which guides you to neoclipse.
> http://wiki.neo4j.org/content/Neoclipse
> 
> Regards, 
> G.
> Sent from my iPhone
> 
> On Apr 14, 2011, at 11:08 PM, sulabh choudhury  wrote:
> 
>> Hi,
>> 
>> I just installed Neoclipse. I am trying to connect it to a graph remotely.
>> I see an option to enter the "Database Resource Uri" but that box is not
>> enabled and hence I do not know how to connect the database.
>> I have both Neo4j-1.2 and Neoclipse-1.2. Also do I need to have Neo4j
>> locally to use Neoclipse ?
>> ___
>> Neo4j mailing list
>> User@lists.neo4j.org
>> https://lists.neo4j.org/mailman/listinfo/user
> ___
> Neo4j mailing list
> User@lists.neo4j.org
> https://lists.neo4j.org/mailman/listinfo/user

___
Neo4j mailing list
User@lists.neo4j.org
https://lists.neo4j.org/mailman/listinfo/user


Re: [Neo4j] REST indexes questions

2011-04-14 Thread Javier de la Rosa
Thank you, Peter. It's gonna be exciting the future version 1.4 :-)

On Thu, Apr 14, 2011 at 16:33, Peter Neubauer
 wrote:
> Javier,
> great work with the Python bindings! I can't answer for all of the
> questions, but trying inline. I think the core guys might have some
> more insights here.
>
> On Wed, Apr 13, 2011 at 9:46 PM, Javier de la Rosa  wrote:
>> Hi all, at first congratulations for the new GA release of Neo4j.
>> Really great job.
>>
>> I'm now developing the support for indexes and I have changed a bit
>> the compatibility with neo4j.py due to currently it doesn't work with
>> nodes indexes neither relationships indexes.
>> But I have some related questions:
>> 1. How can I remove an index?
> Removing an index is a good catch, I don't think there is a method for
> completely removing an Index right now. We should add that for 1.4
> since this is really a bug in the API unless I am missing something
> here.
>
>> 2. Would it be possible, when I get the info of a node, also provide
>> the indexes in which the node has been indexed?
> Not really, as Indicies are independent XA resources from the kernel.
> You can loop through the existing indexes and check if the node is in
> there, but I imagine that is a costly operation. Mattias, is there any
> easy way to do this? If yes, we could add it as an extension or
> directly into the Node-representation I think.
>
>> 3. What it happens if I remove a node that it's already indexed? Is it
>> removed from the index too?
> Same here. If you don't include a node-removal from the index in the
> same transaction, the index is going to be stale. All of the above
> behaviors are target for an "auto-index" feature that would help you
> with this - also for 1.4 planned.
>
>> 4. What's the real difference between the query exact method and the
>> advanced query? I mean, using only the advanced query, are you able to
>> query in exact mode?
> It is referring to the exposure of the get vs. query interface of the
> Index API, see 
> http://components.neo4j.org/neo4j/1.3/apidocs/org/neo4j/graphdb/index/Index.html
> .
>
>
> Thanks so much for the feedback! This is very valuable input for the
> next cycles. Keep it coming Javier!
>
> /peter
>



-- 
Javier de la Rosa
http://versae.es
___
Neo4j mailing list
User@lists.neo4j.org
https://lists.neo4j.org/mailman/listinfo/user


Re: [Neo4j] How to connect Neoclipse remotely

2011-04-14 Thread Bhargav Gunda
As far as I know Neoclipse is a stand alone application. For connecting the 
database file in neoclipse,
Goto-> Preferences( left top corner) -> Neo4j-> browse the loaction for neo4j 
database file. For more details follow the link.
And here is the wiki link which guides you to neoclipse.
http://wiki.neo4j.org/content/Neoclipse

Regards, 
G.
Sent from my iPhone

On Apr 14, 2011, at 11:08 PM, sulabh choudhury  wrote:

> Hi,
> 
> I just installed Neoclipse. I am trying to connect it to a graph remotely.
> I see an option to enter the "Database Resource Uri" but that box is not
> enabled and hence I do not know how to connect the database.
> I have both Neo4j-1.2 and Neoclipse-1.2. Also do I need to have Neo4j
> locally to use Neoclipse ?
> ___
> Neo4j mailing list
> User@lists.neo4j.org
> https://lists.neo4j.org/mailman/listinfo/user
___
Neo4j mailing list
User@lists.neo4j.org
https://lists.neo4j.org/mailman/listinfo/user


[Neo4j] How to connect Neoclipse remotely

2011-04-14 Thread sulabh choudhury
Hi,

I just installed Neoclipse. I am trying to connect it to a graph remotely.
I see an option to enter the "Database Resource Uri" but that box is not
enabled and hence I do not know how to connect the database.
I have both Neo4j-1.2 and Neoclipse-1.2. Also do I need to have Neo4j
locally to use Neoclipse ?
___
Neo4j mailing list
User@lists.neo4j.org
https://lists.neo4j.org/mailman/listinfo/user


Re: [Neo4j] Washington DC Release Party

2011-04-14 Thread Peter Neubauer
Ha,
you will never beat Swedish Geeks in Stockholm and a LIVE release at
the event! I expect nothing less than awesomeness from the DC meetup!

Cheers,

/peter neubauer

GTalk:      neubauer.peter
Skype       peter.neubauer
Phone       +46 704 106975
LinkedIn   http://www.linkedin.com/in/neubauer
Twitter      http://twitter.com/peterneubauer

http://www.neo4j.org               - Your high performance graph database.
http://startupbootcamp.org/    - Öresund - Innovation happens HERE.
http://www.thoughtmade.com - Scandinavia's coolest Bring-a-Thing party.



On Thu, Apr 14, 2011 at 6:06 PM, Andreas Kollegger
 wrote:
> Graphistas of the greater Washington metro area,
>
> As mentioned in the release blog, we'll be getting together at The Front 
> Page[1] next Tuesday at 7:00pm. The Front Page is right near the Dupont 
> Circle metro stop on the red line. I'll be the guy wearing the black ball cap 
> with a Neo4j logo.
>
> While I'm not saying that the other meet-ups will be lame, they won't have me 
> as a host, which limits how cool they can really be.
>
> Reply to this thread, or contact me directly if you have any questions.
>
> Cheers,
> Andreas
> 443-858-8338
>
> [1] http://www.frontpagerestaurant.com/
> ___
> Neo4j mailing list
> User@lists.neo4j.org
> https://lists.neo4j.org/mailman/listinfo/user
>
___
Neo4j mailing list
User@lists.neo4j.org
https://lists.neo4j.org/mailman/listinfo/user


Re: [Neo4j] REST indexes questions

2011-04-14 Thread Peter Neubauer
Javier,
great work with the Python bindings! I can't answer for all of the
questions, but trying inline. I think the core guys might have some
more insights here.

On Wed, Apr 13, 2011 at 9:46 PM, Javier de la Rosa  wrote:
> Hi all, at first congratulations for the new GA release of Neo4j.
> Really great job.
>
> I'm now developing the support for indexes and I have changed a bit
> the compatibility with neo4j.py due to currently it doesn't work with
> nodes indexes neither relationships indexes.
> But I have some related questions:
> 1. How can I remove an index?
Removing an index is a good catch, I don't think there is a method for
completely removing an Index right now. We should add that for 1.4
since this is really a bug in the API unless I am missing something
here.

> 2. Would it be possible, when I get the info of a node, also provide
> the indexes in which the node has been indexed?
Not really, as Indicies are independent XA resources from the kernel.
You can loop through the existing indexes and check if the node is in
there, but I imagine that is a costly operation. Mattias, is there any
easy way to do this? If yes, we could add it as an extension or
directly into the Node-representation I think.

> 3. What it happens if I remove a node that it's already indexed? Is it
> removed from the index too?
Same here. If you don't include a node-removal from the index in the
same transaction, the index is going to be stale. All of the above
behaviors are target for an "auto-index" feature that would help you
with this - also for 1.4 planned.

> 4. What's the real difference between the query exact method and the
> advanced query? I mean, using only the advanced query, are you able to
> query in exact mode?
It is referring to the exposure of the get vs. query interface of the
Index API, see 
http://components.neo4j.org/neo4j/1.3/apidocs/org/neo4j/graphdb/index/Index.html
.


Thanks so much for the feedback! This is very valuable input for the
next cycles. Keep it coming Javier!

/peter
___
Neo4j mailing list
User@lists.neo4j.org
https://lists.neo4j.org/mailman/listinfo/user


Re: [Neo4j] Neo Server slow on frequent requests

2011-04-14 Thread David Montag
Hi Dario,

I just had a peek at the thread dump, and it appear that it was not captured
during the frozen period. Is that correct? If captured when the system
appears frozen, it will show information about what the threads are doing or
waiting for.

Thanks,
David

On Wed, Apr 13, 2011 at 2:50 AM, Dario Rexin  wrote:

> Hi Tobias,
>
> I was already sending requests to the server in the last dump. Here is
> another, hopefully this one is more helpful. The longer I request data, the
> longer it takes for the server to answer. After some time it frequently
> freezes for up to several seconds without answering to any of the requests.
>
> https://gist.github.com/917283
>
> Cheers,
>
> Dario
>
> Am 13.04.11 11:14 schrieb "Tobias Ivarsson" unter
> :
>
> > Hi Dario,
> >
> > This dump looks perfectly fine, the expected threads are there, but they
> are
> > all idle waiting for work.
> >
> > When I asked for a thread dump, I wanted one from when the server was
> under
> > load and you experienced problems. Sorry for not being clear about that.
> >
> > Cheers,
> > Tobias
> >
> > On Wed, Apr 13, 2011 at 10:34 AM, Dario Rexin 
> wrote:
> >
> >> Hey,
> >>
> >> Somehow my attached files always get deleted. Heres the dump:
> >>
> >> https://gist.github.com/917199
> >>
> >>
> >> Cheers,
> >>
> >> Dario
> >>
> >> Am 13.04.11 10:30 schrieb "Dario Rexin" unter :
> >>
> >>> Hi Tobias,
> >>>
> >>> Here's the thread dump you asked for. Thank you for taking a look at
> >> this.
> >>>
> >>>
> >>> Cheers,
> >>>
> >>> Dario
> >>>
> >>>
> >>> Am 12.04.11 22:16 schrieb "Tobias Ivarsson" unter
> >>> :
> >>>
>  Hi Dario,
> 
>  Looking at that picture it is indeed clear that a number of threads
> are
>  waiting for something. What is not shown is the more important
> >> information
>  about *what* they are waiting for. I would love to get information
> like
> >> that
>  in order to investigate the cause of the performance problem you are
> >> seeing.
>  If you could send a thread dump instead of a screenshot that would be
> a
> >> lot
>  more useful, since that would contain information about contention
> that
> >> I
>  could actually analyze. The easiest way to get a thread dump is by
> >> sending
>  the SIGQUIT signal (kill -3) to the JVM process running Neo4j.
> 
>  Cheers,
>  Tobias
> 
>  On Tue, Apr 12, 2011 at 6:35 PM, Dario Rexin 
> >> wrote:
> 
> > Hi all,
> >
> > Due to huge performance issues with some of our neo queries I
> profiled
> >> my
> > calls on the neo server. The profiling shows, that up to 85% of the
> >> time the
> > threads are waiting for other threads. I don¹t understand what¹s
> going
> >> on
> > there. Hopefully someone with a deeper knowledge can help me. Am I
> >> doing
> > something wrong, or is it normal, that most of the time the threads
> are
> > blocking each other? Her is a screenshot, showing the results of my
> > profiling: http://i.imgur.com/eIfam.jpg
> >
> >
> > Thanks in advice,
> >
> > Dario
> > ___
> > Neo4j mailing list
> > User@lists.neo4j.org
> > https://lists.neo4j.org/mailman/listinfo/user
> >
> 
> 
> >>
> >> --
> >> Dario Rexin
> >> Junior Manager Engineering
> >> dario.re...@xing.com
> >>
> >> XING AG
> >> Gaensemarkt 43, 20354 Hamburg, Germany
> >>
> >> Commercial Reg. (Registergericht): Amtsgericht Hamburg, HRB 98807
> >> Exec. Board (Vorstand): Dr. Stefan Groß-Selbeck (Vorsitzender), Ingo
> Chu,
> >> Dr. Helmut Becker, Jens Pape
> >> Chairman of the Supervisory Board (Aufsichtsratsvorsitzender): Dr. Neil
> >> Sunderland
> >>
> >> Please join my network on XING:
> >> https://www.xing.com/profile/Dario_Rexin
> >>
> >> This e-mail may contain confidential and/or privileged information. If
> you
> >> are not the intended recipient (or have received this e-mail in error)
> >> please notify the sender immediately and destroy this e-mail. Any
> >> unauthorised copying, disclosure or distribution of the material in this
> >> e-mail is strictly forbidden and may be unlawful.
> >>
> >> ___
> >> Neo4j mailing list
> >> User@lists.neo4j.org
> >> https://lists.neo4j.org/mailman/listinfo/user
> >>
> >
> >
>
> --
> Dario Rexin
> Junior Manager Engineering
> dario.re...@xing.com
>
> XING AG
> Gaensemarkt 43, 20354 Hamburg, Germany
>
> Commercial Reg. (Registergericht): Amtsgericht Hamburg, HRB 98807
> Exec. Board (Vorstand): Dr. Stefan Groß-Selbeck (Vorsitzender), Ingo Chu,
> Dr. Helmut Becker, Jens Pape
> Chairman of the Supervisory Board (Aufsichtsratsvorsitzender): Dr. Neil
> Sunderland
>
> Please join my network on XING:
> https://www.xing.com/profile/Dario_Rexin
>
> This e-mail may contain confidential and/or privileged information. If you
> are not the intended recipient (or have received this e-mail in error)
> please notify the sender immediately and destroy this

[Neo4j] Washington DC Release Party

2011-04-14 Thread Andreas Kollegger
Graphistas of the greater Washington metro area,

As mentioned in the release blog, we'll be getting together at The Front 
Page[1] next Tuesday at 7:00pm. The Front Page is right near the Dupont Circle 
metro stop on the red line. I'll be the guy wearing the black ball cap with a 
Neo4j logo. 

While I'm not saying that the other meet-ups will be lame, they won't have me 
as a host, which limits how cool they can really be. 

Reply to this thread, or contact me directly if you have any questions. 

Cheers,
Andreas
443-858-8338

[1] http://www.frontpagerestaurant.com/
___
Neo4j mailing list
User@lists.neo4j.org
https://lists.neo4j.org/mailman/listinfo/user


[Neo4j] How to combine both traversing and index queries?

2011-04-14 Thread Ville Mattila
Hi there,

I am somehow stuck with a problem of combining traversing and queries
to indices efficiently - something like finding all people with a name
starting with "Vil*" two steps away from a reference node.

Traversing all friends within two steps from the reference node is
trivial, but I find it a bit inefficient to apply a return evaluator
in each of the nodes visited during traversal. Or is it so? How about
more complex criteria which may involve more than one property or even
more complex (Lucene) queries?

I was thinking to spice up my Neo4j setup with Elasticsearch
(www.elasticsearch.org) to dedicate Neo4j to keep track of the
relationships and ES to index all the data in them, however it makes
me feel very uncomfortable to keep up the consistency when data gets
updated. However, now I need to keep also Neo4j indices updated. And
not to be said, combining traversal and an external index is yet more
complicated. However I like the idea I don't need to index each
property separately (as it seems to be with Neo4j indices now).

Just to clarify, I use REST API with Neo4j.

Maybe I am completely lost and somehow fixed to only one viewpoint in
this whole case... So, any comments, they are appreciated. =)

Thanks,
Ville
___
Neo4j mailing list
User@lists.neo4j.org
https://lists.neo4j.org/mailman/listinfo/user


Re: [Neo4j] [neo4j] Exposing Webadmin

2011-04-14 Thread Georg Summer
so in theory if I changed the proxy settings to:

ProxyPass / http://:7474
ProxyPassReverse / http://:7474

and reverse the neo4j-server.properties to the original state should work
out. (To answer that myself: yes it does)

Any idea how I could configure these proxy settings in a nicer way so that I
am still able to access the other sides on the apache? e.g. phpmyadmin

On a side node: I am doing this dance because I like the Browser and it
would be perfect for a specific part of my concept. I do not know the
complete structure, but in theory it should be possible to rib out the
html/java script of the browser and just dump them in /var/www/neo_browser?

Georg

On 14 April 2011 14:56, Jim Webber  wrote:

> Hi guys,
>
> > Jim, what is your RESTy expertise on this?
>
> Our JAX-RS plumbing will dispatch on very specific parts of the URI (that's
> just the way JAX-RS works). So if ultimately you send the server something
> that doesn't match, it's either going to cause a 404 or similar, or where we
> think it's sensible it'll cause a redirect to webadmin.
>
> So the only solution is to preserve the URI structure on the way into the
> Noe4j server, and (potentially) have it mapped to something else on the
> network where your client lives.
>
> Jim
> ___
> Neo4j mailing list
> User@lists.neo4j.org
> https://lists.neo4j.org/mailman/listinfo/user
>
___
Neo4j mailing list
User@lists.neo4j.org
https://lists.neo4j.org/mailman/listinfo/user


[Neo4j] 20 billion primitives machine

2011-04-14 Thread Jamie Stephens
Folks,

I've got an application that has (will have) about 2 billion vertexes
and maybe 8 billion edges (?).  Maybe an avg of 4 properties per
vertex -- with maybe an avg of 32 bytes/value.  So I guess that's 16
billion primitives.  Let's round to 20 billion.  My edges estimate is
a relatively uninformed guess.  Just starting to dig into the data.

Traversals will be relatively shallow.  Concurrent access.  Throughput
is more important than latency.  But latency should be better than
maybe 50ms 99% of the time (allowing for some cache warming and some
GC).  I don't know much yet about locality.  I'm not sure yet how
sensitive the app will be to long GCs.

We will need to do a big batch load, and writes will need to be fast
in that phase.  After that, we'll see more reads that writes.  So I
imagine a config for the batch load and another config for production.

I understand cache sharding, application-level partitioning, and so
forth.  I'm wondering what I can do on a single machine -- and what
that machine should look like.

http://docs.neo4j.org/chunked/stable/configuration-jvm.html and
http://wiki.neo4j.org/content/Neo4j_Performance_Guide are encouraging.
 And having knobs as documented at
http://wiki.neo4j.org/content/Configuration_Settings is great.  Nice
work!

I'm hoping I might be able to get away with 128GB RAM on 12 cores with
data striped over a handful of disks (SSDs if required).  We'll
probably also need a cluster for both traffic and availability, but
that's another topic.

Does anybody have experience with a data set like this on a similar
machine?  How much RAM and how much disk -- and what kinds and in what
configuration?  Latency, throughput, general experience?  Any
production deployments?

I'd appreciate any guidance or feedback.  I'm happy to summarize later
if that'd be helpful.

BTW, my testbed uses Clojure with clojure.contrib.server-socket and
https://github.com/wagjo/borneo. Very convenient!

--Jamie
___
Neo4j mailing list
User@lists.neo4j.org
https://lists.neo4j.org/mailman/listinfo/user


Re: [Neo4j] [neo4j] Exposing Webadmin

2011-04-14 Thread Jim Webber
Hi guys,

> Jim, what is your RESTy expertise on this?

Our JAX-RS plumbing will dispatch on very specific parts of the URI (that's 
just the way JAX-RS works). So if ultimately you send the server something that 
doesn't match, it's either going to cause a 404 or similar, or where we think 
it's sensible it'll cause a redirect to webadmin.

So the only solution is to preserve the URI structure on the way into the Noe4j 
server, and (potentially) have it mapped to something else on the network where 
your client lives.

Jim
___
Neo4j mailing list
User@lists.neo4j.org
https://lists.neo4j.org/mailman/listinfo/user


Re: [Neo4j] [neo4j] Exposing Webadmin

2011-04-14 Thread Peter Neubauer
Mmh!
This might be the problem. The server is issuing these URLs, and has
no way of knowing that you did reverse proxy everything. This is an
interesting case.

Jim, what is your RESTy expertise on this?

Cheers,

/peter neubauer

GTalk:      neubauer.peter
Skype       peter.neubauer
Phone       +46 704 106975
LinkedIn   http://www.linkedin.com/in/neubauer
Twitter      http://twitter.com/peterneubauer

http://www.neo4j.org               - Your high performance graph database.
http://startupbootcamp.org/    - Öresund - Innovation happens HERE.
http://www.thoughtmade.com - Scandinavia's coolest Bring-a-Thing party.



On Thu, Apr 14, 2011 at 1:33 PM, Georg Summer  wrote:
> Yeah Curl works fine; as well as from a Browser;
>
> curl http:///neo4jdb/db/manage/
> {
>  "services" : {
>    "console" : "http:///db/manage/server/console",
>    "jmx" : "http://1/db/manage/server/jmx",
>    "monitor" : "http:///db/manage/server/monitor"
>  }
> }
>
> what I just noticed is that the links here can not work because they are
> missing my /neo4jdb/
> Is that a problem with the proxy settings?
>
>
> Georg
>
> On 14 April 2011 13:08, Peter Neubauer 
> wrote:
>
>> Georg,
>> mmh! I could try setting this up on my local machine, but is anyone
>> else having input here? Seems like we should examine the Webadmin
>> Javascript requests.
>>
>> Can you access the data REST endpoint with curl?
>>
>> Cheers,
>>
>> /peter neubauer
>>
>> GTalk:      neubauer.peter
>> Skype       peter.neubauer
>> Phone       +46 704 106975
>> LinkedIn   http://www.linkedin.com/in/neubauer
>> Twitter      http://twitter.com/peterneubauer
>>
>> http://www.neo4j.org               - Your high performance graph database.
>> http://startupbootcamp.org/    - Öresund - Innovation happens HERE.
>> http://www.thoughtmade.com - Scandinavia's coolest Bring-a-Thing party.
>>
>>
>>
>> On Thu, Apr 14, 2011 at 12:50 PM, Georg Summer 
>> wrote:
>> > Hi there,
>> >
>> > My setup is that I have a server in a subnet protected by relative strict
>> > port policy, so I can not access the 7474 port of the server where by
>> > default neo4j is running.
>> > I have an Apache Server running as well serving some websites.
>> > I am accessing it remotely from a different part of the network.
>> >
>> > Checking the documentation showed that I can use a proxypass to expose
>> the
>> > restful service:
>> > ProxyPass /neo4jdb/db http://localhost:7474/db
>> > ProxyPassReverse /neo4jdb/db http://localhost:7474/db
>> >
>> > which works fine. (at least using it through a browser for querying)
>> >
>> > So my idea was: same thing for the webadmin:
>> > ProxyPass /neo4jdb/webadmin http://localhost:7474/webadmin
>> > ProxyPassReverse /neo4jdb/webadmin http://localhost:7474/webadmin
>> >
>> > I get directed to the webadmin page. The side gets displayed but no
>> values
>> > are retrieved so all status info is undefined. The Browser also only
>>  tells
>> > me: not found.
>> > The Service Info displays: no bean found
>> >
>> > # REST endpoint for the data API
>> > # Note the / in the end is mandatory
>> > #org.neo4j.server.webadmin.data.uri=/db/data/
>> > org.neo4j.server.webadmin.data.uri=http://localhost/neo4jdb/db/data/
>> >
>> >
>> > # REST endpoint of the administration API (used by Webadmin)
>> > #org.neo4j.server.webadmin.management.uri=/db/manage/
>> > org.neo4j.server.webadmin.management.uri=
>> http://localhost/neo4jdb/db/manage/
>> >
>> > That didn't work. I also tried directly the IP address of the machine. as
>> > well as:
>> > org.neo4j.server.webadmin.data.uri=http://localhost:7474/db/data/
>> > org.neo4j.server.webadmin.management.uri=
>> http://localhost:7474/db/manage/
>> >
>> > no luck either.
>> >
>> > Any Idea what I am missing?
>> >
>> > regards,
>> >
>> > Georg
>> > ___
>> > Neo4j mailing list
>> > User@lists.neo4j.org
>> > https://lists.neo4j.org/mailman/listinfo/user
>> >
>> ___
>> Neo4j mailing list
>> User@lists.neo4j.org
>> https://lists.neo4j.org/mailman/listinfo/user
>>
> ___
> Neo4j mailing list
> User@lists.neo4j.org
> https://lists.neo4j.org/mailman/listinfo/user
>
___
Neo4j mailing list
User@lists.neo4j.org
https://lists.neo4j.org/mailman/listinfo/user


Re: [Neo4j] [neo4j] Exposing Webadmin

2011-04-14 Thread Georg Summer
Yeah Curl works fine; as well as from a Browser;

curl http:///neo4jdb/db/manage/
{
  "services" : {
"console" : "http:///db/manage/server/console",
"jmx" : "http://1/db/manage/server/jmx",
"monitor" : "http:///db/manage/server/monitor"
  }
}

what I just noticed is that the links here can not work because they are
missing my /neo4jdb/
Is that a problem with the proxy settings?


Georg

On 14 April 2011 13:08, Peter Neubauer wrote:

> Georg,
> mmh! I could try setting this up on my local machine, but is anyone
> else having input here? Seems like we should examine the Webadmin
> Javascript requests.
>
> Can you access the data REST endpoint with curl?
>
> Cheers,
>
> /peter neubauer
>
> GTalk:  neubauer.peter
> Skype   peter.neubauer
> Phone   +46 704 106975
> LinkedIn   http://www.linkedin.com/in/neubauer
> Twitter  http://twitter.com/peterneubauer
>
> http://www.neo4j.org   - Your high performance graph database.
> http://startupbootcamp.org/- Öresund - Innovation happens HERE.
> http://www.thoughtmade.com - Scandinavia's coolest Bring-a-Thing party.
>
>
>
> On Thu, Apr 14, 2011 at 12:50 PM, Georg Summer 
> wrote:
> > Hi there,
> >
> > My setup is that I have a server in a subnet protected by relative strict
> > port policy, so I can not access the 7474 port of the server where by
> > default neo4j is running.
> > I have an Apache Server running as well serving some websites.
> > I am accessing it remotely from a different part of the network.
> >
> > Checking the documentation showed that I can use a proxypass to expose
> the
> > restful service:
> > ProxyPass /neo4jdb/db http://localhost:7474/db
> > ProxyPassReverse /neo4jdb/db http://localhost:7474/db
> >
> > which works fine. (at least using it through a browser for querying)
> >
> > So my idea was: same thing for the webadmin:
> > ProxyPass /neo4jdb/webadmin http://localhost:7474/webadmin
> > ProxyPassReverse /neo4jdb/webadmin http://localhost:7474/webadmin
> >
> > I get directed to the webadmin page. The side gets displayed but no
> values
> > are retrieved so all status info is undefined. The Browser also only
>  tells
> > me: not found.
> > The Service Info displays: no bean found
> >
> > # REST endpoint for the data API
> > # Note the / in the end is mandatory
> > #org.neo4j.server.webadmin.data.uri=/db/data/
> > org.neo4j.server.webadmin.data.uri=http://localhost/neo4jdb/db/data/
> >
> >
> > # REST endpoint of the administration API (used by Webadmin)
> > #org.neo4j.server.webadmin.management.uri=/db/manage/
> > org.neo4j.server.webadmin.management.uri=
> http://localhost/neo4jdb/db/manage/
> >
> > That didn't work. I also tried directly the IP address of the machine. as
> > well as:
> > org.neo4j.server.webadmin.data.uri=http://localhost:7474/db/data/
> > org.neo4j.server.webadmin.management.uri=
> http://localhost:7474/db/manage/
> >
> > no luck either.
> >
> > Any Idea what I am missing?
> >
> > regards,
> >
> > Georg
> > ___
> > Neo4j mailing list
> > User@lists.neo4j.org
> > https://lists.neo4j.org/mailman/listinfo/user
> >
> ___
> Neo4j mailing list
> User@lists.neo4j.org
> https://lists.neo4j.org/mailman/listinfo/user
>
___
Neo4j mailing list
User@lists.neo4j.org
https://lists.neo4j.org/mailman/listinfo/user


Re: [Neo4j] [neo4j] Exposing Webadmin

2011-04-14 Thread Peter Neubauer
Georg,
mmh! I could try setting this up on my local machine, but is anyone
else having input here? Seems like we should examine the Webadmin
Javascript requests.

Can you access the data REST endpoint with curl?

Cheers,

/peter neubauer

GTalk:      neubauer.peter
Skype       peter.neubauer
Phone       +46 704 106975
LinkedIn   http://www.linkedin.com/in/neubauer
Twitter      http://twitter.com/peterneubauer

http://www.neo4j.org               - Your high performance graph database.
http://startupbootcamp.org/    - Öresund - Innovation happens HERE.
http://www.thoughtmade.com - Scandinavia's coolest Bring-a-Thing party.



On Thu, Apr 14, 2011 at 12:50 PM, Georg Summer  wrote:
> Hi there,
>
> My setup is that I have a server in a subnet protected by relative strict
> port policy, so I can not access the 7474 port of the server where by
> default neo4j is running.
> I have an Apache Server running as well serving some websites.
> I am accessing it remotely from a different part of the network.
>
> Checking the documentation showed that I can use a proxypass to expose the
> restful service:
> ProxyPass /neo4jdb/db http://localhost:7474/db
> ProxyPassReverse /neo4jdb/db http://localhost:7474/db
>
> which works fine. (at least using it through a browser for querying)
>
> So my idea was: same thing for the webadmin:
> ProxyPass /neo4jdb/webadmin http://localhost:7474/webadmin
> ProxyPassReverse /neo4jdb/webadmin http://localhost:7474/webadmin
>
> I get directed to the webadmin page. The side gets displayed but no values
> are retrieved so all status info is undefined. The Browser also only  tells
> me: not found.
> The Service Info displays: no bean found
>
> # REST endpoint for the data API
> # Note the / in the end is mandatory
> #org.neo4j.server.webadmin.data.uri=/db/data/
> org.neo4j.server.webadmin.data.uri=http://localhost/neo4jdb/db/data/
>
>
> # REST endpoint of the administration API (used by Webadmin)
> #org.neo4j.server.webadmin.management.uri=/db/manage/
> org.neo4j.server.webadmin.management.uri=http://localhost/neo4jdb/db/manage/
>
> That didn't work. I also tried directly the IP address of the machine. as
> well as:
> org.neo4j.server.webadmin.data.uri=http://localhost:7474/db/data/
> org.neo4j.server.webadmin.management.uri=http://localhost:7474/db/manage/
>
> no luck either.
>
> Any Idea what I am missing?
>
> regards,
>
> Georg
> ___
> Neo4j mailing list
> User@lists.neo4j.org
> https://lists.neo4j.org/mailman/listinfo/user
>
___
Neo4j mailing list
User@lists.neo4j.org
https://lists.neo4j.org/mailman/listinfo/user


[Neo4j] [neo4j] Exposing Webadmin

2011-04-14 Thread Georg Summer
Hi there,

My setup is that I have a server in a subnet protected by relative strict
port policy, so I can not access the 7474 port of the server where by
default neo4j is running.
I have an Apache Server running as well serving some websites.
I am accessing it remotely from a different part of the network.

Checking the documentation showed that I can use a proxypass to expose the
restful service:
ProxyPass /neo4jdb/db http://localhost:7474/db
ProxyPassReverse /neo4jdb/db http://localhost:7474/db

which works fine. (at least using it through a browser for querying)

So my idea was: same thing for the webadmin:
ProxyPass /neo4jdb/webadmin http://localhost:7474/webadmin
ProxyPassReverse /neo4jdb/webadmin http://localhost:7474/webadmin

I get directed to the webadmin page. The side gets displayed but no values
are retrieved so all status info is undefined. The Browser also only  tells
me: not found.
The Service Info displays: no bean found

# REST endpoint for the data API
# Note the / in the end is mandatory
#org.neo4j.server.webadmin.data.uri=/db/data/
org.neo4j.server.webadmin.data.uri=http://localhost/neo4jdb/db/data/


# REST endpoint of the administration API (used by Webadmin)
#org.neo4j.server.webadmin.management.uri=/db/manage/
org.neo4j.server.webadmin.management.uri=http://localhost/neo4jdb/db/manage/

That didn't work. I also tried directly the IP address of the machine. as
well as:
org.neo4j.server.webadmin.data.uri=http://localhost:7474/db/data/
org.neo4j.server.webadmin.management.uri=http://localhost:7474/db/manage/

no luck either.

Any Idea what I am missing?

regards,

Georg
___
Neo4j mailing list
User@lists.neo4j.org
https://lists.neo4j.org/mailman/listinfo/user