Hi Colin, Thank you for the prompt response.
I'm a little confused as you say "the US West node will not come online > telling me that the database is not yet online. At that point, I kill the > process and then eventually the database comes online." Do you mean you kill the database process and then restart it and then it > starts communicating? Yes. I kill the database process on the cluster node where the OrientDB is not coming online. Can you see on each machine when Hazelcast 'sees' all the members? Are all > the members showing up? Yes. I see the databases are talking to each other as the IP address of the nodes show up in the log of each database server. I will try setting hotAlignment to false and report my results on this thread. Amir. On Tuesday, March 24, 2015 at 11:25:16 AM UTC-5, Colin wrote: > > Hi Amir, > > Is it consistently a problem between the same machines not seeing each > other? > > I'm a little confused as you say "the US West node will not come online > telling me that the database is not yet online. At that point, I kill the > process and then eventually the database comes online." > > Do you mean you kill the database process and then restart it and then it > starts communicating? > > In your distributed json file, try setting "hotAlignment" to false. > > Can you see on each machine when Hazelcast 'sees' all the members? Are > all the members showing up? > > -Colin > > Orient Technologies > > The Company behind OrientDB > > On Tuesday, March 24, 2015 at 11:19:05 AM UTC-5, Amir Khawaja wrote: >> >> Greetings, everyone. Has anyone had much success running an OrientDB >> 2.0.5 cluster in Azure? I created a cluster in Windows Azure with 4 nodes >> using CentOS 7 and OrientDB Community 2.0.4 -- 2 nodes in US East2 and 2 >> nodes in US West. There is a Site-to-Site VPN connection between the two >> regions in Azure and data is flowing between machines across the network. I >> have three databases that I have currently deployed and testing. I find >> that many times the synchronization between databases does not occur. For >> instance, if I startup the first node in US East2 and once that comes >> online, fire up the second node in US West, the US West node will not come >> online telling me that the database is not yet online. At that point, I >> kill the process and then eventually the database comes online. I even have >> to go so far as to delete the databases in the database path folder. I do >> this a few times and eventually the server may startup. Sometimes, I will >> have three of the four nodes working and the fourth just refuses to come >> online. >> >> The VM size selected for each node in the cluster is a D4 (4 cores, 28GB >> RAM). This should be more than sufficient to handle most loads. Surely, I >> must be missing something as this is not acceptable production behavior. >> For reference, I am pasting the hazelcast.xml and >> default-distributed-db-config.json files here in hopes that someone has >> some pointers for me. >> >> *** hazelcast.xml *** >> >> <?xml version="1.0" encoding="UTF-8"?> >> <!-- ~ Copyright (c) 2008-2012, Hazel Bilisim Ltd. All Rights Reserved. ~ >> ~ Licensed under the Apache License, Version 2.0 (the "License"); ~ you >> may >> not use this file except in compliance with the License. ~ You may obtain >> a copy of the License at ~ ~ http://www.apache.org/licenses/LICENSE-2.0 ~ >> ~ Unless required by applicable law or agreed to in writing, software ~ >> distributed >> under the License is distributed on an "AS IS" BASIS, ~ WITHOUT WARRANTIES >> OR CONDITIONS OF ANY KIND, either express or implied. ~ See the License >> for >> the specific language governing permissions and ~ limitations under the >> License. --> >> >> <hazelcast >> xsi:schemaLocation="http://www.hazelcast.com/schema/config >> hazelcast-config-3.0.xsd" >> xmlns="http://www.hazelcast.com/schema/config" xmlns:xsi=" >> http://www.w3.org/2001/XMLSchema-instance"> >> <group> >> <name>[name]</name> >> <password>[password]</password> >> </group> >> <network> >> <port auto-increment="true">2434</port> >> <join> >> <multicast enabled="false"> >> <multicast-group>235.1.1.1</multicast-group> >> <multicast-port>2434</multicast-port> >> </multicast> >> <tcp-ip enabled="true"> >> <member>10.0.0.4</member> >> <member>10.0.0.5</member> >> <member>10.1.0.4</member> >> <member>10.1.0.5</member> >> </tcp-ip> >> </join> >> </network> >> <executor-service> >> <pool-size>16</pool-size> >> </executor-service> >> </hazelcast> >> >> >> *** default-distributed-db-config.json *** >> >> { >> "autoDeploy": true, >> "hotAlignment": true, >> "executionMode": "synchronous", >> "readQuorum": 1, >> "writeQuorum": 3, >> "failureAvailableNodesLessQuorum": false, >> "readYourWrites": true, >> "clusters": { >> "internal": { >> }, >> "index": { >> }, >> "*": { >> "servers" : [ "<NEW_NODE>" ] >> } >> } >> } >> >> Thank you for any assistance you can offer. >> >> Amir. >> > -- --- You received this message because you are subscribed to the Google Groups "OrientDB" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/d/optout.
