Thanks for the suggestions, Mike. I discovered that DINPUT_QUERY_CACHABLE
wasn't true, so I am trying that now; the process kept failing to retrieve uris, so maybe if we fetch them all up front? I looked at the networking a bit - pings are ~ 0.15 ms and I am seeing sustained transfer rates as high as 84MB/s using scp - I think I'd get more with larger files. Also the servers don't seem busy - I am running xqsync on the destination box, which I suppose might not be ideal, but uses less network anyway - it is maxing out one of the cpus during the initial fetch of all the uris (over 10m of them) now that cachable=true. Maybe there is a problem deep paging into the cts:uris query when it is not cached? I'll report back once the data actually starts transferring -Mike On 03/14/2012 10:14 AM, Michael Blakeley wrote: > I would expect better than that. What is the document rate? > > You may not have enough client threads to keep the servers busy. What does > the utilization look like on both sides? > > You may also be memory-limited in the JVM at some point, especially if the > documents are big. If so, the JVM will spend a lot of time running the > garbage collector. You can check that idea with the '-verbose:gc' option. > > Could there be a network limitation other than bandwidth? You might check > that by exporting to packages instead, and see what that performance looks > like. I have seen some cases where there was a slow hop on the network, or > where a firewall was limiting performance. > > -- Mike > > On 14 Mar 2012, at 13:38 , Mike Sokolov wrote: > > >> I wonder if anyone has a rough guide to what sort of transfer speeds can >> be expected using xqsync to transfer a database from one node to >> another. I have two quite beefy servers on the same LAN (at least >> 100Mb/s ~ 12MB/s), and I'm only getting ~30kB/sec. I was hoping to get >> a few orders of magnitude more, but am I smoking crack? Is there >> something I could be doing or not doing that might be limiting the speed >> somehow? >> >> This is my setup: >> >> java -cp ${BIN}/xqsync.jar:$BIN/xcc.jar:$BIN/xstream.jar:$BIN/xpp3.jar >> -Xmx1024m \ >> -DINPUT_CONNECTION_STRING=$SRCDB \ >> -DOUTPUT_CONNECTION_STRING=$DSTDB \ >> -DSKIP_EXISTING=true \ >> -DCOPY_COLLECTIONS=false \ >> -DCOPY_PERMISSIONS=false \ >> -DCOPY_PROPERTIES=true \ >> -DCOPY_QUALITY=false \ >> -DINPUT_BATCH_SIZE=10 \ >> -DINPUT_QUERY_CACHABLE \ >> -DTHREADS=8 \ >> com.marklogic.ps.xqsync.XQSync >> >> These are the startup messages from the log: >> >> INFO: XQSync starting: version 2009-03-10.1 on 1.6.0_26 (Java(TM) SE >> Runtime Environment) >> Mar 14, 2012 2:33:36 PM com.marklogic.ps.xqsync.XQSync main >> INFO: XCC version = 3.2-7 >> Mar 14, 2012 2:33:36 PM com.marklogic.ps.xqsync.XQSyncManager run >> INFO: starting pool of 8 threads, queue size = 10000 >> Mar 14, 2012 2:33:36 PM com.marklogic.ps.xqsync.Monitor run >> INFO: starting >> Mar 14, 2012 2:33:36 PM com.marklogic.ps.xqsync.XQSyncManager run >> INFO: output version info: client 3.2-7, server 4.1-11 >> Mar 14, 2012 2:33:36 PM com.marklogic.ps.xqsync.XQSyncManager run >> INFO: input version info: client 3.2-7, server 4.1-11 >> Mar 14, 2012 2:33:36 PM com.marklogic.ps.xqsync.XQSyncManager >> queueFromInputConnection >> INFO: buffer size = 0, caching = false >> Mar 14, 2012 2:33:36 PM com.marklogic.ps.xqsync.XQSyncManager getUrisRequest >> INFO: listing all documents (with uri lexicon) >> Mar 14, 2012 2:33:36 PM com.marklogic.ps.xqsync.XQSyncManager >> queueFromInputConnection >> >> The connector is a bit old: Can I expect any substantial improvement >> from updating that? >> >> -- >> Michael Sokolov >> Engineering Director >> www.ifactory.com >> >> _______________________________________________ >> General mailing list >> General@developer.marklogic.com >> http://developer.marklogic.com/mailman/listinfo/general >> >> > _______________________________________________ > General mailing list > General@developer.marklogic.com > http://developer.marklogic.com/mailman/listinfo/general > _______________________________________________ General mailing list General@developer.marklogic.com http://developer.marklogic.com/mailman/listinfo/general