Hello Thomas, 
   thanks for your email, following the details about the STATS during the 
FSSCAN:

## BEST RESULT  with user_acct group_acct disabled / multithreads DB :

2014/06/13 08:08:59 robinhood@robinhood[6946/4] STATS | DB_APPLY      : last: 
61E893EF/144115675168894784, status=processing
2014/06/13 08:09:59 robinhood@robinhood[6946/4] STATS | ==================== 
Dumping stats at 2014/06/13 08:09:59 =====================
2014/06/13 08:09:59 robinhood@robinhood[6946/4] STATS | ======== General 
statistics =========
2014/06/13 08:09:59 robinhood@robinhood[6946/4] STATS | Daemon start time: 
2014/06/13 08:06:58
2014/06/13 08:09:59 robinhood@robinhood[6946/4] STATS | Started modules: scan
2014/06/13 08:09:59 robinhood@robinhood[6946/4] STATS | ======== FS scan 
statistics =========
2014/06/13 08:09:59 robinhood@robinhood[6946/4] STATS | current scan interval = 
6.0h
2014/06/13 08:09:59 robinhood@robinhood[6946/4] STATS | scan is running:
2014/06/13 08:09:59 robinhood@robinhood[6946/4] STATS |      started at : 
2014/06/13 08:06:59 (3.0min ago)
2014/06/13 08:09:59 robinhood@robinhood[6946/4] STATS |      last action: 
2014/06/13 08:09:59 (00s ago)
2014/06/13 08:09:59 robinhood@robinhood[6946/4] STATS |      progress   : 
2020110 entries scanned (0 errors)
2014/06/13 08:09:59 robinhood@robinhood[6946/4] STATS |      avg. speed : 2.85 
ms/entry/thread -> 11231.02 entries/sec
2014/06/13 08:09:59 robinhood@robinhood[6946/4] STATS |      inst. speed: 4.20 
ms/entry/thread -> 7618.37 entries/sec
2014/06/13 08:09:59 robinhood@robinhood[6946/4] STATS | ==== EntryProcessor 
Pipeline Stats ===
2014/06/13 08:09:59 robinhood@robinhood[6946/4] STATS | Idle threads: 9 
2014/06/13 08:09:59 robinhood@robinhood[6946/4] STATS | Id constraints count: 
11 (hash min=0/max=1/avg=0.0)
2014/06/13 08:09:59 robinhood@robinhood[6946/4] STATS | Stage              | 
Wait | Curr | Done |     Total | ms/op | 
2014/06/13 08:09:59 robinhood@robinhood[6946/4] STATS |  0: GET_FID        |    
0 |    0 |    0 |         0 |  0.00 | 
2014/06/13 08:09:59 robinhood@robinhood[6946/4] STATS |  1: GET_INFO_DB    |    
0 |    1 |    0 |   2021254 |  0.20 | 
2014/06/13 08:09:59 robinhood@robinhood[6946/4] STATS |  2: GET_INFO_FS    |    
0 |    0 |    0 |   2021254 |  0.01 | 
2014/06/13 08:09:59 robinhood@robinhood[6946/4] STATS |  3: REPORTING      |    
2 |    0 |    0 |   2021252 |  0.00 | 
2014/06/13 08:09:59 robinhood@robinhood[6946/4] STATS |  4: PRE_APPLY      |    
0 |    1 |    0 |   2021251 |  0.00 | 
2014/06/13 08:09:59 robinhood@robinhood[6946/4] STATS |  5: DB_APPLY       |    
0 |    7 |    0 |   2021244 |  0.95 | 
2014/06/13 08:09:59 robinhood@robinhood[6946/4] STATS |  6: RM_OLD_ENTRIES |    
0 |    0 |    0 |         0 |  0.00 | 
2014/06/13 08:09:59 robinhood@robinhood[6946/4] STATS | DB ops: 
get=89/ins=2021155/upd=89/rm=0
2014/06/13 08:09:59 robinhood@robinhood[6946/4] STATS | --- Pipeline stage 
details --

Filesystem scan activity:

    Current scan interval:   6.0h

    Last filesystem scan:
            status:          done
            start:           2014/06/13 08:06:59
            end:             2014/06/13 08:10:40
            duration:        03min 41s

         Statistics:
            entries scanned: 2523759
            errors:          0
            timeouts:        0
            # threads:       32
            average speed:   11428.57 entries/sec
##########################################


## RESULT with user_acct group_acct enabled / multithreads DB  :

2014/06/13 08:40:27 robinhood@robinhood[23160/4] STATS | ==================== 
Dumping stats at 2014/06/13 08:40:27 =====================
2014/06/13 08:40:27 robinhood@robinhood[23160/4] STATS | ======== General 
statistics =========
2014/06/13 08:40:27 robinhood@robinhood[23160/4] STATS | Daemon start time: 
2014/06/13 08:30:27
2014/06/13 08:40:27 robinhood@robinhood[23160/4] STATS | Started modules: scan
2014/06/13 08:40:27 robinhood@robinhood[23160/4] STATS | ======== FS scan 
statistics =========
2014/06/13 08:40:27 robinhood@robinhood[23160/4] STATS | current scan interval 
= 6.0h
2014/06/13 08:40:27 robinhood@robinhood[23160/4] STATS | scan is running:
2014/06/13 08:40:27 robinhood@robinhood[23160/4] STATS |      started at : 
2014/06/13 08:30:27 (10.0min ago)
2014/06/13 08:40:27 robinhood@robinhood[23160/4] STATS |      last action: 
2014/06/13 08:40:27 (00s ago)
2014/06/13 08:40:27 robinhood@robinhood[23160/4] STATS |      progress   : 
1214687 entries scanned (0 errors)
2014/06/13 08:40:27 robinhood@robinhood[23160/4] STATS |      avg. speed : 
15.25 ms/entry/thread -> 2098.43 entries/sec
2014/06/13 08:40:27 robinhood@robinhood[23160/4] STATS |      inst. speed: 
22.94 ms/entry/thread -> 1394.92 entries/sec
2014/06/13 08:40:27 robinhood@robinhood[23160/4] STATS | ==== EntryProcessor 
Pipeline Stats ===
2014/06/13 08:40:27 robinhood@robinhood[23160/4] STATS | Idle threads: 16
2014/06/13 08:40:27 robinhood@robinhood[23160/4] STATS | Id constraints count: 
100000 (hash min=8/max=19/avg=12.6)
2014/06/13 08:40:27 robinhood@robinhood[23160/4] STATS | Stage              | 
Wait | Curr | Done |     Total | ms/op | 
2014/06/13 08:40:27 robinhood@robinhood[23160/4] STATS |  0: GET_FID        |   
 0 |    0 |    0 |         0 |  0.00 | 
2014/06/13 08:40:27 robinhood@robinhood[23160/4] STATS |  1: GET_INFO_DB    |   
 0 |    0 |    0 |   1248125 |  0.21 | 
2014/06/13 08:40:27 robinhood@robinhood[23160/4] STATS |  2: GET_INFO_FS    |   
 0 |    0 |    0 |   1248125 |  0.01 | 
2014/06/13 08:40:27 robinhood@robinhood[23160/4] STATS |  3: REPORTING      |   
 0 |    0 |    0 |   1248125 |  0.00 | 
2014/06/13 08:40:27 robinhood@robinhood[23160/4] STATS |  4: PRE_APPLY      |   
 0 |    0 |    0 |   1248125 |  0.00 | 
2014/06/13 08:40:27 robinhood@robinhood[23160/4] STATS |  5: DB_APPLY       
|99983 |   16 |    0 |   1148126 |  8.17 | 
2014/06/13 08:40:27 robinhood@robinhood[23160/4] STATS |  6: RM_OLD_ENTRIES |   
 0 |    0 |    0 |         0 |  0.00 | 
2014/06/13 08:40:27 robinhood@robinhood[23160/4] STATS | DB ops: 
get=89/ins=1148037/upd=89/rm=0


Filesystem scan activity:

    Current scan interval:   6.0h

    Last filesystem scan:
            status:          done
            start:           2014/06/13 08:30:27
            end:             2014/06/13 08:52:28
            duration:        22min 01s

         Statistics:
            entries scanned: 2523760
            errors:          0
            timeouts:        0
            # threads:       32
            average speed:   1923.08 entries/sec
###################################


## RESULT with user_acct group_acct enabled / batch DB :

2014/06/13 09:05:54 robinhood@santisrbh01[7739/4] STATS | ======== FS scan 
statistics =========
2014/06/13 09:05:54 robinhood@santisrbh01[7739/4] STATS | current scan interval 
= 6.0h
2014/06/13 09:05:54 robinhood@santisrbh01[7739/4] STATS | scan is running:
2014/06/13 09:05:54 robinhood@santisrbh01[7739/4] STATS |      started at : 
2014/06/13 09:00:54 (5.0min ago)
2014/06/13 09:05:54 robinhood@santisrbh01[7739/4] STATS |      last action: 
2014/06/13 09:05:54 (00s ago)
2014/06/13 09:05:54 robinhood@santisrbh01[7739/4] STATS |      progress   : 
811449 entries scanned (0 errors)
2014/06/13 09:05:54 robinhood@santisrbh01[7739/4] STATS |      avg. speed : 
11.73 ms/entry/thread -> 2728.74 entries/sec
2014/06/13 09:05:54 robinhood@santisrbh01[7739/4] STATS |      inst. speed: 
12.80 ms/entry/thread -> 2499.13 entries/sec
2014/06/13 09:05:54 robinhood@santisrbh01[7739/4] STATS | ==== EntryProcessor 
Pipeline Stats ===
2014/06/13 09:05:54 robinhood@santisrbh01[7739/4] STATS | Idle threads: 8
2014/06/13 09:05:54 robinhood@santisrbh01[7739/4] STATS | Id constraints count: 
99999 (hash min=4/max=24/avg=12.6)
2014/06/13 09:05:54 robinhood@santisrbh01[7739/4] STATS | Stage              | 
Wait | Curr | Done |     Total | ms/op |
2014/06/13 09:05:54 robinhood@santisrbh01[7739/4] STATS |  0: GET_FID        |  
  0 |    0 |    0 |         0 |  0.00 |
2014/06/13 09:05:54 robinhood@santisrbh01[7739/4] STATS |  1: GET_INFO_DB    |  
  0 |    0 |    0 |    817744 |  0.17 |
2014/06/13 09:05:54 robinhood@santisrbh01[7739/4] STATS |  2: GET_INFO_FS    |  
  0 |    0 |    0 |    817744 |  0.01 |
2014/06/13 09:05:54 robinhood@santisrbh01[7739/4] STATS |  3: REPORTING      |  
  0 |    0 |    0 |    817744 |  0.00 |
2014/06/13 09:05:54 robinhood@santisrbh01[7739/4] STATS |  4: PRE_APPLY      |  
  0 |    0 |    0 |    817744 |  0.00 |
2014/06/13 09:05:54 robinhood@santisrbh01[7739/4] STATS |  5: DB_APPLY       
|99995 |    4 |    0 |    717745 |  0.36 | 89.96% batched (avg batch size: 9.7)
2014/06/13 09:05:54 robinhood@santisrbh01[7739/4] STATS |  6: RM_OLD_ENTRIES |  
  0 |    0 |    0 |         0 |  0.00 |
2014/06/13 09:05:54 robinhood@santisrbh01[7739/4] STATS | DB ops: 
get=89/ins=717656/upd=89/rm=0
2014/06/13 09:05:54 robinhood@santisrbh01[7739/4] STATS | --- Pipeline stage 
details ---

Filesystem scan activity:

    Current scan interval:   6.0h

    Last filesystem scan:
            status:          done
            start:           2014/06/13 09:00:54
            end:             2014/06/13 09:15:41
            duration:        14min 47s

         Statistics:
            entries scanned: 2523760
            errors:          0
            timeouts:        0
            # threads:       32
            average speed:   2862.25 entries/sec

#########################


## RESULT with user_acct group_acct disabled / batch DB :

2014/06/13 09:19:40 robinhood@robinhood[24169/4] STATS | ======== FS scan 
statistics =========
2014/06/13 09:19:40 robinhood@robinhood[24169/4] STATS | current scan interval 
= 6.0h
2014/06/13 09:19:40 robinhood@robinhood[24169/4] STATS | scan is running:
2014/06/13 09:19:40 robinhood@robinhood[24169/4] STATS |      started at : 
2014/06/13 09:18:40 (1.0min ago)
2014/06/13 09:19:40 robinhood@robinhood[24169/4] STATS |      last action: 
2014/06/13 09:19:40 (00s ago)
2014/06/13 09:19:40 robinhood@robinhood[24169/4] STATS |      progress   : 
358138 entries scanned (0 errors)
2014/06/13 09:19:40 robinhood@robinhood[24169/4] STATS |      avg. speed : 5.25 
ms/entry/thread -> 6098.09 entries/sec
2014/06/13 09:19:40 robinhood@robinhood[24169/4] STATS |      inst. speed: 
26.34 ms/entry/thread -> 1214.79 entries/sec
2014/06/13 09:19:40 robinhood@robinhood[24169/4] STATS | ==== EntryProcessor 
Pipeline Stats ===
2014/06/13 09:19:40 robinhood@robinhood[24169/4] STATS | Idle threads: 1 
2014/06/13 09:19:40 robinhood@robinhood[24169/4] STATS | Id constraints count: 
89022 (hash min=6/max=20/avg=11.2)
2014/06/13 09:19:40 robinhood@robinhood[24169/4] STATS | Stage              | 
Wait | Curr | Done |     Total | ms/op | 
2014/06/13 09:19:40 robinhood@robinhood[24169/4] STATS |  0: GET_FID        |   
 0 |    0 |    0 |         0 |  0.00 | 
2014/06/13 09:19:40 robinhood@robinhood[24169/4] STATS |  1: GET_INFO_DB    
|11928 |    2 | 23902 |    347795 |  0.15 | 
2014/06/13 09:19:40 robinhood@robinhood[24169/4] STATS |  2: GET_INFO_FS    |   
 0 |    0 |    0 |    323893 |  0.01 | 
2014/06/13 09:19:40 robinhood@robinhood[24169/4] STATS |  3: REPORTING      |   
 0 |    0 |    0 |    323893 |  0.00 | 
2014/06/13 09:19:40 robinhood@robinhood[24169/4] STATS |  4: PRE_APPLY      |   
 0 |    0 |    0 |    323893 |  0.00 | 
2014/06/13 09:19:40 robinhood@robinhood[24169/4] STATS |  5: DB_APPLY       
|53175 |   11 |    0 |    270707 |  0.18 | 89.33% batched (avg batch size: 9.3)
2014/06/13 09:19:40 robinhood@robinhood[24169/4] STATS |  6: RM_OLD_ENTRIES |   
 0 |    0 |    0 |         0 |  0.00 | 
2014/06/13 09:19:40 robinhood@robinhood[24169/4] STATS | DB ops: 
get=24/ins=270683/upd=24/rm=0

Filesystem scan activity:

    Current scan interval:   6.0h

    Last filesystem scan:
            status:          done
            start:           2014/06/13 09:18:40
            end:             2014/06/13 09:27:31
            duration:        08min 51s

         Statistics:
            entries scanned: 2523760
            errors:          0
            timeouts:        0
            # threads:       32
            average speed:   4790.42 entries/sec
#########################################

As you can see the result with multithread DB is much higher than batch DB  
11428.57 entries/sec vs 4790.42 entries/sec

Something that could be helpful in the documentation I think would be the fact 
that formatting the file system were the mysql is running as XFS will offer 
better results in terms of scan performance. Indeed with ext4 I reached at 
maximum 3800 entries/sec with user_acct group_acct disabled, instead with XFS I 
reached 11000  entries/sec.

Is it correct to say that the user_acct group_acct are only needed for the 
webgui? Because if I'm not wrong from command line, even if I clean up the DB 
and I scan without user_acc group_acct, I got all the needed data, but I cannot 
use the web gui. 


Cheers
Fabio

--
- Fabio Verzelloni - CSCS - Swiss National Supercomputing Centre
via Trevano 131 - 6900 Lugano, Switzerland
Tel: +41 (0)91 610 82 04


________________________________________
From: LEIBOVICI Thomas [[email protected]]
Sent: Wednesday, June 11, 2014 4:38 PM
To: Verzelloni  Fabio; [email protected]
Subject: Re: [robinhood-support] question about 'best configuration practice' & 
'mysql errors'

Hi Fabio,

Congratulations. I'm impressed of your in-depth tuning of robinhood,
mysql and your system.

Robinhood should dump stats in its logs at regular interval.
It would help identifying bottlenecks if you could send an extract of
it, like this:


2014/05/21 21:08:34 [13362/4] STATS | ======== FS scan statistics =========
2014/05/21 21:08:34 [13362/4] STATS | scan is running:
2014/05/21 21:08:34 [13362/4] STATS |      started at : 2014/05/21
09:08:26 (12.0h ago)
2014/05/21 21:08:34 [13362/4] STATS |      last action: 2014/05/21
21:08:33 (01s ago)
2014/05/21 21:08:34 [13362/4] STATS |      progress   : 6349237 entries
scanned (0 errors)
2014/05/21 21:08:34 [13362/4] STATS |      avg. speed : 103.84
ms/entry/thread -> 154.09 entries/sec
2014/05/21 21:08:34 [13362/4] STATS |      inst. speed: 41.30
ms/entry/thread -> 387.39 entries/sec
2014/05/21 21:08:34 [13362/4] STATS | ==== EntryProcessor Pipeline Stats ===
2014/05/21 21:08:34 [13362/4] STATS | Idle threads: 15
2014/05/21 21:08:34 [13362/4] STATS | Id constraints count: 10000 (hash
min=0/max=7/avg=1.3)
2014/05/21 21:08:34 [13362/4] STATS | Stage | Wait | Curr | Done |
Total | ms/op |
2014/05/21 21:08:34 [13362/4] STATS |  0: GET_FID |    0 |    0 |    0
|         0 |  0.00 |
2014/05/21 21:08:34 [13362/4] STATS |  1: GET_INFO_DB |    0 |    0 |
0 |   6603345 |  0.25 |
2014/05/21 21:08:34 [13362/4] STATS |  2: GET_INFO_FS |    0 |    0 |
0 |   6603345 |  0.03 |
2014/05/21 21:08:34 [13362/4] STATS |  3: REPORTING |    0 |    0 |    0
|   6603345 |  0.00 |
2014/05/21 21:08:34 [13362/4] STATS |  4: PRE_APPLY |    0 |    0 |    0
|   6603345 |  0.00 |
2014/05/21 21:08:34 [13362/4] STATS |  5: DB_APPLY | 9999 |    1 |    0
|   6593345 |  6.54 | 98.35% batched (avg batch size: 85.1)
2014/05/21 21:08:34 [13362/4] STATS |  6: RM_OLD_ENTRIES |    0 |    0
|    0 |         0 |  0.00 |

I noticed that you disabled DB batching to use multithreaded DB
operations. Did you get better results this way ?

Your hardware looks appropriate. The stats dump will give use more
information.

Regards


On 06/06/14 09:29, Verzelloni Fabio wrote:
> Hello folks,
>     I'm doing some test on robinhood in our environment, some details 
> regarding the hardware in use:
>
> ## Robinhood server ##
>
> IBM M3550 X4
> 128Gb RAM
> 256G HD SSD for mysql
> 2* Intel Xeon Processor E5-2650v2 8C 2.6GHz 20MB Cache 1866MHz
> Lustre 2.5.1
> Mysql 5.5
>
> ## Luster Version in production ##
> lustre 2.1
>
> ## Robinhood.conf ##
>
> FS_Scan {
>          nb_threads_scan = 32;
>          nb_prealloc_tasks=10000;
> }
>
> EntryProcessor {
>          nb_threads = 32;
>          STAGE_GET_FID_threads_max = 16;
>          STAGE_GET_INFO_DB_threads_max = 4;
>          STAGE_GET_INFO_FS_threads_max = 4;
>          STAGE_REPORTING_threads_max = 1;
>          STAGE_DB_APPLY_threads_max = 16;
>          STAGE_CHGLOG_CLR_threads_max = 1;
>          STAGE_RM_OLD_ENTRIES_threads_max = 1;
>          max_pending_operations = 1000;
>          max_batch_size=1;
> }
>
> ## My.cnf ##
>
> [mysqld]
> large-pages
> datadir=/var/lib/mysql
> socket=/var/lib/mysql/mysql.sock
> user=mysql
> # Disabling symbolic-links is recommended to prevent assorted security risks
> symbolic-links=0
> innodb_flush_log_at_trx_commit = 0
> # possibly the most important setting
> max_connections= 512
> innodb_buffer_pool_size= 60G
> # ~50% of memory
> innodb_max_dirty_pages_pct= 15
> innodb_thread_concurrency= 32
> innodb_log_file_size= 100M
> innodb_log_buffer_size= 50M
> innodb_data_file_path= ibdata1:1G:autoextend
> # kernel must be configured for support
> table-open-cache= 2000
> sort-buffer-size= 32M
> read-buffer-size= 16M
> read-rnd-buffer-size= 4M
> thread-cache-size= 128
> query-cache-size= 40M
> query-cache-limit= 1M
> tmp-table-size= 16M
>
> [mysqld_safe]
> log-error=/var/log/mysqld.log
> pid-file=/var/run/mysqld/mysqld.pid
>
> ## vm.nr_hugepages ##
>
> vm.nr_hugepages = 50000
> vm.nr_hugepages_mempolicy = 50000
> vm.hugetlb_shm_group = 27
> vm.hugepages_treat_as_movable = 0
> vm.nr_overcommit_hugepages = 0
>
> ## sysctl ##
>
> kernel.shmmax = 118111600640
> kernel.shmall = 118111600640
>
> ## limits.conf ##
>
> mysql hard memlock unlimited
> mysql soft memlock unlimited
> ---
>
> I'm trying to find the best configuration to reach the best "entries/sec", 
> and with this configuration the best number I can get is 2600 ~ entries/sec. 
> Do you think that based on the HW in use is it possible to improve the speed 
> of the scan?
> What's the best practice to better configure the server to perform the best 
> speed of scan?
>
> While I'm running the initial scan I see a lot of the following messages:
>
> ...
> 2014/06/06 08:34:20 [10535/15] ListMgr | Retryable DB error in ListMgr_Insert 
> l.218. Restarting transaction in 1 sec...
> 2014/06/06 08:34:21 [10535/15] ListMgr | DB deadlock detected
> ...
>
> I was hoping to reach 4000 / 5000 entries/sec do you think with the HW I have 
> available I can manage to reach that result? Suggestions or questions are 
> welcome.
>
> Regards
> Fabio
>
>
> --
> - Fabio Verzelloni - CSCS - Swiss National Supercomputing Centre
> via Trevano 131 - 6900 Lugano, Switzerland
> Tel: +41 (0)91 610 82 04
>
>
> ------------------------------------------------------------------------------
> Learn Graph Databases - Download FREE O'Reilly Book
> "Graph Databases" is the definitive new guide to graph databases and their
> applications. Written by three acclaimed leaders in the field,
> this first edition is now available. Download your free book today!
> http://p.sf.net/sfu/NeoTech
> _______________________________________________
> robinhood-support mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/robinhood-support


------------------------------------------------------------------------------
HPCC Systems Open Source Big Data Platform from LexisNexis Risk Solutions
Find What Matters Most in Your Big Data with HPCC Systems
Open Source. Fast. Scalable. Simple. Ideal for Dirty Data.
Leverages Graph Analysis for Fast Processing & Easy Data Exploration
http://p.sf.net/sfu/hpccsystems
_______________________________________________
robinhood-support mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/robinhood-support

Reply via email to