Just in time: a FILLFACTOR of 100 sometimes ruins deduplicability for a few 
dumps, I guess right after some updates are made. 
With a 90 FILLFACTOR set for each table "ALTER TABLE table SET ( fillfactor = 
90);" I got more than 90% deduplication ratio in all my tests until now. 
About the alleged performance improvement, a given user states the cluster 
technique reduced an indexed query execution time from 23 seconds to ~2 
milliseconds (source: 
https://stackoverflow.com/questions/13234812/improving-query-speed-simple-select-in-big-postgres-table).
 

Regards, 

> From: "Heitor Faria" <hei...@bacula.com.br>
> To: "bacula-devel" <bacula-devel@lists.sourceforge.net>
> Cc: "Kern Sibbald" <k...@sibbald.com>
> Sent: Saturday, June 23, 2018 11:44:07 AM
> Subject: [Bacula-devel] Feature Suggestion: Bacula Pgsql Catalog 
> Clusterization

> Dear Developers,

> By default, postgresql data files storage and dump are non-deterministic,
> fetching inexpressive deduplication results.
> I made some tests clustering all (or at least the largest) Bacula Catalog 
> tables
> according to the primary key (but other index can be used), and I yield much
> better dedup ratio.
> The table cluster must be done and configured once for each table:

> select * from pg_indexes where tablename='table';
> CLUSTER table USING table_pkey;

> And a BeforeJobScript can cluster all database tables where the prior
> configuration was performed. E.g.:

> su - postgres -c "psql -d bacula -c 'cluster verbose'"

> Some literature says the clusterization might speed up indexed queries.
> My proposal is that this technique is incorporated to Bacula database creation
> and backup catalog dump generation script.

> Regards,
> --
> Heitor Medrado de Faria | CEO Bacula do Brasil & USA | Visto EB-1 | LPIC-III |
> EMC 05-001 | ITIL-F
> • Não seja tarifado pelo tamanho dos seus backups, conheça o Bacula Enterprise
> http://www.bacula.com.br/enterprise/
> • Ministro treinamento e implementação in-company do Bacula Community
> http://www.bacula.com.br/in-company/
> • Compre o novo livro do Bacula http://www.bacula.com.br/livro
> • Brazil +55 (61) 98268-4220 | USA +1 (323) 300-5387 | www.bacula.com.br

> Indico as capacitações complementares:
> Shell básico e Programação em Shell com Julio Neves | Zabbix com Adail Host.

> ------------------------------------------------------------------------------
> Check out the vibrant tech community on one of the world's most
> engaging tech sites, Slashdot.org! http://sdm.link/slashdot
> _______________________________________________
> Bacula-devel mailing list
> Bacula-devel@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/bacula-devel

-- 
Heitor Medrado de Faria | CEO Bacula do Brasil & USA | Visto EB-1 | LPIC-III | 
EMC 05-001 | ITIL-F 
• Não seja tarifado pelo tamanho dos seus backups, conheça o Bacula Enterprise 
http://www.bacula.com.br/enterprise/ 
• Ministro treinamento e implementação in-company do Bacula Community 
http://www.bacula.com.br/in-company/ 
• Compre o novo livro do Bacula http://www.bacula.com.br/livro 
• Brazil +55 (61) 98268-4220 | USA +1 (323) 300-5387 | www.bacula.com.br 

Indico as capacitações complementares: 
Shell básico e Programação em Shell com Julio Neves | Zabbix com Adail Host. 
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Bacula-devel mailing list
Bacula-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-devel

Reply via email to