[ 
https://issues.apache.org/jira/browse/LUCENE-4599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13558245#comment-13558245
 ] 

Shawn Heisey commented on LUCENE-4599:
--------------------------------------

New files are 53.2% of the old size.
New TV files total 3890809739.
Old TV files total 7311612548.

{noformat}
Unmodified Solr 4.1:
total 17154140
drwxr-xr-x 2 ncindex ncindex      45056 Jan 19 21:35 ./
drwxr-xr-x 4 ncindex ncindex       4096 Jan 18 20:15 ../
-rw-r--r-- 1 ncindex ncindex         99 Jan 19 21:28 segments_dt
-rw-r--r-- 1 ncindex ncindex         20 Jan 19 21:28 segments.gen
-rw-r--r-- 1 ncindex ncindex 3220362314 Jan 19 21:11 _uk.fdt
-rw-r--r-- 1 ncindex ncindex    1796091 Jan 19 21:11 _uk.fdx
-rw-r--r-- 1 ncindex ncindex       3291 Jan 19 21:28 _uk.fnm
-rw-r--r-- 1 ncindex ncindex 2712855241 Jan 19 21:23 _uk_Lucene41_0.doc
-rw-r--r-- 1 ncindex ncindex 2641242950 Jan 19 21:23 _uk_Lucene41_0.pos
-rw-r--r-- 1 ncindex ncindex 1605874308 Jan 19 21:23 _uk_Lucene41_0.tim
-rw-r--r-- 1 ncindex ncindex   35091811 Jan 19 21:23 _uk_Lucene41_0.tip
-rw-r--r-- 1 ncindex ncindex        115 Jan 19 21:28 _uk_nrm.cfe
-rw-r--r-- 1 ncindex ncindex   36874222 Jan 19 21:28 _uk_nrm.cfs
-rw-r--r-- 1 ncindex ncindex        473 Jan 19 21:28 _uk.si
-rw-r--r-- 1 ncindex ncindex   24581897 Jan 19 21:28 _uk.tvd
-rw-r--r-- 1 ncindex ncindex 7090368538 Jan 19 21:28 _uk.tvf
-rw-r--r-- 1 ncindex ncindex  196662113 Jan 19 21:28 _uk.tvx

Solr 4.1 with patch:
total 13812100
drwxr-xr-x 2 ncindex ncindex      53248 Jan 20 06:10 ./
drwxr-xr-x 4 ncindex ncindex       4096 Jan 18 20:15 ../
-rw-r--r-- 1 ncindex ncindex 3220492130 Jan 20 05:54 _1oy.fdt
-rw-r--r-- 1 ncindex ncindex    1790533 Jan 20 05:54 _1oy.fdx
-rw-r--r-- 1 ncindex ncindex       3291 Jan 20 06:10 _1oy.fnm
-rw-r--r-- 1 ncindex ncindex 2713448546 Jan 20 06:08 _1oy_Lucene41_0.doc
-rw-r--r-- 1 ncindex ncindex 2640844965 Jan 20 06:08 _1oy_Lucene41_0.pos
-rw-r--r-- 1 ncindex ncindex 1604289094 Jan 20 06:08 _1oy_Lucene41_0.tim
-rw-r--r-- 1 ncindex ncindex   34910618 Jan 20 06:08 _1oy_Lucene41_0.tip
-rw-r--r-- 1 ncindex ncindex        115 Jan 20 06:10 _1oy_nrm.cfe
-rw-r--r-- 1 ncindex ncindex   36874183 Jan 20 06:10 _1oy_nrm.cfs
-rw-r--r-- 1 ncindex ncindex        477 Jan 20 06:10 _1oy.si
-rw-r--r-- 1 ncindex ncindex 3889805695 Jan 20 06:10 _1oy.tvd
-rw-r--r-- 1 ncindex ncindex    1004044 Jan 20 06:10 _1oy.tvx
-rw-r--r-- 1 ncindex ncindex         20 Jan 20 06:10 segments.gen
-rw-r--r-- 1 ncindex ncindex        105 Jan 20 06:10 segments_ul
-rw-r--r-- 1 ncindex ncindex          0 Jan 19 21:39 write.lock
{noformat}

For this listing, the _0 and _1 indexes have been swapped - now the _1 indexes 
are live.

{noformat}
ncindex@bigindy5 /index/solr4/data $ du -sc *
492     inc_0
609212  inc_1
24      ncmain
17154980        s0_0
13840212        s0_1
17211000        s1_0
13913260        s1_1
17191660        s2_0
13895536        s2_1
17192320        s3_0
13889920        s3_1
17198940        s4_0
13897380        s4_1
17205112        s5_0
13918936        s5_1
187118984       total
{noformat}

                
> Compressed term vectors
> -----------------------
>
>                 Key: LUCENE-4599
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4599
>             Project: Lucene - Core
>          Issue Type: Task
>          Components: core/codecs, core/termvectors
>            Reporter: Adrien Grand
>            Assignee: Adrien Grand
>            Priority: Minor
>             Fix For: 4.2
>
>         Attachments: 4599-dataimport-fail.log, 4599-zookeer-fail.log, 
> LUCENE-4599.patch, LUCENE-4599.patch, LUCENE-4599.patch, solr.patch
>
>
> We should have codec-compressed term vectors similarly to what we have with 
> stored fields.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to