[
https://issues.apache.org/jira/browse/LUCENE-4599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13558245#comment-13558245
]
Shawn Heisey commented on LUCENE-4599:
--------------------------------------
New files are 53.2% of the old size.
New TV files total 3890809739.
Old TV files total 7311612548.
{noformat}
Unmodified Solr 4.1:
total 17154140
drwxr-xr-x 2 ncindex ncindex 45056 Jan 19 21:35 ./
drwxr-xr-x 4 ncindex ncindex 4096 Jan 18 20:15 ../
-rw-r--r-- 1 ncindex ncindex 99 Jan 19 21:28 segments_dt
-rw-r--r-- 1 ncindex ncindex 20 Jan 19 21:28 segments.gen
-rw-r--r-- 1 ncindex ncindex 3220362314 Jan 19 21:11 _uk.fdt
-rw-r--r-- 1 ncindex ncindex 1796091 Jan 19 21:11 _uk.fdx
-rw-r--r-- 1 ncindex ncindex 3291 Jan 19 21:28 _uk.fnm
-rw-r--r-- 1 ncindex ncindex 2712855241 Jan 19 21:23 _uk_Lucene41_0.doc
-rw-r--r-- 1 ncindex ncindex 2641242950 Jan 19 21:23 _uk_Lucene41_0.pos
-rw-r--r-- 1 ncindex ncindex 1605874308 Jan 19 21:23 _uk_Lucene41_0.tim
-rw-r--r-- 1 ncindex ncindex 35091811 Jan 19 21:23 _uk_Lucene41_0.tip
-rw-r--r-- 1 ncindex ncindex 115 Jan 19 21:28 _uk_nrm.cfe
-rw-r--r-- 1 ncindex ncindex 36874222 Jan 19 21:28 _uk_nrm.cfs
-rw-r--r-- 1 ncindex ncindex 473 Jan 19 21:28 _uk.si
-rw-r--r-- 1 ncindex ncindex 24581897 Jan 19 21:28 _uk.tvd
-rw-r--r-- 1 ncindex ncindex 7090368538 Jan 19 21:28 _uk.tvf
-rw-r--r-- 1 ncindex ncindex 196662113 Jan 19 21:28 _uk.tvx
Solr 4.1 with patch:
total 13812100
drwxr-xr-x 2 ncindex ncindex 53248 Jan 20 06:10 ./
drwxr-xr-x 4 ncindex ncindex 4096 Jan 18 20:15 ../
-rw-r--r-- 1 ncindex ncindex 3220492130 Jan 20 05:54 _1oy.fdt
-rw-r--r-- 1 ncindex ncindex 1790533 Jan 20 05:54 _1oy.fdx
-rw-r--r-- 1 ncindex ncindex 3291 Jan 20 06:10 _1oy.fnm
-rw-r--r-- 1 ncindex ncindex 2713448546 Jan 20 06:08 _1oy_Lucene41_0.doc
-rw-r--r-- 1 ncindex ncindex 2640844965 Jan 20 06:08 _1oy_Lucene41_0.pos
-rw-r--r-- 1 ncindex ncindex 1604289094 Jan 20 06:08 _1oy_Lucene41_0.tim
-rw-r--r-- 1 ncindex ncindex 34910618 Jan 20 06:08 _1oy_Lucene41_0.tip
-rw-r--r-- 1 ncindex ncindex 115 Jan 20 06:10 _1oy_nrm.cfe
-rw-r--r-- 1 ncindex ncindex 36874183 Jan 20 06:10 _1oy_nrm.cfs
-rw-r--r-- 1 ncindex ncindex 477 Jan 20 06:10 _1oy.si
-rw-r--r-- 1 ncindex ncindex 3889805695 Jan 20 06:10 _1oy.tvd
-rw-r--r-- 1 ncindex ncindex 1004044 Jan 20 06:10 _1oy.tvx
-rw-r--r-- 1 ncindex ncindex 20 Jan 20 06:10 segments.gen
-rw-r--r-- 1 ncindex ncindex 105 Jan 20 06:10 segments_ul
-rw-r--r-- 1 ncindex ncindex 0 Jan 19 21:39 write.lock
{noformat}
For this listing, the _0 and _1 indexes have been swapped - now the _1 indexes
are live.
{noformat}
ncindex@bigindy5 /index/solr4/data $ du -sc *
492 inc_0
609212 inc_1
24 ncmain
17154980 s0_0
13840212 s0_1
17211000 s1_0
13913260 s1_1
17191660 s2_0
13895536 s2_1
17192320 s3_0
13889920 s3_1
17198940 s4_0
13897380 s4_1
17205112 s5_0
13918936 s5_1
187118984 total
{noformat}
> Compressed term vectors
> -----------------------
>
> Key: LUCENE-4599
> URL: https://issues.apache.org/jira/browse/LUCENE-4599
> Project: Lucene - Core
> Issue Type: Task
> Components: core/codecs, core/termvectors
> Reporter: Adrien Grand
> Assignee: Adrien Grand
> Priority: Minor
> Fix For: 4.2
>
> Attachments: 4599-dataimport-fail.log, 4599-zookeer-fail.log,
> LUCENE-4599.patch, LUCENE-4599.patch, LUCENE-4599.patch, solr.patch
>
>
> We should have codec-compressed term vectors similarly to what we have with
> stored fields.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]