[ 
https://issues.apache.org/jira/browse/LUCENE-3706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13188828#comment-13188828
 ] 

Robert Muir commented on LUCENE-3706:
-------------------------------------

Here's just a quick comparison of size against term vectors with
the current patch. I can play around and see if we can do better.

{noformat}
EnglishVectors
total 860961
drwx------+ 2 rmuir None     40960 Jan 18 19:03 .
drwx------+ 6 rmuir None         0 Jan 18 18:54 ..
-rwx------+ 1 rmuir None 328731623 Jan 18 19:03 _f.fdt
-rwx------+ 1 rmuir None   1004692 Jan 18 19:03 _f.fdx
-rwx------+ 1 rmuir None        31 Jan 18 19:02 _f.fnm
-rwx------+ 1 rmuir None        63 Jan 18 19:03 _f.per
-rwx------+ 1 rmuir None    251106 Jan 18 19:03 _f.tvd
-rwx------+ 1 rmuir None 417626058 Jan 18 19:03 _f.tvf
-rwx------+ 1 rmuir None   2009380 Jan 18 19:03 _f.tvx
-rwx------+ 1 rmuir None  50348161 Jan 18 19:03 _f_0.frq
-rwx------+ 1 rmuir None  75146262 Jan 18 19:03 _f_0.prx
-rwx------+ 1 rmuir None   6164520 Jan 18 19:03 _f_0.tim
-rwx------+ 1 rmuir None    146233 Jan 18 19:03 _f_0.tip
-rwx------+ 1 rmuir None        31 Jan 18 19:03 _f_nrm.cfe
-rwx------+ 1 rmuir None    125608 Jan 18 19:03 _f_nrm.cfs
-rwx------+ 1 rmuir None        20 Jan 18 19:03 segments.gen
-rwx------+ 1 rmuir None       265 Jan 18 19:03 segments_1

EnglishOffsets
total 552569
drwx------+ 2 rmuir None     28672 Jan 18 19:08 .
drwx------+ 7 rmuir None      4096 Jan 18 19:06 ..
-rwx------+ 1 rmuir None 328731623 Jan 18 19:07 _s.fdt
-rwx------+ 1 rmuir None   1004692 Jan 18 19:07 _s.fdx
-rwx------+ 1 rmuir None        31 Jan 18 19:07 _s.fnm
-rwx------+ 1 rmuir None        63 Jan 18 19:08 _s.per
-rwx------+ 1 rmuir None  52354303 Jan 18 19:08 _s_0.frq
-rwx------+ 1 rmuir None 177235787 Jan 18 19:08 _s_0.prx
-rwx------+ 1 rmuir None   6181626 Jan 18 19:08 _s_0.tim
-rwx------+ 1 rmuir None    146248 Jan 18 19:08 _s_0.tip
-rwx------+ 1 rmuir None        31 Jan 18 19:08 _s_nrm.cfe
-rwx------+ 1 rmuir None    125608 Jan 18 19:08 _s_nrm.cfs
-rwx------+ 1 rmuir None        20 Jan 18 19:08 segments.gen
-rwx------+ 1 rmuir None       265 Jan 18 19:08 segments_1
{noformat}
                
> add offsets into lucene40 postings
> ----------------------------------
>
>                 Key: LUCENE-3706
>                 URL: https://issues.apache.org/jira/browse/LUCENE-3706
>             Project: Lucene - Java
>          Issue Type: New Feature
>    Affects Versions: 4.0
>            Reporter: Robert Muir
>         Attachments: LUCENE-3706.patch
>
>
> LUCENE-3684 added support for 
> IndexOptions.DOCS_AND_FREQS_AND_POSITIONS_AND_OFFSETS, but
> only SimpleText implements it.
> I think we should implement it in the other 4.0 codecs (starting with 
> Lucene40PostingsFormat).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to