[ 
https://issues.apache.org/jira/browse/LUCENE-8764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16890881#comment-16890881
 ] 

Leonardo Menezes commented on LUCENE-8764:
------------------------------------------

Hi,

I have a first running version of this. 

On the UI side, this looks pretty clumsy right now, but as I actually wanted to 
raise the question on where this should actually be located, I left as it is 
for now. The way I see, this requires at least: 
 - field (which right now just uses the table used for top terms)
 - destination file (to make things simpler, we could avoid this and dump to 
current working directory + /export/\{timestamp}_\{field}.out for example)

but potentially could include more options, like include/do not include term 
frequency and so on. Having all of these on existing windows will add a lot of 
noise, so I thought this could simply be exposed through the menubar(like for 
Optimise) and create a dedicated popup for this.

Regarding including doc ids for comparing with external files(as the original 
ticket mentions)... are docids meaningful at all at for this? Perhaps having a 
property read from each document(like, an id) and use that would make more 
sense. Anyway, maybe this comes later.

Thoughts and feedback are welcome  !/jira/images/icons/emoticons/smile.png!  

> Add "export all terms" feature to Luke
> --------------------------------------
>
>                 Key: LUCENE-8764
>                 URL: https://issues.apache.org/jira/browse/LUCENE-8764
>             Project: Lucene - Core
>          Issue Type: Improvement
>          Components: modules/luke
>            Reporter: Tomoko Uchida
>            Priority: Major
>              Labels: beginner
>         Attachments: LUCENE-8764.patch, Screenshot 2019-07-23 12.29.06.png
>
>
> This is a migrated issue from previous Luke project in GitHub: 
> [https://github.com/DmitryKey/luke/issues/3] (There are users' requests so I 
> moved this from GitHub to Jira)
> You can browse terms in arbitrary field via Luke GUI, but in some cases 
> "exporting all terms (and optionally docids) to a file" feature would be 
> useful for further inspection. It might be similar to Solr's terms component.
> As for the user interface, "Export terms" button should be located in 
> Overview tab and/or Documents tab.
>  



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to