[
https://issues.apache.org/jira/browse/LUCENE-8764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16890881#comment-16890881
]
Leonardo Menezes commented on LUCENE-8764:
------------------------------------------
Hi,
I have a first running version of this.
On the UI side, this looks pretty clumsy right now, but as I actually wanted to
raise the question on where this should actually be located, I left as it is
for now. The way I see, this requires at least:
- field (which right now just uses the table used for top terms)
- destination file (to make things simpler, we could avoid this and dump to
current working directory + /export/\{timestamp}_\{field}.out for example)
but potentially could include more options, like include/do not include term
frequency and so on. Having all of these on existing windows will add a lot of
noise, so I thought this could simply be exposed through the menubar(like for
Optimise) and create a dedicated popup for this.
Regarding including doc ids for comparing with external files(as the original
ticket mentions)... are docids meaningful at all at for this? Perhaps having a
property read from each document(like, an id) and use that would make more
sense. Anyway, maybe this comes later.
Thoughts and feedback are welcome !/jira/images/icons/emoticons/smile.png!
> Add "export all terms" feature to Luke
> --------------------------------------
>
> Key: LUCENE-8764
> URL: https://issues.apache.org/jira/browse/LUCENE-8764
> Project: Lucene - Core
> Issue Type: Improvement
> Components: modules/luke
> Reporter: Tomoko Uchida
> Priority: Major
> Labels: beginner
> Attachments: LUCENE-8764.patch, Screenshot 2019-07-23 12.29.06.png
>
>
> This is a migrated issue from previous Luke project in GitHub:
> [https://github.com/DmitryKey/luke/issues/3] (There are users' requests so I
> moved this from GitHub to Jira)
> You can browse terms in arbitrary field via Luke GUI, but in some cases
> "exporting all terms (and optionally docids) to a file" feature would be
> useful for further inspection. It might be similar to Solr's terms component.
> As for the user interface, "Export terms" button should be located in
> Overview tab and/or Documents tab.
>
--
This message was sent by Atlassian JIRA
(v7.6.14#76016)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]