[jira] [Commented] (PDFBOX-4101) Word ordering / line detection failures in text extraction

2018-02-06 Thread Tilman Hausherr (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-4101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16354990#comment-16354990 ] Tilman Hausherr commented on PDFBOX-4101: - There is no fixed rule that the sort mode is better

[jira] [Commented] (PDFBOX-4101) Word ordering / line detection failures in text extraction

2018-02-06 Thread Alexandre (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-4101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16354619#comment-16354619 ] Alexandre commented on PDFBOX-4101: --- I understand what you said! Well, yes I used the unsorted mode.

[jira] [Commented] (PDFBOX-4101) Word ordering / line detection failures in text extraction

2018-02-06 Thread Tilman Hausherr (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-4101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16354611#comment-16354611 ] Tilman Hausherr commented on PDFBOX-4101: - I assume this was created in the unsorted mode. That

[jira] [Commented] (PDFBOX-4101) Word ordering / line detection failures in text extraction

2018-02-06 Thread Alexandre (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-4101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16354601#comment-16354601 ] Alexandre commented on PDFBOX-4101: --- It does recognize columns but I don't have a clue which algorithm

[jira] [Commented] (PDFBOX-4101) Word ordering / line detection failures in text extraction

2018-02-06 Thread Tilman Hausherr (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-4101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16354597#comment-16354597 ] Tilman Hausherr commented on PDFBOX-4101: - Try the sort option... however you still won't be