[Bug 485745]

2016-04-14 Thread Jason Crain
*** This bug has been marked as a duplicate of bug 93299 ***

-- 
You received this bug notification because you are a member of Ubuntu
Desktop Bugs, which is subscribed to evince in Ubuntu.
https://bugs.launchpad.net/bugs/485745

Title:
  Evince presents PDF with mis-rendered Hebrew

To manage notifications about this bug go to:
https://bugs.launchpad.net/evince/+bug/485745/+subscriptions

-- 
desktop-bugs mailing list
desktop-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/desktop-bugs


[Bug 485745]

2016-03-23 Thread Jason Crain
(In reply to Thomas Freitag from comment #5)
> May You have a look at bug 43488 and bug 41104 if Your patch of bug 93299
> fixes them, too? And mark them as duplicate if so?

No, my patch does not fix either of those.  The files in both of those
bugs directly use the glyph IDs to look up glyphs.  Which is completely
insane for a non-embedded font because glyph IDs are not in any way
portable.  To get them to work you could install the exact same Arial
font.  Not just a similar or compatible font like Liberation Sans, but
Microsoft's own Arial font.  Installing msttcorefonts should work.
Might have to also add a fontconfig rule to match ArialMT to Arial.

-- 
You received this bug notification because you are a member of Ubuntu
Desktop Bugs, which is subscribed to evince in Ubuntu.
https://bugs.launchpad.net/bugs/485745

Title:
  Evince presents PDF with mis-rendered Hebrew

To manage notifications about this bug go to:
https://bugs.launchpad.net/evince/+bug/485745/+subscriptions

-- 
desktop-bugs mailing list
desktop-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/desktop-bugs


[Bug 485745]

2016-03-23 Thread Jason Crain
This file would be fixed by my patch in bug 93299.  It provides
fontconfig some hints about the character encoding so it can do better
at choosing an appropriate font.

-- 
You received this bug notification because you are a member of Ubuntu
Desktop Bugs, which is subscribed to evince in Ubuntu.
https://bugs.launchpad.net/bugs/485745

Title:
  Evince presents PDF with mis-rendered Hebrew

To manage notifications about this bug go to:
https://bugs.launchpad.net/evince/+bug/485745/+subscriptions

-- 
desktop-bugs mailing list
desktop-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/desktop-bugs


[Bug 667752]

2016-02-21 Thread Jason Crain
The "narrow text" part of this should be fixed with bug 94054.

-- 
You received this bug notification because you are a member of Ubuntu
Desktop Bugs, which is subscribed to evince in Ubuntu.
https://bugs.launchpad.net/bugs/667752

Title:
  Non-embedded standard fonts in PDF files are not displayed/rendered
  correctly

To manage notifications about this bug go to:
https://bugs.launchpad.net/evince/+bug/667752/+subscriptions

-- 
desktop-bugs mailing list
desktop-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/desktop-bugs


[Bug 33288]

2016-02-19 Thread Jason Crain
Created attachment 121848
Cache result of inner loop in visitDepthFirst

This is an alternative to Brian's patch in comment 65.  This speeds up
the visitDepthFirst function by caching the result in the inner loop.
This provides a similar speedup without changing the output of
pdftotext.

-- 
You received this bug notification because you are a member of Ubuntu
Desktop Bugs, which is subscribed to the bug report.
https://bugs.launchpad.net/bugs/33288

Title:
  Evince doesn't handle columns properly

To manage notifications about this bug go to:
https://bugs.launchpad.net/poppler/+bug/33288/+subscriptions

-- 
desktop-bugs mailing list
desktop-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/desktop-bugs


[Bug 1456616]

2015-08-04 Thread Jason Crain
*** This bug has been marked as a duplicate of bug 66229 ***

-- 
You received this bug notification because you are a member of Ubuntu
Desktop Bugs, which is subscribed to evince in Ubuntu.
https://bugs.launchpad.net/bugs/1456616

Title:
  unable to rotate a pdf coming from Firefox (Google maps)

To manage notifications about this bug go to:
https://bugs.launchpad.net/evince/+bug/1456616/+subscriptions

-- 
desktop-bugs mailing list
desktop-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/desktop-bugs


[Bug 116453]

2015-04-22 Thread Jason Crain
*** Bug 66569 has been marked as a duplicate of this bug. ***

-- 
You received this bug notification because you are a member of Ubuntu
Desktop Bugs, which is subscribed to poppler in Ubuntu.
https://bugs.launchpad.net/bugs/116453

Title:
  evince can not find ü in attached PDF

To manage notifications about this bug go to:
https://bugs.launchpad.net/poppler/+bug/116453/+subscriptions

-- 
desktop-bugs mailing list
desktop-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/desktop-bugs

[Bug 39890]

2015-04-19 Thread Jason Crain
*** Bug 13441 has been marked as a duplicate of this bug. ***

-- 
You received this bug notification because you are a member of Ubuntu
Desktop Bugs, which is subscribed to the bug report.
https://bugs.launchpad.net/bugs/39890

Title:
  characters change when selecting text

To manage notifications about this bug go to:
https://bugs.launchpad.net/poppler/+bug/39890/+subscriptions

-- 
desktop-bugs mailing list
desktop-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/desktop-bugs


[Bug 39890]

2015-04-19 Thread Jason Crain
*** Bug 9608 has been marked as a duplicate of this bug. ***

-- 
You received this bug notification because you are a member of Ubuntu
Desktop Bugs, which is subscribed to the bug report.
https://bugs.launchpad.net/bugs/39890

Title:
  characters change when selecting text

To manage notifications about this bug go to:
https://bugs.launchpad.net/poppler/+bug/39890/+subscriptions

-- 
desktop-bugs mailing list
desktop-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/desktop-bugs


[Bug 116453]

2015-04-15 Thread Jason Crain
(In reply to Nelson Benitez from comment #13)
 Hi Jason, thank you very much for the patch, btw, today I was reading this
 pdf:
 
 http://www.compsci.hunter.cuny.edu/~sweiss/course_materials/csci493.70/
 lecture_notes/GTK_textview.pdf
 
 and noticed that lot of words with double f,  like 'buffer', are not
 found[1] when searching for it, also when copied to gedit it shows the
 unicode not found glyph inplace of the 'ff' in the word.
 
 So, is your patch covering this double f case? 

No, it does not fix that.  That file has a different problem and I don't
see a way of fixing it.  The PDF creator would need to add some extra
information before we could guess that character code 27 should be a
double f.

-- 
You received this bug notification because you are a member of Ubuntu
Desktop Bugs, which is subscribed to poppler in Ubuntu.
https://bugs.launchpad.net/bugs/116453

Title:
  evince can not find ü in attached PDF

To manage notifications about this bug go to:
https://bugs.launchpad.net/poppler/+bug/116453/+subscriptions

-- 
desktop-bugs mailing list
desktop-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/desktop-bugs

[Bug 116453]

2015-03-24 Thread Jason Crain
Created attachment 114485
Combine base characters and diacritical marks

My attempt to improve this.

When you make a diacriticized character with LaTeX, ü for example, it
will make a PDF with separate u and ¨ characters and draw them over each
other.  This patch detects when this happens and converts it to a
combining character sequence so that pdftotext and the search function
will see a ü and not separate characters.  Also refactors some
(TextWord::ensureCapacity and TextWord::setInitialBounds) to avoid
duplicating code.

Limitations:

It doesn't handle some of LaTeX's diacritic commands, such as \b for bar
under letter or \d for dot under letter, because they are positioned
differently and \d would be easy to confuse with a period.  They don't
seem to be used very often though.

If the base character is unusual, such as a math symbol or number,
adding a combining character can make the result of pdftotext look a bit
odd.  I think this is because if the font or rendering engine don't know
how to draw the character sequence, it will place the diacritic in a
strange position, such as to the right of the letter.  In these cases,
the output of pdftotext is technically correct, it just looks odd when
drawn on screen.

When selecting text in evince, you can separately select the character
and diacritic.  If that's a problem, I think I could fix it by adding
clustering support so that a group of glyphs and characters are treated
as a single unit.  It would make this a much more invasive change, but
maybe I should try it anyway.  It would be nice to also fix the
assumpution that one glyph is always matched 1 character.

-- 
You received this bug notification because you are a member of Ubuntu
Desktop Bugs, which is subscribed to poppler in Ubuntu.
https://bugs.launchpad.net/bugs/116453

Title:
  evince can not find ü in attached PDF

To manage notifications about this bug go to:
https://bugs.launchpad.net/poppler/+bug/116453/+subscriptions

-- 
desktop-bugs mailing list
desktop-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/desktop-bugs

[Bug 116453]

2015-02-03 Thread Jason Crain
Created attachment 113036
[draft] combine characters

I might be able to fix this in a better way by combining letters with
nearby diacritic marks so that this document *would* contain ü.  It
seems to be a nice improvement for some latex documents.  Attached patch
can give you a rough idea of what I mean.  It still needs a lot of work
though.

-- 
You received this bug notification because you are a member of Ubuntu
Desktop Bugs, which is subscribed to poppler in Ubuntu.
https://bugs.launchpad.net/bugs/116453

Title:
  evince can not find ü in attached PDF

To manage notifications about this bug go to:
https://bugs.launchpad.net/poppler/+bug/116453/+subscriptions

-- 
desktop-bugs mailing list
desktop-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/desktop-bugs

[Bug 116453]

2015-01-27 Thread Jason Crain
I suppose if I add an option to findText, I should also add a flag
(POPPLER_FIND_IGNORE_COMBINING?) to PopplerFindFlags, for the glib front
end's poppler_page_find_text_with_options().  It would be nice if
someone could confirm that evince would actually use this option.

-- 
You received this bug notification because you are a member of Ubuntu
Desktop Bugs, which is subscribed to poppler in Ubuntu.
https://bugs.launchpad.net/bugs/116453

Title:
  evince can not find ü in attached PDF

To manage notifications about this bug go to:
https://bugs.launchpad.net/poppler/+bug/116453/+subscriptions

-- 
desktop-bugs mailing list
desktop-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/desktop-bugs

[Bug 116453] Re: evince can not find ü in attached PDF

2015-01-21 Thread Jason Crain
hdante: the problem is that, despite appearances, the PDF in the bug
description does not contain the word 'Über'.  It contains the word
'Uber', without a diaresis.  You can see this if you copy and paste from
the document using any PDF reader, including adobe reader, google
chrome, foxit, etc.  There is a diaresis, but it is not really attached
to the 'U'.

Even so, adobe reader and chrome can still find something if you search
the document for 'über'.  What they seem to be doing is ignoring any
diacratic marks, so if you search for 'über' (or even 'ubér') it will
find 'Uber'.  I was proposing similar behavior for poppler.

-- 
You received this bug notification because you are a member of Ubuntu
Desktop Bugs, which is subscribed to poppler in Ubuntu.
https://bugs.launchpad.net/bugs/116453

Title:
  evince can not find ü in attached PDF

To manage notifications about this bug go to:
https://bugs.launchpad.net/poppler/+bug/116453/+subscriptions

-- 
desktop-bugs mailing list
desktop-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/desktop-bugs

[Bug 116453]

2015-01-13 Thread Jason Crain
Created attachment 112107
Remove combining characters from normalized text

This patch changes normalization so that combining characters are
removed from the normalized text.  This makes searching through
TextPage::findText insensitive to these characters.

Also, renames unicodeNormalizeNFKC to unicodeNormalizeSearch to make it
clear it's no longer doing a regular NFKC normalization.

Renames decomp_compat to decomp_compat_base because it now strips
combing characters, leaving only base characters, in addition to
compatibility decomposition.

Removes UnicodeCompTables.h and some compose functions.  They're no
longer needed since we're not recomposing the characters.

I'm not sure if UnicodeTypeTable.h and UnicodeCompTables.h are
considered part of the public interface.  They're included in the xpdf
headers.  Albert, is it OK to change these files in this way?

-- 
You received this bug notification because you are a member of Ubuntu
Desktop Bugs, which is subscribed to poppler in Ubuntu.
https://bugs.launchpad.net/bugs/116453

Title:
  evince can not find ü in attached PDF

To manage notifications about this bug go to:
https://bugs.launchpad.net/poppler/+bug/116453/+subscriptions

-- 
desktop-bugs mailing list
desktop-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/desktop-bugs

[Bug 808894]

2014-12-19 Thread Jason Crain
*** This bug has been marked as a duplicate of bug 46603 ***

-- 
You received this bug notification because you are a member of Ubuntu
Desktop Bugs, which is subscribed to poppler in Ubuntu.
https://bugs.launchpad.net/bugs/808894

Title:
  Certain characters are not rendered correctly when selected
  (highlighted)

To manage notifications about this bug go to:
https://bugs.launchpad.net/poppler/+bug/808894/+subscriptions

-- 
desktop-bugs mailing list
desktop-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/desktop-bugs