I found a tool that can convert a PDF textbook to epub, with minimal 
loss/corruption. Do we have an epub import tool lying around? I think Steve 
Schneider did something with epub in the spring.

On Friday, September 1, 2017 at 6:21:26 AM UTC-4, Jeremy Ruston wrote:
>
> The trouble with PDF is that, contrary to expectations, it is actually an 
> image file format, rather than a text document format. In other words, it 
> doesn’t know anything about paragraphs, or headers, or footers; all it 
> knows about are simple instructions to draw a given letter at given 
> coordinates. (Worse than that, some PDFs are actually just embedded 
> bitmaps). 
>
> That means that converting a PDF into a conventional document is more akin 
> to “optical character recognition” than ordinary file format conversion. It 
> takes machine learning or sophisticated heuristics for software to figure 
> out the structural relationships behind the document image. There is some 
> effective software available to do this conversion, but it tends to be 
> expensive because it’s such a hard problem and the capability is so 
> valuable. 
>
> Best wishes 
>
> Jeremy. 
>
>
> > On 1 Sep 2017, at 05:02, TonyM <anthony...@gmail.com <javascript:>> 
> wrote: 
> > 
> > Such a tool would be helpful. 
> > 
> > Personally I would look into tools to turn pdfs into text then import 
> that, because there a many issues going from a highly formatted document 
> type to plain text. Not to mention text inside images where some OCR is 
> needed. 
> > 
> > Foxit reader and Pro is great for pdf work but not sure it will help 
> you. 
> > 
> > Regards 
> > Tony 
> > 
> > -- 
> > You received this message because you are subscribed to the Google 
> Groups "TiddlyWiki" group. 
> > To unsubscribe from this group and stop receiving emails from it, send 
> an email to tiddlywiki+...@googlegroups.com <javascript:>. 
> > To post to this group, send email to tiddl...@googlegroups.com 
> <javascript:>. 
> > Visit this group at https://groups.google.com/group/tiddlywiki. 
> > To view this discussion on the web visit 
> https://groups.google.com/d/msgid/tiddlywiki/4b702ecd-3fbd-4e09-b6db-dd4092ca4000%40googlegroups.com.
>  
>
> > For more options, visit https://groups.google.com/d/optout. 
>
>

-- 
You received this message because you are subscribed to the Google Groups 
"TiddlyWiki" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tiddlywiki+unsubscr...@googlegroups.com.
To post to this group, send email to tiddlywiki@googlegroups.com.
Visit this group at https://groups.google.com/group/tiddlywiki.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tiddlywiki/37505e27-6cfc-4cd3-bb1d-e9bd966ddd15%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to