Perhaps all from you already know this, but I only recently discovered that pdf2djvu converts a *searchable pdf* into a *searchable djvu* (t.i. uploads anything from pdf to djvu, active links and metadata too) and I like to share my "discover". Conversion is extremely simple. Unluckily, we use only a little bit of djvu text data - usually only the whole, unmapped text, the only exception being hOCR tool by Phe, that outputs mapped text.
Alex 2015-10-15 13:45 GMT+02:00 billinghurst <[email protected]>: > Also to note that User:Dominic was a wikimedian in residence with NARA in > the States and had a large number of files uploaded, and components of > transcription project for those uploads.. They have their own template at > Commons, so you should be able to dig them up. > > Regards, Billinghurst > > On Thu, Oct 15, 2015 at 9:18 PM Arne Wossink <[email protected]> wrote: > >> Hi all, >> >> Wikimedia Nederland has recently approached by several institutions that >> would like to do uploads of source material. Wikisource would be the >> preferred platform for this as the material would be searchable (which it >> wouldn't be if it was only uploaded as pdf to Commons). >> >> I would like to know if there have been previous projects involving large >> uploads by institutions, and if there's any documentation on how to proceed >> with these. >> >> Thanks! >> >> Arne Wossink >> >> Projectleider / Project Lead Wikimedia Nederland >> >> Tel. +31 (0)6 11000505 >> >> *Postadres*: >> * Bezoekadres:* >> Postbus 167 Mariaplaats 3 >> 3500 AD Utrecht Utrecht >> _______________________________________________ >> Wikisource-l mailing list >> [email protected] >> https://lists.wikimedia.org/mailman/listinfo/wikisource-l >> > > _______________________________________________ > Wikisource-l mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/wikisource-l > >
_______________________________________________ Wikisource-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikisource-l
