Scan quality is excellent.

Yes, is a very promising way - my suggestion is, to get always scans in
TIFF (if possible; they are large but USB are large too ...), tro transform
them into an image-only pdf (which is the simpler tool to do this?)  and to
load a copy into Internet Archive specifyng both the library where the book
has been scanned AND the wikisource contribution in scansion/merging
TIFFs/uploading into IA.

Then the excellent OCR -> divu produced by IA can be downloaded and
uploaded into Commons. A good way to share anything, IMHO.

In the meantime: IA produces too an extremely interesting ABBYY.gz output;
it's a xml where a incredible set of interesting data is recorded for any
scanned character. Here an example for a random character of a random IA
book:

<charParams l="1356" t="680" r="1544" b="884" wordStart="false"
wordFromDictionary="true" wordNormal="true" wordNumeric="false"
wordIdentifier="false" charConfidence="25" serifProbability="100"
wordPenalty="0" meanStrokeWidth="347">G</charParams>

Something to explore deeply  IMHO; I presume that less than 1% of usable
ABBYY scan data are wrapped into djvu as OCR layer.

Alex




2013/6/13 Lars Aronsson <[email protected]>

> Some research libraries in Stockholm (at archives and
> museums) have put up book scanners that the public
> can use. They have the same function as a public
> copier, but you get your copies on a USB stick rather
> than on paper.
>
> This opens an interesting opportunity for Wikisource and
> similar volunteer book scanning projects. Instead of
> buying expensive equipment, experimenting with
> cameras and lighting, or building your own scanner,
> you can just visit such a library. I guess you can even
> bring your own book and scan it there, instead of just
> using the library's books. (Of course you still need to
> consider copyright. That goes without saying.)
>
> Wikimedia Sverige, the Swedish chapter of the WMF,
> started a wiki page to document some experience
> from this kind of use, in Swedish of course,
> https://se.wikimedia.org/wiki/**Allm%C3%A4nhetens_bokscanner<https://se.wikimedia.org/wiki/Allm%C3%A4nhetens_bokscanner>
>
> Here is an example of a book that was scanned this way,
> http://runeberg.org/**nordmuseet/1897/0001.html<http://runeberg.org/nordmuseet/1897/0001.html>
> (Ironically, it is the annual report for 1897 of the museum
> where it was scanned. They have the scanner standing in
> their own library, but they have not scanned their own
> reports.)
>
> Are you familiar with anyting similar? Any other pages
> that we should link to?
>
>
> --
>   Lars Aronsson ([email protected])
>
>   Wikimedia Sverige - stöd fri kunskap - http://wikimedia.se/
>
>   Project Runeberg - free Nordic literature - http://runeberg.org/
>
>
>
> ______________________________**_________________
> Wikisource-l mailing list
> [email protected].**org <[email protected]>
> https://lists.wikimedia.org/**mailman/listinfo/wikisource-l<https://lists.wikimedia.org/mailman/listinfo/wikisource-l>
>
_______________________________________________
Wikisource-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikisource-l

Reply via email to