utf-8 seems to be the right format. so further books will be utf-8
Am Freitag, 12. Mai 2017 15:09:24 UTC+2 schrieb Andi: > > The upload with ANSI did not work. > Now all this files are unicode. > > What about German books? > > > > Am Freitag, 12. Mai 2017 15:05:32 UTC+2 schrieb Andi: >> >> >> >> Am Freitag, 12. Mai 2017 15:03:05 UTC+2 schrieb Andi: >>> >>> This I did by hand. >>> You can have more of them... >>> >>> of course the Cogus should do all of this by his own - and he will! >>> but today :)..... >>> >>> this is ANSI txt. O.K. or do you prefer utf-8 or something else? >>> >>> >>> >>> >>> >>> >>> Am Freitag, 12. Mai 2017 00:28:31 UTC+2 schrieb linas: >>>> >>>> Hi Andi, >>>> >>>> Yeah, that's ideal. Did you do this with a script, or by hand? in my >>>> ideal world, there's some script that downloads a bunch of these from >>>> project gutenberg, strips out the license boilerplate, and puts them into >>>> some directory. Busting them up into chapters would be nice, too, so that >>>> if cogserver chokes and dies, or I have to kill it, it can pick up where >>>> it >>>> left off, more or less. >>>> >>>> >>>> On Thu, May 11, 2017 at 4:48 AM, Andi <[email protected]> wrote: >>>> >>>>> does something like this help? >>>>> >>>>> >>>>>> It would really really help if someone could find & prepare some >>>>>> clean text of some kind of adventure novels or young-adult lit, or any >>>>>> kind >>>>>> of narrative literature. Maybe from project gutenberg. I've discovered >>>>>> that wikipedia has 3 major faults: >>>>>> >>>>>> >>>> -- You received this message because you are subscribed to the Google Groups "opencog" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/opencog. To view this discussion on the web visit https://groups.google.com/d/msgid/opencog/f3f54eb9-7247-4cb2-b96a-9e8d55b5cfbc%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.
