Hello friendly folks,
i would appreciate any help on the following:
say we have a folder with thousands of html files. Since the file browser
crashes, i am looking at making a script that would do the following:
Distribute all html files in folders, say 001, 002, 003, etc, sorted by the
html file
ipts and use the classifier
> to find hybrid cases. I have quite satisfactory results with this approach
> in a slightly different use case.
>
>
>
>
>
> *From:* George Milten [mailto:george.mil...@gmail.com]
> *Sent:* dinsdag 10 februari 2015 16:09
>
> *To:* Kool,Wou
. I have not tested
> it, but I suppose /[^\p{Latin}]/ would match any non-latin characters. So you
> find the character class that most characters match and you look for the
> exceptions. Would that help?
>
>
>
>
>
>
>
> *From:* George Milten [mailto:george.mil...@gma
e:
> https://c.na8.content.force.com/servlet/servlet.ImageServer?id=015C00227Uz&oid=00D8000ZRv8&lastMod=140984368]*
> <http://www.oclc.org/>
>
>
>
>
>
>
>
>
>
>
>
> *From:* George Milten [mailto:george.mil...@gmail.com]
> *Sent:* di
Hello friendly folks,
follows what i am trying to do, and i am looking for your help in order to
find the most clever way to achieve this:
We have records, that include typos like this: we have a word say Plato,
where the last o is inputted with the keyboard set to Greek language, so we
need some