On 17 December 2010 21:18, Joseph Reagle <joseph.2...@reagle.org> wrote:
> On Thursday, December 16, 2010, Federico Leva (Nemo) wrote:
>> I have the first 10K edits up reconstructed in their various pages at:
>>    http://cyber.law.harvard.edu/~reagle/wp-redux/
>
> I fixed some of the encoding issues. The DB dump contained different 
> encodings. So, the encoding of each diff in the dump is independently now 
> guessed using Python's CharDet (Universal Encoding Detector) library.
>
> So now you can read up on the few "accented" topics in the early Wikipedia 
> including: Göteborg, Köpenhamn, and Křbenhavn.

Should probably be København and not Křbenhavn

/Martin

_______________________________________________
WikiEN-l mailing list
WikiEN-l@lists.wikimedia.org
To unsubscribe from this mailing list, visit:
https://lists.wikimedia.org/mailman/listinfo/wikien-l

Reply via email to