Re: Removing duplication: Word lists of common words in languages

2015-01-16 Thread Ben Finney
Ben Finney ben+deb...@benfinney.id.au writes: Where is a good authoritative source of such words, by frequency, for various natural languages, suitable for inclusion in Debian as a data package? The package ‘scowl’ URL:https://packages.debian.org/sid/scowl looks like a good candidate already

Re: Removing duplication: Word lists of common words in languages

2014-11-12 Thread Ian Jackson
Ben Finney writes (Re: Removing duplication: Word lists of common words in languages): Ian Jackson ijack...@chiark.greenend.org.uk writes: I had roughly this question in 2013, and found the answer. Here is probably the best starting point: http://www.chiark.greenend.org.uk/ucgi

Re: Removing duplication: Word lists of common words in languages

2014-11-11 Thread Simon McVittie
On 10/11/14 23:16, Ben Finney wrote: To avoid duplicating these “the N most common words, ranked by frequency, for language FOO” For a password generator you ideally want the word-list to be sorted alphabetically, so that it's trivial to verify by eye that there are no duplicates. Duplicate

Re: Removing duplication: Word lists of common words in languages

2014-11-11 Thread Ben Finney
Simon McVittie s...@debian.org writes: On 10/11/14 23:16, Ben Finney wrote: To avoid duplicating these “the N most common words, ranked by frequency, for language FOO” For a password generator you ideally want the word-list to be sorted alphabetically, so that it's trivial to verify by

Re: Removing duplication: Word lists of common words in languages

2014-11-11 Thread Ian Jackson
Ben Finney writes (Re: Removing duplication: Word lists of common words in languages): Where is a good authoritative source of such words, by frequency, for various natural languages, suitable for inclusion in Debian as a data package? I had roughly this question in 2013, and found the answer

Re: Removing duplication: Word lists of common words in languages

2014-11-11 Thread Ben Finney
Ian Jackson ijack...@chiark.greenend.org.uk writes: I had roughly this question in 2013, and found the answer. Here is probably the best starting point: http://www.chiark.greenend.org.uk/ucgi/~ijackson/git?p=evade-mail-usrlocal.git;a=blob;f=lemma.al-permission.mbox Great! That asks for

Removing duplication: Word lists of common words in languages (was: Bug#768772: ITP: xkcdpass …)

2014-11-10 Thread Ben Finney
On 10-Nov-2014, Jonas Smedegaard wrote: Crypt::XkcdPassword by default uses the most commonly used words in film scripts and television shows, and documents examples of adaptations at https://metacpan.org/pod/Crypt::XkcdPassword::Examples. Thank you, it's good to know these exist. I don't