Check out the OpenThreads project[1][2]. In addition to being just generally awesome, it's also a starting point if you want to parse out the topics in an entire mailing list. It was written by a colleague who's very interested in gender diversity in tech, so the project is mostly known for researching what pronouns are used most often on LibTech, and also for posting a list of the most common names mentioned (the top ten are all dudes).
Caution: some assembly required. happy hacking ;) Griffin [1] https://github.com/elationfoundation/openThreads/wiki/A-journey-in-Regular-Expressions-and-ListServ's <https://github.com/elationfoundation/openThreads/wiki/A-journey-in-Regular-Expressions-and-ListServ%27s> [2] https://github.com/elationfoundation/openThreads Jayvan Santos wrote: > > Is there an open algorithm that keeps a summarization of what normally > gets posted here? Pointing out the topics, references, commonly used > terms? > > "Joseph Holsten" <[email protected] > <mailto:[email protected]>> wrote: > > It does beat the spam (which might just be stego, but still). +1 > to dadaist cpunkery. > -- > ~j > -- "Cypherpunks write code not flame wars." --Jurre van Bergen #Foucault / PGP: 0xAE792C97 / OTR: [email protected] My posts are my own, not my employer's.
