Hello all,
I wrote a script which extracts URLs from the RDF
dump of Open Directory Project (available at http://dmoz.org/rdf.html)
and converts them into urls/domains rules for squidGuard.
Especially, rules extracted from dmoz.org/Adult/ and
dmoz.org/Kids_and_Teens would be useful as they are
quite big and checked by human editors, I hope.
dmozlists/adult/domains 31995 lines
dmozlists/adult/urls 61394 lines
dmozlists/kids_and_teens/domains 5783 lines
dmozlists/kids_and_teens/urls 10003 lines
You can get the script and its output at:
http://www.ingrid.org/~harada/filtering/
Enjoy!
--
Masanori Harada
NTT Network Innovation Laboratories