On Thu, Mar 26, 2009 at 12:54 PM, Andreas Rindler <[email protected]> wrote: > Hi, > we are trying to extract all URLs in wiki articles from our Mediawiki > installation. We have tried Grep, Perl and Sed on mysql dumps, but it > is very difficult to get the URLs only, without some > garbage/text/comments before or after them. > > Does anyone know of a better way to achieve this? > SELECT * FROM externallinks;
on your wiki database. Bryan _______________________________________________ Wikitech-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikitech-l
