Ja Ga wrote: > I was investigating a better version of Special:Lonelypages (ignore > disambig pages, exclude the already tagged, identify 1- and 2- link > orphans as well as no-link orphans, etc.) but when I run the original > Lonelypages SQL query on nightshade against enwiki_p it times out after > 10 minutes. > > SELECT page_namespace, page_title FROM page LEFT JOIN pagelinks ON > page_namespace=pl_namespace AND page_title=pl_title WHERE pl_namespace > IS NULL AND page_namespace=".NS_MAIN." AND page_is_redirect=0;
I think the problem is simply that, no matter how you optimize it, this is a slow query: it has to go through every page in the main namespace and check if it has incoming links or not. Adding a limit might help, but not much: unlinked pages are fairly rare, so it still has to search through a _lot_ of pages just to find a few dozen unlinked ones. Anyway, I just tried it with "LIMIT 50", and it finished in two minutes. So it certainly works, it just takes time. I'd suggest just running it with /* SLOW_OK */ and session transaction isolation level set to "READ UNCOMMITTED". -- Ilmari Karonen _______________________________________________ Toolserver-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/toolserver-l
