Re: [Wikitech-l] Bulk link rewrites for HTTP -> HTTPS migration?

2016-01-15 Thread Chris Adams
On Wed, Jan 13, 2016 at 12:47 PM, Legoktm wrote: > > When that work completes, we'll have somewhere around half a million > links > > which differ only in the URL scheme. What would be the best way to > rewrite > > all of those URLs? I'd like to reduce the window

Re: [Wikitech-l] Bulk link rewrites for HTTP -> HTTPS migration?

2016-01-15 Thread Oliver Keyes
I imagine you would need to go through the process, yep, since it's kind of a lot of edits that'd need clearing up if something went wrong. On 15 January 2016 at 13:32, Chris Adams wrote: > On Wed, Jan 13, 2016 at 12:47 PM, Legoktm > wrote: >

Re: [Wikitech-l] Bulk link rewrites for HTTP -> HTTPS migration?

2016-01-13 Thread Max Semenik
Fix them with a bot, for example AWB . On Wed, Jan 13, 2016 at 9:09 AM, Chris Adams wrote: > I've been working with a number of colleagues getting ready to turn HTTPS > on by default for various loc.gov domains. This

Re: [Wikitech-l] Bulk link rewrites for HTTP -> HTTPS migration?

2016-01-13 Thread Chris Adams
On Wed, Jan 13, 2016 at 1:49 PM, Risker wrote: > Before properly answering this question, it's important to know how many > links we're talking about. If it's 5000, the fallout is probably > manageable; but if it's in the hundreds of thousands on any project (most > likely

Re: [Wikitech-l] Bulk link rewrites for HTTP -> HTTPS migration?

2016-01-13 Thread Risker
Before properly answering this question, it's important to know how many links we're talking about. If it's 5000, the fallout is probably manageable; but if it's in the hundreds of thousands on any project (most likely enwiki) there will be renting of garments and gnashing of teeth. All those

Re: [Wikitech-l] Bulk link rewrites for HTTP -> HTTPS migration?

2016-01-13 Thread Oliver Keyes
Question; are LOC links handled in a standardised way using a template? Because if so this could be one change, not hundreds of thousands. (If it's not I'd really suggest using the same edit sets and opportunity to restructure them that way, if LOC links are consistent enough for it to be done.

Re: [Wikitech-l] Bulk link rewrites for HTTP -> HTTPS migration?

2016-01-13 Thread Chris Adams
On Wed, Jan 13, 2016 at 12:47 PM, Legoktm wrote: > You can use Pywikbot's replace.py[1], which lets you provide regex > find/replace and can get a list of pages from the API equivalent of > Special:LinkSearch. > Thanks - I'll look into that as we get various batches

[Wikitech-l] Bulk link rewrites for HTTP -> HTTPS migration?

2016-01-13 Thread Chris Adams
I've been working with a number of colleagues getting ready to turn HTTPS on by default for various loc.gov domains. This has been fairly successful and we're working through the old legacy apps now. When that work completes, we'll have somewhere around half a million links which differ only in

Re: [Wikitech-l] Bulk link rewrites for HTTP -> HTTPS migration?

2016-01-13 Thread Legoktm
On 01/13/2016 09:09 AM, Chris Adams wrote: > I've been working with a number of colleagues getting ready to turn HTTPS > on by default for various loc.gov domains. This has been fairly successful > and we're working through the old legacy apps now. Awesome! > When that work completes, we'll have

Re: [Wikitech-l] Bulk link rewrites for HTTP -> HTTPS migration?

2016-01-13 Thread P. Josepherum
If you use Apache, a rewrite rule is the simplest approach and instructions can be found by searching for "rewrite http to https Apache". A similar process will work with nginx. On Wed, 13 Jan 2016, 17:09 Chris Adams wrote: > I've been working with a number of colleagues