Re: Wiki migration and clean-up

Allen Wittenauer Thu, 28 Jul 2016 11:08:30 -0700

        I hope you folks are aware that this is much more intensive than just 
moving a bunch of documents.  Lots of wiki pages are referenced in the source 
code, including in user-facing error messages.



> On Jul 28, 2016, at 10:47 AM, Ray Chiang <[email protected]> wrote:
> 
> Thanks Martin.  I did ask on INFRA-12342, and it looks like Confluence Wiki 
> is the recommended "latest and greatest".
> 
> Here's my proposal as it currently stands:
> 
> 1) Move to Confluence Wiki.
> 
> 2) Move all the Industry/meetup to a single page with a small set of external 
> links.  This will be mostly of the form, "if you want to know more you can 
> get started with...".
> 
> 3) Have one other page for users just getting started.  The updated IRC 
> information, mailing lists, and the fact that JIRA isn't for user support 
> will go here.
> 
> 4) Keep and reorganize the more detailed technical information (developers, 
> advanced users, and admins) on the Wiki.  For this, I have no doubt I'll be 
> copying large chunks of the old Wiki, but likely updating any pre-branch-2 
> information.
> 
> 5) Once everything is moved, organized, and gets enough +1's from the 
> community, update the pointers to the new Wiki and obsolete the old one.
> 
> Any further discussion is still welcome.
> 
> -Ray
> 
> 
> On 7/27/16 12:08 PM, Martin Rosse wrote:
>> Hi Ray,
>> 
>> The migration is much needed, and thanks for initiating it.
>> 
>> Regarding approaches to cleaning up the Wiki content--my 2 cents is in
>> favor an approach similar to the Spark cwiki:
>> 
>> https://cwiki.apache.org/confluence/display/SPARK/Wiki+Homepage
>> 
>> My take is that the Hadoop product docs on hadoop.apache.org generally
>> target (or should target) the audiences you describe in 1-4, while the Wiki
>> is (should be) primarily for audience #5 or "Hadoop staff"--internal Hadoop
>> development, product management, QA, etc.
>> 
>> Definitely current Wiki content such as "Overview of Hadoop" and the link
>> to "Single Node Hadoop Cluster" installation is redundant, unnecessary doc
>> maintenance, and annoying to come across as a user because you have to
>> assess its value relative to the same/similar content in the product doc on
>> hadoop.apache.org.
>> 
>> BTW, I did some random testing of ASF project wikis hosted on
>> cwiki.apache.org, and the pages for those sites definitely load much, much
>> faster than ASF wiki pages using MoinMoin. Clearly no surprise.
>> 
>> Best,
>> Martin
>> 
>> 
>> On Wed, Jul 27, 2016 at 10:29 AM, Ray Chiang <[email protected]> wrote:
>> 
>>> Good to know.  It's certainly easier to set up an alternate location in
>>> any case and then do a wholesale migration.  It saves from having that
>>> "under construction" look before it's complete.
>>> 
>>> I'll get on the appropriate infra@ list and ask about recommendations.
>>> 
>>> -Ray
>>> 
>>> 
>>> On 7/26/16 10:49 PM, Andrew Wang wrote:
>>> 
>>>> Hi Ray, if you're going to do a wiki cleanup, fair warning that I filed
>>>> this INFRA JIRA about the wiki being terribly slow, and they closed it as
>>>> WONTFIX:
>>>> 
>>>> https://issues.apache.org/jira/browse/INFRA-12283
>>>> 
>>>> So if you'd actually like to undertake a wiki cleanup, we should also
>>>> consider migrating the content to a wiki that isn't terribly slow.
>>>> 
>>>> I think cwiki.apache.org is better, but maybe we should ask infra what
>>>> the
>>>> preferred option is here. They might be able to help with a content
>>>> migration too.
>>>> 
>>>> On Tue, Jul 26, 2016 at 3:27 PM, Ray Chiang <[email protected]> wrote:
>>>> 
>>>> Coming in late to an old thread.
>>>>> I was looking around at the Hadoop documentation (hadoop.apache.org and
>>>>> wiki.apache.org/hadoop) and I'd sum up the current state of the
>>>>> documentation as follows:
>>>>> 
>>>>> 1. hadoop.apache.org is pretty clearly full of technical information.
>>>>> My only minor nit here is that the wiki pointer and the Git pointer
>>>>>     at the top is really tiny.
>>>>> 2. wiki.apache.org is simultaneously targeted to at least four audiences
>>>>>      1. Industry Users (broadest sense of Big Data Industry)
>>>>>      2. Industry Developers (mostly those adding a layer like Hive does
>>>>>         to MapReduce)
>>>>>      3. Hadoop Users (those who just want to set up a small cluster)
>>>>>      4. Hadoop Developers (e.g. using MapReduce APIs)
>>>>>      5. Hadoop Internal Developers (eventual contributors)
>>>>> 
>>>>> I'd like to initiate some cleanup of the wiki, but before I even start,
>>>>> I'd like to see if anyone has constructive suggestions or other
>>>>> approaches
>>>>> that would make this transition smoother.
>>>>> 
>>>>> 1. Some sections, like Industry Users and Industry Developers is
>>>>>     growing so fast, I'm not sure whether it's worth maintaining in any
>>>>>     meaningful format. I'd be inclined to make suggestions on where to
>>>>>     start and let Google take them forward from there.
>>>>> 2. Organize the developer section based on the pieces a new reader
>>>>>     wants to learn (new to everything, new to Hadoop, all the tools for
>>>>>     Hadoop development, "just check out code and go", etc).
>>>>> 3. Organize the Users section a bit more.  The "Setting up a Hadoop
>>>>>     Cluster" is grouped well, but I'd perhaps rearrange the ordering a
>>>>> bit.
>>>>> 
>>>>> -Ray
>>>>> 
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [email protected]
> For additional commands, e-mail: [email protected]
> 


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: Wiki migration and clean-up

Reply via email to