After some tests and usability improvements, I'm going to launch an English
alpha version.

I still need a cool name for the project, any idea?

Stay tunned.

2012/10/23 emijrp <[email protected]>

> Yes, there are some options: (semi)protections, blocks, spam black lists,
> flaggedrevs, abuse filter and some more. All them are well known MediaWiki
> features and extensions.
>
> Thanks for your interest.
>
>
> 2012/10/23 ENWP Pine <[email protected]>
>
>>
>> I agree that this sounds like an interesting experiment. I hope that you
>> get good faith editors. I worry that you’ll get COI editors playing with
>> the search rankings. Do you have a way in mind to deal with that issue?
>>
>> Pine
>>
>>  *From:* emijrp <[email protected]>
>> *Sent:* Monday, 22 October, 2012 08:29
>> *To:* Research into Wikimedia content and 
>> communities<[email protected]>
>> *Subject:* [Wiki-research-l] A wiki search engine
>>
>> Hi all;
>>
>> I'm starting a new project, a wiki search engine. It uses MediaWiki,
>> Semantic MediaWiki and other minor extensions, and some tricky templates
>> and bots.
>>
>> I remember Wikia Search and how it failed. It had the mini-article thingy
>> for the introduction, and then a lot of links compiled by a crawler. Also
>> something similar to a social network.
>>
>> My project idea (which still needs a cool name) is different. Althought
>> it uses an introduction and images copied from Wikipedia, and some links
>> from the "External links" sections, it is only a start. The purpose is that
>> community adds, removes and orders the results for each term, and creates
>> redirects for similar terms to avoid duplicates.
>>
>> Why this? I think that Google PageRank isn't enough. It is frequently
>> abused by farmlinks, SEOs and other people trying to put their websites
>> above.
>>
>> Search "Shakira" in Google for example. You see 1) Official site, 2)
>> Wikipedia 3) Twitter 4) Facebook, then some videos, some news, some images,
>> Myspace. It wastes 3 or more results in obvious nice sites (WP, TW, FB).
>> The wiki search engine puts these sites in the top, and an introduction and
>> related terms, leaving all the space below to not so obvious but
>> interesting websites. Also, if you search for "semantic queries" like
>> "right-wing newspapers" in Google, you won't find real newspapers but
>> "people and sites discussing about ring-wing newspapers". Or latex and
>> LaTeX being shown in the same results pages. These issues can be resolved
>> with disambiguation result pages.
>>
>> How we choose which results are above or below? The rules are not fully
>> designed yet, but we can put official sites in the first place, then .gov
>> or .edu domains which are important ones, and later unofficial websites,
>> blogs, giving priority to local language, etc. And reaching consensus.
>>
>> We can control aggresive spam with spam blacklists, semi-protect or
>> protect highly visible pages, and use bots or tools to check changes.
>>
>> It obviously has a CC BY-SA license and results can be exported. I think
>> that this approach is the opposite to Google today.
>>
>> For weird queries like "Albert Einstein birthplace" we can redirect to
>> the most obvious results page (in this case Albert Einstein) using a
>> hand-made redirect or by software (some little change in MediaWiki).
>>
>> You can check a pretty alpha version here http://www.todogratix.es (only
>> Spanish by now sorry) which I'm feeding with some bots.
>>
>> I think that it is an interesting experiment. I'm open to your questions
>> and feedback.
>>
>> Regards,
>> emijrp
>>
>> --
>> Emilio J. Rodríguez-Posada. E-mail: emijrp AT gmail DOT com
>> Pre-doctoral student at the University of Cádiz (Spain)
>> Projects: AVBOT <http://code.google.com/p/avbot/> | 
>> StatMediaWiki<http://statmediawiki.forja.rediris.es>
>> | WikiEvidens <http://code.google.com/p/wikievidens/> | 
>> WikiPapers<http://wikipapers.referata.com>
>> | WikiTeam <http://code.google.com/p/wikiteam/>
>> Personal website: https://sites.google.com/site/emijrp/
>>
>>  ------------------------------
>> _______________________________________________
>> Wiki-research-l mailing list
>> [email protected]
>> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
>>
>>
>> _______________________________________________
>> Wiki-research-l mailing list
>> [email protected]
>> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
>>
>>
>
>
> --
> Emilio J. Rodríguez-Posada. E-mail: emijrp AT gmail DOT com
> Pre-doctoral student at the University of Cádiz (Spain)
> Projects: AVBOT <http://code.google.com/p/avbot/> | 
> StatMediaWiki<http://statmediawiki.forja.rediris.es>
> | WikiEvidens <http://code.google.com/p/wikievidens/> | 
> WikiPapers<http://wikipapers.referata.com>
> | WikiTeam <http://code.google.com/p/wikiteam/>
> Personal website: https://sites.google.com/site/emijrp/
>
>


-- 
Emilio J. Rodríguez-Posada. E-mail: emijrp AT gmail DOT com
Pre-doctoral student at the University of Cádiz (Spain)
Projects: AVBOT <http://code.google.com/p/avbot/> |
StatMediaWiki<http://statmediawiki.forja.rediris.es>
| WikiEvidens <http://code.google.com/p/wikievidens/> |
WikiPapers<http://wikipapers.referata.com>
| WikiTeam <http://code.google.com/p/wikiteam/>
Personal website: https://sites.google.com/site/emijrp/
_______________________________________________
Wiki-research-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l

Reply via email to