Re: [Wikitech-l] [Wikimedia-l] IRC office hours: Shared hosting

2015-12-20 Thread Brian Wolff
On 12/20/15, James Salsman  wrote:
> Were there any objections to my request below?
>

Yes. As MaxSem said earlier[1], its basically being ignored as being
totally irrelevant to the topic at hand. (To be clear: Third-party
does not mean people who are doing work on Wikimedia sites that aren't
WMF. Third party = Wikis that have nothing to do with Wikimedia wikis
(e.g. wikia, wikihow, uncyclopedia etc))

If you want to get Dispenser his hard disk space, you should take it
up with the labs people, or at the very least some thread where it
would be on-topic.

> Can we also please hire additional database, system, and if necessary
> network administration support to make sure that the third party spam
> prevention bot infrastructure is supported more robustly in the future?

Then by definition it wouldn't be a third-party spam framework if WMF
was running it.

--
-bawolff

[1] https://lists.wikimedia.org/pipermail/wikitech-l/2015-December/084326.html
[Linking because this thread is super-cross posted, and some people
are going to be confused as to what I'm referring to]

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Re: [Wikitech-l] New Beta Feature: completion suggester

2015-12-20 Thread John Erling Blad
I tried this on a search for "Sør-Aurdal" (a municipality in Norway),
dropped the dash and wrote "sørau" and got a hit on "Søraust-Svalbard
naturreservat" among other things. The topmost hit was "søraurdøl", which
is a denomyn for someone from Sør-Aurdal. It seems to me that a spelling
error is compensated with a fuzzy search for long(est?) words, but that
imply nearly completing the word if there is a spelling error.

What if the topmost entry in the list had a less aggressive fuzzy search,
and used shorter words? I tried several other searches, and somehow "sørau"
seems to be difficult. All searches was on nowiki.

I'm a bit impressed... :D

On Sun, Dec 20, 2015 at 9:55 PM, Sage Ross 
wrote:

> If I'm, say, building a web app that could benefit from that kind of
> search suggestion tool, is there an API I can use?
>
> -Sage
>
> On Thu, Dec 17, 2015 at 5:09 PM, Dan Garry  wrote:
> > Hey all,
> >
> > In the continued quest to make the search bar a better tool, the
> Wikimedia
> > Foundation's Discovery Department
> >  has put a
> completion
> > suggester into Beta Features. The tool functions with search-as-you-type,
> > with a small tolerance for typos and spacing in finding results. Possible
> > matches are then displayed as you type in a drop down menu, hopefully
> > eliminating the need to perform a fulltext search with landing page and
> > all. You can read more details at mediawiki.org
> > <
> https://www.mediawiki.org/wiki/Extension:CirrusSearch/CompletionSuggester>
> > and use the talk page for now for feedback.
> >
> > The tool is now available and will only be enabled for the article
> > namespace for now, and will progress into full production at some point
> > hopefully in early 2016, depending on feedback. It's going to be
> important
> > to get feedback from regular contributors who use search to make sure
> that
> > any of the basic feature requests for searching the main space can at
> least
> > be addressed while in Beta Features.
> >
> > Thanks!
> >
> > Dan
> >
> > --
> > Dan Garry
> > Lead Product Manager, Discovery
> > Wikimedia Foundation
> > ___
> > Wikitech-l mailing list
> > Wikitech-l@lists.wikimedia.org
> > https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Re: [Wikitech-l] [Wikimedia-l] IRC office hours: Shared hosting

2015-12-20 Thread James Salsman
Were there any objections to my request below?

Can we also please hire additional database, system, and if necessary
network administration support to make sure that the third party spam
prevention bot infrastructure is supported more robustly in the future?

On Monday, December 14, 2015, James Salsman  wrote:

> Hi Giles,
>
> I regret I will probably not be available for the IRC office hours as
> scheduled.
>
> In the discussion of shared hosting, I worry that en:User:Dispenser's
> reflinks project, which requires a 20 TB cache, is being forgotten
> again. He tried to host it himself, but it's offline again. This data
> is essential in maintaining an audit trail of references as long as
> the Internet Archive respects robots.txt retroactively, allowing those
> who inherit domains to censor them, even if they have already been
> used as a reference in Wikipedia. Keeping the cache is absolutely a
> fair use right in the US, in both statutory and case law, and it is
> essential to be able to track down patterns of attempts at deceptive
> editing to address quality concerns around deliberately biased editing
> such as paid editing. Because of the sensitivity of this goal, the
> Foundation should certainly bear the risk of hosting the reflinks
> cache. However, in the past, 20 TB was considered excessive, even
> though the cost was shown to be less than $5000 without whatever Dell
> NSA-enabled hardware you usually buy.
>
> Would you please reach out to en:User:Dispenser and offer them the
> 20TB hosting solution they need for the Foundation to bear the risk of
> the reflinks cache?  Thank you for your kind consideration.
>
> Best regards,
> Jim
>
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Re: [Wikitech-l] Sharing JS code between NodeJS and browser

2015-12-20 Thread Daniel Friesen
On 2015-12-18 9:31 AM, Yuri Astrakhan wrote:
> Trevor and Daniel, thanks for your reply.
>
> How would you structure the code that is to be shared?  Should it be a
> separate NPM package, referenced from the extension package.json via git
> url, and have a small file in the extension's lib/ dir with a oneliner -
>  "require('...')"  that browserify could pick up? And have a script command
> in package.json to build that file?
Both options are valid.

You can use browserify to make a standalone browser build of a library
and use it via ResourceLoader.
This will allow it to be shared. But it raises the possibility that
there some type of conflict may happen in the global space.

You can use browserify to make a script that'll expose a global
require('...') with the module(s). Same pros and caveats as standalone
builds. Though a mild reduction in the conflicts that could happen.

Or you could write your code in CommonJS/node-like style and browserify
that without declaring any of the dependencies as external. Then all the
things you need will be bundled with your code. You'll eliminate all
possible conflicts. But modules will definitely be duplicated since
nothing is shared.

~Daniel Friesen (Dantman, Nadir-Seen-Fire) [http://danielfriesen.name/]


___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Re: [Wikitech-l] Sharing JS code between NodeJS and browser

2015-12-20 Thread Santhosh Thottingal
I had used returnExports pattern defined in https://github.com/umdjs/umd to
get a module that works in Node, AMD and browser globals.
That repo has several patterns and tools for this and you may find useful
to solve your usecase.

Thanks
Santhosh
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

[Wikitech-l] Fwd: OCR4wikisource

2015-12-20 Thread Shrinivasan T
-- Forwarded message --
From: Shrinivasan T 
Date: 2015-12-19 4:45 GMT+05:30
Subject: OCR4wikisource
To: "Discussion list on Indian language projects of Wikimedia." <
wikimediaindi...@lists.wikimedia.org>


Hi all,

Released a program to link google OCR and books for wiki source.

Grab the python code from here and run in your GNU/linux machines.

https://github.com/tshrinivasan/OCR4wikisource

It is based on
https://github.com/tshrinivasan/google-ocr-python

Reply here with your suggestions and improvements.

Thanks.



-- 
Regards,
T.Shrinivasan


My Life with GNU/Linux : http://goinggnu.wordpress.com
Free E-Magazine on Free Open Source Software in Tamil : http://kaniyam.com

Get Free Tamil Ebooks for Android, iOS, Kindle, Computer :
http://FreeTamilEbooks.com
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Re: [Wikitech-l] New Beta Feature: completion suggester

2015-12-20 Thread Sage Ross
If I'm, say, building a web app that could benefit from that kind of
search suggestion tool, is there an API I can use?

-Sage

On Thu, Dec 17, 2015 at 5:09 PM, Dan Garry  wrote:
> Hey all,
>
> In the continued quest to make the search bar a better tool, the Wikimedia
> Foundation's Discovery Department
>  has put a completion
> suggester into Beta Features. The tool functions with search-as-you-type,
> with a small tolerance for typos and spacing in finding results. Possible
> matches are then displayed as you type in a drop down menu, hopefully
> eliminating the need to perform a fulltext search with landing page and
> all. You can read more details at mediawiki.org
> 
> and use the talk page for now for feedback.
>
> The tool is now available and will only be enabled for the article
> namespace for now, and will progress into full production at some point
> hopefully in early 2016, depending on feedback. It's going to be important
> to get feedback from regular contributors who use search to make sure that
> any of the basic feature requests for searching the main space can at least
> be addressed while in Beta Features.
>
> Thanks!
>
> Dan
>
> --
> Dan Garry
> Lead Product Manager, Discovery
> Wikimedia Foundation
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l