On 8/10/08, Matt Mahoney <[EMAIL PROTECTED]> wrote:
>
> rick the ponderer <[EMAIL PROTECTED]> wrote:
>
>
> >Regarding cempeting to buy information - I'm not suggesting that at
> >all, people would be competing to sell the services of their classifier
> >(and shopping around for the best classifier to consume or build on).
> >It would be like the web services model - like for example at
> strikeiron.com
>
>
> My point is that for most information, free is too expensive. Then how is
> your model funded? You have to collect money from the information providers
> and increase its value up to at least zero by filtering out all but the most
> useful, like for example, Google.
>
> The missing technology is distributed indexing. This has a number of
> problems. First, it is very expensive to compete with Google. Its servers
> make up about 0.1% of the world's computing power. Second, competing web
> services would be inefficient because of the duplication of network traffic
> (spiders) and index storage. A centralized model favors a monopoly. Third,
> Google it is very limited. After a web page update, Google may take days to
> find it and update its index.
>
> Distributed indexing would solve these problems. Nobody would control the
> index. Everyone would have an incentive to contribute computing power
> (storage and bandwidth) and high quality information in exchange for the
> ability to send messages. There would be no distinction between queries and
> updates. You just send a message and it is routed to anyone who cares.
> Imagine if a Google query could initiate a conversation in real time.
>
>
>   -- Matt Mahoney, [EMAIL PROTECTED]
>
>
> -------------------------------------------
> agi
> Archives: https://www.listbox.com/member/archive/303/=now
> RSS Feed: https://www.listbox.com/member/archive/rss/303/
> Modify Your Subscription:
> https://www.listbox.com/member/?&;
> Powered by Listbox: http://www.listbox.com
>
What I'm suggesting is not an information industry in the sense of
websites/blogs etc. It is a software industry, for example you pay to use
amazon s3/ec2, you pay to use salesforce apps. it is more like paying for
electricity and water - metered usage.

My point about blogs/websites was that it's a business that has a very low
to barrier to entry, you can start blog publishing for nothing, or by a
domain and hosting for a few dollars a month.

Most people have some type of knowledge they can blog about, and in the same
way most people have the knowledge for data useful to create a classifier -
just about anyone could label an image of a cat, dog, car, train, aeroplane
etc.

In the same way, if it was made easy for individuals to use classifiers
(hosted training and hosted prediction), their would be a low barrier to
entry for them too,

I might collect 1000 images of cats, send it to a classifier host
that would then apply support vector machine to it for 2 hours, charge me 40
cents (at 20 cents per hour of usage of one cpu), I might then send the
created classifier file to another host (whose business is to host
classifiers) and set a rate of 1000 uses for 1 cent lets say (I might
receive 80 percent of this, and the hoster 20 percent). Then anyone who
wants to identify if there are cats in their image would connect to and use
the cat classifier, paying me a fraction of a cent each time.

For any given image I would probably want to apply thousands of different
classifiers. Somebody might have created a service that aggregates 2000
diffferent animal classifiers and I would send my image there and recieve
results for a fee (each of those classifier creators would receive a sum of
money, as would the aggregator).



-------------------------------------------
agi
Archives: https://www.listbox.com/member/archive/303/=now
RSS Feed: https://www.listbox.com/member/archive/rss/303/
Modify Your Subscription: 
https://www.listbox.com/member/?member_id=8660244&id_secret=108809214-a0d121
Powered by Listbox: http://www.listbox.com

Reply via email to