On 8/10/08, Matt Mahoney <[EMAIL PROTECTED]> wrote: > > rick the ponderer <[EMAIL PROTECTED]> wrote: > > > >Regarding cempeting to buy information - I'm not suggesting that at > >all, people would be competing to sell the services of their classifier > >(and shopping around for the best classifier to consume or build on). > >It would be like the web services model - like for example at > strikeiron.com > > > My point is that for most information, free is too expensive. Then how is > your model funded? You have to collect money from the information providers > and increase its value up to at least zero by filtering out all but the most > useful, like for example, Google. > > The missing technology is distributed indexing. This has a number of > problems. First, it is very expensive to compete with Google. Its servers > make up about 0.1% of the world's computing power. Second, competing web > services would be inefficient because of the duplication of network traffic > (spiders) and index storage. A centralized model favors a monopoly. Third, > Google it is very limited. After a web page update, Google may take days to > find it and update its index. > > Distributed indexing would solve these problems. Nobody would control the > index. Everyone would have an incentive to contribute computing power > (storage and bandwidth) and high quality information in exchange for the > ability to send messages. There would be no distinction between queries and > updates. You just send a message and it is routed to anyone who cares. > Imagine if a Google query could initiate a conversation in real time. > > > -- Matt Mahoney, [EMAIL PROTECTED] > > > ------------------------------------------- > agi > Archives: https://www.listbox.com/member/archive/303/=now > RSS Feed: https://www.listbox.com/member/archive/rss/303/ > Modify Your Subscription: > https://www.listbox.com/member/?& > Powered by Listbox: http://www.listbox.com > What I'm suggesting is not an information industry in the sense of websites/blogs etc. It is a software industry, for example you pay to use amazon s3/ec2, you pay to use salesforce apps. it is more like paying for electricity and water - metered usage.
My point about blogs/websites was that it's a business that has a very low to barrier to entry, you can start blog publishing for nothing, or by a domain and hosting for a few dollars a month. Most people have some type of knowledge they can blog about, and in the same way most people have the knowledge for data useful to create a classifier - just about anyone could label an image of a cat, dog, car, train, aeroplane etc. In the same way, if it was made easy for individuals to use classifiers (hosted training and hosted prediction), their would be a low barrier to entry for them too, I might collect 1000 images of cats, send it to a classifier host that would then apply support vector machine to it for 2 hours, charge me 40 cents (at 20 cents per hour of usage of one cpu), I might then send the created classifier file to another host (whose business is to host classifiers) and set a rate of 1000 uses for 1 cent lets say (I might receive 80 percent of this, and the hoster 20 percent). Then anyone who wants to identify if there are cats in their image would connect to and use the cat classifier, paying me a fraction of a cent each time. For any given image I would probably want to apply thousands of different classifiers. Somebody might have created a service that aggregates 2000 diffferent animal classifiers and I would send my image there and recieve results for a fee (each of those classifier creators would receive a sum of money, as would the aggregator). ------------------------------------------- agi Archives: https://www.listbox.com/member/archive/303/=now RSS Feed: https://www.listbox.com/member/archive/rss/303/ Modify Your Subscription: https://www.listbox.com/member/?member_id=8660244&id_secret=108809214-a0d121 Powered by Listbox: http://www.listbox.com
