On 8/10/08, rick the ponderer <[EMAIL PROTECTED]> wrote: > > > > On 8/10/08, Matt Mahoney <[EMAIL PROTECTED]> wrote: >> >> rick the ponderer <[EMAIL PROTECTED]> wrote: >> >> >> >Regarding cempeting to buy information - I'm not suggesting that at >> >all, people would be competing to sell the services of their classifier >> >(and shopping around for the best classifier to consume or build on). >> >It would be like the web services model - like for example at >> strikeiron.com >> >> >> My point is that for most information, free is too expensive. Then how is >> your model funded? You have to collect money from the information providers >> and increase its value up to at least zero by filtering out all but the most >> useful, like for example, Google. >> >> The missing technology is distributed indexing. This has a number of >> problems. First, it is very expensive to compete with Google. Its servers >> make up about 0.1% of the world's computing power. Second, competing web >> services would be inefficient because of the duplication of network traffic >> (spiders) and index storage. A centralized model favors a monopoly. Third, >> Google it is very limited. After a web page update, Google may take days to >> find it and update its index. >> >> Distributed indexing would solve these problems. Nobody would control the >> index. Everyone would have an incentive to contribute computing power >> (storage and bandwidth) and high quality information in exchange for the >> ability to send messages. There would be no distinction between queries and >> updates. You just send a message and it is routed to anyone who cares. >> Imagine if a Google query could initiate a conversation in real time. >> >> >> -- Matt Mahoney, [EMAIL PROTECTED] >> >> >> ------------------------------------------- >> agi >> Archives: https://www.listbox.com/member/archive/303/=now >> RSS Feed: https://www.listbox.com/member/archive/rss/303/ >> Modify Your Subscription: >> https://www.listbox.com/member/?& >> Powered by Listbox: http://www.listbox.com >> > What I'm suggesting is not an information industry in the sense of > websites/blogs etc. It is a software industry, for example you pay to use > amazon s3/ec2, you pay to use salesforce apps. it is more like paying for > electricity and water - metered usage. > > My point about blogs/websites was that it's a business that has a very low > to barrier to entry, you can start blog publishing for nothing, or by a > domain and hosting for a few dollars a month. > > Most people have some type of knowledge they can blog about, and in the > same way most people have the knowledge for data useful to create a > classifier - just about anyone could label an image of a cat, dog, > car, train, aeroplane etc. > > In the same way, if it was made easy for individuals to use classifiers > (hosted training and hosted prediction), their would be a low barrier to > entry for them too, > > I might collect 1000 images of cats, send it to a classifier host > that would then apply support vector machine to it for 2 hours, charge me 40 > cents (at 20 cents per hour of usage of one cpu), I might then send the > created classifier file to another host (whose business is to host > classifiers) and set a rate of 1000 uses for 1 cent lets say (I might > receive 80 percent of this, and the hoster 20 percent). Then anyone who > wants to identify if there are cats in their image would connect to and use > the cat classifier, paying me a fraction of a cent each time. > > For any given image I would probably want to apply thousands of different > classifiers. Somebody might have created a service that aggregates 2000 > diffferent animal classifiers and I would send my image there and recieve > results for a fee (each of those classifier creators would receive a sum of > money, as would the aggregator). > The question of distribution and search isn't relevant here because the existing infrastructure of the web would be used - dns for location and http for retrieval (or you could substitute email or im), nothing else is needed.
------------------------------------------- agi Archives: https://www.listbox.com/member/archive/303/=now RSS Feed: https://www.listbox.com/member/archive/rss/303/ Modify Your Subscription: https://www.listbox.com/member/?member_id=8660244&id_secret=108809214-a0d121 Powered by Listbox: http://www.listbox.com
