Avinash Lakshman wrote:
That is what we used to load large amounts of data into Cassandra using M/R. So we loaded around 12TB of data from Hadoop into Cassandra before we launched Inbox Search. This way we could do all the heavylifting in Hadoop and load data at practically network bandwidth 100 MB/sec. Going the normal route with the same load chewed up lot of CPU resources on the Cassandra servers because of lot of serialization/deserialization.
Avinash, do you have any sample code that demonstrates importing data this way? -- Eric Evans [email protected]
