Dear all,

After some R n D on schema design in hbase. I am confused how to design corresponding schema of a table in mysql.

CREATE TABLE `page_content` (
`crawled_page_id` bigint(20) NOT NULL DEFAULT '0' 'unique value for each url,
 `link_level` tinyint(4) DEFAULT NULL,
 `isprocessable` tinyint(2) NOT NULL DEFAULT '1',
 `isvalid` tinyint(4) NOT NULL DEFAULT '1',
 `isanalyzed` tinyint(4) NOT NULL DEFAULT '0' COMMENT ,
`islocked` tinyint(4) NOT NULL DEFAULT '0' COMMENT 'set 1 when the records are in analyzing phase',
 `content_language` varchar(10) DEFAULT NULL,
 `url_id` bigint(20) NOT NULL,
 `publishing_date` varchar(40) DEFAULT NULL,
 `heading` varchar(150) DEFAULT NULL,
 `category` varchar(150) DEFAULT NULL,
 `crawled_page_url` varchar(500) NOT NULL,
 `keywords` varchar(500) DEFAULT NULL,
 `dt_stamp` timestamp NOT NULL DEFAULT CURRENT_TIMESTAMP,
 `content` mediumtext,
 PRIMARY KEY (`crawled_page_id`),
 KEY `idx_url` (`crawled_page_url`),
 KEY `idx_head` (`heading`),
 KEY `idx_dtstamp` (`dt_stamp`)
) ENGINE=InnoDB DEFAULT CHARSET=latin1

In all the examples , I find the reverse url is the row key in hbase but in mysql i create an auto increment column that uniquly locate a document.
Can anyone suggest what is the corresponding table in hbase.

Thanks

Reply via email to