Hi Laurent, 1. Are you saying that _top.sql did not exist in the home directory. Or that, _top.sql existed, but hive was not able to read it after loading
2. I don¹t think reserved words are documented somewhere. Can you file a jira for this ? 3. The bad row is printed in the task log. 1. 2011-01-27 11:11:07,046 INFO org.apache.hadoop.fs.FSInputChecker: Found checksum error: b[1024, 1536]=7374796c653d22666f6e742d73697a653a20313270743b223e3c623e266e6273703b2 66e6273703b266e6273703b202a202838302920416d69656e733a3c2f623e3c2f7370616e3e 3c2f7370616e3e5c6e20203c2f703e5c6e20203c703e5c6e202020203c7370616e207374796 c653d22666f66742d66616d696c793a2068656c7665746963613b223e3c7370616e20737479 6c653d22666f6e742d73697a653a20313270743b223e3c623e266e6273703b266e6273703b2 66e6273703b266e6273703b266e6273703b266e6273703b266e6273703b266e6273703b266e 6273703b206f203132682c2050697175652d6e6971756520646576616e74206c65205265637 46f7261742e3c2f623e3c2f7370616e3e3c2f7370616e3e5c6e20203c2f703e5c6e20203c70 3e5c6e202020203c7370616e207374796c653d22666f6e742d66616d696c793a2068656c766 5746963613b223e3c7370616e207374796c653d22666f6e742d73697a653a20313270743b22 3e3c623e266e6273703b266e6273703b266e6273703b266e6273703b266e6273703b266e627 3703b266e6273703b266e6273703b266e6273703b206f2031346833302c204d6169736f6e20 6465206c612063756c747572652e3c2f623e3c2f7370616e3e3c2f7370616e3e5c6e20203c2 f703e5c6e20203c703e5c6e202020203c7370616e207374796c653d 2. org.apache.hadoop.fs.ChecksumException: Checksum error: /blk_2466764552666222475:of:/user/hive/warehouse/article/article.copy at 23446528 3. at org.apache.hadoop.fs.FSInputChecker.verifySum(FSInputChecker.java:27 Thanks, -namit On 1/27/11 7:50 AM, "Laurent Laborde" <kerdez...@gmail.com> wrote: >Friendly greetings ! >I am very frustrated about hive. i like the program but i find a >problem avery day i try to use it and i can't find any support on irc. >so here i am ! > >1st problem : >hive> create table top_test like top; >OK > >hive> load data local inpath >'/home/ker2x/over-blog/archive-20101213/_top.sql' into table top_test; >Copying data from file:/home/ker2x/over-blog/archive-20101213/_top.sql >Loading data to table top_test >OK >Time taken: 0.832 seconds > >hive> select count(1) from top_test; >... >OK >0 > >if the file loaded being with a "_" there is no error but it's >impossible to do any query on it. > >of course if i do : >hive> load data local inpath >'/home/ker2x/over-blog/archive-20101213/top.sql' into table top_test; >Copying data from file:/home/ker2x/over-blog/archive-20101213/top.sql >Loading data to table top_test >OK >Time taken: 0.908 seconds > >hive> select count(1) from top_test; >... >OK >1460112 >Time taken: 34.779 seconds > >it works without problem ... > >The first time i got this problem i lost many hours (loading + query) >because the imported file was tens of GB :( > > >2nd problem : >hive> create table testing (data string); >FAILED: Parse Error: line 1:22 mismatched input 'data' expecting >Identifier in column specification > >can't create a column named "data". is it documented anywhere ? >same problem with a column named "date". >same problem with a column named "comment". >same problem with a column named "timestamp". > >3rd problem : >i'm loading a big file, no problem. >now i run *any* query on this table and here is the result : >http://pastebin.com/mE8Gzdbq >Look like a line or a column doesn't have the expected format. But... >which line ? which column ? >i'm on this problem since days :((( > >Can you help please ? >thank you. > >-- >Laurent "ker2x" Laborde >Sysadmin & DBA at http://www.over-blog.com/