Hey everyone, we're facing a problem while reading AVRO files written with FLUME using the AVRO Java API 1.5.4 into a HADOOP cluster. The Avro Data Store complains about missing sync marker. Investigating the problem shows us, that's perfectly right. The sync marker is missing. Thus we have a block of the double size.
Our software packets: rpm -qa | grep hadoop hadoop-0.20-namenode-0.20.2+923.142-1 hadoop-0.20-0.20.2+923.142-1 hadoop-0.20-native-0.20.2+923.142-1 hadoop-hive-0.7.1+42.27-2 hadoop-pig-0.8.1+28.18-1 This is pretty much all a basic cloudera CDH3 Update 2 Packaging installation with a patched PIG version which is CDH3 Update 3. Did anyone had a similar issue? Does this ring a bell? Thanks Markus
