Hi Lewis, We crawl one of our library websites and the table dump was always presentable without any issues but i am not sure if we have any special characters within our content. I can check and tell you more on monday when i go back to work.
I use Nutch-2.x with Hbase. Kiran. On Sat, Feb 16, 2013 at 3:01 PM, Lewis John Mcgibbney < [email protected]> wrote: > Hi, > I wonder if someone using 2.x can tell me the following. Regardless of > which backend you use for storing the webdb or hostdb, can you please > confirm what your db dump looks like e.g. can it be read, is it presentable > visually, etc. > I am struggling to open a dump of my webdb in my gedit text editor as there > are lots of non UTF-8 chars in there. > I wonder if this behaviour is consistent across all gora backends or if it > is specific to gora-cassandra. > Someone using HBase or Accumulo would be great... or of course any of the > SQL db's. > Thank you very much. > Lewis > > -- > *Lewis* > -- Kiran Chitturi

