Hi Морозов, It's a directory containing Hadoop map file(s) that stores key/value pairs. Hadoop Text class is the key and Nutch' Content class is the value. You would need Hadoop to easily process the files
http://svn.apache.org/viewvc/nutch/trunk/src/java/org/apache/nutch/protocol/Content.java?view=markup Cheers, Markus -----Original message----- > From:Морозов Евгений <[email protected]> > Sent: Sat 27-Oct-2012 18:32 > To: [email protected] > Subject: Format of "content" file in segments? > > Where can I find the format of the content file in a segment directory? > Either source code or documentation. I'm looking at reading it with a > program external to nutch. > > regards, keanta >

