Colin,
  Is it possible that you share some of the code with us?
   
  thx,
  Prasan

Colin Evans <[EMAIL PROTECTED]> wrote:
  We ended up subclassing TextInputFormat and adding a custom RecordReader 
that starts and ends record reads on tags. The
StreamXmlRecordReader class is a good reference for this.



Prasan Ary wrote:
> Hi All,
> I am writing a java implementation for my map/reduce function on hadoop.
> Input to this is a xml file, and the map function has to process a well 
> formed xml records. So far I have been unable to split the xml file at xml 
> record boundary to feed into my map function.
> Can anybody point me to resources where forcing file split at desired 
> boundary is explained ?
> 
> thx,
> Pra.
>
> 
> ---------------------------------
> Be a better friend, newshound, and know-it-all with Yahoo! Mobile. Try it now.
> 



       
---------------------------------
Looking for last minute shopping deals?  Find them fast with Yahoo! Search.

Reply via email to