For reading PDF in java, you may refer to this link: http://stackoverflow.com/questions/4784825/how-to-read-pdf-files-using-java
in mapreduce, you can use the same code; except that each map() function processes one file; Regards, *Stanley Shi,* On Wed, Mar 12, 2014 at 4:53 PM, Ranjini Rathinam <ranjinibe...@gmail.com>wrote: > Hi, > > How to read a PDF file in mapreduce. > > Please provide sample code or sample link for refernce. > > > thanks in advance. > > Ranjini > >