Can you provide more information. How are you passing your input, are you passing raw pdf files? If so, are you using your own record reader. Default record reader wont read pdf files and you wont get the text out of it as is. Thanks, Lohit
----- Original Message ---- From: GaneshG <[EMAIL PROTECTED]> To: [email protected] Sent: Wednesday, July 23, 2008 1:51:52 AM Subject: Text search on a PDF file using hadoop while i search a text in a pdf file using hadoop, the results are not coming properly. i tried to debug my program, i could see the lines red from pdf file is not formatted. please help me to resolve this. -- View this message in context: http://www.nabble.com/Text-search-on-a-PDF-file-using-hadoop-tp18606475p18606475.html Sent from the Hadoop core-user mailing list archive at Nabble.com.
