Can you provide more information. How are you passing your input, are you 
passing raw pdf files? If so, are you using your own record reader. Default 
record reader wont read pdf files and you wont get the text out of it as is. 
Thanks,
Lohit



----- Original Message ----
From: GaneshG <[EMAIL PROTECTED]>
To: [email protected]
Sent: Wednesday, July 23, 2008 1:51:52 AM
Subject: Text search on a PDF file using hadoop


while i search a text in a pdf file using hadoop, the results are not coming
properly. i tried to debug my program, i could see the lines red from pdf
file is not formatted. please help me to resolve this.
-- 
View this message in context: 
http://www.nabble.com/Text-search-on-a-PDF-file-using-hadoop-tp18606475p18606475.html
Sent from the Hadoop core-user mailing list archive at Nabble.com.

Reply via email to