Hi All, I have set of rich documents. Some of them are scanned pdf files. When I send a scanned pdf to extraction request handler, below icon appears in my Dock.
http://tinypic.com/r/2mpmo7o/6 http://tinypic.com/r/28ukxhj/6 Does anyone know what this is? curl "http://localhost:8983/solr/documents/update/extract?literal.ID=ticaret_sicil_gazetesi&literal.URL=ticaret_sicil_gazetesi&commit=true" -F "myfile=@ticaret_sicil_gazetesi.pdf" No exception is seen on solr logs. Doc is indexed, content field is: xmpTPg:NPages 4 Creation-Date 2011-08-24T13:03:16Z stream_source_info myfile created Wed Aug 24 16:03:16 EEST 2011 stream_content_type application/octet-stream stream_size 2302337 producer Image Recognition Integrated Systems, Autoformat5,0,0,229 stream_name ticaret_sicil_gazetesi.pdf Content-Type application/pdf creator I.R.I.S. page page page page Environment: solr-trunk, Mac OS X Version 10.7.4, Java HotSpot(TM) 64-Bit Server VM (build 20.8-b03-424, mixed mode), jetty. Same thing happens with Solr 4.0-beta and Tomcat too. Thanks,