Re: [ccp4bb] pdf to text

2010-09-13 Thread Oganesyan, Vaheh
Thanks go to all who took their time and answered and in some cases did file manipulations. Extremely helpful! Not only for this particular case but many times CCP4BB has proven to be the best. Below are received answers in chronological order: Albert Gluskov: I converted it in adobe acrobat pro,

Re: [ccp4bb] pdf to text

2010-09-13 Thread Mark Brooks
For OCR without installing software, "Free OCR" http://www.free-ocr.com/ works quite well for me, but beware that you may need to do corrections afterwards. Just upload your file to this web site, as long as it isn't secret! The OCR in Adobe Acrobat works better for me though, and is worth the mo

Re: [ccp4bb] pdf to text

2010-09-13 Thread Tomas Malinauskas
Hi, On Mon, Sep 13, 2010 at 2:30 PM, Oganesyan, Vaheh wrote: > The molecule is polymyxin B and doesn’t exist in databases like CSD, Hic-up > or PDB. If some of you happen to have it please share. Thank you. PDB file: http://129.128.185.122/drugbank2/drugs/DB00781/pdb/download More information:

Re: [ccp4bb] pdf to text

2010-09-13 Thread Vellieux Frederic
What you could try to do is print out the pdf file, then locate a scanner with a suitable scanning software. Several scanning software have the possibility of generating word processing program output or ASCII format. Since the pdf file is text only (no figures etc) then it should be OK. You ju

Re: [ccp4bb] pdf to text

2010-09-13 Thread Tim Gruene
Dear Vaheh, the PDF-file is not a "proper" PDF-file. It is an image encapsulated into PDF, which is why it does not contain any text! You could try a text recognition software. In the Linux-/Unix-world, gocr is probably to most popular one. Since it is a courier font, your chances should be prett