With linux and xpdf-tools its as easy as

pdftotext xyz.pdf
wc -w xyz.txt

Best,
Raphael

On 08.04.2014 20:44, Eric Dodge wrote:
> Seems like there are 2 steps here, getting the text into a more usable
> format and then getting the word counts. There are programs that let you
> dump pdf into text (http://pdf2txt.software.informer.com/3.2/ for
> example) in batches. Then paste the text into a tool like this
> (http://www.textfixer.com/tools/online-word-counter.php) to get the word
> counts.
> 
> Eric
> 
> 
> On Tue, Apr 8, 2014 at 12:54 AM, Suren Makkar <[email protected]
> <mailto:[email protected]>> wrote:
> 
>     Hey guys,
> 
>     Quick Rookie question, I'm trying to get total word counts for all
>     occurring words in a bunch of PDFs, and I am lost. Help?
> 
> 
> 
>     -- 
>     For more details about this list
>     http://datameet.org/discussions/
>     ---
>     You received this message because you are subscribed to the Google
>     Groups "datameet" group.
>     To unsubscribe from this group and stop receiving emails from it,
>     send an email to [email protected]
>     <mailto:[email protected]>.
>     For more options, visit https://groups.google.com/d/optout.
> 
> 
> -- 
> For more details about this list
> http://datameet.org/discussions/
> ---
> You received this message because you are subscribed to the Google
> Groups "datameet" group.
> To unsubscribe from this group and stop receiving emails from it, send
> an email to [email protected]
> <mailto:[email protected]>.
> For more options, visit https://groups.google.com/d/optout.

-- 
Raphael Susewind | BGHS Bielefeld University, CSASP University of Oxford
      Snail Mail | Melanchthonstr. 4a, 33615 Bielefeld, Germany
   Papers & Blog | http://www.raphael-susewind.de

Please do consider http://www.gnupg.org for encryption (key id A5ED49AE)

-- 
For more details about this list
http://datameet.org/discussions/
--- 
You received this message because you are subscribed to the Google Groups 
"datameet" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

Reply via email to