On 3/8/07, Tim Golden <[EMAIL PROTECTED]> wrote: > Simplest thing's probably antiword (http://www.winfield.demon.nl/) > and then whatever text-scanning approach you want.
I've gone for: #!/usr/bin/env python import glob, os url = "/home/cherp/prddoc" searchstring = "dxpolbl.p" worddocs = [] for (dirpath, dirnames, filenames) in os.walk(url): for f in filenames: if f.endswith(".doc"): worddocs.append(os.path.join(dirpath,f)) for d in worddocs: for i in glob.glob(d): if searchstring in open(i,"r").read(): print "Found it in: ", i.split('/')[-1] Now... I want to convert this to a cgi-script... how do I grab $QUERY_STRING in python? S. _______________________________________________ Tutor maillist - Tutor@python.org http://mail.python.org/mailman/listinfo/tutor