Knacktus wrote:
Am 02.12.2010 02:51, schrieb Dana:
Hello,
I'm using Python to extract words from plain text files. I have a list
of words. Now I would like to convert that list to a dictionary of
features where the key is the word and the value the number of
occurrences in a group of files based on the filename (different files
correspond to different categories). What is the best way to represent
this data? When I finish I expect to have about 70 unique dictionaries
with values I plan to use in frequency distributions, etc. Should I use
globally defined dictionaries?
Depends on what else you want to do with the group of files. If you're
expecting some operations on the group's data you should create a class
to be able to add some more methods to the data. I would probably go
with a class.
Unless you're planning to have multiple "file groups" at once, or
intending to re-use this code for other modules, using a class is
probably overkill. This isn't Java where everything has to be a class :)
But also I think dictionaries can be fine. If you really only need the
dicts. You could create a function to create those.
Agreed.
One way or the other, a dict {word: count} is the natural data structure
to use for a concordance. Whether you store those dicts in a class, or
just operate directly on the dicts, is relatively unimportant.
--
Steven
_______________________________________________
Tutor maillist - Tutor@python.org
To unsubscribe or change subscription options:
http://mail.python.org/mailman/listinfo/tutor