Chien Nguyen wrote:
Hi All,
I am a newbie to Python. I just did some readings on the web
and got some basic understanding about the language. I'd like
to learn the language by writing some simple programs rather than
keep reading books. My first program will convert certain uni-code characters
(let's say UTF-8) in an RTF file format based on a certain mapping
in another RTF file that is called a "RTF Control file". On each line
of the Control file, there are 2 tokens separate by a TAB or a space.
The first token contains the character that needs to be converted from,
and the second character contains the character that needs to be converted to.

The program will write to a new file that contains a new set of mapped characters. If a character form the original file is not found in the Control file, then the program
just write the same character to the new file.
For an example: The RTF Control file may contain the following lines.

â     í
ơ     ă
ư     ổ

The original RTF file may have something like
tâc mơm thư
and will be converted to a new RTF file as follows.
tíc     măm     thổ

Before I start to go into the coding, I would like to get some advice from
experienced users/mentors about a quick way to do it.

Quick - do you mean time to code, or execution time?

For each line in the control file add an item to a dictionary, with old value as key and new value as value.
For each line in the data file
 For each character in the line
   if in dictionary replace with corresponding value
 write line to output.

--
Bob Gailer
919-636-4239 Chapel Hill, NC

_______________________________________________
Tutor maillist  -  [email protected]
http://mail.python.org/mailman/listinfo/tutor

Reply via email to