Hello crew,

I subscribed to tp-devel some time ago but haven't introduced myself yet. My 
name is peres, and I am working (slacking) on the random planet names task. You 
can randomly find me on #tp :)

Here is a brief explanation of how my generator should work. My design involves 
3 stages, as follows:

1 - generation of pronounceable sequences, based on valid syllables for a 
selected language [currently English].

2 - filtering, so that bad sequences are discarded. This is based on 
sequence-level features like repetition of sounds and the like.

3 - rendering: here the surviving sequences are transformed into words, by 
mapping the sounds they represent into a spelling [this should be done 
according to rules for a language pronounciation].

At the moment I have an implementation for stage 1, I'm working on #2, and I 
have very rough ideas about #3.

If you look at the attached files then please consider that stage 3 is so 
trivial right now, that it renders words using intermediate representations of 
their sound (and we all know that English is not exactly a phonetic language).

So, while part of the weirdness in those lists can be attributed to a missing 
rendering stage, the main problem is that the filters in stage 2 are not good 
enough. In fact, most of the words you see there should be discarded instead of 
being rendered. Rejection ratio is now about 20%, but I am expecting to get 
85-90% in order of obtain good results.

My kind request to the mailing list is the following: I would love if some of 
you could look into the lists, and delete anything but the names they deem 
usable or with some potential (please consider the rendering problem here). 
Your feedback will be useful to add further filtering rules.

Thank you :)
peres

<<attachment: test.zip>>

_______________________________________________
tp-devel mailing list
[email protected]
http://www.thousandparsec.net/tp/mailman.php/listinfo/tp-devel

Reply via email to