Oh, sorry for my question...
Is it possibly the TextParser generator, i am looking for ?
Could this parser also handle "unstructured text" as follows:
"Take out a peace of data from the input, replace it by
something else and finally make all of the stuff a valid
XML output..."
regards,
hussayn
SAXESS - Hussayn Dabbous wrote:
Hy;
I struggled over following problem and wonder, if this is relevant
and has been solved within cocoon:
assume, you have some content, that is plain text, e.g. log reports.
Now you want to use this text with cocoon. Naturaly you have to
convert the text to XML. This could be done by writing a new
generator of course, which would be specific to the data, it has
to convert.
Now assume, you have many different sources, that have to be
transformed into XML.
Wouldn't it be nice to have a generator at hand, that could be
controlled via configuration? By this i can use one generator,
then configure the conversion rules as needed, get the XML data
out of it, then proceed within cocoon pipelines ...
One possible use case (sounds like beeing a JTidy task, but it isn't):
i have several servers, that produce very dirty HTML, intermixed with
javascript. My generator shall gather data from these sites and
not only convert html to xhtml, but also do some necessary modifications
within the javascript, which is certainly not a suitable task for XSLT
processing, nor for JTidy. i could think of regexp processing here...
Rather than creating dedicated generators for every site, i want one
generator, that can be configured to convert data dependent on the
url, or whatever... I think, this is just another step towards
real content syndication ...
What do you mean?
Any thoughts are welcome ...
regards, hussayn
--
Dr. Hussayn Dabbous
SAXESS Software Design GmbH
Neuenhöfer Allee 125
50935 Köln
Telefon: +49-221-56011-0
Fax: +49-221-56011-20
E-Mail: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]