I have a similar conundrum. I'm writing a suite of preprocessors for
Rozenshtein delta functions (aka Encoded Characteristic functions). The
basic idea is really simple, and I imagine embedded quite deeply in the
Perl psyche. The idea is that the delta function δ[x⊜y], where ⊜ is any
comparison
On second thought... Text::Filter::NoPunctuation is probably better than
::Unpunctuate.
However, a more general solution might be Text::Filter::Transliterate
(using tr with from and to mappings passed to the filter) and
Text::Filter::Delete (deleting characters specified according to a
On Thu, Dec 20, 2012 at 11:09 AM, Brian Katzung bri...@kappacs.com wrote:
On second thought... Text::Filter::NoPunctuation is probably better than
::Unpunctuate.
::StripPunctuation would be even more descriptive.
--
Darren Chamberlain d...@sevenroot.org
Hello everyone,
How about creating Text::Filter::LowerCase and
Text::Filter::Unpunctuate as derived classes of Text::Filter?
I had peeked at Text::Filter before, and had deemed it great as a
transport mechanism (handling various in- and outputs, which my module
deliberately would *not*
Hello Paul,
I have a similar conundrum. I'm writing a suite of preprocessors for
Rozenshtein delta functions (aka Encoded Characteristic functions).
The basic idea is really simple […]
Right now, I'm considering the Text::Rewrite::DeltaExpression::(.*)
namespace where $1 is the target
Hello,
I'm writing a small module to apply lossy filters to text, to enable
better subsequent lossless compression. For example, Hello, World!
would become hello, world! with the lowercase filter, or Hello
World with the punctuation removal filter. This does not apply the
actual compression,
Ben,
How about creating Text::Filter::LowerCase and Text::Filter::Unpunctuate
as derived classes of Text::Filter?
- Brian
On 2012-12-19 13:56, Ben Deutsch wrote:
Hello,
I'm writing a small module to apply lossy filters to text, to enable
better subsequent lossless compression. For