Dear Flexwikers,

Below is a Flexwiki dialect perl module for the Perl program html2wiki
(get that program, documentation, and other dialects at cpan.org --
search for HTML-wikiconverter.)

Its not complete but is already useful - I guess I may have some work
to do with images among other things.

To use this you'll need a Perl package (e.g. ActiveState's) and you'll
need to know how to run cpan or ppm to install html2wiki.

You'll need to put the following code in a file "flexwiki.pm" which
must be in the perl library path (e.g.
perl/site/lib/HTML/wikiconverter).

An example of how to run it:

html2wiki --dialect flexwiki somefile.html > somefile.wikitext


-----------------------------------------------------do not include
this delimiting line or the one at the end
-------------------------------------------------------------
package HTML::WikiConverter::flexwiki;

use warnings;
use strict;

use base 'HTML::WikiConverter';

our $VERSION = '0.01';

=head1 NAME

HTML::WikiConverter::flexwiki - Convert HTML to flexwiki markup

=head1 SYNOPSIS

  use HTML::WikiConverter;
  my $wc = new HTML::WikiConverter( dialect => 'flexwiki' );
  print $wc->html2wiki( $html );

=head1 DESCRIPTION

This module contains rules for converting HTML into flexwiki markup. See
L<HTML::WikiConverter> for additional usage details.

=cut

sub rules {
  my %rules = (
    hr => { replace => "\n----\n" },
    br => { replace => \&_br },

    h1 => { start => '!',      block => 1, trim => 'both', line_format
=> 'single' },
    h2 => { start => '!!',     block => 1, trim => 'both', line_format
=> 'single' },
    h3 => { start => '!!!',    block => 1, trim => 'both', line_format
=> 'single' },
    h4 => { start => '!!!!',   block => 1, trim => 'both', line_format
=> 'single' },
    h5 => { start => '!!!!!',  block => 1, trim => 'both', line_format
=> 'single' },
    h6 => { start => '!!!!!!', block => 1, trim => 'both', line_format
=> 'single' },

    blockquote => { start => \&_blockquote_start, trim => 'both',
block => 1, line_format => 'multi' },
    pre        => { line_prefix => ' ', block => 1 },
    p          => { block => 1, trim => 'both', line_format => 'multi' },

    b      => { start => "'''", end => "'''", line_format => 'single' },
    strong => { alias => 'b' },
    i      => { start => "''", end => "''", line_format => 'single' },
    em     => { alias => 'i' },
    tt     => { start => ' ', end => ' ', trim => 'both', line_format
=> 'single' },
    code   => { alias => 'tt' },

    big   => { start => "'+", end => "+'", line_format => 'single' },
    small => { start => "'-", end => "-'", line_format => 'single' },
    sup   => { start => "'^", end => "^'", line_format => 'single' },
    sub   => { start => "~", end => "~", line_format => 'single' },
    ins   => { start => '{+', end => '+}', line_format => 'single' },
    del   => { start => '-', end => '-', line_format => 'single' },

    ul => { line_format => 'multi', block => 1 },
    ol => { alias => 'ul' },
    li => { start => \&_li_start, trim => 'leading' },

    dl => { alias => 'ul' },
    dt => { start => \&_li_start, line_format => 'single', trim => 'both' },
    dd => { start => ': ' },

    a   => { replace => \&_link },
    img => { replace => \&_image },

    table => { start => \&_table_start, block => 1 },
    tr    => { start => "\n||", line_format => 'single' },
    td    => { start => \&_td_start, end => \&_td_end, trim => 'both' },
    th    => { alias => 'td' }
  );

  return \%rules;
}

sub _br {
  my( $self, $node, $rules ) = @_;
  return " [[<<]] " if $node->look_up( _tag => 'table' );
  return " \\\\\n";
}

sub _table_start {
  my( $self, $node, $rules ) = @_;
  my @attrs = qw/ border cellpadding cellspacing width bgcolor align /;
  return '|| '.$self->get_attr_str( $node, @attrs );
}

sub _td_start {
  my( $self, $node, $rules ) = @_;
  my $prefix = $node->tag eq 'th' ? '!' : '';

  my $align = $node->attr('align') || 'left';
  $prefix .= ' ' if $align eq 'center' or $align eq 'right';

  return $prefix;
}

sub _td_end {
  my( $self, $node, $rules ) = @_;
  my $colspan = $node->attr('colspan') || 1;
  my $suffix = ( '||' ) x $colspan;

  my $align = $node->attr('align') || 'left';
  $suffix = ' '.$suffix if $align eq 'center' or $align eq 'left';

  return $suffix;
}

sub _blockquote_start {
  my( $self, $node, $rules ) = @_;
  my @parent_bqs = $node->look_up( _tag => 'blockquote' );
  my $depth = @parent_bqs;

  my $start = ( '-' ) x $depth;
  return "\n".$start.'>';
}

sub _li_start {
  my( $self, $node, $rules ) = @_;
  my @parent_lists = $node->look_up( _tag => qr/ul|ol|dl/ );
  my $depth = @parent_lists;
  my $prefix = "        ";
  my $bullet = '';
  $bullet = '*' if $node->parent->tag eq 'ul';
  $bullet = '#' if $node->parent->tag eq 'ol';
  $bullet = ':' if $node->parent->tag eq 'dl';

  my $fix = ( $prefix ) x $depth;
  return "\n".$fix.$bullet;
}

sub _link {
  my( $self, $node, $rules ) = @_;
  return $self->_anchor($node, $rules) if $node->attr('name');

  my $url = $node->attr('href') || '';
  my $text = $self->get_elem_contents($node) || '';

  $url =~ s/%20/ /g;
  $url =~  s/[.,\+ ]/_/g; #encode (destructive)
  $url =~  s/#/#A/;     #anchors must start with capital letter
  $url =~  s/%..//;    #encode (destructive)
  return $url if $text eq $url;
  return "\"$text\":[$url]";
}

sub _anchor {
  my( $self, $node, $rules ) = @_;
  my $name = $node->attr('name') || '';
  $name =~ s/[.,\+ ]/_/g;          #encode (destructive)
  return ":A$name: "; #\n's are ineffective; \r's are effective but
may cause problems or regex'g underlying \n's  #pts was"[[#$name]]";
}

sub _image {
  my( $self, $node, $rules ) = @_;
  return $node->attr('src') || '';
}

sub preprocess_node {
  my( $self, $node ) = @_;
  my $tag = $node->tag || '';
  $self->_move_aname($node) if $tag eq 'a' and $node->attr('name');
  $self->caption2para($node) if $tag eq 'caption';
  if( $tag eq '~text' and $node->left and $node->left->tag and
$node->left->tag eq 'br' and !$node->look_up(_tag => 'pre') ) {
    ( my $text = $node->attr('text') ) =~ s/^\s+//;
    $node->attr( text => $text );
  }
}

sub postprocess_output{
        my ($self, $outref) [EMAIL PROTECTED];
        $$outref =~  s/(:\[)#A([^\]\n]*)]/$1PlayPage]#A$2/g;  # Anchor
reference not inside the [link]
        $$outref =~ s/\n(!+)('*)(:[^:\n]*: )([^\n]*)\n/$3\n$1$2$4/g; #anchor
definition PRECEDES any <Hn>
}

sub _move_aname {
  my( $self, $node ) = @_;

  my $name = $node->attr('name') || '';
  $node->attr( name => undef );

  my $aname = new HTML::Element( 'a', name => $name );
  $node->preinsert($aname);

  # Keep 'a href's around
  $node->replace_with_content->delete unless $node->attr('href');
}

=head1 AUTHOR

David J. Iberri, C<< <diberri at cpan.org> >>

=head1 BUGS

Please report any bugs or feature requests to
C<bug-html-wikiconverter-flexwiki at rt.cpan.org>, or through the web
interface at
L<http://rt.cpan.org/NoAuth/ReportBug.html?Queue=HTML-WikiConverter-flexwiki>.
I will be notified, and then you'll automatically be notified of
progress on your bug as I make changes.

=head1 SUPPORT

You can find documentation for this module with the perldoc command.

    perldoc HTML::WikiConverter::flexwiki

You can also look for information at:

=over 4

=item * AnnoCPAN: Annotated CPAN documentation

L<http://annocpan.org/dist/HTML-WikiConverter-flexwiki>

=item * CPAN Ratings

L<http://cpanratings.perl.org/d/HTML-WikiConverter-flexwiki>

=item * RT: CPAN's request tracker

L<http://rt.cpan.org/NoAuth/Bugs.html?Dist=HTML-WikiConverter-flexwiki>

=item * Search CPAN

L<http://search.cpan.org/dist/HTML-WikiConverter-flexwiki>

=back

=head1 COPYRIGHT & LICENSE

Copyright 2006 David J. Iberri, all rights reserved.

This program is free software; you can redistribute it and/or modify
it under the same terms as Perl itself.

=cut

1;
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

-------------------------------------------------------------------------
This SF.net email is sponsored by the 2008 JavaOne(SM) Conference 
Don't miss this year's exciting event. There's still time to save $100. 
Use priority code J8TL2D2. 
http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone
_______________________________________________
Flexwiki-users mailing list
Flexwiki-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/flexwiki-users

Reply via email to