OpenOffice hyphenation in Java - YESSSSSS

2005-03-10 Thread Jeremias Maerki
Look what I've just found:

http://linux.org.mt/projects/jtextcheck/jtextcheck-ooohyph-plugin/index.html

It's LGPL (and so are OO's hyphenation patterns if I remember correctly)
but if we provide a plug-in mechanism and host the actual adapter under
the LGPL at http://offo.sourceforge.net/ we could provide the same
hyphenation functionality as OO.

Info on LGPL use by Apache software:
http://wiki.apache.org/jakarta/Using_20LGPL_27d_20code
http://wiki.apache.org/jakarta/LicenceIssues

Now, can we please get rid of all hyphenation patterns in FOP? Please,
please. Just joking, but I'd feel better.

Jeremias Maerki



Re: Project offo distributes hyphenation pattern files for FOP

2004-09-13 Thread Clay Leeds
Thanks for setting this up, Simon. Please let me know if there is 
anything I can do to help.

Web Maestro Clay
Simon Pepping wrote:
Dear FOP users,
In February 2004 a large number of hyphenation pattern files were
removed from FOP's CVS repository due to licensing issues. These
hyphenation patterns were contributed to FOP under licenses which
allowed their free distribution, but under conditions which were felt
to be in contradiction with the Apache license, under which FOP is
distributed. Most files are licensed under the LaTeX Project Public
License (http://www.latex-project.org/lppl/index.html), which requires
that no modified version be published under the name of the original
source file. For details see the FOP wiki page
(http://nagoya.apache.org/wiki/apachewiki.cgi?FOPAudits/March2003).
I am please to announce that the hyphenation pattern files for FOP are
now made available by the project `Objects for Formatting Objects'
(offo) (http://offo.sourceforge.net/). They can be downloaded from
offo's project page (http://sourceforge.net/projects/offo/). At this
moment the homepage of the project is not yet ready. Therefore the
overview of the hyphenation pattern files and their licenses is
available from my web site,
http://www.leverkruid.nl/FOP/hyphenation.html. It is also contained in
the package file.
Regards, Simon
--
Peter B. West http://cv.pbw.id.au/



Re: Project offo distributes hyphenation pattern files for FOP

2004-09-13 Thread Simon Pepping
On Mon, Sep 13, 2004 at 08:03:08AM -0700, Clay Leeds wrote:
 Thanks for setting this up, Simon. Please let me know if there is 
 anything I can do to help.

Thanks for the offer. It was some work to set this up, although I
could build mostly on Jeremias' earlier work. I expect that from now
on the project will not give me much work. The only thing that needs
to happen, is a nicer design for the home page and the page
hyphenation.html. Perhaps a challenge for the web master?

It might be a good idea if you or someone else of the FOP team would
also be a project administrator, to make it less dependent on one
person.

Regards, Simon

-- 
Simon Pepping
home page: http://www.leverkruid.nl



Re: Project offo distributes hyphenation pattern files for FOP

2004-09-13 Thread Simon Pepping
On Sun, Sep 12, 2004 at 11:35:42AM +1000, Peter B. West wrote:
 Moved from fop-user.
 
 Simon,
 
 Under which license have you set up the SourceForge project?

I stated that contributions with any OSI-approved license or in the
Public Domain were acceptable. The discussion to get the idea
explained and this license statement accepted was lengthy. I took care
to state each license carefully in the covering page
http://sourceforge.net/offo/hyphenation.html. For a large part thanks
to Jeremias earlier work.

Simon 

-- 
Simon Pepping
home page: http://www.leverkruid.nl



Re: Project offo distributes hyphenation pattern files for FOP

2004-09-13 Thread Clay Leeds
On Sep 13, 2004, at 11:51 AM, Simon Pepping wrote:
On Mon, Sep 13, 2004 at 08:03:08AM -0700, Clay Leeds wrote:
Thanks for setting this up, Simon. Please let me know if there is
anything I can do to help.
Thanks for the offer. It was some work to set this up, although I
could build mostly on Jeremias' earlier work. I expect that from now
on the project will not give me much work. The only thing that needs
to happen, is a nicer design for the home page and the page
hyphenation.html. Perhaps a challenge for the web master?
I'd be happy to spend some time on this. I've got a couple ideas and 
will see if I can't come up with something soon.

It might be a good idea if you or someone else of the FOP team would
also be a project administrator, to make it less dependent on one
person.
Regards, Simon
Again, I'd be happy to oblige. I can also be another project 
administrator if you'd like.

Web Maestro Clay


Re: Project offo distributes hyphenation pattern files for FOP

2004-09-12 Thread Peter B. West
Moved from fop-user.
Simon,
Under which license have you set up the SourceForge project?
Peter
Simon Pepping wrote:
Dear FOP users,
In February 2004 a large number of hyphenation pattern files were
removed from FOP's CVS repository due to licensing issues. These
hyphenation patterns were contributed to FOP under licenses which
allowed their free distribution, but under conditions which were felt
to be in contradiction with the Apache license, under which FOP is
distributed. Most files are licensed under the LaTeX Project Public
License (http://www.latex-project.org/lppl/index.html), which requires
that no modified version be published under the name of the original
source file. For details see the FOP wiki page
(http://nagoya.apache.org/wiki/apachewiki.cgi?FOPAudits/March2003).
I am please to announce that the hyphenation pattern files for FOP are
now made available by the project `Objects for Formatting Objects'
(offo) (http://offo.sourceforge.net/). They can be downloaded from
offo's project page (http://sourceforge.net/projects/offo/). At this
moment the homepage of the project is not yet ready. Therefore the
overview of the hyphenation pattern files and their licenses is
available from my web site,
http://www.leverkruid.nl/FOP/hyphenation.html. It is also contained in
the package file.
Regards, Simon
--
Peter B. West http://cv.pbw.id.au/


DO NOT REPLY [Bug 27773] - [PATCH] Hyphenation

2004-05-29 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://issues.apache.org/bugzilla/show_bug.cgi?id=27773.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=27773

[PATCH] Hyphenation

[EMAIL PROTECTED] changed:

   What|Removed |Added

 Status|NEW |RESOLVED
 Resolution||FIXED



--- Additional Comments From [EMAIL PROTECTED]  2004-05-29 08:56 ---
Applied. Thanks.

Simon


DO NOT REPLY [Bug 28431] - [PATCH] Hyphenation of words with punctuation marks

2004-05-29 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://issues.apache.org/bugzilla/show_bug.cgi?id=28431.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=28431

[PATCH] Hyphenation of words with punctuation marks

[EMAIL PROTECTED] changed:

   What|Removed |Added

 Status|NEW |RESOLVED
 Resolution||FIXED



--- Additional Comments From [EMAIL PROTECTED]  2004-05-29 09:20 ---
Patch applied. Thanks.
Simon


Re: User configuration for hyphenation

2004-05-11 Thread J.Pietschmann
Glen Mazza wrote:
I believe the same logger reference would just be used with each thread 
instead,
es, and this is exactly the problem. It is conceivable that
different loggers might be used for different processors which
run in concurrent threads. A static logger reference prohibits
this.
J.Pietschmann


Re: User configuration for hyphenation

2004-05-11 Thread Peter B. West
J.Pietschmann wrote:
Glen Mazza wrote:

I believe the same logger reference would just be used with each 
thread instead,


es, and this is exactly the problem. It is conceivable that
different loggers might be used for different processors which
run in concurrent threads. A static logger reference prohibits
this.
However, the logger naming is purely conventional (at least in 1.4 
logging), so if the need for different logging in different threads 
arose, it could be accommodated by appending a thread-based suffix to 
the logger name.  Or was there some other problem you had in mind here?

Peter
--
Peter B. West http://www.powerup.com.au/~pbwest/resume.html


Re: User configuration for hyphenation

2004-05-10 Thread Simon Pepping
On Sun, May 09, 2004 at 11:52:31AM -0400, Glen Mazza wrote:
 
 Joerg I believe just recommended storing the user configuration in 
 FOUserAgent, which is already getting passed around everywhere.  Using 
 my favorite design pattern, Moving Part Reduction, I also think it would 
 be ideal if we could place all of this in FOUserAgent.  (I also suspect 
 that much of what will be in FOUserAgent/User configuration will be 
 variable over time, which may also suggest we keep this all in one 
 class.)  BTW, what other things besides hyphenation needs to go into 
 user configuration/fo user agent, say for 1.0? 

I like Joerg's idea of a Global Context object. It seems to have a
wider scope than FOUserAgent. An FOUserAgent object could be part of
it.

From the maintenance code's configuration it would need to have
baseDir, fontBaseDir, hyphenation-dir and strokeSVGText. Font
information per renderer has already been implemented; only the pdf
renderer really uses it. PDF filter lists have also been implemented
in the pdf renderer.
 
 Two more observations:
 
 1.)  What does the team think of placing all the FOUserAgent information 
 into the abstract apps.InputHandler class by either (a) full 
 incorporation (i.e., getting rid of FOUserAgent and moving everything in 
 it to InputHandler), or (b) including an FOUserAgent instance variable 
 in apps.InputHandler, or (c) having InputHandler extend FOUserAgent? 
 
 Then, we would just pass InputHandler around instead of FOUserAgent.  I 
 mention this because InputHandler includes the file name information 
 already, that is sometimes good to have at various parts of the pipeline 
 (for example, error messages or AWTRenderer, whose refresh capability 
 requires knowing the input source file names.)
 
 I'm not sure of the value of this idea, but am just bringing it up for 
 others to comment.

I would prefer input file information in the Global Context
object. InputHandler has more information than needed elsewhere in the
process.
 
 2.)  For the fop.image package, and future packages which have a high 
 chance of being used for transcoder or other non-FOP purposes, I'd like 
 us to factor out the FOUserAgent.  We had a small problem a few months 
 back when Thomas DeWeese was updating our fop.image package for the 
 transcoder and had trouble trying to figure out how to create an 
 FOUserAgent in order to supply a method parameter.  (Turned out only a 
 logger within FOUserAgent was being used, so I changed the method 
 signature to just require that.) 
 
 More generally, explicitly send in the needed values from FOUserAgent 
 into the called package (eliminating the need for other callees to 
 create an FOUserAgent instance), or, for those packages which require 
 lots of variables in order to be properly initialized, create the 
 collection class (i.e., like a ImageUserAgent) in that package, not in 
 the callee's package.  Then *any* callee can just import that collection 
 class, fill it up, and send it back to the called package.

Would this not be modelled by a set of interfaces? Packages or other
parts of FOP could define interfaces providing the information they
need. A Global Context would probably implement them all. Users
addressing only part of FOP may create their own implementation of the
relevant interface.

Regards, Simon

-- 
Simon Pepping
home page: http://www.leverkruid.nl



Re: User configuration for hyphenation

2004-05-10 Thread Clay Leeds
Simon Pepping wrote:
I like Joerg's idea of a Global Context object. It seems to have a
wider scope than FOUserAgent. An FOUserAgent object could be part of
it.
From the maintenance code's configuration it would need to have
baseDir, fontBaseDir, hyphenation-dir and strokeSVGText. Font
information per renderer has already been implemented; only the pdf
renderer really uses it. PDF filter lists have also been implemented
in the pdf renderer.
Sounds good to me. While we're on the subject of '*Dir' configuration, 
now may be a good time to rename the hyphenation-dir object to something 
that is consistent with the other directory-related items:

  hyphenationBaseDir

We could also shorten the name to something like 'hyphBaseDir', but I 
don't think it's worth it. Better to have clear names (and naming 
conventions).

On Sun, May 09, 2004 at 11:52:31AM -0400, Glen Mazza wrote:
snip what=FoUserAgent stuff
Two more observations:

1.)  What does the team think of placing all the FOUserAgent information 
into the abstract apps.InputHandler class by either (a) full 
incorporation (i.e., getting rid of FOUserAgent and moving everything in 
it to InputHandler), or (b) including an FOUserAgent instance variable 
in apps.InputHandler, or (c) having InputHandler extend FOUserAgent? 

Then, we would just pass InputHandler around instead of FOUserAgent.  I 
mention this because InputHandler includes the file name information 
already, that is sometimes good to have at various parts of the pipeline 
(for example, error messages or AWTRenderer, whose refresh capability 
requires knowing the input source file names.)

I'm not sure of the value of this idea, but am just bringing it up for 
others to comment.


I would prefer input file information in the Global Context
object. InputHandler has more information than needed elsewhere in the
process.
It's not clear to me how these changes would affect things, but I would 
hope the following remains true:

If FOP is 'compartmentalized' (modularized, etc.) to enable embedded or 
external 'non-FOP' usage, I would hope that any of the changes to 
FOUserAgent being discussed, do not impact integration via external 
usage (be it, cocoon, forrest or some other un-dreamed-of tool). In 
other words, my hope is that these types of changes to FOP make it even 
more desirable as a product to 'build on' than it already is.

Web Maestro Clay


Re: User configuration for hyphenation

2004-05-10 Thread Glen Mazza
J.Pietschmann wrote:

Mutable static data, like a static logger reference, interferes
severely with using FOP in an MT environment, because this means
one thread rendering globally rather than one thread per FO
processor rendering, perhaps using multiple processors, each one
in a separate thread.


I believe the same logger reference would just be used with each thread 
instead, written to by multiple threads (which is another problem, but 
not this one.)  Further, commons loggers are instantiated with a log 
name, usually the class name where it appears, with the same logger used 
only if the log name is the same.

We probably still have some unnecessary get/set logger references, such 
as in FOUserAgent, no longer needed due to the architecture of Commons 
Logging (i.e., any class doing logging can now call a LogFactory to get 
a logging instance.)  Getting rid of them would possibly alleviate some 
of these issues.  Also, I'm not certain of the need for logging, at 
least to the degree that we are doing it--many other applications at our 
level or below--Xerces, Xalan, Batik, compilers--don't bother with it, 
relying on System.out/err.println() or glorified versions thereof. 

Glen



Re: User configuration for hyphenation

2004-05-09 Thread J.Pietschmann
Simon Pepping wrote:
But
hyphenation is done deep down in the process, where any reference to
the start-up objects is lost. How can I configure it?
The idea is to use an object which represents some global context
and is reachable from nearly anywhere in the FO tree and LM tree.
I'd think the user agent should qualify.
Static objects are bad because of the usual MT issues (yeah,
even for logging).
J.Pietschmann


Re: User configuration for hyphenation

2004-05-09 Thread Glen Mazza
J.Pietschmann wrote:

Static objects are bad because of the usual MT issues (yeah,
even for logging).
Should FOP itself multithread, or would it be better to let whatever 
would call FOP do the multithreading?  I don't know what Xalan does with 
their translets, I suspect they just rely on the caller to implement it.

Also, would implementation of FOP MT cause the caller's MT to become 
inefficient or otherwise messed up?  I think I can see more use cases of 
letting FOP's caller handle the MT (e.g., servlet implementing MT to 
return PDF's simultaneously to hundreds of users, one PDF generation on 
each thread) than of FOP doing it itself, at the same time the caller 
has implemented it.

Glen


Re: User configuration for hyphenation

2004-05-09 Thread Glen Mazza
Jeremias Maerki wrote:

The first two frames are in static context. The LMs share a reference
to a user agent object, of type
layoutmgr.AbstractLayoutManager.userAgent. Would this be the right
object to hold a reference to the user configuration?
   

On first thought, yes. But I feel the scope of the FOUserAgent is still
somewhat undefined. Maybe we need to finally settle this issue. Until
this is done I consider it safe to hold the user configuration in there.
 

Joerg I believe just recommended storing the user configuration in 
FOUserAgent, which is already getting passed around everywhere.  Using 
my favorite design pattern, Moving Part Reduction, I also think it would 
be ideal if we could place all of this in FOUserAgent.  (I also suspect 
that much of what will be in FOUserAgent/User configuration will be 
variable over time, which may also suggest we keep this all in one 
class.)  BTW, what other things besides hyphenation needs to go into 
user configuration/fo user agent, say for 1.0? 

Two more observations:

1.)  What does the team think of placing all the FOUserAgent information 
into the abstract apps.InputHandler class by either (a) full 
incorporation (i.e., getting rid of FOUserAgent and moving everything in 
it to InputHandler), or (b) including an FOUserAgent instance variable 
in apps.InputHandler, or (c) having InputHandler extend FOUserAgent? 

Then, we would just pass InputHandler around instead of FOUserAgent.  I 
mention this because InputHandler includes the file name information 
already, that is sometimes good to have at various parts of the pipeline 
(for example, error messages or AWTRenderer, whose refresh capability 
requires knowing the input source file names.)

I'm not sure of the value of this idea, but am just bringing it up for 
others to comment.

2.)  For the fop.image package, and future packages which have a high 
chance of being used for transcoder or other non-FOP purposes, I'd like 
us to factor out the FOUserAgent.  We had a small problem a few months 
back when Thomas DeWeese was updating our fop.image package for the 
transcoder and had trouble trying to figure out how to create an 
FOUserAgent in order to supply a method parameter.  (Turned out only a 
logger within FOUserAgent was being used, so I changed the method 
signature to just require that.) 

More generally, explicitly send in the needed values from FOUserAgent 
into the called package (eliminating the need for other callees to 
create an FOUserAgent instance), or, for those packages which require 
lots of variables in order to be properly initialized, create the 
collection class (i.e., like a ImageUserAgent) in that package, not in 
the callee's package.  Then *any* callee can just import that collection 
class, fill it up, and send it back to the called package.

Glen


Re: User configuration for hyphenation

2004-05-09 Thread Glen Mazza
Glen Mazza wrote:

I don't know what Xalan does with their translets, I suspect they just 
rely on the caller to implement it.

Oops--my error--the translets just generate Java code.  My comment was 
in general for the need of XML processors such as Xalan and FOP to 
implement MT internally.

Glen


Re: User configuration for hyphenation

2004-05-09 Thread J.Pietschmann
Glen Mazza wrote:
Should FOP itself multithread, or would it be better to let whatever 
would call FOP do the multithreading?
I don't understand this. Even if the main processing methods
of the FO processing object are synchronizend, which is probaly
what you understand by FOP isn't MT safe itself, the user
can create multiple processors and use them concurrently.
Mutable static data, like a static logger reference, interferes
severely with using FOP in an MT environment, because this means
one thread rendering globally rather than one thread per FO
processor rendering, perhaps using multiple processors, each one
in a separate thread. That's going to raise complaints, check
the posts complaining about the global options in the maintenance
branch.
I'd say we may use static data only in a few cases:
- Immutable data, like name mappings and fallback options
- Global font and perhaps image caches
- Object pools (although they are said to decrease performance
 for modern JREs)
J.Pietschmann


Re: User configuration for hyphenation

2004-05-09 Thread J.Pietschmann
Glen Mazza wrote:
BTW, what other things besides hyphenation needs to go into 
user configuration/fo user agent, say for 1.0?
Various strategy parameters, once they are implemented, like
line breaking strategy; furthermore callbacks for redrawing
pages in a GUI renderer, font management, base URL for images
etc.
J.Pietschmann


Re: User configuration for hyphenation

2004-05-09 Thread Jeremias Maerki

On 08.05.2004 23:04:40 Simon Pepping wrote:
 I am trying to enable user configuration of the directory of
 hyphenation files, like in the maintenance code. Configuration of the
 renderers is easy, because they are instantiated in Fop.main(). But
 hyphenation is done deep down in the process, where any reference to
 the start-up objects is lost. How can I configure it?
 
 If I am correct, in the maintenance code the user configuration could
 be accessed by a static method, anywhere in the process, just like a
 log file in commons logging. I suppose that is not the preferred
 way of accessing configuration in HEAD?

Indeed. Logging is a special case where static lookup is in order. For
the other cases lookup via references is preferred (IMO).

 This is the stack frame:
 
   [1] org.apache.fop.layout.hyphenation.Hyphenator.getHyphenationTree 
 (Hyphenator.java:67)
   [2] org.apache.fop.layout.hyphenation.Hyphenator.hyphenate (Hyphenator.java:232)
   [3] org.apache.fop.layoutmgr.LineLayoutManager.getHyphenContext 
 (LineLayoutManager.java:479)
   [4] org.apache.fop.layoutmgr.LineLayoutManager.getNextBreakPoss 
 (LineLayoutManager.java:228)
   [5] org.apache.fop.layoutmgr.BlockLayoutManager.getNextBreakPoss 
 (BlockLayoutManager.java:204)
   [6] org.apache.fop.layoutmgr.FlowLayoutManager.getNextBreakPoss 
 (FlowLayoutManager.java:79)
   [7] org.apache.fop.layoutmgr.PageLayoutManager.getNextBreakPoss 
 (PageLayoutManager.java:229)
   [8] org.apache.fop.layoutmgr.PageLayoutManager.doLayout 
 (PageLayoutManager.java:194)
 
 The first two frames are in static context. The LMs share a reference
 to a user agent object, of type
 layoutmgr.AbstractLayoutManager.userAgent. Would this be the right
 object to hold a reference to the user configuration?

On first thought, yes. But I feel the scope of the FOUserAgent is still
somewhat undefined. Maybe we need to finally settle this issue. Until
this is done I consider it safe to hold the user configuration in there.
We can change that easily later.


Jeremias Maerki



User configuration for hyphenation

2004-05-08 Thread Simon Pepping
Hi,

I am trying to enable user configuration of the directory of
hyphenation files, like in the maintenance code. Configuration of the
renderers is easy, because they are instantiated in Fop.main(). But
hyphenation is done deep down in the process, where any reference to
the start-up objects is lost. How can I configure it?

If I am correct, in the maintenance code the user configuration could
be accessed by a static method, anywhere in the process, just like a
log file in commons logging. I suppose that is not the preferred
way of accessing configuration in HEAD?

This is the stack frame:

  [1] org.apache.fop.layout.hyphenation.Hyphenator.getHyphenationTree 
(Hyphenator.java:67)
  [2] org.apache.fop.layout.hyphenation.Hyphenator.hyphenate (Hyphenator.java:232)
  [3] org.apache.fop.layoutmgr.LineLayoutManager.getHyphenContext 
(LineLayoutManager.java:479)
  [4] org.apache.fop.layoutmgr.LineLayoutManager.getNextBreakPoss 
(LineLayoutManager.java:228)
  [5] org.apache.fop.layoutmgr.BlockLayoutManager.getNextBreakPoss 
(BlockLayoutManager.java:204)
  [6] org.apache.fop.layoutmgr.FlowLayoutManager.getNextBreakPoss 
(FlowLayoutManager.java:79)
  [7] org.apache.fop.layoutmgr.PageLayoutManager.getNextBreakPoss 
(PageLayoutManager.java:229)
  [8] org.apache.fop.layoutmgr.PageLayoutManager.doLayout (PageLayoutManager.java:194)

The first two frames are in static context. The LMs share a reference
to a user agent object, of type
layoutmgr.AbstractLayoutManager.userAgent. Would this be the right
object to hold a reference to the user configuration?

Your advice would be appreciated.

Regards, Simon

-- 
Simon Pepping
home page: http://www.leverkruid.nl



DO NOT REPLY [Bug 28431] - [PATCH] Hyphenation of words with punctuation marks

2004-04-26 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://issues.apache.org/bugzilla/show_bug.cgi?id=28431.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=28431

[PATCH] Hyphenation of words with punctuation marks





--- Additional Comments From [EMAIL PROTECTED]  2004-04-26 19:35 ---
Luca,

I agree with item 1. Although the spaces before the hyphenated word
are often incorrect, the word is hyphenated correctly, and the patch
solves the problem it set out to solve. 

I also agree with item 2. The break up of the word is incorrect. Also
the spaces before the hyphenated word are often incorrect. However,
this problem is not caused by the patch, it is just revealed by it.

The patch can be applied. The break-up and spacing problems should be
solved in a separate effort.

Simon


DO NOT REPLY [Bug 28431] - Hyphenation of words with punctuation marks

2004-04-19 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://issues.apache.org/bugzilla/show_bug.cgi?id=28431.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=28431

Hyphenation of words with punctuation marks





--- Additional Comments From [EMAIL PROTECTED]  2004-04-19 10:30 ---
Simon,

concernig names, unnecessary if, etc. , I agree with you.

It seems to me that your change concerning hyphenation exceptions works, 
otherwise the hyphenation points would appear in the wrong place because of 
the punctuation marks.

The strange pdf generated is due, IMO, to a couple of problems:

-1-
In the last test case the text is (quite oddly) divided among 3 TextLM
 **[...]** (philanthrop
 ic).
  
Specifying the property linefeed-treatment=ignore, the text is all in a TLM.
Removing from the test file the linefeed after (philanthropic)., the text is 
still split in two parts:
 ***
 *** (philanthropic).
So, it seems there is an irksome bug affecting text splitting.

-2-
The last line in a justified paragraph is sometimes justified too (bug 28314).
The phantom linefeed is by default treated as a space, and so it is adjusted.
Anyway, I was pleased to notice that, although shattered, the word is 
correctly collected and hyphenated.

Regards

Luca


Re: [Bug 27773] - [PATCH] Hyphenation

2004-04-16 Thread Luca Furini

 I think it would be better to report this item in a patch of
 its own. It really is a new issue.

Ok, sorry.
I'm going to do as you suggest.

Luca




DO NOT REPLY [Bug 28431] New: - Hyphenation of words with punctuation marks

2004-04-16 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://issues.apache.org/bugzilla/show_bug.cgi?id=28431.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=28431

Hyphenation of words with punctuation marks

   Summary: Hyphenation of words with punctuation marks
   Product: Fop
   Version: 1.0dev
  Platform: PC
OS/Version: Windows XP
Status: NEW
  Severity: Normal
  Priority: Other
 Component: page-master/layout
AssignedTo: [EMAIL PROTECTED]
ReportedBy: [EMAIL PROTECTED]


I have found a small bug concerning hyphenation in the 
HyphenationTree.hyphenate() method.
Before checking the exception list or using the algorithm, the 
function normalizes the word: during this phase, if a non-letter character 
is found null is returned.
// normalize word
char[] c = new char[2];
for (i = 1; i = len; i++) {
c[0] = w[offset + i - 1];
int nc = classmap.find(c, 0);
if (nc  0) {// found a non-letter character, abort
return null;
}
word[i] = (char)nc;
}
I think the condition (nc  0) is too strong: at the moment words followed by 
punctuation marks, or in parenthesis, are not hyphenated.
So, for example, the word suggestion can be hyphenated, but suggestion. 
and (suggestion), cannot.

This is how I tried to fix this problem:
- non-letter characters at the beginning are not copied into word[]
- if a non-letter character is found which is not at the beginning, it is not 
copied into word[] and a boolean variable becomes true
- if a letter-character is found when the variable is true, null is returned; 
otherwise, word[] is used to find hyphenation points

I have also added a little optimization: if, after the normalization and the 
non-letter character removal, the word size is less than (remainCharCount + 
pushCharCount), null is returned, without checking the exception list and 
performing the algorithm.

I'm going to attach the proposed patch and a test fo file which shows a few 
examples.

Regards

Luca


DO NOT REPLY [Bug 28431] - Hyphenation of words with punctuation marks

2004-04-16 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://issues.apache.org/bugzilla/show_bug.cgi?id=28431.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=28431

Hyphenation of words with punctuation marks





--- Additional Comments From [EMAIL PROTECTED]  2004-04-16 13:29 ---
Created an attachment (id=11258)
proposed patch to HyphenationTree


DO NOT REPLY [Bug 28431] - Hyphenation of words with punctuation marks

2004-04-16 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://issues.apache.org/bugzilla/show_bug.cgi?id=28431.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=28431

Hyphenation of words with punctuation marks





--- Additional Comments From [EMAIL PROTECTED]  2004-04-16 13:30 ---
Created an attachment (id=11259)
test fo file: words with punctuation marks and parenthesis


DO NOT REPLY [Bug 28431] - Hyphenation of words with punctuation marks

2004-04-16 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://issues.apache.org/bugzilla/show_bug.cgi?id=28431.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=28431

Hyphenation of words with punctuation marks





--- Additional Comments From [EMAIL PROTECTED]  2004-04-16 19:44 ---
Luca,

The patch works well.

I do not find the name bAfterLetter very clear. It really is
bNonLetterAfterLetters, but that is too long. I find bEndOfLetters a
reasonable choice.

The 'else if (!bAfterLetter)' might as well be just 'else'.

The venom is in the tail. I do not know the details of this part of
hyphenation. Your addition of 'iIgnoreAtBeginning' seems OK. I think
you should also add 'iIgnoreAtBeginning' in the if branch (hyphenation
exceptions), but the results of a test fo are not quite in
favour. Perhaps you can have a look into this.

I added a long comment explaining various features, perhaps most to
myself.

I added cases to the test fo showing a word that is too short (when
one adds debug logging, one sees the effect), and 4 cases with a
hyphenation exception word.

Regards, Simon


DO NOT REPLY [Bug 28431] - Hyphenation of words with punctuation marks

2004-04-16 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://issues.apache.org/bugzilla/show_bug.cgi?id=28431.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=28431

Hyphenation of words with punctuation marks





--- Additional Comments From [EMAIL PROTECTED]  2004-04-16 19:45 ---
Created an attachment (id=11264)
An expanded test fo file


DO NOT REPLY [Bug 28431] - Hyphenation of words with punctuation marks

2004-04-16 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://issues.apache.org/bugzilla/show_bug.cgi?id=28431.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=28431

Hyphenation of words with punctuation marks





--- Additional Comments From [EMAIL PROTECTED]  2004-04-16 19:46 ---
Created an attachment (id=11265)
A slightly modified patch


DO NOT REPLY [Bug 28431] - Hyphenation of words with punctuation marks

2004-04-16 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://issues.apache.org/bugzilla/show_bug.cgi?id=28431.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=28431

Hyphenation of words with punctuation marks





--- Additional Comments From [EMAIL PROTECTED]  2004-04-17 05:17 ---
Your assumptions appear correct, I checked the Washington Post newspaper and saw
that hyphenation does indeed occur with words that have a period or comma at the
end of them.

Glen


Re: [Bug 27773] - [PATCH] Hyphenation

2004-04-15 Thread Simon Pepping
Luca,

I have noticed this problem, and it is nice to see that you are taking
it on. I think it would be better to report this item in a patch of
its own. It really is a new issue.

Simon

On Wed, Apr 14, 2004 at 07:00:10PM -, [EMAIL PROTECTED] wrote:
 --- Additional Comments From [EMAIL PROTECTED]  2004-04-14 19:00 ---
 Yes, I was quite surprised to see that all the information stored in the 
 BreakPoss was thrown away before adding areas; I chose to duplicate the 
 needed values because this involved fewer and less radical changes.
 
 I have found another small bug concerning hyphenation in the 
 HyphenationTree.hyphenate() method.
 Before checking the exception list or using the algorithm, the 
 function normalizes the word: during this phase, if a non-letter character 
 is found null is returned.
 // normalize word
 char[] c = new char[2];
 for (i = 1; i = len; i++) {
 c[0] = w[offset + i - 1];
 int nc = classmap.find(c, 0);
 if (nc  0) {// found a non-letter character, abort
 return null;
 }
 word[i] = (char)nc;
 }
 I think the condition (nc  0) is too strong: at the moment words followed by 
 punctuation marks, or in parenthesis, are not hyphenated.
 
 This is how I tried to fix this problem:
 - non-letter characters at the beginning are not copied into word[]
 - if a non-letter character is found which is not at the beginning, it is not 
 copied into word[] and a boolean variable becomes true
 - if a letter-character is found when the variable is true, null is returned; 
 otherwise, word[] is used to find hyphenation points
 
 I have also added a little optimization: if, after the normalization and the 
 non-letter character removal, the word size is less than (remainCharCount + 
 pushCharCount), null is returned, without checking the exception list and 
 performing the algorithm.
 
 I'm going to attach the proposed patch and a test fo file which shows a few 
 examples.
 
 Regards
 
 Luca

-- 
Simon Pepping
home page: http://www.leverkruid.nl



DO NOT REPLY [Bug 27773] - [PATCH] Hyphenation

2004-04-14 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://issues.apache.org/bugzilla/show_bug.cgi?id=27773.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=27773

[PATCH] Hyphenation





--- Additional Comments From [EMAIL PROTECTED]  2004-04-14 19:00 ---
Yes, I was quite surprised to see that all the information stored in the 
BreakPoss was thrown away before adding areas; I chose to duplicate the 
needed values because this involved fewer and less radical changes.

I have found another small bug concerning hyphenation in the 
HyphenationTree.hyphenate() method.
Before checking the exception list or using the algorithm, the 
function normalizes the word: during this phase, if a non-letter character 
is found null is returned.
// normalize word
char[] c = new char[2];
for (i = 1; i = len; i++) {
c[0] = w[offset + i - 1];
int nc = classmap.find(c, 0);
if (nc  0) {// found a non-letter character, abort
return null;
}
word[i] = (char)nc;
}
I think the condition (nc  0) is too strong: at the moment words followed by 
punctuation marks, or in parenthesis, are not hyphenated.

This is how I tried to fix this problem:
- non-letter characters at the beginning are not copied into word[]
- if a non-letter character is found which is not at the beginning, it is not 
copied into word[] and a boolean variable becomes true
- if a letter-character is found when the variable is true, null is returned; 
otherwise, word[] is used to find hyphenation points

I have also added a little optimization: if, after the normalization and the 
non-letter character removal, the word size is less than (remainCharCount + 
pushCharCount), null is returned, without checking the exception list and 
performing the algorithm.

I'm going to attach the proposed patch and a test fo file which shows a few 
examples.

Regards

Luca


DO NOT REPLY [Bug 27773] - [PATCH] Hyphenation

2004-04-14 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://issues.apache.org/bugzilla/show_bug.cgi?id=27773.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=27773

[PATCH] Hyphenation





--- Additional Comments From [EMAIL PROTECTED]  2004-04-14 19:03 ---
Created an attachment (id=11238)
test fo file: words with punctuation marks and parenthesis


DO NOT REPLY [Bug 27773] - [PATCH] Hyphenation

2004-04-09 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://issues.apache.org/bugzilla/show_bug.cgi?id=27773.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=27773

[PATCH] Hyphenation





--- Additional Comments From [EMAIL PROTECTED]  2004-04-09 09:28 ---
Hi Luca,

Your patch looks good. It works fine on your test fo, and it also
works fine on a test fo I had lying around.

There are a few things to note:

AreaInfo.bHyphenated is identical to (flags  BreakPoss.HYPHENATED);
ai.ipdArea = MinOptMax.add(ai.ipdArea, new MinOptMax(hyphIPD)) is
equivalent to bp.setStackingSize(MinOptMax.add(ipd, new
MinOptMax(hyphIPD))). In other words, the same information is present
in the BreakPoss, which is held by the LineLM. Unfortunately, this
information is not accessible in the addAreas method of TextLM. This
is so because posIter.next() returns the position member of the
BreakPoss, not the BreakPoss itself. I believe that is an error in
PositionIterator which eventually needs correction. For now your
solution is a good one.

Your proposed TextInfo.hypChar is also known as
LineLM.hyphProps.hyphenationChar. This information is not passed on to
childLMs and thus not known to TextLM. Storing it in TextInfo seems a
good idea; perhaps it is not even needed in hyphProps.

It is not completely clear that 

+if (ai.bHyphenated) {
+str += foText.textInfo.hyphChar;
+ai.ipdArea = MinOptMax.add(ai.ipdArea, new MinOptMax(hyphIPD));
+}

is always correct. This happens on the last AI returned by
posIter.next(), which is not necessarily the last one in the line. But
it is hard to conceive of an AI which is last in its LM and also
hyphenated; therefore it will work. Ideally, the LineLM indicates
which BP is the one ending the line; perhaps this needs correction at
some time.

Regards, Simon


DO NOT REPLY [Bug 27773] - [PATCH] Hyphenation

2004-04-05 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://issues.apache.org/bugzilla/show_bug.cgi?id=27773.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=27773

[PATCH] Hyphenation





--- Additional Comments From [EMAIL PROTECTED]  2004-04-05 09:22 ---
Hi all

I apologize for having posted some proposed changes without a line explaining 
what I was changing and why.
Really sorry, I'm going to try and do it now.

First of all, the condition now tested in the 
LineLayoutManager.getNextBreakPoss() to decide whether or not to try 
hyphenation is:
  if (bTextAlignment == TextAlign.JUSTIFY || prevBP == null) {
  ...
  }
So hyphenation is tried for *every* justified block of text; only if 
the language attribute is not set the getHyphenContext() method returns null 
and hyphenation is not actually done.
My suggestion is to change it so that the fo property hyphenate (7.9.4 in the 
recommendation, and already implemented in the CommonHyphenation class) is 
tested instead:
  if (hyphProps.hyphenate == Constants.TRUE) {
  ...
  }
In the TextInfo class there is a boolean attribute called bCanHyphenate, but 
it is given a default value of true, and it is no more modified. It seems to 
me that it is quite useless.
Is is used in the TextLayoutManager.getHyphenIPD() method:
  if (textArray.length  iStopIndex || foText.textInfo.bCanHyphenate == false) 
{
  ...
  }
but this method is called only if context.tryHyphenate() is true; with my 
suggested change, context.tryHyphenate() returns true only if the fo property 
hyphenate is true, and so it is no more necessary to test 
foText.textInfo.bCanHyphenate.

The second problem is that the hyphenation character is not shown: to fix this 
I added in the AreaInfo class (which stores information about the area that 
will be generated) a boolean attribute bHyphenated.
Is is set according to the value of the flags of the corresponding BreakPoss, 
and it is tested in the TextLayoutManager.addAreas() method to decide whether 
to add the hyphen.

Last (and least): I tried to implement the fo property hyphenation-character 
(7.9.5 in the recommendation).
As it applies to blocks, I thought to store it as a character variable in the 
TextInfo class, with initial value -.
It is set in the PropertyManager.getTextLayoutProps() method, together with 
the other TextInfo attribute.
Is is used to calculate the hyphIPD in the TextLayoutManager constructor, and 
in the addAreas method.

Bye
Luca


DO NOT REPLY [Bug 27773] - [PATCH] Hyphenation

2004-04-05 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://issues.apache.org/bugzilla/show_bug.cgi?id=27773.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=27773

[PATCH] Hyphenation





--- Additional Comments From [EMAIL PROTECTED]  2004-04-05 16:40 ---
Sorry, I'm here again (hope I am not making too much a mess!) :-)

I'm going to attach an updated proposed patch, as I noticed there was an error 
in the one I posted before.
In the TextLayoutManager.addAreas() method, the hyphen size must be added to 
the total inline progression dimension of the line *before* the adjustment 
calculation; this involves also moving a few lines of code.

Bye
Luca


DO NOT REPLY [Bug 27773] - [PATCH] Hyphenation

2004-04-05 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://issues.apache.org/bugzilla/show_bug.cgi?id=27773.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=27773

[PATCH] Hyphenation





--- Additional Comments From [EMAIL PROTECTED]  2004-04-05 16:41 ---
Created an attachment (id=11138)
updated diff


DO NOT REPLY [Bug 27773] - [PATCH] Hyphenation

2004-03-19 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://issues.apache.org/bugzilla/show_bug.cgi?id=27773.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=27773

[PATCH] Hyphenation





--- Additional Comments From [EMAIL PROTECTED]  2004-03-19 11:38 ---
Created an attachment (id=10868)
patch created using cvs diff


DO NOT REPLY [Bug 27773] - [PATCH] Hyphenation

2004-03-19 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://issues.apache.org/bugzilla/show_bug.cgi?id=27773.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=27773

[PATCH] Hyphenation





--- Additional Comments From [EMAIL PROTECTED]  2004-03-19 11:39 ---
Created an attachment (id=10869)
FO sample file to test the behavior of fop before and after the patch


DO NOT REPLY [Bug 27773] New: - [PATCH] Hyphenation

2004-03-18 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://issues.apache.org/bugzilla/show_bug.cgi?id=27773.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=27773

[PATCH] Hyphenation

   Summary: [PATCH] Hyphenation
   Product: Fop
   Version: 1.0dev
  Platform: PC
OS/Version: Windows XP
Status: NEW
  Severity: Enhancement
  Priority: Other
 Component: page-master/layout
AssignedTo: [EMAIL PROTECTED]
ReportedBy: [EMAIL PROTECTED]


I have tried to solve a few problems concerning hyphenation:
- show the '-' at the end of the hyphenated lines
- use the fo:hyphenate property to enable hyphenation, instead of the alignment
- specify the hyphenation character using the fo:hyphenation-character property

With this simple .fo file it is possible to see that before changes the result 
is the opposite of what expected: hyphenation is done when it shoud not, and 
not when it shoud.

?xml version=1.0 encoding=iso-8859-1?
fo:root xmlns:fo=http://www.w3.org/1999/XSL/Format;
  fo:layout-master-set
fo:simple-page-master master-name=simple margin=0.5in
  fo:region-body margin=1in/
/fo:simple-page-master
  /fo:layout-master-set
  fo:page-sequence master-reference=simple
fo:flow flow-name=xsl-region-body
  fo:blockThe following block has language=it, hyphenate=false and 
text-align=justify; it should NOT be hyphenated./fo:block
  fo:block start-indent=0.2in end-indent=0.2in language=it 
hyphenate=false text-align=justifyLunghissime parolone esagerevoli 
costringerebbero sillabazione auspicabilmente: precipitevolissimevolmente 
transatlantico catastrofico./fo:block
  fo:blockThe following block has language=it, hyphenate=true and 
text-align=start; it should be hyphenated./fo:block
  fo:block start-indent=0.2in end-indent=0.2in language=it 
hyphenate=true text-align=startLunghissime parolone esagerevoli 
costringerebbero sillabazione auspicabilmente: precipitevolissimevolmente 
transatlantico catastrofico./fo:block
  fo:blockThe following block has language=it, hyphenate=true, text-
align=justify and hyphenation-character=_./fo:block
  fo:block start-indent=0.2in end-indent=0.2in language=it 
hyphenate=true text-align=justify hyphenation-character=_Lunghissime 
parolone esagerevoli costringerebbero sillabazione auspicabilmente: 
precipitevolissimevolmente transatlantico catastrofico./fo:block
/fo:flow
  /fo:page-sequence
/fo:root


This is the diff:

Index: xml-fop/src/java/org/apache/fop/fo/PropertyManager.java
===
RCS file: /home/cvspublic/xml-
fop/src/java/org/apache/fop/fo/PropertyManager.java,v
retrieving revision 1.24
diff -w -u -r1.24 PropertyManager.java
--- xml-fop/src/java/org/apache/fop/fo/PropertyManager.java 27 Feb 2004 
17:57:40 -  1.24
+++ xml-fop/src/java/org/apache/fop/fo/PropertyManager.java 18 Mar 2004 
12:43:55 -
@@ -494,6 +494,9 @@
 textInfo.textTransform
 = this.propertyList.get(PR_TEXT_TRANSFORM).getEnum();
 
+textInfo.hyphChar = this.propertyList.get(
+  PR_HYPHENATION_CHARACTER).getCharacter();
+
 }
 return textInfo;
 }
Index: xml-fop/src/java/org/apache/fop/fo/TextInfo.java
===
RCS file: /home/cvspublic/xml-fop/src/java/org/apache/fop/fo/TextInfo.java,v
retrieving revision 1.6
diff -w -u -r1.6 TextInfo.java
--- xml-fop/src/java/org/apache/fop/fo/TextInfo.java27 Feb 2004 17:57:40 -
1.6
+++ xml-fop/src/java/org/apache/fop/fo/TextInfo.java18 Mar 2004 12:43:55 -

@@ -50,8 +50,8 @@
 /** fo:letter-spacing property */
 public SpaceVal letterSpacing;
 
-/** can this text be hyphenated? */
-public boolean bCanHyphenate = true;
+/* the hyphenation character to be used */
+public char hyphChar = '-';
 
 /** fo:text-decoration property: is text underlined? */
 public boolean underlined = false;
Index: xml-fop/src/java/org/apache/fop/layoutmgr/LineLayoutManager.java
===
RCS file: /home/cvspublic/xml-
fop/src/java/org/apache/fop/layoutmgr/LineLayoutManager.java,v
retrieving revision 1.15
diff -w -u -r1.15 LineLayoutManager.java
--- xml-fop/src/java/org/apache/fop/layoutmgr/LineLayoutManager.java18 Mar 
2004 00:22:40 - 1.15
+++ xml-fop/src/java/org/apache/fop/layoutmgr/LineLayoutManager.java18 Mar 
2004 12:44:00 -
@@ -20,6 +20,7 @@
 
 import org.apache.fop.datatypes.Length;
 import org.apache.fop.fo.PropertyManager;
+import org.apache.fop.fo.Constants;
 import org.apache.fop.fo.properties.CommonMarginBlock;
 import org.apache.fop.fo.properties.CommonHyphenation

Re: Hyphenation

2004-03-17 Thread Chris Bowditch
Luca Furini wrote:

Hi all!

I am an italian student of the University of Bologna.

I have tried to solve a few problems concerning hyphenation, in
particular:
- show the '-' at the end of the hyphenated lines
- use the fo:hyphenate property to enable hyphenation, instead of the
  alignment
- specify the hyphenation character using the fo:hyphenation-character
  property
Thanks for trying. Unfortunately we need a bit more information for your 
suggested change to make it to the code base.

Most importantly, which version of the code are you trying to fix? 
Maintenance or HEAD?

Secondly, could you please follow the procedure for submitting patches, 
described here:

http://xml.apache.org/fop/dev/index.html#patches

Please include sample FO files that demonstrate before and after effects 
of your changes.

Thanks very much,

Chris




Implicit grants (FOP hyphenation)

2004-03-05 Thread Jeremias Maerki
(ccing fop-dev and XML PMC)

Last year I've invested hours after hours doing a license audit on the
XML FOP project mainly because of problems with the hyphenation pattern
files. These files have been converted to XML by FOP contributors but
almost all files are coming from TeX software repositories where
licenses are diverse or non-existent. Some problems were resolved, more
than half of the files removed in the process.

For one of the files I have been able to get the original author of the
hyphenation pattern file to submit a grant form by fax to the ASF. That
grant has never been confirmed although I've pinged Jim twice (in April
last year). Following that I gave up hunting for grants. I understand
that dealing with this is annoying but it brings up the question about
on one side requiring grant forms and on the other side having a good
infrastructure not depending on one person alone to handle it all.

But that's not the reason I write this. I've done the relicensing on the
XML FOP project and was again confronted with our hyphenation files. Two
of them now have the ALv2 header because for these two files all legal
problems have been dealt with (although we don't have any copyright
grants on file for them. See big question below). The other files carry
copyright statements (or just a file history with names) but appear to
be distributable by the ASF. I still have to figure out what to do with
them. Most of them appear to be in the public domain (although this is
usually not explicitly state as such) so I will probably simply add the
ALv2 header if nobody sees any reason why I couldn't do so. I'm real
tired of the problem.

My big question: The ALv2 specifies implicit grants, right? (Section 5)
Can I apply this section to contributions done before the ALv2 was here?

Example: For the file (da.xml, Dansk hyphenation patterns, currently not
in CVS) mentioned above, I know that a grant was sent to the ASF by the
original author of the file (although there's no confirmation of the
receipt). And the person who converted the file to XML intentionally
submitted it to the FOP mailing list for inclusion in the project. It
would appear that I am now able add the ALv2 header to the file and add
it to CVS again. Any reason I can't do that?

As a related question:
The implicit grant works for original works only. Derived works still
need a written permission by the original author(s). I'd imagine a
written permission is enough. No grant form necessary. Correct? But then,
where's the border between not needing a grant and needing one (for
example for the new projects coming to the ASF)? Just wondering.

A slightly out-of-date log of the FOP audit is here:
http://nagoya.apache.org/wiki/apachewiki.cgi?FOPAudits/March2003

I apologize if this post may carry a somewhat bitter undertone. It took
a lot of time already and I'm still not finished. I wanted to do it
right but that's very difficult. So thanks in advance for any insight
and good ideas.

Jeremias Maerki
FOP committer, XML PMC representative for FOP



Re: Implicit grants (FOP hyphenation)

2004-03-05 Thread J.Pietschmann
Jeremias Maerki wrote:
But that's not the reason I write this. I've done the relicensing on the
XML FOP project and was again confronted with our hyphenation files. Two
of them now have the ALv2 header because for these two files all legal
problems have been dealt with
I don't think it is necessary to put *all* files under APL. If we can
assume the content had been granted, and there is no infformation to
the contrary and no incompatible license already in the file, we can
just leave it as it is, or perhaps add something to the effect
 ... has been contributed to FOP and is assumed to be licensed
 for all purposes FOP can be used. Contact the authors stated above
 for further details.
(unless license@ says otherwise, of course)
Tracking down the original committer from CVS might help too, but
in general I wouldn't loose much further sleep on the whole issue.
Of course, newly contributed files should be put under APL which
means all issues have to be resolved before the file is committed to
CVS.
J.Pietschmann



Re: Implicit grants (FOP hyphenation)

2004-03-05 Thread Jeremias Maerki
I agree. But if we don't put them under the ALv2 we need to clearly mark
the files as such. Like we do for the JARs in the lib directory.

But before we do anything I'd like to see if there's any answer to my
questions.

On 05.03.2004 21:29:39 J.Pietschmann wrote:
 Jeremias Maerki wrote:
  But that's not the reason I write this. I've done the relicensing on the
  XML FOP project and was again confronted with our hyphenation files. Two
  of them now have the ALv2 header because for these two files all legal
  problems have been dealt with
 
 I don't think it is necessary to put *all* files under APL. If we can
 assume the content had been granted, and there is no infformation to
 the contrary and no incompatible license already in the file, we can
 just leave it as it is, or perhaps add something to the effect
   ... has been contributed to FOP and is assumed to be licensed
   for all purposes FOP can be used. Contact the authors stated above
   for further details.
 (unless license@ says otherwise, of course)
 Tracking down the original committer from CVS might help too, but
 in general I wouldn't loose much further sleep on the whole issue.
 
 Of course, newly contributed files should be put under APL which
 means all issues have to be resolved before the file is committed to
 CVS.


Jeremias Maerki



cvs commit: xml-fop/src/java/org/apache/fop/layout/hyphenation ByteVector.java CharVector.java Hyphen.java Hyphenation.java HyphenationException.java HyphenationTree.java Hyphenator.java PatternConsumer.java PatternParser.java TernaryTree.java

2004-02-27 Thread jeremias
jeremias2004/02/27 09:48:09

  Modified:src/java/org/apache/fop/layout/hyphenation ByteVector.java
CharVector.java Hyphen.java Hyphenation.java
HyphenationException.java HyphenationTree.java
Hyphenator.java PatternConsumer.java
PatternParser.java TernaryTree.java
  Log:
  Applied Apache License Version 2.0 by following the instructions at 
http://www.apache.org/dev/apply-license.html.
  
  Revision  ChangesPath
  1.2   +16 -48
xml-fop/src/java/org/apache/fop/layout/hyphenation/ByteVector.java
  
  Index: ByteVector.java
  ===
  RCS file: 
/home/cvs/xml-fop/src/java/org/apache/fop/layout/hyphenation/ByteVector.java,v
  retrieving revision 1.1
  retrieving revision 1.2
  diff -u -r1.1 -r1.2
  --- ByteVector.java   11 Mar 2003 13:05:33 -  1.1
  +++ ByteVector.java   27 Feb 2004 17:48:08 -  1.2
  @@ -1,53 +1,21 @@
   /*
  - * $Id$
  - * 
  - *The Apache Software License, Version 1.1
  - * 
  + * Copyright 1999-2004 The Apache Software Foundation.
* 
  - * Copyright (C) 1999-2003 The Apache Software Foundation. All rights reserved.
  - * 
  - * Redistribution and use in source and binary forms, with or without modifica-
  - * tion, are permitted provided that the following conditions are met:
  - * 
  - * 1. Redistributions of source code must retain the above copyright notice,
  - *this list of conditions and the following disclaimer.
  - * 
  - * 2. Redistributions in binary form must reproduce the above copyright notice,
  - *this list of conditions and the following disclaimer in the documentation
  - *and/or other materials provided with the distribution.
  - * 
  - * 3. The end-user documentation included with the redistribution, if any, must
  - *include the following acknowledgment: This product includes software
  - *developed by the Apache Software Foundation (http://www.apache.org/).
  - *Alternately, this acknowledgment may appear in the software itself, if
  - *and wherever such third-party acknowledgments normally appear.
  - * 
  - * 4. The names FOP and Apache Software Foundation must not be used to
  - *endorse or promote products derived from this software without prior
  - *written permission. For written permission, please contact
  - *[EMAIL PROTECTED]
  - * 
  - * 5. Products derived from this software may not be called Apache, nor may
  - *Apache appear in their name, without prior written permission of the
  - *Apache Software Foundation.
  - * 
  - * THIS SOFTWARE IS PROVIDED ``AS IS'' AND ANY EXPRESSED OR IMPLIED WARRANTIES,
  - * INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND
  - * FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE
  - * APACHE SOFTWARE FOUNDATION OR ITS CONTRIBUTORS BE LIABLE FOR ANY DIRECT,
  - * INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLU-
  - * DING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS
  - * OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON
  - * ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
  - * (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF
  - * THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
  - * 
  - * 
  - * This software consists of voluntary contributions made by many individuals
  - * on behalf of the Apache Software Foundation and was originally created by
  - * James Tauber [EMAIL PROTECTED]. For more information on the Apache
  - * Software Foundation, please see http://www.apache.org/.
  - */ 
  + * Licensed under the Apache License, Version 2.0 (the License);
  + * you may not use this file except in compliance with the License.
  + * You may obtain a copy of the License at
  + * 
  + *  http://www.apache.org/licenses/LICENSE-2.0
  + * 
  + * Unless required by applicable law or agreed to in writing, software
  + * distributed under the License is distributed on an AS IS BASIS,
  + * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
  + * See the License for the specific language governing permissions and
  + * limitations under the License.
  + */
  +
  +/* $Id$ */
  + 
   package org.apache.fop.layout.hyphenation;
   
   import java.io.Serializable;
  
  
  
  1.2   +16 -48
xml-fop/src/java/org/apache/fop/layout/hyphenation/CharVector.java
  
  Index: CharVector.java
  ===
  RCS file: 
/home/cvs/xml-fop/src/java/org/apache/fop/layout/hyphenation

cvs commit: xml-fop/src/java/org/apache/fop/layout/hyphenation HyphenationTree.java

2004-01-02 Thread pbwest
pbwest  2004/01/02 04:18:45

  Modified:src/java/org/apache/fop/layoutmgr PageLayoutManager.java
   src/java/org/apache/fop/pdf PDFEncryptionJCE.java
   src/java/org/apache/fop/render/pdf PDFRenderer.java
   src/java/org/apache/fop/image FopImageConsumer.java
   src/java/org/apache/fop/layout/hyphenation
HyphenationTree.java
  Log:
  Removed unnecessary semi-colon.
  
  Revision  ChangesPath
  1.27  +1 -1  xml-fop/src/java/org/apache/fop/layoutmgr/PageLayoutManager.java
  
  Index: PageLayoutManager.java
  ===
  RCS file: 
/home/cvs/xml-fop/src/java/org/apache/fop/layoutmgr/PageLayoutManager.java,v
  retrieving revision 1.26
  retrieving revision 1.27
  diff -u -r1.26 -r1.27
  --- PageLayoutManager.java29 Dec 2003 23:28:47 -  1.26
  +++ PageLayoutManager.java2 Jan 2004 12:18:44 -   1.27
  @@ -401,7 +401,7 @@
   if (childArea.getAreaClass() == Area.CLASS_NORMAL) {
   placeFlowRefArea(childArea);
   } else {
  -; // todo: all the others!
  + // todo: all the others!
   }
   }
   
  
  
  
  1.3   +1 -1  xml-fop/src/java/org/apache/fop/pdf/PDFEncryptionJCE.java
  
  Index: PDFEncryptionJCE.java
  ===
  RCS file: /home/cvs/xml-fop/src/java/org/apache/fop/pdf/PDFEncryptionJCE.java,v
  retrieving revision 1.2
  retrieving revision 1.3
  diff -u -r1.2 -r1.3
  --- PDFEncryptionJCE.java 27 Mar 2003 11:10:33 -  1.2
  +++ PDFEncryptionJCE.java 2 Jan 2004 12:18:45 -   1.3
  @@ -414,7 +414,7 @@
   hash[i++] = (byte) (number  8);
   hash[i++] = (byte) (number  16);
   hash[i++] = (byte) (generation  0);
  -hash[i++] = (byte) (generation  8);;
  +hash[i++] = (byte) (generation  8);
   return hash;
   }
   
  
  
  
  1.26  +1 -1  xml-fop/src/java/org/apache/fop/render/pdf/PDFRenderer.java
  
  Index: PDFRenderer.java
  ===
  RCS file: /home/cvs/xml-fop/src/java/org/apache/fop/render/pdf/PDFRenderer.java,v
  retrieving revision 1.25
  retrieving revision 1.26
  diff -u -r1.25 -r1.26
  --- PDFRenderer.java  12 Dec 2003 22:37:39 -  1.25
  +++ PDFRenderer.java  2 Jan 2004 12:18:45 -   1.26
  @@ -1245,7 +1245,7 @@
   endTextObject();
   
   if (viewport.getClip()) {
  -saveGraphicsState();;
  +saveGraphicsState();
   
   clip(x, y, width, height);
   }
  
  
  
  1.2   +1 -1  xml-fop/src/java/org/apache/fop/image/FopImageConsumer.java
  
  Index: FopImageConsumer.java
  ===
  RCS file: /home/cvs/xml-fop/src/java/org/apache/fop/image/FopImageConsumer.java,v
  retrieving revision 1.1
  retrieving revision 1.2
  diff -u -r1.1 -r1.2
  --- FopImageConsumer.java 11 Mar 2003 13:05:25 -  1.1
  +++ FopImageConsumer.java 2 Jan 2004 12:18:45 -   1.2
  @@ -55,7 +55,7 @@
   import java.awt.image.ColorModel;
   import java.awt.image.ImageConsumer;
   import java.awt.image.ImageProducer;
  -import java.awt.image.PixelGrabber;;
  +import java.awt.image.PixelGrabber;
   
   /**
* ImageConsumer implementation for FopImage classes.
  
  
  
  1.2   +0 -1  
xml-fop/src/java/org/apache/fop/layout/hyphenation/HyphenationTree.java
  
  Index: HyphenationTree.java
  ===
  RCS file: 
/home/cvs/xml-fop/src/java/org/apache/fop/layout/hyphenation/HyphenationTree.java,v
  retrieving revision 1.1
  retrieving revision 1.2
  diff -u -r1.1 -r1.2
  --- HyphenationTree.java  11 Mar 2003 13:05:33 -  1.1
  +++ HyphenationTree.java  2 Jan 2004 12:18:45 -   1.2
  @@ -530,7 +530,6 @@
   token = in.readLine().trim();
   long starttime = 0;
   int counter = 0;
  -;
   try {
   BufferedReader reader =
   new BufferedReader(new FileReader(token));
  
  
  

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



DO NOT REPLY [Bug 13734] - Hyphenation does not work correctly on long string with many '-'

2003-12-21 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://nagoya.apache.org/bugzilla/show_bug.cgi?id=13734.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=13734

Hyphenation does not work correctly on long string with many '-'





--- Additional Comments From [EMAIL PROTECTED]  2003-12-21 18:58 ---
The first case works OK in the development version. See attachment.

The behaviour of the second case is to be expected, because there is no possible
  breakpoint. ',' is not a possible breakpoint. See attachment.

How is case 3 different from case 2?


DO NOT REPLY [Bug 16713] - Hyphenation error in tables

2003-12-21 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://nagoya.apache.org/bugzilla/show_bug.cgi?id=16713.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=16713

Hyphenation error in tables





--- Additional Comments From [EMAIL PROTECTED]  2003-12-21 19:01 ---
The attachments require other files which are not attached: three xsl files and
an external graphic.


DO NOT REPLY [Bug 21641] - Infinit loop within too small areas using hyphenation

2003-12-21 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://nagoya.apache.org/bugzilla/show_bug.cgi?id=21641.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=21641

Infinit loop within too small areas using hyphenation





--- Additional Comments From [EMAIL PROTECTED]  2003-12-21 19:01 ---
OK in the development version.


DO NOT REPLY [Bug 16626] - Hyphenation causing java.io.IOException error

2003-12-15 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://nagoya.apache.org/bugzilla/show_bug.cgi?id=16626.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=16626

Hyphenation causing java.io.IOException error

[EMAIL PROTECTED] changed:

   What|Removed |Added

 Status|NEW |RESOLVED
 Resolution||DUPLICATE



--- Additional Comments From [EMAIL PROTECTED]  2003-12-15 22:41 ---
We will not get this fixed for the maintenance branch, however a fix on this 
problem was applied to development so this should not occur in our next release.

Thanks,
Glen

*** This bug has been marked as a duplicate of 25512 ***


DO NOT REPLY [Bug 23985] - Hyphenation Problems with filenames, numbers

2003-11-06 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://nagoya.apache.org/bugzilla/show_bug.cgi?id=23985.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=23985

Hyphenation Problems with filenames, numbers

[EMAIL PROTECTED] changed:

   What|Removed |Added

 Status|NEW |RESOLVED
 Resolution||FIXED



--- Additional Comments From [EMAIL PROTECTED]  2003-11-07 05:52 ---
Fixed in 1.0.

Due to maintenance code complexity vs. benefit gained, I will skip making the 
changes in 0.20.x.  (Also, I could not duplicate the problem in the nightly 
version of maintenance using Acrobat Reader 6.0, as I could in the release 
version w/AR 5.5.  So other factors may make this issue less likely to occur 
with 0.20.5 anyway.)


DO NOT REPLY [Bug 22048] New: - PDF renderer and hyphenation break character order

2003-08-01 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://nagoya.apache.org/bugzilla/show_bug.cgi?id=22048.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=22048

PDF renderer and hyphenation break character order

   Summary: PDF renderer and hyphenation break character order
   Product: Fop
   Version: 0.20.4
  Platform: PC
   URL: http://staff.csc.fi/aniemela/fopbug2.pdf
OS/Version: Linux
Status: NEW
  Severity: Major
  Priority: Other
 Component: page-master/layout
AssignedTo: [EMAIL PROTECTED]
ReportedBy: [EMAIL PROTECTED]


The PDF renderer/page layout component/hyphenation break character order with
inline elements:
The following construct

fo:block
  Otsakerivin vapaamuotoisuuden takia ainoat jarkevasti hyodynnettavat
  arvot ovat kentan nimi ( 
  fo:inline font-family=monospacelt;melting.pointgt;/fo:inline 
  ) ja DT-numero. 
/fo:block
Is rendered as
Otsakerivin [] hyodynnettavat arvot ovat kentan nimi ( mel-
ting.point ) ja DT-numero.

Below is a cleaned-up version of Docbook-XSLT-produced FO data, which explains
the block-happiness. URL points to a FOP 0.20.4 rendered PDF which illustrates
the bug.

Full XSL-FO data follows:
?xml version=1.0 encoding=utf-8 ? 
  fo:root xmlns:fo=http://www.w3.org/1999/XSL/Format; font-family=serif
font-size=10pt text-align=left line-height=normal
font-selection-strategy=character-by-character language=en
fo:layout-master-set
fo:simple-page-master master-name=blank page-width=210mm
page-height=297mm margin-top=0.5in margin-bottom=0.5in   
   margin-left=1in margin-right=1in
  fo:region-body display-align=center margin-bottom=0.5in
margin-top=0.5in region-name=blank-body / 
/fo:simple-page-master

fo:simple-page-master master-name=body-first page-width=210mm
page-height=297mm margin-top=0.5in 
   margin-bottom=0.5in margin-left=1in
margin-right=1in
  fo:region-body margin-bottom=0.5in margin-top=0.5in column-count=1 / 
/fo:simple-page-master
fo:simple-page-master master-name=body-odd page-width=210mm
page-height=297mm margin-top=0.5in 
   margin-bottom=0.5in margin-left=1in
margin-right=1in
  fo:region-body margin-bottom=0.5in margin-top=0.5in column-count=1 / 
/fo:simple-page-master
fo:simple-page-master master-name=body-even page-width=210mm
page-height=297mm margin-top=0.5in
   margin-bottom=0.5in margin-right=1in
margin-left=1in
  fo:region-body margin-bottom=0.5in margin-top=0.5in column-count=1 / 
/fo:simple-page-master
fo:page-sequence-master master-name=body
  fo:repeatable-page-master-alternatives
fo:conditional-page-master-reference master-reference=blank
blank-or-not-blank=blank / 
fo:conditional-page-master-reference master-reference=body-first
page-position=first / 
fo:conditional-page-master-reference master-reference=body-odd
odd-or-even=odd / 
fo:conditional-page-master-reference master-reference=body-even
odd-or-even=even / 
  /fo:repeatable-page-master-alternatives
/fo:page-sequence-master
  /fo:layout-master-set
  fo:page-sequence xmlns:axf=http://www.antennahouse.com/names/XSL/Extensions;
hyphenate=true master-reference=body 
language=fi format=1 initial-page-number=1
hyphenation-character=- 
hyphenation-push-character-count=2
hyphenation-remain-character-count=2
fo:flow flow-name=xsl-region-body
  fo:block id=d0e14
fo:block
  fo:block
fo:block keep-together=always margin-left=-4pc
font-family=sans-serif
  fo:block keep-with-next.within-column=always
fo:block font-family=sans-serif font-weight=bold
keep-with-next.within-column=always space-before.minimum=0.8em
space-before.optimum=1.0em space-before.maximum=1.2em
   fo:block font-size=14.399pt1.1.1.
Datamerkinnan otsakerivi/fo:block 
/fo:block
  /fo:block
/fo:block
  /fo:block
  fo:block / 
/fo:block
fo:block space-before.optimum=1em space-before.minimum=0.8em
space-before.maximum=1.2em
  Otsakerivin vapaamuotoisuuden takia ainoat jarkevasti hyodynnettavat
arvot ovat kentan nimi ( 
  fo:inline font-family=monospacelt;melting.pointgt;/fo:inline 
  ) ja DT-numero. 
/fo:block
  /fo:block
/fo:flow
  /fo:page-sequence
/fo:root

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



DO NOT REPLY [Bug 22048] - PDF renderer and hyphenation break character order

2003-08-01 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://nagoya.apache.org/bugzilla/show_bug.cgi?id=22048.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=22048

PDF renderer and hyphenation break character order

[EMAIL PROTECTED] changed:

   What|Removed |Added

 Status|NEW |RESOLVED
 Resolution||FIXED



--- Additional Comments From [EMAIL PROTECTED]  2003-08-01 12:40 ---
This has been fixed already. Get the latest release.

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



DO NOT REPLY [Bug 21641] New: - Infinit loop within too small areas using hyphenation

2003-07-16 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://nagoya.apache.org/bugzilla/show_bug.cgi?id=21641.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=21641

Infinit loop within too small areas using hyphenation

   Summary: Infinit loop within too small areas using hyphenation
   Product: Fop
   Version: 0.20.5
  Platform: PC
OS/Version: Other
Status: NEW
  Severity: Critical
  Priority: Other
 Component: general
AssignedTo: [EMAIL PROTECTED]
ReportedBy: [EMAIL PROTECTED]


Using hyphenate=true rendering text within a too small area (e.g. a small 
table column) results in an inifite loop. The loop appeares in FOText.java 
line 280 (while ( start != -1)) which never ends, because 
LineAread.addText(...) never returns -1. I think The bug should be fixed in 
LineAread.addText(...). Example document:

?xml version=1.0 encoding=utf-8 standalone=yes?
fo:root xmlns:fo=http://www.w3.org/1999/XSL/Format;
fo:layout-master-set
fo:simple-page-master margin=2cm master-name=simple 
page-height=29.7cm page-width=21cm
fo:region-body/
fo:region-before extent=3cm/
fo:region-after extent=1.5cm/
/fo:simple-page-master
/fo:layout-master-set
fo:page-sequence master-reference=simple hyphenate=true 
language=en
fo:flow flow-name=xsl-region-body
fo:table table-layout=fixed
fo:table-column column-width=4mm/
fo:table-body
fo:table-row
fo:table-cell 
border-color=black border-style=solid border-width=thin
fo:block 
font-size=8pt font-family=Times margin=1mmverylongword/fo:block
/fo:table-cell
/fo:table-row
/fo:table-body
/fo:table
/fo:flow
/fo:page-sequence
/fo:root

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



DO NOT REPLY [Bug 21641] - Infinit loop within too small areas using hyphenation

2003-07-16 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://nagoya.apache.org/bugzilla/show_bug.cgi?id=21641.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=21641

Infinit loop within too small areas using hyphenation

[EMAIL PROTECTED] changed:

   What|Removed |Added

   Severity|Critical|Normal



--- Additional Comments From [EMAIL PROTECTED]  2003-07-16 17:40 ---
Well, doing unpleasant things on invalid input isn't really a critical bug.

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



hyphenation doc has moved

2003-07-10 Thread Victor Mote
I did some rearranging in our doc to get the configuration page pared down
to only configuration information, and to move some configuration content
there that was lurking in other places (attempting to keep them
cross-referenced). One of the side effects is that the hyphenation doc has
been moved out of configuration and into its own document, which has an
entry on the menu. FYI in case you can't find it where you are used to
finding it.

Victor Mote


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



Re: Status update: hyphenation (no, not the pattern file licenses)

2003-07-05 Thread J.Pietschmann
Jeremias Maerki wrote:
Well, separate the reusable stuff from the FOP-specific stuff
(package-wise). Other Apache projects have started sub-(sub-)projects.
Maybe we could do something similar if only to encourage decoupling of
utility components that could be used elsewhere. Additional candidates:
PDF library, Transcoders, a free image library etc. etc.
Humm. Unfortunately I don't see how we get the manpower to
manage subsubprojects and attract developers. I'd rather think
of asking jakarta-commons-lang or the sandbox.
Another question: How does your work relate to the general possibility
to make use of OpenOffice.org's hyphenation stuff.
Umm, zilch. But then, th OOo hyphenation files I saw are just
the FOP hyphenation files sans XML tags and with character
encoding issues instead.
J.Pietschmann



-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]


Status update: hyphenation (no, not the pattern file licenses)

2003-07-04 Thread J.Pietschmann
Hi,
as I wrote quite some weeks ago I took an extended look at
hyphenation, with the possible target of creating APL
licensed patterns from our own dictionaries. I also looked
at the current hyphenator.
I finally seem to have grasped how hyphenation and
hyphenation pattern generation from a marked up dictionary
(patgen.web) works. I implemented my own qd hyphenator
which lead to the detection of a few well hidden bugs in
the current hyphenator. Interestingly, nobody seems to care
all that much. Do people proof read their output?
Other problems with the current hyphenator are that it
contains a lot of crufty code, has a few easy to cure
inefficiencies, and it can't handle hyphenations with
consonant shifts and other spelling changes, like in the
old german spelling backen - bak-ken and Schiffahrt -
Schiff-fahrt, despite it seems to claim so. (the new
german spelling does no longer have these features, but
I think there are still languages where this is an issue).
Furthermore, the interaction between the main code and
the hyphenator is somewhat ineffective too, and the main
code does not necessarily have the same understanding on
what consitutes a word than the hyphenator.
In parallel to working on the unit test concept and preparing
for some refactoring in HEAD, I'm working now on providing
a patgen.web rewrite in Java which does not have some of the
restricting features of the original and also on a german
lexicon which has currently roughly 12k stem forms of words
with hyphenation and other information. Help with comleting
the lexicon or with other languages is welcome. Unfortunately
tools for lexicon maintenance, word inflection and
plausibility checks on hyphenated forms are still alpha, at
best. I'll publish them on my Apache home page in a few days.
RT: I believe the lexicon(s), pattern generator, the hyphenator
and perhaps other tools could be useful for other projects.
What would be a sensible approach to encourage reuse?
J.Pietschmann

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]


Re: Status update: hyphenation (no, not the pattern file licenses)

2003-07-04 Thread Jeremias Maerki

On 04.07.2003 22:59:27 J.Pietschmann wrote:
 Interestingly, nobody seems to care
 all that much. Do people proof read their output?

LOL

snip what=a lot of interesting-sounding stuff/

 RT: I believe the lexicon(s), pattern generator, the hyphenator
 and perhaps other tools could be useful for other projects.
 What would be a sensible approach to encourage reuse?

Well, separate the reusable stuff from the FOP-specific stuff
(package-wise). Other Apache projects have started sub-(sub-)projects.
Maybe we could do something similar if only to encourage decoupling of
utility components that could be used elsewhere. Additional candidates:
PDF library, Transcoders, a free image library etc. etc.

Another question: How does your work relate to the general possibility
to make use of OpenOffice.org's hyphenation stuff.

Jeremias Maerki


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



DO NOT REPLY [Bug 20411] - German hyphenation is ^H^H^Hwas missing

2003-06-03 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://nagoya.apache.org/bugzilla/show_bug.cgi?id=20411.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=20411

German hyphenation is ^H^H^Hwas missing

[EMAIL PROTECTED] changed:

   What|Removed |Added

 Status|NEW |RESOLVED
 Resolution||WONTFIX



--- Additional Comments From [EMAIL PROTECTED]  2003-06-02 13:20 ---
We've had german hyphenation files in FOP earlier but had to remove them 
because of license issues. See Release Notes 
(http://xml.apache.org/fop/relnotes.html).

Unfortunately, your hyphenation file is based on a file released under the 
LPPL which is considered incompatible with the ASL. Therefore, we can't accept 
your contribution. I'm sorry. I've recently updated the documentation on this 
topic (http://xml.apache.org/fop/configuration.html). If the LPPL works for 
your environment, fine. You can even publish the file yourself as long as you 
respect the license of the original file. But Apache can't redistribute that 
file.

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



DO NOT REPLY [Bug 20411] New: - German hyphenation is ^H^H^Hwas missing

2003-06-02 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://nagoya.apache.org/bugzilla/show_bug.cgi?id=20411.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=20411

German hyphenation is ^H^H^Hwas missing

   Summary: German hyphenation is ^H^H^Hwas missing
   Product: Fop
   Version: 0.20.5
  Platform: All
OS/Version: All
Status: NEW
  Severity: Normal
  Priority: Other
 Component: general
AssignedTo: [EMAIL PROTECTED]
ReportedBy: [EMAIL PROTECTED]


I have converted a TeX hyphenation file.  The conversion was done by Perl, by vi
and by hand, be prepared for errors.  I have tested it with a document of 50
pages, and haven't found any obvious bugs, however.

The file is (hopefully) attached to the bug report.

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



DO NOT REPLY [Bug 20411] - German hyphenation is ^H^H^Hwas missing

2003-06-02 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://nagoya.apache.org/bugzilla/show_bug.cgi?id=20411.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=20411

German hyphenation is ^H^H^Hwas missing





--- Additional Comments From [EMAIL PROTECTED]  2003-06-02 11:08 ---
Created an attachment (id=6595)
Hyphenation file for (new) German orthography

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



Hyphenation: new russian hyphenation patterns added

2003-05-30 Thread Jeremias Maerki
With the kind help of Alex V. Alishevskikh I've been able to commit a
new pattern file for russian. The file should be pretty safe from a
legal point of view. Interested parties will find the details in the
comments provided in ru.xml.

Yipee! :-)

Jeremias Maerki


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



Re: XML FOP: Licensing issues with hyphenation pattern files

2003-03-17 Thread Dirk-Willem van Gulik

  Before we start here's a little background on the hyphenation stuff:
  Our hyphenation file are XML files that are derived from TeX hyphenation

Under what license where the original TeX files ?

  2. If the former (of [1]) is true, we need a grant from the copyright
  holder of the original file, right? What if the original file is unclear

This depends on what the license was you got it under.

If there is no (explicit) license; you'll have to indeed get permission of
the authors as the default generally is 'No' (depending on age of the
file and country).

  3. (Question is somewhat general) What's the threshold for the necessity
  of a grant? Does a non-committer have to submit a grant on a single new
  file?

If code written by someone else under a non-apache license is imported; it
will need to a grant to become part of the apache proper (as to allow us
to remove the original and cut-and-paste the asf license into it).

OR

it needs to be labeled as NOT apache; and have the right license next to
it.  See for example in cvs

apache-1.3/src/regex
apache-1.3/src/regex/COPYRIGHT

Also note that it is in its own directory.

  4. Some of the hyphenation pattern files are licensed under the LPPL
  (LaTeX project public license, http://www.latex-project.org/lppl.html).
  We'd like to have clearance to use, modify and distribute files under
  this license in the FOP project.

Ack - will post on that in a moment.

  5. Can we modify and relicense under the ASL hyphenation pattern files
  clearly stated as being in the public domain without having a grant but

Propably - but depending on what country that('this is public domain') was
'stated' in and at what day; as the Berne convention was not signed by all
countries at the same time. (The Berne convention effectively killed the
concept of 'Public Domain' or -non- copyrighted material in most
countries - and makes the statement 'this is public domain' somethings
ineffective if not backed up by a license, grant and/or copyright.).

  6. We can't use files containing a restriction like Can be used freely
  for non-commercial purposes., except if we can positively identify the
  copyright holder and get a grant, right?

Correct - as that would mean that you can download something from
apache.org which has _MORE_ restrictive elements in it than the ASF
license.  Anything we include under 3rd party licenses should be as
restrictive or -less- restrictive than our own license. So that when you
download something from *.apache.org it can always be used in with the ASF
level of freedom; and perhaps some bits may give you even -more- leeway.
Never less.

Dw


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



Re: XML FOP: Licensing issues with hyphenation pattern files

2003-03-17 Thread Dirk-Willem van Gulik

Folks,

See below the Latex License.

http://www.latex-project.org/lppl.html

It seems to me that clause '8.B makes this license more restrictive than
the ASF license - and we thus should not allow code(fragements) which are
under this license in downloads from ASF infrastructure.

Or am I misreading things - the license is long and weildy.

Dw


LaTeX  Project Public License

LPPL Version 1.2 1999-09-03


Copyright 1999 LaTeX 3 Project
Everyone is allowed to distribute verbatim copies of this license
document, but modification of it is not allowed.



PREAMBLE
 The LaTeX  Project Public License (LPPL) is the license under which the
base LaTeX  distribution is distributed.

You may use this license for any program that you have written and wish to
distribute.  This license may be particularly suitable if your program is
TeX -related (such as a LaTeX  package), but you may use it even if your
program is unrelated to TeX .  The section `WHETHER AND HOW TO DISTRIBUTE
PROGRAMS UNDER THIS LICENSE', below, gives instructions, examples, and
recommendations for authors who are considering distributing their
programs under this license.

In this license document, `The Program' refers to any program distributed
under this license.

This license gives conditions under which The Program may be distributed
and conditions under which modified versions of The Program may be
distributed.  Individual files of The Program may bear supplementary
and/or superseding conditions on modification of themselves and on the
distribution of modified versions of themselves, but *no* file of The
Program may bear supplementary or superseding conditions on the
distribution of an unmodified copy of the file.  A distributor wishing to
distribute a complete, unmodified copy of The Program therefore needs to
check the conditions only in this license and nowhere else.

Activities other than distribution and/or modification of The Program are
not covered by this license; they are outside its scope.  In particular,
the act of running The Program is not restricted.

We, the LaTeX 3 Project, believe that the conditions below give you the
freedom to make and distribute modified versions of The Program that
conform with whatever technical specifications you wish while maintaining
the availability, integrity, and reliability of The Program.  If you do
not see how to achieve your goal while  meeting these conditions, then
read the document `cfgguide.tex' in the base LaTeX  distribution for
suggestions.

CONDITIONS ON DISTRIBUTION AND MODIFICATION
 You may distribute a complete, unmodified copy of The Program.
Distribution of only part of The Program is not allowed.

You may not modify in any way a file of The Program that bears a legal
notice forbidding modification of that file.

You may distribute a modified file of The Program if, and only if, the
following eight conditions are met:
You must meet any additional conditions borne by the file on the
distribution of a modified version of the file as described below in   the
subsection `Additional Conditions on Individual Files of The   Program'.


If the file is a LaTeX  software file, then you must meet any   applicable
additional conditions on the distribution of a modified   version of the
file that are described below in the subsection   `Additional Conditions
on LaTeX  Software Files'.


You must not distribute the modified file with the filename of   the
original file.


In the modified file, you must acknowledge the authorship and   name of
the original file, and the name (if any) of the program   which contains
it.


You must change any identification string in the file to   indicate
clearly that the modified file is not part of The Program.


You must change any addresses in the modified file for the   reporting of
errors in the file or in The Program generally to   ensure that reports
for files no longer maintained by the original   maintainers will be
directed to the maintainers of the modified   files.


You must distribute the modified file under a license that   forbids
distribution both of the modified file and of any files   derived from the
modified file with the filename of the original   file.


You must do either (A) or (B):
(A)


distribute a copy of The Program (that is, a complete, unmodified copy
of The Program) together with the modified file; if your distribution
of the modified file is made by offering access to copy the modified
file from a designated place, then offering equivalent access to copy
The Program from the same place meets this condition, even though
third parties are not compelled to copy The Program along with the
modified file;


(B)


provide to those who receive the modified file information that is
sufficient for them to obtain a copy of The Program; for example, you
may provide a Uniform Resource Locator (URL) for a site that you
expect will provide them with a copy of The Program free of charge

Re: XML FOP: Licensing issues with hyphenation pattern files

2003-03-17 Thread Jeremias Maerki
Thanks, Steven and Dirk for responding!

On 17.03.2003 14:28:44 Dirk-Willem van Gulik wrote:
 
   Before we start here's a little background on the hyphenation stuff:
   Our hyphenation file are XML files that are derived from TeX hyphenation
 
 Under what license where the original TeX files ?

http://cvs.apache.org/viewcvs.cgi/xml-fop/src/hyph/

Under many different licenses (GPL, LGPL, LPPL, public domain, no
license...) That's what makes this clean-up work such a PITA.

And then there's the files from the original TeX distribution by Donald
Knuth which are assumed to be in the public domain but with certain
restrictions (modified file shall not contain the same filename as the
original). Problem there is that there is not really something you could
call a license associated with the distribution, at least non that I
could find. This is all more or less based on common knowledge, rather
than hard facts on file.

   2. If the former (of [1]) is true, we need a grant from the copyright
   holder of the original file, right? What if the original file is unclear
 
 This depends on what the license was you got it under.
 
 If there is no (explicit) license; you'll have to indeed get permission of
 the authors as the default generally is 'No' (depending on age of the
 file and country).

Ok

   3. (Question is somewhat general) What's the threshold for the necessity
   of a grant? Does a non-committer have to submit a grant on a single new
   file?
 
 If code written by someone else under a non-apache license is imported; it
 will need to a grant to become part of the apache proper (as to allow us
 to remove the original and cut-and-paste the asf license into it).

That's pretty clear. I was rather thinking about people sending a new
source file to a mailing list of an Apache project. If a grant is
necessary for each and every new file that is submitted, we're going to
scare away potential contributors. From reading the mails about the new
license 2.0 this seems to be adressed there. Anyway, I get the
impression no grant is filed in most cases in most projects at Apache
today. Or do I misunderstand?

I've received an email a few minutes ago from a FOP contributor telling
me that I will receive a confirmation mail from the original author of
one of the problematic hyphenation files, that he allows us to use his
file. But strictly following the rules I have to ask him to file a grant.
I hope he won't mind.

 OR
 
 it needs to be labeled as NOT apache; and have the right license next to
 it.  See for example in cvs
 
   apache-1.3/src/regex
   apache-1.3/src/regex/COPYRIGHT
 
 Also note that it is in its own directory.

Is that necessary? Isn't putting an xy.xml.LICENSE file alongside an
xy.xml file enough to mark it as NOT Apache?

snip/

   5. Can we modify and relicense under the ASL hyphenation pattern files
   clearly stated as being in the public domain without having a grant but
 
 Propably - but depending on what country that('this is public domain') was
 'stated' in and at what day; as the Berne convention was not signed by all
 countries at the same time. (The Berne convention effectively killed the
 concept of 'Public Domain' or -non- copyrighted material in most
 countries - and makes the statement 'this is public domain' somethings
 ineffective if not backed up by a license, grant and/or copyright.).

Aargh! The world would be so boring if everything were simple. :-)

snip/

Jeremias Maerki


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



Re: XML FOP: Licensing issues with hyphenation pattern files

2003-03-17 Thread Jeremias Maerki
There's another IMO: Clause 7 expects a restriction that the ASL can't
provide.

On 17.03.2003 14:32:16 Dirk-Willem van Gulik wrote:
 See below the Latex License.
 
   http://www.latex-project.org/lppl.html
 
 It seems to me that clause '8.B makes this license more restrictive than
 the ASF license - and we thus should not allow code(fragements) which are
 under this license in downloads from ASF infrastructure.
 
 Or am I misreading things - the license is long and weildy.


Jeremias Maerki


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



Re: XML FOP: Licensing issues with hyphenation pattern files

2003-03-17 Thread Jeremias Maerki
I've just received that email. The original author is willing to fax in
a grant. Question now: Where can I find the form? I've only found the
one for committers. Thanks a lot!

On 17.03.2003 15:23:55 Jeremias Maerki wrote:
 I've received an email a few minutes ago from a FOP contributor telling
 me that I will receive a confirmation mail from the original author of
 one of the problematic hyphenation files, that he allows us to use his
 file. But strictly following the rules I have to ask him to file a grant.
 I hope he won't mind.


Jeremias Maerki


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



RE: XML FOP: Licensing issues with hyphenation pattern files

2003-03-17 Thread Victor Mote
Jeremias Maerki wrote:

 And then there's the files from the original TeX distribution by Donald
 Knuth which are assumed to be in the public domain but with certain
 restrictions (modified file shall not contain the same filename as the
 original). Problem there is that there is not really something you could
 call a license associated with the distribution, at least non that I
 could find. This is all more or less based on common knowledge, rather
 than hard facts on file.

Here are some excerpts from Chapter 30, The Future of TeX and METAFONT
from Knuth's Digital Typography, 1999, Center for the Study of Language
and Information, ISBN 1-57589-010-4:

---Start---
My work on developing TeX, METAFONT, and Computer Modern has come to an end.
I will make no further changes except to correct extremely serious bugs.

I have put these systems into the public domain so that people everywhere
can use the ideas freely if they wish

As stated on the copyright pages of Volumes B, D, and E, [referring here to
the 5-volume Computers and Typesetting series] anybody can make use of my
programs in whatever way they wish, as long as they do not use the names
TeX, METAFONT, or Computer Modern. In particular, any person or group who
wants to produce a program superior to mine is free to do so. However,
nobody is allowed to call a system TeX or METAFONT unless that system
conforms 100% to my own programs, as I have specified in the manuals for the
TRIP and TRAP tests

Of course I do not claim to have found the best solution to every problem. I
simply claim that it is a great advantage to have a fixed point as a
building block I welcome continued research that will lead to
alternative systems that can typeset documents better than TeX is able to
do. But the authors of such systems must think of another name

---End---

BTW, nothing in the excised portions of the above are contradictory, only
explanatory.

The copyright page for Volume B of Computers and Typesetting, which volume
is entitled TeX: The Program says the following: The program for TeX is
in the public domain, and readers my freely incorporate the algorithms of
this book into their own programs. However, use of the name 'TeX' is
restricted to software systems that agree exactly with the program presented
here.

Now, I suppose that an argument could be made that the hyphenation patterns
are not part of the algorithm, but I think an equally good argument can be
made that they are. They are certainly part of the TeX distribution, and
Knuth's license documented above extends to the frozen version of TeX. If
the name is TeX, and the files are a part of it, Knuth's license applies.

I appreciate the concern for IP rights, and the desire to be conservative on
such matters. I am extremely conservative about such things myself, having
made my living off of such rights for years. I just don't know what Knuth
could have done to be more clear that these files are in the public domain.
I suppose a file in the distribution would have been nice, but ... Actually,
there might be one, but I haven't been able to find it either.

I think the correct solution here is to install the TeX (not LaTeX)
hyphenation pattern files, credit Knuth, make a comment that Knuth's files
are in the public domain, add the Apache License (to apply to any
additions), and turn our users loose looking for things that might need to
be added. A post to the user mailing list and a release note (saying that
the hyphenation files are different, and perhaps less comprehensive) would
be good also.

BTW, I don't see a restriction on the /file/ names being the same, only on
the product itself being the same.

I apologize that some of the above is redundant from previous postings. I
did not cite the authorities previously, and perhaps that will help us get
moving here again.

Victor Mote



-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



Re: XML FOP: Licensing issues with hyphenation pattern files

2003-03-17 Thread Jeremias Maerki
Thanks, Greg and Nicola, for your answers. I've already sent the form to
the person I mentioned. Now that most of my questions have been resolved
I'm off to distribute quite a bunch of these forms to all the various
copyright holders.

After it has been suggested that every single file submitted by a
non-committer must be accompanied by a license grant it would really
make sense to publish the form somewhere under http://www.apache.org/dev/,
for example. That's one step towards making all committers aware that
these steps should be followed. Maybe this process could be described on
a separate page with just a few sentences. It should also be mentioned
that the above also applies if a file with someone else's name has been
modified and included in a project codebase as is the case with FOP's
hypenation patterns.

I'm going to document all the findings from this thread within the next
two or three days at
http://nagoya.apache.org/wiki/apachewiki.cgi?Licensing. Help is welcome.
I hope someone with write access will then transfer some of the contents
to the Development Information pages.

Thanks again to all who have helped resolve our licensing issues so far.

On 17.03.2003 21:00:50 Greg Stein wrote:
 On Mon, Mar 17, 2003 at 04:17:12PM +0100, Jeremias Maerki wrote:
  I've just received that email. The original author is willing to fax in
  a grant. Question now: Where can I find the form? I've only found the
  one for committers. Thanks a lot!
  
  On 17.03.2003 15:23:55 Jeremias Maerki wrote:
   I've received an email a few minutes ago from a FOP contributor telling
   me that I will receive a confirmation mail from the original author of
   one of the problematic hyphenation files, that he allows us to use his
   file. But strictly following the rules I have to ask him to file a grant.
   I hope he won't mind.
 
 I've attached(*) the software grant form which they can print, sign, and fax
 into the ASF (+1-410-803-2258) or mail to:
 
 The Apache Software Foundation
 1901 Munsey Drive
 Forest Hill, MD 21050-2747, U.S.A.
 
 (*) no, I don't know where it might be publicly accessible; this is from our
 Foundation CVS repos



Jeremias Maerki


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



Re: XML FOP: Licensing issues with hyphenation pattern files

2003-03-16 Thread Jeremias Maerki
I haven't received any reactions to my post. We still have our pending
release on hold until the issues are resolved. I wonder if I have taken
the wrong approach. Can anybody advise? Thanks!

On 06.03.2003 11:10:28 Jeremias Maerki wrote:
 Hello all (licensing specialists, XML PMC people, fop-devs)
 
 (I don't know where might be the best place to discuss this. fop-dev is
 currently very low-traffic.)
 
 The FOP team needs help. In February we realized that we had problematic
 hyphenation pattern files in our codebase. For example, some of them
 were derived from GPL-licensed files.
 
 Some action has already been taken. I've documented the whole auditing
 process and status here:
 http://nagoya.apache.org/wiki/apachewiki.cgi?FOPAudits/March2003
 
 I'm at a point where I need some advice on how to proceed from here.
 Here are the questions, I hope are the right ones and help us to resolve
 these issues quickly.
 
 Before we start here's a little background on the hyphenation stuff:
 Our hyphenation file are XML files that are derived from TeX hyphenation
 files (ASCII format). These XML files reside in CVS (xml-fop/src/hyph,
 formerly xml-fop/hyph). During build they are parsed into Java objects
 and serialized using Java serialization. These serialized objects are
 included in the JAR (fop.jar) in the binary distribution.
 
 1. Do our hyphenation files have to be licensed under the ASL? Or is it
 possible to do something similar we do with Java libraries (external
 dependencies) such as JUnit and JDOM (add the file to CVS and have an
 accompanying file containing its license)?
 
 2. If the former (of [1]) is true, we need a grant from the copyright
 holder of the original file, right? What if the original file is unclear
 about the copyright holder (multiple names, for example) and about the
 license (no explicit license, for example)? Please see the Wiki page for
 examples.
 
 3. (Question is somewhat general) What's the threshold for the necessity
 of a grant? Does a non-committer have to submit a grant on a single new
 file?
 
 4. Some of the hyphenation pattern files are licensed under the LPPL
 (LaTeX project public license, http://www.latex-project.org/lppl.html).
 We'd like to have clearance to use, modify and distribute files under
 this license in the FOP project.
 
 5. Can we modify and relicense under the ASL hyphenation pattern files
 clearly stated as being in the public domain without having a grant but
 giving credit where possible?
 
 6. We can't use files containing a restriction like Can be used freely
 for non-commercial purposes., except if we can positively identify the
 copyright holder and get a grant, right?
 
 Thank you in advance for your help!
 
 Jeremias Maerki
 FOP committer and XML PMC member


Jeremias Maerki


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



cvs commit: xml-fop/src/java/org/apache/fop/layout/hyphenation - New directory

2003-03-11 Thread jeremias
jeremias2003/03/11 04:52:22

  xml-fop/src/java/org/apache/fop/layout/hyphenation - New directory

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: Japanese Hyphenation was: Re: hyphenation patterns

2003-03-10 Thread Satoshi Ishigami

Konnichiwa.

On Fri, 07 Mar 2003 20:22:36 +0100 , J.Pietschmann wrote:

 Hm. I don't read japanese :-/

JIS X 4051 illustrates line-breaking, justification, writing-mode,
letter-spacing, ruby, etc. for Japanese text processing.

CSS3 module:text is useful to understand these features in English.
http://www.w3.org/TR/css3-text/

This document is probably same as JIS X 4051. Following section is
espeically useful for line-breaking.

6. Line breaking
11.2.  Hanging punctuation: the 'hanging-punctuation' property

Another useful document is following book.

http://www.oreilly.com/catalog/cjkvinfo/index.html
CJKV Information Processing
Chinese, Japanese, Korean  Vietnamese Computing
By Ken Lunde
1st Edition December 1998
1-56592-224-7, Order Number: 2247
1125 pages



  Certainly, many japanese people wish that FOP will implement it,
  but the Japanese Tex hypenation file does not work with current 
  FOP.

 What's the reason for this? I got the impression both the Japanese
 and the Chinese TeX versions patched also the TeX source in order
 to adapt to their respective line breaking rules. I'm not sure
 how relevant this is to hyphenation.

Current FOP can not control any line breaking restrictions.

The Asian languages line-breaking strategy has different
controls from those of western text. In Japanese, this
restriction is called 'kinsoku'.

A set of kinsoku character is Open Punctuation, Close Punctuation
and Ambiguous Quotation defined in UAX#14.

For example, you must not layout U+300C (LEFT CORNER BRACKET)
categorized in OP at the end of line and U+3002 (IDEOGRAPHIC
FULL STOP) categorized in CP at the head of line.

These restriction is estimated at each end of line where is same
point as the western soft-hyphenate estimation (i.e. break opportunity
estimation).

Can FOP currently control these restrictions without any modification?
If can, it is my misunderstanding and Japanese Tex hypenation file
can use it. But if can not, FOP must implements this feature to use
Japanese Tex hypenation file.

I think that the cost to implement JIS X 4051 line breaking
algorithm is almost equivalent to implement TR14. So I suggested
to implement TR14.


 This is planned for HEAD. The TR14 rules for CJK hyphenation seems
 to be easy: in absence of any more complicated requirements,
 hyphenate after every full character. Does the above mentioned
 standard add such more complicated rules which TR14 does not
 care too much about?

There is no more complicated rules for line-breaking.

CSS3 module:text says following :-)

http://www.w3.org/TR/css3-text/#line-break-prop
| The rules described by JIS X-4051 have been superseded by
| the Unicode Technical Report #14.

JIS X 4051 line-breaking and TR14 is almost equivalent.
In addition, TR14 can use for CJKV and any language with single
Unicode Line-Break-Properties file!

---
Satoshi Ishigami   VIC TOKAI CORPORATION

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



Re: hyphenation patterns

2003-03-10 Thread Keiron Liddle
 Keiron, I assume it was you who wrote two of the mails and put the
 notifications on the Wiki page? With only the IP address it's difficult
 to tell (you can register your name in Preferences. Nudge, nudge). Was
 it Togan, you contacted or one of the other two? Not that we write to
 the same people twice. Thanks

Sorry about that, still trying to sort out how this wiki stuff works.
Yes, I sent mails to the email for those who submitted the files.

Keiron.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



DO NOT REPLY [Bug 4316] - Problem of hyphenation in inline-areas

2003-03-09 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://nagoya.apache.org/bugzilla/show_bug.cgi?id=4316.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=4316

Problem of hyphenation in inline-areas

[EMAIL PROTECTED] changed:

   What|Removed |Added

 Status|ASSIGNED|RESOLVED
 Resolution||INVALID



--- Additional Comments From [EMAIL PROTECTED]  2003-03-09 17:54 ---
The hyphenation property applies to fo:block and fo:character only according to
section 7.9.4 hyphenate (Not sure what it should cause for the latter as the
spec says Hyphenation may be used .. for the text contained in this object.
Odd.) This means you can't use fo:inline or whatever inline FO to temporarily
turn off hyphenation. I'd guess this is what keep-together.within-line is for.

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



Re: hyphenation patterns

2003-03-07 Thread Togan Muftuoglu
Hi,

* Jeremias Maerki; [EMAIL PROTECTED] on 06 Mar, 2003 wrote:
Let me know if you need further information ,
It would be good if we could find the location where the trhyph.tex in
the SuSE ditribution comes from. I don't want to download the whole
thing just for verifying one little file.
I have asked the maintainer and the reply is a follows

answer
In the .spec file it isn't listed separately.  Thus I assume it's part
of the regular teTeX distribution.  Hope this helps!
/answer

So I would asssume all the teTeX distributions have trhypen.tex file and
Debian is also strict with licenses so if it's in Debian then it means
at least it has the TeX licence
Hope this helps. Do you want me to contact Turgut Uyar or have you done
it already
--

Togan Muftuoglu

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]


Re: hyphenation patterns

2003-03-07 Thread Jeremias Maerki

On 07.03.2003 14:36:49 Togan Muftuoglu wrote:
 Hi,
 
 * Jeremias Maerki; [EMAIL PROTECTED] on 06 Mar, 2003 wrote:
  Let me know if you need further information ,
 
 It would be good if we could find the location where the trhyph.tex in
 the SuSE ditribution comes from. I don't want to download the whole
 thing just for verifying one little file.
 
 I have asked the maintainer and the reply is a follows
 
 answer
 In the .spec file it isn't listed separately.  Thus I assume it's part
 of the regular teTeX distribution.  Hope this helps!
 
 /answer
 
 So I would asssume all the teTeX distributions have trhypen.tex file and
 Debian is also strict with licenses so if it's in Debian then it means
 at least it has the TeX licence

Too many assumptions for me. :-)

 Hope this helps. Do you want me to contact Turgut Uyar or have you done
 it already

I can do that. I've also found an email address of Pierre MacKay.

Keiron, I assume it was you who wrote two of the mails and put the
notifications on the Wiki page? With only the IP address it's difficult
to tell (you can register your name in Preferences. Nudge, nudge). Was
it Togan, you contacted or one of the other two? Not that we write to
the same people twice. Thanks



Jeremias Maerki


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



Re: Japanese Hyphenation was: Re: hyphenation patterns

2003-03-07 Thread J.Pietschmann
Satoshi Ishigami wrote:
The JIS X 4051 spec is written in Japanese. I don't know whether
there is English version spec, or not.
Hm. I don't read japanese :-/

Certainly, many japanese people wish that FOP will implement it,
but the Japanese Tex hypenation file does not work with current 
FOP.
What's the reason for this? I got the impression both the Japanese
and the Chinese TeX versions patched also the TeX source in order
to adapt to their respective line breaking rules. I'm not sure
how relevant this is to hyphenation.
I think that FOP should implements UAX#14(TR14) if possible.
For example, AntennaHouse's XSLFormatter implements UAX#14.
This is planned for HEAD. The TR14 rules for CJK hyphenation seems
to be easy: in absence of any more complicated requirements,
hyphenate after every full character. Does the above mentioned
standard add such more complicated rules which TR14 does not
care too much about?
J.Pietschmann

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]


XML FOP: Licensing issues with hyphenation pattern files

2003-03-06 Thread Jeremias Maerki
Hello all (licensing specialists, XML PMC people, fop-devs)

(I don't know where might be the best place to discuss this. fop-dev is
currently very low-traffic.)

The FOP team needs help. In February we realized that we had problematic
hyphenation pattern files in our codebase. For example, some of them
were derived from GPL-licensed files.

Some action has already been taken. I've documented the whole auditing
process and status here:
http://nagoya.apache.org/wiki/apachewiki.cgi?FOPAudits/March2003

I'm at a point where I need some advice on how to proceed from here.
Here are the questions, I hope are the right ones and help us to resolve
these issues quickly.

Before we start here's a little background on the hyphenation stuff:
Our hyphenation file are XML files that are derived from TeX hyphenation
files (ASCII format). These XML files reside in CVS (xml-fop/src/hyph,
formerly xml-fop/hyph). During build they are parsed into Java objects
and serialized using Java serialization. These serialized objects are
included in the JAR (fop.jar) in the binary distribution.

1. Do our hyphenation files have to be licensed under the ASL? Or is it
possible to do something similar we do with Java libraries (external
dependencies) such as JUnit and JDOM (add the file to CVS and have an
accompanying file containing its license)?

2. If the former (of [1]) is true, we need a grant from the copyright
holder of the original file, right? What if the original file is unclear
about the copyright holder (multiple names, for example) and about the
license (no explicit license, for example)? Please see the Wiki page for
examples.

3. (Question is somewhat general) What's the threshold for the necessity
of a grant? Does a non-committer have to submit a grant on a single new
file?

4. Some of the hyphenation pattern files are licensed under the LPPL
(LaTeX project public license, http://www.latex-project.org/lppl.html).
We'd like to have clearance to use, modify and distribute files under
this license in the FOP project.

5. Can we modify and relicense under the ASL hyphenation pattern files
clearly stated as being in the public domain without having a grant but
giving credit where possible?

6. We can't use files containing a restriction like Can be used freely
for non-commercial purposes., except if we can positively identify the
copyright holder and get a grant, right?

Thank you in advance for your help!

Jeremias Maerki
FOP committer and XML PMC member


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



Re: hyphenation patterns

2003-03-06 Thread jaccoud

 other issues:
 -pt.xml iseems ok and can be put back?
[Jeremias Maaerki]
No, it isn't IMO. It forbids commercial usage which is not forbidden by
FOP's license. We cannot guarantee that the file is indirectly used for
commercial purposes. I could be wrong. I'll ask on on licensing.

I already discussed this with Herr Pietschmann and agreed to donate the
file to the ASF, provided the credits and bilingual comments are preserved.
As I said the previous license was there because of my laziness, I just
copied the old one...
However, because no one seemed to agree how to insert the licence in the
file (short or long). It seems it is already in the codebase, and you my
apply the appropriate licence in the way you find appropriate. I can send
another copy if needed.
I am happy to provide Portuguese-parliant users with a appropriate file,
even though this is a very small contribution.


By the way, is there a way to turn off the
  [ERROR] Couldn't find hyphenation pattern  pt_br
  using general language pattern pt instead.
message? First, I don't see it as an error, but a warning. Second, it is
not relevant to Portuguese: spelling may change from pt-PT to pt-BR, but
that does not affect hyphenation. (I know it does for other languages.)
Better, is there a way to tell FOP that the same hyphenation rules (file)
can be used for any pt[-XX] language values?


=
Marcelo Jaccoud Amaral
PetrobrĂ¡s (http://www.petrobras.com.br)
mailto:[EMAIL PROTECTED]
=
The way to get things done is not to mind who gets the credit of doing
them.--Benjamin Jowett




Sorry, I've been busy last week getting my barcode package a new home.
Almost forgot about the issues here.

On 04.03.2003 01:51:27 Christian Geisert wrote:
 if I understand it right we are allowed to distribute the LPPL
 hyphenation patterns (both source and binary) together with FOP
 if we add the LPPL LICENSE for the patterns (for example in the
 in the root dir) to the distribution ?
 (And to be safe I'll ask this on licensing@)

I want to ask at licensing first. I've finally received my confirmation
for admission to this mailing list. I'll do that today.









-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



Re: hyphenation patterns

2003-03-06 Thread Jeremias Maerki
Thanks for bringing it up again. It forgot that in the pile of work
these license problems generate. I'm sorry. I've updated the Wiki page
accordingly.
I've submitted some questions today concerning license issues. As soon
as I get the answers I'm going to contact you so we can finally readd
your file. Thanks for your patience!


On 06.03.2003 15:54:08 jaccoud wrote:
 
  other issues:
  -pt.xml iseems ok and can be put back?
 [Jeremias Maaerki]
 No, it isn't IMO. It forbids commercial usage which is not forbidden by
 FOP's license. We cannot guarantee that the file is indirectly used for
 commercial purposes. I could be wrong. I'll ask on on licensing.
 
 I already discussed this with Herr Pietschmann and agreed to donate the
 file to the ASF, provided the credits and bilingual comments are preserved.
 As I said the previous license was there because of my laziness, I just
 copied the old one...
 However, because no one seemed to agree how to insert the licence in the
 file (short or long). It seems it is already in the codebase, and you my
 apply the appropriate licence in the way you find appropriate. I can send
 another copy if needed.
 I am happy to provide Portuguese-parliant users with a appropriate file,
 even though this is a very small contribution.

snip/

Jeremias Maerki


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



Re: hyphenation patterns

2003-03-06 Thread Togan Muftuoglu
Hi,

* Jeremias Maerki; [EMAIL PROTECTED] on 06 Mar, 2003 wrote:
Thanks for bringing it up again. It forgot that in the pile of work
these license problems generate. I'm sorry. I've updated the Wiki page
accordingly.
I know you have tons of stuff to be sorted our with these licenses. Just
to make sure that as the submitter of the tr.xml let me know if tehre is
anything I can do to assist  

* Keiron Liddle; [EMAIL PROTECTED] on 05 Mar, 2003 wrote:
Hello Togan,

In the FOP project we a trying to clear up some license issues.
You donated a turkish hyphenation patterns file.
What is the license for that? Did you modify the original tex hyphenations and 
what was the license for that file?
Well I had send an email on Feb 17th 2003 to fop-dev mailinglist about
this issue and Jeremias said he would contact Turgut Uyar as there is no
license on the header. My modification is only to make FOP use the file
and I do not think that is rocket science :-) Nevertheless if you also
need grant for my work FOP project may use it

Let me know if you need further information ,

* Jeremias Maerki; [EMAIL PROTECTED] on 14 Feb, 2003 wrote:
tr.xml
Can't find original file.
No licence. Check with author.
Well, since I sent out the Turkish hyphenation file I should know where
it comes right. The trhyphen.tex is installed from the SuSE 8.1 distro 

[EMAIL PROTECTED]:~/hangar rpm -qf /usr/share/texmf/tex/generic/hyphen/trhyph.tex 
tetex-beta.20020207-254

the trhyph.tex file has the following header

% A mechanically generated Turkish Hyphenation table for TeX,
% using the University of Washington diacritical coding
% developed by P. A. MacKay for the Ottoman Texts Project.
% Slightly modified by H. Turgut Uyar.
Turgut Uyar has the following addres based on google search
[EMAIL PROTECTED]
Would you like to contact him directly or do you want me to do so ?
Although I think you can explain the needs and requirements better than
I can  

--
Togan Muftuoglu
--

Togan Muftuoglu

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]


Re: hyphenation patterns

2003-03-06 Thread J.Pietschmann
Jeremias Maerki wrote:
-pt.xml iseems ok and can be put back?
No, it isn't IMO. It forbids commercial usage which is not forbidden by
FOP's license.
This was the old pt.xml. Should I recommit the new one (released
under APL)? I thought I committed it both to HEAD and the maintenance
branch...
J.Pietschmann



-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]


Re: hyphenation patterns

2003-03-06 Thread Togan Muftuoglu
* Jeremias Maerki; [EMAIL PROTECTED] on 06 Mar, 2003 wrote:
related C source file
http://www.ctan.org/tex-archive/language/turkish/hyphen/turk_hyf.c
from Pierre A. MacKay which states that in case of commercial
distribution the author must be contacted. Which is one more indicator
that there could be problems with licensing. The whole TeX thing is a
legal nightmare if you ask me. Well, maybe we (or I) are overdoing
things here but we (or I) want to do it right.
Well shoot I found his papers and the whole thing is about with ottoman
turkish ( not the turkish we use today and his original hypenation is
tkhyphen.tex ) and now I know why there are some silly suggestions when
hyphenating :-)

Let me know if you need further information ,
It would be good if we could find the location where the trhyph.tex in
the SuSE ditribution comes from. I don't want to download the whole
thing just for verifying one little file.
I will ask the tetex maintainer to find I out I'll let you know as soon
as I get a reply from him.
side note why do I have to work the harder way no spellchecker no
hyphenation I started wondering am I the only one using these things
under Linux :-(
Thanks
--
Togan Muftuoglu

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]


Re: hyphenation patterns

2003-03-06 Thread J.Pietschmann
Jeremias Maerki wrote:
Unfortunately, that's not so easy. You've created a derived work, which
can't simply claim full copyright for.
In the realm of legal battles, wording matters.
Claiming copyright is ok as long as there was substantial work put
into creating the Derived Work.
Claiming the right to choose a lincense is quite another matter and
depends on the license of the Original Work.
It turns out that the file mentions multiple authors. Now the question
is: Who is the copyright owner
Each one is a copyright owner, and perhaps *all* would have to agree
on a license change, depending on the license(s) the file had through
its history. It's also a matter of courtesy.
that can give us the right to use, modify
and redistribute that file? Pierre A. MacKay seems to be the original
author. H. Turgut Uyar has modified it. Now, who owns the copyright?
It depends. For a start, Mr. MacKay apparently didn't create his stuff
for the sole purpose of creating hyphenation files. This means the
creator of the TeX file can not only claim copyright but also created
an Original Work rather than a Derived Work, so he can also choose a
license, as long as the original doesn't have a very restrictive
license (which would probably also make creating and using the TeX
hyphenation file illegal, so I suppose this isn't the case).
There is still the problem that the TeX file is explicitely labeled
as Mechanically generated, which can be interpreted in a way that
there wasn't substantial work put into creating the TeX file, which
means it isn't even a Derived work but basically another representation
of the original material and Mr. Uyar can't even claim copyright. The
slight modifications however should be enough to fix this.
Therefore, depending on the interpretation we've either have to ask Mr.
Uyar only (interpretation 1) or both (interpretation 2).
http://www.ctan.org/tex-archive/language/turkish/hyphen/turk_hyf.c
from Pierre A. MacKay which states that in case of commercial
distribution the author must be contacted.
This is not a real problem as long as the license doens't require
that Derived Works must be distributed under a similar condition.
that there could be problems with licensing. The whole TeX thing is a
legal nightmare if you ask me.
Well, the whole Intellectual Property stuff is a nightmare. This
is a direct result of assigning substantial value to intangibles...
J.Pietschmann

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]


Re: hyphenation patterns

2003-03-06 Thread J.Pietschmann
Togan Muftuoglu wrote:
side note why do I have to work the harder way no spellchecker no
hyphenation I started wondering am I the only one using these things
under Linux :-(
Until recently ispell was good enough. It's a bit dated now
but you could check Savannah whether anybody picked it up for
further development. SourceForge has a handful of projects for
spell checkers, most of which should be cross-platform. Which
does not imply they can be already used. Then there is MozSpell,
and OpenOffice has some sort of spell checker too.
Now the really big thing would be a grammar checker. I've still
my old notes from 15 years ago for a grammar checker for german
and english, but no time at all to do any real coding... :-/
J.Pietschmann

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]


Japanese Hyphenation was: Re: hyphenation patterns

2003-03-06 Thread J.Pietschmann
On a related matter: some time ago someone mentioned the japanese
hyphenation standard. I was not able to find the document, probably
all web sites dealing with this are in japanese. Is there anybody
listening who can help out?
J.Pietschmann

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]


Re: hyphenation patterns

2003-03-06 Thread Jeremias Maerki
Did you see my earlier mail to Marcelo Jaccoud Amaral? I'd like to wait
until we get answers from licensing@ (especially because of the grant
thing).

On 06.03.2003 21:45:58 J.Pietschmann wrote:
 Jeremias Maerki wrote:
 -pt.xml iseems ok and can be put back?
  No, it isn't IMO. It forbids commercial usage which is not forbidden by
  FOP's license.
 
 This was the old pt.xml. Should I recommit the new one (released
 under APL)? I thought I committed it both to HEAD and the maintenance
 branch...


Jeremias Maerki


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



Re: Japanese Hyphenation was: Re: hyphenation patterns

2003-03-06 Thread Satoshi Ishigami

Konnichiwa.

On Thu, 06 Mar 2003 22:32:10 +0100 , J.Pietschmann wrote:

 On a related matter: some time ago someone mentioned the japanese
 hyphenation standard. I was not able to find the document, probably
 all web sites dealing with this are in japanese. Is there anybody
 listening who can help out?

I wrote it in the past.
http://marc.theaimsgroup.com/?l=fop-devm=102992807207069w=2

The JIS X 4051 spec is written in Japanese. I don't know whether
there is English version spec, or not.

Certainly, many japanese people wish that FOP will implement it,
but the Japanese Tex hypenation file does not work with current 
FOP.

I think that FOP should implements UAX#14(TR14) if possible.
For example, AntennaHouse's XSLFormatter implements UAX#14.

UAX#14, Line Breaking Properties
http://www.unicode.org/reports/tr14/
Old discussion related with TR14 in fop-dev is
http://marc.theaimsgroup.com/?l=fop-devw=2r=1s=tr14q=b

---
Satoshi Ishigami   VIC TOKAI CORPORATION

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



Re: hyphenation patterns

2003-03-06 Thread Togan Muftuoglu
* J.Pietschmann; [EMAIL PROTECTED] on 06 Mar, 2003 wrote:
Togan Muftuoglu wrote:
side note why do I have to work the harder way no spellchecker no
hyphenation I started wondering am I the only one using these things
under Linux :-(
Until recently ispell was good enough. It's a bit dated now
but you could check Savannah whether anybody picked it up for
further development. SourceForge has a handful of projects for
spell checkers, most of which should be cross-platform. Which
does not imply they can be already used. Then there is MozSpell,
and OpenOffice has some sort of spell checker too.
It gets off topic yet for the record there is none for Turkish ( Ok
there is hyphenation for Turkish under an unknown licence for TeX)
I just did for myself a Turkish dictionary to be used with aspell yet
with a language like Turkish the wordlist can be as conjugated to a
figure like 100.000.000.000 so it is not realistic and possible to have
affixes to be able to create such a big list from a 35.000 root words


--

Togan Muftuoglu

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]


Re: hyphenation patterns

2003-03-04 Thread Jeremias Maerki
Sorry, I've been busy last week getting my barcode package a new home.
Almost forgot about the issues here.

On 04.03.2003 01:51:27 Christian Geisert wrote:
 if I understand it right we are allowed to distribute the LPPL
 hyphenation patterns (both source and binary) together with FOP
 if we add the LPPL LICENSE for the patterns (for example in the
 in the root dir) to the distribution ?
 (And to be safe I'll ask this on licensing@)

I want to ask at licensing first. I've finally received my confirmation
for admission to this mailing list. I'll do that today.

 other issues:
 -pt.xml iseems ok and can be put back?

No, it isn't IMO. It forbids commercial usage which is not forbidden by
FOP's license. We cannot guarantee that the file is indirectly used for
commercial purposes. I could be wrong. I'll ask on on licensing.

 -what about renaming en_GB.xml to en.xml?

+1

 -I remember *something* about hu.xml but couldn't find it

I wrote a mail to the original author but didn't get an answer. The mail
didn't bounce, though.

 -has anybody tried to contact the authors of patterns with unclear licence?

Forgot to do that. Sorry. Will do today.


I'm strongly for investigating the possibility to make use of the
hyphenation packages from OpenOffice. Although they are LGPL we could
support them and tell our users where they can get them. Not a very nice
thing because our users have to do the license evaluations themselves
but at least we are on the safe side. Is anyone out there who would like
to do that?

Jeremias Maerki


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



Re: hyphenation patterns

2003-03-04 Thread Jeremias Maerki
I've added a detailed Wiki page on the licensing audit for FOP where I
intend to track the status of all current issues.

http://nagoya.apache.org/wiki/apachewiki.cgi?FOPAudits

To the XML PMC members listening in: I'll follow up with direct
questions to licensing@ especially about the LPPL issue.

Any help with sorting out the issues is appreciated. Next on my list is
to write to some of the hyphenation pattern authors for clarification of
the license.

Jeremias Maerki


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



Re: hyphenation patterns

2003-03-04 Thread Jeremias Maerki

On 04.03.2003 09:37:51 Jeremias Maerki wrote:
  -has anybody tried to contact the authors of patterns with unclear licence?
 
 Forgot to do that. Sorry. Will do today.

Ok, I wrote to the authors of certain files where I see some probability
of success in finding the original author and getting permission for
further use. Please see the Wiki page for details:
http://nagoya.apache.org/wiki/apachewiki.cgi?FOPAudits/March2003

This was some really unpleasant work today. I'm looking forward to do
some coding again. I hope we can resolve the whole thing soon.

Jeremias Maerki


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



DO NOT REPLY [Bug 15985] - Hyphenation pattern files appear to be invalid

2003-03-03 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://nagoya.apache.org/bugzilla/show_bug.cgi?id=15985.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=15985

Hyphenation pattern files appear to be invalid

[EMAIL PROTECTED] changed:

   What|Removed |Added

 Status|RESOLVED|REOPENED
 Resolution|FIXED   |



--- Additional Comments From [EMAIL PROTECTED]  2003-03-03 14:47 ---
The en.xml file has been removed with the CVS comment: 
 
Revision 1.1.2.2 , Tue Feb 18 04:39:10 2003 UTC (13 days, 9 hours ago) by chrisg  
Branch: fop-0_20_2-maintain  
Changes since 1.1.2.1: +0 -0 lines  
FILE REMOVED  
removed hyphenation patterns with unclear/problematic license 
 
I've checked out the latest revision from branch fop-0_20_2-maintain and the problem 
still exists! The country in the Hyphenator class is empty (null) and therefore it 
tries to 
open the en.xml file and not the en_GB.xml file which is the only one left for english 
hyphenation.

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



DO NOT REPLY [Bug 15985] - Hyphenation pattern files appear to be invalid

2003-03-03 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://nagoya.apache.org/bugzilla/show_bug.cgi?id=15985.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=15985

Hyphenation pattern files appear to be invalid

[EMAIL PROTECTED] changed:

   What|Removed |Added

 Status|REOPENED|RESOLVED
 Resolution||FIXED



--- Additional Comments From [EMAIL PROTECTED]  2003-03-03 15:22 ---
These are two completely different issues.
Set the hyphenation language to en_GB until someone donates a hyphenation
file for en without license problems.

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



hyphenation patterns

2003-03-03 Thread Christian Geisert
Ok,

if I understand it right we are allowed to distribute the LPPL
hyphenation patterns (both source and binary) together with FOP
if we add the LPPL LICENSE for the patterns (for example in the
in the root dir) to the distribution ?
(And to be safe I'll ask this on licensing@)
other issues:
-pt.xml iseems ok and can be put back?
-what about renaming en_GB.xml to en.xml?
-I remember *something* about hu.xml but couldn't find it
-has anybody tried to contact the authors of patterns with unclear licence?
Christian



-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]


DO NOT REPLY [Bug 15985] - Hyphenation pattern files appear to be invalid

2003-02-28 Thread bugzilla
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
http://nagoya.apache.org/bugzilla/show_bug.cgi?id=15985.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=15985

Hyphenation pattern files appear to be invalid

[EMAIL PROTECTED] changed:

   What|Removed |Added

 Status|NEW |RESOLVED
 Resolution||FIXED



--- Additional Comments From [EMAIL PROTECTED]  2003-03-01 00:30 ---
Mismatch caused by build system change, fixed in 0.20.5

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



en_US hyphenation pattern

2003-02-27 Thread Koes, Derrick

Anyone know why the en_US hyphenation pattern was dropped in 0.20.5rc2?

Thanks,
Derrick







This electronic transmission is strictly confidential to Smith  Nephew and
intended solely for the addressee.  It may contain information which is
covered by legal, professional or other privilege.  If you are not the
intended addressee, or someone authorized by the intended addressee to
receive transmissions on behalf of the addressee, you must not retain,
disclose in any form, copy or take any action in reliance on this
transmission.  If you have received this transmission in error, please
notify the sender as soon as possible and destroy this message.

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]



Re: en_US hyphenation pattern

2003-02-27 Thread Clay Leeds
Licensing issues.

Koes, Derrick wrote:
Anyone know why the en_US hyphenation pattern was dropped in 0.20.5rc2?

Thanks,
Derrick


--
Clay Leeds - [EMAIL PROTECTED]
Web Developer - Medata, Inc. - http://www.medata.com
PGP Public Key: https://mail.medata.com/pgp/cleeds.asc
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]


  1   2   3   >