page redesign for language pages

2014-01-20 Thread Daniel Naber
Hi,

I have modified the German page at https://languagetool.org/de/ to look 
like the homepage. I'd suggest that we do this for all language-specific 
pages, as this design looks much better. There's still enough room to 
cover anything you want (the user needs to scroll down, though).

Let me know if you have any problems with that. Feel free to adapt your 
page to the new design - just copy the PHP code from www/de/index.php. 
If there are no objections, I will change the design of all language 
pages that have not been converted on Wednesday.

Regards
  Daniel



--
CenturyLink Cloud: The Leader in Enterprise Cloud Services.
Learn Why More Businesses Are Choosing CenturyLink Cloud For
Critical Workloads, Development Environments  Everything In Between.
Get a Quote or Start a Free Trial Today. 
http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: Portuguese sentence splitter

2014-01-20 Thread Bruno P. Kinoshita
Does Portuguese use abbreviations that end in a dot, like a lot of other 
languages do? If so, is there a list of such abbreviations? We could 
then add Portuguese to segments.srx to make the sentence splitter work.


Brazilian Portuguese does, and I think the Portugal's Portuguese does as well. 

I found this list http://pt.wikipedia.org/wiki/Anexo:Lista_de_abreviaturas. One 
abbreviation that is very common is Sr., Sra., Prof. (Mr., Mrs., Prof.).

Hope that helps, 
Bruno


 From: Daniel Naber daniel.na...@languagetool.org
To: LanguageTool Developer List languagetool-devel@lists.sourceforge.net 
Sent: Monday, January 20, 2014 2:45 PM
Subject: Portuguese sentence splitter
 

Hi,

Portuguese uses the SRXSentenceTokenizer, but there's no mapping for 
Portuguese in segments.srx. The result is that sentences in Portuguese 
are not detected, i.e. everything is one long sentence. Thus the 
UppercaseSentenceStartRule never matches except at the very beginning of 
a text.

Does Portuguese use abbreviations that end in a dot, like a lot of other 
languages do? If so, is there a list of such abbreviations? We could 
then add Portuguese to segments.srx to make the sentence splitter work.

Regards
  Daniel


--
CenturyLink Cloud: The Leader in Enterprise Cloud Services.
Learn Why More Businesses Are Choosing CenturyLink Cloud For
Critical Workloads, Development Environments  Everything In Between.
Get a Quote or Start a Free Trial Today. 
http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


 

--
CenturyLink Cloud: The Leader in Enterprise Cloud Services.
Learn Why More Businesses Are Choosing CenturyLink Cloud For
Critical Workloads, Development Environments  Everything In Between.
Get a Quote or Start a Free Trial Today. 
http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


CoGrOO vs LanguageTool?

2014-01-20 Thread Richard Eckart de Castilho
Hi,

I recently stumbled across CoGrOO [1]. It appears to offer similar 
functionality as LanguageTool for Brazilian Portuguese only, but being based 
mostly on statistical models rather than rules.

Did anybody here ever try that or does anybody have an opinion on the project?
Or did anybody possibly compare it to LanguageTool?

Cheers,

-- Richard

[1] http://cogroo.sourceforge.net
--
CenturyLink Cloud: The Leader in Enterprise Cloud Services.
Learn Why More Businesses Are Choosing CenturyLink Cloud For
Critical Workloads, Development Environments  Everything In Between.
Get a Quote or Start a Free Trial Today. 
http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: Portuguese sentence splitter

2014-01-20 Thread Marco A.G.Pinto

  
  
Hello Daniel,
  
  Yes, there are lots of words that end with a dot, such as:
  - vs.
  - exmo.
  - etc.
  - dr.
  - dra.
  - prof.
  
  and so on...
  
  Could you explain what I need to do?
  
  PS-I am now the maintainer of the English dictionaries of
  Apache OpenOffice. I have committed
   (pressed the "PUBLISH" button in my account)
 a new version of the dictionaries for AOO (2014.01.01) but
  it is pending.
   Every month I release a new version of en_GB with around
  200-300 new unique words.
   It seems I will release Mozilla's more often.
   Is there a way of LanguageTool update the .DIC and .AFF of
  the en_GB?
   But please wait until the 1st of February since it will
  have 300+ unique words than the last
version I released.
  
  
  Thanks!
  
  Kind regards,
 Marco A.G.Pinto
   
  
  
  On 20/01/2014 16:45, Daniel Naber wrote:


  Hi,

Portuguese uses the SRXSentenceTokenizer, but there's no mapping for 
Portuguese in segments.srx. The result is that sentences in Portuguese 
are not detected, i.e. everything is one long sentence. Thus the 
UppercaseSentenceStartRule never matches except at the very beginning of 
a text.

Does Portuguese use abbreviations that end in a dot, like a lot of other 
languages do? If so, is there a list of such abbreviations? We could 
then add Portuguese to segments.srx to make the sentence splitter work.

Regards
  Daniel





-- 
  
  

--
CenturyLink Cloud: The Leader in Enterprise Cloud Services.
Learn Why More Businesses Are Choosing CenturyLink Cloud For
Critical Workloads, Development Environments  Everything In Between.
Get a Quote or Start a Free Trial Today. 
http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: page redesign for language pages

2014-01-20 Thread Daniel Naber
On 2014-01-20 17:57, Marco A.G.Pinto wrote:

  Does that mean that on Wednesday I will be able to download the
 latest Web page checkout and translate directly over the PT files and
 then commit them?

There are not many new strings, as most strings are translated already 
(e.g. the download buttons). So there's probably not much work for you - 
but I see that the Portuguese page doesn't actually have much content 
besides the check form and the download buttons. It would be nice to get 
a bit more content, especially examples for errors that can be detected 
(see the other language pages).

Regards
  Daniel


--
CenturyLink Cloud: The Leader in Enterprise Cloud Services.
Learn Why More Businesses Are Choosing CenturyLink Cloud For
Critical Workloads, Development Environments  Everything In Between.
Get a Quote or Start a Free Trial Today. 
http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: page redesign for language pages

2014-01-20 Thread Marco A.G.Pinto

  
  
Hello!
  
  There is some text below the download buttons translated by me:
  https://languagetool.org/pt/
  
  I need to check the other language pages.
  
  Unfortunately, today I am very busy coding software and working on
  my university job from home.
  
  Tomorrow I will be out most of the day, with a meeting in the
  afternoon and the PhD seminar after it.
  
  I guess that only on Wednesday I will have some free time.
  
  PS-Daniel, the link to the CHANGES.txt vanished from the
  English page and I only know how to
   access it from the Portuguese page to add notes.
  
  
  Kind regards,
   Marco A.G.Pinto
   ---
  
  
  
  On 20/01/2014 18:25, Daniel Naber wrote:


  On 2014-01-20 17:57, Marco A.G.Pinto wrote:


  
 Does that mean that on Wednesday I will be able to download the
latest Web page checkout and translate directly over the PT files and
then commit them?

  
  
There are not many new strings, as most strings are translated already 
(e.g. the download buttons). So there's probably not much work for you - 
but I see that the Portuguese page doesn't actually have much content 
besides the check form and the download buttons. It would be nice to get 
a bit more content, especially examples for errors that can be detected 
(see the other language pages).

Regards
  Daniel



-- 
  
  

--
CenturyLink Cloud: The Leader in Enterprise Cloud Services.
Learn Why More Businesses Are Choosing CenturyLink Cloud For
Critical Workloads, Development Environments  Everything In Between.
Get a Quote or Start a Free Trial Today. 
http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: Portuguese sentence splitter

2014-01-20 Thread Daniel Naber
On 2014-01-20 18:06, Marco A.G.Pinto wrote:

  Could you explain what I need to do?

Just collect those words (I hope it won't be thousands?) that end in a 
dot and send them here to the list or mail me directly. I'll then try to 
make the sentence detection work.

  Is there a way of LanguageTool update the .DIC and .AFF of the en_GB?

There's a script (create_dict.sh) for that, I'm not sure how robust it 
really is. It's documented at 
http://wiki.languagetool.org/hunspell-support. If in doubt, ask for an 
update here.

Regards
  Daniel


--
CenturyLink Cloud: The Leader in Enterprise Cloud Services.
Learn Why More Businesses Are Choosing CenturyLink Cloud For
Critical Workloads, Development Environments  Everything In Between.
Get a Quote or Start a Free Trial Today. 
http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel