Re: Question about rules

2016-06-29 Thread Daniel Naber
On 2016-06-29 18:10, Matheus Poletto wrote:

>  I want know if is ok  i keep writing and improving the rules to the
> pt-BR and doing commit on it to the project?

Let's do the next two or three changes with pull requests - if that 
works well, we'll provide you with direct write access to the LT github 
repo.

Regards
  Daniel


--
Attend Shape: An AT Tech Expo July 15-16. Meet us at AT Park in San
Francisco, CA to explore cutting-edge tech and listen to tech luminaries
present their vision of the future. This family event has something for
everyone, including kids. Get more information and register today.
http://sdm.link/attshape
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Question about rules

2016-06-29 Thread Matheus Poletto
Hi guy, how r u ?
 I just have my first commit accept to pt-BR, thanks for the awesome
support in this!
 I want know if is ok  i keep writing and improving the rules to the pt-BR
and doing commit on it to the project?

Best regards - Matheus Otavio Poletto from Brazil.
--
Attend Shape: An AT Tech Expo July 15-16. Meet us at AT Park in San
Francisco, CA to explore cutting-edge tech and listen to tech luminaries
present their vision of the future. This family event has something for
everyone, including kids. Get more information and register today.
http://sdm.link/attshape___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: I want to be the Galician language maintainer

2016-06-29 Thread Daniel Naber
On 2016-06-29 11:51, Gabriel André wrote:

Hi Gabriel,

> Last week I sent an e-mail to Daniel asking to be the Galician language
> maintainer. I am starting to create new rules and to review the old 
> ones
> now. I am not a programmer myself, but I am a linguist with some NLP
> knowledge.

welcome once more to LanguageTool! Please let us know if you have any 
questions. Once you have created/modified a few rules, please create a 
pull request at github.

Regards
  Daniel


--
Attend Shape: An AT Tech Expo July 15-16. Meet us at AT Park in San
Francisco, CA to explore cutting-edge tech and listen to tech luminaries
present their vision of the future. This family event has something for
everyone, including kids. Get more information and register today.
http://sdm.link/attshape
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: The spell checker issue

2016-06-29 Thread Dominique Pellé
Daniel Naber  wrote:

> Hi,
>
> yesterday I tried to update the English dictionary that LT includes. The
> details are documented at
> https://github.com/languagetool-org/languagetool/issues/329 but in a
> nutshell: our spell checking is so complicated that the dictionary
> update didn't work.
>
> We could really need a process that allows us to use hunspell
> dictionaries directly, without conversion to other formats. The original
> reason we don't use hunspell (or only parts of it) is that it's slow,
> especially when it comes to generating suggestions. Today I ran a test
> with hunspell 1.4.1 and LT, and it turns out LT is about 4-5 times
> faster.
>
> What could be a solution:
>
> A) Improve hunspell to be faster. We'd need someone who can do this and
> then we'd still rely on native code, which isn't what we want in Java
> (but we've lived with it for years now).
>
> B) Finally write a Java-based spell checker that can read hunspell
> dictionaries. The internet is full of spell checkers, but we need one
> with support for advanced features like compound words (important for
> German).
>
> C) I don't know, do you have an idea?
>
> If we cannot find a solution, the current situation will persist so that
> some dictionaries probably won't be updated.


If Hunspell is thread-safe (?), could we search for suggestions of
multiple words in parallel in multiple threads?

Dominique

--
Attend Shape: An AT Tech Expo July 15-16. Meet us at AT Park in San
Francisco, CA to explore cutting-edge tech and listen to tech luminaries
present their vision of the future. This family event has something for
everyone, including kids. Get more information and register today.
http://sdm.link/attshape
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


I want to be the Galician language maintainer

2016-06-29 Thread Gabriel André
Hi,

Last week I sent an e-mail to Daniel asking to be the Galician language
maintainer. I am starting to create new rules and to review the old ones
now. I am not a programmer myself, but I am a linguist with some NLP
knowledge.

Thank you very much,

Gabriel André

--
Attend Shape: An AT Tech Expo July 15-16. Meet us at AT Park in San
Francisco, CA to explore cutting-edge tech and listen to tech luminaries
present their vision of the future. This family event has something for
everyone, including kids. Get more information and register today.
http://sdm.link/attshape
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


The spell checker issue

2016-06-29 Thread Daniel Naber
Hi,

yesterday I tried to update the English dictionary that LT includes. The 
details are documented at 
https://github.com/languagetool-org/languagetool/issues/329 but in a 
nutshell: our spell checking is so complicated that the dictionary 
update didn't work.

We could really need a process that allows us to use hunspell 
dictionaries directly, without conversion to other formats. The original 
reason we don't use hunspell (or only parts of it) is that it's slow, 
especially when it comes to generating suggestions. Today I ran a test 
with hunspell 1.4.1 and LT, and it turns out LT is about 4-5 times 
faster.

What could be a solution:

A) Improve hunspell to be faster. We'd need someone who can do this and 
then we'd still rely on native code, which isn't what we want in Java 
(but we've lived with it for years now).

B) Finally write a Java-based spell checker that can read hunspell 
dictionaries. The internet is full of spell checkers, but we need one 
with support for advanced features like compound words (important for 
German).

C) I don't know, do you have an idea?

If we cannot find a solution, the current situation will persist so that 
some dictionaries probably won't be updated.

Regards
  Daniel

This is the text for testing, full of typos (supposed to be German):
Fgen Siex hxier Ixhren Txext eiwen. Klcken ie nch dr Prüung aug diw 
fatbig
unteelegten Textstellwn. oder notzen Sie desen Teyt alls Beeispiel füür 
eein
Paat Fwhler , diw LanguageTool erkwnnen ksnn: Ih wirde Ankst und banke.


--
Attend Shape: An AT Tech Expo July 15-16. Meet us at AT Park in San
Francisco, CA to explore cutting-edge tech and listen to tech luminaries
present their vision of the future. This family event has something for
everyone, including kids. Get more information and register today.
http://sdm.link/attshape
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel