Re: Help improve rule pt_PT

2016-08-17 Thread Jan Schreiber
I would try it like this:

 
 
   
 há
 
 segundos?|minutos?|horas?|dias?|semanas?|mês|meses|anos?
 
   atrás
 
   
   Com o verbo haver não é necessário usar "atrás".
   Há n segundos 
atrás.
 

In other words, replace 'atrás' with an empty string. I am not sure if 
this leaves an undesired space character before the period, though.

Also, IIRC rule ids should not contain accented characters.

I couldn't test my version though, since I don't have Java installed on 
this office machine.

Am 17.08.2016 um 14:05 schrieb Marco A.G.Pinto:
> Thanks, Jan, I use "1" and "3" but TESTRULES PT gives an error in:
> Com o verbo haver não é necessário usar "atrás": \1
> \2 \3.
>
> Can you help?
>
> Thanks!
>
> On 17/08/2016 12:36, Jan Schreiber wrote:
>> Marco,
>>
>> I think you could replace
>> 
>> from your original version with the following:
>> 
>> (Replace 0 with 1 if necessary.)
>>
>> Best,
>> Jan
>>
>> Am 17.08.2016 um 13:25 schrieb Marco A.G.Pinto:
>>> Yakov,
>>>
>>> It didn't work.
>>>
>>> I tried it in the stand-alone tool after using *TESTRULES PT*, with the
>>> sentences:
>>> *Isso aconteceu à quase 20 anos!**
>>> **Isso aconteceu à quase vinte e três anos!**
>>> **Há 10 anos atrás**
>>> **Há quarenta e dois anos atrás*
>>>
>>> Rules I changed:
>>> ***
>>> ****
>>> **  **
>>> **há**
>>> ****
>>> **>> regexp="yes">segundos?|minutos?|horas?|dias?|semanas?|mês|meses|anos?**
>>> **atrás**
>>> **  **
>>> **  Com o verbo haver não é necessário usar "atrás":
>>> \1 \2 \3.**
>>> **  Há n segundos
>>> atrás.**
>>> ****
>>> **
>>> **
>>> ****
>>> ****
>>> ****
>>> **  **
>>> ****
>>> **à**
>>> ****
>>> **quase**
>>> ****
>>> **>> regexp="yes">segundos?|minutos?|horas?|dias?|semanas?|mês|meses|anos?**
>>> **  **
>>> **  Substituir «à» por há.**
>>> **  Conheço a Ana à quase
>>> 30 anos.**
>>> **   *
>>>
>>>
>>> Thanks!
>>>
>>> Kind regards,
>>>>Marco A.G.Pinto
>>>  ---
>>>
>>>
>>> On 17/08/2016 11:35, Yakov Reztsov wrote:
>>>> Hello!
>>>> One of the options is add *skip="1":*
>>>>
>>>> 
>>>> 
>>>>   
>>>> há
>>>> 
>>>> >>> regexp="yes">segundos?|minutos?|horas?|dias?|semanas?|mês|meses|anos?
>>>> atrás
>>>>   
>>>>   Com o verbo haver não é necessário usar "atrás":
>>>> \1 \2 \3.
>>>>   Há n segundos
>>>> atrás.
>>>> 
>>>>
>>>> Среда, 17 августа 2016, 10:59 +03:00 от "Marco A.G.Pinto" <>:
>>>>
>>>> Hello!
>>>>
>>>> I want to improve the following rule:
>>>>
>>>> ***
>>>> ****
>>>> **  **
>>>> **há**
>>>> ****
>>>> **>>> 
>>>> regexp="yes">segundos?|minutos?|horas?|dias?|semanas?|mês|meses|anos?**
>>>> **atrás**
>>>> **  **
>>>> **  Com o verbo haver não é necessário usar "atrás":
>>>> \1 \2 \3.**
>>>> **  Há n segundos
>>>> atrás.**
>>>> ***
>>>>
>>>>
>>>> I want the token after "há" to accept up to three words.
>>>>
>>>> For example:
>>>> "há *vinte e três* dias atrás"
>>>>
>>>> What changes shall I make?
>>>>
>>>> Thanks!
>>>>
>>>> Kind regards,
>>>>  >Marco A.G.Pinto
>>>>---
>>>>
>>>> --
>>>>
>>>>
>>>> --
>>>>
>>>> Yakov Reztsov
>>> --
>>>
>>>
>
>
> --
>
>
> --
>
>
>
> ___
> Languagetool-devel mailing list
> Languagetool-devel@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/languagetool-devel
>

--
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: English native speaker help

2016-08-17 Thread Jan Schreiber
Hi list,

What Athena says is exactly what I learned in school, and they were 
supposed to teach us British English there. So it's probably safe to add 
this as a rule for both variants.

See also:
http://www.englishteachermelanie.com/grammar-present-perfect-have-you-ever-been-to/

Best,
Jan

Am 17.08.2016 um 10:10 schrieb Athena Santora:
> Hi there
>
> In American English the sentence is awkward. It's not "wrong" to say "I
> never have been" but a native speaker wouldn't commonly phrase it that way.
>
> The other "error" : "been IN" isn't technically an error either but a
> native speaker wouldn't commonly phrase it this way either (now that I
> live in Spain, i find its super common for non Native speakers when they
> translate).
>
> A native speaker would say ..."been TO London." So
>
> You may say
> I've never been to London
> Or...
> I have never been to London
>
> Hope this helps.!

--
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: Help improve rule pt_PT

2016-08-17 Thread Jan Schreiber
Marco,

I think you could replace

from your original version with the following:

(Replace 0 with 1 if necessary.)

Best,
Jan

Am 17.08.2016 um 13:25 schrieb Marco A.G.Pinto:
> Yakov,
>
> It didn't work.
>
> I tried it in the stand-alone tool after using *TESTRULES PT*, with the
> sentences:
> *Isso aconteceu à quase 20 anos!**
> **Isso aconteceu à quase vinte e três anos!**
> **Há 10 anos atrás**
> **Há quarenta e dois anos atrás*
>
> Rules I changed:
> ***
> ****
> **  **
> **há**
> ****
> ** regexp="yes">segundos?|minutos?|horas?|dias?|semanas?|mês|meses|anos?**
> **atrás**
> **  **
> **  Com o verbo haver não é necessário usar "atrás":
> \1 \2 \3.**
> **  Há n segundos
> atrás.**
> ****
> **
> **
> ****
> ****
> ****
> **  **
> ****
> **à**
> ****
> **quase**
> ****
> ** regexp="yes">segundos?|minutos?|horas?|dias?|semanas?|mês|meses|anos?**
> **  **
> **  Substituir «à» por há.**
> **  Conheço a Ana à quase
> 30 anos.**
> **   *
>
>
> Thanks!
>
> Kind regards,
>>Marco A.G.Pinto
>  ---
>
>
> On 17/08/2016 11:35, Yakov Reztsov wrote:
>> Hello!
>> One of the options is add *skip="1":*
>>
>> 
>> 
>>   
>> há
>> 
>> > regexp="yes">segundos?|minutos?|horas?|dias?|semanas?|mês|meses|anos?
>> atrás
>>   
>>   Com o verbo haver não é necessário usar "atrás":
>> \1 \2 \3.
>>   Há n segundos
>> atrás.
>> 
>>
>> Среда, 17 августа 2016, 10:59 +03:00 от "Marco A.G.Pinto" <>:
>>
>> Hello!
>>
>> I want to improve the following rule:
>>
>> ***
>> ****
>> **  **
>> **há**
>> ****
>> **> 
>> regexp="yes">segundos?|minutos?|horas?|dias?|semanas?|mês|meses|anos?**
>> **atrás**
>> **  **
>> **  Com o verbo haver não é necessário usar "atrás":
>> \1 \2 \3.**
>> **  Há n segundos
>> atrás.**
>> ***
>>
>>
>> I want the token after "há" to accept up to three words.
>>
>> For example:
>> "há *vinte e três* dias atrás"
>>
>> What changes shall I make?
>>
>> Thanks!
>>
>> Kind regards,
>>  >Marco A.G.Pinto
>>---
>>
>> --
>>
>>
>> --
>>
>> Yakov Reztsov
>
> --
>
>
> --
>
>
>
> ___
> Languagetool-devel mailing list
> Languagetool-devel@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/languagetool-devel
>

--
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: I want become a maintainer in PT-BR

2016-06-27 Thread Jan Schreiber
Hi Matheus,

Just in case you are struggling with git (as I do), here is a helpful
basic guide: http://rogerdudler.github.io/git-guide/

My advice, for what it's worth: As a first step, take a look at the
grammar.xml file for European Portuguese or any other language that you
happen to be familiar with. Most of the legwork is done in the
respective grammar.xml for each language.

And don't hesitate to ask if you get stuck. The community will figure it
out somehow.

(Not that this is very relevant, but I'm the one who directed you to
this mailing list on Facebook.)

Best,
Jan

Am 27.06.2016 um 19:55 schrieb Daniel Naber:
> On 2016-06-27 19:01, Matheus Poletto wrote:
> 
> Hi Matheus,
> 
>>> I found some posts saying you need maintainer for my language PT-BR
>>> and i love the ideia of becomming one. What i need do to become
>>> this?
> 
> that's great, welcome to LanguageTool! Most maintainers spend their time 
> writing new error detection rules and improving existing rules. The 
> technical details about this are documented at [1]. But in general, 
> there's no TODO list and maintainers choose their own tasks. This may be 
> XML rule development only or developing with Java, or writing add-ons.
> 
> Maybe Marco, our Maintainer for European Portuguese, can give you some 
> examples of issues that you could work on. Once you have first results, 
> you can create a pull request at github and your changes will become 
> part of LT. Please let us know if you have more questions.
> 
> Regards
>   Daniel
> 
> [1] http://wiki.languagetool.org/development-overview
> 


--
Attend Shape: An AT Tech Expo July 15-16. Meet us at AT Park in San
Francisco, CA to explore cutting-edge tech and listen to tech luminaries
present their vision of the future. This family event has something for
everyone, including kids. Get more information and register today.
http://sdm.link/attshape
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: Link to LT Wiki

2016-03-30 Thread Jan Schreiber
Am 30.03.2016 um 11:03 schrieb Mike Unwalla:
> Development page:  "wiki.languagetool.org contains the complete development
> documentation, including:" If the only purpose of wiki.languagetool.org is
> for development information, then change the name from 'wiki.' to
> 'development.' Reason: 'wiki' tells me it's a wiki, but 'development' tells
> me its purpose, which is much more useful.
>
'development' is very general as well. How about 'dev_wiki' or something 
like that?

--
Transform Data into Opportunity.
Accelerate data analysis in your applications with
Intel Data Analytics Acceleration Library.
Click to learn more.
http://pubads.g.doubleclick.net/gampad/clk?id=278785471=/4140
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: new committer: Mike Unwalla

2016-03-23 Thread Jan Schreiber
Welcome on board, Mike!

Am 23.03.2016 um 17:23 schrieb Daniel Naber:
> Hi,
>
> I'd like to welcome Mike Unwalla as a new committer and maintainer for
> English in LT. Mike has been around on this mailing list for a long time
> and he's very experienced with writing rules for technical English.
> Welcome to the team Mike, we're looking forward to your contributions!
>
> Regards
>Daniel
>

--
Transform Data into Opportunity.
Accelerate data analysis in your applications with
Intel Data Analytics Acceleration Library.
Click to learn more.
http://pubads.g.doubleclick.net/gampad/clk?id=278785351=/4140
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: MS Word add-in for LT

2016-01-29 Thread Jan Schreiber
> There's 
> also a "Tipography" check box (it should rather by Typography, btw) 
> whose meaning is unclear to me. Is it punctuation or spelling?


In German, I use the Typography category for rules that aren't exactly
punctuation errors, but are more or or less of an aesthetic nature. For
example, if somebody mixes straight quotation marks (") and French
guillemets (») in the same text, I wouldn't call it a punctuation error,
because both styles are legit. Similarly, using three individual dots
instead of the ellipsis sign is something that I consider a
"typographical" error.

--
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=267308311=/4140
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: Bug in German rule

2016-01-19 Thread Jan Schreiber
Thanks for the report! That bug is my fault. When I wrote the rule, I
tacitly assumed that SENT_END would always be a punctuation mark, I
wasn't aware that it can ever be a word.

If we revert Daniel's last change and use (something like) the following
to match a period or an exclamation mark at the sentence end in
FRAGE_OHNE_FRAGEZEICHEN:
  \.|!
I think we can keep the suggestion.

We could then have another rule that covers the case when a sentence
ends with no punctuation mark at all. I am thinking of a rule that
matches something like this:
  [^\.!\?]

I'm not sure if the last regex will work, the intention is to match a
SENT_END that isn't a period, exclamation mark or question mark. Maybe
we would have to add a colon and/or an ellipsis.


Am 18.01.2016 18:11, schrieb Rick Genter:
> I'm experimenting with LanguageTool-3.2 and have discovered what I
> believe to be a problem with one of the German rules.
> 
> If I give the text "Wie geht es dir", the LanguageTool UI shows:
> 
> Inline image 1
> 
> While I understand why "dir" is underlined, I feel like the column
> information is wrong and that the error is actually in column 16, where
> the ? is missing.
> 
> I'm trying to use LanguageTool via API for batch processing of text, and
> I would like to be able to apply suggestions for correcting the text
> when LanguageTool detects an error. For most errors that have
> suggestions this works, but for FRAGE_OHNE_FRAGEZEICHEN this doesn't; if
> I apply the suggestion the above becomes "Wie geht es ?" (the "dir" is
> lost).
> 
> I looked at de/grammar.xml but it wasn't clear how I would change it to
> fix this, and if I did change it I think the UI would lose the red
> underline under "dir".
> 
> What's the best way to solve this?
> -- 
> Rick Genter
> rick.gen...@gmail.com 
> 
> 
> --
> Site24x7 APM Insight: Get Deep Visibility into Application Performance
> APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
> Monitor end-to-end web transactions and take corrective actions now
> Troubleshoot faster and improve end-user experience. Signup Now!
> http://pubads.g.doubleclick.net/gampad/clk?id=267308311=/4140
> 
> 
> 
> ___
> Languagetool-devel mailing list
> Languagetool-devel@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/languagetool-devel
> 

--
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=267308311=/4140
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: new syntax available

2015-10-06 Thread Jan Schreiber
That's great news, Daniel! A very powerful new feature.

Am 05.10.2015 14:59, schrieb Daniel Naber:
> Hi,
> 
> there's now a first and limited implementation of the  syntax in 
> master. Instead of
> 
> foo
> 
> you can now use
> 
> foo
> 

--
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: Integration of LanguageTool with VisualEditor

2015-07-28 Thread Jan Schreiber
 Sorry, it's Internal error. I get it every time I visit
 http://tools.wmflabs.org/languageproofing/
Same here.

Am 28.07.2015 16:05, schrieb Daniel Naber:
 On 2015-07-28 15:10, Ankita Kumari wrote:
 
 Hi, how did you get Internal server error? That would still be
 progress because so far I haven't gotten any error.
 
 Sorry, it's Internal error. I get it every time I visit 
 http://tools.wmflabs.org/languageproofing/
 
 Regards
   Daniel



signature.asc
Description: OpenPGP digital signature
--
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: thanking contributors

2015-07-03 Thread Jan Schreiber
Hi Daniel,

The title element of the page reads

titleLanguageTool Contact/title

I guess 'Contact' should be replaced with 'Team'.

Best,
Jan

Am 03.07.2015 17:18, schrieb Daniel Naber:
 Hi,
 
 I've added a new page https://languagetool.org/team/ that lists all the 
 people who have ever contributed to LT. Or rather at least those I could 
 find by searching for by, thanks, and contribute in our change log 
 and the git log. Please let me know if someone is missing, and add 
 people who contribute rules, patches, or good ideas. Being listed there 
 might be a nice motivation to contribute for some.
 
 The new page is linked from the footer of every page on 
 languagetool.org.
 
 Regards
   Daniel
 
 
 --
 Don't Limit Your Business. Reach for the Cloud.
 GigeNET's Cloud Solutions provide you with the tools and support that
 you need to offload your IT needs and focus on growing your business.
 Configured For All Businesses. Start Your Cloud Today.
 https://www.gigenetcloud.com/
 ___
 Languagetool-devel mailing list
 Languagetool-devel@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/languagetool-devel
 

--
Don't Limit Your Business. Reach for the Cloud.
GigeNET's Cloud Solutions provide you with the tools and support that
you need to offload your IT needs and focus on growing your business.
Configured For All Businesses. Start Your Cloud Today.
https://www.gigenetcloud.com/
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: confusion probability rule now online

2015-05-29 Thread Jan Schreiber
Another word pair came to my mind for which this method might work:
prove - proof

Best,
Jan

Am 27.05.2015 20:12, schrieb Daniel Naber:
 On 2015-05-27 19:45, Jan Schreiber wrote:
 
 this is really awesome! Is it possible to extend this to abbreviations
 with an apostrophe?
 
 Not yet, but it's on my TODO list.
 
 Other important cases:
 then - than
 effect - affect
 
 I'll try to add these.
 
 Regards
   Daniel
 
 
 --
 ___
 Languagetool-devel mailing list
 Languagetool-devel@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/languagetool-devel
 

--
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Browser preview of grammar.xml broken

2015-05-27 Thread Jan Schreiber
Hi,

I am currently studying the German grammar file in order to get myself
up to speed again, so I can start making some contributions in the near
future.

I noticed that the browser view doesn't show the examples anymore,
apparently because the type attribute was removed. Could somebody fix
the rules.css file to accommodate for that change?

TIA,
Jan

--
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: Help with unmunch and Icelandic + Galician

2014-10-27 Thread Jan Schreiber
You have to use V3.0 build 64. From the menu Dictionary Tools, choose
Extract wordlist. It worked for me.

Am 27.10.2014 16:38, schrieb Daniel Naber:
 On 2014-10-27 13:48, Marco A.G.Pinto wrote:
 
  To unmunch .DIC + .AFF use my tool, Proofing Tool GUI:
  http://marcoagpinto.cidadevirtual.pt/proofingtoolgui.html [3]
 
 How does it work? I couldn't find an unmunch menu item or similar. 
 What does it do differently to unmunch command line program?
 
 Regards
   Daniel
 
 
 --
 ___
 Languagetool-devel mailing list
 Languagetool-devel@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/languagetool-devel
 

--
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: Checkout Failed

2014-10-23 Thread Jan Schreiber
Hi,
The only advice I have to give here is to keep your patience. I have had
this problem and it usually went away when I kept trying. It can be very
annoying though.
--Jan

Am 23.10.2014 16:25, schrieb Marco A.G.Pinto:
 Hello!
 
 After long minutes downloading files with a checkout in Tortoise, I get a:
 The XML response contains invalid XML:
 
 
 Is there a way of fixing it?
 
 Thanks!
 
 Kind regards,
 Marco A.G.Pinto
   ---
 
 
 -- 
 
 
 --
 
 
 
 ___
 Languagetool-devel mailing list
 Languagetool-devel@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/languagetool-devel
 

--
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: British English: -IZE/-ISE

2014-10-18 Thread Jan Schreiber
For German there is WordCoherencyRule that uses a semicolon-separated
list called coherency.txt. This can probably easily be ported to English.
The solution is not ideal, however, since it detects incoherent use of
individual words, such as 'monetize'/'monetise', but it doesn't insist
that users should prefer, say, 'familiarize' if they used 'monetize'.

Am 18.10.2014 01:35, schrieb Marco A.G.Pinto:
 Hello!
 
 Since some British people complain that they prefer -ISE, others prefer
 -IZE and others both, I was wondering if we could add a setting to LT
 regarding that.
 
 This way, one en_GB dictionary would satisfy everyone, and we would rely
 on LT to do the job.
 
 Is it a good idea?
 
 Thanks!
 
 Kind regards,
Marco A.G.Pinto
  --
 
 
 -- 
 
 
 --
 Comprehensive Server Monitoring with Site24x7.
 Monitor 10 servers for $9/Month.
 Get alerted through email, SMS, voice calls or mobile push notifications.
 Take corrective actions from your mobile device.
 http://p.sf.net/sfu/Zoho
 
 
 
 ___
 Languagetool-devel mailing list
 Languagetool-devel@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/languagetool-devel
 

--
Comprehensive Server Monitoring with Site24x7.
Monitor 10 servers for $9/Month.
Get alerted through email, SMS, voice calls or mobile push notifications.
Take corrective actions from your mobile device.
http://p.sf.net/sfu/Zoho
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: languagetool.org improvement

2014-10-17 Thread Jan Schreiber
I tested it with MSIE and it works fine!

Am 17.10.2014 10:04, schrieb Daniel Naber:
 Hi,
 
 I've modified languagetool.org so that it not only selects your language 
 by default, but now also the country variant. ...

--
Comprehensive Server Monitoring with Site24x7.
Monitor 10 servers for $9/Month.
Get alerted through email, SMS, voice calls or mobile push notifications.
Take corrective actions from your mobile device.
http://p.sf.net/sfu/Zoho
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: Yours missing POS tag?

2014-10-17 Thread Jan Schreiber
There is a file called manually_added.txt in the English resources
folder. Would it make sense to add it there, or is this just a kind of
record?

Am 17.10.2014 17:06, schrieb Daniel Naber:
 On 2014-10-17 16:33, Jonathon Churchill wrote:
 
 Hi, when writing a rule I noticed that the word yours does not
 receive the PRP$ tag, rather just has the PRP tag. As shown in the
 following link suggests that yours should in fact have the PRP$ tag
 because it is a possessive pronoun:
 
 Thanks for catching this. The POS data has been collected from several 
 sources, a complete list is at the bottom of our README.txt. I assume it 
 has just been missing from the original data (or maybe it was lost 
 during the conversion). Could you open a bug report for this? I'd like 
 to fix this once we have collected several such cases, so we don't have 
 to touch the binary file just to fix a single word.
 
 Regards
   Daniel

--
Comprehensive Server Monitoring with Site24x7.
Monitor 10 servers for $9/Month.
Get alerted through email, SMS, voice calls or mobile push notifications.
Take corrective actions from your mobile device.
http://p.sf.net/sfu/Zoho
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: svn issue (solved)

2014-10-16 Thread Jan Schreiber
Yeah, this type of issue seems quite common with svn. Most issues I have
had so far went away after a few hours, and the others were usually
resolved by checking out to a new folder.

Am 16.10.2014 16:42, schrieb R.J. Baars:
 Somehow, the issue disappeared spontaneously.
 
 Ruud
 
 Somehow I suddenly could not commit anything to LT anymore.



signature.asc
Description: OpenPGP digital signature
--
Comprehensive Server Monitoring with Site24x7.
Monitor 10 servers for $9/Month.
Get alerted through email, SMS, voice calls or mobile push notifications.
Take corrective actions from your mobile device.
http://p.sf.net/sfu/Zoho___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: users suggesting words for the spell checker

2014-10-13 Thread Jan Schreiber
I love the idea! A user-friendly, hassle-free way to submit false
positives in Hunspell was indeed quite desirable.

Am 13.10.2014 09:42, schrieb Daniel Naber:
 Hi,
 
 I've activated a new feature on languagetool.org for German that lets
 users suggest a word for the spell checker. There's a new menu item on
 potentially misspelled words that directs users to a page on
 community.languagetool.org where they can submit this word. No
 registration is needed for the user. I watch these words via a
 password-protected RSS feed and add the valid ones to the German
 ignore.txt file. There are about 3-6 suggestions per day for German.
 From time to time I will report the added words to the maintainer of the
 German spell checker dictionary.
 

--
Meet PCI DSS 3.0 Compliance Requirements with EventLog Analyzer
Achieve PCI DSS 3.0 Compliant Status with Out-of-the-box PCI DSS Reports
Are you Audit-Ready for PCI DSS 3.0 Compliance? Download White paper
Comply to PCI DSS 3.0 Requirement 10 and 11.5 with EventLog Analyzer
http://p.sf.net/sfu/Zoho
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: switching from Hunspell to Morfologik

2014-10-13 Thread Jan Schreiber
In case anyone's interested in the exported plain text file, it is here:
http://sourceforge.net/projects/germandict/files/Morfologik/de_frequency.7z

I sorted the words by frequency class and additionally sorted the
largest A class of least frequent words by word length.

The frequency distribution for the first 200,000 words looks fairly
plausible, but the vast majority (about 1.4 million word forms) is
lumped together in one huge class.

Ruud, you said you have larger frequency data sets available for most of
the languages. If you happen to have data for German available I would
love to have it, ideally in the gaia format so I don't have to hassle
with converting it. But a tab-separated list or something like that
would also be great.

--Jan

Am 12.10.2014 18:18, schrieb Jan Schreiber:
 I figured out how to dump the dictionary. All I had to do was create a
 hunspell subfolder and move the binary dictionary into it, then the
 exporting process worked as advertised.

--
Meet PCI DSS 3.0 Compliance Requirements with EventLog Analyzer
Achieve PCI DSS 3.0 Compliant Status with Out-of-the-box PCI DSS Reports
Are you Audit-Ready for PCI DSS 3.0 Compliance? Download White paper
Comply to PCI DSS 3.0 Requirement 10 and 11.5 with EventLog Analyzer
http://p.sf.net/sfu/Zoho
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: switching from Hunspell to Morfologik

2014-10-13 Thread Jan Schreiber
Dawid,

thanks a ton for the clarifications. TBH I blindly followed the
instructions on the Wiki page you mentioned and I wasn't really sure
what I was doing. Please let's make sure I don't misunderstand you.

 with SUFFIX encoder (which your .info file implicitly picks)
This encoder expands suffixes, similar to Hunspell, with which I am
slightly more familiar, right? The word list I used is just a plain list
with no affix compression whatsoever, hence this feature should
definitely be turned off. I wasn't even fully aware it existed.

One of the problems here was that I couldn't test the binary dict file
to see if it worked. Is there a way to use Morfologik for non-programmers?

--Jan


Am 13.10.2014 20:29, schrieb Dawid Weiss:
 This is a valid FSA file, but not a valid encoding for the dictionary
 you're trying to dump, Jan. That's why you're getting an exception.
 For example this entry:
 
 AAA+I
 
 with SUFFIX encoder (which your .info file implicitly picks) this
 means to truncate 8 bytes from the sequence, which is clearly wrong.
 It seems to me that you have data that shouldn't be encoded with
 anything (and isn't) -- perhaps the LT colleagues can follow-up with
 this one. The wiki page at:
 
 http://wiki.languagetool.org/hunspell-support
 
 indeed should clarify the encoder property for the associated .info file as:
 
 fsa.dict.encoder=NONE
 
 if you comment out these obsolete properties from your .info file:
 
 #fsa.dict.uses-prefixes=false
 #fsa.dict.uses-infixes=false
 
 and add the above one, the dictionary dumps just fine. In any case,
 you can always dump *any* FSA dictionary without applying the decoding
 routines; just use:
 
 java -jar morfologik-tools-1.10.0-SNAPSHOT-standalone.jar  fsa_dump -d
 dict --raw-data
 
 If you do want to decode the data, pass an additional -x, although
 if the underlying data doesn't make sense, exceptions may occur (no
 runtime checks are done to verify sanity for performance reasons).
 
 Dawid
 
 On Mon, Oct 13, 2014 at 3:59 PM, Jan Schreiber
 jan.schrei...@languagetool.org wrote:
 In case anyone's interested in the exported plain text file, it is here:
 http://sourceforge.net/projects/germandict/files/Morfologik/de_frequency.7z

 I sorted the words by frequency class and additionally sorted the
 largest A class of least frequent words by word length.

 The frequency distribution for the first 200,000 words looks fairly
 plausible, but the vast majority (about 1.4 million word forms) is
 lumped together in one huge class.

 Ruud, you said you have larger frequency data sets available for most of
 the languages. If you happen to have data for German available I would
 love to have it, ideally in the gaia format so I don't have to hassle
 with converting it. But a tab-separated list or something like that
 would also be great.

 --Jan

 Am 12.10.2014 18:18, schrieb Jan Schreiber:
 I figured out how to dump the dictionary. All I had to do was create a
 hunspell subfolder and move the binary dictionary into it, then the
 exporting process worked as advertised.

 --
 Meet PCI DSS 3.0 Compliance Requirements with EventLog Analyzer
 Achieve PCI DSS 3.0 Compliant Status with Out-of-the-box PCI DSS Reports
 Are you Audit-Ready for PCI DSS 3.0 Compliance? Download White paper
 Comply to PCI DSS 3.0 Requirement 10 and 11.5 with EventLog Analyzer
 http://p.sf.net/sfu/Zoho
 ___
 Languagetool-devel mailing list
 Languagetool-devel@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/languagetool-devel
 
 --
 Comprehensive Server Monitoring with Site24x7.
 Monitor 10 servers for $9/Month.
 Get alerted through email, SMS, voice calls or mobile push notifications.
 Take corrective actions from your mobile device.
 http://p.sf.net/sfu/Zoho
 ___
 Languagetool-devel mailing list
 Languagetool-devel@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/languagetool-devel
 

--
Comprehensive Server Monitoring with Site24x7.
Monitor 10 servers for $9/Month.
Get alerted through email, SMS, voice calls or mobile push notifications.
Take corrective actions from your mobile device.
http://p.sf.net/sfu/Zoho
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: switching from Hunspell to Morfologik

2014-10-12 Thread Jan Schreiber
Daniel Naber wrote:
 Can you reproduce that when using Morfologik directly? java -jar 
 morfologik-tools-1.9.0-standalone.jar fsa_build --help will give usage 
 instructions (same for 'fsa_dump).

I haven't got morfologik-tools-1.9.0-standalone.jar anywhere on my hard
disk. The closest match I could find is morfologik-tools.jar in the libs
subfolder of the LT 2.7 distribution. Here's a dump from my command line
window:

C:\Users\J\Desktop\LanguageTool-2.7\libsjava -jar
morfologik-tools-1.9.0-standalone.jar fsa_build --help
Error: Unable to access jarfile morfologik-tools-1.9.0-standalone.jar

C:\Users\J\Desktop\LanguageTool-2.7\libsjava -jar morfologik-tools.jar
fsa_build --help
Tools are unavailable, at least one JAR dependency missing.
Required JARs: morfologik-fsa-1.9.0.jar hppc-0.5.3.jar
morfologik-stemming-1.9.0.jar morfologik-polish-1.9.0.jar
commons-cli-1.2.jar commons-lang-2.6.jar


--
Meet PCI DSS 3.0 Compliance Requirements with EventLog Analyzer
Achieve PCI DSS 3.0 Compliant Status with Out-of-the-box PCI DSS Reports
Are you Audit-Ready for PCI DSS 3.0 Compliance? Download White paper
Comply to PCI DSS 3.0 Requirement 10 and 11.5 with EventLog Analyzer
http://p.sf.net/sfu/Zoho
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: switching from Hunspell to Morfologik

2014-10-11 Thread Jan Schreiber
Hi,

I tried creating an fsa dictionary with frequency information from my
German word list[1] as detailed on the Wiki[2].

The creation process worked alright apparently, but when I tried to dump
the binary dictionary to a list with the command

java -cp languagetool.jar org.languagetool.dev.DictionaryExporter de-DE.dict

I got the following error in LT 2.7:

Unhandled program error occurred.
Invoke with '--help' for help.
java.lang.IndexOutOfBoundsException
at java.nio.Buffer.checkBounds(Buffer.java:559)
at java.nio.HeapByteBuffer.put(HeapByteBuffer.java:181)
at
morfologik.stemming.DictionaryLookup.decodeBaseForm(DictionaryLookup.java:312)
at 
morfologik.stemming.DictionaryIterator.next(DictionaryIterator.java:95)
at 
morfologik.stemming.DictionaryIterator.next(DictionaryIterator.java:15)
at morfologik.tools.FSADumpTool.dump(FSADumpTool.java:171)
at morfologik.tools.FSADumpTool.go(FSADumpTool.java:75)
at morfologik.tools.Tool.go(Tool.java:45)
at morfologik.tools.FSADumpTool.main(FSADumpTool.java:285)
at 
org.languagetool.dev.DictionaryExporter.main(DictionaryExporter.java:41)

The binary dictionary with frequency data that I created can be found at
[3].

Best,
Jan

[1] http://sourceforge.net/projects/germandict/
[2] http://wiki.languagetool.org/hunspell-support
[3] http://sourceforge.net/projects/germandict/files/Morfologik/

Am 11.10.2014 12:00, schrieb Daniel Naber:
 Hi,
 
 to provide LT as a 100% pure Java software, I'd like to switch from 
 Hunspell (native code) to Morfologik (Java-based). For that, I think the 
 following languages are easy to switch:
 
  Asturian
  Galician
  Khmer
  Spanish
  Tagalog
  Esperanto
  Icelandic
 
 Does anybody see a problem with me switching those languages to 
 Morfologik? For Esperanto and Icelandic this would also have the benefit 
 that they can then offer suggestions for typos.
 
 Does anybody see a problem with that? Other languages (fr, de, sv, 
 pt-BR) might not be easy to switch, please see 
 https://github.com/languagetool-org/languagetool/issues/199 for details.
 
 Regards
   Daniel
 
 
 --
 Meet PCI DSS 3.0 Compliance Requirements with EventLog Analyzer
 Achieve PCI DSS 3.0 Compliant Status with Out-of-the-box PCI DSS Reports
 Are you Audit-Ready for PCI DSS 3.0 Compliance? Download White paper
 Comply to PCI DSS 3.0 Requirement 10 and 11.5 with EventLog Analyzer
 http://p.sf.net/sfu/Zoho
 ___
 Languagetool-devel mailing list
 Languagetool-devel@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/languagetool-devel
 

--
Meet PCI DSS 3.0 Compliance Requirements with EventLog Analyzer
Achieve PCI DSS 3.0 Compliant Status with Out-of-the-box PCI DSS Reports
Are you Audit-Ready for PCI DSS 3.0 Compliance? Download White paper
Comply to PCI DSS 3.0 Requirement 10 and 11.5 with EventLog Analyzer
http://p.sf.net/sfu/Zoho
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: switching from Hunspell to Morfologik

2014-10-11 Thread Jan Schreiber
Hi,

I wonder if we could use the switch to Morfologik as an opportunity to
rethink our general approach to dictionaries.

Currently we use two dictionaries for all the fully supported languages
afaik, and those contribute considerably to the large download size of
LanguageTool.

Why not use just one dictionary per language and keep all the necessary
data in one well organised place? This large word database could contain
everything we need: tags and base form for the grammar checking,
frequency information for the spelling suggestions, just about
everything. Even if we want the dictionary to contain incorrect/outdated
spellings for tagging purposes, all we need is a one-bit flag that tells
the spell-checking routine if a word is misspelled.

Cheers,
Jan


Am 11.10.2014 12:00, schrieb Daniel Naber:
 Hi,
 
 to provide LT as a 100% pure Java software, I'd like to switch from 
 Hunspell (native code) to Morfologik (Java-based).

--
Meet PCI DSS 3.0 Compliance Requirements with EventLog Analyzer
Achieve PCI DSS 3.0 Compliant Status with Out-of-the-box PCI DSS Reports
Are you Audit-Ready for PCI DSS 3.0 Compliance? Download White paper
Comply to PCI DSS 3.0 Requirement 10 and 11.5 with EventLog Analyzer
http://p.sf.net/sfu/Zoho
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Some thoughts on grammar Nazis, LanguageTool, and the world in itself

2014-05-19 Thread Jan Schreiber
Marcin wrote:
 This is perfect English and you could probably find Jane Austin or
 Charles Dickens using such constructions.

I think this little discussion reveals a fundamental problem of an Open
Source grammar checker, and perhaps of grammar checkers in general.

In the back of my head, I've thought this for years. The more popular LT
becomes over time, the more time we will probably have to spend fighting
off prescriptivist poppycock[1].

There are a lot of people around who are well-meaning and intelligent
and in some way interested in language and grammar, but are not trained
linguists. This type of person will just take some grammar book (or even
worse, a style manual) from their shelves and try to translate all the
rules they happen to find there into LT rules, and they are unable to
understand what the rule was originally intended to do. Eventually they
end up doing more harm than good.

The Hemingway app[2] is a good example of what I have in mind here. It
complains about adverbs and gives useless advice such as 1 adverbs. Aim
for 0 or fewer. The idea behind the rule is probably something like
don't use verb + adverb if the same thought can be expressed more
clearly by choosing a more descriptive verb, e.g. don't say
She moved quickly.
but rather
She ran.
The second sentence is easier to understand and expresses a fact more
accurately with less words, so it is in some way an improvement over the
first.

But it is by no means trivial to translate this idea to the XML
formalism, or (more generally speaking) to something a computer can
understand. Running around and telling people not to use adverbs is
certainly not the way to go, it is just confusing.

My first thought when Daniel published the new rule editor was, Let's
hope this tool will not bring us a bunch of grammar Nazis and nitpicky
smartasses.

These are just some thoughts I wanted to share. I'm not sure if I said
something useful, or if these are just smartass remarks themselves.

[1] http://languagelog.ldc.upenn.edu/nll/?cat=5
[2] http://languagelog.ldc.upenn.edu/nll/?p=10416

--
Accelerate Dev Cycles with Automated Cross-Browser Testing - For FREE
Instantly run your Selenium tests across 300+ browser/OS combos.
Get unparalleled scalability from the best Selenium testing platform available
Simple to use. Nothing to install. Get started now for free.
http://p.sf.net/sfu/SauceLabs
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Translations

2014-05-14 Thread Jan Schreiber
Here is a question (or several questions) from our Facebook page that I
cannot answer:

 Sébastien Kerguen: I recently finished the French translation of
 LanguageTool (link:
 https://www.transifex.com/pro.../p/languagetool/language/fr/).
 
 When will it be integrated to LanguageTool (talking about the Firefox
 addons page, the UI, community website..)?
 
 And who is in charge of merging translations? Thanks.




signature.asc
Description: OpenPGP digital signature
--
Accelerate Dev Cycles with Automated Cross-Browser Testing - For FREE
Instantly run your Selenium tests across 300+ browser/OS combos.
Get unparalleled scalability from the best Selenium testing platform available
Simple to use. Nothing to install. Get started now for free.
http://p.sf.net/sfu/SauceLabs___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: rule editor screencast - help needed

2014-05-11 Thread Jan Schreiber
I can view it with Chrome here.

Am 11.05.2014 13:09, schrieb Daniel Naber:
 Does this work for anybody with Chrome? I only now noticed that it 
 doesn't play with Chrome on my system (but it does with Firefox).



signature.asc
Description: OpenPGP digital signature
--
Is your legacy SCM system holding you back? Join Perforce May 7 to find out:
#149; 3 signs your SCM is hindering your productivity
#149; Requirements for releasing software faster
#149; Expert tips and advice for migrating your SCM now
http://p.sf.net/sfu/perforce___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: rule editor screencast - help needed

2014-05-10 Thread Jan Schreiber
Great! Could somebody who can use the LanguageTool voice on Facebook
post a link there? My Internet connection is painfully slow atm so I am
virtually unable to use Facebook or YouTube.

Am 10.05.2014 20:09, schrieb Daniel Naber:
 the video is now online at YouTube and linked at 
 http://community.languagetool.org/ruleEditor2/.



signature.asc
Description: OpenPGP digital signature
--
Is your legacy SCM system holding you back? Join Perforce May 7 to find out:
#149; 3 signs your SCM is hindering your productivity
#149; Requirements for releasing software faster
#149; Expert tips and advice for migrating your SCM now
http://p.sf.net/sfu/perforce___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: rule editor screencast - help needed

2014-05-08 Thread Jan Schreiber
Thanks, Ruud. Daniel, did you like the result?

Am 07.05.2014 22:25, schrieb R.J. Baars:
 It is in dropbox, shared with daniel now. Might take some time to shrink.
 



signature.asc
Description: OpenPGP digital signature
--
Is your legacy SCM system holding you back? Join Perforce May 7 to find out:
#149; 3 signs your SCM is hindering your productivity
#149; Requirements for releasing software faster
#149; Expert tips and advice for migrating your SCM now
http://p.sf.net/sfu/perforce___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: rule editor screencast - help needed

2014-05-07 Thread Jan Schreiber
Hi,

I uploaded a version to
https://drive.google.com/file/d/0B2khHHcwKI_hNHExbGtuQThVVGc

Somewhere near the end, video and audio are quite badly out of synch,
that's why I am not exactly happy with the result, but I guess this is
just about as good as it gets.

Also, the result is encoded in WMV (an absolutely useless, typical
Microsoft invention imho), because the tool I used will not save to a
decent format. Let me know if this is a problem.

--Jan


Am 05.05.2014 22:06, schrieb Daniel Naber:
 Hi,
 
 I've created a screencast for the rule editor, that is, I've written a
 script and I've made a screencast video. But I don't have a good
 microphone and with the microphone from my laptop the sound is horrible.
 So who has a proper microphone and would like to record 5 minutes of
 their voice?



signature.asc
Description: OpenPGP digital signature
--
Is your legacy SCM system holding you back? Join Perforce May 7 to find out:
#149; 3 signs your SCM is hindering your productivity
#149; Requirements for releasing software faster
#149; Expert tips and advice for migrating your SCM now
http://p.sf.net/sfu/perforce___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: rule editor screencast - help needed

2014-05-07 Thread Jan Schreiber
 Anybody here who knows how to fix that? The video is just a
 screencast, so it should be okay to slow it down or speed it up a
 bit?

The audio is here:
https://drive.google.com/file/d/0B2khHHcwKI_hcXMxOXNnRjJpYWs/edit?usp=sharing

If anybody here on the list can add this to the existing video in better
resolution, we're pretty much done I hope.
--Jan

Am 07.05.2014 20:36, schrieb Daniel Naber:
 On 2014-05-07 20:03, Jan Schreiber wrote:
 
 Somewhere near the end, video and audio are quite badly out of synch,
 that's why I am not exactly happy with the result, but I guess this is
 just about as good as it gets.
 
 Anybody here who knows how to fix that? The video is just a screencast, 
 so it should be okay to slow it down or speed it up a bit?
 
 Also, the result is encoded in WMV (an absolutely useless, typical
 Microsoft invention imho), because the tool I used will not save to a
 decent format. Let me know if this is a problem.
 
 Not sure if the format is a problem, but the resolution got worse so 
 it's more difficult to read the text on the screen.
 
 Regards
   Daniel
 
 
 --
 Is your legacy SCM system holding you back? Join Perforce May 7 to find out:
 #149; 3 signs your SCM is hindering your productivity
 #149; Requirements for releasing software faster
 #149; Expert tips and advice for migrating your SCM now
 http://p.sf.net/sfu/perforce
 ___
 Languagetool-devel mailing list
 Languagetool-devel@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/languagetool-devel
 



signature.asc
Description: OpenPGP digital signature
--
Is your legacy SCM system holding you back? Join Perforce May 7 to find out:
#149; 3 signs your SCM is hindering your productivity
#149; Requirements for releasing software faster
#149; Expert tips and advice for migrating your SCM now
http://p.sf.net/sfu/perforce___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: rule editor screencast - help needed

2014-05-06 Thread Jan Schreiber
I think I will have a go at that later today, even though I would prefer a 
native speaker for this kind of thing. --Jan

Daniel Naber daniel.na...@languagetool.org schrieb:

Hi,

I've created a screencast for the rule editor, that is, I've written a 
script and I've made a screencast video. But I don't have a good 
microphone and with the microphone from my laptop the sound is horrible. 
So who has a proper microphone and would like to record 5 minutes of 
their voice?

The video is at https://www.youtube.com/watch?v=LLeFr0wO3xU (no voice, 
needs HD to be readable)

I'll attach the script, but it's just a suggestion, you don't have to 
stick to it. Any suggestions on how to improve the script are also 
welcome (I have no experience with writing such stuff).

Regards
  Daniel

--
Is your legacy SCM system holding you back? Join Perforce May 7 to find out:
#149; 3 signs your SCM is hindering your productivity
#149; Requirements for releasing software faster
#149; Expert tips and advice for migrating your SCM now
http://p.sf.net/sfu/perforce
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel
--
Is your legacy SCM system holding you back? Join Perforce May 7 to find out:
#149; 3 signs your SCM is hindering your productivity
#149; Requirements for releasing software faster
#149; Expert tips and advice for migrating your SCM now
http://p.sf.net/sfu/perforce
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: rule editor screencast - help needed

2014-05-06 Thread Jan Schreiber
I uploaded what I have so far to
https://drive.google.com/file/d/0B2khHHcwKI_hcXMxOXNnRjJpYWs/edit?usp=sharing
I am not sure if it is totally terrible. As everybody knows, hearing
your own voice recorded is always a little strange.
If it is okay, I can start work on synching it with the video.
--Jan

Am 06.05.2014 11:58, schrieb Jan Schreiber:
 I think I will have a go at that later today, even though I would prefer a 
 native speaker for this kind of thing. --Jan
 
 Daniel Naber daniel.na...@languagetool.org schrieb:
 
 Hi,

 I've created a screencast for the rule editor, that is, I've written a 
 script and I've made a screencast video. But I don't have a good 
 microphone and with the microphone from my laptop the sound is horrible. 
 So who has a proper microphone and would like to record 5 minutes of 
 their voice?

 The video is at https://www.youtube.com/watch?v=LLeFr0wO3xU (no voice, 
 needs HD to be readable)

 I'll attach the script, but it's just a suggestion, you don't have to 
 stick to it. Any suggestions on how to improve the script are also 
 welcome (I have no experience with writing such stuff).

 Regards
  Daniel

 --
 Is your legacy SCM system holding you back? Join Perforce May 7 to find out:
 #149; 3 signs your SCM is hindering your productivity
 #149; Requirements for releasing software faster
 #149; Expert tips and advice for migrating your SCM now
 http://p.sf.net/sfu/perforce
 ___
 Languagetool-devel mailing list
 Languagetool-devel@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/languagetool-devel
 --
 Is your legacy SCM system holding you back? Join Perforce May 7 to find out:
 #149; 3 signs your SCM is hindering your productivity
 #149; Requirements for releasing software faster
 #149; Expert tips and advice for migrating your SCM now
 http://p.sf.net/sfu/perforce
 ___
 Languagetool-devel mailing list
 Languagetool-devel@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/languagetool-devel
 

--
Is your legacy SCM system holding you back? Join Perforce May 7 to find out:
#149; 3 signs your SCM is hindering your productivity
#149; Requirements for releasing software faster
#149; Expert tips and advice for migrating your SCM now
http://p.sf.net/sfu/perforce
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: rule editor screencast - help needed

2014-05-06 Thread Jan Schreiber
Do you still have the source video around somewhere? If  yes, can you
upload it somewhere so I can grab it?
TBH, my video editing abilities suck, but I'll do what I can. --Jan

Am 06.05.2014 22:36, schrieb Daniel Naber:
 On 2014-05-06 17:23, Jan Schreiber wrote:
 
 I uploaded what I have so far to
 https://drive.google.com/file/d/0B2khHHcwKI_hcXMxOXNnRjJpYWs/edit?usp=sharing
 
 Thanks, this is much better than what I could do. Do you also know how 
 to put the audio track on top of the video again? My first attempts with 
 ffmpeg have failed.
 
 Regards
   Daniel

--
Is your legacy SCM system holding you back? Join Perforce May 7 to find out:
#149; 3 signs your SCM is hindering your productivity
#149; Requirements for releasing software faster
#149; Expert tips and advice for migrating your SCM now
http://p.sf.net/sfu/perforce
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


AW: POS tag help in the new rule editor

2014-05-01 Thread Jan Schreiber
Looks great. Now let's cross fingers and hope that this will bring us a few 
more contributers.

Daniel Naber daniel.na...@languagetool.org schrieb:

Hi,

the new rule editor (http://community.languagetool.org/ruleEditor2/) now 
shows a POS tag help automatically when you enter a field that expects a 
POS tag or a POS tag regex - see attached screenshot.

So far, this works for English and German. To add your language, just 
add a mapping from POS tags (like NN) to readable names (like 
singular noun) to this file, just like it already exists for 'en' and 
'de':

https://github.com/languagetool-org/languagetool-community-website/blob/master/web-app/js/angular/posTagHelperServices.js

To actually test it, you'd need to check out that Grails project and 
make it work, but it's okay if you just change the file, I'll do the 
testing and deploy the changes. You cannot break anything, as the 
changes do not get deployed automatically. Please let me know if you 
have any questions.

Regards
  Daniel

--
Accelerate Dev Cycles with Automated Cross-Browser Testing - For FREE
Instantly run your Selenium tests across 300+ browser/OS combos.  Get 
unparalleled scalability from the best Selenium testing platform available.
Simple to use. Nothing to install. Get started now for free.
http://p.sf.net/sfu/SauceLabs
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel
--
Accelerate Dev Cycles with Automated Cross-Browser Testing - For FREE
Instantly run your Selenium tests across 300+ browser/OS combos.  Get 
unparalleled scalability from the best Selenium testing platform available.
Simple to use. Nothing to install. Get started now for free.
http://p.sf.net/sfu/SauceLabs
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Hunspell false positives [de]

2014-04-19 Thread Jan Schreiber
Dear list,

I have a huge list[1] of 70,000 (potential) false positives in the
latest de_DE_frami dictionary for Hunspell. Among other things, it is
based on the first 10,000 (or so) entries of a machine-generated list
Ruud Baars sent me a few months ago.

I think this is a valuable resource, but its size makes it difficult
to handle. In particular, there is no guarantee that there aren't some
true positives hidden somewhere, and I don't know how to filter out
words that are very rare and would only clutter the dictionary.

Any ideas how this can be made useful for the improvement of German
Hunspell support with reasonable effort?

-- Jan

[1] https://sourceforge.net/p/germandict/code/HEAD/tree/hunspell_words.txt

--
Learn Graph Databases - Download FREE O'Reilly Book
Graph Databases is the definitive new guide to graph databases and their
applications. Written by three acclaimed leaders in the field,
this first edition is now available. Download your free book today!
http://p.sf.net/sfu/NeoTech
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


AW: 2.3.1 SVN folder?

2013-10-11 Thread Jan Schreiber
The current version is always in Trunk. The tags are more like an archive of 
versions.

Marco A.G.Pinto marcoagpi...@mail.telepac.pt schrieb:

Hello!

I haven't been paying much attention to all that is going on.

Just a quick question:
Shall my grammar.xml and compounds.txt be added inside:
 C:\Users\marcoagpinto\Desktop\LanguageTool Repo GITHUB\tags\v2.3.1
and no longer in:
 C:\Users\marcoagpinto\Desktop\LanguageTool Repo GITHUB\trunk
?

Thanks!

Kind regards,
  Marco A.G.Pinto
---


-- 

--
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register 
http://pubads.g.doubleclick.net/gampad/clk?id=60134071iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel
--
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register 
http://pubads.g.doubleclick.net/gampad/clk?id=60134071iu=/4140/ostg.clktrk___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


AW: Show tags

2013-10-10 Thread Jan Schreiber
Syntax highlighting for tags in the GUI would be great. For those working with 
Notepad++, if somebody could come up with a user-defined highlighter scheme for 
it and share that, that would also be great.

Kumara Bhikkhu kumara.bhik...@gmail.com schrieb:


S I[I/PRP,B-NP-singular|E-NP-singular] love[love/VBP,B-VP]
everyone[everyone/NN:U, everyone/PRP,B-NP-singular|E-NP-singular]
I[I/PRP,B-NP-singular|E-NP-singular] talked[talk/VBD, talk/VBN,B-VP]
to[to/IN, to/TO,B-NP-singular|E-NP-singular].[./., /S,O]
Is it just me, or everyone else also finds this difficult to read. 
I'm thinking how can we make this easier to read. One tag analysis per line?

kb

--
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register 
http://pubads.g.doubleclick.net/gampad/clk?id=60134071iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel
--
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register 
http://pubads.g.doubleclick.net/gampad/clk?id=60134071iu=/4140/ostg.clktrk___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


AW: Spell checkers

2013-10-09 Thread Jan Schreiber
I am very interested if data for German is available! --Jan

R.J. Baars r.j.ba...@xs4all.nl schrieb:

Are there any people out there interested in new probable words for their
spell checkers and/or postagging files?

The data that I can create are:
- language
- word
- # of times found within correct words
- indication of likelyness for the language (based on character distribution)
- when desired, word context.

Anyone interested?

It is not for all languages yet, but if you ar interested and not
available yet, we can co-operate getting the data.

Ruud




--
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register 
http://pubads.g.doubleclick.net/gampad/clk?id=60134071iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel
--
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register 
http://pubads.g.doubleclick.net/gampad/clk?id=60134071iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: He tried not to laughs.

2013-10-06 Thread Jan Schreiber
Thank you for the heads-up, I am certainly one of the culprits here. I
hope I manage to get rid of this old habit. --Jan

Am 04.10.2013 16:22, schrieb gulp21:
 Anyway, mails you from always end up as below: all joined up.
 
 Marcin Miłkowski wrote thus at 04:04 PM 04-10-13:
 
 I’d guess that it is a UTF8-related problem with Eudora.
 
 And a general note (not related to this problem): When starting a
 new topic on the mailing list, please do *not* reply to an existing
 message and change the subject, but send a new mail to the list;
 othwerwise, threaded mail views become confusing (see [1]), since
 the reply is always associated with the message which is replied to
 (Reply-To header field), regardless of what the subject is.
 
 Regards Markus
 
 [1] 
 http://www.mail-archive.com/languagetool-devel@lists.sourceforge.net/index.html#02425

  
 --

 
October Webinars: Code for Performance
 Free Intel webinars can help you accelerate application
 performance. Explore tips for MPI, OpenMP, advanced profiling, and
 more. Get the most from the latest Intel processors and
 coprocessors. See abstracts and register  
 http://pubads.g.doubleclick.net/gampad/clk?id=60134791iu=/4140/ostg.clktrk

 
___
 Languagetool-devel mailing list 
 Languagetool-devel@lists.sourceforge.net 
 https://lists.sourceforge.net/lists/listinfo/languagetool-devel
 


--
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register 
http://pubads.g.doubleclick.net/gampad/clk?id=60134791iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Modules for individual supported languages?

2013-10-03 Thread Jan Schreiber
Somebody by the name of Łukasz Janik posted this to our Facebook wall:

prosze kazdy jezyk jako osobno

I don't speak a single word of Polish, but according to Google
Translator, this is a feature request to release single-language
versions of LT. (Google and I might be wrong here of course.) ;-)

I tend to agree with him. Given the fact that the vast majority of
people probably doesn't actively use more than three languages, we're
imposing a huge overhead on our users.

We've discussed this before, but I'm not sure what the outcome was. I
think the ideal solution would be if the users could configure the
languages they want before downloading. If that is not possible, there
should be a clean way to remove unwanted languages during or after
installation.

Maybe we could have a two-step download: In the first step, you download
the main app, perhaps with English already on board. During install, you
can choose whatever other languages you may need.

--Jan

--
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register 
http://pubads.g.doubleclick.net/gampad/clk?id=60134791iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


AW: About LT in Russian

2013-10-02 Thread Jan Schreiber
Very nice. Too bad I can't read Russian. :-/

Yakov Reztsov yakovr...@mail.ru schrieb:


Hi!
I wrote a little article about the Russian module of  LanguageTool in Russian .

http://www.altlinux.ru/fileadmin/news/10-conference-Kaluga-2013.pdf
page 79-82

-- 

Yakov Reztsov
--
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register 
http://pubads.g.doubleclick.net/gampad/clk?id=60134791iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel
--
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register 
http://pubads.g.doubleclick.net/gampad/clk?id=60134791iu=/4140/ostg.clktrk___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


AW: Re: AW: Re: xml editor

2013-10-02 Thread Jan Schreiber
On-the-fly checking doesn't work me either, but the normal check does.

Kumara Bhikkhu kumara.bhik...@gmail.com schrieb:

Hey, thanks for the tip.

I've tried XML Tools for Notepad++. The auto-check seems to be buggy. 
It complains where there's no fault.

Jan Schreiber wrote thus at 10:44 PM 30-09-13:
My personal recommendation is PSPad with xmllint, a command-line 
validator for XML. Also, if you prefer Notepad++, there is an XML 
plugin for that too. Use the plugin manager to install it.
Running testrules.bat will also validate the XML against the DTD, 
which is quite helpful. --Jan


--
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register 
http://pubads.g.doubleclick.net/gampad/clk?id=60134791iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel
--
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register 
http://pubads.g.doubleclick.net/gampad/clk?id=60134791iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


AW: Re: xml editor

2013-09-30 Thread Jan Schreiber
My personal recommendation is PSPad with xmllint, a command-line validator for 
XML. Also, if you prefer Notepad++, there is an XML plugin for that too. Use 
the plugin manager to install it.
Running testrules.bat will also validate the XML against the DTD, which is 
quite helpful. --Jan

Kumara Bhikkhu kumara.bhik...@gmail.com schrieb:

I've tried. Too confusing for me. Thanks anyway.

Dmitri Gabinski wrote thus at 08:03 PM 28-09-13:


28.09.2013 10:04 пользователь 
Kumara Bhikkhu 
mailto:kumara.bhik...@gmail.comkumara.bhik...@gmail.com написал:

 
  Dmitri Gabinski wrote thus at 05:53 PM 27-09-13:
 
  Try jEdit with its XML plugin.
 
  How does this compare with Notepad++?

No idea as I don't use Notepad++. jEdit is made 
by coders for coders, its features are very rich 
and can be expanded by numerous plugins. As for XML specifically,...

  I'm thinking of an editor that can tell me if I missed out something,
  like an end tag.

jEdit with the XML plugin (and a couple of other 
plugins, it'll tell you what to install) can 
validate an XML file against a scheme.

Best regards,

Dmitri Gabinski

--
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register 
http://pubads.g.doubleclick.net/gampad/clk?id=60133471iu=/4140/ostg.clktrk
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel
--
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register 
http://pubads.g.doubleclick.net/gampad/clk?id=60133471iu=/4140/ostg.clktrk___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: Hunspell expansion to words list

2013-05-03 Thread Jan Schreiber
The problem with the compounds in Hunspell that Ruud described exists
for German as well. Just saying.

Am 03.05.2013 13:07, schrieb Ruud Baars:
 Hi.
 
 Finally I have a full keyborad, to elaborate a bit on the expansion issue.
 
 Spell checking is supposed signal any incorrect word. So most correct 
 words should be accepted.
 There are words in between though. Words that are technically correct, 
 but in everyday language use mocht commonly a mistake for a different word.
 
 Example for Dutch: si is one of the notes in do-re-mi-fa-sol-la-si-do. 
 So it is technically correct. But over 80% of the hits in Dutch 
 sentences it is a mistake for is. So it has intentionally been left out 
 of the correct words list, even though it is correct.
 
 When compounding is uses, some compounding parts will accidentally 
 combine into a word that is technically correct, but still most of the 
 time a mistake. Example: a muskaatnoot (nutmeg) is correct, but also 
 muskaatnood could easily be generated, since nood (emergency) is a 
 compounder too.
 
 No matter how carefully compounds have been selected, lots of nonsense 
 words have been reported as Hunspell suggestions since the Hunspell 
 dictionary for Dutch introduced compounding.
 
 Because of that, it is not a good base material for expansion. The one 
 being fabricated now, to be released the end of this year (hopefully, it 
 is 1 year leate then) could be better base material for expansion.
 
 Ruud

--
Get 100% visibility into Java/.NET code with AppDynamics Lite
It's a free troubleshooting tool designed for production
Get down to code-level detail for bottlenecks, with 2% overhead.
Download for free and get started troubleshooting in minutes.
http://p.sf.net/sfu/appdyn_d2d_ap2
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: Javascript-based online check - testing needed

2013-01-08 Thread Jan Schreiber
Am 06.01.2013 23:38, schrieb Daniel Naber:
 Could someone test with MSIE?
I did, seems to work so far. On my netbook, the font size in IE is VERY 
small however. I had to increase it manually to make the text readable. 
On a larger screen this is probably not a problem.
Also, the wavy underlines don't look wavy at all (as they do in Google 
Chrome), but rather like dashed lines.

--Jan

--
Master SQL Server Development, Administration, T-SQL, SSAS, SSIS, SSRS
and more. Get SQL Server skills now (including 2012) with LearnDevNow -
200+ hours of step-by-step video tutorials by Microsoft MVPs and experts.
SALE $99.99 this month only - learn more at:
http://p.sf.net/sfu/learnmore_122512
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] public relations tasks

2013-01-06 Thread Jan Schreiber
Hi everybody,

I wonder if we should link to our Facebook profile
http://www.facebook.com/LanguageTool directly from the landing page
http://www.languagetool.org. Below the Twitter icon on the left would
probably be a good place for that.

As a sort of status report, there is almost no user activity worth
mentioning taking place on our Facebook page. We have twelve likes so
far, most of which are coming from project members.

Best,
Jan

Am 29.07.2012 22:50, schrieb Daniel Naber:
 Hi,
 
 who is interested in doing some public relation tasks for LT? Here are some 
 tasks I can think of (feel free to add your own ideas):
 
 * Maintain a Facebook and Google+ homepage for LT. This includes posting 
 news, answering user questions etc. 
 
 * Create a funny introductory video clip that shows what LT can do for you. 
 Not just a boring screencast but a video that shows embarrassing mistakes 
 and how LT detects them.
 
 * Find a professional homepage designer and plan a re-design of 
 languagetool.org.
 
 Let me know if you would like to work on a task (I think the 
 Facebook/Google+ tasks are most fitting for an experienced LT committer).
 
 Regards
  Daniel
 

--
Master Visual Studio, SharePoint, SQL, ASP.NET, C# 2012, HTML5, CSS,
MVC, Windows 8 Apps, JavaScript and much more. Keep your skills current
with LearnDevNow - 3,200 step-by-step video tutorials by Microsoft
MVPs and experts. ON SALE this month only -- learn more at:
http://p.sf.net/sfu/learnmore_123012
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


New editors page on languagetool.wikidot.com

2012-12-31 Thread Jan Schreiber
Dear list,

I've had a first go at a page[1] on the Wiki that will hopefully evolve 
into a list of recommended editors over time. Feel free to add your 
favorite editor(s) there.

--Jan

[1] http://languagetool.wikidot.com/xml-editors

--
Master Visual Studio, SharePoint, SQL, ASP.NET, C# 2012, HTML5, CSS,
MVC, Windows 8 Apps, JavaScript and much more. Keep your skills current
with LearnDevNow - 3,200 step-by-step video tutorials by Microsoft
MVPs and experts. SALE $99.99 this month only -- learn more at:
http://p.sf.net/sfu/learnmore_122412
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] Performance

2012-11-23 Thread Jan Schreiber
OK, thanks for clearing this up. I've been concerned about performance
issues ever since I started adding large disjunctions (in regular
expressions) to the rules.

I have virtually no background in computer science, so I am often not
aware of the computational costs that the things I do in the rules might
have. I guess I better leave that part to the developers and go on as
usual. --Jan

Am 22.11.2012 16:56, schrieb Daniel Naber:
 Turning off rules by default means that basically nobody will ever turn 
 them on again. As Marcin said: performance is an issue, but it's not a big 
 issue. We need to be aware of it and try to improve it, but if people stop 
 writing powerful rules because they think it costs performance, we're going 
 in the wrong direction.

--
Monitor your physical, virtual and cloud infrastructure from a single
web console. Get in-depth insight into apps, servers, databases, vmware,
SAP, cloud infrastructure, etc. Download 30-day Free Trial.
Pricing starts from $795 for 25 servers or applications!
http://p.sf.net/sfu/zoho_dev2dev_nov
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] Performance

2012-11-21 Thread Jan Schreiber
Ruud Baars wrote:
 Suppose there is a slow/expensive rule, that almost never matches to warn 
 about some low-priority issue; maybe it is worth switching it off then.

That looks like a good idea to me. We could turn off some rules by
default that are very unlikely to return an important true positive.

Viewing things from the German point of view, we could probably default
many of the capitalization rules to off. My experience as a teacher
indicates that hardly anyone has problems with the general rule that
nouns have to be uppercased in German. There are many, many difficult
cases, but I wouldn't expect (e.g.) the rule that detects ich mache mir
sorgen as an error to have a high number of matches.

Maybe we could run a test on Wikipedia data and disable those rules by
default that have virtually no true positives. We could offer something
like a fast mode and a thorough mode in the GUI. – Something that
strikes me as counter-intuitive in that approach, however, is that the
fast mode would overlook the most obvious errors if we take that route.

Just brainstorming, never mind: As a general approach, maybe we could
ask the users for some kind of self-assessment when they start the
application for the first time, like this:

Are you
- a very bad speller,
- a bad speller,
- a good speller, etc.

For the good spellers, we would do the whole thing, and I think they
will accept some delay.

--Jan

--
Monitor your physical, virtual and cloud infrastructure from a single
web console. Get in-depth insight into apps, servers, databases, vmware,
SAP, cloud infrastructure, etc. Download 30-day Free Trial.
Pricing starts from $795 for 25 servers or applications!
http://p.sf.net/sfu/zoho_dev2dev_nov
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] Firefox extension

2012-11-14 Thread Jan Schreiber
 We now have an SSL certificate (i.e. we support https) and I did
 some load testing today which went okay. Assuming the extension
 will not get hundreds of thousands of users in a short
 time, everything should be fine with the server we have today.

Good news. Thanks for testing, Daniel! --Jan


Am 14.11.2012 00:07, schrieb Daniel Naber:
 On 09.11.2012, 17:16:41 Daniel Naber wrote:
 
 What 
 kind of server capacity would be needed behind that.
 I was thinking 'cloud-like'.

 We simply don't know because it hasn't been tested yet. It also depends
 on  how popular the extension will become.
 
 We now have an SSL certificate (i.e. we support https) and I did some load 
 testing today which went okay. Assuming the extension will not get hundreds 
 of thousands of users in a short time, everything should be fine with the 
 server we have today.
 
 If the server process crashes, it will need to be restarted manually. At 
 least I have an automatic notification about the server being down.
 
 Regards
  Daniel
 

--
Monitor your physical, virtual and cloud infrastructure from a single
web console. Get in-depth insight into apps, servers, databases, vmware,
SAP, cloud infrastructure, etc. Download 30-day Free Trial.
Pricing starts from $795 for 25 servers or applications!
http://p.sf.net/sfu/zoho_dev2dev_nov
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] vote about logo proposals

2012-10-05 Thread Jan Schreiber
Am 05.10.2012 20:37, schrieb gulp21:
 I know it's still on the download buttons on our homepage, but
 I will fix that tomorrow or so unless someone beats me to it.

 In my opinion, the blue squiggle does not look that nice on the yellow
 background. Here[1] you can see how it looks like with a blue
 background.

Blue on blue looks like a bad idea to me. Personally, I like the colors, 
but there are many people out there who won't be able to see the 
squiggles, due to impaired color vision. Does the orange/blue version 
really look that bad to everybody? Just asking. Never mind me. --Jan

--
Don't let slow site performance ruin your business. Deploy New Relic APM
Deploy New Relic app performance management and know exactly
what is happening inside your Ruby, Python, PHP, Java, and .NET app
Try New Relic at no cost today and get our sweet Data Nerd shirt too!
http://p.sf.net/sfu/newrelic-dev2dev
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] vote about logo proposals

2012-09-23 Thread Jan Schreiber
I like the new icon. It is similar to the old one and pretty 
straightforward and simple, but still a vast improvement.

Am 23.09.2012 12:34, schrieb Daniel Naber:
 I have already set a new favicon on http://www.languagetool.org.

--
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://ad.doubleclick.net/clk;258768047;13503038;j?
http://info.appdynamics.com/FreeJavaPerformanceDownload.html
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] vote about logo proposals

2012-09-19 Thread Jan Schreiber
Hard to decide. I like all of those.

Am 18.09.2012 23:06, schrieb Daniel Naber:
 As nobody is really happy yet with the new logo, let's take a step back and
 look at others logos. What I posted was only a subset of what I got from
 our designer.

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] logo design

2012-08-24 Thread Jan Schreiber
Am 24.08.2012 18:17, schrieb Daniel Naber:
 I found a professional logo designer who's going to work on a LanguageTool
 logo next week. If you have any ideas how the logo should look like / what
 it should contain, please let me know and I'll forward it to the designer.

What came to my mind first was something vaguely similar to the 
Wikipedia logo: Letters or words from several alphabets (Greek, 
Cyrillic, Latin, Arabic ...), arranged in some way that emphasizes both 
their differences and their similarities.

Maybe the word for grammar or language (or LanguageTool?!) in some of 
the languages we support, arranged on the three visible sides of a cube, 
something along these lines ...

The problem with the above idea of translating the name of the software 
to several languages is probably that it does not translate naturally in 
most languages. Sprachwerkzeug (a natural German translation) sounds 
weird. The good news is that we have an international team of 
contributors here that can help judging on the soundness of suggestions.

I'm just brainstorming here ... --Jan

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] Using Subversion + Ant at checkout

2012-07-21 Thread Jan Schreiber
Am 21.07.2012 17:13, schrieb Ruud Baars:
 On Windows you could try TurtoiseSVN ..
 
I strongly second that. Get it from
http://tortoisesvn.net

It is basically an extension for the Windows Explorer that makes
interaction with the SVN repository much easier and more intuitive,
because most commands are available in the Explorer context menu if
applicable. For example, just right-click a folder and choose update
from repository. --Jan

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] including morfologik-tools?

2012-07-05 Thread Jan Schreiber
I agree that this would be useful.

Am 04.07.2012 21:12, schrieb Daniel Naber:
 Shouldn't we
 just include it in LT SVN (not in the *.zip) - this would make life easier
 for everyone I think.

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] Wordlists for spellers?

2012-06-23 Thread Jan Schreiber
I'm fine with that, if it works fast enough. Hunspell is no doubt the 
best free spell checker around for German.

Am 22.06.2012 22:31, schrieb Daniel Naber:
 For now, I prefer the HunspellNoSuggest solution for German.

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] Wordlists for spellers?

2012-06-22 Thread Jan Schreiber
The wordlist was primarily intended as a replacement aspell dictionary, 
it is about four to five times larger than the official one, hence less 
annoying/confusing false positives.

There are Swiss and Austrian Versions in the binary download here:
http://sourceforge.net/projects/germandict/files/aspell_dict.7z/download
You can dump them with the the aspell dump command.

Since the majority of words is the same, I wonder if we should store the 
files similarly to the way aspell does it, i.e. one base dictionary plus 
three variants.

I don't have regular Internet access at home atm, sorry in advance if it 
takes me two days to respond to future e-mails.

Am 22.06.2012 17:46, schrieb Marcin Miłkowski:
 I already converted aspell-de-de to a wordlist, but your wordlist is
 definitely better suited for this task. However, we need such lists for
 Austrian German and Swiss German as well, and aspell is probably not so
 good...

 I will upload a dictionary based on your dictionary for everybody to
 see. It works quite fast here.

 Marcin

 W dniu 2012-06-22 17:18, Jan Schreiber pisze:
 We could try my wordlist (1.3  million word forms) for German if that
 speeds up the process:
 http://sourceforge.net/projects/germandict/

 --Jan

 Am 22.06.2012 11:09, schrieb Marcin Miłkowski:
 Hi all,

 considering that hunspell is so slow and that for many languages, we
 already have wordlists (for example, there are wordlists for all
 variants of English, and all aspell dictionaries may be converted to
 word lists), we might replace all hunspell dictionaries with
 morfologik-speller dictionaries.

 For English, it should be almost trivial. I don't know about German -
 maybe it would be a satisfactory approximation to use aspell dump
 command (for a simple explanation, see
 http://typethinker.blogspot.com/2008/02/fun-with-aspell-word-lists.html). 
 Note
 that in the current version, the speller won't find good suggestions for
 umlauts if too many were omitted.

 Regards,
 Marcin

 --
 Live Security Virtual Conference
 Exclusive live event will cover all the ways today's security and
 threat landscape has changed and how IT managers can respond. Discussions
 will include endpoint security, mobile security and the latest in malware
 threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
 ___
 Languagetool-devel mailing list
 Languagetool-devel@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/languagetool-devel


 --
 Live Security Virtual Conference
 Exclusive live event will cover all the ways today's security and
 threat landscape has changed and how IT managers can respond. Discussions
 will include endpoint security, mobile security and the latest in malware
 threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
 ___
 Languagetool-devel mailing list
 Languagetool-devel@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/languagetool-devel


 --
 Live Security Virtual Conference
 Exclusive live event will cover all the ways today's security and
 threat landscape has changed and how IT managers can respond. Discussions
 will include endpoint security, mobile security and the latest in malware
 threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
 ___
 Languagetool-devel mailing list
 Languagetool-devel@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/languagetool-devel

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] Spaces in file path

2012-06-13 Thread Jan Schreiber
I confirm this as fixed now.

Am 07.06.2012 11:20, schrieb Marcin Miłkowski:
 W dniu 2012-06-07 09:18, Jan Schreiber pisze:
 Using yesterday's developer build of LanguageTool, I get the following
 error message:
  java.io.FileNotFoundException: The dictionary files [...] could
  not be read
 if the path to LT contains spaces. Without spaces, it works smoothly.
 
 Please try again. The previous version had a stupid hack, now I replaced 
 with something more stable, converting first URL to URI, and then 
 instantiating a File object, which returns a proper absolute path.

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] online rule creator

2012-06-03 Thread Jan Schreiber
Excellent. 5000 articles is enough for the time being. Hopefully this
will encourage some contributors from outside.

Am 03.06.2012 01:15, schrieb Daniel Naber:
 On Mittwoch, 30. Mai 2012, Daniel Naber wrote:
 
 New feature: each rule is now automatically checked against a subset of 
 Wikipedia (about 5000 documents per language). This should help to avoid 
 false alarms. We will hopefully move to a more powerful server soon so a 
 much larger subset can be checked.

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] LibreOffice 3.5.4 released

2012-05-31 Thread Jan Schreiber
LanguageTool works for me as LibO 3.5.3 extension under Win XP SP3.
After experiencing problems at first, I went through the following
(admittedly VERY cumbersome) steps, and finally it worked:

- Uninstall LibO
- Uninstall any existing versions of Java (I happened to have several)
- Delete any leftovers from the old LanguageTool install
  (actually, I deleted everything under %APPDATA%\LibreOffice\3\
  user\extensions)

- Reboot
- Freshly install Java 7
- Freshly install LibO
- Reinstall LanguageTool

The new setup has been running smoothly for ~6 weeks or something now.
YMMV, of course.

Am 31.05.2012 16:18, schrieb gulp21:
 Does anyone know whether the Could not create Java implementation 
 loader problem is solved in 3.5.4? A few days ago, I tried to install 
 LT in LibO 3.5.3 on a computer of a friend of mine (running XP SP3, 
 latest version of Java), and that error message was shown. The possible 
 solution mentioned on http://languagetool.org/issues/ did not work.
 

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] online rule creator

2012-05-28 Thread Jan Schreiber
A very good idea. However, I find it a bit unclear where exactly the
suggestion is taken from: From the corrected sentence, or from the
user's error message.

Error Message with correction in single quotation marks would probably
make it a bit clearer, although it is not exactly elegant.

Am 28.05.2012 22:18, schrieb Daniel Naber:
 Hi,
 
 I have uploaded a rule creator at
 
 http://community.languagetool.org/ruleEditor
 

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] How to use the url feature with LanguageTool from command line?

2012-05-27 Thread Jan Schreiber
  However, when I use LanguageTool in the command line, I don't see
  anything about the URL information:
For all I know, the URL is displayed in LibreOffice only.

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] Configuration redesign

2012-05-23 Thread Jan Schreiber
 By the way, just because collapsible checkboxes are hard to implement in 
 Swing, maybe we should have just a list of categories in the first 
 dialog, and open another dialog with a list of rules in the category chosen?

I agree with Daniel that that sounds suspiciously like too many clicks,
but otoh the 1300+ German rules are very difficult to navigate through
in the GUI without folding. The French ruleset is even larger. Any
improvement in this area is very welcome. --Jan

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] [LanguageTool] SF.net SVN: languagetool:[6896] ...

2012-05-19 Thread Jan Schreiber
Not a bug, but a (major) annoyance for me: When grammar.xml is displayed
in a browser, the new external resources aren't embedded:
http://languagetool.svn.sourceforge.net/viewvc/languagetool/trunk/JLanguageTool/src/rules/de/grammar.xml
(Tippfehler is gone.)

I often use the browser view to make sure things are displayed as
intended, to get an overview etc.

Can this be solved via JavaScript/css ... somehow?

Daniel Naber schrieb:
 On Freitag, 18. Mai 2012, Jan Schreiber wrote:
 
 Am I right that this method won't work out of the box, but requires
 changes to the Java code?
 
 Yes. I have just committed this, moving only the typo rules to their own 
 file as a test. Does everything work for you? Java code will not need to be 
 touched when we add new external files.
 
 Regards
  Daniel
 

-- 
Jan Schreiber, M.A.
Universität Duisburg-Essen
Fachbereich Geisteswissenschaften
Fach Philosophie
D-45117 Essen

mailto:jan.schrei...@uni-due.de
http://www.uni-due.de/~gph120/
OpenPGP-Schlüssel:
http://www.uni-due.de/~gph120/diverses/0x06C970E5%20pub.asc
Fingerprint: 57DD DD97 99C6 09E7 8B41 7DAD 63A3 1F42 06C9 70E5

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] [LanguageTool] SF.net SVN: languagetool:[6896] ...

2012-05-19 Thread Jan Schreiber
I just found this:
http://www.xml.com/pub/a/2001/03/14/trxml10.html

It looks as if it uses an XML-internal method. I'll try it and report
back how it works.

I wrote:
 Not a bug, but a (major) annoyance for me: When grammar.xml is displayed
 in a browser, the new external resources aren't embedded:
 http://languagetool.svn.sourceforge.net/viewvc/languagetool/trunk/JLanguageTool/src/rules/de/grammar.xml
 (Tippfehler is gone.)
 
 I often use the browser view to make sure things are displayed as
 intended, to get an overview etc.
 
 Can this be solved via JavaScript/css ... somehow?
 [...]


--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] [LanguageTool] SF.net SVN: languagetool:[6896] ...

2012-05-19 Thread Jan Schreiber
I got fairly close using
!ENTITY sonstiges SYSTEM ./sonstiges.xml
It displays as expected in IE, but not in Firefox, and it doesn't pass
testrules.bat.

Daniel Naber schrieb:
 On Samstag, 19. Mai 2012, Jan Schreiber wrote:
 
 Not a bug, but a (major) annoyance for me: When grammar.xml is displayed
 in a browser, the new external resources aren't embedded:
 http://languagetool.svn.sourceforge.net/viewvc/languagetool/trunk/JLangu
 ageTool/src/rules/de/grammar.xml (Tippfehler is gone.)
 
 Indeed, it's described here:
 http://stackoverflow.com/questions/1512747/will-firefox-do-xslt-on-external-entities
 
 Plus, we need a special classpath: path to make it work under both 
 developer and 
 deployed environment. One workaround is to use an XSLT processor like this:
 
 xsltproc src/rules/print.xsl src/rules/de/grammar.xml
 
 It will expand the external entities and print out HTML (classpath: url 
 needs to be 
 changed before). I'll see if I can find a better solution.
 


--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] [LanguageTool] SF.net SVN: languagetool:[6896] ...

2012-05-18 Thread Jan Schreiber
Am I right that this method won't work out of the box, but requires
changes to the Java code? I tried it but the external files failed to load.
The only worry I have is that changes to the categories (like splitting
a category etc.) will be more complicated.

Daniel Naber wrote:
 grammar.xml contains only the existing entity declarations and this:
 
 !ENTITY typos SYSTEM classpath:/rules/de/typos.xml
 !ENTITY typos SYSTEM classpath:/rules/de/grammar.xml
 
 rules
 
  typos;
  grammar;
  (etc)
 
 /rules
 
 That means that the actual rules are included from types.xml etc. The rules 
 can still use the entities, without duplicating them. Does that sound okay 
 to you?

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] [LanguageTool] SF.net SVN: languagetool:[6896] ...

2012-05-17 Thread Jan Schreiber
If I understand this properly, it is exactly what I have been looking
for. It would enable us to farm out those huge and cumbersome regExes to
another file until we have a method in place that allows us to get along
without them entirely.

Daniel Naber schrieb:
 Here's how the file could be split up:
 
 grammar.xml contains only the existing entity declarations and this:
 
 !ENTITY typos SYSTEM classpath:/rules/de/typos.xml
 !ENTITY typos SYSTEM classpath:/rules/de/grammar.xml
 
 rules


--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] Any advice?

2012-05-16 Thread Jan Schreiber
 can respond. Discussions 
 will include endpoint security, mobile security and the latest in malware 
 threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
 ___
 Languagetool-devel mailing list
 Languagetool-devel@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/languagetool-devel
 
 

-- 
Jan Schreiber, M.A.
Universität Duisburg-Essen
Fachbereich Geisteswissenschaften
Fach Philosophie
D-45117 Essen

mailto:jan.schrei...@uni-due.de
http://www.uni-due.de/~gph120/
OpenPGP-Schlüssel:
http://www.uni-due.de/~gph120/diverses/0x06C970E5%20pub.asc
Fingerprint: 57DD DD97 99C6 09E7 8B41 7DAD 63A3 1F42 06C9 70E5

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] [LanguageTool] SF.net SVN: languagetool:[6896] ...

2012-05-16 Thread Jan Schreiber
Marcin:
 Classifying some of the words semantically might be really useful for 
 some rules.

Indeed, I could not agree more. The most difficult part would be coming
up with the semantic categories in a way that is not completely ad hoc.
Everyone who has ever used a public library is probably aware of the
fact that categorizing things in a comprehensible way is anything but
trivial. Our own categories in the grammar files are cases in point.

We could use the categories from Daniel's OpenThesaurus as a starting
point for German, but they are quite unsatisfactory. Maybe the tag
clouds used in some public bookmarking systems such as the former
delicio.us can be used as a source, at least as a source of inspiration. --J


--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] Suggesting the infinitive of a verb in a rule

2012-05-11 Thread Jan Schreiber
Thanks for the response. Is the method supposed to work for German? I
tried to do it as you described, but no use. --Jan

Marcin Miłkowski schrieb:
 W dniu 2012-05-11 09:41, Jan Schreiber pisze:
 Dear all,

 I'm currently struggling with a replacement rule. In the following example:
  example type=incorrectDas hat ja somarkerkommen
  gemusst/marker./example
 I would like to suggest the infinitive of the verb gemusst, i.e.
 müssen, like so:
  message[Do you mean]suggestionkommen müssensuggestion?

 The trouble is that I have a list of five or so verbs that is difficult
 to impossible to cover with a single regular expression:

 TokenReplacement
 ---
 gemusst  müssen
 gewollt  wollen
 gesehen  sehen
 ...

 Is there a method to do this that I'm unaware of?
 
 Yes, you can replace the POS tag, and the POS tag will specify the form 
 of the verb.
 
 It would be roughly something like this:
 
 suggestionmatch no=1 postag=pattern-for-original-postag 
 postag_regexp=yes postag_replace=pattern-for-infitive//suggestion
 
 You simply do regular expression replacement from the original POS tag 
 to the target one.
 
 Regards,
 Marcin


--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] Suggesting the infinitive of a verb in a rule

2012-05-11 Thread Jan Schreiber
@ all: Thanks for the replies.

@ Daniel, Markus:
Now what? So far only one out of some 1,200 German rules needs this
feature. Is it worth the effort? OTOH it doesn't look like much work.

gulp21 schrieb:
 Thanks for the response. Is the method supposed to work for German? I
 tried to do it as you described, but no use. --Jan
 
 According to http://languagetool.wikidot.com/tips-and-tricks, it only 
 works for English, Polish, and Dutch.
 
   For some languages - currently English, Polish, and Dutch - it's 
 possible to inflect the words being suggested.
 
 Regards
 Markus


--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] getting rid of mark_from and mark_to

2012-05-11 Thread Jan Schreiber
Looks like an excellent idea, getting this mark_(.+) wrong is one of
my favorite errors.

Daniel Naber wrote:
 I have a local patch here that allow settings markers without the mark_from 
 and mark_to attributes. I find those difficult to understand and it's easy to 
 get them wrong. With this patch, patterns would look like this, using the 
 marker element to replace mark_from/to:
 
 pattern case_sensitive=no
   token postag=SENT_START/
   tokenSehr/token
   marker
   token regexp=yesgeehrte/token
   /marker
   tokenHerr/token
 /pattern
 
 What do you think? The old way of specifying mark_from/to will still work, 
 also because I don't know yet how to migrate rules automatically. Any XSLT 
 wizard around who knows whether this is possible?
 
 Regards
  Daniel


--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] Improve the web demo of LanguageTool?

2012-01-27 Thread Jan Schreiber
 btw, is there a reason why padding-left and padding-right are set to 3px 
 for the class errorMarker. imho the extra space between the words looks 
 a bit strange.
 
I agree that the extra space is a bit large. Maybe we should change it
to 1 px and see how that looks?

--
Try before you buy = See our experts in action!
The most comprehensive online learning library for Microsoft developers
is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
Metro Style Apps, more. Free future releases when you subscribe now!
http://p.sf.net/sfu/learndevnow-dev2
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] Improve the web demo of LanguageTool?

2012-01-23 Thread Jan Schreiber
Would anyone mind if I upload a new version of the file squiggle.png
(used for the squiggly underlines in the demo) with its color changed to
blue rather than red?

In my opinion the red color doesn't fit well into the yellow, orange and
brown overall design of the website, and blue or green are the colors
most commonly used for grammar checking, as opposed to spell checking.

--Jan




--
Try before you buy = See our experts in action!
The most comprehensive online learning library for Microsoft developers
is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
Metro Style Apps, more. Free future releases when you subscribe now!
http://p.sf.net/sfu/learndevnow-dev2
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel


Re: [Languagetool] Possible way of speeding up LanguageTool

2012-01-14 Thread Jan Schreiber
Dominique Pellé wrote:
 The slowest startup times are for Chinese, German
 and Ukrainian. It would be interesting to find why.

That's true.

Some of my rules in the German file do checks against huge regular
expressions, which is probably quite inelegant. This might be a reason.

--
RSA(R) Conference 2012
Mar 27 - Feb 2
Save $400 by Jan. 27
Register now!
http://p.sf.net/sfu/rsa-sfdev2dev2
___
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel