>> Hi,
>>
>> the load test I did for our HTTPS service showed that we have kind of a
>> performance problem for languages that have a lot of rules. Testing a
>> random German text with 125 sentences takes 2 seconds on my machine at
>> average. About half of that time is spent in rule matching, i.e. in
>> PatternRuleMatcher.match().
>>
>> What can be done about this?
>>
>> -More micro-optimization of the inner loop of pattern matching. I think
>> there's not much potential in that, but I'd love to be proven wrong.
>>
>> -Make the checking process work in parallel to better use multiple
>> cores.
>>
>> -Rewrite the pattern matching to use a finite state machine. I think
>> this
>> could improve performance a lot, if we create one state machine that
>> includes all rules of a language. This state machine would then work on
>> a
>> per-sentence basis.
>>
> Instead af checking a text completely, check gradually; active sentence
> first, then back and forth (client code)
>
> Could rules be 'compiled' instead of interpreted?
>
> Is there a significant difference between direct word matching, postag
> matching and regexp matching? Would it help changing some postag-rules
> into word-match rules for the most common words?
>
> Could rules be ordered by biggest chance to hit first?
>
> Could rules be structured in if-then-else-like structures? (Mutually
> excluding trules e.g.)
>
> Is there time to be won in searching the postags for words?
>
Is part of the time communication time? (client-server-client?) Anyt
option to speed that up (compression/dropping spaces?)
>>
>> Any work in that area is very welcome...
>>
>> Regards
>>  Daniel
>>
>> --
>> http://www.danielnaber.de
>>
>>
>> ------------------------------------------------------------------------------
>> Monitor your physical, virtual and cloud infrastructure from a single
>> web console. Get in-depth insight into apps, servers, databases, vmware,
>> SAP, cloud infrastructure, etc. Download 30-day Free Trial.
>> Pricing starts from $795 for 25 servers or applications!
>> http://p.sf.net/sfu/zoho_dev2dev_nov
>> _______________________________________________
>> Languagetool-devel mailing list
>> Languagetool-devel@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/languagetool-devel
>>
>
>
>
> ------------------------------------------------------------------------------
> Monitor your physical, virtual and cloud infrastructure from a single
> web console. Get in-depth insight into apps, servers, databases, vmware,
> SAP, cloud infrastructure, etc. Download 30-day Free Trial.
> Pricing starts from $795 for 25 servers or applications!
> http://p.sf.net/sfu/zoho_dev2dev_nov
> _______________________________________________
> Languagetool-devel mailing list
> Languagetool-devel@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/languagetool-devel
>



------------------------------------------------------------------------------
Monitor your physical, virtual and cloud infrastructure from a single
web console. Get in-depth insight into apps, servers, databases, vmware,
SAP, cloud infrastructure, etc. Download 30-day Free Trial.
Pricing starts from $795 for 25 servers or applications!
http://p.sf.net/sfu/zoho_dev2dev_nov
_______________________________________________
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel

Reply via email to