Re: [CLucene-dev] Indexing a document

2011-11-30 Thread Emerson Espínola
I included only the CLucene.h.

[]'s
Emerson de Lira Espínola
** emersonespin...@gmail.com
https://profiles.google.com/emersonespinola/buzz?hl=pt-BR
http://www.quora.com/emersonespinola
http://www.facebook.com/emersonespinola
http://www.linkedin.com/in/emersonespinola
http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com
http://emersonespinola.blogspot.com http://twitter.com/emersonespinola
http://www.myebook.com/emersonespinola/

 http://www.myebook.com/emersonespinola/


2011/11/30 Veit Jahns nuncupa...@googlemail.com

 2011/11/30 Emerson Espínola emersonespin...@gmail.com
 
  Here my code goes attached.

 The code looks good.

  If it was a missing include I think it wouldn't even compile, and I also
 used the namespace macros.

 I didn't mean the includes in CLucene, but the includes in your
 application. Did you only include the main CLucene.h? If so, did you
 added the BrazilianAnalyzer.h? Or did you include the
 BrazilianAnalyzer.h directly?

 Veit


 --
 All the data continuously generated in your IT infrastructure
 contains a definitive record of customers, application performance,
 security threats, fraudulent activity, and more. Splunk takes this
 data and makes sense of it. IT sense. And common sense.
 http://p.sf.net/sfu/splunk-novd2d
 ___
 CLucene-developers mailing list
 CLucene-developers@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/clucene-developers

--
All the data continuously generated in your IT infrastructure 
contains a definitive record of customers, application performance, 
security threats, fraudulent activity, and more. Splunk takes this 
data and makes sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-novd2d___
CLucene-developers mailing list
CLucene-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/clucene-developers


Re: [CLucene-dev] Indexing a document

2011-11-22 Thread Emerson Espínola
Has anyone seen these classes before?
What would be another alternative to port them?

[]'s
Emerson de Lira Espínola
** emersonespin...@gmail.com
https://profiles.google.com/emersonespinola/buzz?hl=pt-BR
http://www.quora.com/emersonespinola
http://www.facebook.com/emersonespinola
http://www.linkedin.com/in/emersonespinola
http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com
http://emersonespinola.blogspot.com http://twitter.com/emersonespinola
http://www.myebook.com/emersonespinola/

 http://www.myebook.com/emersonespinola/


2011/11/19 Emerson Espinola emersonespin...@gmail.com

 I don't know. I'm not familiar with those classes. I'm trying to port them.

 Enviado via iPhone

 Em 19/11/2011, às 13:03, Itamar Syn-Hershko ita...@code972.com escreveu:

 ReaderPtr? that's an LPP class?!

 2011/11/18 Emerson Espínola emersonespin...@gmail.com

 Hi. I'm porting BrazillianAnalyzer from Java do C++.
 I could compile already BrazilianStemFilter.cpp/.h.
 Now I'm trying to make BrazillianAnalyzer.cpp/.h compile. But I getting
 some errors.

 For example BrazillianAnalyzer.java extends from StopwordAnalyzerBase,
 but I couldn't find this class in CLucene.
  It also uses LuceneVersion, ReaderPtr, and TokenStreamPtr, but I
 couldn't find these classes too. Are there their implementations in CLucene
 with another name?

 I downloaded clucene-core-2.3.3.4.


 []'s
 Emerson de Lira Espínola
 **
 emersonespin...@gmail.com
 https://profiles.google.com/emersonespinola/buzz?hl=pt-BR
 http://www.quora.com/emersonespinola 
 http://www.facebook.com/emersonespinola
 http://www.linkedin.com/in/emersonespinola
 http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com
 http://emersonespinola.blogspot.com
 http://twitter.com/emersonespinola
 http://www.myebook.com/emersonespinola/

  http://www.myebook.com/emersonespinola/


 2011/11/7 Bořivoj Kostka kos...@tovek.cz

  Well, I was talking about version 2.3; you can download it from
 http://sourceforge.net/projects/clucene/

 I guess the stemming from this version could be adapted for 0.9
 (probably some small modifications will be needed), if you cannot upgrade.
 

 ** **

 Regards

 Borek

 ** **

 *From:* Emerson Espínola [mailto:emersonespin...@gmail.com]
 *Sent:* Monday, November 07, 2011 7:08 PM

 *To:* clucene-developers@lists.sourceforge.net
 *Subject:* Re: [CLucene-dev] Indexing a document

  ** **

 The version that I have here is clucene-core-0.9.21b and I don't see
 this inside src folder. My structure is
 clucene-core-0.9.21b\src\CLucene\analysis\standard. :(


 []'s
 Emerson de Lira Espínola
 ~WRD000.jpg 
 emersonespin...@gmail.com~WRD000.jpghttps://profiles.google.com/emersonespinola/buzz?hl=pt-BR
 ~WRD000.jpg http://www.quora.com/emersonespinola 
 ~WRD000.jpghttp://www.facebook.com/emersonespinola
 image002.jpg 
 http://www.linkedin.com/in/emersonespinola~WRD000.jpghttp://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com
 ~WRD000.jpg 
 http://emersonespinola.blogspot.com~WRD000.jpghttp://twitter.com/emersonespinola
 ~WRD000.jpg http://www.myebook.com/emersonespinola/



 

  2011/11/7 Bořivoj Kostka kos...@tovek.cz

 It is located in src/contribs-lib/CLucene/snowball

 You must enable building contribs-lib when you run cmake.

 As far as I know it is standard part of CLucene source package

 Borek

  

 *From:* Emerson Espínola [mailto:emersonespin...@gmail.com]
 *Sent:* Monday, November 07, 2011 4:53 PM


 *To:* clucene-developers@lists.sourceforge.net
 *Subject:* Re: [CLucene-dev] Indexing a document

  

 Really?



 That would help me a lot. Regarding stemm there are no much differences.
 How do I get it? Is contribs-lib a folder? If so I couldn't find it.

 []'s
 Emerson de Lira Espínola

 ~WRD000.jpg 
 emersonespin...@gmail.com~WRD000.jpghttps://profiles.google.com/emersonespinola/buzz?hl=pt-BR
 ~WRD000.jpg http://www.quora.com/emersonespinola 
 ~WRD000.jpghttp://www.facebook.com/emersonespinola
 image003.jpg 
 http://www.linkedin.com/in/emersonespinola~WRD000.jpghttp://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com
 ~WRD000.jpg 
 http://emersonespinola.blogspot.com~WRD000.jpghttp://twitter.com/emersonespinola
 ~WRD000.jpg http://www.myebook.com/emersonespinola/


 

 2011/11/7 Bořivoj Kostka kos...@tovek.cz

 Emerson

 There is portuguese stemmer in contribs-lib (snowball). I don’t know how
 much written brazillian differs from portuguese, but perhaps this could be
 also (at least partial) solution.

 Regards

 Borek

  

 *From:* Emerson Espínola [mailto:emersonespin...@gmail.com]
 *Sent:* Monday, November 07, 2011 3:28 PM
 *To:* clucene-developers@lists.sourceforge.net
 *Subject:* Re: [CLucene-dev] Indexing a document

  

 Thank you.

 2011/11/7 Itamar Syn-Hershko ita...@code972.com

 Yeah, shouldn't be too hard to pull off

  

 On Mon, Nov 7, 2011 at 4:17 PM, Emerson Espínola 
 emersonespin

Re: [CLucene-dev] Indexing a document

2011-11-19 Thread Itamar Syn-Hershko
ReaderPtr? that's an LPP class?!

2011/11/18 Emerson Espínola emersonespin...@gmail.com

 Hi. I'm porting BrazillianAnalyzer from Java do C++.
 I could compile already BrazilianStemFilter.cpp/.h.
 Now I'm trying to make BrazillianAnalyzer.cpp/.h compile. But I getting
 some errors.

 For example BrazillianAnalyzer.java extends from StopwordAnalyzerBase, but
 I couldn't find this class in CLucene.
  It also uses LuceneVersion, ReaderPtr, and TokenStreamPtr, but I couldn't
 find these classes too. Are there their implementations in CLucene with
 another name?

 I downloaded clucene-core-2.3.3.4.


 []'s
 Emerson de Lira Espínola
 **
 emersonespin...@gmail.com
 https://profiles.google.com/emersonespinola/buzz?hl=pt-BR
 http://www.quora.com/emersonespinola 
 http://www.facebook.com/emersonespinola
 http://www.linkedin.com/in/emersonespinola
 http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com
 http://emersonespinola.blogspot.com http://twitter.com/emersonespinola
 http://www.myebook.com/emersonespinola/

  http://www.myebook.com/emersonespinola/


 2011/11/7 Bořivoj Kostka kos...@tovek.cz

  Well, I was talking about version 2.3; you can download it from
 http://sourceforge.net/projects/clucene/

 I guess the stemming from this version could be adapted for 0.9 (probably
 some small modifications will be needed), if you cannot upgrade.

 ** **

 Regards

 Borek

 ** **

 *From:* Emerson Espínola [mailto:emersonespin...@gmail.com]
 *Sent:* Monday, November 07, 2011 7:08 PM

 *To:* clucene-developers@lists.sourceforge.net
 *Subject:* Re: [CLucene-dev] Indexing a document

  ** **

 The version that I have here is clucene-core-0.9.21b and I don't see this
 inside src folder. My structure is
 clucene-core-0.9.21b\src\CLucene\analysis\standard. :(


 []'s
 Emerson de Lira Espínola
 [image: Obrázek byl odebrán odesílatelem.] emersonespin...@gmail.com[image:
 Obrázek byl odebrán 
 odesílatelem.]https://profiles.google.com/emersonespinola/buzz?hl=pt-BR[image:
 Obrázek byl odebrán odesílatelem.] http://www.quora.com/emersonespinola
  [image: Obrázek byl odebrán 
 odesílatelem.]http://www.facebook.com/emersonespinola [image:
 Obrázek byl odebrán 
 odesílatelem.]http://www.linkedin.com/in/emersonespinola[image:
 Obrázek byl odebrán 
 odesílatelem.]http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com[image:
 Obrázek byl odebrán odesílatelem.] 
 http://emersonespinola.blogspot.com[image:
 Obrázek byl odebrán odesílatelem.] 
 http://twitter.com/emersonespinola[image:
 Obrázek byl odebrán odesílatelem.]http://www.myebook.com/emersonespinola/



 

  2011/11/7 Bořivoj Kostka kos...@tovek.cz

 It is located in src/contribs-lib/CLucene/snowball

 You must enable building contribs-lib when you run cmake.

 As far as I know it is standard part of CLucene source package

 Borek

  

 *From:* Emerson Espínola [mailto:emersonespin...@gmail.com]
 *Sent:* Monday, November 07, 2011 4:53 PM


 *To:* clucene-developers@lists.sourceforge.net
 *Subject:* Re: [CLucene-dev] Indexing a document

  

 Really?



 That would help me a lot. Regarding stemm there are no much differences.
 How do I get it? Is contribs-lib a folder? If so I couldn't find it.

 []'s
 Emerson de Lira Espínola

 [image: Obrázek byl odebrán odesílatelem.] emersonespin...@gmail.com[image:
 Obrázek byl odebrán 
 odesílatelem.]https://profiles.google.com/emersonespinola/buzz?hl=pt-BR[image:
 Obrázek byl odebrán odesílatelem.] http://www.quora.com/emersonespinola
  [image: Obrázek byl odebrán 
 odesílatelem.]http://www.facebook.com/emersonespinola [image:
 Obrázek byl odebrán 
 odesílatelem.]http://www.linkedin.com/in/emersonespinola[image:
 Obrázek byl odebrán 
 odesílatelem.]http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com[image:
 Obrázek byl odebrán odesílatelem.] 
 http://emersonespinola.blogspot.com[image:
 Obrázek byl odebrán odesílatelem.] 
 http://twitter.com/emersonespinola[image:
 Obrázek byl odebrán odesílatelem.]http://www.myebook.com/emersonespinola/


 

 2011/11/7 Bořivoj Kostka kos...@tovek.cz

 Emerson

 There is portuguese stemmer in contribs-lib (snowball). I don't know how
 much written brazillian differs from portuguese, but perhaps this could be
 also (at least partial) solution.

 Regards

 Borek

  

 *From:* Emerson Espínola [mailto:emersonespin...@gmail.com]
 *Sent:* Monday, November 07, 2011 3:28 PM
 *To:* clucene-developers@lists.sourceforge.net
 *Subject:* Re: [CLucene-dev] Indexing a document

  

 Thank you.

 2011/11/7 Itamar Syn-Hershko ita...@code972.com

 Yeah, shouldn't be too hard to pull off

  

 On Mon, Nov 7, 2011 at 4:17 PM, Emerson Espínola 
 emersonespin...@gmail.com wrote:

 So the better way for me would be porting BrazillianAnalyzer from Lucene,
 wouldn't it? Because if I use the [1] I'll probably have problems because
 it depends on something (.h files

Re: [CLucene-dev] Indexing a document

2011-11-07 Thread Emerson Espínola
Thank you Viet.

When will this new version of CLucene be ready? Any idea?

[]'s
Emerson de Lira Espínola
** emersonespin...@gmail.com
https://profiles.google.com/emersonespinola/buzz?hl=pt-BR
http://www.quora.com/emersonespinola
http://www.facebook.com/emersonespinola
http://www.linkedin.com/in/emersonespinola
http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com
http://emersonespinola.blogspot.com http://twitter.com/emersonespinola
http://www.myebook.com/emersonespinola/

 http://www.myebook.com/emersonespinola/


2011/11/5 Veit Jahns nuncupa...@googlemail.com

 Hi Emerson!

 2011/11/4 Emerson Espínola emersonespin...@gmail.com


 I'm willing to write the BrazillianAnalyzer. What should I do? Do you
 have an idea about how much time is needed to write it? 1 week? 2 weeks? 1
 month?


 If the BrazillianAnalyzer from Java Lucene fits your needs, then the only
 thing you have to do is to port this Analyzer to CLucene. I did the same
 once for the GermanAnalyzer and I needed more or less a day.

 BTW, a while ago we started to work on a new code base for CLucene based
 on Lucene++. Lucene++ has already a BrazillianAnalyzer [1]. But it is based
 on a newer version of Java Lucene.

 Kind regards,

 Veit

 [1]
 https://github.com/luceneplusplus/LucenePlusPlus/tree/e28b15b02ff9de2208965e9af8eb80983380cdcd/src/contrib/analyzers/common/analysis/br



 --
 RSA(R) Conference 2012
 Save $700 by Nov 18
 Register now
 http://p.sf.net/sfu/rsa-sfdev2dev1
 ___
 CLucene-developers mailing list
 CLucene-developers@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/clucene-developers


--
RSA(R) Conference 2012
Save $700 by Nov 18
Register now
http://p.sf.net/sfu/rsa-sfdev2dev1___
CLucene-developers mailing list
CLucene-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/clucene-developers


Re: [CLucene-dev] Indexing a document

2011-11-07 Thread Itamar Syn-Hershko
Thats going to take a while, unfortunately. LPP is already available on
github, but we want to have some improvements made to its core before
merging it to CLucene

On Mon, Nov 7, 2011 at 4:01 PM, Emerson Espínola
emersonespin...@gmail.comwrote:

 Thank you Viet.

 When will this new version of CLucene be ready? Any idea?


 []'s
 Emerson de Lira Espínola
 ** emersonespin...@gmail.com
 https://profiles.google.com/emersonespinola/buzz?hl=pt-BR
 http://www.quora.com/emersonespinola 
 http://www.facebook.com/emersonespinola
 http://www.linkedin.com/in/emersonespinola
 http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com
 http://emersonespinola.blogspot.com http://twitter.com/emersonespinola
 http://www.myebook.com/emersonespinola/

  http://www.myebook.com/emersonespinola/


 2011/11/5 Veit Jahns nuncupa...@googlemail.com

 Hi Emerson!

 2011/11/4 Emerson Espínola emersonespin...@gmail.com


 I'm willing to write the BrazillianAnalyzer. What should I do? Do you
 have an idea about how much time is needed to write it? 1 week? 2 weeks? 1
 month?


 If the BrazillianAnalyzer from Java Lucene fits your needs, then the only
 thing you have to do is to port this Analyzer to CLucene. I did the same
 once for the GermanAnalyzer and I needed more or less a day.

 BTW, a while ago we started to work on a new code base for CLucene based
 on Lucene++. Lucene++ has already a BrazillianAnalyzer [1]. But it is based
 on a newer version of Java Lucene.

 Kind regards,

 Veit

 [1]
 https://github.com/luceneplusplus/LucenePlusPlus/tree/e28b15b02ff9de2208965e9af8eb80983380cdcd/src/contrib/analyzers/common/analysis/br



 --
 RSA(R) Conference 2012
 Save $700 by Nov 18
 Register now
 http://p.sf.net/sfu/rsa-sfdev2dev1

 ___
 CLucene-developers mailing list
 CLucene-developers@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/clucene-developers




 --
 RSA(R) Conference 2012
 Save $700 by Nov 18
 Register now
 http://p.sf.net/sfu/rsa-sfdev2dev1
 ___
 CLucene-developers mailing list
 CLucene-developers@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/clucene-developers


--
RSA(R) Conference 2012
Save $700 by Nov 18
Register now
http://p.sf.net/sfu/rsa-sfdev2dev1___
CLucene-developers mailing list
CLucene-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/clucene-developers


Re: [CLucene-dev] Indexing a document

2011-11-07 Thread Emerson Espínola
So the better way for me would be porting BrazillianAnalyzer from Lucene,
wouldn't it? Because if I use the [1] I'll probably have problems because
it depends on something (.h files) that are not implemented already. Is
that right?

[]'s
Emerson de Lira Espínola
** emersonespin...@gmail.com
https://profiles.google.com/emersonespinola/buzz?hl=pt-BR
http://www.quora.com/emersonespinola
http://www.facebook.com/emersonespinola
http://www.linkedin.com/in/emersonespinola
http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com
http://emersonespinola.blogspot.com http://twitter.com/emersonespinola
http://www.myebook.com/emersonespinola/

 http://www.myebook.com/emersonespinola/


2011/11/7 Itamar Syn-Hershko ita...@code972.com

 Thats going to take a while, unfortunately. LPP is already available on
 github, but we want to have some improvements made to its core before
 merging it to CLucene


 On Mon, Nov 7, 2011 at 4:01 PM, Emerson Espínola 
 emersonespin...@gmail.com wrote:

 Thank you Viet.

 When will this new version of CLucene be ready? Any idea?


 []'s
 Emerson de Lira Espínola
 ** emersonespin...@gmail.com
 https://profiles.google.com/emersonespinola/buzz?hl=pt-BR
 http://www.quora.com/emersonespinola 
 http://www.facebook.com/emersonespinola
 http://www.linkedin.com/in/emersonespinola
 http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com
 http://emersonespinola.blogspot.com
 http://twitter.com/emersonespinola
 http://www.myebook.com/emersonespinola/

  http://www.myebook.com/emersonespinola/


 2011/11/5 Veit Jahns nuncupa...@googlemail.com

  Hi Emerson!

 2011/11/4 Emerson Espínola emersonespin...@gmail.com


 I'm willing to write the BrazillianAnalyzer. What should I do? Do you
 have an idea about how much time is needed to write it? 1 week? 2 weeks? 1
 month?


 If the BrazillianAnalyzer from Java Lucene fits your needs, then the
 only thing you have to do is to port this Analyzer to CLucene. I did the
 same once for the GermanAnalyzer and I needed more or less a day.

 BTW, a while ago we started to work on a new code base for CLucene based
 on Lucene++. Lucene++ has already a BrazillianAnalyzer [1]. But it is based
 on a newer version of Java Lucene.

 Kind regards,

 Veit

 [1]
 https://github.com/luceneplusplus/LucenePlusPlus/tree/e28b15b02ff9de2208965e9af8eb80983380cdcd/src/contrib/analyzers/common/analysis/br



 --
 RSA(R) Conference 2012
 Save $700 by Nov 18
 Register now
 http://p.sf.net/sfu/rsa-sfdev2dev1

 ___
 CLucene-developers mailing list
 CLucene-developers@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/clucene-developers




 --
 RSA(R) Conference 2012
 Save $700 by Nov 18
 Register now
 http://p.sf.net/sfu/rsa-sfdev2dev1
 ___
 CLucene-developers mailing list
 CLucene-developers@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/clucene-developers




 --
 RSA(R) Conference 2012
 Save $700 by Nov 18
 Register now
 http://p.sf.net/sfu/rsa-sfdev2dev1
 ___
 CLucene-developers mailing list
 CLucene-developers@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/clucene-developers


--
RSA(R) Conference 2012
Save $700 by Nov 18
Register now
http://p.sf.net/sfu/rsa-sfdev2dev1___
CLucene-developers mailing list
CLucene-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/clucene-developers


Re: [CLucene-dev] Indexing a document

2011-11-07 Thread Itamar Syn-Hershko
Yeah, shouldn't be too hard to pull off

On Mon, Nov 7, 2011 at 4:17 PM, Emerson Espínola
emersonespin...@gmail.comwrote:

 So the better way for me would be porting BrazillianAnalyzer from Lucene,
 wouldn't it? Because if I use the [1] I'll probably have problems because
 it depends on something (.h files) that are not implemented already. Is
 that right?


 []'s
 Emerson de Lira Espínola
 ** emersonespin...@gmail.com
 https://profiles.google.com/emersonespinola/buzz?hl=pt-BR
 http://www.quora.com/emersonespinola 
 http://www.facebook.com/emersonespinola
 http://www.linkedin.com/in/emersonespinola
 http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com
 http://emersonespinola.blogspot.com http://twitter.com/emersonespinola
 http://www.myebook.com/emersonespinola/

  http://www.myebook.com/emersonespinola/


 2011/11/7 Itamar Syn-Hershko ita...@code972.com

 Thats going to take a while, unfortunately. LPP is already available on
 github, but we want to have some improvements made to its core before
 merging it to CLucene


 On Mon, Nov 7, 2011 at 4:01 PM, Emerson Espínola 
 emersonespin...@gmail.com wrote:

 Thank you Viet.

 When will this new version of CLucene be ready? Any idea?


 []'s
 Emerson de Lira Espínola
 ** emersonespin...@gmail.com
 https://profiles.google.com/emersonespinola/buzz?hl=pt-BR
 http://www.quora.com/emersonespinola 
 http://www.facebook.com/emersonespinola
 http://www.linkedin.com/in/emersonespinola
 http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com
 http://emersonespinola.blogspot.com
 http://twitter.com/emersonespinola
 http://www.myebook.com/emersonespinola/

  http://www.myebook.com/emersonespinola/


 2011/11/5 Veit Jahns nuncupa...@googlemail.com

  Hi Emerson!

 2011/11/4 Emerson Espínola emersonespin...@gmail.com


 I'm willing to write the BrazillianAnalyzer. What should I do? Do you
 have an idea about how much time is needed to write it? 1 week? 2 weeks? 1
 month?


 If the BrazillianAnalyzer from Java Lucene fits your needs, then the
 only thing you have to do is to port this Analyzer to CLucene. I did the
 same once for the GermanAnalyzer and I needed more or less a day.

 BTW, a while ago we started to work on a new code base for CLucene
 based on Lucene++. Lucene++ has already a BrazillianAnalyzer [1]. But it is
 based on a newer version of Java Lucene.

 Kind regards,

 Veit

 [1]
 https://github.com/luceneplusplus/LucenePlusPlus/tree/e28b15b02ff9de2208965e9af8eb80983380cdcd/src/contrib/analyzers/common/analysis/br



 --
 RSA(R) Conference 2012
 Save $700 by Nov 18
 Register now
 http://p.sf.net/sfu/rsa-sfdev2dev1

 ___
 CLucene-developers mailing list
 CLucene-developers@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/clucene-developers




 --
 RSA(R) Conference 2012
 Save $700 by Nov 18
 Register now
 http://p.sf.net/sfu/rsa-sfdev2dev1
 ___
 CLucene-developers mailing list
 CLucene-developers@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/clucene-developers




 --
 RSA(R) Conference 2012
 Save $700 by Nov 18
 Register now
 http://p.sf.net/sfu/rsa-sfdev2dev1
 ___
 CLucene-developers mailing list
 CLucene-developers@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/clucene-developers




 --
 RSA(R) Conference 2012
 Save $700 by Nov 18
 Register now
 http://p.sf.net/sfu/rsa-sfdev2dev1
 ___
 CLucene-developers mailing list
 CLucene-developers@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/clucene-developers


--
RSA(R) Conference 2012
Save $700 by Nov 18
Register now
http://p.sf.net/sfu/rsa-sfdev2dev1___
CLucene-developers mailing list
CLucene-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/clucene-developers


Re: [CLucene-dev] Indexing a document

2011-11-07 Thread Emerson Espínola
Really?

That would help me a lot. Regarding stemm there are no much differences.
How do I get it? Is contribs-lib a folder? If so I couldn't find it.

[]'s
Emerson de Lira Espínola
** emersonespin...@gmail.com
https://profiles.google.com/emersonespinola/buzz?hl=pt-BR
http://www.quora.com/emersonespinola
http://www.facebook.com/emersonespinola
http://www.linkedin.com/in/emersonespinola
http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com
http://emersonespinola.blogspot.com http://twitter.com/emersonespinola
http://www.myebook.com/emersonespinola/

 http://www.myebook.com/emersonespinola/


2011/11/7 Bořivoj Kostka kos...@tovek.cz

  Emerson

 There is portuguese stemmer in contribs-lib (snowball). I don’t know how
 much written brazillian differs from portuguese, but perhaps this could be
 also (at least partial) solution.

 Regards

 Borek

 ** **

 *From:* Emerson Espínola [mailto:emersonespin...@gmail.com]
 *Sent:* Monday, November 07, 2011 3:28 PM
 *To:* clucene-developers@lists.sourceforge.net
 *Subject:* Re: [CLucene-dev] Indexing a document

 ** **

 Thank you.

 2011/11/7 Itamar Syn-Hershko ita...@code972.com

 Yeah, shouldn't be too hard to pull off

 ** **

 On Mon, Nov 7, 2011 at 4:17 PM, Emerson Espínola 
 emersonespin...@gmail.com wrote:

 So the better way for me would be porting BrazillianAnalyzer from Lucene,
 wouldn't it? Because if I use the [1] I'll probably have problems because
 it depends on something (.h files) that are not implemented already. Is
 that right?


 

 2011/11/7 Itamar Syn-Hershko ita...@code972.com

 Thats going to take a while, unfortunately. LPP is already available on
 github, but we want to have some improvements made to its core before
 merging it to CLucene

 ** **

 On Mon, Nov 7, 2011 at 4:01 PM, Emerson Espínola 
 emersonespin...@gmail.com wrote:

 Thank you Viet.

 When will this new version of CLucene be ready? Any idea?



 []'s
 Emerson de Lira Espínola


 

 2011/11/5 Veit Jahns nuncupa...@googlemail.com

 Hi Emerson!

 2011/11/4 Emerson Espínola emersonespin...@gmail.com


 I'm willing to write the BrazillianAnalyzer. What should I do? Do you have
 an idea about how much time is needed to write it? 1 week? 2 weeks? 1 month?
 


 If the BrazillianAnalyzer from Java Lucene fits your needs, then the only
 thing you have to do is to port this Analyzer to CLucene. I did the same
 once for the GermanAnalyzer and I needed more or less a day.

 BTW, a while ago we started to work on a new code base for CLucene based
 on Lucene++. Lucene++ has already a BrazillianAnalyzer [1]. But it is based
 on a newer version of Java Lucene.

 Kind regards,

 Veit

 [1]
 https://github.com/luceneplusplus/LucenePlusPlus/tree/e28b15b02ff9de2208965e9af8eb80983380cdcd/src/contrib/analyzers/common/analysis/br
 

 ** **


 --
 RSA(R) Conference 2012
 Save $700 by Nov 18
 Register now
 http://p.sf.net/sfu/rsa-sfdev2dev1


 ___
 CLucene-developers mailing list
 CLucene-developers@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/clucene-developers




 --
 RSA(R) Conference 2012
 Save $700 by Nov 18
 Register now
 http://p.sf.net/sfu/rsa-sfdev2dev1
 ___
 CLucene-developers mailing list
 CLucene-developers@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/clucene-developers

 ** **



 --
 RSA(R) Conference 2012
 Save $700 by Nov 18
 Register now
 http://p.sf.net/sfu/rsa-sfdev2dev1
 ___
 CLucene-developers mailing list
 CLucene-developers@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/clucene-developers

 ** **



 --
 RSA(R) Conference 2012
 Save $700 by Nov 18
 Register now
 http://p.sf.net/sfu/rsa-sfdev2dev1
 ___
 CLucene-developers mailing list
 CLucene-developers@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/clucene-developers

 ** **



 --
 RSA(R) Conference 2012
 Save $700 by Nov 18
 Register now
 http://p.sf.net/sfu/rsa-sfdev2dev1
 ___
 CLucene-developers mailing list
 CLucene-developers@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/clucene-developers

 ** **


 --
 RSA(R) Conference 2012
 Save $700 by Nov 18
 Register now
 http://p.sf.net/sfu/rsa-sfdev2dev1

Re: [CLucene-dev] Indexing a document

2011-11-07 Thread Bořivoj Kostka
It is located in src/contribs-lib/CLucene/snowball
You must enable building contribs-lib when you run cmake.
As far as I know it is standard part of CLucene source package
Borek

From: Emerson Espínola [mailto:emersonespin...@gmail.com]
Sent: Monday, November 07, 2011 4:53 PM
To: clucene-developers@lists.sourceforge.net
Subject: Re: [CLucene-dev] Indexing a document

Really?

That would help me a lot. Regarding stemm there are no much differences. How do 
I get it? Is contribs-lib a folder? If so I couldn't find it.

[]'s
Emerson de Lira Espínola
[Obrázek byl odebrán odesílatelem.]mailto:emersonespin...@gmail.com[Obrázek 
byl odebrán 
odesílatelem.]https://profiles.google.com/emersonespinola/buzz?hl=pt-BR[Obrázek
 byl odebrán odesílatelem.]http://www.quora.com/emersonespinola [Obrázek byl 
odebrán odesílatelem.] http://www.facebook.com/emersonespinola  [Obrázek byl 
odebrán odesílatelem.] http://www.linkedin.com/in/emersonespinola [Obrázek 
byl odebrán odesílatelem.] 
http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com [Obrázek 
byl odebrán odesílatelem.] http://emersonespinola.blogspot.com [Obrázek byl 
odebrán odesílatelem.] http://twitter.com/emersonespinola [Obrázek byl 
odebrán odesílatelem.] http://www.myebook.com/emersonespinola/



2011/11/7 Bořivoj Kostka kos...@tovek.czmailto:kos...@tovek.cz
Emerson
There is portuguese stemmer in contribs-lib (snowball). I don’t know how much 
written brazillian differs from portuguese, but perhaps this could be also (at 
least partial) solution.
Regards
Borek

From: Emerson Espínola 
[mailto:emersonespin...@gmail.commailto:emersonespin...@gmail.com]
Sent: Monday, November 07, 2011 3:28 PM
To: 
clucene-developers@lists.sourceforge.netmailto:clucene-developers@lists.sourceforge.net
Subject: Re: [CLucene-dev] Indexing a document

Thank you.

2011/11/7 Itamar Syn-Hershko ita...@code972.commailto:ita...@code972.com
Yeah, shouldn't be too hard to pull off

On Mon, Nov 7, 2011 at 4:17 PM, Emerson Espínola 
emersonespin...@gmail.commailto:emersonespin...@gmail.com wrote:
So the better way for me would be porting BrazillianAnalyzer from Lucene, 
wouldn't it? Because if I use the [1] I'll probably have problems because it 
depends on something (.h files) that are not implemented already. Is that right?

2011/11/7 Itamar Syn-Hershko ita...@code972.commailto:ita...@code972.com
Thats going to take a while, unfortunately. LPP is already available on github, 
but we want to have some improvements made to its core before merging it to 
CLucene

On Mon, Nov 7, 2011tel:2011 at 4:01 PM, Emerson Espínola 
emersonespin...@gmail.commailto:emersonespin...@gmail.com wrote:
Thank you Viet.

When will this new version of CLucene be ready? Any idea?


[]'s
Emerson de Lira Espínola

2011/11/5 Veit Jahns 
nuncupa...@googlemail.commailto:nuncupa...@googlemail.com
Hi Emerson!

2011/11/4 Emerson Espínola 
emersonespin...@gmail.commailto:emersonespin...@gmail.com

I'm willing to write the BrazillianAnalyzer. What should I do? Do you have an 
idea about how much time is needed to write it? 1 week? 2 weeks? 1 month?

If the BrazillianAnalyzer from Java Lucene fits your needs, then the only thing 
you have to do is to port this Analyzer to CLucene. I did the same once for the 
GermanAnalyzer and I needed more or less a day.

BTW, a while ago we started to work on a new code base for CLucene based on 
Lucene++. Lucene++ has already a BrazillianAnalyzer [1]. But it is based on a 
newer version of Java Lucene.

Kind regards,

Veit

[1] 
https://github.com/luceneplusplus/LucenePlusPlus/tree/e28b15b02ff9de2208965e9af8eb80983380cdcd/src/contrib/analyzers/common/analysis/br

--
RSA(R) Conference 2012tel:2012
Save $700 by Nov 18
Register now
http://p.sf.net/sfu/rsa-sfdev2dev1

___
CLucene-developers mailing list
CLucene-developers@lists.sourceforge.netmailto:CLucene-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/clucene-developers


--
RSA(R) Conference 2012tel:2012
Save $700 by Nov 18
Register now
http://p.sf.net/sfu/rsa-sfdev2dev1
___
CLucene-developers mailing list
CLucene-developers@lists.sourceforge.netmailto:CLucene-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/clucene-developers


--
RSA(R) Conference 2012tel:2012
Save $700 by Nov 18
Register now
http://p.sf.net/sfu/rsa-sfdev2dev1
___
CLucene-developers mailing list
CLucene-developers@lists.sourceforge.netmailto:CLucene-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/clucene-developers


--
RSA(R) Conference

Re: [CLucene-dev] Indexing a document

2011-11-07 Thread Bořivoj Kostka
Well, I was talking about version 2.3; you can download it from 
http://sourceforge.net/projects/clucene/
I guess the stemming from this version could be adapted for 0.9 (probably some 
small modifications will be needed), if you cannot upgrade.

Regards
Borek

From: Emerson Espínola [mailto:emersonespin...@gmail.com]
Sent: Monday, November 07, 2011 7:08 PM
To: clucene-developers@lists.sourceforge.net
Subject: Re: [CLucene-dev] Indexing a document

The version that I have here is clucene-core-0.9.21b and I don't see this 
inside src folder. My structure is 
clucene-core-0.9.21b\src\CLucene\analysis\standard. :(

[]'s
Emerson de Lira Espínola
[Obrázek byl odebrán odesílatelem.]mailto:emersonespin...@gmail.com[Obrázek 
byl odebrán 
odesílatelem.]https://profiles.google.com/emersonespinola/buzz?hl=pt-BR[Obrázek
 byl odebrán odesílatelem.]http://www.quora.com/emersonespinola [Obrázek byl 
odebrán odesílatelem.] http://www.facebook.com/emersonespinola  [Obrázek byl 
odebrán odesílatelem.] http://www.linkedin.com/in/emersonespinola [Obrázek 
byl odebrán odesílatelem.] 
http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com [Obrázek 
byl odebrán odesílatelem.] http://emersonespinola.blogspot.com [Obrázek byl 
odebrán odesílatelem.] http://twitter.com/emersonespinola [Obrázek byl 
odebrán odesílatelem.] http://www.myebook.com/emersonespinola/



2011/11/7 Bořivoj Kostka kos...@tovek.czmailto:kos...@tovek.cz
It is located in src/contribs-lib/CLucene/snowball
You must enable building contribs-lib when you run cmake.
As far as I know it is standard part of CLucene source package
Borek

From: Emerson Espínola 
[mailto:emersonespin...@gmail.commailto:emersonespin...@gmail.com]
Sent: Monday, November 07, 2011 4:53 PM

To: 
clucene-developers@lists.sourceforge.netmailto:clucene-developers@lists.sourceforge.net
Subject: Re: [CLucene-dev] Indexing a document

Really?


That would help me a lot. Regarding stemm there are no much differences. How do 
I get it? Is contribs-lib a folder? If so I couldn't find it.

[]'s
Emerson de Lira Espínola
[Obrázek byl odebrán odesílatelem.]mailto:emersonespin...@gmail.com[Obrázek 
byl odebrán 
odesílatelem.]https://profiles.google.com/emersonespinola/buzz?hl=pt-BR[Obrázek
 byl odebrán odesílatelem.]http://www.quora.com/emersonespinola [Obrázek byl 
odebrán odesílatelem.] http://www.facebook.com/emersonespinola  [Obrázek byl 
odebrán odesílatelem.] http://www.linkedin.com/in/emersonespinola [Obrázek 
byl odebrán odesílatelem.] 
http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com [Obrázek 
byl odebrán odesílatelem.] http://emersonespinola.blogspot.com [Obrázek byl 
odebrán odesílatelem.] http://twitter.com/emersonespinola [Obrázek byl 
odebrán odesílatelem.] http://www.myebook.com/emersonespinola/


2011/11/7 Bořivoj Kostka kos...@tovek.czmailto:kos...@tovek.cz
Emerson
There is portuguese stemmer in contribs-lib (snowball). I don’t know how much 
written brazillian differs from portuguese, but perhaps this could be also (at 
least partial) solution.
Regards
Borek

From: Emerson Espínola 
[mailto:emersonespin...@gmail.commailto:emersonespin...@gmail.com]
Sent: Monday, November 07, 2011 3:28 PM
To: 
clucene-developers@lists.sourceforge.netmailto:clucene-developers@lists.sourceforge.net
Subject: Re: [CLucene-dev] Indexing a document

Thank you.

2011/11/7 Itamar Syn-Hershko ita...@code972.commailto:ita...@code972.com
Yeah, shouldn't be too hard to pull off

On Mon, Nov 7, 2011 at 4:17 PM, Emerson Espínola 
emersonespin...@gmail.commailto:emersonespin...@gmail.com wrote:
So the better way for me would be porting BrazillianAnalyzer from Lucene, 
wouldn't it? Because if I use the [1] I'll probably have problems because it 
depends on something (.h files) that are not implemented already. Is that right?

2011/11/7 Itamar Syn-Hershko ita...@code972.commailto:ita...@code972.com
Thats going to take a while, unfortunately. LPP is already available on github, 
but we want to have some improvements made to its core before merging it to 
CLucene

On Mon, Nov 7, 2011tel:2011 at 4:01 PM, Emerson Espínola 
emersonespin...@gmail.commailto:emersonespin...@gmail.com wrote:
Thank you Viet.

When will this new version of CLucene be ready? Any idea?


[]'s
Emerson de Lira Espínola
2011/11/5 Veit Jahns 
nuncupa...@googlemail.commailto:nuncupa...@googlemail.com
Hi Emerson!

2011/11/4 Emerson Espínola 
emersonespin...@gmail.commailto:emersonespin...@gmail.com

I'm willing to write the BrazillianAnalyzer. What should I do? Do you have an 
idea about how much time is needed to write it? 1 week? 2 weeks? 1 month?

If the BrazillianAnalyzer from Java Lucene fits your needs, then the only thing 
you have to do is to port this Analyzer to CLucene. I did the same once for the 
GermanAnalyzer and I needed more or less a day.

BTW, a while ago we started to work on a new code base for CLucene based on 
Lucene++. Lucene++ has already a BrazillianAnalyzer [1]. But it is based on a 
newer version

Re: [CLucene-dev] Indexing a document

2011-11-05 Thread Veit Jahns
Hi Emerson!

2011/11/4 Emerson Espínola emersonespin...@gmail.com


 I'm willing to write the BrazillianAnalyzer. What should I do? Do you have
 an idea about how much time is needed to write it? 1 week? 2 weeks? 1 month?


If the BrazillianAnalyzer from Java Lucene fits your needs, then the only
thing you have to do is to port this Analyzer to CLucene. I did the same
once for the GermanAnalyzer and I needed more or less a day.

BTW, a while ago we started to work on a new code base for CLucene based on
Lucene++. Lucene++ has already a BrazillianAnalyzer [1]. But it is based on
a newer version of Java Lucene.

Kind regards,

Veit

[1]
https://github.com/luceneplusplus/LucenePlusPlus/tree/e28b15b02ff9de2208965e9af8eb80983380cdcd/src/contrib/analyzers/common/analysis/br
--
RSA(R) Conference 2012
Save $700 by Nov 18
Register now
http://p.sf.net/sfu/rsa-sfdev2dev1___
CLucene-developers mailing list
CLucene-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/clucene-developers


Re: [CLucene-dev] Indexing a document

2011-05-27 Thread Veit Jahns
Hi Emerson!

2011/5/27 Emerson Espínola emersonespin...@gmail.com

 Hi Veit.

 Thank yo very much for your answer. Great explanation. You don't wonder how 
 much you're helping me.

You are welcome!

 1. I'll try english documents.
 2. Ok.
 3. Does BrazilianAnalyzer work similar to StandardAnalyzer? If so, that's 
 what I want. I'm from Brazil. :) There are no much differences between 
 portuguese from Portugal and portuguese from Brazil. But if there is already 
 an analyzer for portuguese from Brazil that's perfect for me.

According to the source code, it is based on the StandardAnalyzer. It
uses the StandardTokenizer and the StandardFilter, and adds a stopword
filter and a brazilian stemming filter.

Kind regards,

Veit

--
vRanger cuts backup time in half-while increasing security.
With the market-leading solution for virtual backup and recovery, 
you get blazing-fast, flexible, and affordable data protection.
Download your free trial now. 
http://p.sf.net/sfu/quest-d2dcopy1
___
CLucene-developers mailing list
CLucene-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/clucene-developers


Re: [CLucene-dev] Indexing a document

2011-05-26 Thread Veit Jahns
Hi Emerson!

2011/5/26 Emerson Espínola emersonespin...@gmail.com

 void Index::NewDoc (const char *contents)

Using char here means that you use CLucene in an ASCII mode. Right?

 {
     Document* doc = _CLNEW Document();

     doc-add(*_CLNEW Field(_T(contents), _T(contents), Field::STORE_YES | 
 Field::INDEX_TOKENIZED));

_T(contents) isn't needed here. Just passing the value of contents
to the Field constructor is sufficient.

     this-writer-addDocument(doc);
     _CLDELETE(doc);

As far as I know, writer---or something inside writer---takes
ownership of doc. So, by deleting it you delete a object that is
still referenced. Maybe this is the reason for you access violation.

 }

 Consider this-writer already instanciated in my class Index. I debuged until 
 I couldn't go deeper and O stopped in the line which is throwing the 
 exception: line 282 from DocumentWriter.cpp method invertDocument().

 void DocumentWriter::invertDocument(const Document* doc) {
 ...
     CL_NS(analysis)::TokenStream* stream = analyzer-tokenStream(fieldName, 
 reader);
 ...
 }

Which version of CLucene do you use? I see there an assertion.

Kind regards,

Veit

--
vRanger cuts backup time in half-while increasing security.
With the market-leading solution for virtual backup and recovery, 
you get blazing-fast, flexible, and affordable data protection.
Download your free trial now. 
http://p.sf.net/sfu/quest-d2dcopy1
___
CLucene-developers mailing list
CLucene-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/clucene-developers