Re: [CLucene-dev] Indexing a document
I included only the CLucene.h. []'s Emerson de Lira Espínola ** emersonespin...@gmail.com https://profiles.google.com/emersonespinola/buzz?hl=pt-BR http://www.quora.com/emersonespinola http://www.facebook.com/emersonespinola http://www.linkedin.com/in/emersonespinola http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com http://emersonespinola.blogspot.com http://twitter.com/emersonespinola http://www.myebook.com/emersonespinola/ http://www.myebook.com/emersonespinola/ 2011/11/30 Veit Jahns nuncupa...@googlemail.com 2011/11/30 Emerson Espínola emersonespin...@gmail.com Here my code goes attached. The code looks good. If it was a missing include I think it wouldn't even compile, and I also used the namespace macros. I didn't mean the includes in CLucene, but the includes in your application. Did you only include the main CLucene.h? If so, did you added the BrazilianAnalyzer.h? Or did you include the BrazilianAnalyzer.h directly? Veit -- All the data continuously generated in your IT infrastructure contains a definitive record of customers, application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-novd2d ___ CLucene-developers mailing list CLucene-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/clucene-developers -- All the data continuously generated in your IT infrastructure contains a definitive record of customers, application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-novd2d___ CLucene-developers mailing list CLucene-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/clucene-developers
Re: [CLucene-dev] Indexing a document
Has anyone seen these classes before? What would be another alternative to port them? []'s Emerson de Lira Espínola ** emersonespin...@gmail.com https://profiles.google.com/emersonespinola/buzz?hl=pt-BR http://www.quora.com/emersonespinola http://www.facebook.com/emersonespinola http://www.linkedin.com/in/emersonespinola http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com http://emersonespinola.blogspot.com http://twitter.com/emersonespinola http://www.myebook.com/emersonespinola/ http://www.myebook.com/emersonespinola/ 2011/11/19 Emerson Espinola emersonespin...@gmail.com I don't know. I'm not familiar with those classes. I'm trying to port them. Enviado via iPhone Em 19/11/2011, às 13:03, Itamar Syn-Hershko ita...@code972.com escreveu: ReaderPtr? that's an LPP class?! 2011/11/18 Emerson Espínola emersonespin...@gmail.com Hi. I'm porting BrazillianAnalyzer from Java do C++. I could compile already BrazilianStemFilter.cpp/.h. Now I'm trying to make BrazillianAnalyzer.cpp/.h compile. But I getting some errors. For example BrazillianAnalyzer.java extends from StopwordAnalyzerBase, but I couldn't find this class in CLucene. It also uses LuceneVersion, ReaderPtr, and TokenStreamPtr, but I couldn't find these classes too. Are there their implementations in CLucene with another name? I downloaded clucene-core-2.3.3.4. []'s Emerson de Lira Espínola ** emersonespin...@gmail.com https://profiles.google.com/emersonespinola/buzz?hl=pt-BR http://www.quora.com/emersonespinola http://www.facebook.com/emersonespinola http://www.linkedin.com/in/emersonespinola http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com http://emersonespinola.blogspot.com http://twitter.com/emersonespinola http://www.myebook.com/emersonespinola/ http://www.myebook.com/emersonespinola/ 2011/11/7 Bořivoj Kostka kos...@tovek.cz Well, I was talking about version 2.3; you can download it from http://sourceforge.net/projects/clucene/ I guess the stemming from this version could be adapted for 0.9 (probably some small modifications will be needed), if you cannot upgrade. ** ** Regards Borek ** ** *From:* Emerson Espínola [mailto:emersonespin...@gmail.com] *Sent:* Monday, November 07, 2011 7:08 PM *To:* clucene-developers@lists.sourceforge.net *Subject:* Re: [CLucene-dev] Indexing a document ** ** The version that I have here is clucene-core-0.9.21b and I don't see this inside src folder. My structure is clucene-core-0.9.21b\src\CLucene\analysis\standard. :( []'s Emerson de Lira Espínola ~WRD000.jpg emersonespin...@gmail.com~WRD000.jpghttps://profiles.google.com/emersonespinola/buzz?hl=pt-BR ~WRD000.jpg http://www.quora.com/emersonespinola ~WRD000.jpghttp://www.facebook.com/emersonespinola image002.jpg http://www.linkedin.com/in/emersonespinola~WRD000.jpghttp://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com ~WRD000.jpg http://emersonespinola.blogspot.com~WRD000.jpghttp://twitter.com/emersonespinola ~WRD000.jpg http://www.myebook.com/emersonespinola/ 2011/11/7 Bořivoj Kostka kos...@tovek.cz It is located in src/contribs-lib/CLucene/snowball You must enable building contribs-lib when you run cmake. As far as I know it is standard part of CLucene source package Borek *From:* Emerson Espínola [mailto:emersonespin...@gmail.com] *Sent:* Monday, November 07, 2011 4:53 PM *To:* clucene-developers@lists.sourceforge.net *Subject:* Re: [CLucene-dev] Indexing a document Really? That would help me a lot. Regarding stemm there are no much differences. How do I get it? Is contribs-lib a folder? If so I couldn't find it. []'s Emerson de Lira Espínola ~WRD000.jpg emersonespin...@gmail.com~WRD000.jpghttps://profiles.google.com/emersonespinola/buzz?hl=pt-BR ~WRD000.jpg http://www.quora.com/emersonespinola ~WRD000.jpghttp://www.facebook.com/emersonespinola image003.jpg http://www.linkedin.com/in/emersonespinola~WRD000.jpghttp://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com ~WRD000.jpg http://emersonespinola.blogspot.com~WRD000.jpghttp://twitter.com/emersonespinola ~WRD000.jpg http://www.myebook.com/emersonespinola/ 2011/11/7 Bořivoj Kostka kos...@tovek.cz Emerson There is portuguese stemmer in contribs-lib (snowball). I don’t know how much written brazillian differs from portuguese, but perhaps this could be also (at least partial) solution. Regards Borek *From:* Emerson Espínola [mailto:emersonespin...@gmail.com] *Sent:* Monday, November 07, 2011 3:28 PM *To:* clucene-developers@lists.sourceforge.net *Subject:* Re: [CLucene-dev] Indexing a document Thank you. 2011/11/7 Itamar Syn-Hershko ita...@code972.com Yeah, shouldn't be too hard to pull off On Mon, Nov 7, 2011 at 4:17 PM, Emerson Espínola emersonespin
Re: [CLucene-dev] Indexing a document
ReaderPtr? that's an LPP class?! 2011/11/18 Emerson Espínola emersonespin...@gmail.com Hi. I'm porting BrazillianAnalyzer from Java do C++. I could compile already BrazilianStemFilter.cpp/.h. Now I'm trying to make BrazillianAnalyzer.cpp/.h compile. But I getting some errors. For example BrazillianAnalyzer.java extends from StopwordAnalyzerBase, but I couldn't find this class in CLucene. It also uses LuceneVersion, ReaderPtr, and TokenStreamPtr, but I couldn't find these classes too. Are there their implementations in CLucene with another name? I downloaded clucene-core-2.3.3.4. []'s Emerson de Lira Espínola ** emersonespin...@gmail.com https://profiles.google.com/emersonespinola/buzz?hl=pt-BR http://www.quora.com/emersonespinola http://www.facebook.com/emersonespinola http://www.linkedin.com/in/emersonespinola http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com http://emersonespinola.blogspot.com http://twitter.com/emersonespinola http://www.myebook.com/emersonespinola/ http://www.myebook.com/emersonespinola/ 2011/11/7 Bořivoj Kostka kos...@tovek.cz Well, I was talking about version 2.3; you can download it from http://sourceforge.net/projects/clucene/ I guess the stemming from this version could be adapted for 0.9 (probably some small modifications will be needed), if you cannot upgrade. ** ** Regards Borek ** ** *From:* Emerson Espínola [mailto:emersonespin...@gmail.com] *Sent:* Monday, November 07, 2011 7:08 PM *To:* clucene-developers@lists.sourceforge.net *Subject:* Re: [CLucene-dev] Indexing a document ** ** The version that I have here is clucene-core-0.9.21b and I don't see this inside src folder. My structure is clucene-core-0.9.21b\src\CLucene\analysis\standard. :( []'s Emerson de Lira Espínola [image: Obrázek byl odebrán odesílatelem.] emersonespin...@gmail.com[image: Obrázek byl odebrán odesílatelem.]https://profiles.google.com/emersonespinola/buzz?hl=pt-BR[image: Obrázek byl odebrán odesílatelem.] http://www.quora.com/emersonespinola [image: Obrázek byl odebrán odesílatelem.]http://www.facebook.com/emersonespinola [image: Obrázek byl odebrán odesílatelem.]http://www.linkedin.com/in/emersonespinola[image: Obrázek byl odebrán odesílatelem.]http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com[image: Obrázek byl odebrán odesílatelem.] http://emersonespinola.blogspot.com[image: Obrázek byl odebrán odesílatelem.] http://twitter.com/emersonespinola[image: Obrázek byl odebrán odesílatelem.]http://www.myebook.com/emersonespinola/ 2011/11/7 Bořivoj Kostka kos...@tovek.cz It is located in src/contribs-lib/CLucene/snowball You must enable building contribs-lib when you run cmake. As far as I know it is standard part of CLucene source package Borek *From:* Emerson Espínola [mailto:emersonespin...@gmail.com] *Sent:* Monday, November 07, 2011 4:53 PM *To:* clucene-developers@lists.sourceforge.net *Subject:* Re: [CLucene-dev] Indexing a document Really? That would help me a lot. Regarding stemm there are no much differences. How do I get it? Is contribs-lib a folder? If so I couldn't find it. []'s Emerson de Lira Espínola [image: Obrázek byl odebrán odesílatelem.] emersonespin...@gmail.com[image: Obrázek byl odebrán odesílatelem.]https://profiles.google.com/emersonespinola/buzz?hl=pt-BR[image: Obrázek byl odebrán odesílatelem.] http://www.quora.com/emersonespinola [image: Obrázek byl odebrán odesílatelem.]http://www.facebook.com/emersonespinola [image: Obrázek byl odebrán odesílatelem.]http://www.linkedin.com/in/emersonespinola[image: Obrázek byl odebrán odesílatelem.]http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com[image: Obrázek byl odebrán odesílatelem.] http://emersonespinola.blogspot.com[image: Obrázek byl odebrán odesílatelem.] http://twitter.com/emersonespinola[image: Obrázek byl odebrán odesílatelem.]http://www.myebook.com/emersonespinola/ 2011/11/7 Bořivoj Kostka kos...@tovek.cz Emerson There is portuguese stemmer in contribs-lib (snowball). I don't know how much written brazillian differs from portuguese, but perhaps this could be also (at least partial) solution. Regards Borek *From:* Emerson Espínola [mailto:emersonespin...@gmail.com] *Sent:* Monday, November 07, 2011 3:28 PM *To:* clucene-developers@lists.sourceforge.net *Subject:* Re: [CLucene-dev] Indexing a document Thank you. 2011/11/7 Itamar Syn-Hershko ita...@code972.com Yeah, shouldn't be too hard to pull off On Mon, Nov 7, 2011 at 4:17 PM, Emerson Espínola emersonespin...@gmail.com wrote: So the better way for me would be porting BrazillianAnalyzer from Lucene, wouldn't it? Because if I use the [1] I'll probably have problems because it depends on something (.h files
Re: [CLucene-dev] Indexing a document
Thank you Viet. When will this new version of CLucene be ready? Any idea? []'s Emerson de Lira Espínola ** emersonespin...@gmail.com https://profiles.google.com/emersonespinola/buzz?hl=pt-BR http://www.quora.com/emersonespinola http://www.facebook.com/emersonespinola http://www.linkedin.com/in/emersonespinola http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com http://emersonespinola.blogspot.com http://twitter.com/emersonespinola http://www.myebook.com/emersonespinola/ http://www.myebook.com/emersonespinola/ 2011/11/5 Veit Jahns nuncupa...@googlemail.com Hi Emerson! 2011/11/4 Emerson Espínola emersonespin...@gmail.com I'm willing to write the BrazillianAnalyzer. What should I do? Do you have an idea about how much time is needed to write it? 1 week? 2 weeks? 1 month? If the BrazillianAnalyzer from Java Lucene fits your needs, then the only thing you have to do is to port this Analyzer to CLucene. I did the same once for the GermanAnalyzer and I needed more or less a day. BTW, a while ago we started to work on a new code base for CLucene based on Lucene++. Lucene++ has already a BrazillianAnalyzer [1]. But it is based on a newer version of Java Lucene. Kind regards, Veit [1] https://github.com/luceneplusplus/LucenePlusPlus/tree/e28b15b02ff9de2208965e9af8eb80983380cdcd/src/contrib/analyzers/common/analysis/br -- RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 ___ CLucene-developers mailing list CLucene-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/clucene-developers -- RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1___ CLucene-developers mailing list CLucene-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/clucene-developers
Re: [CLucene-dev] Indexing a document
Thats going to take a while, unfortunately. LPP is already available on github, but we want to have some improvements made to its core before merging it to CLucene On Mon, Nov 7, 2011 at 4:01 PM, Emerson Espínola emersonespin...@gmail.comwrote: Thank you Viet. When will this new version of CLucene be ready? Any idea? []'s Emerson de Lira Espínola ** emersonespin...@gmail.com https://profiles.google.com/emersonespinola/buzz?hl=pt-BR http://www.quora.com/emersonespinola http://www.facebook.com/emersonespinola http://www.linkedin.com/in/emersonespinola http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com http://emersonespinola.blogspot.com http://twitter.com/emersonespinola http://www.myebook.com/emersonespinola/ http://www.myebook.com/emersonespinola/ 2011/11/5 Veit Jahns nuncupa...@googlemail.com Hi Emerson! 2011/11/4 Emerson Espínola emersonespin...@gmail.com I'm willing to write the BrazillianAnalyzer. What should I do? Do you have an idea about how much time is needed to write it? 1 week? 2 weeks? 1 month? If the BrazillianAnalyzer from Java Lucene fits your needs, then the only thing you have to do is to port this Analyzer to CLucene. I did the same once for the GermanAnalyzer and I needed more or less a day. BTW, a while ago we started to work on a new code base for CLucene based on Lucene++. Lucene++ has already a BrazillianAnalyzer [1]. But it is based on a newer version of Java Lucene. Kind regards, Veit [1] https://github.com/luceneplusplus/LucenePlusPlus/tree/e28b15b02ff9de2208965e9af8eb80983380cdcd/src/contrib/analyzers/common/analysis/br -- RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 ___ CLucene-developers mailing list CLucene-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/clucene-developers -- RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 ___ CLucene-developers mailing list CLucene-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/clucene-developers -- RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1___ CLucene-developers mailing list CLucene-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/clucene-developers
Re: [CLucene-dev] Indexing a document
So the better way for me would be porting BrazillianAnalyzer from Lucene, wouldn't it? Because if I use the [1] I'll probably have problems because it depends on something (.h files) that are not implemented already. Is that right? []'s Emerson de Lira Espínola ** emersonespin...@gmail.com https://profiles.google.com/emersonespinola/buzz?hl=pt-BR http://www.quora.com/emersonespinola http://www.facebook.com/emersonespinola http://www.linkedin.com/in/emersonespinola http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com http://emersonespinola.blogspot.com http://twitter.com/emersonespinola http://www.myebook.com/emersonespinola/ http://www.myebook.com/emersonespinola/ 2011/11/7 Itamar Syn-Hershko ita...@code972.com Thats going to take a while, unfortunately. LPP is already available on github, but we want to have some improvements made to its core before merging it to CLucene On Mon, Nov 7, 2011 at 4:01 PM, Emerson Espínola emersonespin...@gmail.com wrote: Thank you Viet. When will this new version of CLucene be ready? Any idea? []'s Emerson de Lira Espínola ** emersonespin...@gmail.com https://profiles.google.com/emersonespinola/buzz?hl=pt-BR http://www.quora.com/emersonespinola http://www.facebook.com/emersonespinola http://www.linkedin.com/in/emersonespinola http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com http://emersonespinola.blogspot.com http://twitter.com/emersonespinola http://www.myebook.com/emersonespinola/ http://www.myebook.com/emersonespinola/ 2011/11/5 Veit Jahns nuncupa...@googlemail.com Hi Emerson! 2011/11/4 Emerson Espínola emersonespin...@gmail.com I'm willing to write the BrazillianAnalyzer. What should I do? Do you have an idea about how much time is needed to write it? 1 week? 2 weeks? 1 month? If the BrazillianAnalyzer from Java Lucene fits your needs, then the only thing you have to do is to port this Analyzer to CLucene. I did the same once for the GermanAnalyzer and I needed more or less a day. BTW, a while ago we started to work on a new code base for CLucene based on Lucene++. Lucene++ has already a BrazillianAnalyzer [1]. But it is based on a newer version of Java Lucene. Kind regards, Veit [1] https://github.com/luceneplusplus/LucenePlusPlus/tree/e28b15b02ff9de2208965e9af8eb80983380cdcd/src/contrib/analyzers/common/analysis/br -- RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 ___ CLucene-developers mailing list CLucene-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/clucene-developers -- RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 ___ CLucene-developers mailing list CLucene-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/clucene-developers -- RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 ___ CLucene-developers mailing list CLucene-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/clucene-developers -- RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1___ CLucene-developers mailing list CLucene-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/clucene-developers
Re: [CLucene-dev] Indexing a document
Yeah, shouldn't be too hard to pull off On Mon, Nov 7, 2011 at 4:17 PM, Emerson Espínola emersonespin...@gmail.comwrote: So the better way for me would be porting BrazillianAnalyzer from Lucene, wouldn't it? Because if I use the [1] I'll probably have problems because it depends on something (.h files) that are not implemented already. Is that right? []'s Emerson de Lira Espínola ** emersonespin...@gmail.com https://profiles.google.com/emersonespinola/buzz?hl=pt-BR http://www.quora.com/emersonespinola http://www.facebook.com/emersonespinola http://www.linkedin.com/in/emersonespinola http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com http://emersonespinola.blogspot.com http://twitter.com/emersonespinola http://www.myebook.com/emersonespinola/ http://www.myebook.com/emersonespinola/ 2011/11/7 Itamar Syn-Hershko ita...@code972.com Thats going to take a while, unfortunately. LPP is already available on github, but we want to have some improvements made to its core before merging it to CLucene On Mon, Nov 7, 2011 at 4:01 PM, Emerson Espínola emersonespin...@gmail.com wrote: Thank you Viet. When will this new version of CLucene be ready? Any idea? []'s Emerson de Lira Espínola ** emersonespin...@gmail.com https://profiles.google.com/emersonespinola/buzz?hl=pt-BR http://www.quora.com/emersonespinola http://www.facebook.com/emersonespinola http://www.linkedin.com/in/emersonespinola http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com http://emersonespinola.blogspot.com http://twitter.com/emersonespinola http://www.myebook.com/emersonespinola/ http://www.myebook.com/emersonespinola/ 2011/11/5 Veit Jahns nuncupa...@googlemail.com Hi Emerson! 2011/11/4 Emerson Espínola emersonespin...@gmail.com I'm willing to write the BrazillianAnalyzer. What should I do? Do you have an idea about how much time is needed to write it? 1 week? 2 weeks? 1 month? If the BrazillianAnalyzer from Java Lucene fits your needs, then the only thing you have to do is to port this Analyzer to CLucene. I did the same once for the GermanAnalyzer and I needed more or less a day. BTW, a while ago we started to work on a new code base for CLucene based on Lucene++. Lucene++ has already a BrazillianAnalyzer [1]. But it is based on a newer version of Java Lucene. Kind regards, Veit [1] https://github.com/luceneplusplus/LucenePlusPlus/tree/e28b15b02ff9de2208965e9af8eb80983380cdcd/src/contrib/analyzers/common/analysis/br -- RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 ___ CLucene-developers mailing list CLucene-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/clucene-developers -- RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 ___ CLucene-developers mailing list CLucene-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/clucene-developers -- RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 ___ CLucene-developers mailing list CLucene-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/clucene-developers -- RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 ___ CLucene-developers mailing list CLucene-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/clucene-developers -- RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1___ CLucene-developers mailing list CLucene-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/clucene-developers
Re: [CLucene-dev] Indexing a document
Really? That would help me a lot. Regarding stemm there are no much differences. How do I get it? Is contribs-lib a folder? If so I couldn't find it. []'s Emerson de Lira Espínola ** emersonespin...@gmail.com https://profiles.google.com/emersonespinola/buzz?hl=pt-BR http://www.quora.com/emersonespinola http://www.facebook.com/emersonespinola http://www.linkedin.com/in/emersonespinola http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com http://emersonespinola.blogspot.com http://twitter.com/emersonespinola http://www.myebook.com/emersonespinola/ http://www.myebook.com/emersonespinola/ 2011/11/7 Bořivoj Kostka kos...@tovek.cz Emerson There is portuguese stemmer in contribs-lib (snowball). I don’t know how much written brazillian differs from portuguese, but perhaps this could be also (at least partial) solution. Regards Borek ** ** *From:* Emerson Espínola [mailto:emersonespin...@gmail.com] *Sent:* Monday, November 07, 2011 3:28 PM *To:* clucene-developers@lists.sourceforge.net *Subject:* Re: [CLucene-dev] Indexing a document ** ** Thank you. 2011/11/7 Itamar Syn-Hershko ita...@code972.com Yeah, shouldn't be too hard to pull off ** ** On Mon, Nov 7, 2011 at 4:17 PM, Emerson Espínola emersonespin...@gmail.com wrote: So the better way for me would be porting BrazillianAnalyzer from Lucene, wouldn't it? Because if I use the [1] I'll probably have problems because it depends on something (.h files) that are not implemented already. Is that right? 2011/11/7 Itamar Syn-Hershko ita...@code972.com Thats going to take a while, unfortunately. LPP is already available on github, but we want to have some improvements made to its core before merging it to CLucene ** ** On Mon, Nov 7, 2011 at 4:01 PM, Emerson Espínola emersonespin...@gmail.com wrote: Thank you Viet. When will this new version of CLucene be ready? Any idea? []'s Emerson de Lira Espínola 2011/11/5 Veit Jahns nuncupa...@googlemail.com Hi Emerson! 2011/11/4 Emerson Espínola emersonespin...@gmail.com I'm willing to write the BrazillianAnalyzer. What should I do? Do you have an idea about how much time is needed to write it? 1 week? 2 weeks? 1 month? If the BrazillianAnalyzer from Java Lucene fits your needs, then the only thing you have to do is to port this Analyzer to CLucene. I did the same once for the GermanAnalyzer and I needed more or less a day. BTW, a while ago we started to work on a new code base for CLucene based on Lucene++. Lucene++ has already a BrazillianAnalyzer [1]. But it is based on a newer version of Java Lucene. Kind regards, Veit [1] https://github.com/luceneplusplus/LucenePlusPlus/tree/e28b15b02ff9de2208965e9af8eb80983380cdcd/src/contrib/analyzers/common/analysis/br ** ** -- RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 ___ CLucene-developers mailing list CLucene-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/clucene-developers -- RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 ___ CLucene-developers mailing list CLucene-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/clucene-developers ** ** -- RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 ___ CLucene-developers mailing list CLucene-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/clucene-developers ** ** -- RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 ___ CLucene-developers mailing list CLucene-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/clucene-developers ** ** -- RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 ___ CLucene-developers mailing list CLucene-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/clucene-developers ** ** -- RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1
Re: [CLucene-dev] Indexing a document
It is located in src/contribs-lib/CLucene/snowball You must enable building contribs-lib when you run cmake. As far as I know it is standard part of CLucene source package Borek From: Emerson Espínola [mailto:emersonespin...@gmail.com] Sent: Monday, November 07, 2011 4:53 PM To: clucene-developers@lists.sourceforge.net Subject: Re: [CLucene-dev] Indexing a document Really? That would help me a lot. Regarding stemm there are no much differences. How do I get it? Is contribs-lib a folder? If so I couldn't find it. []'s Emerson de Lira Espínola [Obrázek byl odebrán odesílatelem.]mailto:emersonespin...@gmail.com[Obrázek byl odebrán odesílatelem.]https://profiles.google.com/emersonespinola/buzz?hl=pt-BR[Obrázek byl odebrán odesílatelem.]http://www.quora.com/emersonespinola [Obrázek byl odebrán odesílatelem.] http://www.facebook.com/emersonespinola [Obrázek byl odebrán odesílatelem.] http://www.linkedin.com/in/emersonespinola [Obrázek byl odebrán odesílatelem.] http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com [Obrázek byl odebrán odesílatelem.] http://emersonespinola.blogspot.com [Obrázek byl odebrán odesílatelem.] http://twitter.com/emersonespinola [Obrázek byl odebrán odesílatelem.] http://www.myebook.com/emersonespinola/ 2011/11/7 Bořivoj Kostka kos...@tovek.czmailto:kos...@tovek.cz Emerson There is portuguese stemmer in contribs-lib (snowball). I don’t know how much written brazillian differs from portuguese, but perhaps this could be also (at least partial) solution. Regards Borek From: Emerson Espínola [mailto:emersonespin...@gmail.commailto:emersonespin...@gmail.com] Sent: Monday, November 07, 2011 3:28 PM To: clucene-developers@lists.sourceforge.netmailto:clucene-developers@lists.sourceforge.net Subject: Re: [CLucene-dev] Indexing a document Thank you. 2011/11/7 Itamar Syn-Hershko ita...@code972.commailto:ita...@code972.com Yeah, shouldn't be too hard to pull off On Mon, Nov 7, 2011 at 4:17 PM, Emerson Espínola emersonespin...@gmail.commailto:emersonespin...@gmail.com wrote: So the better way for me would be porting BrazillianAnalyzer from Lucene, wouldn't it? Because if I use the [1] I'll probably have problems because it depends on something (.h files) that are not implemented already. Is that right? 2011/11/7 Itamar Syn-Hershko ita...@code972.commailto:ita...@code972.com Thats going to take a while, unfortunately. LPP is already available on github, but we want to have some improvements made to its core before merging it to CLucene On Mon, Nov 7, 2011tel:2011 at 4:01 PM, Emerson Espínola emersonespin...@gmail.commailto:emersonespin...@gmail.com wrote: Thank you Viet. When will this new version of CLucene be ready? Any idea? []'s Emerson de Lira Espínola 2011/11/5 Veit Jahns nuncupa...@googlemail.commailto:nuncupa...@googlemail.com Hi Emerson! 2011/11/4 Emerson Espínola emersonespin...@gmail.commailto:emersonespin...@gmail.com I'm willing to write the BrazillianAnalyzer. What should I do? Do you have an idea about how much time is needed to write it? 1 week? 2 weeks? 1 month? If the BrazillianAnalyzer from Java Lucene fits your needs, then the only thing you have to do is to port this Analyzer to CLucene. I did the same once for the GermanAnalyzer and I needed more or less a day. BTW, a while ago we started to work on a new code base for CLucene based on Lucene++. Lucene++ has already a BrazillianAnalyzer [1]. But it is based on a newer version of Java Lucene. Kind regards, Veit [1] https://github.com/luceneplusplus/LucenePlusPlus/tree/e28b15b02ff9de2208965e9af8eb80983380cdcd/src/contrib/analyzers/common/analysis/br -- RSA(R) Conference 2012tel:2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 ___ CLucene-developers mailing list CLucene-developers@lists.sourceforge.netmailto:CLucene-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/clucene-developers -- RSA(R) Conference 2012tel:2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 ___ CLucene-developers mailing list CLucene-developers@lists.sourceforge.netmailto:CLucene-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/clucene-developers -- RSA(R) Conference 2012tel:2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 ___ CLucene-developers mailing list CLucene-developers@lists.sourceforge.netmailto:CLucene-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/clucene-developers -- RSA(R) Conference
Re: [CLucene-dev] Indexing a document
Well, I was talking about version 2.3; you can download it from http://sourceforge.net/projects/clucene/ I guess the stemming from this version could be adapted for 0.9 (probably some small modifications will be needed), if you cannot upgrade. Regards Borek From: Emerson Espínola [mailto:emersonespin...@gmail.com] Sent: Monday, November 07, 2011 7:08 PM To: clucene-developers@lists.sourceforge.net Subject: Re: [CLucene-dev] Indexing a document The version that I have here is clucene-core-0.9.21b and I don't see this inside src folder. My structure is clucene-core-0.9.21b\src\CLucene\analysis\standard. :( []'s Emerson de Lira Espínola [Obrázek byl odebrán odesílatelem.]mailto:emersonespin...@gmail.com[Obrázek byl odebrán odesílatelem.]https://profiles.google.com/emersonespinola/buzz?hl=pt-BR[Obrázek byl odebrán odesílatelem.]http://www.quora.com/emersonespinola [Obrázek byl odebrán odesílatelem.] http://www.facebook.com/emersonespinola [Obrázek byl odebrán odesílatelem.] http://www.linkedin.com/in/emersonespinola [Obrázek byl odebrán odesílatelem.] http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com [Obrázek byl odebrán odesílatelem.] http://emersonespinola.blogspot.com [Obrázek byl odebrán odesílatelem.] http://twitter.com/emersonespinola [Obrázek byl odebrán odesílatelem.] http://www.myebook.com/emersonespinola/ 2011/11/7 Bořivoj Kostka kos...@tovek.czmailto:kos...@tovek.cz It is located in src/contribs-lib/CLucene/snowball You must enable building contribs-lib when you run cmake. As far as I know it is standard part of CLucene source package Borek From: Emerson Espínola [mailto:emersonespin...@gmail.commailto:emersonespin...@gmail.com] Sent: Monday, November 07, 2011 4:53 PM To: clucene-developers@lists.sourceforge.netmailto:clucene-developers@lists.sourceforge.net Subject: Re: [CLucene-dev] Indexing a document Really? That would help me a lot. Regarding stemm there are no much differences. How do I get it? Is contribs-lib a folder? If so I couldn't find it. []'s Emerson de Lira Espínola [Obrázek byl odebrán odesílatelem.]mailto:emersonespin...@gmail.com[Obrázek byl odebrán odesílatelem.]https://profiles.google.com/emersonespinola/buzz?hl=pt-BR[Obrázek byl odebrán odesílatelem.]http://www.quora.com/emersonespinola [Obrázek byl odebrán odesílatelem.] http://www.facebook.com/emersonespinola [Obrázek byl odebrán odesílatelem.] http://www.linkedin.com/in/emersonespinola [Obrázek byl odebrán odesílatelem.] http://spaces.live.com/profile.aspx?mem=emersonespin...@hotmail.com [Obrázek byl odebrán odesílatelem.] http://emersonespinola.blogspot.com [Obrázek byl odebrán odesílatelem.] http://twitter.com/emersonespinola [Obrázek byl odebrán odesílatelem.] http://www.myebook.com/emersonespinola/ 2011/11/7 Bořivoj Kostka kos...@tovek.czmailto:kos...@tovek.cz Emerson There is portuguese stemmer in contribs-lib (snowball). I don’t know how much written brazillian differs from portuguese, but perhaps this could be also (at least partial) solution. Regards Borek From: Emerson Espínola [mailto:emersonespin...@gmail.commailto:emersonespin...@gmail.com] Sent: Monday, November 07, 2011 3:28 PM To: clucene-developers@lists.sourceforge.netmailto:clucene-developers@lists.sourceforge.net Subject: Re: [CLucene-dev] Indexing a document Thank you. 2011/11/7 Itamar Syn-Hershko ita...@code972.commailto:ita...@code972.com Yeah, shouldn't be too hard to pull off On Mon, Nov 7, 2011 at 4:17 PM, Emerson Espínola emersonespin...@gmail.commailto:emersonespin...@gmail.com wrote: So the better way for me would be porting BrazillianAnalyzer from Lucene, wouldn't it? Because if I use the [1] I'll probably have problems because it depends on something (.h files) that are not implemented already. Is that right? 2011/11/7 Itamar Syn-Hershko ita...@code972.commailto:ita...@code972.com Thats going to take a while, unfortunately. LPP is already available on github, but we want to have some improvements made to its core before merging it to CLucene On Mon, Nov 7, 2011tel:2011 at 4:01 PM, Emerson Espínola emersonespin...@gmail.commailto:emersonespin...@gmail.com wrote: Thank you Viet. When will this new version of CLucene be ready? Any idea? []'s Emerson de Lira Espínola 2011/11/5 Veit Jahns nuncupa...@googlemail.commailto:nuncupa...@googlemail.com Hi Emerson! 2011/11/4 Emerson Espínola emersonespin...@gmail.commailto:emersonespin...@gmail.com I'm willing to write the BrazillianAnalyzer. What should I do? Do you have an idea about how much time is needed to write it? 1 week? 2 weeks? 1 month? If the BrazillianAnalyzer from Java Lucene fits your needs, then the only thing you have to do is to port this Analyzer to CLucene. I did the same once for the GermanAnalyzer and I needed more or less a day. BTW, a while ago we started to work on a new code base for CLucene based on Lucene++. Lucene++ has already a BrazillianAnalyzer [1]. But it is based on a newer version
Re: [CLucene-dev] Indexing a document
Hi Emerson! 2011/11/4 Emerson Espínola emersonespin...@gmail.com I'm willing to write the BrazillianAnalyzer. What should I do? Do you have an idea about how much time is needed to write it? 1 week? 2 weeks? 1 month? If the BrazillianAnalyzer from Java Lucene fits your needs, then the only thing you have to do is to port this Analyzer to CLucene. I did the same once for the GermanAnalyzer and I needed more or less a day. BTW, a while ago we started to work on a new code base for CLucene based on Lucene++. Lucene++ has already a BrazillianAnalyzer [1]. But it is based on a newer version of Java Lucene. Kind regards, Veit [1] https://github.com/luceneplusplus/LucenePlusPlus/tree/e28b15b02ff9de2208965e9af8eb80983380cdcd/src/contrib/analyzers/common/analysis/br -- RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1___ CLucene-developers mailing list CLucene-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/clucene-developers
Re: [CLucene-dev] Indexing a document
Hi Emerson! 2011/5/27 Emerson Espínola emersonespin...@gmail.com Hi Veit. Thank yo very much for your answer. Great explanation. You don't wonder how much you're helping me. You are welcome! 1. I'll try english documents. 2. Ok. 3. Does BrazilianAnalyzer work similar to StandardAnalyzer? If so, that's what I want. I'm from Brazil. :) There are no much differences between portuguese from Portugal and portuguese from Brazil. But if there is already an analyzer for portuguese from Brazil that's perfect for me. According to the source code, it is based on the StandardAnalyzer. It uses the StandardTokenizer and the StandardFilter, and adds a stopword filter and a brazilian stemming filter. Kind regards, Veit -- vRanger cuts backup time in half-while increasing security. With the market-leading solution for virtual backup and recovery, you get blazing-fast, flexible, and affordable data protection. Download your free trial now. http://p.sf.net/sfu/quest-d2dcopy1 ___ CLucene-developers mailing list CLucene-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/clucene-developers
Re: [CLucene-dev] Indexing a document
Hi Emerson! 2011/5/26 Emerson Espínola emersonespin...@gmail.com void Index::NewDoc (const char *contents) Using char here means that you use CLucene in an ASCII mode. Right? { Document* doc = _CLNEW Document(); doc-add(*_CLNEW Field(_T(contents), _T(contents), Field::STORE_YES | Field::INDEX_TOKENIZED)); _T(contents) isn't needed here. Just passing the value of contents to the Field constructor is sufficient. this-writer-addDocument(doc); _CLDELETE(doc); As far as I know, writer---or something inside writer---takes ownership of doc. So, by deleting it you delete a object that is still referenced. Maybe this is the reason for you access violation. } Consider this-writer already instanciated in my class Index. I debuged until I couldn't go deeper and O stopped in the line which is throwing the exception: line 282 from DocumentWriter.cpp method invertDocument(). void DocumentWriter::invertDocument(const Document* doc) { ... CL_NS(analysis)::TokenStream* stream = analyzer-tokenStream(fieldName, reader); ... } Which version of CLucene do you use? I see there an assertion. Kind regards, Veit -- vRanger cuts backup time in half-while increasing security. With the market-leading solution for virtual backup and recovery, you get blazing-fast, flexible, and affordable data protection. Download your free trial now. http://p.sf.net/sfu/quest-d2dcopy1 ___ CLucene-developers mailing list CLucene-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/clucene-developers