Re: Solr Compile error
Hi there thanx for your reply. I'm using ivy 2.3.0 these files that were not accessible: so I downloaded them from: - http://maven.restlet.com/org/restlet/jee/org.restlet/2.1.1/org.restlet.jar - http://maven.restlet.com/org/restlet/jee/org.restlet.ext.servlet/2.1.1/org.restlet.ext.servlet.jar - http://mvnrepository.com/artifact/com.codahale.metrics/metrics-core/3.0.1/metrics-core.jar - http://mvnrepository.com/artifact/com.codahale.metrics/metrics-healthchecks/3.0.1/metrics-healthchecks.jar - http://www.eu.apache.org/dist/avro/avro-1.7.7/java/avro.jar and put them in the" \.ivy2\cache\ " in the home directory. after that it compiles successfully. It could be some firewall settings that blocked the download of those files. thanks again. Cheers,
Solr Compile error
Hi there when I "ant compile" solr 4.10.x, I got the following error: -- ivy-availability-check: [echo] Building solr-core... ivy-fail: ivy-fail: ivy-configure: [ivy:configure] :: loading settings :: file = E:\Solr\Solr_4_10_Source\lucene\ivy-settings.xml resolve: [ivy:retrieve] [ivy:retrieve] :: problems summary :: [ivy:retrieve] WARNINGS [ivy:retrieve] [FAILED ] org.restlet.jee#org.restlet;2.1.1!org.restlet.jar: (0ms) [ivy:retrieve] shared: tried [ivy:retrieve] C:\Users\Me.MyPC\.ivy2\shared\org.restlet.jee\org.restlet\2.1.1\jars\org.restlet.jar [ivy:retrieve] public: tried [ivy:retrieve] http://repo1.maven.org/maven2/org/restlet/jee/org.restlet/2.1.1/org.restlet-2.1.1.jar [ivy:retrieve] [FAILED ] org.restlet.jee#org.restlet.ext.servlet;2.1.1!org.restlet.ext.servlet.jar: (0ms) [ivy:retrieve] shared: tried [ivy:retrieve] C:\Users\Me.MyPC\.ivy2\shared\org.restlet.jee\org.restlet.ext.servlet\2.1.1\jars\org.restlet.ext.servlet.jar [ivy:retrieve] public: tried [ivy:retrieve] http://repo1.maven.org/maven2/org/restlet/jee/org.restlet.ext.servlet/2.1.1/org.restlet.ext.servlet-2.1.1.jar [ivy:retrieve] :: [ivy:retrieve] :: FAILED DOWNLOADS:: [ivy:retrieve] :: ^ see resolution messages for details ^ :: [ivy:retrieve] :: [ivy:retrieve] :: org.restlet.jee#org.restlet;2.1.1!org.restlet.jar [ivy:retrieve] :: org.restlet.jee#org.restlet.ext.servlet;2.1.1!org.restlet.ext.servlet.jar [ivy:retrieve] :: [ivy:retrieve] [ivy:retrieve] [ivy:retrieve] :: USE VERBOSE OR DEBUG MESSAGE LEVEL FOR MORE DETAILS BUILD FAILED E:\Solr\Solr_4_10_Source\build.xml:119: The following error occurred while executing this line: E:\Solr\Solr_4_10_Source\solr\common-build.xml:365: The following error occurred while executing this line: E:\Solr\Solr_4_10_Source\solr\core\build.xml:65: impossible to resolve dependencies: resolve failed - see output for details Total time: 10 seconds --- what I noticed is that this link does not exist. " http://repo1.maven.org/maven2/org/restlet/jee/org.restlet.ext.servlet/2.1.1/org.restlet.ext.servlet-2.1.1.jar " even when browsing "http://repo1.maven.org/";, I got a page that says: Browsing for this directory has been disabled. View this directory's contents on http://search.maven.org instead. any ideas how to overcome this error?
Re: Issue with multivalued fields in UIMA
Hi there I ran into the same problem. would you please explain how did you solve it. Thanks, Darx On Fri, Aug 29, 2014 at 11:26 PM, Tommaso Teofili wrote: > Hi, > > it'd be good if you could open a Jira issues (with a patch preferably) > describing your findings. > > Thanks, > Tommaso > > > 2014-08-29 18:34 GMT+02:00 mkhordad : > > > I solved it. It was caused by a bug in UIMAUpdateRequestProcessor. > > > > > > > > -- > > View this message in context: > > > http://lucene.472066.n3.nabble.com/Issue-with-multivalued-fields-in-UIMA-tp4155609p4155864.html > > Sent from the Solr - User mailing list archive at Nabble.com. > > >
Re: UIMA Dictionary Annotator
It was some configuration errors this is the new configuration AggregateAE.xml false name org.apache.uima.DictionaryEntry coveredText uimaKeyWords it works fine On Sun, Sep 21, 2014 at 1:23 PM, Darx Oman wrote: > Hi there > I trying to use UIMA Dictionary Annotator with Solr 4.10.0 > > I did the following: > > > 1) added a field to the schema > multiValued="true" /> > > > > 2)Modified solrConfig as follow > > > uima > > > > >class="org.apache.solr.uima.processor.UIMAUpdateRequestProcessorFactory"> > > > DictionaryAnnotator.xml > > > false > > text > > > > > > org.apache.uima.DictionaryEntry > > tokenType > uimaKeyWords > > > > > > > > > > > > 3) copied the following jar files to solr\collection1\lib folder > lucene-analyzers-uima-4.10.0.jar > solr-uima-4.10.0.jar > uima-an-dictionary.jar > uimaj-core-2.3.1.jar > WhitespaceTokenizer-2.3.1.jar > xmlbeans-2.4.0.jar > > > 4)Added some entries to dictionary.xml > > iPod > > > samsung > > > but when I indexed xml documents from example\exampledocs > no annotation happened > > this is the xml response I got > > > IW-02 > iPod & iPod Mini USB 2.0 Cable > Belkin > belkin > > electronics > connector > > > car power adapter for iPod, white > > 2.0 > 11.5 > 11.50,USD > 1 > false > 37.7752,-122.4232 > 2006-02-14T23:55:59Z > 1479845823070076928 > > > > what might went wrong? > > am I missing something? > > >
UIMA Dictionary Annotator
Hi there I trying to use UIMA Dictionary Annotator with Solr 4.10.0 I did the following: 1) added a field to the schema 2)Modified solrConfig as follow uima DictionaryAnnotator.xml false text org.apache.uima.DictionaryEntry tokenType uimaKeyWords 3) copied the following jar files to solr\collection1\lib folder lucene-analyzers-uima-4.10.0.jar solr-uima-4.10.0.jar uima-an-dictionary.jar uimaj-core-2.3.1.jar WhitespaceTokenizer-2.3.1.jar xmlbeans-2.4.0.jar 4)Added some entries to dictionary.xml iPod samsung but when I indexed xml documents from example\exampledocs no annotation happened this is the xml response I got IW-02 iPod & iPod Mini USB 2.0 Cable Belkin belkin electronics connector car power adapter for iPod, white 2.0 11.5 11.50,USD 1 false 37.7752,-122.4232 2006-02-14T23:55:59Z 1479845823070076928 what might went wrong? am I missing something?
Re: Sending pdf files to slor for indexing
Hi there you can use DIH with Tika
Re: Indexing Best Practice
Hi Lance thanx for your reply, but I have a question is this patch committed to trunk?
Indexing Best Practice
Hi guys I'm wondering how to best configure solr to fulfills my requirements. I'm indexing data from 2 data sources: 1- Database 2- PDF files (password encrypted) Every file has related information stored in the database. Both the file content and the related database fields must be indexed as one document in solr. Among the DB data is *per-user* permissions for every document. The file contents nearly never change, on the other hand, the DB data and especially the permissions change very frequently which require me to re-index everything for every modified document. My problem is in process of decrypting the PDF files before re-indexing them which takes too much time for a large number of documents, it could span to days in full re-indexing. What I'm trying to accomplish is eliminating the need to re-index the PDF content if not changed even if the DB data changed. I know this is not possible in solr, because solr doesn't update documents. So how to best accomplish this: Can I use 2 indexes one for PDF contents and the other for DB data and have a common id field for both as a link between them, *and results are treated as one Document*?
Re: How to index PDF file stored in SQL Server 2008
Hi again what you are missing is field mapping no need for TikaEntityProcessor since you are not accessing pdf files
Re: How to index PDF file stored in SQL Server 2008
Hi there TikaEntityProcessor is available as part of DIH-extras*.jar in 3.x and 4.0
Re: Memory leak for Multiple fields faceted searching
How much memory you specify for java heap? -Xmx...
Re: memory not getting released in tomcat after pushing large documents
Hi guys I'm facing a simillar porblem and i find out it is caused by MS SQL that is running in the same machine by just restarting MS SQL service, memory goes down.
Re: Adding the suggest component
Hi Solr 3.x and 4.x (trunk) include a component called Suggester http://wiki.apache.org/solr/Suggester
Synonyms question
Hi guys How to put this in synonyms.txt US USA United States of America
Re: upgrading to Tika 0.9 on Solr 1.4.1
hi if you want to index pdf files then use tika 0.6 because 0.7 and 0.8 does not detect the correctly the pdfParse
Re: UIMA Error
Hi How to apply the AlchemyAPIAnnotator? will this helps me with the *NamedEntityExtractionAnnotator?* *thanx a lot Tommaso for you time*
Re: UIMA Error
Hi Tommaso yes my server isn't connected to the internet. what other UIMA annotators that I can run which doesn't require an internet connection?
Re: DataImportHandler: no queries when using entity=something
sorry add to url "&clean=false" http://solr:8983/solr/dataimport?command=full-import&entity=games&; clean=false this is by mistake it was intended for somebody else
Re: Solr Indexing Performance
I indexed 1000 pdf file with the same configuration, it completed in about 32 min.
UIMA Error
hi guys i'm trying to use UIMA contrib, but i got the following error ... INFO: [] webapp=/solr path=/select params={clean=false&commit=true&command=status&qt=/dataimport} status=0 QTime=0 05/02/2011 10:54:53 ص org.apache.solr.uima.processor.UIMAUpdateRequestProcessor processText INFO: Analazying text 05/02/2011 10:54:53 ص org.apache.solr.uima.processor.ae.OverridingParamsAEProvider getAE INFO: setting cat_apikey : 0449a72fe7ec5cb3497f14e77f338c86f2fe 05/02/2011 10:54:53 ص org.apache.solr.uima.processor.ae.OverridingParamsAEProvider getAE INFO: setting keyword_apikey : 0449a72fe7ec5cb3497f14e77f338c86f2fe 05/02/2011 10:54:53 ص org.apache.solr.uima.processor.ae.OverridingParamsAEProvider getAE INFO: setting concept_apikey : 0449a72fe7ec5cb3497f14e77f338c86f2fe 05/02/2011 10:54:53 ص org.apache.solr.uima.processor.ae.OverridingParamsAEProvider getAE INFO: setting entities_apikey : 0449a72fe7ec5cb3497f14e77f338c86f2fe 05/02/2011 10:54:53 ص org.apache.solr.uima.processor.ae.OverridingParamsAEProvider getAE INFO: setting lang_apikey : 0449a72fe7ec5cb3497f14e77f338c86f2fe 05/02/2011 10:54:53 ص org.apache.solr.uima.processor.ae.OverridingParamsAEProvider getAE INFO: setting oc_licenseID : g6h9zamsdtwhb93nc247ecrs 05/02/2011 10:54:53 ص WhitespaceTokenizer initialize INFO: "Whitespace tokenizer successfully initialized" 05/02/2011 10:54:56 ص org.apache.solr.core.SolrCore execute INFO: [] webapp=/solr path=/select params={clean=false&commit=true&command=status&qt=/dataimport} status=0 QTime=0 05/02/2011 10:54:57 ص WhitespaceTokenizer typeSystemInit INFO: "Whitespace tokenizer typesystem initialized" 05/02/2011 10:54:57 ص WhitespaceTokenizer process INFO: "Whitespace tokenizer starts processing" 05/02/2011 10:54:57 ص WhitespaceTokenizer process INFO: "Whitespace tokenizer finished processing" 05/02/2011 10:54:57 ص org.apache.uima.analysis_engine.impl.PrimitiveAnalysisEngine_impl callAnalysisComponentProcess(405) SEVERE: Exception occurred org.apache.uima.analysis_engine.AnalysisEngineProcessException at org.apache.uima.annotator.calais.OpenCalaisAnnotator.process(OpenCalaisAnnotator.java:206) at org.apache.uima.analysis_component.CasAnnotator_ImplBase.process(CasAnnotator_ImplBase.java:56) at org.apache.uima.analysis_engine.impl.PrimitiveAnalysisEngine_impl.callAnalysisComponentProcess(PrimitiveAnalysisEngine_impl.java:377) at org.apache.uima.analysis_engine.impl.PrimitiveAnalysisEngine_impl.processAndOutputNewCASes(PrimitiveAnalysisEngine_impl.java:295) at org.apache.uima.analysis_engine.asb.impl.ASB_impl$AggregateCasIterator.processUntilNextOutputCas(ASB_impl.java:567) at org.apache.uima.analysis_engine.asb.impl.ASB_impl$AggregateCasIterator.(ASB_impl.java:409) at org.apache.uima.analysis_engine.asb.impl.ASB_impl.process(ASB_impl.java:342) at org.apache.uima.analysis_engine.impl.AggregateAnalysisEngine_impl.processAndOutputNewCASes(AggregateAnalysisEngine_impl.java:267) at org.apache.uima.analysis_engine.impl.AnalysisEngineImplBase.process(AnalysisEngineImplBase.java:267) at org.apache.uima.analysis_engine.impl.AnalysisEngineImplBase.process(AnalysisEngineImplBase.java:280) at org.apache.solr.uima.processor.UIMAUpdateRequestProcessor.processText(UIMAUpdateRequestProcessor.java:122) at org.apache.solr.uima.processor.UIMAUpdateRequestProcessor.processAdd(UIMAUpdateRequestProcessor.java:69) at org.apache.solr.handler.dataimport.SolrWriter.upload(SolrWriter.java:75) at org.apache.solr.handler.dataimport.DataImportHandler$1.upload(DataImportHandler.java:291) at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:626) at org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:266) at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:185) at org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:335) at org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:393) at org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:374) Caused by: java.net.UnknownHostException: api.opencalais.com at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:177) at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:366) at java.net.Socket.connect(Socket.java:529) at java.net.Socket.connect(Socket.java:478) at sun.net.NetworkClient.doConnect(NetworkClient.java:163) at sun.net.www.http.HttpClient.openServer(HttpClient.java:394) at sun.net.www.http.HttpClient.openServer(HttpClient.java:529) at sun.net.www.http.HttpClient.(HttpClient.java:233) at sun.net.www.http.HttpClient.New(HttpClient.java:306) at sun.net.www.http.HttpClient.New(HttpClient.java:323) at sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(HttpURLConnection.java:975) at sun.net.www.protocol.http.HttpURLConnection.plainConnect(HttpURLConnection.java:916) at sun.net.www.protocol.http.HttpURLConnection.connect(HttpURLConnection.java:841) at sun.net.www.protocol.http.HttpURLConnection.getOutputStr
Re: DataImportHandler: no queries when using entity=something
check your log file you might have a connection problem
Re: DataImportHandler: no queries when using entity=something
add to url "&clean=false" http://solr:8983/solr/dataimport?command=full-import&entity=games&; clean=false *clean* : (default 'true'). Tells whether to clean up the index before the indexing is started >
Re: [POLL] Where do you get Lucene/Solr from? Maven? ASF Mirrors?
[x] ASF Mirrors (linked in our release announcements or via the Lucene website) [] Maven repository (whether you use Maven, Ant+Ivy, Buildr, etc.) [x] I/we build them from source via an SVN/Git checkout. [] Other (someone in your company mirrors them internally or via a downstream project)
Re: Index MS office
take a look at DIH http://wiki.apache.org/solr/DataImportHandler
Re: Solr Indexing Performance
Thanx Tomas I'll try with different configuration
Re: solr - uima error
Thanx Tommaso now solr starts OK. but what really I don't comprehend is how my text get annotated by UIMA because when I started indexing none of the "UIMA" new fields get data If you please point me to some kind of articles that explain how this thing works...
Solr Indexing Performance
Hi guys I'm running a solr instance (trunk) in my dev. Server to test my configuration. I'm doing a DIH full import to index 49 PDF files with their corresponding database records. Both the PDF files and database are local in the server. *Server : * · Windows 2008 R2 · MS SQL server 2008 R2 · 16 core processor · 16 GB ram *Tomcat (7.0.5) : * · Set JAVA_OPTS = %JAVA_OPTS% -Xms1024M -Xmx8192M *Solrconfig:* · Main index configurations 2048 50 *DIH configuration:* · 2 data sources defined jdbcDataSource and BinFileDataSource · One main entity with 3 sub entities · Total schema fields are 8, three of which are text type and multivalued. *My DIH import Status Messages:* · Total Requests made to DataSource = 99** · Total Rows Fetched = 2124** · Total DocumentsProcessed = 49** · Time Taken = *0:2:3:880*** * Is this time reasonable or it can be improved?*
Re: solr - uima error
Hi I already copied "apache-solr-uima-4.0-SNAPSHOT.jar"tosolr\lib but what causing the error is this < updateRequestProcessorChain name="uima"> this class is not found processor class="org.apache.solr.uima.processor.UIMAProcessorFactory
solr - uima error
I tried to do the uima integration with solr I followed the steps in the readme file using the solr example folder as my solr instance but when I started solr, I got this error message HTTP ERROR 500 Problem accessing /solr/admin/. Reason: Severe errors in solr configuration. Check your log files for more detailed information on what may be wrong. - org.apache.solr.common.SolrException: Error loading class 'org.apache.solr.uima.processor.UIMAProcessorFactory' at org.apache.solr.core.SolrResourceLoader.findClass(SolrResourceLoader.java:408) at org.apache.solr.core.SolrCore.createInstance(SolrCore.java:399) at org.apache.solr.core.SolrCore.createInitInstance(SolrCore.java:422) at org.apache.solr.core.SolrCore.initPlugins(SolrCore.java:1486) at org.apache.solr.update.processor.UpdateRequestProcessorChain.init(UpdateRequestProcessorChain.java:57) at org.apache.solr.core.SolrCore.createInitInstance(SolrCore.java:424) at org.apache.solr.core.SolrCore.initPlugins(SolrCore.java:1470) at org.apache.solr.core.SolrCore.initPlugins(SolrCore.java:1464) at org.apache.solr.core.SolrCore.loadUpdateProcessorChains(SolrCore.java:572) at org.apache.solr.core.SolrCore.(SolrCore.java:514) at org.apache.solr.core.CoreContainer.create(CoreContainer.java:647) at org.apache.solr.core.CoreContainer.load(CoreContainer.java:399) at org.apache.solr.core.CoreContainer.load(CoreContainer.java:283) at org.apache.solr.core.CoreContainer$Initializer.initialize(CoreContainer.java:232) at org.apache.solr.servlet.SolrDispatchFilter.init(SolrDispatchFilter.java:85) at org.mortbay.jetty.servlet.FilterHolder.doStart(FilterHolder.java:97) at org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java:50) at org.mortbay.jetty.servlet.ServletHandler.initialize(ServletHandler.java:713) at org.mortbay.jetty.servlet.Context.startContext(Context.java:140) at org.mortbay.jetty.webapp.WebAppContext.startContext(WebAppContext.java:1282) at org.mortbay.jetty.handler.ContextHandler.doStart(ContextHandler.java:518) at org.mortbay.jetty.webapp.WebAppContext.doStart(WebAppContext.java:499) at org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java:50) at org.mortbay.jetty.handler.HandlerCollection.doStart(HandlerCollection.java:152) at org.mortbay.jetty.handler.ContextHandlerCollection.doStart(ContextHandlerCollection.java:156) at org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java:50) at org.mortbay.jetty.handler.HandlerCollection.doStart(HandlerCollection.java:152) at org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java:50) at org.mortbay.jetty.handler.HandlerWrapper.doStart(HandlerWrapper.java:130) at org.mortbay.jetty.Server.doStart(Server.java:224) at org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java:50) at org.mortbay.xml.XmlConfiguration.main(XmlConfiguration.java:985) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(Unknown Source) at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source) at java.lang.reflect.Method.invoke(Unknown Source) at org.mortbay.start.Main.invokeMain(Main.java:194) at org.mortbay.start.Main.start(Main.java:534) at org.mortbay.start.Main.start(Main.java:441) at org.mortbay.start.Main.main(Main.java:119) Caused by: java.lang.ClassNotFoundException: org.apache.solr.uima.processor.UIMAProcessorFactory at java.net.URLClassLoader$1.run(Unknown Source) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(Unknown Source) at java.lang.ClassLoader.loadClass(Unknown Source) at java.net.FactoryURLClassLoader.loadClass(Unknown Source) at java.lang.ClassLoader.loadClass(Unknown Source) at java.lang.Class.forName0(Native Method) at java.lang.Class.forName(Unknown Source) at org.apache.solr.core.SolrResourceLoader.findClass(SolrResourceLoader.java:392) ... 39 more -- *Powered by Jetty://* I Searched for the class "IMAProcessorFactory" but I didn't find it . any body can help ? By the way I'm using solr trunk checked out today Jan 29th
Highlighting default encoder
In Solr admin advance search page the highlighted text is not displayed correctly for Arabic characters! I'm using Solr Trunk 2011-01-10 …. It use to be working in solr 1.4.1. Does anybody knows why?
Re: Apply a patch
when I do a checkout I got this error REPORT of '/repos/asf/!svn/vcc/default': Could not read response body: An existing connection was forcibly closed by the remote host. so i'll try to checkout each folder separately solr, modules, lucene
Re: Apply a patch
Hi Gora Thanx for your quick respnse. I checked out source code svn, and applied the patch but when I build the source code I've got the following error C:\trunk\solr\common-build.xml:245: C\trunk\modules\analysis\phonetic does not exist.
Apply a patch
Hi guys would someone please explain to me how to apply a patch to solr lets say for example "SOLR-2129-version3.patch" i'm Running Solr Trunk "apache-solr-4.0-2011-01-02_08-06-03" on Windows environment
Re: DIH and UTF-8
if you are using tomcat modify server.xml check the URIEncoding="*UTF-8*" is set
Re: Basic Solr Configurations and best practice
thanx Alexey I downloaded Solr 4 and implemented the TikaEntityProcessor, it worked fine with Tika 0.6. didn't work with Tika 0.7 nor Tika 0.8 SNAPSHOT On Sat, Nov 27, 2010 at 4:05 AM, Alexey Serba wrote: > > 1- How to combine data from DIH and content extracted from file > system > > document into one document in the index? > http://wiki.apache.org/solr/TikaEntityProcessor > You can have one sql entity that retrieves metadata from database and > another nested entity that parses binary file into additional fields > in the document. > > > 2- Should I move the per-user permissions into a separate index? > What > > technique to implement? > I would start with keeping permissions in the same index as the actual > content. > > > On Tue, Nov 23, 2010 at 11:35 AM, Darx Oman wrote: > > Hi guys > > > > I'm kind of new to solr and I'm wondering how to configure solr to best > > fulfills my requirements. > > > > Requirements are as follow: > > > > I have 2 data sources: database and file system documents. Every document > in > > the file system has related information stored in the database. Both the > > file content and the related database fields must be indexed. Along with > > the DB data is per-user permissions for every document. I'm using DIH > for > > the DB and Tika for the file System. The documents contents nearly never > > change, while the DB data especially the permissions changes very > > frequently. Total number of documents roughly around 2M and each document > is > > about 500KB. > > > > 1- How to combine data from DIH and content extracted from file > system > > document into one document in the index? > > > > 2- Should I move the per-user permissions into a separate index? > What > > technique to implement? > > >
Basic Solr Configurations and best practice
Hi guys I'm kind of new to solr and I'm wondering how to configure solr to best fulfills my requirements. Requirements are as follow: I have 2 data sources: database and file system documents. Every document in the file system has related information stored in the database. Both the file content and the related database fields must be indexed. Along with the DB data is per-user permissions for every document. I'm using DIH for the DB and Tika for the file System. The documents contents nearly never change, while the DB data especially the permissions changes very frequently. Total number of documents roughly around 2M and each document is about 500KB. 1- How to combine data from DIH and content extracted from file system document into one document in the index? 2- Should I move the per-user permissions into a separate index? What technique to implement?
Re: FW: How to enable Unicode Support in Solr
Hi amier try saving the xml file encoding as UTF-8 On Mon, Sep 6, 2010 at 11:08 AM, Darx Darx wrote: > > > > Date: Mon, 6 Sep 2010 10:10:25 +0500 > > Subject: How to enable Unicode Support in Solr > > From: am...@techarete.com > > > To: solr-user@lucene.apache.org > > > > I have an index that takes textual description and places it in the > index. I > > am creating an XML file and passing it to Solr for indexing, but Solr is > not > > saving Unicode characters as it is showing question mark for those > > characters. I want to know that how to enable Unicode Support for Solr > > index. > > > > Regards, > > Amier Haider >