Forgive my ignorance but I want to be sure, do I add <copyField source="iframe" dest="text"/> to solrindex-mapping.xml? so that my solrindex-mapping.xml looks like this: <fields> <field dest="content" source="content"/> <field dest="title" source="title"/> <field dest="iframe" source="iframe"/> <field dest="host" source="host"/> <field dest="segment" source="segment"/> <field dest="boost" source="boost"/> <field dest="digest" source="digest"/> <field dest="tstamp" source="tstamp"/> <field dest="id" source="url"/> <copyField source="url" dest="url"/> *<copyField source="iframe" dest="text"/> * </fields> <uniqueKey>url</uniqueKey>
And what do you mean by standard tokenization ? Thanks! On Thu, Jun 27, 2013 at 3:43 PM, Jack Krupansky <j...@basetechnology.com>wrote: > Just <copyField> from the string field to a "text" field and use standard > tokenization, then you can search the text field for "youtube" or even > "something" that is a component of the URL path. No wildcard required. > > > -- Jack Krupansky > > -----Original Message----- From: Amit Sela > Sent: Thursday, June 27, 2013 8:37 AM > To: solr-user@lucene.apache.org > Subject: Re: Solr admin search with wildcard > > > The stored and indexed string is actually a url like " > http://www.youtube.com/**somethingsomething<http://www.youtube.com/somethingsomething> > ". > It looks like removing the quotes does the job: iframe:*youtube* or am I > wrong ? For now, performance is not an issue, but accuracy is and I would > like to know for example how many URLS have iframe source leading to > YouTube for example. So query like: iframe:*youtube* with max rows 10 or > something will return in the response numFound field the total number of > pages that have a tag ifarme with a source matching *youtube, No ? > > > On Thu, Jun 27, 2013 at 3:24 PM, Jack Krupansky <j...@basetechnology.com>* > *wrote: > > No, you cannot use wildcards within a quoted term. >> >> Tell us a little more about what your strings look like. You might want to >> consider tokenizing or using ngrams to avoid the need for wildcards. >> >> -- Jack Krupansky >> >> -----Original Message----- From: Amit Sela >> Sent: Thursday, June 27, 2013 3:33 AM >> To: solr-user@lucene.apache.org >> Subject: Solr admin search with wildcard >> >> >> I'm looking to search (in the solr admin search screen) a certain field >> for: >> >> *youtube* >> >> I know that leading wildcards takes a lot of resources but I'm not worried >> with that >> >> My only question is about the syntax, would this work: >> >> field:"*youtube*" ? >> >> Thanks, >> >> I'm using Solr 3.6.2 >> >> >