Can you share the schema that you use for the fulltext search.
Ankush Chadda about.me/iamkhush [image: Ankush Chadda on about.me] <http://about.me/iamkhush> On Mon, Jun 16, 2014 at 9:40 AM, Anand Chitipothu <[email protected]> wrote: > On 16-Jun-2014, at 12:49 AM, Ankush wrote: > > Hey, > > I am trying to implement fulltext search on my website, which uses > openlibrary framework.I dont have prior experience on solr . Can you help > me clear my doubts - > > > - The schema that you currently use for fulltext search is the inside > core of solr-biblio ( > https://github.com/internetarchive/openlibrary/tree/master/conf/solr-biblio > ). > > No, it is used for searching work records in openlibray, not fulltext > search. > > http://openlibrary.org/search > > > - Is solr-biblio used for all the searches on website? > > No, fulltext search uses completely different solr instance with different > schema. > > > - Now in order to index the books, I saw the script inside_all.py( > > https://github.com/internetarchive/openlibrary/tree/master/openlibrary/solr/inside/index_all.py).This > scripts makes hit to fulltext/abbyy_to_text.php, Gets page_count and body > and uses it to index. Now abby_to_text.php is in the BookReaderIA dir, > which uses extract_paragraph.py to return the data. What I cannot > understand is, that extract_paragraphs.py prints page_count in 'meta:...' ( > > https://github.com/openlibrary/bookreader/blob/master/BookReaderIA/fulltext/extract_paragraphs.py#L155) > , but abby_to_text.php is trying to fetch a string 'page count' from the > data ( > > https://github.com/internetarchive/openlibrary/blob/master/openlibrary/solr/inside/index_all.py#L130). > How is this working on your end > > It is not in my head right now (I'm not the one who implemented it). I'll > look at how it works and let you know. > > Anand > > _______________________________________________ > Ol-tech mailing list > [email protected] > http://mail.archive.org/cgi-bin/mailman/listinfo/ol-tech > Archives: http://www.mail-archive.com/[email protected]/ > To unsubscribe from this mailing list, send email to > [email protected] >
_______________________________________________ Ol-tech mailing list [email protected] http://mail.archive.org/cgi-bin/mailman/listinfo/ol-tech Archives: http://www.mail-archive.com/[email protected]/ To unsubscribe from this mailing list, send email to [email protected]
