[CODE4LIB] Greenstone: tweaking Lucene indexing

2009-09-29 Thread Yitzchak Schaffer
Hello, Sorry for any cross-posting annoyance. I have a request for a Greenstone collection I'm working on, to add context snippets to search results; for example a search for yak culture might return this in the list of results: ... addressing the fine points of strongyak culture/strong,

Re: [CODE4LIB] Greenstone: tweaking Lucene indexing

2009-09-29 Thread Erik Hatcher
The Lucene Highlighter doesn't require that the text you want highlighted be stored. In fact, you can pass in any arbitrary text to the Highlighter. See the various getBestFragments from the Highlighter class:

Re: [CODE4LIB] Greenstone: tweaking Lucene indexing

2009-09-29 Thread Yitzchak Schaffer
Erik Hatcher wrote: The Lucene Highlighter doesn't require that the text you want highlighted be stored. In fact, you can pass in any arbitrary text to the Highlighter. Thanks Erik, What I'm looking for is to return the context of the search result, not just the ID of the containing

Re: [CODE4LIB] Greenstone: tweaking Lucene indexing

2009-09-29 Thread Erik Hatcher
On Sep 29, 2009, at 7:33 AM, Yitzchak Schaffer wrote: Erik Hatcher wrote: The Lucene Highlighter doesn't require that the text you want highlighted be stored. In fact, you can pass in any arbitrary text to the Highlighter. Thanks Erik, What I'm looking for is to return the context of

[CODE4LIB] Bookmarking web links - authoritativeness or focused searching

2009-09-29 Thread Cindy Harper
I've been thinking about the role of libraries as promoter of authoritative works - helping to select and sort the plethora of information out there. And I heard another presentation about social media this morning. So I though I'd bring up for discussion here some of the ideas I've been mulling

Re: [CODE4LIB] Bookmarking web links - authoritativeness or focused searching

2009-09-29 Thread Kent Gerber
www.diigo.com is a social bookmarking site like delicious and it has added features like creating groups around specific themes and the ability to annotate the Web pages you bookmark for future reference. You might want to explore this feature and see if it is appropriate for what you

Re: [CODE4LIB] Greenstone: tweaking Lucene indexing

2009-09-29 Thread Yitzchak Schaffer
Erik Hatcher wrote: I'm a bit confused then. You mentioned that somehow Zend Lucene was going to help, but if you don't have the text to highlight anywhere then the Highlighter isn't going to be of any use. Again, you don't need the full text in the Lucene index, but you do need it get it

Re: [CODE4LIB] Bookmarking web links - authoritativeness or focused searching

2009-09-29 Thread Jason Griffey
It's not social bookmarking, but as far as But I'm thinking now about the possibility of a search engine limited to sites cooperatively vetted by librarians, that would incorporate ranking by # links. Something more responsive than cataloging websites in our catalogs., well, that's almost exactly

Re: [CODE4LIB] Greenstone: tweaking Lucene indexing

2009-09-29 Thread Ross Singer
Yitzchak, are you interested in actually searching the fulltext? Or just highlighting the terms? If you're only interested in highlighting it, it might be a whole lot easier to implement this in javascript through something like jQuery:

Re: [CODE4LIB] Bookmarking web links - authoritativeness or focused searching

2009-09-29 Thread MJ Ray
Cindy Harper wrote: I've been thinking about the role of libraries as promoter of authoritative works - helping to select and sort the plethora of information out there. And I heard another presentation about social media this morning. So I though I'd bring up for discussion here some of the

Re: [CODE4LIB] Bookmarking web links - authoritativeness or focused searching

2009-09-29 Thread Joe Hourcle
On Tue, 29 Sep 2009, Cindy Harper wrote: I've been thinking about the role of libraries as promoter of authoritative works - helping to select and sort the plethora of information out there. And I heard another presentation about social media this morning. So I though I'd bring up for

Re: [CODE4LIB] Bookmarking web links - authoritativeness or focused searching

2009-09-29 Thread Donahue, Amy (NIH/NLM) [C]
I feel like a couple years ago a librarian(s?) created a Google Custom Search Engine that did exactly what you describe as focused searching, but I can't find a link any more. You can search the CSEs by scrolling down on this page (and there are a couple of links to directories, too):

Re: [CODE4LIB] Greenstone: tweaking Lucene indexing

2009-09-29 Thread Yitzchak Schaffer
Ross Singer wrote: Yitzchak, are you interested in actually searching the fulltext? Or just highlighting the terms? Sorry this wasn't clearer. Let me re-summarize, and report on a new development: - Greenstone allows for Lucene as one of the indexing plugins - I took advantage of this

Re: [CODE4LIB] Greenstone: tweaking Lucene indexing

2009-09-29 Thread Yitzchak Schaffer
Ross Singer wrote: Yitzchak, are you interested in actually searching the fulltext? Or just highlighting the terms? Just in case my earlier response didn't make it crystal clear: we're trying to search the fulltext, and put the search string in context within the document which includes it.

Re: [CODE4LIB] web archiving - was: Implementing OpenURL for simple web resources

2009-09-29 Thread Erik Hetzner
At Fri, 18 Sep 2009 10:40:08 -0400, Ed Summers wrote: Hi Erik, all […] I haven't been following this thread completely, but you've taken it in an interesting direction. I think you've succinctly described the issue with using URLs as references in an academic context: that the integrity

Re: [CODE4LIB] Bookmarking web links - authoritativeness or focused searching

2009-09-29 Thread Keith Jenkins
AbleGrape.com is a good example of a focused search engine that aims to index only authoritative sources within a particular disciple -- in this case it's wine, enology, and viticulture. It currently crawls about 40,000 vetted websites. It's a great search engine for the subject area it serves,