I have been messing around for several days with configuration settings
trying to get ht://Dig to recognize statutes for the State of
Wisconsin... unfortunately I have been unsuccessful.

Here is a brief description of what I'm trying to accomplish:
We have about 60,000 documents that we are indexing, most of them have
statute numbers (similar to "356.47(b)(a)" )... you'll notice a problem
right off the bat when looking at this... and that is the period.  Now
if I include the period and open/close paranthesis, then I'm going to be
indexing invalid words as well...

So, I thought of two possible of solutions, but I don't think they are
implemented in ht://Dig.  One would be the ability to include a list of
valid words to search and index (i.e. these would be recognized in a
document before the removal of punctuation).  The second would be to
have a regular expression that also searches for valid words.

If someone knows a fix or solution to my problem, I will be forever in
debt to you.  Hopefully, someone else has had a similar issue and has
resolved it relatively easiliy.

Thank you in advance for your prompt response.


Jeffrey Kirby
CCAP Web Team
[EMAIL PROTECTED]
608-264-6253


-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
ht://Dig general mailing list: <[EMAIL PROTECTED]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to