http://www.si.hhs.nl/~jan/doc/java/docs/tooldocs/win32/javadoc.html
for me and test if it works at their site?
Using 3.1.1
Much thanx in advance.
--jesse
------------
J. op den Brouw Joha
t; in between a H1/H1 pair.
Can someone index
http://www.si.hhs.nl/~jan/doc/java/docs/tooldocs/win32/javadoc.html
for me and test if it works at their site?
Using 3.1.1
Much thanx in advance.
--jesse
------------
J. op den Brouw
Hi all,
upgrading to 3.1.2 makes Acroread play dead.
It is started and then does nothing (0% CPU time).
Anything I missed?
--jesse
J. op den Brouw Johanna Westerdijkplein 75
Haagse Hogeschool
?
--jesse
J. op den Brouw Johanna Westerdijkplein 75
Haagse Hogeschool 2521 EN DEN HAAG
Sector Techniek Netherlands
Afdeling
db.wordlist
-rw-r--r-- 1 msql www 44359680 Oct 9 21:24 db.words.db
msql@pluto:188
Which files can be deleted? db.images.sorted.gz and db.urls.sorted.gz
gzipped versions of db.{url|images}.
--jesse
J. op den Brouw
Maybe I can persuede(? sorry) people to update the htdig survey
or to add new entries:
http://www.st.hhs.nl/htdig/survey.html
--jesse
J. op den Brouw Johanna Westerdijkplein 75
Haagse Hogeschool
Our web server / mail server connection was down. If anyone
had submitted survey info, it didn't arrive in my mailbox.
It should work now...
(We recently changed to HP-UX servers..)
--jesse
J. op den Brouw
00:yoftheh
htmerge: 4200:ythatabitof
htmerge: Total word count: 4247
htmerge: Total documents: 1
htmerge: Total doc db size (in K): 1002
[msql@chaos scripts]$
--jesse
------------
J. op den Brouw
: installation or configuration problem: C++ compiler cannot
create executables.
plattform is Solaris 2.5.1 sparc with gcc 2.8.1
What am i missing ?
Thanks
Andreas
--jesse
J. op den Brouw Johanna
: * export controls on cryptography software.
Error: Couldn't read xref table
[msql@chaos xpdf]$
Is there a encryption patch somewhere?
The PDF *is* legal; I can read it with Acroread.
--jesse
J. op den Brouw
. Is there any
way to show http://www.servername.com in the search
results instead of http://www1.servername.com, etc..?
Thanks,
Amir
--jesse
J. op den Brouw Johanna Westerdijkplein 75
Haagse Hogeschool
Hi,
it there a way search the mail list archive?
or is there a site that has searchable htdig mailing lists?
--jesse
J. op den Brouw Johanna Westerdijkplein 75
Haagse Hogeschool
these products with Word2000 ?
Word 2000 file format is the same as Word 97 file format, isn't it?
--jesse
J. op den Brouw Johanna Westerdijkplein 75
Haagse Hogeschool
strings, whatever) are written to
the database. When you search for those words that were left out,
you will not find them (of course).
I had this problem when I ran out of disk space over night :)
--jesse
----------------
J. op den Brouw
ery-common-word not searchterm
this is allmost like
TRUE not searchterm
which is
not searchterm
--jesse
----------------
J. op den Brouw Johanna Wester
0x1e0af in WordList::Word (this=0x1c4140, word=0x806e7e0 "littleendian",
--jesse
----------
J. op den brouw Johanna Westerdijkplein 75
Haagse Hogeschool2521 EN DEN HA
.
Does anyone have any insights?
--jesse
----------
J. op den brouw Johanna Westerdijkplein 75
Haagse Hogeschool2521 EN DEN HAAG
Faculty of E
rowser who started it all..
--
--jesse
----------
J. op den brouw Johanna Westerdijkplein 75
Haagse Hogeschool2521 EN DEN HAAG
Faculty of E
ubstrings of words ?
Thanks
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
--
--jesse
--------------
J. op den brouw Johanna Wester
J. op den Brouw Johanna Westerdijkplein 75
Haagse Hogeschool 2521 EN DEN HAAG
Faculty of Engeneering Netherlands
Electrical Engeneering+31
? Is it possible?
--jesse
J. op den Brouw Johanna Westerdijkplein 75
Haagse Hogeschool 2521 EN DEN HAAG
Faculty of Engeneering Netherlands
Electrical
is very easy for U...
thanks for helping
Olivier RICHÉ (france)
--jesse
J. op den Brouw Johanna Westerdijkplein 75
Haagse Hogeschool 2521 EN DEN HAAG
Faculty
,
Does any have a binary for htDig V3.1.5 for HP-UX10.20?
--jesse
J. op den Brouw Johanna Westerdijkplein 75
Haagse Hogeschool 2521 EN DEN HAAG
Faculty of Engeneering
the load well over the limit.
Marcel
man nice
and then do something like:
nice nice_value rundig
rundig starts htdig et al. and these will run with the
same priority.
--jesse
J. op den Brouw Johanna
J. op den Brouw Johanna Westerdijkplein 75
Haagse Hogeschool 2521 EN DEN HAAG
Faculty of Engeneering Netherlands
Electrical Engeneering
;, nPages)));
vars.Add("FIRSTDISPLAYED",
There is no singular_suffix equivalent. Sorry 'bout that
On Mon, 7 Aug 2000, Marcel Hicking wrote:
Nope. Nothing in the docs. I looked there before.
At least not for 3.1.5.
And I would like to have a singular_suffix as well.
Cheers, Marcel
--jesse
Yes they do, IFAIK.
Use the '|' to OR two or more resticts
restrict=g21|g22
Or in a HTML form like:
INPUT TYPE=HIDDEN NAME="restrict" VALUE="g21|g22"
"Reich, Stefan" wrote:
Hello,
I don't think logical operators are supported for restrict (at least for
HTDIG = 3.1.5)
You can send
--jesse
J. op den Brouw Johanna Westerdijkplein 75
Haagse Hogeschool 2521 EN DEN HAAG
Faculty of Engeneering Netherlands
Electrical
--jesse
------------
J. op den Brouw Johanna Westerdijkplein 75
Haagse Hogeschool 2521 EN DEN HAAG
Faculty of Engeneering Netherlands
Electrical Engeneering+31
I'm not sure, but the parsing is normally done after
the extension is associated with the parser. As htdig
doesn't recognize .css, the parse part is not started
anyway.
Correct me if I'm wrong.
Thomas Rother wrote:
"J. op den Brouw" wrote:
"text/css" not a recogn
I also get this:
115:115:2:http://www.library.yale.edu/pubstation/databases/GPI.htm: More than
one title tag in document! (possible search engine spamming)
--jesse
J. op den Brouw Johanna
Hi All,
About a half a year ago I started a survey on htdig users systems.
Well the results can be found at
http://www.st.hhs.nl/htdig/survey_result.html
Only those who said "yes" to "Ok to publish" are on the list.
Drop me a line if you have any questions...
--jesse
If you have SunOS or Solaris, this means that the C/C++ compiler is not
installed.
GNU C is installed but maybe c++ is referring to a not-installed version
of
SUN C/C++.
J-P Theberge wrote:
Hi,
What is wrong with my machine?
% ./configure
loading cache ./config.cache
checking for a
Hi All,
Just trying to run contrib/htparsedoc/... (Word 6 parser).
Script fails with a awk error, so I build a Perl version
which does the same.
Now htdig digs okay with the external parser. When I do
a search, it returns with 4 matched docs but only
displays 2. Anyone anything?
--jesse
Hi all
if you want to test my Perl proggie
for indexing word 6 docs
http://www.st.hhs.nl/htdig/parse_word_doc.pl.txt
You need catdoc (see contrib) to use this schema...
--jesse
--
To unsubscribe from the htdig mailing list,
Here is a Perl script that uses the catdoc program (V 0.90).
Download catdoc stuff from URL below, untargz, ./configure
etc. Set $CATDOC to catdoc proggie.
Set external_parsers to something like:
external_parsers: application/msword
/usr/local/htdig/external_parsers/bin/parse_word_doc.pl
Indeed, this seems to be a bug. It also tells us to write good
parsers. If the parsers do the right thing, this "bug" won't
appear
Vadim Chekan wrote:
Hello!
I wrote my own external parser for WinWord document and done error in it.
External parser must write to stdout "w \t 3
See for yourself:
Note: this only happens when I'm parsing Word Docs
via an external parser...
Note: the trailing | are written by my own proggie.
Note: first three lines are normal.
htmerge: Removing doc #9012|
htmerge: Removing doc #9017|
htmerge: Removing doc #9022|
htmerge: Removing doc
Well, I've worked on this item some time ago, but, stupid me,
I threw the source away (and Unix doesn't have undelete, not
that I know of on this machine), but here are the essential
ingredients:
+ - AND - AND (htdig)
- - AND NOT - NOT (htdig)
So: +words +undelete -windows - words AND
Geoff Hutchison wrote:
At 10:28 AM -0500 11/18/98, J. op den Brouw wrote:
See for yourself:
Note: this only happens when I'm parsing Word Docs
via an external parser...
Note: the trailing | are written by my own proggie.
Note: first three lines are normal.
What does your db.wordlist
Gordon Hopper wrote:
I just discovered that max_doc_size is different from max_head_length.
Furthermore the default for max_doc_size is 100K (defaults.cc). This is
fine except when indexing large PDF files. The problem is that the
error message is not correct. I got many errors like this
Hello:
I had the same problem too, because Any and All are not
very good transaltable in dutch with one word. First: note that
it should be (method is and or or):
method_names: and All or Any
Then secondly: You can't use " as a delimiter as in "all words"!
The trick I used is to
Hello,
!-- The engine searched with the following construct: $(LOGICAL_WORDS)
--
Put above into your header and you will see the expanded search words
created
and used by htdig. So if you search with doc* you should see somthing
like
doc* or docs or document or docent
etc.
Alexei B.Kozlov
You can't use * as a wildcard. Just use
input type=hidden name=exclude value=".html"
It should work now.
Defranould wrote:
I'm trying to EXCLUDE some files of a server in order to avoid giving
several times the same information.
But EXCLUDE does not work. I tryed for example to EXCLUDE
It seems that your DB2 (db-2.4.14) is not installed correctly
because it cannot resolve symbols (undefined references).
Check if DB2 if configured correctrly.
Somewhere in db-2.4.14/dist there has to be a libdb.a
Johann Habakuk Israel wrote:
Hello,
when I try to compile ht-dig version
Hello
I got the "htmerge: Unable to open word list file" error
and to my opinion it showed when I hit a server that uses
a robots.txt file and I could not get a file, then somewhere
htmerge crashes because nothing is indexed.
I only wanted to index that site, no other.
Anyone any idea's
This is what I needed to know. Thanks.
Gilles Detillieux wrote:
According to J. op den Brouw:
I got the "htmerge: Unable to open word list file" error
and to my opinion it showed when I hit a server that uses
a robots.txt file and I could not get a file, then somewhere
htmer
Hi all,
this is an easy one. My perl 5.00502 version doesn't contain
a BerkeleyDB module. Where can I find one?
Output from whatsnew.pl:
msql@pluto:71 whatsnew.pl
Can't locate BerkeleyDB.pm in @INC (@INC contains:
/usr/local/lib/perl5/5.00502/sun4-solaris /usr/local/lib/perl5/5.00502
[EMAIL PROTECTED] wrote:
I have been reading the recent messages about prefix searching and it looks
like it could be very useful I so I tried it on our site but I have been
unable to make it work. I am using htdig 3.1.0b2 on a Solaris 2.6 Sparc
box.
After reading through the mail
Hi all,
if you want to index Word files or have been doing so for a time,
there is a new parse_word_doc.pl at:
http://www.st.hhs.nl/htdig/parse_word_doc.pl.txt
features: code speedup (mucho!)
matching patterns didn't work. now they match .,';: etc
at the beginning or
Hi all,
I was wondering why
db_stat -d db.docs.index
produces 9128 keys and the output from rundig ends with
htmerge: 8960
htmerge: 8970
htmerge: 8980
htmerge: 8990
htmerge: 9000
htmerge: 9010
htmerge: Total documents: 9010
htmerge: Total doc db size (in K): 68210
How is that possible?
.
it should not be a problem if both run on a x86 machine. GDBM compiles
achitecture dependant. But it's always good to re-index
--jesse
-
J. op den BrouwJohanna Westerdijkplein 75
Haagse Hogeschool
function.
Berkeley DBM Configuration complete
Okay, what to do next?
--jesse
-
J. op den BrouwJohanna Westerdijkplein 75
Haagse Hogeschool 2521 EN DEN HAAG
Sector Techniek
so worried that it could go wrong... Maybe this should be
renamed to something like "Berkeley DB:".
--jesse
---------
J. op den BrouwJohanna Westerdijkplein 75
Haagse
-
J. op den BrouwJohanna Westerdijkplein 75
Haagse Hogeschool 2521 EN DEN HAAG
Sector Techniek Netherlands
Afdeling
--jesse
---------
J. op den BrouwJohanna Westerdijkplein 75
Haagse Hogeschool 2521 EN DEN HAAG
Sector Techniek Netherlands
Afdeling Elekt
s" concatinated with
a pipe sign (|). So you'll need something like
OPTION VALUE="one|two" .......
--jesse
-
J. op den BrouwJohanna Westerdijkplei
their data high byte high and
x86 high byte low. Same problem with berkeley databases.
--jesse
-
J. op den BrouwJohanna Westerdijkplein 75
Haagse Hogeschool 2521 EN DEN
t. It works though...
--jesse
-------------
J. op den BrouwJohanna Westerdijkplein 75
Haagse Hogeschool 2521 EN DEN HAAG
Sector Techniek Netherlands
Afdeling Elektrotechniek
--
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the body of the message.
--jesse
---------
J. op den Brouw
that print out the databases.
Maybe someone can modify these to convert http in https. Does only
work for GDBM not for Berkeley DB.
--jesse
-
J. op den BrouwJohanna Westerdijkplein 75
Haagse Hogeschool
searches.
Do you have a clean version og htdog 3.0.8b2? If so, the exclude
is not working properly. There is a patch available at the
htdig patch site (don't know it right now).
--jesse
-
J. op den Brouw
-
J. op den BrouwJohanna Westerdijkplein 75
Haagse Hogeschool 2521 EN DEN HAAG
Sector Techniek Netherlands
Afdeling Elektrotechniek +31 70 4458936
On Mon, 14 Sep 1998, J. op den Brouw wrote:
First of all, it finds Sun's cc as the compiler, but cc isn't installed.
When running cc I get:
/usr/ucb/cc: language optional software package not installed
Anyway, configure thinks cc is a proggie and doesn't understand
the -O options. I
htsearch.
"word string" should be converted to word AND word. It's not
as good as the real meaning of "..." but better than
nothing.
Anyone ideas on this subject?
Greetz,
--jesse
-------------
Hi all,
maybe I didn't look very good, but how can I use the NOT in htdig.
I know it is posible, but I forgot.
Is it: word1 AND NOT word2
or
word1 NOT word2
--jesse
-
J. op den Brouw
otmail.com
--
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the body of the message.
--jesse
-------------
J. op den BrouwJohanna Weste
-
J. op den BrouwJohanna Westerdijkplein 75
Haagse Hogeschool 2521 EN DEN HAAG
Sector Techniek Netherlands
Afdeling Elektrotechniek +31
und".
--jesse
-------------
J. op den BrouwJohanna Westerdijkplein 75
Haagse Hogeschool 2521 EN DEN HAAG
Sector Techniek Netherlands
Afdeling Elektrotechniek
om-FindFirst(url) 0)
must be
excludeFrom-FindFirst(url) = 0)
Recompile install htsearch and then it should work.
(BTW this problem is also in 3.0.8b2 clean version)
--jesse
-------------
J. op den Brouw
.
--jesse
---------
J. op den BrouwJohanna Westerdijkplein 75
Haagse Hogeschool 2521 EN DEN HAAG
Sector Techniek Netherlands
Afdeling Elekt
There is such a script but it works only with GDBM and DB 1.85
(A.F.A.I.K). I did'nt see anything in Perl that worked with 2.x.
Terry Hitzeman wrote:
Wondering if anyone has seen a what's new perl script that will look
at an index by modification date and throw the results to a template
g/FAQ.html
--jesse
------------
J. op den Brouw Johanna Westerdijkplein 75
Haagse Hogeschool 2521 EN DEN HAAG
Faculty of Engeneering Netherlands
on a previous installation. what do
i have to do to change this? where is it specified? any help would be greatly
appreciated. thanks.
matt
--jesse
J. op den Brouw Johanna Westerdijkplein 75
Haagse
I think we *all* want to share it with you. It should be on the patch
site.
Please do mail us you patch.
Will Ballantyne wrote:
I've got htdig working with ssl by using the ssl patch for 3.1.4 and
making the necessary changes. If anyone wants a new diffs file send me
a note...
--Jesse
.0, when I search with some word, I have an
internal server error and in the log error of the web site I have just
this line :
Premature end of script headers: /home/httpd/cgi-bin/htsearch
How can I debug this problem ?
--jesse
------------
of them all at once.
--jesse
J. op den Brouw Johanna Westerdijkplein 75
Haagse Hogeschool 2521 EN DEN HAAG
Faculty of Engeneering
I think that's what happens when you copy off the screen ;-)
"Brian W. Spolarich" wrote:
On Tue, 31 Oct 2000, Joe R. Jah wrote:
| I am forwarding your message to the patch author and htdig users
| mailing list, to which the patch was originally posted. Maintainer of
| the patch site
de against a clean version of htdig.
Greetz from windy Holland,
--jesse
------------
J. op den Brouw Johanna Westerdijkplein 75
Haagse Hogeschool 2521 EN DEN HAAG
Faculty of E
that helps - but it's
not doing much now?
--jesse
J. op den Brouw Johanna Westerdijkplein 75
Haagse Hogeschool 2521 EN DEN HAAG
Faculty of Engeneering
You should not do that. There is a nice function call in Perl
called select(). It can be used to reassign output to the
'screen' to a file. Something like this..
open(OUTPUT, somefile);
$old_fh = select(OUTPUT);
do your stuff with htmerge
select(($old_fh);
close(OUTPUT);
This is from 'The Perl
81 matches
Mail list logo