I'll check empty words problem on Monday.
By the way, you can unkomment #define DEBUG_SQL
in sql.c and indexer will output every SQL query.
Are there any empty words in this output?


About XLS converter. YOu can also use xls2csv
convertor, it is the part of catdoc project.
Please find it on freshmeat.


Danish Qadri wrote:
> That was actually what concerned me the most. This is a CVS build from
> last night. I updated my copy at 8:30ish EST (GMT -4).
> 
> I've also seen it pushing empty words into the other dictionaries, dict6,
> dict8 etc. I was just going to delete them eventually :). I just tried to
> do this: "select * from dict4 where word = '';" and it returned zero
> search results.
> 
> Could it be possible that either Postgres did not have enough memory to
> spit out the sting? or possibly that it is full of non printable (i.e.
> control ) characters?
> 
> I'm mentioning the memory problem, becaues I'm using the xlHtml converter
> to convert MS Excel files, and it tends to use about 300 to 400 MB of RAM
> on every run.
> 
> 
> 
> - Danish
> 
> 
> 
> On Sat, 29 Sep 2001, Johannes Erdfelt wrote:
> 
> 
>>On Sat, Sep 29, 2001, Danish Qadri <[EMAIL PROTECTED]> wrote:
>>
>>>I caught this in my PostgreSQL logs:
>>>
>>>Sep 29 16:07:30 intranet2.globix.net postgres[18413]: [559702] DEBUG:
>>>query: INSERT INTO dict4 (url_id,word,intag) VALUES(11566,'',-1413086976)
>>>
>>>
>>>I'm using multi mode as you can see from the table name. I was suprised to
>>>see a negative # for intag, what does it signify? A general feeling I got
>>>was that the intag # recorded how many times a word occurs in a document,
>>>and with what relevancy (weight).
>>>

___________________________________________
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]

Reply via email to