Can you look at the error statistics? statistics/errors
You should be able to do it in a separate browser page.


-----Original Message-----
From: [EMAIL PROTECTED] on behalf of The Developers
Sent: Thu 4/27/2006 2:33 PM
To: owfs-developers@lists.sourceforge.net
Subject: [Owfs-developers] Virtual memory growth (& device dropouts)
 
First of all Kudos! to the developers and others helping with the owfs project.

Setup:
11434 ?        Ssl    1:20 /opt/owfs/bin/owserver -u -p 3333
11439 ?        Ssl    0:46 /opt/owfs/bin/owhttpd -s 127.0.0.1:3333 -p 3002
(both run by root)
Kernel    2.6.10-5-386 [Ubuntu 5.04]
owserver and owhttpd were compiled from the owfs-2.3p0.tar.gz
tarball found at http://owfs.sourceforge.net (but without fuse,
I only compiled owserver and owhttpd).

I have a DS9490 with two DS18B20 devices and one DS18S20 on it.
Temperatures are read about once a minute using wget to access
the owhttpd server.
After about 30 hours of successful data collection - no more data.
Initially I suspected the wiring, and reduced the wiring to a
short segment with a single DS2438 device that was known to work
but the owhttpd did not report the detection of ANY devices.
Finally, I pulled out the USB and then re-inserted it. All this without
stopping the owserver or owhttpd. Then the DS2438 was recognized.

I reconnected the 3-temperature-device cable. The three devices
were listed once again. However, more tests showed that the
owhttpd had stopped reporting anything about the first
18B20 other than its "address" , "crc" , and "die" values. Oddly, the
owhttpd was sending a partial webpage for the "misbehaving" device
and in fact ending the page abruptly in the middle of a tag [see #1
below].  The data received is exactly 512-bytes which seems too
nice a round number to be ignored.

The other two devices were reported normally. I restarted my minute
interval wget command -
while (:) ; do date ; wget -q -O - http://127.0.0.1:3002/28.D96BC6000000 \
http://localhost:3002/10.BA19BC000800  | html2text -nobs | \
awk -F'|' '/temperature/ print $2 ": " $3 * 1}; /address/ {printf "%s
%s ", $2, $3}' \
; sleep 60 ; done >> file_out.dat
- this time retrieving temperatures for the two working devices.
Within an hour, by chance, I happened to notice that
the amount of virtual memory used by each of the owserver and
owhttpd processes was very large and growing [see #3 below]
(Over 1.7GB and 875MB respectively as I write this - on a 256MB system)

 After another two hours of collecting valid data,  the 2nd device
 (the 18S20) was now being reported in the same partial fashion as the
 first 18B20.

To further rule out a cable and/or device issue, I moved the
cabling end from the USB dongle on the Ubuntu system, and
connected it to anther system running the dsl-2.3 (with owfs.uci)
and a DS9097U. It worked fine over there. So I put it back.
unfortunately the Ubuntu system does not have a RS232 port.

I have also included some possibly useful diagnostic information
from the /bus.0/statistics directory [see #2 below].

Q1. Does any one have any idea what is going on?
Q2. Is there any reason that so much virtual memory is being requested?
Q3. Why is the owhttpd returning incomplete web pages?

The problem could of course be in my kernel, or elsewhere, but I'm not
having any other issues, and I have never seen any other processes
grabbing virtual memory like these two - that is why I am asking on this
list.

Seeing how quick the response is on this list in general, I am going to
leave everything running for at least another day or two, in hopes of
troubleshooting the problem while it is happening.
If I simply restart everything we may never know what went wrong,
and each cycle may take > 30hours before a problem
arises. So I am hoping that someone can offer suggestions about
how to pin down the culprit AS IS. Of course all thoughts are
welcome.

I am not a developer, nor a C programmer, but with detailed instructions
I can follow, I will do my best to provide troubleshooting information. I am
relatively competent at the bash command line. Sorry if I haven't provided
all the pertinent information.


Mark

=====================
 #1. Abruptly ended html code sent by owhttpd server
 HTML><HEAD><TITLE>1-Wire Web: 28.CC06AE000000</TITLE></HEAD>
 <BODY BGCOLOR='#BBBBBB'><TABLE WIDTH='100%' BGCOLOR='#DDDDDD'
BORDER='1'><TR><TD>OWFS on 127.0.0.1:3333</TD><TD><A HREF='/'>Bus
listing</A></TD><TD><A HREF='http://owfs.sourceforge.net'>OWFS
homepage</A></TD><TD><A
HREF='http://www.maxim-ic.com'>Dallas/Maxim</A></TD><TD>by <A
HREF='mailto://[EMAIL PROTECTED]'>Paul H
Alfille</A></TD></TR></TABLE>
 <H1>28.CC06AE000000</H1><HR>
<BR><small><A href='/uncached/28.CC06AE000000'>uncached
version</A></small><TABLE BGCOLOR="#DDDDDD" BORDER=1><TR><TD><A
HREF='/'><CODE><B><BIG>up</BIG></B></CODE></A></TD><TD>directory</TD></TR><TR><T
D><B>address</B></TD><TD>28CC06AE000000DC</TD></TR>
 <TR><TD><B>crc8</B></TD><TD>DC</TD></TR>
 <TR><TD><B>die</B></TD><TD
 --------------------------------------------------------------------
 #2. Bus errors seen at http://localhost:3002/bus.0/statistics/errors
 [ALL non-zero entries are listed]
 |BUS_byte_errors              |1        |
 |BUS_open_errors              |795      |
 |BUS_reconnect_errors         |796      |
 |BUS_reconnects               |798      |
 |BUS_reset_errors             |3197     |
 |CRC8_tries                   |18601    |
 ---------
 [and from http://localhost:3002/bus.0/statistics/bus]
 |bus_locks.ALL_______|85071____|
 |bus_locks.0_________|85071____|
 |bus_time.ALL________|4338.26__|
 |bus_time.0__________|4338.26__|
 |bus_unlocks.ALL_____|85071____|
 |bus_unlocks.0_______|85071____|
 |elapsed_time________|162716___|
 |other_bus_errors.ALL|3993_____|
 |other_bus_errors.0__|3993_____|
 |reconnect.ALL_______|798______|
 |reconnect.0_________|798______|
 |reconnect_errors.ALL|796______|
 |reconnect_errors.0__|796______|
 |total_bus_locks_____|85071____|
 |total_bus_time______|4338.26__|
 |total_bus_unlocks___|85071____|
 --------------------------------------------------------------------
 #3. Top output displaying growing VIRTual memory usage
 top - 01:37:25 up 1 day, 15:27,  1 user,  load average: 0.88, 0.38, 0.19
 Tasks: 104 total,   1 running, 102 sleeping,   1 stopped,   0 zombie
 Cpu(s): 15.2% us,  5.3% sy,  0.0% ni, 77.9% id,  1.0% wa,  0.3% hi,  0.3% si
 Mem:    240052k total,   225980k used,    14072k free,     4624k buffers
 Swap:   506008k total,    46924k used,   459084k free,    54972k cached

   PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
 11434 root      17   0  571m 1300  772 S  0.3  0.5   1:16.80 owserver
 11439 root      16   0  275m 1604  776 S  0.3  0.7   0:44.28 owhttpd
  6456 root      15   0  163m  18m 5476 S  4.6  7.9  10:43.22 Xorg
 ---------
 top - 02:02:49 up 1 day, 15:52,  1 user,  load average: 0.82, 0.80, 0.55
 Tasks: 103 total,   1 running, 101 sleeping,   1 stopped,   0 zombie
 Cpu(s):  9.3% us,  3.7% sy,  0.0% ni, 86.7% id,  0.0% wa,  0.3% hi,  0.0% si
 Mem:    240052k total,   214708k used,    25344k free,     5388k buffers
 Swap:   506008k total,    47128k used,   458880k free,    38600k cached

   PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
 11434 root      16   0  587m 1308  772 S  0.0  0.5   1:17.39 owserver
 11439 root      16   0  291m 1648  776 S  0.0  0.7   0:44.66 owhttpd
  6456 root      15   0  166m  20m 5472 S  3.6  8.9  11:32.55 Xorg
 ---------
 top - 08:44:12 up 1 day, 22:33,  1 user,  load average: 0.16, 0.06, 0.02
 Tasks: 104 total,   1 running, 102 sleeping,   1 stopped,   0 zombie
 Cpu(s):  1.3% us,  0.7% sy,  0.0% ni, 98.0% id,  0.0% wa,  0.0% hi,  0.0% si
 Mem:    240052k total,   235364k used,     4688k free,    11864k buffers
 Swap:   506008k total,    45604k used,   460404k free,    33280k cached

   PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
 11434 root      16   0 1355m 1708  772 S  0.0  0.7   1:19.99 owserver
 11439 root      16   0  667m 2620  776 S  0.0  1.1   0:46.16 owhttpd
  6456 root      15   0  165m  20m 4976 S  0.7  8.8  12:41.34 Xorg
 --------------------------------------------------------------------
 ======================================


-------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=k&kid0709&bid&3057&dat1642
_______________________________________________
Owfs-developers mailing list
Owfs-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/owfs-developers

<<winmail.dat>>

Reply via email to