First of all Kudos! to the developers and others helping with the owfs project.
Setup: 11434 ? Ssl 1:20 /opt/owfs/bin/owserver -u -p 3333 11439 ? Ssl 0:46 /opt/owfs/bin/owhttpd -s 127.0.0.1:3333 -p 3002 (both run by root) Kernel 2.6.10-5-386 [Ubuntu 5.04] owserver and owhttpd were compiled from the owfs-2.3p0.tar.gz tarball found at http://owfs.sourceforge.net (but without fuse, I only compiled owserver and owhttpd). I have a DS9490 with two DS18B20 devices and one DS18S20 on it. Temperatures are read about once a minute using wget to access the owhttpd server. After about 30 hours of successful data collection - no more data. Initially I suspected the wiring, and reduced the wiring to a short segment with a single DS2438 device that was known to work but the owhttpd did not report the detection of ANY devices. Finally, I pulled out the USB and then re-inserted it. All this without stopping the owserver or owhttpd. Then the DS2438 was recognized. I reconnected the 3-temperature-device cable. The three devices were listed once again. However, more tests showed that the owhttpd had stopped reporting anything about the first 18B20 other than its "address" , "crc" , and "die" values. Oddly, the owhttpd was sending a partial webpage for the "misbehaving" device and in fact ending the page abruptly in the middle of a tag [see #1 below]. The data received is exactly 512-bytes which seems too nice a round number to be ignored. The other two devices were reported normally. I restarted my minute interval wget command - while (:) ; do date ; wget -q -O - http://127.0.0.1:3002/28.D96BC6000000 \ http://localhost:3002/10.BA19BC000800 | html2text -nobs | \ awk -F'|' '/temperature/ print $2 ": " $3 * 1}; /address/ {printf "%s %s ", $2, $3}' \ ; sleep 60 ; done >> file_out.dat - this time retrieving temperatures for the two working devices. Within an hour, by chance, I happened to notice that the amount of virtual memory used by each of the owserver and owhttpd processes was very large and growing [see #3 below] (Over 1.7GB and 875MB respectively as I write this - on a 256MB system) After another two hours of collecting valid data, the 2nd device (the 18S20) was now being reported in the same partial fashion as the first 18B20. To further rule out a cable and/or device issue, I moved the cabling end from the USB dongle on the Ubuntu system, and connected it to anther system running the dsl-2.3 (with owfs.uci) and a DS9097U. It worked fine over there. So I put it back. unfortunately the Ubuntu system does not have a RS232 port. I have also included some possibly useful diagnostic information from the /bus.0/statistics directory [see #2 below]. Q1. Does any one have any idea what is going on? Q2. Is there any reason that so much virtual memory is being requested? Q3. Why is the owhttpd returning incomplete web pages? The problem could of course be in my kernel, or elsewhere, but I'm not having any other issues, and I have never seen any other processes grabbing virtual memory like these two - that is why I am asking on this list. Seeing how quick the response is on this list in general, I am going to leave everything running for at least another day or two, in hopes of troubleshooting the problem while it is happening. If I simply restart everything we may never know what went wrong, and each cycle may take > 30hours before a problem arises. So I am hoping that someone can offer suggestions about how to pin down the culprit AS IS. Of course all thoughts are welcome. I am not a developer, nor a C programmer, but with detailed instructions I can follow, I will do my best to provide troubleshooting information. I am relatively competent at the bash command line. Sorry if I haven't provided all the pertinent information. Mark ===================== #1. Abruptly ended html code sent by owhttpd server HTML><HEAD><TITLE>1-Wire Web: 28.CC06AE000000</TITLE></HEAD> <BODY BGCOLOR='#BBBBBB'><TABLE WIDTH='100%' BGCOLOR='#DDDDDD' BORDER='1'><TR><TD>OWFS on 127.0.0.1:3333</TD><TD><A HREF='/'>Bus listing</A></TD><TD><A HREF='http://owfs.sourceforge.net'>OWFS homepage</A></TD><TD><A HREF='http://www.maxim-ic.com'>Dallas/Maxim</A></TD><TD>by <A HREF='mailto://[EMAIL PROTECTED]'>Paul H Alfille</A></TD></TR></TABLE> <H1>28.CC06AE000000</H1><HR> <BR><small><A href='/uncached/28.CC06AE000000'>uncached version</A></small><TABLE BGCOLOR="#DDDDDD" BORDER=1><TR><TD><A HREF='/'><CODE><B><BIG>up</BIG></B></CODE></A></TD><TD>directory</TD></TR><TR><TD><B>address</B></TD><TD>28CC06AE000000DC</TD></TR> <TR><TD><B>crc8</B></TD><TD>DC</TD></TR> <TR><TD><B>die</B></TD><TD -------------------------------------------------------------------- #2. Bus errors seen at http://localhost:3002/bus.0/statistics/errors [ALL non-zero entries are listed] |BUS_byte_errors |1 | |BUS_open_errors |795 | |BUS_reconnect_errors |796 | |BUS_reconnects |798 | |BUS_reset_errors |3197 | |CRC8_tries |18601 | --------- [and from http://localhost:3002/bus.0/statistics/bus] |bus_locks.ALL_______|85071____| |bus_locks.0_________|85071____| |bus_time.ALL________|4338.26__| |bus_time.0__________|4338.26__| |bus_unlocks.ALL_____|85071____| |bus_unlocks.0_______|85071____| |elapsed_time________|162716___| |other_bus_errors.ALL|3993_____| |other_bus_errors.0__|3993_____| |reconnect.ALL_______|798______| |reconnect.0_________|798______| |reconnect_errors.ALL|796______| |reconnect_errors.0__|796______| |total_bus_locks_____|85071____| |total_bus_time______|4338.26__| |total_bus_unlocks___|85071____| -------------------------------------------------------------------- #3. Top output displaying growing VIRTual memory usage top - 01:37:25 up 1 day, 15:27, 1 user, load average: 0.88, 0.38, 0.19 Tasks: 104 total, 1 running, 102 sleeping, 1 stopped, 0 zombie Cpu(s): 15.2% us, 5.3% sy, 0.0% ni, 77.9% id, 1.0% wa, 0.3% hi, 0.3% si Mem: 240052k total, 225980k used, 14072k free, 4624k buffers Swap: 506008k total, 46924k used, 459084k free, 54972k cached PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND 11434 root 17 0 571m 1300 772 S 0.3 0.5 1:16.80 owserver 11439 root 16 0 275m 1604 776 S 0.3 0.7 0:44.28 owhttpd 6456 root 15 0 163m 18m 5476 S 4.6 7.9 10:43.22 Xorg --------- top - 02:02:49 up 1 day, 15:52, 1 user, load average: 0.82, 0.80, 0.55 Tasks: 103 total, 1 running, 101 sleeping, 1 stopped, 0 zombie Cpu(s): 9.3% us, 3.7% sy, 0.0% ni, 86.7% id, 0.0% wa, 0.3% hi, 0.0% si Mem: 240052k total, 214708k used, 25344k free, 5388k buffers Swap: 506008k total, 47128k used, 458880k free, 38600k cached PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND 11434 root 16 0 587m 1308 772 S 0.0 0.5 1:17.39 owserver 11439 root 16 0 291m 1648 776 S 0.0 0.7 0:44.66 owhttpd 6456 root 15 0 166m 20m 5472 S 3.6 8.9 11:32.55 Xorg --------- top - 08:44:12 up 1 day, 22:33, 1 user, load average: 0.16, 0.06, 0.02 Tasks: 104 total, 1 running, 102 sleeping, 1 stopped, 0 zombie Cpu(s): 1.3% us, 0.7% sy, 0.0% ni, 98.0% id, 0.0% wa, 0.0% hi, 0.0% si Mem: 240052k total, 235364k used, 4688k free, 11864k buffers Swap: 506008k total, 45604k used, 460404k free, 33280k cached PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND 11434 root 16 0 1355m 1708 772 S 0.0 0.7 1:19.99 owserver 11439 root 16 0 667m 2620 776 S 0.0 1.1 0:46.16 owhttpd 6456 root 15 0 165m 20m 4976 S 0.7 8.8 12:41.34 Xorg -------------------------------------------------------------------- ====================================== ------------------------------------------------------- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid0709&bid&3057&dat1642 _______________________________________________ Owfs-developers mailing list Owfs-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/owfs-developers