this is the strace output

[pid 33318] nanosleep({0, 20000}, NULL) = 0
[pid 33318] nanosleep({0, 20000}, NULL) = 0
[pid 33318] nanosleep({0, 20000}, NULL) = 0
[pid 33318] nanosleep({0, 20000},  <unfinished ...>
[pid 33321] write(4, "HTTP/1.1 200 OK\r\nContent-Encodin"..., 2182 
<unfinished ...>
[pid 33318] <... nanosleep resumed> NULL) = 0
[pid 33321] <... write resumed> )       = 2182
[pid 33318] nanosleep({0, 20000},  <unfinished ...>
[pid 33321] futex(0x1073648, FUTEX_WAKE_PRIVATE, 1) = 1
[pid 33317] <... futex resumed> )       = 0
[pid 33317] nanosleep({0, 3000},  <unfinished ...>
[pid 33321] read(4, 0xc000204000, 4096) = -1 EAGAIN (Resource temporarily 
unavailable)
[pid 33318] <... nanosleep resumed> NULL) = 0
[pid 33321] futex(0xc0000be148, FUTEX_WAIT_PRIVATE, 0, NULL <unfinished ...>
[pid 33318] nanosleep({0, 20000},  <unfinished ...>
[pid 33317] <... nanosleep resumed> NULL) = 0
[pid 33317] futex(0x1073648, FUTEX_WAIT_PRIVATE, 0, NULL <unfinished ...>
[pid 33318] <... nanosleep resumed> NULL) = 0
[pid 33318] futex(0x1072dd0, FUTEX_WAIT_PRIVATE, 0, {60, 0} <unfinished ...>
[pid 42464] <... futex resumed> )       = -1 ETIMEDOUT (Connection timed 
out)
[pid 42464] futex(0x1072dd0, FUTEX_WAKE_PRIVATE, 1) = 1
[pid 33318] <... futex resumed> )       = 0
[pid 42464] futex(0xc00055e148, FUTEX_WAIT_PRIVATE, 0, NULL <unfinished ...>
[pid 33318] nanosleep({0, 20000}, NULL) = 0
[pid 33318] futex(0x1072dd0, FUTEX_WAIT_PRIVATE, 0, {60, 0}


On Saturday, 29 February 2020 12:30:58 UTC-6, Bibin John wrote:
>
> both file have same permissions.
>
> /home/nodeexp/db $ ls -ltr
> total 8
> -rwxr-xr-x 1 user user 400 Feb 29 13:13 1.prom
> -rwxr-xr-x 1 user user 396 Feb 29 13:14 2.prom
>
> Please find content of files
>
> /home/nodeexp/db $ cat 1.prom
> kafka_topic_cluster_last_update{cluster="test", ts="1582844402760"} 
> 1582844402760
> kafka_topic_rf{cluster="test", atopic="APPC-LCM-READ-REGRESSION-1848"} 3
> kafka_topic_partitioncount{cluster="test", 
> atopic="APPC-LCM-READ-REGRESSION-1848"} 8
> kafka_topic_details{cluster="test", 
> atopic="APPC-LCM-READ-REGRESSION-1848", bpartition="0", cleader="2", 
> dreplicas="2,3,1", eisr="1,2,3", ts="1582844402760"} 3
>
> /home/nodeexp/db $ cat 2.prom
> kafka_topic_cluster_last_update{cluster="dev", ts="1582844402760"} 
> 1582844402760
> kafka_topic_rf{cluster="dev", atopic="APPC-LCM-READ-REGRESSION-1848"} 3
> kafka_topic_partitioncount{cluster="dev", 
> atopic="APPC-LCM-READ-REGRESSION-1848"} 8
> kafka_topic_details{cluster="dev", atopic="APPC-LCM-READ-REGRESSION-1848", 
> bpartition="0", cleader="2", dreplicas="2,3,1", eisr="1,2,3", 
> ts="1582844402760"} 3
> /opt/data/tools/node_exporter/data $
>
>
> Data from nodeexporter. this shows, it tried to read from both prom files 
> but shows content of only 1 file.
>
> # HELP kafka_topic_cluster_last_update Metric read from 
> /home/nodeexp/db/1.prom
> # TYPE kafka_topic_cluster_last_update untyped
> kafka_topic_cluster_last_update{cluster="test",ts="1582844402760"} 
> 1.58284440276e+12
> # HELP kafka_topic_details Metric read from /home/nodeexp/db/1.prom
> # TYPE kafka_topic_details untyped
> kafka_topic_details{atopic="APPC-LCM-READ-REGRESSION-1848",bpartition="0",cleader="2",cluster="test",dreplicas="2,3,1",eisr="1,2,3",ts="1582844402760"}
>  
> 3
> # HELP kafka_topic_partitioncount Metric read from /home/nodeexp/db/1.prom
> # TYPE kafka_topic_partitioncount untyped
> kafka_topic_partitioncount{atopic="APPC-LCM-READ-REGRESSION-1848",cluster="test"}
>  
> 8
> # HELP kafka_topic_rf Metric read from /home/nodeexp/db/1.prom
> # TYPE kafka_topic_rf untyped
> kafka_topic_rf{atopic="APPC-LCM-READ-REGRESSION-1848",cluster="test"} 3
> # HELP node_exporter_build_info A metric with a constant '1' value labeled 
> by version, revision, branch, and goversion from which node_exporter was 
> built.
> # TYPE node_exporter_build_info gauge
> node_exporter_build_info{branch="",goversion="go1.13.6",revision="",version=""}
>  
> 1
> # HELP node_scrape_collector_duration_seconds node_exporter: Duration of a 
> collector scrape.
> # TYPE node_scrape_collector_duration_seconds gauge
> node_scrape_collector_duration_seconds{collector="textfile"} 0.000220073
> # HELP node_scrape_collector_success node_exporter: Whether a collector 
> succeeded.
> # TYPE node_scrape_collector_success gauge
> node_scrape_collector_success{collector="textfile"} 1
> # HELP node_textfile_mtime_seconds Unixtime mtime of textfiles 
> successfully read.
> # TYPE node_textfile_mtime_seconds gauge
> node_textfile_mtime_seconds{file=".prom"} 1.580203921e+09
> node_textfile_mtime_seconds{file="1.prom"} 1.583000029e+09
> node_textfile_mtime_seconds{file="2.prom"} 1.583000042e+09
> # HELP node_textfile_scrape_error 1 if there was an error opening or 
> reading a file, 0 otherwise
> # TYPE node_textfile_scrape_error gauge
> node_textfile_scrape_error 0
> # HELP process_cpu_seconds_total Total user and system CPU time spent in 
> seconds.
> # TYPE process_cpu_seconds_total counter
> process_cpu_seconds_total 0.03
> # HELP process_max_fds Maximum number of open file descriptors.
> # TYPE process_max_fds gauge
> process_max_fds 200000
> # HELP process_open_fds Number of open file descriptors.
> # TYPE process_open_fds gauge
> process_open_fds 10
> # HELP process_resident_memory_bytes Resident memory size in bytes.
> # TYPE process_resident_memory_bytes gauge
> process_resident_memory_bytes 9.89184e+06
> # HELP process_start_time_seconds Start time of the process since unix 
> epoch in seconds.
> # TYPE process_start_time_seconds gauge
> process_start_time_seconds 1.58300043655e+09
> # HELP process_virtual_memory_bytes Virtual memory size in bytes.
> # TYPE process_virtual_memory_bytes gauge
> process_virtual_memory_bytes 7.21330176e+08
> # HELP process_virtual_memory_max_bytes Maximum amount of virtual memory 
> available in bytes.
> # TYPE process_virtual_memory_max_bytes gauge
> process_virtual_memory_max_bytes -1
> # HELP promhttp_metric_handler_errors_total Total number of internal 
> errors encountered by the promhttp metric handler.
> # TYPE promhttp_metric_handler_errors_total counter
> promhttp_metric_handler_errors_total{cause="encoding"} 0
> promhttp_metric_handler_errors_total{cause="gathering"} 5
> # HELP promhttp_metric_handler_requests_in_flight Current number of 
> scrapes being served.
> # TYPE promhttp_metric_handler_requests_in_flight gauge
> promhttp_metric_handler_requests_in_flight 1
> # HELP promhttp_metric_handler_requests_total Total number of scrapes by 
> HTTP status code.
> # TYPE promhttp_metric_handler_requests_total counter
> promhttp_metric_handler_requests_total{code="200"} 5
> promhttp_metric_handler_requests_total{code="500"} 0
> promhttp_metric_handler_requests_total{code="503"} 0
>
>
>
>
>
>
>
> On Friday, 28 February 2020 02:52:26 UTC-6, Brian Candler wrote:
>>
>> Look at the stderr output from node_exporter.  My guess is that one of 
>> the metrics is in an invalid format; if so, textfile_collector will report 
>> and abandon the rest of the file (maybe the rest of the directory - I 
>> haven't tested this)
>>
>> Another possibility is permissions on the files.
>>
>> You may also get more clues from strace:
>>
>> strace -f -p <pid-of-node-exporter>
>>
>> Look for accesses to /home/nodeexp/db/
>>
>

-- 
You received this message because you are subscribed to the Google Groups 
"Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/prometheus-users/5da738bf-9b0f-40e0-8f92-6d6ffd244e23%40googlegroups.com.

Reply via email to