[ovirt-users] ETL service sampling has encountered an error. Please consult the service log for more details.

2019-06-06 Thread nicolas

Hi,

We're running oVirt 4.1.9 (cannot upgrade yet until [1] is released). 
Since a few days ago our event list if full of lines like this:


  ETL service sampling has encountered an error. Please consult the 
service log for more details.


Having a look at the log I see events like:

2019-06-06 
13:37:11|NJ4C8T|TOlL8U|FdlWtU|OVIRT_ENGINE_DWH|StatisticsSync|Default|6|Java 
Exception|tJDBCOutput_7|org.postgresql.util.PSQLException:ERROR: current 
transaction is aborted, commands ignored until end of transaction 
block|1
2019-06-06 
13:37:11|NJ4C8T|TOlL8U|FdlWtU|OVIRT_ENGINE_DWH|StatisticsSync|Default|6|Java 
Exception|tJDBCOutput_4|org.postgresql.util.PSQLException:ERROR: current 
transaction is aborted, commands ignored until end of transaction 
block|1

Exception in component tJDBCOutput_5
org.postgresql.util.PSQLException: ERROR: current transaction is 
aborted, commands ignored until end of transaction block
at 
org.postgresql.core.v3.QueryExecutorImpl.receiveErrorResponse(QueryExecutorImpl.java:2157)
at 
org.postgresql.core.v3.QueryExecutorImpl.processResults(QueryExecutorImpl.java:1886)
at 
org.postgresql.core.v3.QueryExecutorImpl.execute(QueryExecutorImpl.java:255)
at 
org.postgresql.jdbc2.AbstractJdbc2Statement.execute(AbstractJdbc2Statement.java:555)
at 
org.postgresql.jdbc2.AbstractJdbc2Statement.executeWithFlags(AbstractJdbc2Statement.java:417)
at 
org.postgresql.jdbc2.AbstractJdbc2Statement.executeUpdate(AbstractJdbc2Statement.java:363)
at 
ovirt_engine_dwh.statisticssync_4_1.StatisticsSync.tJDBCInput_10Process(StatisticsSync.java:9030)
at 
ovirt_engine_dwh.statisticssync_4_1.StatisticsSync$5.run(StatisticsSync.java:16071)
2019-06-06 
13:37:11|NJ4C8T|TOlL8U|FdlWtU|OVIRT_ENGINE_DWH|StatisticsSync|Default|6|Java 
Exception|tJDBCOutput_5|org.postgresql.util.PSQLException:ERROR: current 
transaction is aborted, commands ignored until end of transaction 
block|1

Exception in component tRunJob_5
java.lang.RuntimeException: Child job running failed
at 
ovirt_engine_dwh.samplerunjobs_4_1.SampleRunJobs.tRunJob_5Process(SampleRunJobs.java:1654)
at 
ovirt_engine_dwh.samplerunjobs_4_1.SampleRunJobs.tRunJob_6Process(SampleRunJobs.java:1456)
at 
ovirt_engine_dwh.samplerunjobs_4_1.SampleRunJobs.tRunJob_1Process(SampleRunJobs.java:1228)
at 
ovirt_engine_dwh.samplerunjobs_4_1.SampleRunJobs.tRunJob_4Process(SampleRunJobs.java:1000)
at 
ovirt_engine_dwh.samplerunjobs_4_1.SampleRunJobs.tJDBCConnection_2Process(SampleRunJobs.java:767)
at 
ovirt_engine_dwh.samplerunjobs_4_1.SampleRunJobs.tJDBCConnection_1Process(SampleRunJobs.java:642)
at 
ovirt_engine_dwh.samplerunjobs_4_1.SampleRunJobs$2.run(SampleRunJobs.java:2683)
2019-06-06 
13:37:11|FdlWtU|TOlL8U|KNLNa4|OVIRT_ENGINE_DWH|SampleRunJobs|Default|6|Java 
Exception|tRunJob_5|java.lang.RuntimeException:Child job running 
failed|1

Exception in component tRunJob_1
java.lang.RuntimeException: Child job running failed
at 
ovirt_engine_dwh.sampletimekeepingjob_4_1.SampleTimeKeepingJob.tRunJob_1Process(SampleTimeKeepingJob.java:6067)
at 
ovirt_engine_dwh.sampletimekeepingjob_4_1.SampleTimeKeepingJob.tJDBCInput_2Process(SampleTimeKeepingJob.java:5809)
at 
ovirt_engine_dwh.sampletimekeepingjob_4_1.SampleTimeKeepingJob.tJDBCConnection_1Process(SampleTimeKeepingJob.java:)
at 
ovirt_engine_dwh.sampletimekeepingjob_4_1.SampleTimeKeepingJob.tJDBCConnection_2Process(SampleTimeKeepingJob.java:4319)
at 
ovirt_engine_dwh.sampletimekeepingjob_4_1.SampleTimeKeepingJob.tRowGenerator_2Process(SampleTimeKeepingJob.java:4188)
at 
ovirt_engine_dwh.sampletimekeepingjob_4_1.SampleTimeKeepingJob.tJDBCInput_3Process(SampleTimeKeepingJob.java:3593)
at 
ovirt_engine_dwh.sampletimekeepingjob_4_1.SampleTimeKeepingJob.tJDBCInput_5Process(SampleTimeKeepingJob.java:2977)
at 
ovirt_engine_dwh.sampletimekeepingjob_4_1.SampleTimeKeepingJob.tJDBCInput_4Process(SampleTimeKeepingJob.java:2295)
at 
ovirt_engine_dwh.sampletimekeepingjob_4_1.SampleTimeKeepingJob.tJDBCConnection_3Process(SampleTimeKeepingJob.java:1649)
at 
ovirt_engine_dwh.sampletimekeepingjob_4_1.SampleTimeKeepingJob$2.run(SampleTimeKeepingJob.java:11363)


Can someone tell me how to fix it? I already restarted ovirt-engine, 
ovirt-engine-dwhd, postgresql and the three at a time and still didn't 
fix the issue. Currently the DWHD data is empty in the Dashboard.


Thanks!

  [1]: https://github.com/oVirt/ovirt-web-ui/issues/490
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/PXMHT5K6ZBFMNHEAUSYWWSDZ2GFUIOXQ/


Re: [ovirt-users] ETL service sampling has encountered an error. Please consult the service log for more details.

2017-04-23 Thread Shirly Radco
Hi Nicolas,

These are DWH error. I see in engine.log that around the time you are
referring to there are many connectipn errors to the postgresql database.
When DWH is unable to connect to the engine database and error is sent to
the ovirt-engine-history.log.
I don't see the error you are referring to in the engine.log.

The DWH samples every 20 second for statistics that are used to create the
ovirt dashboards.

If there is a problem with the postgres connection then the hourly
aggregation of the samples failed too.

I see that first postgres connection error started at 2017-04-20
11:06:04,162+01 ERROR .
I see the problem with the database is still not fixed.
Please check for the reason your database connection keeps closing.

Best regards,


--

SHIRLY RADCO

BI SOFTWARE ENGINEER,

Red Hat Israel 

sra...@redhat.com
 
 


On Thu, Apr 20, 2017 at 2:12 PM,  wrote:

> Hi,
>
> We're using oVirt 4.1.1.8 (upgraded yesterday) and since it has been
> showing some strange errors I'd like to know about:
>
> ETL service sampling has encountered an error. Please consult the
> service log for more details.
>
> I'm attaching the engine log FWIW. This message showed up at 11:07:04 but
> in the log I see the exceptions started 1 minute ago.
>
> Also, at 11:07:58 this event showed up:
>
> Engine server is not responding.
>
> And at 11:12:58 it seems to recover:
>
> Engine server is up and running.
>
> At 12:00:00 this message showed up:
>
> ETL service aggregation to hourly tables has encountered an error.
> Please consult the service log for more details.
>
> But relative to this one I see nothing in logs.
>
> Could someone clarify what these error messages mean?
>
> Thanks.
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
>
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] ETL service sampling has encountered an error. Please consult the service log for more details.

2017-04-23 Thread Nicolás

Any ideas on this?

Thanks.

El 20/04/17 a las 12:12, nico...@devels.es escribió:

Hi,

We're using oVirt 4.1.1.8 (upgraded yesterday) and since it has been 
showing some strange errors I'd like to know about:


ETL service sampling has encountered an error. Please consult the 
service log for more details.


I'm attaching the engine log FWIW. This message showed up at 11:07:04 
but in the log I see the exceptions started 1 minute ago.


Also, at 11:07:58 this event showed up:

Engine server is not responding.

And at 11:12:58 it seems to recover:

Engine server is up and running.

At 12:00:00 this message showed up:

ETL service aggregation to hourly tables has encountered an error. 
Please consult the service log for more details.


But relative to this one I see nothing in logs.

Could someone clarify what these error messages mean?

Thanks.


___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] ETL service sampling has encountered an error

2017-02-23 Thread Shirly Radco
Hi,

This in a known issue.
We have a bug targeted to 4.2 for this.
https://bugzilla.redhat.com/show_bug.cgi?id=137
This error means that the dwh tried to reach the engine database and for
some reason the heartbeat of the engine did not update since the last dwh
sync.


Best regards,

Shirly Radco

BI Software Engineer
Red Hat Israel Ltd.
34 Jerusalem Road
Building A, 4th floor
Ra'anana, Israel 4350109


On Tue, Feb 21, 2017 at 11:17 PM, Christophe TREFOIS <
christophe.tref...@uni.lu> wrote:

> Dear all,
>
> Since upgrading to 4.0.6, we see this error in the engine events tab and
> were wondering how to fix it.
>
> ovirt-engine-dwh.log
>
> —
> 2017-02-20 22:01:00|zPHx4s|6nXgTp|ygadfe|OVIRT_ENGINE_DWH|
> SampleTimeKeepingJob|Default|5|tWarn|tWarn_1|Can not sample data, oVirt
> Engine is not updating the statistics. Please check your oVirt Engine
> status.|9704
> —
>
> And in engine log the only error I see (repeated) since yesterday is
>
> 65366:2017-02-21 22:13:34,357 ERROR [org.ovirt.engine.core.
> vdsbroker.vdsbroker.VdsBrokerObjectsBuilder] (DefaultQuartzScheduler10)
> [2dc75859] null architecture type, replacing with x86_64, %s
>
> Any advice on where to start is welcome.
>
> Kind regards,
> Christophe
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] ETL service sampling has encountered an error

2017-02-21 Thread Christophe TREFOIS
Dear all,

Since upgrading to 4.0.6, we see this error in the engine events tab and were 
wondering how to fix it.

ovirt-engine-dwh.log

—
2017-02-20 
22:01:00|zPHx4s|6nXgTp|ygadfe|OVIRT_ENGINE_DWH|SampleTimeKeepingJob|Default|5|tWarn|tWarn_1|Can
 not sample data, oVirt Engine is not updating the statistics. Please check 
your oVirt Engine status.|9704
—

And in engine log the only error I see (repeated) since yesterday is

65366:2017-02-21 22:13:34,357 ERROR 
[org.ovirt.engine.core.vdsbroker.vdsbroker.VdsBrokerObjectsBuilder] 
(DefaultQuartzScheduler10) [2dc75859] null architecture type, replacing with 
x86_64, %s

Any advice on where to start is welcome.

Kind regards,
Christophe
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users