Ehab El Badrawi created BEAM-10858:
--------------------------------------
Summary: beam.io.jdbc not working on Google Cloud Dataflow
Key: BEAM-10858
URL: https://issues.apache.org/jira/browse/BEAM-10858
Project: Beam
Issue Type: Bug
Components: cross-language, extensions-java-gcp, runner-py-interactive
Affects Versions: 2.23.0
Environment: Apache Beam 2.25.0.dev0 Jupyter Notebook running on
Google Cloud Dataflow
Reporter: Ehab El Badrawi
Using the JDBC python wrapper to connect to an external server is failing on
Cloud Dataflow.
The error produced is;
RuntimeError:
/root/apache-beam-custom/packages/beam/sdks/java/extensions/schemaio-expansion-service/build/libs/beam-sdks-java-extensions-schemaio-expansion-service-2.25.0-SNAPSHOT.jar
not found. Please build the server with cd
/root/apache-beam-custom/packages/beam; ./gradlew
:sdks:java:extensions:schemaio-expansion-service:shadowJar
You can recreate the issue by running the below
{code:java}
from apache_beam.io import jdbc
jdbc.ReadFromJdbc(
table_name='[tablename]',
driver_class_name='com.mysql.cj.jdbc.Driver',
jdbc_url='jdbc:mysql://[host]:3306/[databasename]',
username='[username]',
password='[password]'){code}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)