Author: sejer Date: Wed Feb 11 08:32:29 2026 New Revision: 82420 Log: Staging of rc2 of Wayang 1.1.1
Added: dev/wayang/1.1.1/ dev/wayang/1.1.1/rc2/ dev/wayang/1.1.1/rc2/README.md dev/wayang/1.1.1/rc2/RELEASE_NOTES dev/wayang/1.1.1/rc2/apache-wayang-1.1.1-source-release.zip (contents, props changed) dev/wayang/1.1.1/rc2/apache-wayang-1.1.1-source-release.zip.asc dev/wayang/1.1.1/rc2/apache-wayang-1.1.1-source-release.zip.sha512 Added: dev/wayang/1.1.1/rc2/README.md ============================================================================== --- /dev/null 00:00:00 1970 (empty, because file is newly added) +++ dev/wayang/1.1.1/rc2/README.md Wed Feb 11 08:32:29 2026 (r82420) @@ -0,0 +1,280 @@ +<!-- + - Licensed to the Apache Software Foundation (ASF) under one + - or more contributor license agreements. See the NOTICE file + - distributed with this work for additional information + - regarding copyright ownership. The ASF licenses this file + - to you under the Apache License, Version 2.0 (the + - "License"); you may not use this file except in compliance + - with the License. You may obtain a copy of the License at + - + - http://www.apache.org/licenses/LICENSE-2.0 + - + - Unless required by applicable law or agreed to in writing, + - software distributed under the License is distributed on an + - "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY + - KIND, either express or implied. See the License for the + - specific language governing permissions and limitations + - under the License. + --> + +# Apache Wayang™ <img align="right" width="128px" src="https://wayang.apache.org/img/wayang.png" alt="Wayang Logo"> + +## The first open-source cross-platform data processing system + +[](https://img.shields.io/maven-central/v/org.apache.wayang/wayang-core.svg) +[](http://www.apache.org/licenses/LICENSE-2.0) +[]() + + + + +[](https://twitter.com/intent/tweet?text=Apache%20Wayang%20enables%20cross%20platform%20data%20processing,%20star%20it%20via:%20&url=https://github.com/apache/incubator-wayang&via=apachewayang&hashtags=dataprocessing,bigdata,analytics,hybridcloud,developers) [](https://www.reddit.com/r/ApacheWayang/) +## Table of contents + * [Description](#description) + * [Quick Guide for Running Wayang](#quick-guide-for-running-wayang) + * [Quick Guide for Developing with Wayang](#quick-guide-for-developing-with-wayang) + * [Installing Wayang](#installing-wayang) + + [Requirements at Runtime](#requirements-at-runtime) + + [Validating the installation](#validating-the-installation) + * [Getting Started](#getting-started) + + [Prerequisites](#prerequisites) + + [Building](#building) + * [Running the tests](#running-the-tests) + * [Example Applications](#example-applications) + * [Built With](#built-with) + * [Contributing](#contributing) + * [Authors](#authors) + * [License](#license) + +## Description + +In contrast to traditional data processing systems that provide one dedicated execution engine, Apache Wayang can transparently and seamlessly integrate multiple execution engines and use them to perform a single task. We call this *cross-platform data processing*. In Wayang, users can specify any data processing application using one of Wayang's APIs and then Wayang can choose the data processing platform(s), e.g., Postgres or Apache Spark, that best fits the application. Finally, Wayang will orchestrate the execution, thereby hiding the different platform-specific APIs and coordinating inter-platform communication. + +Apache Wayang aims at freeing data engineers and software developers from the burden of learning all different data processing systems, their APIs, strengths and weaknesses; the intricacies of coordinating and integrating different processing platforms; and the inflexibility when trying a fixed set of processing platforms. As of now, Wayang has built-in support for the following processing platforms: +- [Java Streams](https://docs.oracle.com/javase/8/docs/api/java/util/stream/Stream.html) +- [Apache Spark](https://spark.apache.org/) +- [Apache Flink](https://flink.apache.org/) +- [Apache Giraph](https://giraph.apache.org/) +- [GraphChi](https://github.com/GraphChi/graphchi-java) +- [Postgres](http://www.postgresql.org) +- [SQLite](https://www.sqlite.org/) +- [Apache Kafka](https://kafka.apache.org) +- [Tensorflow](https://www.tensorflow.org/) + +Apache Wayang can be used via the following APIs: +- Java scala-like +- Scala +- SQL +- Java native (recommended only for low level development) + +Apache Wayang provides a flexible architecture which enables easy addition of new operators and data processing platforms without requiring any change of the internals of the system. For details on how to add new operators, see [here](https://wayang.apache.org/docs/guide/adding-operators). + +## Quick Guide for Running Wayang + +For a quick guide on how to run WordCount see [here](guides/tutorial.md). + +### Spark Dataset / DataFrame pipelines + +Wayang’s Spark platform can now execute end-to-end pipelines on Spark `Dataset[Row]` (aka DataFrames). This is particularly useful when working with lakehouse-style storage (Parquet/Delta) or when you want to plug Spark ML stages into a Wayang plan without repeatedly falling back to RDDs. + +To build a Dataset-backed pipeline: + +1. **Use the Dataset-aware plan builder APIs.** + - `PlanBuilder.readParquet(..., preferDataset = true)` (or `JavaPlanBuilder.readParquet(..., ..., true)`) reads Parquet files directly into a Dataset channel. + - `DataQuanta.writeParquet(..., preferDataset = true)` writes a Dataset channel without converting it back to an RDD. +2. **Keep operators dataset-compatible.** Most operators continue to work unchanged; if an operator explicitly prefers RDDs, Wayang will insert the necessary conversions automatically (at an additional cost). Custom operators can expose `DatasetChannel` descriptors to stay in the dataframe world. +3. **Let the optimizer do the rest.** The optimizer now assigns a higher cost to Dataset↔RDD conversions, so once you opt into Dataset sources/sinks the plan will stay in Dataset form by default. + +No extra flags are required—just opt into the Dataset-based APIs where you want dataframe semantics. If you see unexpected conversions in your execution plan, check that the upstream/downstream operators you use can consume `DatasetChannel`s; otherwise Wayang will insert a conversion operator for you. + +## Quick Guide for Developing with Wayang + +For a quick guide on how to use Wayang in your Java/Scala project see [here](guides/develop-with-Wayang.md). + +## Installing Wayang + +You first have to build the binaries as shown [here](guides/tutorial.md). +Once you have the binaries built, follow these steps to install Wayang: + +```shell +tar -xvf wayang-1.0.1-SNAPSHOT.tar.gz +cd wayang-1.0.1-SNAPSHOT +``` + +In linux +```shell +echo "export WAYANG_HOME=$(pwd)" >> ~/.bashrc +echo "export PATH=${PATH}:${WAYANG_HOME}/bin" >> ~/.bashrc +source ~/.bashrc +``` +In MacOS +```shell +echo "export WAYANG_HOME=$(pwd)" >> ~/.zshrc +echo "export PATH=${PATH}:${WAYANG_HOME}/bin" >> ~/.zshrc +source ~/.zshrc +``` + +### Requirements at Runtime + +Apache Wayang relies on external execution engines and Java to function correctly. Below are the updated runtime requirements: + +- **Java 17**: Make sure `JAVA_HOME` is correctly set to your Java 17 installation. +- **Apache Spark 3.4.4**: Compatible with Scala 2.12. Set the `SPARK_HOME` environment variable. +- **Apache Hadoop 3+**: Set the `HADOOP_HOME` environment variable. + +> 🛠️ **Note:** When using Java 17, you _must_ add JVM flags to allow Wayang and Spark to access internal Java APIs, or you will encounter `IllegalAccessError`. See below. + +### Validating the installation + +To execute your first application with Apache Wayang, you need to execute your program with the 'wayang-submit' command: + +```shell +bin/wayang-submit org.apache.wayang.apps.wordcount.Main java file://$(pwd)/README.md +``` + +### ⚙️ Java 17 Compatibility + +When running Wayang applications using Java 17 (especially with Spark), you must add JVM flags to open specific internal Java modules. These flags resolve access issues with `sun.nio.ch.DirectBuffer` and others. + +Update your `wayang-submit` (wayang-assembly/target/wayang-1.0.1-SNAPSHOT/bin/wayang-submit) script (or command) with: + +```bash +eval "$RUNNER \ + --add-exports=java.base/sun.nio.ch=ALL-UNNAMED \ + --add-opens=java.base/java.nio=ALL-UNNAMED \ + --add-opens=java.base/java.lang=ALL-UNNAMED \ + --add-opens=java.base/java.util=ALL-UNNAMED \ + --add-opens=java.base/java.io=ALL-UNNAMED \ + --add-opens=java.base/java.lang.reflect=ALL-UNNAMED \ + --add-opens=java.base/java.util.concurrent=ALL-UNNAMED \ + --add-opens=java.base/java.net=ALL-UNNAMED \ + --add-opens=java.base/java.lang.invoke=ALL-UNNAMED \ + $FLAGS -cp \"${WAYANG_CLASSPATH}\" $CLASS ${ARGS}" +``` + +## Getting Started + +Wayang is available via Maven Central. To use it with Maven, include the following code snippet into your POM file: +```xml +<dependency> + <groupId>org.apache.wayang</groupId> + <artifactId>wayang-***</artifactId> + <version>1.0.0</version> +</dependency> +``` +Note the `***`: Wayang ships with multiple modules that can be included in your app, depending on how you want to use it: +* `wayang-core`: provides core data structures and the optimizer (required) +* `wayang-basic`: provides common operators and data types for your apps (recommended) +* `wayang-api-scala-java`: provides an easy-to-use Scala and Java API to assemble Wayang plans (recommended) +* `wayang-java`, `wayang-spark`, `wayang-graphchi`, `wayang-sqlite3`, `wayang-postgres`: adapters for the various supported processing platforms +* `wayang-profiler`: provides functionality to learn operator and UDF cost functions from historical execution data + +> **NOTE:** The module `wayang-api-scala-java` is intended to be used with Java 11 and Scala 2.12. + +For the sake of version flexibility, you still have to include in the POM file your Hadoop (`hadoop-hdfs` and `hadoop-common`) and Spark (`spark-core` and `spark-graphx`) version of choice. + +In addition, you can obtain the most recent snapshot version of Wayang via Sonatype's snapshot repository. Just include: +```xml +<repositories> + <repository> + <id>apache-snapshots</id> + <name>Apache Foundation Snapshot Repository</name> + <url>https://repository.apache.org/content/repositories/snapshots</url> + </repository> +</repositories> +``` + +### Prerequisites +Apache Wayang is built with Java 17 and Scala 2.12. However, to run Apache Wayang it is sufficient to have just Java 17 installed. Please also consider that processing platforms employed by Wayang might have further requirements. +``` +Java 17 +Scala 2.12.17 +Spark 3.4.4, Compatible with Scala 2.12. +Maven +``` + +> **NOTE:** In windows, you need to define the variable `HADOOP_HOME` with the winutils.exe, an not official option to obtain [this repository](https://github.com/steveloughran/winutils), or you can generate your winutils.exe following the instructions in the repository. Also, you may need to install [msvcr100.dll](https://www.microsoft.com/en-us/download/details.aspx?id=26999) + +> **NOTE:** Make sure that the JAVA_HOME environment variable is set correctly to Java 17 as the prerequisite checker script currently supports up to Java 17 and checks the latest version of Java if you have higher version installed. In Linux, it is preferably to use the export JAVA_HOME method inside the project folder. It is also recommended running './mvnw clean install' before opening the project using IntelliJ. + + +### Building + +If you need to rebuild Wayang, e.g., to use a different Scala version, you can simply do so via Maven: + +1. Adapt the version variables (e.g., `spark.version`) in the main `pom.xml` file. +2. Build Wayang with the adapted versions. + ```shell + git clone https://github.com/apache/incubator-wayang.git + cd incubator-wayang + ./mvnw clean install -DskipTests + ``` +> **NOTE:** If you receive an error about not finding `MathExBaseVisitor`, then the problem might be that you are trying to build from IntelliJ, without Maven. MathExBaseVisitor is generated code, and a Maven build should generate it automatically. + +> **NOTE:**: In the current Maven setup, Wayang supports Java 17. The default Scala version is 2.12.17, which is compatible with Java 17. Ensure that your Spark distribution is also built with Scala 2.12 (e.g., `spark-3.4.4-bin-hadoop3-scala2.12`). + +> **NOTE:** For compiling and testing the code it is required to have Hadoop installed on your machine. + +> **NOTE:** the `standalone` profile to fix Hadoop and Spark versions, so that Wayang apps do not explicitly need to declare the corresponding dependencies. + +> **NOTE**: When running applications (e.g., WordCount) with Java 17, you must pass additional flags to allow internal module access: + +>--add-exports=java.base/sun.nio.ch=ALL-UNNAMED \ +--add-opens=java.base/java.nio=ALL-UNNAMED \ +--add-opens=java.base/java.lang=ALL-UNNAMED \ +--add-opens=java.base/java.util=ALL-UNNAMED \ +--add-opens=java.base/java.io=ALL-UNNAMED \ +--add-opens=java.base/java.lang.reflect=ALL-UNNAMED \ +--add-opens=java.base/java.util.concurrent=ALL-UNNAMED \ +--add-opens=java.base/java.net=ALL-UNNAMED \ +--add-opens=java.base/java.lang.invoke=ALL-UNNAMED \ + +> +> Also, note the `distro` profile, which assembles a binary Wayang distribution. +To activate these profiles, you need to specify them when running maven, i.e., + +```shell +./mvnw clean install -DskipTests -P<profile name> +``` + +## Running the tests +In the incubator-wayang root folder run: +```shell +./mvnw test +``` + +## Example Applications +You can see examples on how to start using Wayang [here](guides/wayang-examples.md) + +## Built With + +* [Java 17](https://www.oracle.com/java/technologies/javase/17-0-14-relnotes.html) +* [Scala 2.12.17](https://www.scala-lang.org/download/2.12.17.html) +* [Maven](https://maven.apache.org/) + +## Contributing +Before submitting a PR, please take a look on how to contribute with Apache Wayang contributing guidelines [here](CONTRIBUTING.md). + +There is also a guide on how to compile your code [here](guides/develop-in-Wayang.md). +## Authors +The list of [contributors](https://github.com/apache/incubator-wayang/graphs/contributors). + +## License +All files in this repository are licensed under the Apache Software License 2.0 + +Copyright 2020 - 2026 The Apache Software Foundation. + +Licensed under the Apache License, Version 2.0 (the "License"); +you may not use this file except in compliance with the License. +You may obtain a copy of the License at + + http://www.apache.org/licenses/LICENSE-2.0 + +Unless required by applicable law or agreed to in writing, software +distributed under the License is distributed on an "AS IS" BASIS, +WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. +See the License for the specific language governing permissions and +limitations under the License. + +## Acknowledgements +The [Logo](https://wayang.apache.org/img/wayang.png) was donated by Brian Vera. Added: dev/wayang/1.1.1/rc2/RELEASE_NOTES ============================================================================== --- /dev/null 00:00:00 1970 (empty, because file is newly added) +++ dev/wayang/1.1.1/rc2/RELEASE_NOTES Wed Feb 11 08:32:29 2026 (r82420) @@ -0,0 +1,117 @@ +============================================================== +Apache Wayang 1.1.1 +============================================================== +This is a minor release of Apache Wayang. It is the first release as a top-level project. +This release includes mainly bug fixes and minor improvements: + +- REST server can now output the error message in case of failure +- Spark executor can submit dataframe queries to Spark and not only RDD jobs +- Security fixes on the object file sources for Java platform and Spark +- Bug fix in explain( ) to output the right plan +- Changes in documentation to not include "incubator" +- Support for bounded streaming textfile source, DataStream- map, join and collection sink in Flink platform + +============================================================== +Apache Wayang (incubating) 1.1.0 +============================================================== + +This is a minor release of Apache Wayang (incubating). +Based on this release the project aims to move out of incubation. + +This release supports the following new features: + +general: +- migration to Java 17 +- migration to junit5 for tests +- better explain & json utils + +api-sql: +- sort operator +- cross joins +- joins with multiple equals conditions +- search arguments in filter WHERE clauses +- '+/-' algebra in filter statements +- calcite rules for more optimized SQL trees +- better support for sql-types for java udfs +- better serializability for java-based platforms + +wayang-basic: +- Parquet source operator +- LinearSVC operator +- LogisticRegression operator +- DecisionTreeRegression operator with time series +- GoogleCloudStorageSource operator +- NN Batch normalisation +- NN Convolutions +- NN MSE loss + +api-python: +- Parquet source operator + +wayang-api-scala-java: +- api for deep learning + +============================================================== +Apache Wayang (incubating) 1.0.0 +============================================================== + +This is a major release of Apache Wayang (incubating). +Based on this release the projects wants to continue the path towards becoming an +Apache TLP. + +This release supports the following new features: +- Support for Tensorflow Java as a platform (Note: this platform is compatible with Apple's Silicon's chip only with the right arm-based java version) +- Support for Kafka source/sink +- Python API +- MultiContext for federated data processing +- Explain utility for displaying execution plans +- Abstracted cost model to be pluggable +- Retired Java 8/Scala 2.11 +- Added Join operator for JDBC and Postgres platforms +- Support for any JDBC-supported database +- Added ML4all abstraction + +Several bug fixes are also included: +- Aligned Flink versions +- Fixed CardinalityRepository to sample measured cardinalities +- Copied cost models for different configurations + + +============================================================== +Apache Wayang (incubating) 0.7.1 +============================================================== + +This is a patch/bugfix release of Apache Wayang (incubating). + +This release supports the following frameworks: +- This patch version fixes a problem in the artifact gpg encrypting key. + +Previous updates that are included in this release: +- Apache Flink v1.7.1 +- Apache Giraph v1.2.0-hadoop2 +- GraphChi v0.2.2 (only available with scala 11.x) +- Java Streams (version depends on the java version) +- JDBC-Template +- Postgres v9.4.1208 (Implementation JDBC-Template) +- Apache Spark v3.1.2 (scala 2.12) and v2.4.8 (scala 2.11) +- SQLite3 v3.8.11.2 (implementation JDBC-Template) + +NOTE: depending on the scala version the list of the supported platforms available +could be different. + + +New Features +------------ +[WAYANG-34] Terasort +[WAYANG-IDE] Recognition of Scala code in IDE +[WAYANG-8] Inclusion of Python as platform of processing +[WAYANG-211] JVM-platform inside of Python-API +Grep Benchmark +Test classes and operators for Flink +Support for S3 files locations +Initial version of the SQL API and suppport for joins + +Basic SQL support + +Incompatible changes +-------------------- \ No newline at end of file Added: dev/wayang/1.1.1/rc2/apache-wayang-1.1.1-source-release.zip ============================================================================== Binary file. No diff available. Added: dev/wayang/1.1.1/rc2/apache-wayang-1.1.1-source-release.zip.asc ============================================================================== --- /dev/null 00:00:00 1970 (empty, because file is newly added) +++ dev/wayang/1.1.1/rc2/apache-wayang-1.1.1-source-release.zip.asc Wed Feb 11 08:32:29 2026 (r82420) @@ -0,0 +1,16 @@ +-----BEGIN PGP SIGNATURE----- + +iQIzBAABCgAdFiEEftBXAsUMc7Glv9YoyBFbNgQIqRUFAmmMN9cACgkQyBFbNgQI +qRV7dw//dUDNDXUnmS+aSOUMi3vEEq31uPNUVCeUaaFjoVzTs3DRqDItJTlWglg3 +vD37UG+7VbaPxwL+P11cZZpdXTUMCtdwpXly96VcFLrqCw0iJaWsVkTg4fndzWJe +ckRPCYEVxRkvUaWlA3yhpA3YvCgGWuwhQWcMPI+TCCb2Fw09b/kAT/Bd5x33ZQSg +pO+4gn7SXlPgwmEgoDCetxaqwWD+Ka0F3YhG6ees3Mt1xwIlX6tIA5ye0HDjSpRC +tOa4RiUrAnXqF2AvSezzBU2xkJuskt55g3XgM7eYKA7BT/q0OwC8rlLE11R3HGS/ +M0HWsPE7guRDyweWTGwElgWgYwK3LHRNH+W/QOAef/joVkRnESGFDY1UicrToOgW +tmFiX853GTt8Whkoa2rFse38/05o0jsuPwXMN5t0Crdhm1+zjlwFT93jtiDcxKnc +pBFRcZZ5t86FMgVVA0aa0FoCcN96jg7qOd690PoBCHvEacizOEzzkgJvuugS/YrO +A1pZ5tU+OFHwz6xKwK7XgNFB4GItm6r4sur93SnkYtsi5x0ITYn0zPsDB3ekcvOI +AgExxMuWgGnHQHKFK1vSqK8Fx59jUWkOV26IP3tek/SWepksBcXnTcBakf7MGgIY +gfCQp+/pxpo1CcqjdqkzYz+BQeTgYxjvhkzgNM8AtcbDBypQlm0= +=xeXv +-----END PGP SIGNATURE----- Added: dev/wayang/1.1.1/rc2/apache-wayang-1.1.1-source-release.zip.sha512 ============================================================================== --- /dev/null 00:00:00 1970 (empty, because file is newly added) +++ dev/wayang/1.1.1/rc2/apache-wayang-1.1.1-source-release.zip.sha512 Wed Feb 11 08:32:29 2026 (r82420) @@ -0,0 +1 @@ +190ac0a320d94e1e1798941192cc6e152a4f5d041475b9e303ca07121399e19771075b49a92da0de7a4f63b781f5e1bec74103effdd7f346fb3f7d7692ba57f5 \ No newline at end of file
