[unomi] branch master updated: UNOMI-797 Fixes in migration documentation (#647)

shuber Fri, 01 Sep 2023 03:33:54 -0700

This is an automated email from the ASF dual-hosted git repository.

shuber pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/unomi.git



The following commit(s) were added to refs/heads/master by this push:
     new 98e4d87c8 UNOMI-797 Fixes in migration documentation (#647)
98e4d87c8 is described below

commit 98e4d87c827b05e817e0538793a8a34e5434a00a
Author: Serge Huber <shu...@jahia.com>
AuthorDate: Fri Sep 1 12:31:54 2023 +0200

    UNOMI-797 Fixes in migration documentation (#647)
    
    * UNOMI-797 Fixes in migration documentation
    
    * UNOMI-797 Fixes in migration documentation
    - Simplified the text further based on the review.
    
    * UNOMI-797 Fixes in migration documentation
    - Minor text improvements.
---
 .../asciidoc/migrations/migrate-1.6-to-2.0.adoc    | 57 +++++++++++++---------
 1 file changed, 33 insertions(+), 24 deletions(-)

diff --git a/manual/src/main/asciidoc/migrations/migrate-1.6-to-2.0.adoc 
b/manual/src/main/asciidoc/migrations/migrate-1.6-to-2.0.adoc
index 15133a229..b175f0e64 100644
--- a/manual/src/main/asciidoc/migrations/migrate-1.6-to-2.0.adoc
+++ b/manual/src/main/asciidoc/migrations/migrate-1.6-to-2.0.adoc
@@ -22,58 +22,67 @@ There are two main steps in preparing your migration to 
Apache Unomi 2.0:
 
 === Updating applications consuming Unomi
 
-Since Apache Unomi is an engine, you've probably built multiple applications 
consuming its APIs, you might also have built extensions directly running in 
Unomi. 
+Since Apache Unomi is an engine, you've probably built multiple applications 
consuming its APIs, you might also have built extensions directly running in 
Unomi.
 
-As you begin updating applications consuming Apache Unomi, it is generally a 
good practice to <<Enable debug mode>>. 
+As you begin updating applications consuming Apache Unomi, it is generally a 
good practice to <<_enabling_debug_mode,enable debug mode>>.
 Doing so will display any errors when processing events (such as JSON Schema 
validations), and will provide useful indications towards solving issues.
 
 ==== Data Model changes
 
-There has been changes to Unomi Data model, please make sure to review those 
in the << what_s_new>> section of the user manual.
+There has been changes to Unomi Data model, please make sure to review those 
in the <<_whats_new_in_apache_unomi_2_0,What's new in Unomi 2>> section of the 
user manual.
 
 ==== Create JSON schemas
 
 Once you updated your applications to align with Unomi 2 data model, the next 
step will be to create the necessary JSON Schemas.
 
-Any event (and more generally, any object) received through Unomi public 
endpoints do require a valid JSON schema. 
-Apache Unomi ships, out of the box, with all of the necessary JSON Schemas for 
its own operation but you will need to create schemas for any custom event you 
may be using.
+Any event (and more generally, any object) received through Unomi public 
endpoints do require a valid JSON schema.
+Apache Unomi ships, out of the box, with all of the necessary JSON Schemas for 
its own operation as well as all event types generated from the Apache Unomi 
Web Tracker but you will need to create schemas for any custom event you may be 
using.
 
-When creating your new schemas, reviewing debug messages in the logs (using: 
`log:set DEBUG org.apache.unomi.schema.impl.SchemaServiceImpl` in Karaf 
console), 
-will point to errors in your schemas or will help you diagnose why the events 
are not being accepted.
+When creating your new schemas, there are multiple ways of testing them:
+
+- Using a the event validation API endpoint available at the URL : 
`/cxs/jsonSchema/validateEvent`
+- Using debug logs when sending events using the usual ways (using the 
`/context.json` or `/eventcollector` endpoints)
+
+Note that in both cases it helps to activate the debug logs, that may be 
activated either:
+
+- Through the ssh Karaf console command : `log:set DEBUG 
org.apache.unomi.schema.impl.SchemaServiceImpl`
+- Using the UNOMI_LOGS_JSONSCHEMA_LEVEL=DEBUG environment variable and then 
restarting Apache Unomi. This is especially useful when using Docker Containers.
+
+Once the debug logs are active, you will see detailed error messages if your 
events are not matched with any deployed JSON schema.
 
 Note that it is currently not possible to modify or surcharge an existing 
system-deployed JSON schema via the REST API. It is however possible to deploy 
new schemas and manage them through the REST API on the `/cxs/jsonSchema` 
endpoint.
-If you are currently using custom properties on an Apache Unomi-provided event 
type, 
+If you are currently using custom properties on an Apache Unomi-provided event 
type,
 you will need to either change to use a new custom eventType and create the 
corresponding schema or to create a Unomi schema extension. You can find more 
details in the <<JSON schemas,JSON Schema>> section of this documentation.
 
-You can use, as a source of inspiration for creating new schemas, Apache Unomi 
2.0 schema located at: 
+You can use, as a source of inspiration for creating new schemas, Apache Unomi 
2.0 schema located at:
  
https://github.com/apache/unomi/tree/master/extensions/json-schema/services/src/main/resources/META-INF/cxs/schemas[extensions/json-schema/services/src/main/resources/META-INF/cxs/schemas].
 
-Finally, and although it is technically feasible, we recommend against 
creating permissive JSON Schemas allowing any event payload. This requires 
making sure that you don't allow open properties by using JSON schema keywords 
such as 
https://json-schema.org/understanding-json-schema/reference/object.html#unevaluated-properties[unevaluated
 properties]
+Finally, and although it is technically feasible, we recommend against 
creating permissive JSON Schemas allowing any event payload. This requires 
making sure that you don't allow undeclared properties by setting JSON schema 
keywords such as 
https://json-schema.org/understanding-json-schema/reference/object.html#unevaluated-properties[unevaluated
 properties] to `false`.
 
 === Migrating your existing data
 
 ==== Elasticsearch version and capacity
 
-While still using Unomi 1.6, the first step will be to upgrade your 
Elasticsearch to 7.17.5. 
+While still using Unomi 1.6, the first step will be to upgrade your 
Elasticsearch to 7.17.5.
 Documentation is available on 
https://www.elastic.co/guide/en/elasticsearch/reference/7.17/setup-upgrade.html[Elasticsearch's
 website].
 
-Your Elasticsearch cluster must have enough capacity to handle the migration. 
-At a minimum, the required capacity must be greater than the size of the 
dataset in production + the size of the largest index.
+Your Elasticsearch cluster must have enough capacity to handle the migration.
+At a minimum, the required capacity storage capacity must be greater than the 
size of the dataset in production + the size of the largest index. Any other 
settings should at least be as big as the source setup (preferably higher).
 
 ==== Migrate custom data
 
 Apache Unomi 2.0 knows how to migrate its own data from the new model to the 
old one, but it does not know how to migrate custom events you might be using 
in your environment.
 
-It relies on a set of groovy scripts to perform its data migration, 
-located in 
https://github.com/apache/unomi/tree/master/tools/shell-commands/src/main/resources/META-INF/cxs/migration[tools/shell-commands/src/main/resources/META-INF/cxs/migration],
 
+It relies on a set of groovy scripts to perform its data migration,
+located in 
https://github.com/apache/unomi/tree/master/tools/shell-commands/src/main/resources/META-INF/cxs/migration[tools/shell-commands/src/main/resources/META-INF/cxs/migration],
 these scripts are sorted alphabetically and executed sequentially when 
migration is started. You can use these scripts as a source of inspiration for 
creating your own.
 
 In most cases, migration steps consist of an Elasticsearch painless script 
that will handle the data changes.
 
-Depending of the volume of data, migration can be lengthy. By paying attention 
to when re-indexation is happening (triggered in the groovy scripts by 
`MigrationUtils.reIndex()`), 
+Depending of the volume of data, migration can be lengthy. By paying attention 
to when re-indexation is happening (triggered in the groovy scripts by 
`MigrationUtils.reIndex()`),
 you can find the most appropriate time for your scritps to be executed and 
avoid re-indexing the same indices multiple times.
 
-For example if you wanted to update profiles with custom data (currently 
migrated by `migrate-2.0.0-10-profileReindex.groovy`), you could create a 
script in position "09" that would only contain painless scripts without a 
reindexing step. 
+For example if you wanted to update profiles with custom data (currently 
migrated by `migrate-2.0.0-10-profileReindex.groovy`), you could create a 
script in position "09" that would only contain painless scripts without a 
reindexing step.
 The script in position "10" will introduce its own painless script, then 
trigger the re-indexation. This way you don't have to re-index the same indices 
twice.
 
 You can find existing painless scripts in 
https://github.com/apache/unomi/tree/master/tools/shell-commands/src/main/resources/requestBody/2.0.0[tools/shell-commands/src/main/resources/requestBody/2.0.0]
@@ -96,14 +105,14 @@ Before starting the migration, please ensure that:
 
 ===== Migration process overview
 
-The migration is performed by means of a dedicated Apache Unomi 2.0 node 
started in a particular migration mode. 
+The migration is performed by means of a dedicated Apache Unomi 2.0 node 
started in a particular migration mode.
 
 In a nutshell, the migration process will consist in the following steps:
 
 - Shutdown your Apache Unomi 1.6 cluster
-- Start an Apache Unomi 2.0 migration node
+- Start one Apache Unomi 2.0 node that will perform the migration (upon 
startup)
 - Wait for data migration to complete
-- Start you Apache Unomi 2.0 cluster
+- Start your Apache Unomi 2.0 cluster
 - (optional) Import additional JSON Schemas
 
 Each migration step maintains its execution state, meaning if a step fails you 
can fix the issue, and resume the migration from the failed step.
@@ -149,7 +158,7 @@ If there is a need for advanced configuratiion, the 
configuration file used by A
 
 ===== Migrate manually
 
-You can migrate manually using the Karaf console. 
+You can migrate manually using the Karaf console.
 
 After having started Apache Unomi 2.0 with the `./karaf` command, you will be 
presented with the Karaf shell.
 
@@ -166,10 +175,10 @@ At the end of the migration, you can start Unomi 2.0 as 
usual using: `unomi:star
 
 The migration can also be performed using Docker images, the migration itself 
can be started by passing a specific value to the `KARAF_OPTS` environment 
variable.
 
-In the context of this migration guide, we will asssume that: 
+In the context of this migration guide, we will asssume that:
 
  - Custom migration scripts are located in `/home/unomi/migration/scripts/`
- - Painless scripts, or more generally any migration assets are located in 
`/home/unomi/migration/assets/`, these scripts will be mounted under 
`/tmp/assets/` inside the Docker container. 
+ - Painless scripts, or more generally any migration assets are located in 
`/home/unomi/migration/assets/`, these scripts will be mounted under 
`/tmp/assets/` inside the Docker container.
 
 [source]
 ----
@@ -189,7 +198,7 @@ Using the above command, Unomi 2.0 will not start 
automatically at the end of th
 
 ===== Step by step migration with Docker
 
-Once your cluster is shutdown, performing the migration will be as simple as 
starting a dedicated docker container. 
+Once your cluster is shutdown, performing the migration will be as simple as 
starting a dedicated docker container.
 
 ===== Post Migration

[unomi] branch master updated: UNOMI-797 Fixes in migration documentation (#647)

Reply via email to