Re: Near Real time analytics with Spark and tokenization

2017-10-15 Thread Jörn Franke
Can’t you cache the token vault in a caching solution , such as Ignite? The lookup of single tokens would be really fast. About what volumes one talks about? I assume you refer to PCI DSS, so security might be an important aspect which might be not that easy to achieve with vault-less tokenizat

Near Real time analytics with Spark and tokenization

2017-10-15 Thread Mich Talebzadeh
Hi, When doing micro-batch streaming of trade data we need to tokenization certain columns before data lands in Hbase with Lambda architecture. There are two ways of tokenizing data, vault based and vault less using something like Protegrity tokenization. The vault-based tokenization requires cl