Hi all, I am looking for a good reference for processing data prior to training a model using APACHE BEAM *Phase1:* 30K+ columns of features, partitioned between big query tables - each of 10K, and 100K+ rows.
*Phase 2:* more columns and more rows any reference is highly appreciated. Thank you, Eila -- Eila www.orielresearch.org https://www.meetup.com/Deep-Learning-In-Production/
