Hello,
I am trying to deploy Drill on production systems, and am facing a problem
with JOIN between Drill table and Mongo table.
I'll explain the problem using some sample data.
*users*
{"_id": "36479360", "email": "[email protected]"}
{"_id": "36479361", "email": "[email protected]"}
{"_id": "36479362", "email": "[email protected]"}
{"_id": "36479363", "email": "[email protected]"}
{"_id": "36479364", "email": "[email protected]"}
*actions*
{"_id": "374629", "email": "[email protected]", "action_type": ""}
{"_id": "374630", "email": "[email protected]", "action_type": "view"}
{"_id": "374631", "email": "[email protected]", "action_type": "abandoned"}
{"_id": "374632", "email": "[email protected]", "action_type": "view"}
{"_id": "374633", "email": "[email protected]", "action_type": "abandoned"}
Now, when I try to do a JOIN on these tables and query the data, the
results are different from what is expected. See the query below :
drill> SELECT DISTINCT U.email FROM actions AS A JOIN users AS U ON
CAST(A.email AS VARCHAR) = CAST(U.email AS VARCHAR) WHERE
A.action_type='view';
+-----------------------+
| email |
+-----------------------+
| [email protected] |
| [email protected] |
| [email protected] |
| [email protected] |
| [email protected] |
+-----------------------+
I expect only users T2 and T4 to appear, but all the users are returned by
the query. What am I missing here?
Regards,
--
Milind Utsav
+91-9742565861