I have two tables A and B that have a column id. I want to get identifiers from A that are not in B. The obvious way is:
SELECT id FROM A WHERE id NOT IN (SELECT id FROM B)
Unfortunately, Hive does not support exists or subqueries. Is there a way to achieve the above using compounds?
I thought about the following
SELECT A.id FROM A,B WHERE A.id<>B.id
But it looks like this will return an integer A, since there always exists an id in B that is not equal to any id of A.
source share