Redshift select selected returns duplicate values

Question

Redshift select selected returns duplicate values

I have a database where each property of an object is stored on a separate line. The attached query does not return individual values in the redshift database, but works as expected when testing in any mysql compatible database.

SELECT DISTINCT distinct_value 
FROM
( 
  SELECT
    uri,
    ( SELECT DISTINCT value_string 
      FROM `test_organization__app__testsegment` AS X 
      WHERE X.uri = parent.uri AND name = 'hasTestString' AND parent.value_string IS NOT NULL ) AS distinct_value 
  FROM `test_organization__app__testsegment` AS parent 
  WHERE     
    uri IN ( SELECT uri 
             FROM `test_organization__app__testsegment` 
             WHERE name = 'types' AND value_uri_multivalue = 'Document'
           )
) AS T 
WHERE distinct_value IS NOT NULL
ORDER BY distinct_value ASC
LIMIT 10000 OFFSET 0

+4

sql amazon-redshift

Dkobylarz 30 sept '15 at 21:25

source share

2 answers

AlexYes · Answer 1 · 2017-06-22T22:00:08+0000

This is not a mistake, and the behavior is deliberate, although not straightforward.

Redshift , Redshift , .. , . , SELECT DISTINCT , , , . , .

? Redshift , , . , , , ETL , .

vihaa_vrutti · Answer 2 · 2018-03-07T06:57:05+0000

, , . , 1, 1, 2, - .

- !!

select distinct table1.col1 from table1 left outer join table2 on table1.col1 = table2.col1

, 1 dublicates

Redshift select selected returns duplicate values

More articles: