I am trying to load data from external tables into SQL DW Internal tables. I have compressed data stores in a BLOB storage, and external tables are pointed to the BLOB storage location.
I have about 24 files about 22 GB in size and am trying to load data from an external table into an internal 300 DWU table with a large service account / resource class account.
My insert in the instruction (which is very simple) works for more than 10 hours. insert into Trxdata.Details_data select * from Trxdata.Stage_External_Table_details_data;
I also tried with the instruction below, which also works for more than 10 hours. CREATE TABLE Trxdata.Details_data12 C (DISTRIBUTION = ROUND_ROBIN) TYPE SELECT * FROM Trxdata.Stage_External_Table_details_data;
I see that both SQL servers are running with ACTIVE status in "sys". "dm_pdw_exec_requests" [I thought it might be a problem with the concurrency slot and it does not have concurrency slots to run, but this is not the case]
and I was hoping that increasing / increasing DWUs could improve performance. but looking at using DWU on portal.azure.com - I'm not sure about the increase in DWU, because the DWU usage chart shows <50DWU in the last 12 hours
DWU usage chart
So, I am trying to understand - how can I find - what is such a long time? How to improve data loading performance?
source
share