I have a problem with a large query from ten related tables. I am transferring data from the fact table (f1) to the star chart. I start by filling out the dimension tables from f1, and then populating the new fact table (f2) with a connection to the dimension tables to get the corresponding identifiers.
Unfortunately, I get the error, "the internal partition does not fit into memory." From the log I see:
2012-10-18 16:20:31.607 Init Session:0x2aac6c02b250 [EE] <INFO> ENABLE_JOIN_SPILL may allow this query to run, with reduced performance 2012-10-18 16:20:31.607 Init Session:0x2aac6c02b250 [EE] <INFO> Query Retry action: Setting add_vertica_options('EE','ENABLE_JOIN_SPILL');
but this will not work either since I get:
2012-10-18 16:23:31.138 Init Session:0x2aac6c02b250 [EE] <INFO> Join ((public.owa_search_term_dim x public.page_impressions_with_session) using owa_search_term_dim_projection_node0001 and previous join (PATH ID: 7)) inner partition did not fit in memory; value 2012-10-18 16:23:31.138 Init Session:0x2aac6c02b250 [EE] <INFO> Query Retry action: Swapping join order with override: 1|7|0
This has been going on for some time, while Vertica seems to be trying to find a way to make the connection, but is eventually freed from errors by stating that the connection does not fit into memory.
Are there any tips on how to minimize the amount of memory needed to make connections, or why the drive does not work? I can handle the performance hit, I just need to fulfill the request.
source share