Amazon Redshift
Last updated
Last updated
You can quickly import data from your Amazon Redshift Database into Exploratory.
Here is a blog post introducing this support in detail.
Create a connection following this instruction.
Click '+' button next to 'Data Frames' and select 'Database Data'.
Click Amazon Redshift to select.
Click Preview button to see the data back from your Redshift db.
If it looks ok, then you can click 'Import' to import the data into Exploratory.
You might want to take a random sample of the data that would be reasonable size for your analysis.
You can use md5 function to get random number generated and use it like below to get the random sample of the data.
First, click a parameter link on the SQL Data Import Dialog.
Second, define a parameter and click Save button.
Finally, you can use @{} to surround a variable name inside the query like below.
If you type @ then it suggests parameters like below.
Here's a blog post for more detail.
If you encounter a database connection error, please go to AWS console and make sure you added your client PC's IP address to your Security Group (Inbound) associated with the Redshift cluster.
From performance point of view, we no longer show actual number of rows which can be only fetched by executing whole query again.
If you still want to show the actual number of query for your query, you can do so by setting System Configuration.
Then set "Yes" For "Show Actual Number of Rows on SQL Data Import Dialog"
This will show you Actual Number of Rows like below.
Here is the link to the blog post Exploratory Data Analysis for Amazon Redshift with R & dplyr