Prerequisite
<ul>
<li>AWS account
</li>
<li>AWS CLI setup in local
</li>
<li>redshift cluster created
</li>
<li>s3 bucket
</li>
</ul>
Please follow the below link to create a redshift cluster,
<a target="_blank" href="https://hashnode.com/post/cld2xvwzf00po8anv87cn8ho1">setup redshift</a>
<hr />
setup data into an s3 bucket
download the file in your local system from this <a target="_blank" href="https://github.com/hardikpatel29/redshifts-data-demo">link</a>.
Now copy to the s3 bucket,
<pre><code class="lang-bash">aws s3 cp part-00000 s3://workshopdemo171222/
</code></pre>
<img src="https://cdn.hashnode.com/res/hashnode/image/upload/v1674201617987/2db5f795-adc7-40d6-9777-dc50b445e50b.png" alt class="image--center mx-auto" />
<hr />
create database and user in redshift,
connect query editor in redshift console,
create database,
<pre><code class="lang-pgsql">create database mydb
</code></pre>
<img src="https://cdn.hashnode.com/res/hashnode/image/upload/v1674203083393/caf47172-6abe-4e32-9fa5-8cc92decce2d.png" alt class="image--center mx-auto" />
Now connect created database in the query editor, for that click on change connection,
<img src="https://cdn.hashnode.com/res/hashnode/image/upload/v1674203128926/021d542c-4044-4377-b11a-83150d6b80c3.png" alt class="image--center mx-auto" />
after change database connection in query editor,
<pre><code class="lang-pgsql">create table myorders (
 order_id INT PRIMARY KEY,
 order_date DATETIME,
 order_customer_id INT,
 order_status VARCHAR(30)
)
</code></pre>
<img src="https://cdn.hashnode.com/res/hashnode/image/upload/v1674203676623/db630925-ae97-441c-a2b2-09626af87a7e.png" alt class="image--center mx-auto" />
<hr />
Create IAM User for copy data from s3 ,
create new iam user with programmatic access only and having s3 full access policy.
after creating go into query editor and run query
<pre><code class="lang-pgsql">COPY myorders from 's3://xxxx/part-00000'
CREDENTIALS 'aws_access_key_id=xxxxx;aws_secret_access_key=xxxxx'
CSV;
</code></pre>
Now lets verify data into the table,
<pre><code class="lang-pgsql">select * from myorders limit 10
</code></pre>
<hr />
output is like below,
<img src="https://cdn.hashnode.com/res/hashnode/image/upload/v1674204513884/18b2edd7-dd9f-4924-8818-e9c0e168f2e1.png" alt class="image--center mx-auto" />
Hence, we have gone through that how we can copy data from s3 bucket to our redshift cluster.
How COPY command work to check in details please check this <a target="_blank" href="https://docs.aws.amazon.com/redshift/latest/dg/r_COPY.html">link</a>.

**Prerequisite**

* AWS account
    
* AWS CLI setup in local
    
* redshift cluster created
    
* s3 bucket
    

Please follow the below link to create a redshift cluster,

[setup redshift](https://hashnode.com/post/cld2xvwzf00po8anv87cn8ho1)

---

**setup data into an s3 bucket**

download the file in your local system from this [link](https://github.com/hardikpatel29/redshifts-data-demo).

Now copy to the s3 bucket,

```bash
aws s3 cp part-00000 s3://workshopdemo171222/
```

![](https://cdn.hashnode.com/res/hashnode/image/upload/v1674201617987/2db5f795-adc7-40d6-9777-dc50b445e50b.png align="center")

---

**create database and user in redshift,**

connect query editor in redshift console,

create database,

```pgsql
create database mydb
```

![](https://cdn.hashnode.com/res/hashnode/image/upload/v1674203083393/caf47172-6abe-4e32-9fa5-8cc92decce2d.png align="center")

Now connect created database in the query editor, for that click on change connection,

![](https://cdn.hashnode.com/res/hashnode/image/upload/v1674203128926/021d542c-4044-4377-b11a-83150d6b80c3.png align="center")

after change database connection in query editor,

```pgsql
create table myorders (
	order_id INT PRIMARY KEY,
    order_date DATETIME,
    order_customer_id INT,
    order_status VARCHAR(30)
)
```

![](https://cdn.hashnode.com/res/hashnode/image/upload/v1674203676623/db630925-ae97-441c-a2b2-09626af87a7e.png align="center")

---

**Create IAM User for copy data from s3 ,**

create new iam user with programmatic access only and having s3 full access policy.

after creating go into query editor and run query

```pgsql
COPY myorders from 's3://xxxx/part-00000'
CREDENTIALS 'aws_access_key_id=xxxxx;aws_secret_access_key=xxxxx'
CSV;
```

Now lets verify data into the table,

```pgsql
select * from myorders limit 10
```

---

output is like below,

![](https://cdn.hashnode.com/res/hashnode/image/upload/v1674204513884/18b2edd7-dd9f-4924-8818-e9c0e168f2e1.png align="center")

Hence, we have gone through that how we can copy data from s3 bucket to our redshift cluster.

How COPY command work to check in details please check this [link](https://docs.aws.amazon.com/redshift/latest/dg/r_COPY.html).

HardikPatel

HardikPatel

How to copy data from s3 to AWS Redshift table?