# S3

The **S3** data source in Epsilla allows users to connect their Amazon S3 bucket to import documents directly into the knowledge base. This option is ideal for managing large datasets stored in the cloud, such as research papers, financial reports, official documents, or other files that are frequently updated or distributed.

### **Select Knowledge Base Type**

Begin by choosing **S3** as the data source type. Click **Continue** to proceed.

<figure><img src="https://2532879721-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FM0ZX7fId7ifK45ldHWEp%2Fuploads%2Fgit-blob-f3c87f2e92b61b9a9639799ba208e8edb7897eb1%2Fchoose%20s3%20data%20source.png?alt=media" alt="" width="333"><figcaption></figcaption></figure>

### **Knowledge Base Name**

Provide a **Knowledge Base Name**. The name should begin with a letter or an underscore (`_`), and can contain only letters, digits, underscores, and whitespaces. Enter your desired name in the input box, for example, `MyS3Docs`.

<figure><img src="https://2532879721-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FM0ZX7fId7ifK45ldHWEp%2Fuploads%2Fgit-blob-08ded25b837c4e3c3f494441e0f0e1cbbe9d84b1%2Fknowledge%20base%20name.png?alt=media" alt="" width="563"><figcaption></figcaption></figure>

### **S3 Configuration**

* **Access Key**: Enter your AWS Access Key to grant Epsilla permission to access your S3 bucket.
* **Secret Key**: Provide your AWS Secret Key.
* **Bucket**: Enter the name of the S3 bucket where your files are stored, for example, `research-papers`.
* **Prefix**: Specify a prefix if you wish to narrow the data source to a specific folder within the bucket, for example, `standford/`. Leave it empty if using the bucket root.

<figure><img src="https://2532879721-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FM0ZX7fId7ifK45ldHWEp%2Fuploads%2Fgit-blob-9100ad0d38f523d521c0b2f352c4604e6946762a%2Fs3%20config.png?alt=media" alt="" width="563"><figcaption></figcaption></figure>

### **Data Processing**

After entering the necessary information, click **Create** to initiate the knowledge base setup. Epsilla will start processing the data from your S3 bucket, embedding the files into the knowledge base.

<figure><img src="https://2532879721-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FM0ZX7fId7ifK45ldHWEp%2Fuploads%2Fgit-blob-65d6156efeb17e0648798b1061c10e6e0ce1fa36%2Fknowledge%20base%20creation%20progress.png?alt=media" alt="" width="563"><figcaption></figcaption></figure>

As the data is processed, you can view the status of each file in the **Loaded Files** section. Files that are ready for use will be marked as **Ready**, while those still processing will show a status of **Processing data**.

<figure><img src="https://2532879721-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FM0ZX7fId7ifK45ldHWEp%2Fuploads%2Fgit-blob-5bcba2e9b76c4e777ffd129766934121717e5d36%2Fs3%20processing.png?alt=media" alt="" width="563"><figcaption></figcaption></figure>

You can edit the configuration or view details of your S3 data source at any time by navigating to the **Data Sources** tab under your knowledge base.

<figure><img src="https://2532879721-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FM0ZX7fId7ifK45ldHWEp%2Fuploads%2Fgit-blob-694c2766edfa18be1b5fc9bff6fec304b44e4ef3%2Fs3%20edit.png?alt=media" alt="" width="563"><figcaption></figcaption></figure>
