# Website

The **Website** data source in Epsilla allows users to import and manage content directly from webpages, making it ideal for dynamic or continuously updated information. This type of data source is optimal for retrieving data from websites, blogs, and other web-based platforms into the knowledge base.

### **Select Knowledge Base Type**

To begin, choose **Website** as the data source to load content from webpages.

<figure><img src="/files/7Z06NcFTcKeEO0LQVRJH" alt="" width="375"><figcaption></figcaption></figure>

Click **Continue** to proceed.

### **Knowledge Base Name**

Provide a **Knowledge Base Name**. The name should begin with a letter or an underscore (`_`), and can contain only letters, digits, underscores, and whitespaces. Enter your desired name in the input box, such as `MyWebsiteKnowledge`.

<figure><img src="/files/5SPSAbBfCO0qAmfpBLXK" alt="" width="563"><figcaption></figcaption></figure>

### **Add Webpage URLs**

In the **Webpage URLs** section, input the publicly accessible URLs you want to extract data from.

<figure><img src="/files/focCkMIFDXV0xpxW82pg" alt="" width="563"><figcaption></figcaption></figure>

* The URLs must start with `http://` or `https://`.
* You can add a **Single Webpage** manually, **Crawl** a webpage for subpages, or add multiple webpages at once.

For example, you can manually enter:

```arduino
https://epsilla-inc.gitbook.io/epsilladb
```

<figure><img src="/files/YG4CxatNba8QnpvtWBSR" alt="" width="563"><figcaption></figcaption></figure>

To **crawl a webpage**, click the **Crawl webpage** button. In the dialogue that appears:

* Input the base URL, such as `https://epsilla-inc.gitbook.io/`.
* Set the **Max number of pages** to crawl (e.g., 100).

<figure><img src="/files/1iRCWs0XlkLbnSJHvPqI" alt="" width="563"><figcaption></figcaption></figure>

* Click **Search** to locate subpages. A list of subpages will be displayed.

<figure><img src="/files/hEY7wsQANXFDAk41gr8y" alt="" width="563"><figcaption></figcaption></figure>

* Select the pages you want to include in the knowledge base and click **Add**.

<figure><img src="/files/iYzw2t9DAD6ZhiFUKhKp" alt="" width="563"><figcaption></figcaption></figure>

You can also use the **Add multiple webpages** option to input multiple URLs at once, separated by new lines.

<figure><img src="/files/Jh5C0o36fa8OICbTa7Oq" alt="" width="563"><figcaption></figcaption></figure>

### **Data Processing**

Once you've added the desired URLs, click **Create** to begin processing the data.

<figure><img src="/files/NwYZdMVXtHnpu9T03dEB" alt="" width="563"><figcaption></figcaption></figure>

Epsilla will automatically retrieve the content from the pages, chunk it into manageable pieces, and embed it into vectors. You can monitor the progress during this step.

<figure><img src="/files/M9G2FT8HF5rSEaIF7hj1" alt="" width="563"><figcaption></figcaption></figure>

You can inspect the processed data (chunks) at the [**Data Storage**](/epsilladb/knowledge-base/data-storage.md) tab.[<br>](https://epsilla-inc.gitbook.io/epsilladb/knowledge-base)


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://epsilla-inc.gitbook.io/epsilladb/knowledge-base/website.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
