> For the complete documentation index, see [llms.txt](https://epsilla-inc.gitbook.io/epsilladb/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://epsilla-inc.gitbook.io/epsilladb/knowledge-base/website.md).

# Website

The **Website** data source in Epsilla allows users to import and manage content directly from webpages, making it ideal for dynamic or continuously updated information. This type of data source is optimal for retrieving data from websites, blogs, and other web-based platforms into the knowledge base.

### **Select Knowledge Base Type**

To begin, choose **Website** as the data source to load content from webpages.

<figure><img src="/files/7Z06NcFTcKeEO0LQVRJH" alt="" width="375"><figcaption></figcaption></figure>

Click **Continue** to proceed.

### **Knowledge Base Name**

Provide a **Knowledge Base Name**. The name should begin with a letter or an underscore (`_`), and can contain only letters, digits, underscores, and whitespaces. Enter your desired name in the input box, such as `MyWebsiteKnowledge`.

<figure><img src="/files/5SPSAbBfCO0qAmfpBLXK" alt="" width="563"><figcaption></figcaption></figure>

### **Add Webpage URLs**

In the **Webpage URLs** section, input the publicly accessible URLs you want to extract data from.

<figure><img src="/files/focCkMIFDXV0xpxW82pg" alt="" width="563"><figcaption></figcaption></figure>

* The URLs must start with `http://` or `https://`.
* You can add a **Single Webpage** manually, **Crawl** a webpage for subpages, or add multiple webpages at once.

For example, you can manually enter:

```arduino
https://epsilla-inc.gitbook.io/epsilladb
```

<figure><img src="/files/YG4CxatNba8QnpvtWBSR" alt="" width="563"><figcaption></figcaption></figure>

To **crawl a webpage**, click the **Crawl webpage** button. In the dialogue that appears:

* Input the base URL, such as `https://epsilla-inc.gitbook.io/`.
* Set the **Max number of pages** to crawl (e.g., 100).

<figure><img src="/files/1iRCWs0XlkLbnSJHvPqI" alt="" width="563"><figcaption></figcaption></figure>

* Click **Search** to locate subpages. A list of subpages will be displayed.

<figure><img src="/files/hEY7wsQANXFDAk41gr8y" alt="" width="563"><figcaption></figcaption></figure>

* Select the pages you want to include in the knowledge base and click **Add**.

<figure><img src="/files/iYzw2t9DAD6ZhiFUKhKp" alt="" width="563"><figcaption></figcaption></figure>

You can also use the **Add multiple webpages** option to input multiple URLs at once, separated by new lines.

<figure><img src="/files/Jh5C0o36fa8OICbTa7Oq" alt="" width="563"><figcaption></figcaption></figure>

### **Data Processing**

Once you've added the desired URLs, click **Create** to begin processing the data.

<figure><img src="/files/NwYZdMVXtHnpu9T03dEB" alt="" width="563"><figcaption></figcaption></figure>

Epsilla will automatically retrieve the content from the pages, chunk it into manageable pieces, and embed it into vectors. You can monitor the progress during this step.

<figure><img src="/files/M9G2FT8HF5rSEaIF7hj1" alt="" width="563"><figcaption></figcaption></figure>

You can inspect the processed data (chunks) at the [**Data Storage**](/epsilladb/knowledge-base/data-storage.md) tab.[<br>](https://epsilla-inc.gitbook.io/epsilladb/knowledge-base)


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://epsilla-inc.gitbook.io/epsilladb/knowledge-base/website.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
