# Create New Evaluation

### **Create a New Evaluation**

Click the **Create Evaluation** button, which will prompt the creation of a new evaluation template.

<figure><img src="https://2532879721-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FM0ZX7fId7ifK45ldHWEp%2Fuploads%2Fgit-blob-014d45744ffe20deac7474c9554db164638a8b70%2FScreenshot%202024-10-13%20at%2012.11.34%20AM.png?alt=media" alt="" width="195"><figcaption></figcaption></figure>

You will be taken to an empty evaluation titled **Untitled Evaluation - {Creation Time}**, where you can start adding details.

<figure><img src="https://2532879721-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FM0ZX7fId7ifK45ldHWEp%2Fuploads%2FjL8obChX7jiV3Qs08wcO%2FScreenshot%202024-10-13%20at%2012.11.48%20AM.png?alt=media&#x26;token=992e510b-4515-48c0-8aa9-08f7d6255273" alt=""><figcaption></figcaption></figure>

Click on the title field, which will initially display the default name, and give your evaluation a meaningful name (e.g., "Financial Agent Evaluation").&#x20;

<figure><img src="https://2532879721-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FM0ZX7fId7ifK45ldHWEp%2Fuploads%2Fgit-blob-c716fef95df327ff4b9ff1f1f815f108b96f88e0%2FScreenshot%202024-10-13%20at%2012.12.30%20AM.png?alt=media" alt="" width="563"><figcaption></figcaption></figure>

Unfocus the name input to apply the changes.

<figure><img src="https://2532879721-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FM0ZX7fId7ifK45ldHWEp%2Fuploads%2Fgit-blob-d7f67972af222bad806d2554d8ccd28eb7e6dc94%2FScreenshot%202024-10-13%20at%2012.12.39%20AM.png?alt=media" alt="" width="563"><figcaption></figcaption></figure>

### **Fill in Test Case Details**

There will be one empty test case created by default. Fill in test case details.

**Question**: Input the query or task you want the AI agent to handle. For example, “What was the cumulative total return for Meta Platforms, Inc. at the end of 2022 compared to its peak value during the five-year period ending December 31, 2023?”

<figure><img src="https://2532879721-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FM0ZX7fId7ifK45ldHWEp%2Fuploads%2Fgit-blob-636fe52c7775835f30554130d915559cf01c2e0a%2FScreenshot%202024-10-13%20at%2012.13.37%20AM.png?alt=media" alt="" width="563"><figcaption></figcaption></figure>

**Expected Answer**: Input the correct answer that the AI agent is expected to provide. For example, “The cumulative total return for Meta Platforms, Inc. at the end of 2022 was 90, which is significantly lower compared to its peak value of 275 at the end of 2023.”

<figure><img src="https://2532879721-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FM0ZX7fId7ifK45ldHWEp%2Fuploads%2Fgit-blob-9cc7fe985c59e61a621fb0c8a7d33ef451dac9f7%2FScreenshot%202024-10-13%20at%2012.13.45%20AM.png?alt=media" alt="" width="563"><figcaption></figcaption></figure>

### **Add Test Cases**

You want to create a set of test cases that can be repeatedly executed during AI agent updates, so that we can be confident our configuration changes don't introduce regression.

Click the **Add Test Case** button to input questions and expected answers for your evaluation. A new row will be added for each test case, with columns for **Question**, **Expected Answer** for you to fill in same as above.

<figure><img src="https://2532879721-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FM0ZX7fId7ifK45ldHWEp%2Fuploads%2Fgit-blob-3af8c19081199d1d77fa315ce72fce2bc63dcd6e%2FScreenshot%202024-10-13%20at%2012.13.58%20AM.png?alt=media" alt="" width="563"><figcaption></figcaption></figure>

Add as many test cases as needed.

<figure><img src="https://2532879721-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FM0ZX7fId7ifK45ldHWEp%2Fuploads%2Fgit-blob-2c3f9f1089b431f1393f604114ce08c1c12fc45e%2FScreenshot%202024-10-13%20at%2012.14.13%20AM.png?alt=media" alt=""><figcaption></figcaption></figure>

### Future Changes

After an evaluation is created, you can always go back and make changes by clicking the card from the list.

<figure><img src="https://2532879721-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FM0ZX7fId7ifK45ldHWEp%2Fuploads%2Fgit-blob-eaf26e01c3145ceff497155accd4f3a2ddc352c4%2FScreenshot%202024-10-13%20at%2012.22.29%20AM.png?alt=media" alt=""><figcaption></figcaption></figure>

You can also delete an evaluation from the actions dropdown menu.

<figure><img src="https://2532879721-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FM0ZX7fId7ifK45ldHWEp%2Fuploads%2Fgit-blob-feb10470b54ea77ac9305b2109dda8ccd41e6808%2FScreenshot%202024-10-13%20at%2012.23.51%20AM.png?alt=media" alt="" width="375"><figcaption></figcaption></figure>
