Evaluations
Run models against your data
Introducing Evaluations, a powerful feature designed to enable you to effortlessly test and compare a selection of AI models against your datasets.
Whether you're fine-tuning models or evaluating performance metrics, Oxen evaluations simplifies the process, allowing you to quickly and easily run prompts through an entire dataset.
Once you're happy with the results, output the resulting dataset to a new file, another branch, or directly as a new commit.
b182b07a-2979-49b4-96a8-68e1f5701657
OpenAIOpenAI/GPT-4otexttext
Ox
ox
9 months ago
Answer the question concicely 

{question}
completed 285 rows34739 tokens 2 iterations
4f3c5c51-711e-4141-893e-ccef076844ba
OpenAIOpenAI/GPT-4otexttext
Ox
ox
9 months ago
You are a studied professor. Answer the question from {subject} and pick from the following choices.

Choices:
{choices}

Answer as concicely as possible. Do not state anything beyond the answer.

Question:
{question}
completed 285 rows45673 tokens 3 iterations