Evaluations
Run models against your data
Introducing Evaluations, a powerful feature designed to enable you to effortlessly test and compare a selection of AI models against your datasets.
Whether you're fine-tuning models or evaluating performance metrics, Oxen evaluations simplifies the process, allowing you to quickly and easily run prompts through an entire dataset.
Once you're happy with the results, output the resulting dataset to a new file, another branch, or directly as a new commit.
80c46f2f-f3b9-4695-a745-98f62252df98
OpenAIOpenAI/GPT-4otexttext
Ox
ox
9 months ago
classify if the text is spam or ham depending on if it is SMS spam. Reply with one word all lower case.

{input}
completed 10 row sample502 tokens 2 iterations
03bed1d0-b0e3-4c62-957f-b5ca0c7c89fc
OpenAIOpenAI/GPT-4otexttext
Ox
ox
9 months ago
Classify if the text is spam or ham depending on if it is SMS spam.

{input}

Respond with a single word "spam" or "ham".
completed 10 row sample545 tokens 1 iteration
b0609e3d-a860-42d0-b6b5-2e29c4be86a5
OpenAIOpenAI/GPT-4otexttext
Ox
ox
9 months ago
Classify if the text is spam or ham depending on if it is SMS spam

{input}
completed 10 row sample920 tokens 1 iteration