Evaluations
Run models against your data
Introducing Evaluations, a powerful feature designed to enable you to effortlessly test and compare a selection of AI models against your datasets.
Whether you're fine-tuning models or evaluating performance metrics, Oxen evaluations simplifies the process, allowing you to quickly and easily run prompts through an entire dataset.
Once you're happy with the results, output the resulting dataset to a new file, another branch, or directly as a new commit.
efb482c4-89ee-486c-9935-de4415d6e183
OpenAIOpenAI/GPT-4o Minitexttext
Ox
ox
7 months ago
hello

{query}
completed 5 row sample367 tokens$ 0.0002 2 iterations
143f5615-eadd-4a39-8cd6-902c7af0a110
GoogleGoogle/Text Embedding 004textembeddings
Ox
ox
8 months ago
query
completed 200 rows0 tokens$ 0.0000 2 iterations
dd251178-ac03-4a4e-9b3a-cc01ebb4c519
GoogleGoogle/Text Embedding 004textembeddings
Ox
ox
8 months ago
query
google-text-embedd
completed 15 rows0 tokens$ 0.0000 2 iterations
d743f4fe-3431-445d-957f-b311c4ee12ad
OpenAIOpenAI/Text Embeddings Small v3textembeddings
Ox
ox
8 months ago
query
openai-embeddings
completed 15 rows110 tokens$ 0.0000 2 iterations
f78bf8ce-2706-4382-9a4f-985fd23fb0df
OpenAIOpenAI/Text Embeddings Small v3textembeddings
Ox
ox
8 months ago
query
open-ai-embeddings
completed 15 rows110 tokens$ 0.0000 1 iteration
035b7860-4842-4060-a5ff-6272c89a5126
OpenAIOpenAI/Text Embeddings Small v3textembeddings
Ox
ox
8 months ago
query
error An exception occurred indexing, getting dataframe and running evaluation: %CaseClauseError{term: {:error, "An exception occurred when getting model response: %Protocol.UndefinedError{protocol: String.Chars, value: %{\"code\" => nil, \"message\" => \"invalid model ID\", \"param\" => nil, \"type\" => \"invalid_request_error\"}, description: \"\"}"}} Waiting... 0 tokens$ 0.0000 1 iteration
bcbcd637-6171-47ec-92f4-d0fe106b7722
GoogleGoogle/Gemini 1.5 Flashtexttext
Ox
ox
8 months ago
What is the answer to the question given the context? Only reply with text that is contained in the context.

Question:
{query}

Context:
{context}

Answer:
completed 200 rows63321 tokens$ 0.0055 2 iterations
4d12281d-891a-419b-bc2a-6622d14d0595
OpenAIOpenAI/GPT-4o Minitexttext
Ox
ox
8 months ago
What is the answer to the question given the context? Only reply with text that is contained in the context.

Question:
{query}

Context:
{context}

Answer:
completed 200 rows59541 tokens$ 0.0107 2 iterations
e8add75e-e9c1-4add-ad87-ed5ead22768b
GoogleGoogle/Gemini 1.5 Flashtexttext
Ox
ox
8 months ago
What is the answer to the question given the context?

Question:
{query}

Context:
{context}

Answer:
started (80%)Running 4 / 5 row sample857 tokens$ 0.0001 1 iteration