Evaluations
Run models against your data
Introducing Evaluations, a powerful feature designed to enable you to effortlessly test and compare a selection of AI models against your datasets.
Whether you're fine-tuning models or evaluating performance metrics, Oxen evaluations simplifies the process, allowing you to quickly and easily run prompts through an entire dataset.
Once you're happy with the results, output the resulting dataset to a new file, another branch, or directly as a new commit.
38f412a0-81ab-441a-9621-d06bb4a91375
MetaMeta/Llama 3.1 405B Instruct Turbotexttext
eloy
2 months ago
Please evaluate the information provided in {text} and determine whether it is relevant and useful for assessing the company's performance. Respond with "yes" if it is useful and relevant, or "no" if it is not.

Just output "yes" or "no" all lowercaps. do not output anything else other than "yes" or "no".
completed 5253 rows851197 tokens$ 2.98 1 iteration
fa0e3b07-b7da-4a1a-ba14-a1484ce9b7f8
MetaMeta/Llama 3.1 405B Instruct Turbotexttext
eloy
2 months ago
Please evaluate the information provided in {text} and determine whether it is relevant and useful for assessing the company's performance. Respond with "yes" if it is useful and relevant, or "no" if it is not.

Just output "yes" or "no" all lowercaps. do not output anything else other than "yes" or "no".
error An exception occurred indexing, getting dataframe and running evaluation: %Req.TransportError{reason: :closed} 120 / 5253 rows17844 tokens$ 0.0523 3 iterations
e1e1a2f5-fdf4-43a1-a898-c151641311df
DeepSeekDeepSeek/Deepseek V3texttext
eloy
2 months ago
Please evaluate the information provided in {text} and determine whether it is relevant and useful for assessing the company's performance. Respond with "yes" if it is useful and relevant, or "no" if it is not.

Just output "yes" or "no". do not output anything else other than "yes" or "no".
error An exception occurred indexing, getting dataframe and running evaluation: %Req.TransportError{reason: :econnrefused} 5130 / 5253 rows643836 tokens$ 0.5795 5 iterations
39a738ed-357d-4bed-b6cd-12b64d315723
OpenAIOpenAI/Text Embeddings Small v3textembeddings
eloy
2 months ago
text
error no case clause matching: {:error, "resource_not_found", 0, 0} 887 rows299994 tokens$ 0.0060 3 iterations
a0e9f060-da93-4468-ae62-295bce207ca0
OpenAIOpenAI/Text Embeddings Small v3textembeddings
eloy
2 months ago
text
completed 887 rows295176 tokens$ 0.0059 2 iterations