Introducing Evaluations, a powerful feature designed to enable you to effortlessly test and compare a selection of AI models against your datasets.
Whether you're fine-tuning models or evaluating performance metrics, Oxen evaluations simplifies the process, allowing you to quickly and easily run prompts through an entire dataset.
Once you're happy with the results, output the resulting dataset to a new file, another branch, or directly as a new commit.
efb482c4-89ee-486c-9935-de4415d6e183

ox
7 months agohello {query}
main
143f5615-eadd-4a39-8cd6-902c7af0a110

ox
8 months agoquery
dd251178-ac03-4a4e-9b3a-cc01ebb4c519

ox
8 months agoquery
main
google-text-embedd
d743f4fe-3431-445d-957f-b311c4ee12ad

ox
8 months agoquery
main
openai-embeddings
f78bf8ce-2706-4382-9a4f-985fd23fb0df

ox
8 months agoquery
main
035b7860-4842-4060-a5ff-6272c89a5126

ox
8 months agoquery
main
error An exception occurred indexing, getting dataframe and running evaluation: %CaseClauseError{term: {:error, "An exception occurred when getting model response: %Protocol.UndefinedError{protocol: String.Chars, value: %{\"code\" => nil, \"message\" => \"invalid model ID\", \"param\" => nil, \"type\" => \"invalid_request_error\"}, description: \"\"}"}} Waiting... 0 tokens$ 0.0000 1 iteration
bcbcd637-6171-47ec-92f4-d0fe106b7722

ox
8 months agoWhat is the answer to the question given the context? Only reply with text that is contained in the context. Question: {query} Context: {context} Answer:
results-gemini-flash
4d12281d-891a-419b-bc2a-6622d14d0595

ox
8 months agoWhat is the answer to the question given the context? Only reply with text that is contained in the context. Question: {query} Context: {context} Answer:
gpt-4o-mini-results
e8add75e-e9c1-4add-ad87-ed5ead22768b

ox
8 months agoWhat is the answer to the question given the context? Question: {query} Context: {context} Answer: