Evaluations/Find the Tulips
main
flowers.csv
texttext
OpenAIOpenAI/GPT-4o Mini
OpenAI OpenAI
prediction
response only with true or false
if the {flower_type} includes tulip then true else false
Jul 16, 2025, 10:44 PM UTC
Jul 16, 2025, 10:44 PM UTC
51 rows
1333 tokens$ 0.0002
51 rows processed, 1333 tokens used ($0.0002)
completed
5 columns, 1-100 of 101 rows