Evaluations/show me the roses
main
flowers.csv
texttext
OpenAIOpenAI/GPT-4o Mini
OpenAI OpenAI
prediction
question: does {flower_type} include the word 'rose', ONLY awser true or false, in lowercase with no additional text
answer:
Dec 17, 2024, 11:08 PM UTC
Dec 17, 2024, 11:08 PM UTC
00:00:08
20 row sample
705 tokens$ 0.0001
20 rows processed, 705 tokens used ($0.0001)
Estimated cost for all 101 rows: $0.0006
Sample Results completed
5 columns, 1-20 of 101 rows
image
flower_type
images/tulips/4418204816_018375acd0_m.jpg
tulips
images/roses/11694025703_9a906fedc1_n.jpg
roses
images/dandelion/9200211647_be34ce978b.jpg
dandelion
images/tulips/16283125269_4cfae953f1.jpg
tulips
images/dandelion/5598014250_684c28bd5c_n.jpg
dandelion
images/sunflowers/20753711039_0b11d24b50_n.jpg
sunflowers
images/tulips/8757486380_90952c5377.jpg
tulips
images/roses/3208417632_19138d8e35_n.jpg
roses
images/sunflowers/4745980581_a0b7585258_n.jpg
sunflowers
images/dandelion/4633514720_22e82c5f7c_m.jpg
dandelion
images/sunflowers/15054865768_2cc87ac9d4_m.jpg
sunflowers
images/dandelion/2503875867_2075a9225d_m.jpg
dandelion
images/tulips/4521037085_70d5802e1d_m.jpg
tulips
images/dandelion/2462476884_58c617b26a.jpg
dandelion
images/tulips/8713387500_6a9138b41b_n.jpg
tulips
images/daisy/3711723108_65247a3170.jpg
daisy
images/dandelion/18482768066_677292a64e.jpg
dandelion
images/tulips/8572847041_d0cc07861f_n.jpg
tulips
images/dandelion/3562861685_8b8d747b4d.jpg
dandelion
images/sunflowers/6061175433_95fdb12f32_n.jpg
sunflowers