Here lies all the data 🐉
Testing rag+gemini
Measuring Mathematical Problem Solving With the MATH Dataset
this is duped from prod for testing