Many suspect that LLMs are 'cheating' on evaluations; to what extent is that accurate?
2-day hackathon project, awarded first place by peer review in the Evaluations Apart Hackathon in Nov '23.