I repeated my experiment with the Miss Manners benchmark using OpenAI GPT4. The results were better than ChatGPT, but not as good as Mistral (Large).
Continue reading “Miss Manners with GPT 4”I repeated my experiment with the Miss Manners benchmark using Mistral.ai (Large). The results were impressive!
Continue reading “Miss Manners with Mistral (Large)”“Miss Manners” is organizing a dinner party and needs to devise a seating arrangement for her guests. She has a large circular table and will be inviting 16 guests: 8 males and 8 females. Miss Manners is an aging lady of a bygone era and isn’t aware that gender is not binary. She would like to ensure that guests are not seated next to someone of the same gender, and that guests seated next to each other share at least one hobby.
In this article I will examine how ChatGTP fares with this venerable optimisation (or product rules) benchmark and present conclusions.
Continue reading “Miss Manners with ChatGPT”