Search

Innovation That Matters

Technology trends for the world of tomorrow

Category

rules and process

Miss Manners with Claude Sonnet

I repeated my experiment with the Miss Manners benchmark using Clause Sonnet (Clause 3.5 Pro). The results were the best I’ve seen so far.

Continue reading “Miss Manners with Claude Sonnet”

Miss Manners with Claude Opus

I repeated my experiment with the Miss Manners benchmark using Clause Opus (Clause Pro). The results were better than GPT4, but inferior to Mistral (Large).

Continue reading “Miss Manners with Claude Opus”

Miss Manners LLM Benchmark

Recently I’ve been evaluating the ability of LLMs to perform simple reasoning, using the Miss Manners benchmark. This article ranks the LLMs on this benchmark and summarises the results.

Continue reading “Miss Manners LLM Benchmark”

Miss Manners with Gemini

I repeated my experiment with the Miss Manners benchmark using Google Gemini. The results were inferior to ChatGPT.

Continue reading “Miss Manners with Gemini”

Miss Manners with GPT 4

I repeated my experiment with the Miss Manners benchmark using OpenAI GPT4. The results were better than ChatGPT, but not as good as Mistral (Large).

Continue reading “Miss Manners with GPT 4”

Miss Manners with Mistral (Large)

I repeated my experiment with the Miss Manners benchmark using Mistral.ai (Large). The results were impressive!

Continue reading “Miss Manners with Mistral (Large)”

Miss Manners with ChatGPT

“Miss Manners” is organizing a dinner party and needs to devise a seating arrangement for her guests. She has a large circular table and will be inviting 16 guests: 8 males and 8 females. Miss Manners is an aging lady of a bygone era and isn’t aware that gender is not binary. She would like to ensure that guests are not seated next to someone of the same gender, and that guests seated next to each other share at least one hobby.

In this article I will examine how ChatGTP fares with this venerable optimisation (or product rules) benchmark and present conclusions.

Continue reading “Miss Manners with ChatGPT”

Contract Digitisation Literature Review

Introduction

The conceptual leap from managing contract text and data, to understanding the real-time rights and obligations of the various parties to a contract is a major one! To get even a partial view of the rights and obligations of contractual parties requires creating a computable representation of the logic and workflow inherent/implicit in the contact, as well as tapping into a digital representation of real-world contract events.

Continue reading “Contract Digitisation Literature Review”

Obligations and Obligation Management

Obligations — they obligate an entity to perform an action. In the context of law and contracts a party to a contract has an obligation under the terms of the contract. Violation of the terms of the contract typically incurs penalties.

In this article I sketch out a path from zero contract management, to real-time visibility into your obligations.

Continue reading “Obligations and Obligation Management”

Website Powered by WordPress.com.

Up ↑