Confused by people throwing words like ontology, taxonomy and data model around? In this post I explain the differences and give some practical advice for which to adopt.
Continue reading “Ontology vs Taxonomy vs Data Model”I repeated my experiment with the Miss Manners benchmark using Clause Sonnet (Clause 3.5 Pro). The results were the best I’ve seen so far.
Continue reading “Miss Manners with Claude Sonnet”Over the past few weeks I’ve been researching, and building a framework that combines the power of Large Language Models for text parsing and transformation with the precision of structured data queries over Knowledge Graphs for explainable data retrieval.
In this fourth article of the series (one, two, three) I will show a generic web interface that helps explain how the LLM is using tools and graph queries to answer a wide variety of structured and unstructured questions.
Continue reading “Knowledge Graphs: RAG is NOT all you need”Over the past few weeks I’ve been researching, and building a framework that combines the power of Large Language Models for text parsing and transformation with the precision of structured data queries over Knowledge Graphs for explainable data retrieval.
In this third article of the series (one, two) I will show you how to combine structured and unstructured semantic queries, and use LLMs to orchestrate question answering over a knowledge graph.
Continue reading “Knowledge Graphs: Question Answering”I repeated my experiment with the Miss Manners benchmark using Clause Opus (Clause Pro). The results were better than GPT4, but inferior to Mistral (Large).
Continue reading “Miss Manners with Claude Opus”Recently I’ve been evaluating the ability of LLMs to perform simple reasoning, using the Miss Manners benchmark. This article ranks the LLMs on this benchmark and summarises the results.
Continue reading “Miss Manners LLM Benchmark”I repeated my experiment with the Miss Manners benchmark using Google Gemini. The results were inferior to ChatGPT.
Continue reading “Miss Manners with Gemini”I repeated my experiment with the Miss Manners benchmark using OpenAI GPT4. The results were better than ChatGPT, but not as good as Mistral (Large).
Continue reading “Miss Manners with GPT 4”I repeated my experiment with the Miss Manners benchmark using Mistral.ai (Large). The results were impressive!
Continue reading “Miss Manners with Mistral (Large)”