🧠 LLM Evaluation Dashboard

Comprehensive Intelligence & Performance Analysis

System Overview

Loading data...

Model Performance Comparison

Intelligence Metrics Analysis

Advanced metrics evaluating different dimensions of AI intelligence and reasoning capabilities.

Calculating intelligence metrics...

Performance by Category

Toggle models to show/hide on the spider web chart:

Detailed Test Results

Select a model to view detailed results