Pages List
List view
All charts and visualisations are created using Lava Metrics.
Early access to our Beta? 👉 Sign up here
Marketing AI Performance Leaderboard - June 2025 Results
Research Online Results
Research Online Scores by LLM
What are the overall results?
🏆 Research Online Winner: Gemini: 2.5-Flash-Preview
❌ Research Online Loser: Qwen: qwen-max
Individual Test Winners and Losers
Test Ranking (Best → Worst)
Ranking reflects the average performance per 'research online' test of all LLMs ordered highest to lowest.
- Buyer Person Developement
- Industry Overview Report
- Content Gap Analysis
- Competitor Analysis
- Market Opportunities and Threats
FAQs
What does this Leaderboard represent?
We have designed tests that simulate a marketer’s interaction with native platform UIs (e.g., ChatGPT, Gemini) across several marketing domains:
- Copywriting: Generating ad copy, email subject lines, and social media posts.
- Internal Data Analysis: Interpreting sample CRM data to identify trends and insights.
- Strategic Planning: Creating marketing plans based on given scenarios.
- Online Research: Gathering information from the web to support marketing decisions.
How were the tests scored?
Each test output is evaluated by specialised AI “judges.”
- Judges are themselves AI agents configured with specific evaluation criteria.
- They parse the Test Answer, compare it against expected outcomes or benchmarks, and score on multiple dimensions (e.g., factual correctness, tone, format).
- Final scores are normalized and aggregated to produce a single value per test.