Introduction
The rise of AI chatbots since the debut of the first successful model in 2022 has reshaped how we write, code, learn, and even travel. With dozens of free options on the market, choosing the right assistant can be overwhelming. This article summarizes a hands‑on evaluation of eight popular free chatbots, ranking them on a 120‑point scale that combines text (100 points) and image (20 points) performance.
Testing Methodology
Each chatbot was given ten real‑world text prompts and four image prompts. The text prompts covered:
- Summarization & web access
- Academic concept explanation for a five‑year‑old
- Math sequencing and analysis
- Cultural discussion with context
- Literary analysis
- Travel itinerary creation
- Emotional‑support interview coaching
- Translation with cultural relevance
- Coding challenge
- Long‑form story (minimum 1,500 words)
The image prompts asked the models to generate:
- A flying aircraft carrier
- A giant robot
- A young baseball player in a medieval court
- An homage to “Back to the Future”
Scores were awarded based on accuracy, completeness, creativity, and adherence to the prompt specifications.
Overall Scores
All chatbots showed noticeable improvement over previous evaluations. The top‑scoring models combined strong text answers with decent image generation:
- ChatGPT – 108/120
- Gemini – 104/120
- Perplexity – 96/120
- Copilot – 92/120
- Grok – 88/120
- Claude – 84/120
Top Performers
ChatGPT excelled in every text category, especially explaining concepts to children, math sequencing, and cultural discussions. Its image output captured key visual elements (e.g., the DeLorean and skateboard in the Back to the Future prompt) despite occasional generic characters.
Gemini was the fastest image generator, completing all four images in under ten seconds, and produced high‑quality visuals using the new Nano Banana model. Its text answers were solid, though the travel itinerary table was hard to read.
Notable Strengths & Weaknesses
Key takeaways for each model:
- ChatGPT – strongest overall, reliable on both text and images, free version rivals paid tiers.
- Gemini – best image speed and quality, but occasional formatting issues in tables.
- Copilot – offers a $20‑per‑month Pro plan with deeper Microsoft 365 integration; free version slightly trails in coding accuracy.
- Perplexity – transparent source citations and good translation, limited to three free images after sign‑in.
- Grok – unique for reporting word counts on long‑form stories; tends to over‑fit prompts.
- Claude – impressive long‑form storytelling (2,344 words) but weaker on web search and coding.
Premium Options
While the free tiers are surprisingly capable, some users may benefit from paid plans:
- Microsoft Copilot Pro – $20/month for enhanced AI features inside Office apps.
- Copilot Developer Plan – $10/month for API access and custom tooling.
- Gemini Pro 2.5 – subscription adds priority access to the latest image models.
Conclusion
For most everyday tasks—summarizing articles, solving math problems, generating code snippets, or creating quick images—ChatGPT remains the most well‑rounded free chatbot. Gemini shines when speed and visual fidelity are paramount, while Perplexity and Claude offer niche strengths in source transparency and long‑form storytelling. Which free AI chatbot impressed you the most? Share your experiences in the comments below.