GPT-5.5 scores 93 out of 100 in ZDNET analyst's 10-round evaluation

7 articles · Updated · ZDNet · Apr 24

ZDNET's David Gewirtz tested GPT-5.5 using ChatGPT Plus, finding strong performance in writing, coding, and reasoning, but deducting points for over-enthusiasm and imperfect instruction following.
The model excelled in creative writing, math, and academic explanation, yet lost points for sourcing errors and providing multiple translations when only one was requested.
GPT-5.5 shows incremental improvements over previous versions, with faster release cycles and enhanced image generation, making it the analyst's new default recommendation despite minor shortcomings.

What new security risks do more capable agentic models like GPT-5.5 introduce?

Is GPT-5.5’s “overeagerness” a simple flaw or a sign of emerging AI autonomy?

With double the API cost, does GPT-5.5 offer enough value for businesses to upgrade?

Will ongoing copyright lawsuits threaten the very existence of models like GPT-5.5?

How can ZDNET’s positive review be trusted while its parent company is suing OpenAI?

If AI just predicts desired answers, can we ever truly achieve genuine AI alignment?