Skip to main content

That moment when I told ChatGPT it needed a history lesson, and it agreed with me

Chat GPT is awfully good at most things, but occasionally it’s just awful

Sam Altman wearing ChatGPT armor
Microsoft CoPilot / Microsoft

I had an experience this week which forcefully reminded me that ChatGPT and Google’s Gemini were great but not perfect. And to be clear, I have jumped into the AI pool with both feet and am enthusiastic about the long-term prospects. However, I believe that we need to tap the brakes on the irrational exuberance and belief that AI can do everything, everywhere all at once.

The specific project that broke ChatGPT’s back was obscure on the one hand but should not really have been that tough. My daughter is finishing her doctoral dissertation and was trying to generate a map that compared the borders of the Byzantine Empire in the years 379 AD versus 457 AD.

Recommended Videos

Here is the prompt that I used against deep research:

Create a detailed map that overlays the borders of the Byzantine empire in 379AD at the start of the reign of Theodosius the Great versus the borders in 457AD at the end of the reign of Marcian. I need both borders shown clearly on a single map.

Use a historical map style and highlight major cities.

The Deep Research option is powerful but often time-consuming. As it runs, I enjoy watching the play-by-play in the details window. ChatGPT did an excellent job of generating a text analysis of the changing borders, major cities, and historical events.

The wheels fell off the bus when I asked ChatGPT to turn its text analysis into an easy-to-read map.

Without digging too deeply into the minutiae of the fifth century world, the point is that it made up names, misspelled names and placed cities at random. Notice that Rome appears twice on the Italian peninsula. What is particularly frustrating about this effort is that the names and locations were correct in the text.

I tried patiently asking for spelling corrections and proper placements of well-known cities without success. Finally, I told ChatGPT that its results were garbage and threw up my hands. To its credit, ChatGPT took the criticism in stride. It replied “ Thank you for your candor. You are right to expect better “. Unfortunately, things did not get better.

After a few minutes of cursing out that platform I decided to give Google Gemini a shot at the identical query. Shockingly its results were even worse. If you look at the image below, you will see “Rome” in the middle of the Iberian Peninsula. Antioch appears three or four times across Europe, but many of the other names are right out of fantasy novels.

I was complaining about this mapping chaos to a friend. He shared a similar story. He entered a photo from a small offsite meeting into ChatGPT. He asked it to add the words “Mahalo from Hawaii 2025” above a photo of a group of colleagues. Instead of just adding the text, the engine totally changed the image. It made people skinnier; it changed men into women and an Asian into a Caucasian. Another friend told me that an AI generated biography of him talked about his twin children which he does not have. It even provided a link to a non-existent source. Yikes.

Ronald Reagan used to say: Trust but verify.

My point is not to suggest that we run away from AI and cancel all our subscriptions. Rather, it is to remind everyone (me included) that we cannot hand the keys to the AI engines and walk away. They are tools that can assist us but, in the end, we need to look at the output, see if it looks and smells right, and decide whether to accept it or not. It is clear that the performance of AI engines is uneven; excellent at some projects and terrible at others–such as mapping.

We will probably see the rise of the machines someday–but today is not the day.

Peter Horan
Peter has published a number of technology magazines and sites over the years. His current passion is around AI.
Key ChatGPT and Gemini features compared. Who did it better?
Opera Mini Aria AI chatbot vs ChatGPT and Google Gemini running on Android phones resting on a blue fabric sofa.

The AI industry has blossomed quickly in recent years, and several companies have been in steep competition with one another. Two brands that have especially been neck and neck are OpenAI and Google. These two companies have many services in common within the AI game. Notably, OpenAI has its ChatGPT chatbot and Google has its Gemini tool as flagship features; however, each brand has since launched additional AI services under their respective umbrellas.

Here’s a rundown of the functions and features that ChatGPT and Gemini have in common, and which are ideal to use.

Read more
Tired of monthly payments? ChatGPT could soon offer a lifetime subscription
ChatGPT giving a response about its knowledge cutoff.

ChatGPT usage is more prevalent than ever, and its current model offers a monthly subscription of $20 for ChatGPT Plus or the mind-boggling steep $200 per month for ChatGPT Pro. Beyond that, there are no other options, but an APK teardown suggests the service might soon offer both annual and lifetime subscription plans.

@M1Astra, a user on X, found code strings in the latest build that point to multiple new subscription tiers. There's no clear price on these, but the second-to-last string suggests a discount will be offered for annual subscribers. There's also a line for a possible weekly subscription that would allow users to use more advanced features as needed, rather than paying one month at a time.

Read more
ChatGPT models explained: How to use each, according to OpenAI
ChatGPT models list.

Although the entire AI boom was triggered by just one ChatGPT model, a lot has changed since 2022. New models have been released, old models have been replaced, updates roll out and roll back again when they go wrong -- the world of LLMs is pretty busy. At the moment, we have six OpenAI LLMs to choose from and, as both users and Sam Altman are aware, their names are completely useless.

Most people have probably just been using the newest model they can get their hands on, but it turns out that each of the six current models is good at different things -- and OpenAI has finally decided to tell us which model to use for which tasks.

Read more