Over 300 managers and executives participated in a HBR study, revealing that those who consulted ChatGPT made less accurate stock predictions and exhibited greater optimism and overconfidence compared to those who engaged in peer discussions.
The experiment designThe experiment commenced with participants being shown a recent stock price chart for Nvidia (NVDA). Nvidia was selected due to its significant share price increase, driven by its integral role in powering AI technologies. Each participant was initially asked to make an individual, private forecast regarding Nvidia’s projected stock price one month into the future.
Following their initial forecasts, participants were randomly divided into two distinct groups:
After their respective consultation periods, all participants submitted a revised forecast for Nvidia’s stock price one month ahead.
Key findings: Optimism, inaccuracy, and overconfidenceThe study’s findings indicated that AI consultation led to more optimistic forecasts. While both groups had similar baseline expectations, the ChatGPT group elevated their one-month price estimates by approximately $5.11 on average after consultation.
In contrast, peer discussions resulted in more conservative forecasts, with the group lowering their price estimates by approximately $2.20 on average.
A critical finding was that AI consultation negatively impacted prediction accuracy. After a one-month waiting period, the analysis revealed:
AI consultation also contributed to increased overconfidence. Consulting ChatGPT significantly heightened participants’ propensity to offer pinpoint predictions (forecasts with decimals), an indicator of overconfidence. Conversely, the peer discussion group became less likely to use pinpoint predictions, indicating a decrease in overconfidence.
Why the disparity? Five key factorsThe disparity in outcomes can be attributed to five key factors:
The study’s findings offer important guidance for integrating AI tools into decision-making:
The study acknowledges its limitations, including its controlled setting, focus on a single stock (Nvidia), and the use of a ChatGPT model without real-time market data. These factors suggest that results might vary in different contexts or with different AI tools.