Bad Data: Why AI Products Fail
- When Salesforce recently deployed an AI agent on its website, the initial results were concerning: the agent began to "hallucinate" information and provide inconsistent responses.
- However, the issue wasn't with the AI itself, but rather with the quality of the data it was trained on.
- Ahuja explained that Salesforce had published multiple "knowledge articles" on its public website that contained conflicting information.
“`html
When Salesforce recently deployed an AI agent on its website, the initial results were concerning: the agent began to “hallucinate” information and provide inconsistent responses. This led to a temporary shutdown of the feature.
However, the issue wasn’t with the AI itself, but rather with the quality of the data it was trained on. Shibani Ahuja,Senior Vice President of enterprise IT Strategy at Salesforce,revealed during a roundtable at fortune‘s Brainstorm Tech conference in Park City,Utah,that the agent exposed contradictory information within Salesforce’s own knowledge base.
The Root Cause: Contradictory Knowledge Articles
Ahuja explained that Salesforce had published multiple “knowledge articles” on its public website that contained conflicting information. The AI agent, attempting to synthesize answers from this data, understandably produced unreliable results. “It wasn’t actually the agent. It was the agent that helped us identify a problem that always existed,” Ahuja stated.
Instead of abandoning the AI agent,Salesforce repurposed it. The agent was transformed into an “auditor agent” tasked with identifying inconsistencies and anomalies across the company’s public-facing content. Once the underlying data was cleaned and standardized,the AI agent was redeployed and functioned as intended.
Data Quality is Paramount for AI Success
This incident underscores a critical lesson for organizations implementing AI: the performance of AI models is directly tied to the quality of the data they are trained on. As Ahuja and other speakers at the conference emphasized, even the most refined AI algorithms are only as good as the information they receive.
poor data quality can manifest in several ways, including:
- Inconsistencies: Conflicting information across different data sources.
- Inaccuracies: Incorrect or outdated data.
- Completeness Issues: Missing data points.
- Duplication: Redundant data entries.
Addressing these issues requires a proactive data governance strategy, including regular data audits, standardization processes, and robust data validation procedures. Investing in data quality is no longer optional; it’s a prerequisite for successful AI implementation.
Implications for AI Deployment
The Salesforce experience highlights the importance of a phased approach to AI deployment. Rather than immediately launching AI-powered features to a broad audience, organizations should consider a controlled rollout with thorough monitoring and evaluation.This allows for the early detection of data quality issues and minimizes the risk of negative user experiences.
Furthermore, organizations should view AI not just as a tool for automation, but also as a tool for data quality improvement. AI agents can be leveraged to identify and flag data inconsistencies,helping to maintain a clean and reliable data foundation. This proactive approach can save significant time and resources in the long run.
