Declining Chatbot Performance: Data Challenges Threaten the Future of Generative AI

2023-7-23 19:15

Modern chatbots are constantly learning, and their behavior always changes. But their performance can decline as well as improve.

Recent studies undermine the assumption that learning always means improving. This has implications for the future of ChatGPT and its peers. To ensure chatbots remain functional, Artificial Intelligence (AI) developers must address emerging data challenges.

ChatGPT Getting Dumber Over Time

A recently published study demonstrated that chatbots can become less capable of performing certain tasks over time.

To come to this conclusion, researchers compared outputs from the Large Language Models (LLM) GPT-3.5 and GPT-4 in March and June 2023. In just three months, they observed significant changes in the models that underpin ChatGPT.

For example, in March, GPT-4 was able to identify prime numbers with 97.6% accuracy. By June, its accuracy had plummeted to just 2.4%.

GPT-4 (Left) and GPT-3.5 (Right) Responses to the Same Question in March and June (Source: arXiv)

The experiment also assessed the rate at which the models were able to answer sensitive questions, how well they could generate code and their capacity for visual reasoning. Among all the skills they tested, the team observed instances of AI output quality deteriorating over time.

Read More: 9 Best AI Crypto Trading Bots to Maximize Your Profits

The Challenge of Live Training Data 

Machine Learning (ML) relies on a training process whereby AI models can emulate human intelligence by processing vast amounts of information. 

For instance, the LLMs that power modern chatbots were developed thanks to the availability of massive online repositories. These include datasets compiled from Wikipedia articles, allowing chatbots to learn by digesting the largest body of human knowledge ever created.

But now, the likes of ChatGPT have been released in the wild. And developers have far less control over their ever-changing training data.

The problem is that such models can also “learn” to give incorrect answers. If the quality of their training data deteriorates, their outputs do too. This poses a challenge for dynamic chatbots that are being fed a steady diet of web-scraped content.

Data Poisoning Could Lead to Chatbot Performance Declining

Because they tend to rely on content scraped from the web, chatbots are especially prone to a type of manipulation known as data poisoning. 

This is exactly what happened to Microsoft’s Twitter bot Tay in 2016. Less than 24 hours after its launch, the predecessor to ChatGPT started to post inflammatory and offensive tweets. Microsoft developers quickly suspended it and went back to the drawing board.

As it turns out, online trolls had been spamming the bot from the start, manipulating its ability to learn from interactions with the public. After being bombarded with abuse by an army of 4channers, it’s little wonder Tay started parroting their hateful rhetoric.

Like Tay, contemporary chatbots are products of their environment and are vulnerable to similar attacks. Even Wikipedia, which has been so important in the development of LLMs, could be used to poison ML training data.

However, intentionally corrupted data isn’t the only source of misinformation chatbot developers need to be wary of.

Read More: Best Crypto Sign-Up Bonuses in 2023

Model Collapse: a Ticking Time Bomb for Chatbots?

As AI tools grow in popularity, AI-generated content is proliferating. But what happens to LLMs trained on web-scraped datasets if a growing proportion of that content is itself created by machine learning?

One recent investigation into the effects of recursivity on ML models explored just this question. And the answer it found has major implications for the future of generative AI.

The researchers discovered that when AI-generated materials are used as training data, ML models start forgetting things they learned previously.

Coining the term “model collapse,” they noted that different families of AI all tend to degenerate when exposed to artificially-created content.

The team created a feedback loop between an image-generating ML model and its output in one experiment. 

Upon observation, they discovered that after each iteration, the model amplified its own mistakes and began to forget the human-generated data it started with. After 20 cycles, the output hardly resembled the original dataset.

Outputs From an Image-Generating ML Model (Source: arXiv

The researchers observed the same tendency to degenerate when they played out a similar scenario with an LLM. And with each iteration, mistakes such as repeated phrases and broken speech occurred more frequently.

From this, the study speculates that future generations of ChatGPT could be at risk of model collapse. If AI generates more and more online content, the performance of chatbots and other generative ML models may worsen.

Reliable Content Needed to Prevent Declining Chatbot Performance

Going forward, reliable content sources will become increasingly important to protect against the degenerative effects of low-quality data. And those companies that control access to the content needed to train ML models hold the keys to further innovation. 

After all, it’s no coincidence that tech giants with millions of users constitute some of the biggest names in AI. 

In the last week alone, Meta revealed the latest version of its LLM Llama 2, Google launched new features for Bard, and reports circulated that Apple is preparing to enter the fray too.

Whether it’s driven by data poisoning, early signs of model collapse, or some other factor, chatbot developers can’t ignore the threat of declining performance.

Read More: 6 Best Copy Trading Platforms in 2023

The post Declining Chatbot Performance: Data Challenges Threaten the Future of Generative AI appeared first on BeInCrypto.

origin »

Bitcoin price in Telegram @btc_price_every_hour

High Performance Blockchain (HPB) на Currencies.ru

$ 0 (+0.00%)
Объем 24H $0
Изменеия 24h: 0.00 %, 7d: 0.00 %
Cегодня L: $0 - H: $0.0064459
Капитализация $0 Rank 99999
Цена в час новости $ 0.0910131 (-100%)

performance data declining generative future threaten chatbot

performance data → Результатов: 126


Ripple investors may re-consider their investments after XRP's performance in the last three days

=In the last few days, the on-chain performance of the XRP token has told a tale of a steady decrease in the buy and sell bids for the leading altcoin. According to data from Santiment, on 20 July, thThe post Ripple investors may re-consider their investments after XRP's performance in the last three days appeared first on AMBCrypto.

2022-7-23 18:30


Entain integrates Syntropy's technology to increase network performance and resiliency

Syntropy is helping enterprises meet the increased requirements for network performance for Web3 applications with its critical data routing protocol that modernizes internet connectivity for the new The post Entain integrates Syntropy's technology to increase network performance and resiliency appeared first on AMBCrypto.

2022-4-8 13:45


Фото:

On-Chain Data Suggests Ether Will Be Blazing Through New All-Time Highs Before Bitcoin

Earlier this month, Bloomberg revealed that Ethereum’s year-to-date performance has been significantly higher than Bitcoin’s for two years now. Bitcoin proponents, unfazed by the recent developments, have their eyes peeled for when Bitcoin finally hits $100,000; as they should, given that multiple indicators have validated the possibility of Bitcoin attaining said price this year. However, […]

2021-8-22 21:55


Фото:

Report Finds Public Trust of Bitcoin Will Soon Outweigh that of Big Banks

Bitcoin’s status as a “safe haven asset” has grown in recent times due to its performance against a backdrop of global uncertainty This narrative has also been bolstered by the massive amounts of money printing and inflation seen throughout the globe The recent halving event shined a spotlight on Bitcoin’s deflationary structure and fixed supply All these factors appear to have built up the public’s trust in the benchmark cryptocurrency New data shows that the […]

2020-6-25 02:00


Фото:

Tether’s USDT Issuance Possibly Led to the Bitcoin Price Surge After March Dip, Data Suggests

Bitcoin (BTC) has recorded a sustained upward momentum over the past couple of months with the digital asset’s price even surpassing the $10,000 level at some point. The virtual currency’s performance continues to defy odds as mainstream financial markets are struggling with the effects of the Covid-19 driven economic crisis. Bitcoin price surge tied to […]

2020-6-9 22:25


Фото:

Coinbase Data Shows Crypto Investors Still Fans of Altcoins Despite Poor Performance

Despite climbing against Bitcoin today, most major altcoins are still trading around their multi-year lows while looking towards their BTC trading pairs This weakness comes as traders flee smaller cryptocurrency’s in favor of Bitcoin, which has been firmly establishing itself as a “safe haven asset” throughout the past couple of months Data from Coinbase, however, indicates that the vast majority of crypto investors on the platform are still interested in investing in altcoins Data from […]

2020-5-19 21:00


BTCS to launch crypto asset data analytics platform in 2H 2020

BTCS, a digital asset and blockchain technology company, announced today it is expanding its business model with the development of a consumer-facing digital-asset data analytics platform. The goal is to enable users to connect multiple digital asset exchanges and wallets to aggregate portfolio holdings into a single seamless platform to view and analyze performance, risk […] CryptoNinjas: BTCS to launch crypto asset data analytics platform in 2H 2020

2020-2-4 19:09


Представлена версия 2.0 блокчейн-платформы Hyperledger Fabric

Возглавляемый Linux Foundation консорциум Hyperledger объявил о выпуске версии 2. 0 блокчейн-платформы Fabric. Hyperledger Fabric 2. 0 is here! New release is optimized for production deployments with decentralized chaincode features, private data enhancements and improved performance.

2020-1-31 10:02