WTF is OpenAI's GPTBot?

604
0
Published 2023-09-08
Publishers have a new tool in their efforts to limit AI’s threat to their businesses. And it’s from the company behind one of the predominant threats.

In August, OpenAI announced that website owners can now block its GPTBot web crawler from accessing their webpages’ content. Since then, 12% of the 1000 most-visited sites online have done so, according to Originality AI. The list of sites shutting themselves off to OpenAI’s web crawlers includes publishers such as Bloomberg, CNN and The New York Times.

For those unfamiliar with what a web crawler like OpenAI’s GPTBot is, not to mention how websites are able block their access, check out this explainer video skit, created by Digiday senior media editor Tim Peterson.

VISIT us: www.digiday.com/
LIKE us on FACEBOOK: www.facebook.com/digiday
FOLLOW us on TWITTER: twitter.com/Digiday
FOLLOW our INSTAGRAM: instagram.com/digiday/

All Comments (2)
  • @PeterSigurdson
    You’re missing the point – ChatGPT doesn’t need to crawl your site because all of the knowledge which went in the building up the content on your site is part of the background knowledge of the world, which it has access to so anybody with careful, prompt engineering can reproduce what’s on your site anyway I don’t need to access Galileo’s and newtons writing about gravity if I can go out side and find out about gravity by myself 1:52