WTF is OpenAI's GPTBot?
604
Published 2023-09-08
In August, OpenAI announced that website owners can now block its GPTBot web crawler from accessing their webpages’ content. Since then, 12% of the 1000 most-visited sites online have done so, according to Originality AI. The list of sites shutting themselves off to OpenAI’s web crawlers includes publishers such as Bloomberg, CNN and The New York Times.
For those unfamiliar with what a web crawler like OpenAI’s GPTBot is, not to mention how websites are able block their access, check out this explainer video skit, created by Digiday senior media editor Tim Peterson.
VISIT us: www.digiday.com/
LIKE us on FACEBOOK: www.facebook.com/digiday
FOLLOW us on TWITTER: twitter.com/Digiday
FOLLOW our INSTAGRAM: instagram.com/digiday/
All Comments (2)
-
You’re missing the point – ChatGPT doesn’t need to crawl your site because all of the knowledge which went in the building up the content on your site is part of the background knowledge of the world, which it has access to so anybody with careful, prompt engineering can reproduce what’s on your site anyway I don’t need to access Galileo’s and newtons writing about gravity if I can go out side and find out about gravity by myself 1:52