Sites scramble to block ChatGPT web crawler after instructions emerge

August 11, 2023

Without announcement, OpenAI recently added details about its web crawler, GPTBot, to its online documentation site. GPTBot is the name of the user agent that the company uses to retrieve webpages to train the AI models behind ChatGPT, such as GPT-4. Earlier this week, some sites quickly announced their intention to block GPTBot’s access to their content.

In the new documentation, OpenAI says that webpages crawled with GPTBot “may potentially be used to improve future models,” and that allowing GPTBot to access your site “can help AI models become more accurate and improve their general capabilities and safety.”

→ Continue reading at Ars Technica

Comments

Seahawks WR Cade Johnson released from hospital following concussion

Anti-magnetizing-vaccine doctor loses medical license

Sites scramble to block ChatGPT web crawler after instructions emerge

Related articles

Comments

Share article

Latest articles

‘You, sir, are not a change’: Party leaders target Carney in final election debate

Green push from Canada’s trade partners could lead to export boom, says report

Washington business leaders ‘very concerned’ over Trump’s unpredictable tariffs

There’s a secret reason the Space Force is delaying the next Atlas V launch

Resist, eggheads! Universities are not as weak as they have chosen to be.

Google’s digital ad network declared an illegal monopoly, joining its search engine in penalty box