Fact Check: Perplexity AI accused of ignoring robots.txt directives.

Fact Check: Perplexity AI accused of ignoring robots.txt directives.

Published June 20, 2025
by TruthOrFake AI
βœ“
VERDICT
True

# Fact Check: Perplexity AI Accused of Ignoring Robots.txt Directives ## What We Know Perplexity AI has been accused of not adhering to the directive...

Fact Check: Perplexity AI Accused of Ignoring Robots.txt Directives

What We Know

Perplexity AI has been accused of not adhering to the directives specified in the robots.txt file, which is a standard used by websites to communicate with web crawlers about which parts of the site should not be accessed. The BBC has formally threatened legal action against Perplexity AI, claiming that the company is reproducing its content "verbatim" without permission. In their communication, the BBC stated that they had disallowed two of Perplexity's crawlers and asserted that the company "is clearly not respecting robots.txt" (BBC, BBC).

Additionally, an investigation by Amazon Web Services (AWS) is underway regarding allegations that Perplexity AI is using a crawler hosted on AWS servers that ignores the Robots Exclusion Protocol (Engadget, RetailWire). Reports indicate that this crawler has been identified as visiting various media properties multiple times, suggesting a pattern of scraping content without consent (Engadget).

Analysis

The claims against Perplexity AI are substantiated by multiple sources. The BBC's legal threat indicates a serious concern regarding copyright infringement and the misrepresentation of its content. The BBC's findings suggest that Perplexity AI's outputs have not only misrepresented their content but have also potentially damaged their reputation (BBC).

The investigation by AWS adds another layer of scrutiny. According to reports, AWS is looking into whether Perplexity AI's activities violate its terms of service, which prohibit abusive and illegal activities (Engadget). Perplexity's spokesperson has denied these allegations, asserting that their crawlers respect robots.txt, except in specific instances where users include URLs in their inquiries (RetailWire). This admission raises questions about the extent to which Perplexity AI is compliant with web scraping standards.

The reliability of the sources is generally high, with major news organizations like the BBC and Engadget providing detailed accounts of the situation. However, it is important to note that Perplexity AI's responses may reflect a defensive posture, potentially biasing their statements.

Conclusion

The claim that Perplexity AI has been accused of ignoring robots.txt directives is True. The evidence from credible sources indicates that Perplexity AI is under scrutiny for potentially scraping content without permission, leading to legal threats from the BBC and an investigation by AWS. The company's own admissions regarding the behavior of its crawlers further support this conclusion.

Sources

  1. BBC threatens AI firm with legal action over unauthorised content use
  2. Amazon reportedly investigated Perplexity AI after accusations it scrapes websites without consent
  3. Amazon Investigates Perplexity AI Web Scraping Allegations
  4. BBC threatens AI firm with legal action over unauthorised content use
  5. Amazon Investigates Perplexity AI Web Scraping Allegations

Have a claim you want to verify? It's 100% Free!

Our AI-powered fact-checker analyzes claims against thousands of reliable sources and provides evidence-based verdicts in seconds. Completely free with no registration required.

πŸ’‘ Try:
"Coffee helps you live longer"
βœ“100% Free
βœ“No Registration
βœ“Instant Results

Comments

Comments

Leave a comment

Loading comments...

More Fact Checks to Explore

Discover similar claims and stay informed with these related fact-checks