Fact Check: Perplexity AI accused of ignoring robots.txt, violating web scraping norms.

Published June 21, 2025

•By

VERDICT

Partially True

# Fact Check: Perplexity AI Accused of Ignoring robots.txt, Violating Web Scraping Norms ## What We Know Perplexity AI is currently under investigati...

Fact Check: Perplexity AI Accused of Ignoring robots.txt, Violating Web Scraping Norms

What We Know

Perplexity AI is currently under investigation by Amazon Web Services (AWS) due to allegations that its web crawler is bypassing the Robots Exclusion Protocol (robots.txt) to scrape content from various websites without permission. The robots.txt file is a standard used by web developers to instruct automated bots on which parts of their site should not be accessed. While compliance with this protocol is voluntary, reputable crawlers typically respect these guidelines (Wired, DailyAI).

Reports indicate that a virtual machine associated with Perplexity AI was found to be scraping content from multiple sites, including those owned by Condé Nast, The Guardian, Forbes, and The New York Times (Wired). Wired's investigation revealed that when specific headlines were input into Perplexity's chatbot, the responses closely paraphrased the articles with minimal attribution, raising concerns about copyright infringement (DailyAI).

Perplexity AI's spokesperson, Sara Platnick, stated that the company’s bot respects the robots.txt protocol, but also acknowledged that it may ignore these instructions when a user specifically requests a URL (Wired). CEO Aravind Srinivas admitted that Perplexity uses third-party web crawlers, which may have contributed to the violations (DailyAI).

Analysis

The investigation by AWS is significant because it reflects the growing scrutiny of AI companies regarding their data collection practices. AWS has a strict policy against abusive activities, and their investigation into Perplexity AI suggests that they take these allegations seriously (Engadget, Wired).

The claims against Perplexity AI are bolstered by multiple credible sources, including Wired and DailyAI, which detail the specific instances of alleged scraping and the responses from both Perplexity and AWS. However, the company's defense—that it respects robots.txt and that any violations may stem from third-party crawlers—introduces ambiguity into the situation. This defense could imply a lack of direct oversight or control over its data collection practices, which is a critical aspect of the allegations (DailyAI, Wired).

Despite the serious nature of the allegations, the situation is complicated by the fact that Perplexity AI has not been definitively proven to be in violation of any laws or regulations at this stage. The investigation is ongoing, and the outcome remains uncertain. The responses from Perplexity's leadership indicate a willingness to cooperate with AWS, which may mitigate some concerns if they can demonstrate compliance with web scraping norms moving forward (Engadget, DailyAI).

Conclusion

The claim that Perplexity AI is accused of ignoring robots.txt and violating web scraping norms is Partially True. While there are credible allegations supported by investigations from reputable sources, Perplexity AI has denied outright violations and claims that it adheres to the robots.txt protocol under normal circumstances. The ongoing investigation by AWS will be crucial in determining the validity of these claims and whether any actions will be taken against Perplexity AI.

Sources

Have a claim you want to verify? It's 100% Free!

Our AI-powered fact-checker analyzes claims against thousands of reliable sources and provides evidence-based verdicts in seconds. Completely free with no registration required.

💡 Try:

"Coffee helps you live longer"

✓100% Free

✓No Registration

✓Instant Results

Comments

Loading comments...

Fact Check: Perplexity AI accused of ignoring robots.txt, violating web scraping norms.

Fact Check: Perplexity AI Accused of Ignoring robots.txt, Violating Web Scraping Norms

What We Know

Analysis

Conclusion

Sources

Have a claim you want to verify? It's 100% Free!

Comments

Leave a comment

More Fact Checks to Explore

Fact Check: Perplexity AI accused of ignoring robots.txt directives for web scraping.

Fact Check: Perplexity AI accused of ignoring robots.txt directives.

Fact Check: Trump in the last 24 hours: Fell asleep at an event Forgot who accompanied him to the event he fell asleep at Accused Obama of writing the Epstein files Made up a story about his uncle teaching the Unabomber Forgot he appointed Jerome Powell as Fed Chair

Fact Check: Trump in the last 24 hours: Fell asleep at an event Forgot who accompanied him to the event he fell asleep at Accused Obama of writing the Epstein files Made up a story about his uncle teaching the Unabomber Forgot he appointed Jerome Powell as Fed Chair

Fact Check: Is perplexity free?