Tag Archives: web crawlers

Webpage direct visits: bots, crawlers, or real visitors?

website-traffic-bots-crawlers-vs-real-visitors-engagement

"Are webpage visits coming from a direct visit (vs a search) from bots or crawlers vs real visitors, or perhaps AI scrapers?"

Yes - a significant portion of direct webpage visits can come from bots, crawlers, and AI scraping content, not just real human visitors. But how much depends on the site, traffic level, and what kind of page you’re looking at.

Below is a clear breakdown.

1. What “Direct” Traffic Really Means

In analytics tools (

... Continue reading
Posted in Ask David!, Website Traffic | Tagged , , , , , , , , , , , , , , , , , , , , , , , | Leave a comment

AI Gone Rogue Again? Perplexity Bots Bypass IP Blocks and Robots.txt

AI bots from Perplexity bypassing IP blocks and robots.txt files

Just months after Claude’s so-called “blackmail” stunt fueled fears about agentic AI models, the web is facing another wave of rogue behavior. This time, Perplexity AI is under fire for sneaking past website blocks, disguising its bots, and scraping content without permission. While Claude sparked debates about AI ethics and control, Perplexity highlights a more widespread, everyday threat to publishers: unconsented content harvesting at massive scale.

Ignoring Robots.txt

Cloudflare, which recently expanded into bot management, reported that Perplexity’s crawlers often

... Continue reading
Posted in Technology in the News | Tagged , , , , , , , | Leave a comment