DEV Community

# webscraping

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Building a Stock Trend Predictor for a Market That Has No API

Building a Stock Trend Predictor for a Market That Has No API

Comments
6 min read
HTTP 200 Is a Lie: A 30-Line Schema Canary for Source Drift

HTTP 200 Is a Lie: A 30-Line Schema Canary for Source Drift

Comments
10 min read
I Tested Every Web Scraping Tool Against Lazada — Here's What Actually Works (May 2026)

I Tested Every Web Scraping Tool Against Lazada — Here's What Actually Works (May 2026)

Comments
7 min read
How to scrape Nextdoor for hyper-local demographics and community sentiment

How to scrape Nextdoor for hyper-local demographics and community sentiment

5
Comments
9 min read
How China-focused funds turn Weibo into alt-data (Python, 2026)

How China-focused funds turn Weibo into alt-data (Python, 2026)

Comments
4 min read
I Built a SaaS Risk Scanner That Collects 35+ Signals Per Vendor. Here's What I Learned About Scraping, LLMs, and Solo Engineering.

I Built a SaaS Risk Scanner That Collects 35+ Signals Per Vendor. Here's What I Learned About Scraping, LLMs, and Solo Engineering.

Comments
8 min read
Scraping Without Tests Is Gambling. And the House Always Wins.

Scraping Without Tests Is Gambling. And the House Always Wins.

1
Comments
3 min read
Anonymous Proxies: How Modern Websites Decide Whether to Trust Your Traffic

Anonymous Proxies: How Modern Websites Decide Whether to Trust Your Traffic

Comments
7 min read
We patched Chromium with 49 C++ hooks to beat Cloudflare — here's how BrowserHand works

We patched Chromium with 49 C++ hooks to beat Cloudflare — here's how BrowserHand works

Comments
1 min read
When a scraping platform is too much for an LLM workflow

When a scraping platform is too much for an LLM workflow

Comments
4 min read
Building a self-hosted browser scraping service (is it more hassle than its worth?)

Building a self-hosted browser scraping service (is it more hassle than its worth?)

Comments
8 min read
Optimizing Browser Fingerprint Spoofing and Session Validation in Automated Scrapers

Optimizing Browser Fingerprint Spoofing and Session Validation in Automated Scrapers

1
Comments
2 min read
How I built a Bluesky scraper using the AT Protocol API (and published it on Apify)

How I built a Bluesky scraper using the AT Protocol API (and published it on Apify)

Comments
3 min read
Data Normalization Across Dublin Rental Portals: How to Make Listings Comparable

Data Normalization Across Dublin Rental Portals: How to Make Listings Comparable

Comments
4 min read
Robots.txt Is Not Enough Anymore: What Developers Need to Know About AI Crawler Controls

Robots.txt Is Not Enough Anymore: What Developers Need to Know About AI Crawler Controls

Comments
6 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.