ManyPI
ManyPI

Scraping Recipe Generator

Build a structured JSON config for your scraper. Define start URLs, pagination selectors, and data extraction rules in a standardized format.

Recipe Builder

CSS selector that wraps each item you want to extract.

Generated Recipe JSON

Standardize Your Scraping

Maintaing custom Python scripts for 50 different websites is a nightmare. A "config-driven" approach allows you to separate the logic (the scraper code) from the definition (the selectors). This tool helps you create those definitions in a clean JSON format.

Relative Selection

The key to successfully scraping lists (e-commerce catalogs, news feeds) is the concept of "relative selection". You define a parent selector for the card, and then simple child selectors for the title/price. This JSON format enforces that best practice.

Portable Configuration

This JSON structure is generic enough that you can easily adapt it to work with Apify, Scrapy, Puppeteer, or your own custom engine. It serves as a universal blueprint for the data extraction task.

Frequently asked questions

Learn about configuring scrapers

Level up your
data gathering

See why ManyPI is the data extraction platform of choice for
modern technical teams.