Wikipedia Scraper logo

Wikipedia Scraper API

Extract article content, summaries, infoboxes, and references from Wikipedia. Access the world's largest free encyclopedia programmatically.

Use Cases

Knowledge Extraction

Build knowledge bases and extract structured information for AI applications.

Research Automation

Gather background information and references for academic or business research.

Content Enrichment

Enhance applications with authoritative information from Wikipedia.

Entity Recognition

Extract entities, relationships, and facts from infoboxes and articles.

API Endpoints

Available endpoints for the Wikipedia Scraper API

POST /v1/wikipedia/page

Wikipedia Page API

Scrape a public Wikipedia article page from a search result URL.

Parameters

url string required Wikipedia page URL returned by wikipedia/search.
POST /v1/wikipedia/search

Wikipedia API

Search Wikipedia through the public MediaWiki API with offset pagination.

Parameters

cursor string Cursor returned as next_cursor from the previous response.
limit integer Maximum number of results to return.
query string required Wikipedia article search query.

Frequently Asked Questions

Common questions about the Wikipedia Scraper API

Can I access Wikipedia in other languages?

Yes, we support all Wikipedia language editions. Just specify the language code.

Do you extract citations and references?

Yes, we can extract all citations, references, and external links from articles.

Can I get article revision history?

We extract current versions. Historical data may be available through additional endpoints.

Is the data structured or raw HTML?

We provide both structured JSON data and clean text content, not raw HTML.

Start scraping Wikipedia

Get your free API key