Getting Started
Welcome to wicket! This section will help you get up and running quickly.
wicket extracts plain text from Wikipedia XML dump files. It reads MediaWiki XML dumps (optionally bzip2-compressed), removes wiki markup, and writes clean text in doc or JSON format.
Next Steps
- Installation – install wicket from source or crates.io
- Quick Start – extract text from a Wikipedia dump in minutes