Keyboard shortcuts

Press or to navigate between chapters

Press S or / to search in the book

Press ? to show this help

Press Esc to hide this help

Getting Started

Welcome to wicket! This section will help you get up and running quickly.

wicket extracts plain text from Wikipedia XML dump files. It reads MediaWiki XML dumps (optionally bzip2-compressed), removes wiki markup, and writes clean text in doc or JSON format.

Next Steps

  • Installation – install wicket from source or crates.io
  • Quick Start – extract text from a Wikipedia dump in minutes