Can we extract ALL text from a webpage ?

GitHub-flavored Markdown & a sane subset of HTML is supported.