* jaytea ignores everything Mpdreamz just said and proposes a regex soltn!;D
this might be too basic for you Riamus, but the following is just a small expansion on $htmlfree which removes balanced tags nested inside other balanced tags (much like the type of regexes that handle nested parentheses)
alias htmlfree return $regsubex($1,/^[^<]*>|(<(?:[^<]|(?1))*>)|<.*$/gU,)
use it, build upon it, or ignore it ;p