Hi--three years later, is this something that seems more doable? It feels like Universal Summarizer's pipeline must have a step to strip out only the relevant content of a page (i.e., the actual article vs the nav bar, ads, page footer, etc.) and you could just run that output through a word count and come up with a rough estimate.