Posted by data64 on 07/29/05 15:54
>
> You could use Perl's LWP::Simple module to get the webpage and put it
> into a variable.
>
> Next you could use Perl's HTML::Parser module to extract the plain text
> you want from the HTML. You would likely also have to use the split
> function and regular expressions as suppliments to this.
>
Actually in this case, I would suggest Template::Extract rather than
Html::Parser as an simpler way of extracting data.
But then with Perl there's usually more than one way of doing it.
data64
[Back to original message]
|