|
Posted by David Dorward on 09/19/07 13:14
On Sep 19, 1:30 pm, SDG <giuffsa...@hotmail.it> wrote:
> Hi, I'm writing a web scraper to extract text from a web page, and I
> need to know what characters can be present inside an attribute of a
> tag.
Any, although some attributes have limits on what is allowed, although
those limits aren't usually expressed by the DTD (e.g. the width
attribute takes an integer or an integer followed by a percentage
sign), and other characters (& for example) have special meaning.
--
David Dorward
http://dorward.me.uk/
http://blog.dorward.me.uk/
Navigation:
[Reply to this message]
|