|
Posted by Jim Higson on 08/05/06 10:53
Andy Dingley wrote:
>
> Jim Higson wrote:
>
>> Just curious, but why is this tricky to do?
>
> RSS isn't XML (for most of the non-RDF versions - read their spec!).
> There is no XML-valid RSS 2.0 as there's no way to define validity for
> it. Practical RSS is also very frequently not well-formed RSS -
> references to HTML entities being the usual culprits.
>
> If you try to load RSS through an XML parser, then unless you're just
> dealing with RSS 1.0 and Atom feeds, then you'll regularly find parsing
> errors. A practical real-world RSS aggregator has to cope with this,
> without failing.
Thanks for the heads-up. Interesting how bad the situation is. I'd have
imagined it was plain XML and pretty easy to parse as such.
--
Jim
Navigation:
[Reply to this message]
|