Re: Font Tag — HTML — IT news, forums, messages

You are here: Re: Font Tag « HTML « IT news, forums, messages

Posted by Jim Higson on 03/29/06 11:16

Jo wrote:

> Thanks..
> Im writing a HTML parser that removes the tags and keeps using sensible
> text. This is in C#.Its like a tool.But, can i add another tool to it
> like HTML Tidy to cleanup? Wud that be right?
> In webpages, i only want the main txt to be displayed and not the Side
> divisions on the left n right of the web page that mostly shows links
> to the other pages.
> I realised that in the web page im workin on now, has the right n left
> div inside font tag of their own specified class. So i will check
> whether its a font tag, then check for its class, if all are true, then
> i'll remove until a </font> tag comes. This was workin fine until one
> webpage showed me that </font> tag was missing for a <font> tag... Now
> what do i do?
> I have coded in C#..

Writing an error-tollerent HTML/SGML parser takes a long time. Do you have
to do this (ie it is for a school project) or could you use a preexisting
one?

TagSoup is a pretty good parser for bad HTML. See:
http://www.idealliance.org/papers/xml02/dx_xml02/html/abstract/05-06-06.html

TagSoup is in Java, but not every part of a project has to be in the same
language.

--
Jim

Navigation:

Next in forum: Re: html, unicode and character sets
Prev in forum: Re: IE7:Headache?
Thread view: Re: Font Tag

[Reply to this message]

Удаленная работа для программистов • Как заработать на Google AdSense • England, UK • статьи на английском • PHP MySQL CMS Apache Oscommerce • Online Business Knowledge Base • DVD MP3 AVI MP4 players codecs conversion help

Home • Search • Site Map • Set as Homepage • Add to Favourites

Сайт изготовлен в Студии Валентина Петручека —
изготовление и поддержка веб-сайтов, разработка программного обеспечения, поисковая оптимизация