|
Posted by Jochem Maas on 12/08/05 12:03
Roman Ivanov wrote:
> Task:
> Create a script that converts text into HTML with paragraphs.
>
> Problem:
> Input text could use the book notation, as well as the web notation,
> plus it can contain HTML.
>
> ==
> <h1>This is a title</h1>
>
> This is a Book paragraph.
> This is another book paragraph.
> This is yet another book paragraph, but it's not indented with spaces,
> because user wrote it in OpenOffice.
> ==
>
> ==
> This is a web paragraph.
>
> This is another web paragraph.
what is a book paragraph, what is a web paragraph? (exactly)
have you looked at the Tidy extension? in short it kicks ass
at cleaning up junk HTML - possibly a good start.
>
> This is yet another web paragraph, which is indented with spaces for
> some unknown reason.
> ==
>
> Output text should be correctly formatted without using lots of br's and
> 's. Doing so manually is not a problem, I would just use <p> for
> web paragraphs, and <p class="book"> for book paragraphs. However,
> formatting such text with a scrip is very difficult. Does anyone knows a
> good exaple of such script?
>
Navigation:
[Reply to this message]
|