You are here: Re: Problem loading html containing scripts using Dom LoadHTML « PHP Programming Language « IT news, forums, messages
Re: Problem loading html containing scripts using Dom LoadHTML

Posted by Jerry Stuckle on 05/15/07 20:56

loretta wrote:
> On May 14, 2:16 pm, shimmyshack <matt.fa...@gmail.com> wrote:
>> On May 14, 6:08 pm, loretta <lorb...@optonline.net> wrote:
>>
>>
>>
>>
>>
>>> This code is just reading html and printing , eventually I want to
>>> modify the html. However, the original html contains javascript and
>>> the output html contains tags not in the original.
>>> $url = "http://www.something.com";
>>> $doc = new DOMDocument();
>>> $doc->loadHTMLFile($url);
>>> print $doc->saveHTML();
>>> Original html snippet:
>>> function exampleFunction() {
>>> var doc = '<html><head>';
>>> doc += '<title>Title</title>';
>>> doc += '</head>';
>>> doc += '<body onload="self.focus();">';
>>> doc += '</body></html>';
>>> }
>>> Html after saveHTML:
>>> function exampleFunction() {
>>> ('about:blank','imagemanagerpopup',settings);
>>> var doc = '<html><head>';
>>> doc += '<title>Title</title>';
>>> doc += '</script>
>>> </head>
>>> <body>
>>> <p>';
>>> doc += '</body>
>>> </html><html><body>
>>> <p>';
>>> }
>>> Extra tags to end the script, head and begin a new body are being
>>> added before the </body> tag and after the <body onload=self.focus()>
>>> tag in the js variable. Is there a way for the Dom to leave the
>>> javascript as is without trying to 'fix' the html ? The changes being
>>> made are causing a javascript error.
>>> Thanks
>> start off with xHTML, so it can be loaded with no errors, see google
>> on how to add javascript in a way that is compliant with xml standards- Hide quoted text -
>>
>> - Show quoted text -
>
> The html I am retrieving has a xhtml doctype. I also have no control
> over the original webpage. The original webpage loads with no errors
> in both IE and FF.
>

But does it validate (http://validator.w3.org)? Pages can load in
browsers without error and still not validate. The browsers are very
forgiving, and make a "best guess" as to what the page creator wanted.

--
==================
Remove the "x" from my email address
Jerry Stuckle
JDS Computer Training Corp.
jstucklex@attglobal.net
==================

 

Navigation:

[Reply to this message]


Удаленная работа для программистов  •  Как заработать на Google AdSense  •  England, UK  •  статьи на английском  •  PHP MySQL CMS Apache Oscommerce  •  Online Business Knowledge Base  •  DVD MP3 AVI MP4 players codecs conversion help
Home  •  Search  •  Site Map  •  Set as Homepage  •  Add to Favourites

Copyright © 2005-2006 Powered by Custom PHP Programming

Сайт изготовлен в Студии Валентина Петручека
изготовление и поддержка веб-сайтов, разработка программного обеспечения, поисковая оптимизация