Posted by Alan on 03/24/07 10:45
"Razvan" <defconhaya@gmail.com> wrote in message
news:1174729037.379978.230910@o5g2000hsb.googlegroups.com...
> Hello there,
>
> I have the following problem:
> I have a big html and i want to remove from it everything between some
> tags and to keep the rest, of course using regex, but any solution
> will be great.
> The number and type of tags may vary. Here is an example:
>
> <body>
> text text text text text text text
> text text text
> text text text text
>
> <remove1>
> text text text text text text
> text text
> text
> text text text
> </remove1>
>
> text text text
> text text
>
> <remove1>
> text text text text
> </remove1>
>
> text text
> text text
> text text text
>
> <remove2>
> text text text text text
> text text text
> text text
> </remove2>
>
> text text text text text
> text text text text
> </body>
>
> Any suggestions will be appreciated !
> Thanks.
>
regex search and replace with <(/?[^\>]+)> and "" leaves just your text text
text etc
Possible some flavours may need escaping: \<(/?[^\>]+)\>
hth
Alan
[Back to original message]
|